WhyzTech Blog

Syndicate content
Social Media, Computational Linguistics and AI 2011-11-11T14:46:42Z
Updated: 4 years 38 weeks ago

Intellectual Property Protection: Patents

Fri, 11/11/2011 - 14:46
I’ve spent seven years developing gnowit.com. It arose out of my PhD thesis, and employs the codebase and architecture I put together over the years. Gnowit.com contains some very novel innovations around sentiment classification, auto-categorization of information, core-topic extraction and ‘noise-cancelling’ (let’s leave it at that for now). As we’ve come out of stealth mode two weeks [...]

Solrj error with new httpclient

Tue, 09/06/2011 - 09:02
I refactored my codebase to use the new Apache HttpClient 4 component (from the org.apache.http.package). This broke Solrj, which is dependent on the Apache HttpClient 3.x components. Th e resulting error was the cryptic ‘java.lang.ClassNotFoundException: org.apache.commons.httpclient.HttpException’ The fix was to reintroduce the commons-httpclient-3.1jar file, and explicitly pass in the older httpclient when creating the connection to Solrj. import org.apache.solr.client.solrj.impl.CommonsHttpSolrServer; import [...]

Promoting Code from R&D to Engineering: The 3 Decimal Place Barrier

Tue, 08/23/2011 - 17:08
One of the watershed events that marks a successful R&D project is the handover to engineering. This step generally involves a fair amount of integration effort, and occasionally requires that the code be rewritten to employ databases, web services or conform to data contracts with other modules. Sometimes, this also requires that the logic be rewritten [...]