Data Mining and Information Retrieval
Trying to decipher data on the web
I took the following courses (among others) at The University of Illinois at Chicago (UIC):
Query Processing in Database Systems
In Analysis of News Articles using Metasearch Engines we find sentences that are unique to Conservative and Liberal News Providers. Sometimes, it is interesting to see how a controversial topic (such as 'Stem Cell Research', 'War on a country') receives opposite opinions in different newspapers.
Search Engine Evaluation was a term-long project - an attempt to determine which of the top search engines - Google, Yahoo! and Live - would give 'satisfactory' results for a given query. It involved all students in class running queries on the three search engines and evaluating the search engines. I wrote a program that learns the data obtained from the evaluation and guess, for a given query, how the three search engines would perform.
MS-GSP Algorithm mines for sequential patterns using multiple minimum supports. It is a level-wise search algorithm, based on the GSP and the Apriori Algorithms. An implementation of this algorithm using C# can be found here.