Data Mining and Information Retrieval
Trying to decipher data on the web
I took the following courses (among others) at The University of Illinois at Chicago (UIC):
Data Mining and Text Mining and
Query Processing in Database Systems
Projects:
In Analysis of News Articles using Metasearch Engines we find sentences that are unique to Conservative and Liberal News Providers. Sometimes, it is interesting to see how a controversial topic (such as 'Stem Cell Research', 'War on a country') receives opposite opinions in different newspapers.
Search Engine Evaluation was a term-long project - an attempt to determine which of the top search engines - Google, Yahoo! and Live - would give 'satisfactory' results for a given query. It involved all students in class running queries on the three search engines and evaluating the search engines. I wrote a program that learns the data obtained from the evaluation and guess, for a given query, how the three search engines would perform.
MS-GSP Algorithm mines for sequential patterns using multiple minimum supports. It is a level-wise search algorithm, based on the GSP and the Apriori Algorithms. An implementation of this algorithm using C# can be found here.