Thesis
With some of the classes I've taken and having talked to a few professors and fellow students, my thesis topic is starting to solidify. I'm considering an online application to find complex target material using data mining, specifically, natural language processing (NLP). I've found a worthy freeware application and a few other references that seem promising, but much more research is needed.
At this point though, and while the past few years of my career involved extraction and reporting, with sometimes sophisticated regexes, this new application will ideally find historical target material, given a set of complex characteristic sets. With the interesting information found, I then plan on using statistics to determine the likelihood of an event happening again in the future.
The application will have to be build robust enough to have multiple application potentials (eg, maybe apply it towards both engineering or scientific domains). I do have a target market in mind already and since an SDM thesis has to be half technology and half business oriented, what I have at this point is a good start. More later.
Labels: "data mining", "Natural language processing", NLP, thesis


