Search This Blog

Tuesday, September 15, 2009

Analyzing in Text Mining

Text Mining is Mine the data in the form of text. Source of data is usually derived from the documents. The goal is to find words that can represent what is in the document so that it can be inter-connectedness analysis of documents

phase of text mining are:
Tokenism
Filtering
Stemming
Tagging
Analyzing

what is Analyzing ?
Finding how much connection between the words between documents
Term Frequency-Inversed Document Frequency (TF-IDF) is the simplest algorithm is usually used for scoring

the process of TF-IDF

No comments:

Calendar