nCategorization: automatically assigns a document to one or more predetermined categories; based on lexical analysis
nClustering: provides an overview of contents, identifies hidden similarities and accelerates the process of finding similar or related information
nGenre identification: indicates the type of document based on characteristics of language, format, and content
nMetadata extraction: process of identifying key “features” and extracting them
nLanguage identification: ability to automatically recognize foreign languages