More than a simple categorization or entity extraction software, at its core, Luxid Annotation Server provides a robust and scalable natural language processing pipeline—Luxid Annotation Factory—that supports rapid and reliable content enrichment.
A broad range of extraction techniques
Luxid Annotation Server supports a variety of use cases including document annotation, indexing and categorization. It includes part-of-speech tagging, language identification, tokenization, segmentation, morphological analysis and part-of-speech categorization. The output of this analysis is passed on to Luxid Annotation Server’s information extraction layer, which embeds a wide range of information extraction techniques including morpho-syntactic reasoning, statistics, thesaurus-, taxonomy- and ontology-based extraction, machine learning and rules-based extraction.
Corpus level analysis
At the corpus level, Luxid Annotation Server performs categorization (classification of documents in predefined categories) or clustering (grouping of similar documents into dynamically created clusters) operations.
Luxid Annotation Factory and Skill Cartridges
At the core of Luxid Annotation Factory is a wide range of specialized extraction modules called Skill Cartridges that support entity extraction across multiple domains and applications.
Each Skill Cartridge is focused on a specific context: general interest entities such as companies, people, locations, or dates, or more domain-specific entities such as proteins or genes in a biology context. Beyond simple structured information extraction, Skill Cartridges can also extract links and relations between entities (for example: a merger between two companies or a chemical reaction between two compounds), including the roles played by entities in the relation, their attributes and any other contextual information mentioned in content.
The Luxid Skill Cartridge Library is a range of off-the shelf Skill Cartridges that address areas of recurring interest such as people names, locations, corporate information and relationships, news categorization, biology, medicine, chemistry, homeland security, and others.
Skill Cartridges can be easily customized and/or developed from scratch.