Thursday, May 27, 2010

LingPipe 3.9.2

LingPipe is a suite of Java libraries

Feature Overview

LingPipe's information extraction and data mining tools:
  • track mentions of entities (e.g. people or proteins);
  • link entity mentions to database entries;
  • uncover relations between entities and actions;
  • classify text passages by language, character encoding, genre, topic, or sentiment;
  • correct spelling with respect to a text collection;
  • cluster documents by implicit topic and discover significant trends over time; and
  • provide part-of-speech tagging and phrase chunking.

Architecture

LingPipe's architecture is designed to be efficient, scalable, reusable, and robust. Highlights include:
  • Java API with source code and unit tests;
  • multi-lingual, multi-domain, multi-genre models;
  • training with new data for new tasks;
  • n-best output with statistical confidence estimates;
  • online training (learn-a-little, tag-a-little);
  • thread-safe models and decoders for concurrent-read exclusive-write (CREW) synchronization; and
  • character encoding-sensitive I/O.


0 comments:

Post a Comment

 

About Me

My photo
Bangalore, Karnataka, India
LIbrarian

About this blog

In this blog I publish my thoughts, experiences, share my knowledge with LIS Professionals.