Welcome to the SharpNLP Project

SharpNLP is a collection of natural language processing tools written in C#. Currently it provides the following NLP tools:
  • a sentence splitter
  • a tokenizer
  • a part-of-speech tagger
  • a chunker (used to "find non-recursive syntactic annotations such as noun phrase chunks")
  • a parser
  • a name finder
  • a coreference tool
  • an interface to the WordNet lexical database

SharpNLP 1.0.2529 Beta

This is the latest release of SharpNLP. To use it you will need to download either the runtime binaries or the source code (SharpNLP 1.0.2529 Beta), plus the model files (NBin Models 1.0.0 Production). Detailed instructions can be found in the Installation Instructions.

What is SharpNLP

OpenNLP is both the name of a group of open source projects (http://opennlp.sourceforge.net) related to natural language processing (NLP), and the name of a library of NLP tools written in Java (http://sourceforge.net/projects/opennlp) by Jason Baldridge, Tom Morton, and Gann Bierner. SharpNLP is a C# port of the Java OpenNLP tools, plus additional code to facilitate natural language processing.

All of these tools are driven by maximum entropy models processed by the SharpEntropy library.

In addition, SharpNLP contains a WordNet database access library called SharpWordNet.

How can I help?
Developer Guide

More information can be obtained from these CodeProject articles:


Last edited Dec 13, 2006 at 8:23 PM by richardn, version 9