Skip to content
#

nlp-library

Here are 247 public repositories matching this topic...

transformers
NielsRogge
NielsRogge commented Nov 15, 2021

🚀 Feature request

Currently, the EncoderDecoderModel class in PyTorch automatically creates the decoder_input_ids based on the labels provided by the user (similar to how this is done for T5/BART). This should also be implemented for TFEncoderDecoderModel, because currently users should manually provide decoder_input_ids to the model.

One can take a look at the TF implementation

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

  • Updated Feb 8, 2021
  • Python

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

  • Updated Mar 13, 2019
  • Python
OCTIS

Improve this page

Add a description, image, and links to the nlp-library topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nlp-library topic, visit your repo's landing page and select "manage topics."

Learn more