Skip to content
#

nlp-resources

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Here are 71 public repositories matching this topic...

fititnt
fititnt commented May 31, 2018

Mac-Morpho is a corpus of Brazilian Portuguese texts annotated with part-of-speech tags. Its first version was released in 2003 [1], and since then, two rev

shyamupa
shyamupa commented Jan 23, 2019

Thanks for setting this up, really makes life easier for preprocessing.

It might be useful to say that the wsj path points to the mrg part of the parsed corpus in PTB, something like

WSJ=.../LDC99T42/treebank_3/parsed/mrg/wsj/

There are other options like parsed/prd/wsj, which do not work with the script provided.

The below exports in file ./bin/basic/get_data.sh do no

Created by Alan Turing

Wikipedia
Wikipedia
You can’t perform that action at this time.