keyword
Here are 166 public repositories matching this topic...
-
Updated
Sep 29, 2019
-
Updated
Apr 27, 2020 - HTML
I would like to process corpus of documents by TFIDF model. My corpus is one txt file where each line is document. It is fine as input for any models from pke, but for TFIDF I need a document frequency matrix which can be generated in pke utilities but it accept only input_dir where files are documents. It would convenient to have option to inject a documents as one file as for all models.
Th
-
Updated
Apr 25, 2019 - Python
-
Updated
May 26, 2020 - JavaScript
-
Updated
May 22, 2020 - JavaScript
Traceback (most recent call last):
File "", line 1, in
File "/data1/kt/work/KR-WordRank/krwordrank/word/_word.py", line 211, in extract
rank, graph = self.train(docs, beta, max_iter, vocabulary, bias)
File "/data1/kt/work/KR-WordRank/krwordrank/word/_word.py", line 319, in train
self.scan_vocabs(docs)
File "/data1/kt/work/KR-WordRank/krwordrank/word/_word.py",
-
Updated
May 24, 2020 - Python
The current solution opens the a panel and puts documentation there. Sublime Text 3 offers better ways to display documentation to the users. Consider could those features be used.
-
Updated
Sep 11, 2019 - Python
-
Updated
Sep 12, 2017 - CSS
分词过程应该可以优化
def segment(sentence, cut_all=False):
sentence = sentence.replace('\n', '').replace('\u3000', '').replace('\u00A0', '')
sentence = ' '.join(jieba.cut(sentence, cut_all=cut_all))
return re.sub('[a-zA-Z0-9.。::,,))((!!??”“\"]', '', sentence).split() # 可以先替换,然后分词
-
Updated
Apr 23, 2020 - CSS
-
Updated
Apr 18, 2019 - Python
-
Updated
Aug 30, 2016 - JavaScript
-
Updated
Aug 29, 2018 - Python
-
Updated
Jan 29, 2019 - PHP
-
Updated
Mar 16, 2020 - JavaScript
-
Updated
May 16, 2020 - Python
Verify the page and line numbers are correct when splitting document into sentences.
pdf_url <- "https://arxiv.org/pdf/1610.00147.pdf"
search_result_sent <- keyword_search(pdf_url, keyword = c('measurement error'),
path = TRUE, remove_hyphen = TRUE, convert_sentence = TRUE)-
Updated
Aug 14, 2017 - JavaScript
Improve this page
Add a description, image, and links to the keyword topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the keyword topic, visit your repo's landing page and select "manage topics."
I could not think of a good way to work on a PR, so creating an issue instead.
This is what REAME says regarding
delightandprojectile:However this under certain circumstances would have serious performance issue, an