Here are
229 public repositories
matching this topic...
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Updated
Jan 7, 2022
Python
Updated
Apr 5, 2022
Python
Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator
Updated
Oct 25, 2019
Python
Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:
Updated
Feb 10, 2020
Python
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Updated
Sep 23, 2020
Python
Updated
Oct 11, 2020
Python
Chatbot in 200 lines of code using TensorLayer
Updated
Oct 5, 2021
Python
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Updated
Jul 8, 2020
Python
Collections of Chinese NLP corpus
Updated
Dec 28, 2020
Python
Updated
Jun 23, 2021
Python
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Updated
Mar 12, 2022
Python
Updated
Dec 29, 2021
Python
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Updated
Mar 29, 2022
Python
❤️ Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Updated
Apr 7, 2022
Python
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Updated
Apr 8, 2022
Python
Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.
Updated
Jul 13, 2020
Python
Tools for ASR Corpus Generation from Online Video
Updated
Feb 10, 2019
Python
Corpus of Russian news articles collected from Lenta.Ru
Updated
Feb 26, 2021
Python
Updated
Aug 7, 2021
Python
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
Updated
Feb 9, 2022
Python
总结了一些可以用作聊天机器人训练实作的文字语聊,包含中英文不同语言
Updated
Jun 7, 2018
Python
This repository contains code and metadata of How2 dataset
Updated
Jun 12, 2020
Python
Japanese text8 corpus for word embedding.
Updated
Oct 4, 2017
Python
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Updated
Dec 7, 2020
Python
A General Purpose NLP library for Turkish
Updated
Feb 28, 2022
Python
A list of ~98,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
Updated
Jan 21, 2022
Python
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
Updated
Jan 25, 2022
Python
Convert a PDF via OCR to a TXT file in UTF-8 encoding
Updated
Apr 6, 2022
Python
Corpus of Annual Reports in Japan
Updated
Dec 19, 2020
Python
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
Updated
Oct 15, 2020
Python
Improve this page
Add a description, image, and links to the
corpus
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
corpus
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.