A collection of natural language processing projects & tool libraries & resources, divided into sub-fields and sorted out, automatically updated every week
This resource list contains 100 python natural language processing related projects & tool libraries & resources. These resources are divided into 11 different sub-sections. These projects have currently received 180K likes on github. All tool resources will automatically collect information from GitHub and the tool maintenance platform every week, and update and sort out the display. This list is completed with reference to the best-of template. Everyone is welcome to submit PRs to enrich this list.
facebookresearch-fastText24K·Code facebookresearch’s fastText tool library by @facebookresearch | text_classification(brightmart)7.4K · Code Various Chinese text classification models implemented by brightmart by @brightmart | facebookresearch-pytext6.3K · Code pytext tool library for facebookresearch by @facebookresearch |
PaddleNLP5.2K · Code PaddlePaddle NLP Tool Library by @PaddlePaddle | Paddle-ERNIE5.1K · Code ERNIE implemented by PaddlePaddle by @PaddlePaddle | Chinese-Text-Classification-Pytorch3.8K · Code Various Chinese text classification models implemented by Pytorch by @649453932 |
Bert-Chinese-Text-Classification-Pytorch2.5K · Code Chinese text classification model based on transformer/BERT implemented in Pytorch by @649453932 | Text_Classification1.6K · Code kk7nc Text_Classification by @kk7nc | NeuralNLP-NeuralClassifier(Tencent)1.4K · Code Tencent text classification open source project, including multiple model implementations and complete model construction process by @Tencent |
pytorch-sentiment-analysis3.6K · Code Sentiment analysis modeling implemented by pytorch by @bentrevett | Aspect-Based-Sentiment-Analysis1.6K · Code Implementation of sentiment analysis based on graph convolutional networks with different granularities. by @songyouwei | baidu-Senta1.6K · Code Baidu’s Senta sentiment analysis project by @baidu |
BDCI2019-Sentiment-classification440 · Code The first place solution in BDCI2019 Sentiment Analysis Competition by @cxy229 | Customer_Satisfaction_Analysis330 · Code Consumer Satisfaction Analysis Modeling by @CarryChang | chinese-customer-review34 · Code Chinese review website review data by @lsvih |
Chinese-DeepNER700 · Code Deep learning named entity recognition project by @z814081807 | BERT-for-Sequence-Labeling-and-Text-Classification410 · Code Bert-based sequence annotation and text classification by @yuanxiaosc | sequence_tagging390 · Code Sequence annotation implementation by @qiufengyuyi |
Macadam300 · Code Macadam is a natural language processing toolkit based on Tensorflow (Keras) and bert4keras, focusing on text classification, sequence annotation and relationship extraction. by @yongzhuo | seq2annotation80 · Code Implementation scheme of sequence annotation by @howl-anderson |
DeepPavlov5.8K · Code Deep learning end-to-end dialogue system by @deepmipt | haystack5.2K · Code End-to-end deep learning question and answer retrieval system by @deepset-ai | baidu-AnyQ2.4K · Code Baidu's AnyQ high-frequency question retrieval question and answer system by @baidu |
Chatbot_CN1.1K · Code A chatbot based on the financial-judicial field (both in the nature of chatting) by @charlesXu86 | Chinese-Chatbot-PyTorch670 · Code Dialogue robot implemented based on pytorch by @Doragd | 58-qa_match310 · Code Implementation of 58’s matching question answering system by @wuba |
pytorch-question-answering240 · Code Implementation of pytorch question and answer system by @kushalj001 | gpt2-bert-reddit-bot180 · Code Implementation of reddit question and answer robot based on gpt2 and bert by @lots-of-things |
ASRT_SpeechRecognition5.7K · Code Chinese speech recognition system based on deep learning by @nl8590687 | TensorFlowTTS2.8K · Code Text-to-speech project implemented by tensorflow by @TensorSpeech | NVIDIA-waveglow2K · Code NVIDIA’s waveglow speech synthesis project by @NVIDIA |
TransformerTTS920 · Code Implementation of tts text-to-speech based on transformer by @as-ideas | noisereduce720 · Code Intelligent noise reduction project by @timsainb | Speech-Transformer690 · Code Speech-Transformer speech recognition project by @kaituoxu |
Multilingual_Text_to_Speech580 · Code Multilingual text-to-speech implementation by @Tomiinek | inaSpeechSegmenter520 · Code Voice intelligent segmentation project by @ina-foss | Speech-enhancement390 · Code Speech enhancement project by @vbelz |
OpenTransformer310 · Code Transformer-based speech recognition by @ZhengkunTian | tacotron2100 · Code tacotron2 speech synthesis project by @ide8 |
QASystemOnMedicalKG4.2K · Code Medical knowledge graph construction and question and answer reasoning by @liuhuanyong | KnowledgeGraphData4.1K · Code Knowledge graph data collection by @ownthink | awesome-knowledge-graph3.4K · Code awesome knowledge map information by @husthuke |
Agriculture_KnowledgeGraph3.2K · Code Agricultural knowledge map project by @qq547276542 | KnowledgeGraphCourse3K·Code Southeast University Knowledge Graph Course by @npubird | Financial-Knowledge-Graphs1.9K · Code Financial knowledge graph project by @jm199504 |
stock-knowledge-graph1.6K · Code Stock Financial Knowledge Map Project by @lemonhu | knowledge-graphs1.3K · Code Knowledge graph by @shaoxiongji | NLP-Knowledge-Graph1.2K · Code NLP knowledge graph information by @lihanghang |
QASystemOnMedicalGraph860 · Code Question and answer system based on medical knowledge graph by @zhihao-chen |
GPT2-Chinese5.2K · Code GPT2 Chinese generation by @Morizeyao | texar2.3K · Code texar NLP tool library for text generation by @asyml | gpt2-ml1.4K · Code gpt2 Chinese text generation by @imcaspar |
Texygen840 · Code text generation by @geek-ai | texar-pytorch720 · Code pytorch implementation of texar by @asyml | transformer-xl-chinese610 · Code transformer-xl Chinese text generation by @GaoPeng97 |
awesome-text-generation460 · Code awesome text generation information by @ChenChengKuan | aichpoem160 · Code poetry generation by @wangjiezju1988 | CoupletAI140 · Code AI couplet generation by @WiseDoge |
gpt-2-poetry84 · Code gpt2 poetry generation by @kylemcdonald | QueryGeneration83 · Code Similar question generation based on deep learning by @YunwenTechnology |
awesome-text-summarization1.2K · Code awesome text summary generation information by @icoxfog417 | bert-extractive-summarizer1K·Code Extractive text summary generation based on BERT by @dmmiller612 | GPT2-NewsTitle690 · Code GPT2 news headlines automatically generated by @liucongg |
Text-Summarization-Papers460 · Code Papers related to text summarization and abstract generation by @neulab | seq2seq-summarizer350 · Code seq2seq text summary generation by @ymfa | GPT2-Summary290 · Code GPT2 text summary generation by @qingkongzhiqian |
Bart_T5-summarization150 · Code Text summary generation based on Bart-T5 by @renatoviolin | TranSummar130 · Code Extractive text summary generation based on transformer by @lipiji | nnsum100 · Code Text summary based on deep learning by @kedz |
Neural-Abstract-Text-Summarization100 · Code LeafNATS text summary generation framework by @tshi04 |
mt-dnn2K · Code Text understanding based on multi-task deep learning network by @namisan | Adversarial-Training-for-Natural-Language-Understanding230 · Code Text understanding implementation based on adversarial training by @zhuchen03 | microsoft-Natural-Language-Understanding120 · Code Text understanding based on multi-task deep learning network (Microsoft) by @microsoft |
capsule-mrc88 · Code Capsule-based perspective reading comprehension model by @freefuiiismyname |
sktime5.6K · Code scikit machine learning time series forecasting by @alan-turing-institute | neural_prophet2.5K · Code Prophet time series prediction based on neural network by @ourownstory | pytorch-forecasting2.2K · Code pytorch timing estimation by @jdb78 |
pytorch-ts740 · Code Time series prediction based on pytorch by @zalandoresearch | transformer-time-series-prediction610 · Code Transformer-based time series prediction by @oliverguhr | DeepLearning-For-Time-Series-Forecasting560 · Code Microsoft deep learning time series prediction implementation by @Azure |
sktime-dl560 · Code scikit deep learning time series classification and regression by @sktime | awesome-time-series510 · Code awesome time series forecast data by @cuge1995 | Deep-Time-Series-Prediction380 · Code Deep learning time series prediction by @EvilPsyCHo |
scikit-hts200 · Code scikit hierarchical time series forecasting library by @carlomazzaferro |
tensorflow-nmt6.1K · Code tensorflow neural network machine translation library by @tensorflow | OpenNMT-py5.7K · Code OpenNMT neural network machine translation library by @OpenNMT | facebookresearch-UnsupervisedMT1.5K · Code FacebookResearch’s unsupervised machine translation library by @facebookresearch |
OpenNMT-tf1.3K · Code OpenNMT neural network machine translation library (tensorflow version) by @OpenNMT | Nematus780 · Code nematus neural network machine translation library by @EdinburghNLP | translators760 · Code Machine translation library by @UlionTse |
EasyNMT610 · Code EasyNMT neural network machine translation library by @UKPLab | joeynmt560 · Code Neural network machine translation by @joeynmt |
MatchZoo3.7K · Code MatchZoo library containing multiple text matching models by @NTMC-Community | awesome-neural-models-for-semantic-match750 · Code awesome neural network semantic matching information by @NTMC-Community | text_matching620 · Code Text matching model implementation by @terrifyzhao |
TextMatch580 · Code TextMatch text matching implementation by @zhaogaofeng611 | eBay-Sequence-Semantic-Embedding450 · Code eBay's sequence semantic embedding tool library by @eBay | deepmatcher440 · Code Deep learning text matching library by @anhaidgroup |
alibaba-simple-effective-text-matching330 · Code ACL2019 paper "Simple and efficient text matching" implementation by @alibaba-edu | alibaba-simple-effective-text-matching-pytorch270 · Code ACL2019 paper "Simple and efficient text matching" implementation (pytorch version) by @alibaba-edu | Semantic-Search100 · Code Semantic retrieval system by @renatoviolin |