ThoughtSource
PIXIU
ThoughtSource | PIXIU | |
---|---|---|
1 | 6 | |
845 | 423 | |
1.4% | 5.7% | |
8.4 | 8.9 | |
11 months ago | 8 days ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ThoughtSource
PIXIU
What are some alternatives?
medmcqa - A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.
spacy-llm - 🦙 Integrating LLMs into structured NLP pipelines
hate-speech-and-offensive-language - Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Baichuan-13B - A 13B large language model developed by Baichuan Intelligent Technology
PLOD-AbbreviationDetection - This repository contains the PLOD Dataset for Abbreviation Detection released with our LREC 2022 publication
Baichuan-7B - A large-scale 7B pretraining language model developed by BaiChuan-Inc.
goodreads - code samples for the goodreads datasets
chatgpt-extractive-shortener - Shortens a paragraph of text with ChatGPT, using successive rounds of word-level extractive summarization.
datasets - 🎁 5,400,000+ Unsplash images made available for research and machine learning
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
happy-transformer - Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
ARElight - Granular Viewer of Sentiments Between Entities in Massively Large Documents and Collections of Texts, powered by AREkit