SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Chinese Projects
-
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报告、自然语言生成、NLU太难了系列、自动对联数据及机器人、用户名黑名单列表、罪名法务名词及分类模型、微信公众号语料、cs224n深度学习自然语言处理课程、中文手写汉字识别、中文自然语言处理 语料/数据集、变量命名神器、分词语料库+代码、任务型对话英文数据集、ASR 语音数据集 + 基于深度学习的中文
-
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
-
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
-
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
-
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
-
Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Qwen: https://github.com/QwenLM/Qwen
Project mention: Baichuan 7B reaches top of LLM leaderboard for it's size (New foundation model 4K tokens) | /r/LocalLLaMA | 2023-06-17GitHub: baichuan-inc/baichuan-7B: A large-scale 7B pretraining language model developed by BaiChuan-Inc. (github.com)
Could probably whip up a python script real quick with this library: https://github.com/mozillazg/python-pinyin. Probably need some extra logic to deal with heteronyms. Not sure what your goal is.
Huatuo-Llama-Med-Chinese https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese
Project mention: Would this method work to increase the memory of the model? Saving summaries generated by a 2nd model and injecting them depending on the current topic. | /r/LocalLLaMA | 2023-06-09
Project mention: Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:263.0 | /r/algoprojects | 2023-07-31
Project mention: How to type Jyutcitzi? 【RIME keyboard installation manual】? | /r/CantoneseScriptReform | 2023-12-07Please follow instructions at https://github.com/rime/rime-cantonese/wiki and https://github.com/rime/rime-cantonese/wiki/新手安裝教程 In a nutshell, download and install using the following files: Mac: mac-2021.05.16-installer.pkg Windows: windows-sfx-2021.05.16-installer.exe Linux: Download and run ibus-install.sh Please check to ensure that RIME Cantonese is properly installed before proceeding to Step 3.
Python Chinese related posts
-
What the heck is so great about this model?
-
New open-source LLM model Qwen 72B surpasses GPT4 in 4 of 10 benchmarks
-
Qwen (通义千问) chat and pretrained large language model by Alibaba Cloud
-
Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:263.0
-
Baichuan IA de China
-
Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:221.0
-
Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:221.0
-
A note from our sponsor - SaaSHub
www.saashub.com | 16 May 2024
Index
What are some of the best open-source Chinese projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | funNLP | 64,583 |
2 | Qwen | 11,430 |
3 | chinese-xinhua | 10,668 |
4 | GPT2-Chinese | 7,366 |
5 | pkuseg-python | 6,432 |
6 | Baichuan-7B | 5,644 |
7 | 汉字拼音转换工具(Python 版) | 4,699 |
8 | Huatuo-Llama-Med-Chinese | 4,296 |
9 | awesome-pretrained-chinese-nlp-models | 4,279 |
10 | text-classification-cnn-rnn | 4,083 |
11 | Baichuan2 | 3,949 |
12 | Chinese-CLIP | 3,701 |
13 | ark-pixel-font | 2,982 |
14 | DeepKE | 3,001 |
15 | Baichuan-13B | 2,967 |
16 | OFA | 2,337 |
17 | TencentPretrain | 985 |
18 | ttskit | 997 |
19 | xpinyin | 809 |
20 | Cornucopia-LLaMA-Fin-Chinese | 547 |
21 | rime-cantonese | 501 |
22 | DataCLUE | 145 |
23 | cwg | 70 |
Sponsored