In the field of natural language processing, pretrained language models (Pretrained Language Models) have become a very important basic technology. This warehouse mainly collects some high-quality Chinese pre-training models, Chinese multi-modal models, and Chinese large language models that are currently public on the Internet. and other content (thanks to the guy who shared the resources), and will continue to update...
To download the HuggingFace warehouse model in China, it is recommended to use the HuggingFace mirror address: https://hf-mirror.com/
Change log
General basic large model
Vertical foundation large model
Universal dialogue model
Vertical dialogue model
Multimodal dialogue large model
Large model evaluation benchmark
Online experience model
Open source model library platform
Open source dataset library
Open source Chinese instruction data set
Embedding
Other-Awesome
备注
ND: Non-Causal Decoder or Prefix LM
CD: Causal Decoder
ED: Encoder-Decoder
Large-scale basic models: Only models
大于7B
parameters are listed in the table.
Model | size | time | language | field | download | Project address | Institution/Individual | Architecture | literature | Remark |
---|---|---|---|---|---|---|---|---|---|---|
XVERSE-MoE | 255B/A36B | 2024-09 | Chinese and English | Universal | ?HF | XVERSE-MoE-A36B | xverse-ai | MoE | ||
Qwen-2.5 | 0.5/1.5/3/7/14/32/72B | 2024-09 | Chinese and English | Universal | ?HF | Qwen2.5 | QwenLM | CD | Blog | |
Tele-FLM | 52B/102B/1TB | 2024-07 | Multilingual | Universal | [?HF] | / | CofeAI | CD | Tele-FLM Technical Report | |
meta-llama-3.1 | 8/70/405B | 2024-07 | Multilingual | Universal | [?HF] | llama3 | meta-llama | CD | ||
internlm2.5-Base | 7B | 2024-07 | Chinese and English | Universal | [?HF] | InternLM | InternLM | CD | Technical Report | |
MAP-NEO-Base | 2/7B | 2024-06 | Chinese and English | Universal | ?HF | MAP-NEO | multimodal-art-projection | CD | Paper | |
Nemotron-4-Base | 340B | 2024-06 | Multilingual | Universal | ?HF | / | NVIDIA | CD | technical report. | |
Index-Base | 1.9B | 2024-06 | Chinese and English | Universal | ?HF | Index-1.9B | bilibili | CD | Report | |
Qwen2-Base | 0.5/2/5/7/72B | 2024-06 | Multilingual | Universal | ?HF | Qwen2 | QwenLM | CD | Blog | |
GLM-4-Base | 9B | 2024-06 | Multilingual | Universal | ?HF | GLM-4 | THUDM | / | ||
Yi-1.5-Base | 6/9/34B | 2024-05 | Chinese and English | Universal | ?HF | Yi-1.5 | 01-ai | CD | Paper | |
DeepSeek-V2-Base | A21B/236B | 2024-05 | Chinese and English | Universal | ?HF | DeepSeek-V2 | deepseek-ai | MOE | Paper | |
Llama-3-Base | 8/70B | 2024-04 | Multilingual | Universal | ?HF | llama3 | Meta Llama | CD | ||
Zhinao-Base | 7B | 2024-04 | Chinese and English | Universal | ?HF? | / | Qihoo Technology | CD | ||
XVERSE-MoE | A4.2B/25.8B | 2024-04 | Chinese and English | Universal | ?HF | XVERSE-MoE-A4.2B | xverse-ai | MoE | ||
SoftTiger-Base | 13/70B | 2024-04 | Chinese and English | Universal | ?HF | TigerBot | TigerResearch | CD | ||
HammerLLM | 1.4b | 2024-04 | Chinese and English | Universal | ?HF | HammerLLM | DataHammer | |||
Mengzi3-Base | 13B | 2024-04 | Chinese and English | Universal | ?HF | Mengzi3 | Langboat | CD | ||
Breeze-Base | 7B | 2024-02 | Chinese and English | Universal | ?HF | / | MediaTek Research | |||
TowerBase | 7/13B | 2024-02 | Multilingual | Universal | [?HF] | / | Unbabel | CD | ||
Qwen1.5-Base | 0.5/1.8/4 7/14/32/72/110B | 2024-02 | Chinese and English | Universal | [?HF] | Qwen1.5 | Qwen | / | Blog | |
LongAlign-Base | 6/7/13B | 2024-02 | Chinese and English | Universal | [?HF] | LongAlign | THUDM | / | Paper | |
Chinese-Mixtral-Base | 8x7B | 2024-02 | Chinese and English | Universal | [Baidu] [?HF] | Chinese-Mixtral | Yiming Cui | MOE | ||
iFlytekSpark-Base | 13B | 2024-01 | Chinese and English | Universal | mindspore | / | iFlytek | CD | ||
Orion-Base | 14B | 2024-01 | Multilingual | Universal | [?HF] | Orion | OrionStarAI | CD | Paper | RAG Plugin |
YaYi2-Base | 30B | 2023-12 | Multilingual | Universal | [?HF] | YAYI2 | wenge-research | CD | Paper | |
Aquila2-Base | 7/34/70B | 2023-12 | Chinese and English | Universal | [?HF] | Aquila2 | FlagAI | CD | ||
Alaya-Base | 7B | 2023-12 | Chinese and English | Universal | [?HF] | Alaya | DataCanvas | CD | ||
Qwen-Base | 1.8/7 14/72B | 2023-12 | Chinese and English | Universal | [?HF] | Qwen | Alibaba Cloud | CD | Paper Report Report2 | |
DeepSeek-Base | 7/67B | 2023-11 | Chinese and English | Universal | [?HF] | DeepSeek-LLM | deepseek-ai | CD | ||
Yuan-2.0 | 2/51 102B | 2023-11 | Chinese and English | Universal | Baidu [?HF] | Yuan-2.0 | IEIT-Yuan | CD | ||
Alaya-Base | 7B | 2023-11 | Chinese and English | Universal | [?HF] | Alaya | DataCanvasIO | CD | ||
Yi-Base | 6/9/34B | 2023-11 | Chinese and English | Universal | [?HF] | Yi | 01.AI | CD | ||
XVERSE-Base | 7/13 65B | 2023-11 | Multilingual | Universal | [?HF] | XVERSE | Yuanxiang Technology | CD | ||
Nanbeige-Base | 16B | 2023-11 | Chinese and English | Universal | [?HF] | Nanbeige | Nanbeige LLM Lab | CD | ||
LingoWhale | 8B | 2023-11 | Chinese and English | Universal | [?HF] | LingoWhale-8B | DeepLang AI | CD | ||
Skywork-base | 13B | 2023-10 | Chinese | Universal | [?HF] | Skywork | SkyworkAI | CD | Paper | |
BlueLM-Base | 7B | 2023-11 | Chinese and English | Universal | [?HF] | BlueLM | vivo AI Lab | CD | ||
Chatglm3-base | 6B | 2023-10 | Chinese and English | Universal | [?HF] | ChatGLM3 | THUDM | ND | ||
Ziya2-Base | 13B | 2023-10 | Chinese and English | Universal | [?HF] | Fengshenbang-LM | IDEA Institute | CD | ||
OpenBA-LM | 15B | 2023-09 | Chinese and English | Universal | [?HF] | OpenBA | OpenNLG Group | ED | Paper | |
TigerBot-Base-70B | 80B | 2023-09 | Multilingual | Universal | [?HF] | TigerBot | Hubo Technology | CD | Paper | |
FLM | 101B | 2023-09 | Chinese and English | Universal | [?HF] | / | CofeAI | CD | ||
falcon | 7/40 180B | 2023-09 | Multilingual | Universal | [?HF] | / | Technology Innovation Institute | CD | ||
Baichuan2 | 7/13B | 2023-09 | Chinese | Universal | [?HF] | Baichuan2 | Baichuan Intelligence | CD | ||
Chinese-LLaMA-2-16K | 7/13B | 2023-08 | Chinese and English | Universal | [?HF] | Chinese-LLaMA-Alpaca-2 | Yiming Cui | CD | ||
YuLan-LLaMA-2 | 13B | 2023-08 | Chinese and English | Universal | [?HF] | YuLan-Chat | Renmin University of China | CD | ||
Aquila-Base-33B | 33B | 2023-08 | Chinese and English | Universal | TODO | Aquila | FlagAI | CD | ||
TigerBot-Base-13B | 13B | 2023-08 | Multilingual | Universal | [?HF] | TigerBot | Hubo Technology | CD | ||
Linly-Chinese-LLaMA-2 | 7/13B | 2023-07 | Chinese and English | Universal | [?HF] | Linly | Shenzhen University Computer Vision Institute | CD | ||
Chinese-LLaMA-2 | 7B | 2023-07 | Chinese and English | Universal | [?HF] | Chinese-LLaMA-Alpaca-2 | Yiming Cui | CD | ||
Jiang-base | 13B | 2023-07 | Chinese | Universal | [?HF] | / | Not knowing the wisdom | CD | ||
wx | 7/13B | 2023-07 | Chinese | Universal | [?HF] | / | Blue whale national number | CD | ||
Llama2 | 7/13 70B | 2023-07 | Multilingual | Universal | [?HF] | llama | Meta | CD | Paper | |
PolyLM | 13B | 2023-07 | Multilingual | Universal | [?HF] | PolyLM | Bodhidharma Academy | CD | Paper | |
Baichuan-13B | 13B | 2023-07 | Chinese | Universal | [?HF] | Baichuan-13B | Baichuan Intelligence | CD | ||
TigerBot | 7B | 2023-07 | Multilingual | Universal | [?HF] | TigerBot | Hubo Technology | CD | ||
InternLM-base | 7/20B | 2023-07 | Chinese | Universal | [?HF] | InternLM | Shanghai Artificial Intelligence Laboratory | CD | report | |
MPT | 7/30B | 2023-06 | Multilingual | Universal | [?HF] | llm-foundry | MosaicML | CD | ||
Baichuan | 7B | 2023-06 | Chinese and English | Universal | [?HF] | baichuan-7B | Baichuan Intelligence | CD | ||
Chinese-Falcon | 7B | 2023-06 | Chinese and English | Universal | [?HF] | Linly | Shenzhen University Computer Vision Institute | CD | Blog | |
AtomGPT | 13B | 2023-06 | Chinese and English | Universal | [?HF] | / | atomic echo | CD | ||
Aquila | 7B | 2023-06 | Chinese and English | Universal | [?HF] | Aquila | FlagAI | CD | ||
Chinese-LLaMA | 33B | 2023-06 | Chinese and English | Universal | [?HF] | Chinese-LLaMA-Alpaca | Yiming Cui | CD | ||
TigerBot | 7B | 2023-06 | Multilingual | Universal | [?HF] | TigerBot | Hubo Technology | CD | ||
Panda-OpenLLaMA | 7B | 2023-05 | Chinese and English | Universal | [?HF] | pandallm | dandelionsllm | CD | ||
Panda | 7/13B | 2023-05 | Chinese and English | Universal | [?HF] | pandallm | dandelionsllm | CD | ||
OpenLLaMA | 13B | 2023-05 | Chinese and English | Universal | [?HF] | Linly | Shenzhen University Computer Vision Institute | CD | ||
BiLLa-LLM | 7B | 2023-05 | Chinese and English | Universal | [?HF] | ikB | Zhongli Li | CD | ||
Ziya-LLaMA-Reward | 7B | 2023-05 | Chinese and English | Universal | [?HF] | Fengshenbang-LM | IDEA Institute | CD | ||
YuYan | 11B | 2023-04 | Chinese | Universal | [?HF] | / | NetEase Fuxi | CD | Paper | |
Chinese-LLaMA | 7/13/33B | 2023-04 | Chinese | Universal | [?HF] | Linly | Shenzhen University Computer Vision Institute | CD | Blog | |
OpenChineseLLaMA | 7B | 2023-04 | Chinese and English | Universal | [?HF] | OpenChineseLLaMA | OpenLMLab | CD | ||
MOSS-003 | 16B | 2023-04 | Chinese and English | Universal | [?HF] | MOSS | Fudan University | CD | ||
BBT-2-Text | 13B | 2023-04 | Chinese | Universal | Apply | BBT-FinCUGE-Applications | supersymmetry | CD | Paper | |
BBT-2-Text | 12B | 2023-04 | Chinese | Universal | Apply | BBT-FinCUGE-Applications | supersymmetry | CD | Paper | |
Chinese-LLaMA | 13B | 2023-04 | Chinese and English | Universal | [?HF] | Chinese-LLaMA-Alpaca | Yiming Cui | CD | ||
flan-ul2 | 20B | 2023-03 | Multilingual | Universal | [?HF] | ul2 | ED | Paper | ||
CPM-Bee | 10B | 2023-01 | Chinese and English | Universal | [?HF] | CPM-Bee | OpenBMB | CD | ||
BLOOM | 176B | 2022-11 | Multilingual | Universal | [?HF] | Megatron-DeepSpeed | BigScience | CD | Paper | |
BLOOMZ | 176B | 2022-11 | Multilingual | Universal | [?HF] | Megatron-DeepSpeed | BigScience | CD | Paper | |
flan-t5-xxl | 11B | 2022-11 | Multilingual | Universal | [?HF] | t5x | ED | paper | ||
CPM-Ant+ | 10B | 2022-10 | Chinese and English | Universal | BMB | CPM-Live | OpenBMB | CD | blog | |
GLM | 130B | 2022-10 | Chinese and English | Universal | Apply | GLM-130B | Tsinghua University | ND | paper | |
CPM-Ant | 10B | 2022-09 | Chinese | Universal | [?HF] | CPM-Live | OpenBMB | CD | blog | |
GLM | 10B | 2022-09 | Chinese | Universal | [?HF] | GLM | Tsinghua University | ND | paper | |
Source 1.0 | 245B | 2021-09 | Chinese | Universal | API | Yian-1.0 | wave | CD | paper | |
CPM-2 | 10/11/ 200B | 2021-06 | Chinese | Universal | Apply | CPM | Zhiyuan Research Institute | ED | paper | |
PanGu-Alpha | 13/200B | 2021-05 | Chinese | Universal | [?HF] | PanGu-Alpha | Pengcheng Laboratory | CD | paper | |
PLUG | 27B | 2021-04 | Chinese | Universal | Apply | AliceMind | Alibaba | ED | ||
GPT-3 | 13/30B | 2021-04 | Chinese | Universal | TODO | GPT-3 | Bodhidharma Academy | CD |
[Back to Top]
Open source basic models in various vertical fields
Model | size | time | language | field | download | Project address | Institution/Individual | Architecture | literature | Remark |
---|---|---|---|---|---|---|---|---|---|---|
Qwen-2.5 | 1.5/7B | 2024-09 | Chinese and English | code | ?HF | Qwen2.5 | QwenLM | CD | Blog | |
Qwen-2.5 | 1.5/7/72B | 2024-09 | Chinese and English | math | ?HF | Qwen2.5 | QwenLM | CD | Blog | |
Tongyi-Finance-Base | 14B | 2023-11 | Chinese | finance | ModelScope | Tongyi Finance-14B | Tongyi financial model | CD | ||
ChiMed-GPT | 13B | 2023-10 | Chinese | medical | [?HF] | ChiMed-GPT | University of Science and Technology of China | CD | Paper | |
CodeShell-base | 7B | 2023-10 | Chinese and English | code | [?HF] | codeshell | WisdomShell | CD | ||
WiNGPT-base | 7B | 2023-09 | Chinese | medicine | [?HF] | WiNGPT2 | Winning Health AI Research | CD | ||
Xuanyuan | 70B | 2023-09 | Chinese | finance | [?HF] | Xuanyuan | Du Xiaoman | CD | Report | |
CodeLLAma | 7/13/ 34B | 2023-08 | Multilingual | code | [?HF] | codellama | Meta Research | CD | Paper | |
educhat-base-002 | 7/13B | 2023-06 | Chinese and English | educate | [?HF] | EduChat | East China Normal University | CD | ||
AquilaCode-NV | 7B | 2023-06 | Chinese and English | code | [?HF] | Aquila | FlagAI | CD | ||
AquilaCode-TS | 7B | 2023-06 | Chinese and English | code | [?HF] | Aquila | FlagAI | CD | ||
LaWGPT | 7B | 2023-05 | Chinese and English | law | [?HF] | LawGPT | Pengxiao Song | CD | ||
CodeGeeX | 13B | 2022-06 | Multilingual | code | Apply | CodeGeeX | Tsinghua University | CD | blog |
[Back to Top]
Large language model with capabilities such as question answering and dialogue.
Model | size | time | language | field | download | Project address | Institution/Individual | Architecture | literature |
---|---|---|---|---|---|---|---|---|---|
Athene-V2-Chat | 72B | 2024-11 | Chinese and English | Universal | ?HF | / | Nexusflow | CD | Blog |
Athene-V2-Agent | 72B | 2024-11 | Chinese and English | Tool call | ?HF | / | Nexusflow | CD | Blog |
Hunyuan-Large | A52/389B | 2024-11 | Chinese and English | Universal | ?HF | Tencent-Hunyuan-Large | Tencent | MoE | Paper |
Aya-Expanse | 8/32B | 2024-10 | Multilingual | Universal | ?HF | / | Cohere For AI | CD | |
Granite 3.0 | 1/2/3/8B | 2024-10 | Multilingual | Universal | ?HF | granite-3.0-language-models | ibm-granite | CD | Paper |
Granite 3.0-MoE | 1B/3B/A400M | 2024-10 | Multilingual | Universal | ?HF | granite-3.0-language-models | ibm-granite | MoE | Paper |
TeleChat2 | 115B | 2024-09 | Chinese and English | Universal | ?ModelScope | TeleChat2 | Tele-AI | CD | |
Qwen-2.5 | 0.5/1.5/3/7/14/32/72B | 2024-09 | Chinese and English | Universal | ?HF | Qwen2.5 | QwenLM | CD | Blog |
XVERSE-MoE | 255B/A36B | 2024-09 | Chinese and English | Universal | ?HF | XVERSE-MoE-A36B | xverse-ai | MoE | |
DeepSeek-V2.5 | 236B/A21B | 2024-09 | Chinese and English | Universal | ?HF | DeepSeek-V2 | deepseek-ai | MOE | Paper |
MiniCPM3 | 4B | 2024-09 | Chinese and English | Universal | ?HF | MiniCPM | OpenBMB | CD | MiniCPM Paper |
C4AI Command R+ 08-2024 | 104B | 2024-08 | Multilingual | Universal | ?HF | / | CohereForAI | CD | |
JIUTIAN-Chat | 39/A13B | 2024-07 | Chinese and English | Universal | ?MS | / | China Mobile JiuTian-AI | MOE | |
meta-llama-3.1 | 8/70/405B | 2024-07 | Multilingual | Universal | [?HF] | llama3 | meta-llama | CD | |
internlm2.5-chat | 7B | 2024-07 | Chinese and English | Universal | [?HF] | InternLM | InternLM | CD | Technical Report |
Mistral-large-insruct-2407 | 123B | 2024-07 | Multilingual | Universal | ?HF | / | Mistral AI | blog post | |
DeepSeek-V2-Chat-0628 | 236B | 2024-07 | Chinese and English | Universal | ?HF | DeepSeek-V2 | deepseek-ai | MOE | Paper |
C4ai-command-r-plus | 104B | 2024-07 | Multilingual | Universal | ?HF | / | CohereForAI | CD | |
Gemma-2-chat | 9/27B | 2024-06 | Multilingual | Universal | ?HF | / | CD | ||
MAP-NEO-Chat | 2/7B | 2024-06 | Chinese and English | Universal | ?HF | MAP-NEO | multimodal-art-projection | CD | Paper |
GEB-Chat | 1.3B | 2024-06 | Chinese and English | Universal | ?HF | / | GEB-AGI | CD | Paper |
Nemotron-4-Chat | 340B | 2024-06 | Multilingual | Universal | ?HF | / | NVIDIA | CD | technical report. |
Index-Chat | 1.9B | 2024-06 | Chinese and English | Universal | ?HF | Index-1.9B | bilibili | CD | Report |
Qwen2-MoE | 57B/A14B | 2024-06 | Multilingual | Universal | ?HF | Qwen2 | QwenLM | MoE | Blog |
Qwen2-Chat | 0.5/2/5/7/72B | 2024-06 | Multilingual | Universal | ?HF | Qwen2 | QwenLM | CD | Blog |
GLM-4-Chat | 9B | 2024-06 | Multilingual | Universal | ?HF | GLM-4 | THUDM | / | |
Skywork-MoE | 16/A22B/146B | 2024-06 | Chinese and English | Universal | ?HF | Skywork-MoE | SkyworkAI | MoE | Tech Report |
Yuan2.0 | 40/A3.7B | 2024-05 | Chinese and English | Universal | ?HF | Yuan2.0-M32 | IEIT-Yuan | MOE | Paper |
Star-Chat | 52B | 2024-05 | Chinese and English | Universal | ?HF | TeleChat-52B | Tele-AI | CD | |
LingLong | 317M | 2024-05 | Chinese and English | Universal | ?HF | linglong | nkcs-iclab | CD | |
Sailor | 14B | 2024-05 | 7 languages | Universal | ?HF | sailor-llm | sail-sg | CD | Paper |
Nanbeige2 | 8/16B | 2024-05 | Chinese and English | Universal | ?HF | Nanbeige | Nanbeige | CD | |
Yi-1.5-Chat | 6/9/34B | 2024-05 | Chinese and English | Universal | ?HF | Yi-1.5 | 01-ai | CD | Paper |
DeepSeek-V2-Chat | A21B/236B | 2024-05 | Chinese and English | Universal | ?HF | DeepSeek-V2 | deepseek-ai | MOE | Paper |
XVERSE-MoE | A4.2B/25.8B | 2024-05 | Chinese and English | Universal | ?HF | XVERSE-MoE-A4.2B | xverse-ai | MOE | |
Llama3-zh | 8/70B | 2024-04 | Chinese and English | Universal | ?HF | / | / | CD | llama3 Chinese list |
Llama3-Chinese-Chat | 8B | 2024-04 | Chinese and English | Universal | ?HF | / | Shenzhi Wang | CD | |
Llama-3-Chat | 8/70B | 2024-04 | Multilingual | Universal | ?HF | llama3 | Meta Llama | CD | |
Zhinao-Chat | 7B | 2024-04 | Chinese and English | Universal | ?HF? | / | Qihoo Technology | CD | |
MiniCPM-MoE | 8x2B | 2024-04 | Chinese and English | Universal | ?HF | MiniCPM | OpenBMB | MoE | |
Nanbeige2-Chat | 8B | 2024-04 | Chinese and English | Universal | ?HF | Nanbeige | Nanbeige LLM Lab | CD | |
Sailor | 7B | 2024-04 | Multilingual | Universal | ?HF | sailor-llm | Sea AI Lab | CD | Paper |
Mengzi3-Chat | 13B | 2024-04 | Chinese and English | Universal | ?HF | Mengzi3 | Langboat | CD | |
Qwen-MoE | 2.7B | 2024-03 | Chinese and English | Universal | ?HF | Qwen1.5 | Qwen | MoE | Blog |
Command-R | 35B | 2024-03 | Multilingual | Universal | ?HF | / | CohereForAI | CD | |
Breeze-Instruct | 7B | 2024-02 | Chinese and English | Universal | ?HF | / | MediaTek Research | ||
aya-101 | 13B | 2024-02 | Multilingual | Universal | ?HF | / | Cohere For AI | CD | Paper |
ChemLLM | 7B | 2024-02 | Multilingual | Universal | ?HF | / | AI4Chem | CD | Paper |
TowerInstruct | 7/13B | 2024-02 | Multilingual | Universal | [?HF] | / | Unbabel | CD | |
Qwen1.5-Chat | 0.5/1.8/4/ 7/14/32/72/110B | 2024-02 | Chinese and English | Universal | [?HF] | Qwen1.5 | Qwen | / | Blog |
MiniCPM | 2B | 2024-02 | Chinese and English | Universal | [?HF] ModelScope | MiniCPM | OpenBMB | / | Report |
LongAlign-Chat | 6/7/13B | 2024-02 | Chinese and English | Universal | [?HF] | LongAlign | THUDM | / | Paper |
Chinese-Mixtral-Chat | 8x7B | 2024-02 | Chinese and English | Universal | [Baidu] [?HF] | Chinese-Mixtral | Yiming Cui | MOE | |
iFlytekSpark-Chat | 13B | 2024-01 | Chinese and English | Universal | mindspore | / | iFlytek | CD | |
rwkv-5-world | 0.1/1/ 3/7B | 2023-01 | Multilingual | Universal | [?HF] | RWKV-LM | BlinkDL | URL | |
Orion-Chat | 14B | 2024-01 | Multilingual | Universal | [?HF] | Orion | OrionStarAI | CD | Paper |
internlm2-chat | 7/20B | 2024-01 | Chinese and English | Universal | [?HF] | InternLM | InternLM | CD | Report |
Chinese-Mixtral | 8x7B | 2023-01 | Chinese and English | Universal | [?HF] | / | HIT-SCIR | CD-MOE | |
Telechat | 7/12B | 2024-01 | Chinese and English | Universal | [?HF] | Telechatx | Tele-AI | CD | Report |
kagentlms | 7/13B | 2024-01 | Chinese and English | Universal | [?HF] | KwaiAgents | KwaiKEG | ||
YaYi2-Chat | 30B | 2023-12 | Multilingual | Universal | [?HF] | YAYI2 | wenge-research | CD | Paper |
SUS-Chat | 34/72B | 2023-12 | Chinese and English | Universal | [?HF] | SUS-Chat | SUSTech-IDEA | CD | |
Aquila2-Chat | 7/34/70B | 2023-12 | Chinese and English | Universal | [?HF] | Aquila2 | FlagAI | CD | |
Alaya-Chat | 7B | 2023-12 | Chinese and English | Universal | [?HF] | Alaya | DataCanvas | CD | |
Qwen-Chat | 1.8/7/ 14/72B | 2023-12 | Chinese and English | Universal | [?HF] | Qwen | Alibaba Cloud | CD | Paper Report Report2 |
DeepSeek-Chat | 7/67B | 2023-11 | Chinese and English | Universal | [?HF] | DeepSeek-LLM | deepseek-ai | CD | |
Yi-Chat | 6/34B | 2023-11 | Chinese and English | Universal | [?HF] | Yi | 01.AI | CD | |
Alaya-Chat | 7B | 2023-11 | Chinese and English | Universal | [?HF] | Alaya | DataCanvasIO | CD | |
OrionStar-Yi-Chat | 34B | 2023-11 | Chinese and English | Universal | [?HF] | OrionStar-Yi-34B-Chat | OrionStarAI | CD | |
Nanbeige-Chat | 16B | 2023-11 | Chinese and English | Universal | [?HF] | Nanbeige | Nanbeige LLM Lab | CD | |
OpenChat 3.5 | 7B | 2023-11 | Chinese and English | Universal | [?HF] | openchat | OpenChat | CD | Paper |
XVERSE-Chat | 7/13B | 2023-11 | Multilingual | Universal | [?HF] | XVERSE | Yuanxiang Technology | CD | |
AndesGPT | 7B | 2023-11 | Chinese | Universal | [?HF] | AndesGPT-7B | OPPO-Mente-Lab | CD | |
SeaLLM-Chat | 13B | 2023-11 | Multilingual | Universal | [?HF] | SeaLLMs | SeaLLMs | CD | |
BlueLM | 7B | 2023-11 | Chinese and English | Universal | [?HF] | BlueLM | vivo AI Lab | CD | |
Skywork-chat | 13B | 2023-10 | Chinese | Universal | [?HF] | Skywork | SkyworkAI | CD | Paper |
Zephyr | 7B | 2023-10 | Multilingual | Universal | [?HF] | alignment-handbook | Hugging Face H4 | CD | Paper |
Mistral | 7B | 2023-10 | Multilingual | Universal | [?HF] | mistral-src | Mistral AI | CD | Paper |
chatglm3 | 6B | 2023-10 | Chinese and English | Universal | [?HF] | ChatGLM3 | THUDM | ND | |
Zhiyin-chat | 7B | 2023-10 | Chinese and English | Universal | [?HF] | Zhiyin | Institute of Acoustics, Chinese Academy of Sciences | CD | |
Ziya2-Chat | 13B | 2023-10 | Chinese and English | Universal | [?HF] | Fengshenbang-LM | IDEA Institute | CD | |
Vulture | 40/180B | 2023-10 | Multilingual | Universal | [?HF] | / | VILM-AI | TODO | |
Vulture | 3/7/ 40/180B | 2023-09 | Multilingual | Universal | [?HF] | / | VILM | CD | |