European OpenLLM Projects
One LLM to free them all ;-)
Did you know that English tokens represent more than 90 % of generalist LLMs training data ?
We’re OpenLLM Europe ??, an Open Source community committed to empower LLM projects in all European languages, specifically medium and low-resource languages. We aim to build the first multimodal multilingual european model with partners all over the continent.
- OpenLLM-Europe ??
- Discord: https://discord.com/invite/b5UQTWQn
- Contact:[email protected] - https://github.com/OpenLLM-Europe
Our work is 100% open and fits in with ALT-EDIC's mission, which you can discover here: https://language-data-space.ec.europa.eu/related-initiatives/alt-edic_en
- ALT = for Alliance for Language Technologies EDIC
- EDIC = European Digital Infrastructure Consortium
The mission of the ALT-EDIC is to develop a common European infrastructure in Language Technologies, focussing particularly on Large Language Models. It seeks to improve European competitiveness, increase the availability of European language data and uphold Europe's linguistic diversity and cultural richness. The ALT-EDIC is a multi-country project, run and funded by the Member States who have agreed to join it. By pooling resources, the members should achieve the critical mass of data and other resources needed to create and finetune Large Language Models, which any single member would find difficult to do alone.
OpenLLM Europe ?? is thus making its contribution to identifying and attempting to federate national initiatives to create LLMs or learning datasets. Our goal is to federate, create together & promote open source and sovereign Generative AI digital commons.
Here is a list of Open Source projects in AI (mostly LLMs) that we have gathered during our research.
Feel free to use it to build great things together.
Feel free to amen it and add projects that we missed. PR are welcome !
Feel free to join our Discord server
Bulgarian initiatives ?? :
Croatian initiatives ?? :
- CroAI - https://www.linkedin.com/posts/croai_large-language-models-have-demonstrated-impressive-activity-7167796231417520128-AlDs/
Czech initiatives ?? :
Danish initiatives ?? :
- Danish foundation models - https://www.linkedin.com/in/saattrupdan/
- Danskgpt - Contact:[email protected]
Duch initiatives ?? :
English initiatives ?? :
- Stability AI - Multilingual ? - https://stability.ai/contact
- NOUS Research - Contact:[email protected]
Estonian intiatives ?? :
Finnish initiatives ?? :
French initiatives ?? :
- Le Bon LLM - https://www.linkedin.com/company/le-bon-llm/
- OpenLLM France - Contact:[email protected] - https://www.openllm-france.fr
German initiatives ?? :
Greek initiatives ?? :
Hungarian initiatives ?? :
Irish initiatives ?? :
Italian initiatives ?? :
Latvian initiatives ?? :
Lithuanian initiatives ?? :
- EMBEDDIA - Contact:[email protected]
- Tilde AI powered langage technologies - Contact:https://www.linkedin.com/in/andrejs-vasiljevs/
Maltese initiatives ?? :
- BERTu - https://www.linkedin.com/in/claudia-borg-ai/
Norvegian initiatives ??:
Polish initiatives ?? :
Portuguese initiavtives ?? :
Romanian initiatives ?? :
Serbian initiatives?? :
- Serbian LLM - Serbian ?? - https://www.linkedin.com/in/aleksagordic/
Slovak initiatives ?? :
- KInit - https://www.linkedin.com/in/juraj-bezdek-6b521346/
- Blip.solution - Contact:[email protected]
Slovenian initiatives ?? :
Spanish initiatives ?? :
Swedish initiatives ?? :
Multilingual European Open Source project ?
- HPLT - Contact:[email protected]
- Unbabel - Contact:https://communityonboarding.unbabel.com/signup/step/0
- Occiglot - Contact:brack.cs.tu-darmstadt.de
- TrustLLM - Contact:[email protected]
- Luxembourg Institute of Science and technology - Luxembourg ?? - Contact:[email protected]
- Sosnitskij - https://www.linkedin.com/in/said-azizov-6b5a82256/
- Evidently AI - Multilingual ? - Discord: https://discord.gg/evidentlyai - Contact:[email protected]
- YugoGPT - ?????????? - Discord: https://discord.gg/yugogpt - https://www.linkedin.com/in/aleksagordic/
Open Source LLM projects based outside of Europe
- LangFuse US project using european languages ?? - Contact:[email protected]
- Sayhan - Turkish ?? - https://www.linkedin.com/in/sayhan-yalva%C3%A7er-0617641b1/
- Sestek - Turkish ?? - Contact:[email protected]
- AI Forever - Armenian ?? - https://www.linkedin.com/in/said-azizov-6b5a82256/
- Yandex YaLM 100B - Russian and English ???? - Contact:[email protected]
- EleutherAI - International collaboration using english ? - Discord:https://discord.com/invite/zBGx3azzUn - Contact:[email protected]