SaaSHub helps you find the best software and product alternatives Learn more →
Llm-colosseum Alternatives
Similar projects and alternatives to llm-colosseum based on common topics and language
-
instinct.cpp
instinct.cpp is a framework for developing AI Agent applications (RAG, Chatbot, Code interpreter) powered by language models.
-
enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
-
SKAB
SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
-
Awesome_Satellite_Benchmark_Datasets
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
llm-colosseum reviews and mentions
- LLM Colosseum
- Evaluate LLMs in Real Time with Street Fighter III
-
LLM Colosseum: Make LLMs fight in SFIII
Hello guys,
Tired of current boring LLMs benchmark ? I'm sharing with you a fun project built during the Mistral AI SF hackathon.
Using a RL framework, we made LLMs fight against each other in real time in Street Fighter III. You can find the repo here : https://github.com/OpenGenerativeAI/llm-colosseum.
Aside from the fact that it's very funny to see Mistral and others performing Hadouken, we found that it is a great way to benchmark language models. They need to quickly understand their environment and take actions accordingly.
With >400 fights, check out the ELO ranking on the HF space here : https://huggingface.co/spaces/junior-labs/llm-colosseum
-
A note from our sponsor - SaaSHub
www.saashub.com | 27 Apr 2024
Stats
OpenGenerativeAI/llm-colosseum is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of llm-colosseum is Jupyter Notebook.
Sponsored