All Model Leaderboards (that I know)

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • llm-leaderboard

    A joint community effort to create one central leaderboard for LLMs.

  • llm-humaneval-benchmarks

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • chain-of-thought-hub

    Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

  • Chain-of-Thought Hub https://github.com/FranxYao/chain-of-thought-hub - these are mostly gathered although Yao Fu, the author is working on specific CoT runs

  • llm-jeopardy

    Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Is the ChatGPT and Bing AI boom already over?

    1 project | news.ycombinator.com | 2 Sep 2023
  • Meta is preparing to launch a new open source coding model, dubbed Code Llama, that may release as soon as next week

    1 project | /r/LocalLLaMA | 20 Aug 2023
  • GPT-3.5 and GPT-4 performance in Open LLM Leaderboard tests?

    1 project | /r/LLMDevs | 5 Jun 2023
  • PullRequestBenchmark Challenge: Can AI Replace Your Dev Team?

    1 project | news.ycombinator.com | 10 Apr 2024
  • PRBenchmark – Expert PR Review Capabilities Equals Expert PR Creation Capability

    1 project | news.ycombinator.com | 5 Apr 2024