[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • alpaca_eval

    An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

  • an automatic evaluator that is easy to use, fast, cheap and validated against 20K human annotations. It actually has a higher agreement with majority vote of humans than a single human annotator! Of course, our method still has limitations which we discuss here!

  • alpaca_farm

    A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

  • AlpacaEval dataset: 805 instructions, which are a simplification of AlpacaFarm's evaluation set.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Python Notebooks for Fundamentals of Music Processing

    5 projects | news.ycombinator.com | 2 Jun 2024
  • RAG with Groq and Llama 3

    1 project | news.ycombinator.com | 31 May 2024
  • Enhancing Data Security with Role-Based Access Control of Qdrant Vector Database

    1 project | dev.to | 31 May 2024
  • Open Sustainable Technology

    1 project | news.ycombinator.com | 30 May 2024
  • Explaining in Style: Training a GAN to Explain a Classifier in StyleSpace

    1 project | news.ycombinator.com | 30 May 2024