[D] Is all the talk about what GPT can do on Twitter and Reddit exaggerated or fairly accurate?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • JARVIS

    JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

  • https://github.com/microsoft/JARVIS this has that capability as far as I know, I don’t know the individual hugging face model it utilizes for whatever task because there are a ton

  • guardrails

    Adding guardrails to large language models.

  • not vouching for it, but I know this is at least a thing that exists and I like the general idea: https://github.com/shreyar/guardrails

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • TruthfulQA

    TruthfulQA: Measuring How Models Imitate Human Falsehoods

  • I agree they show that you can brute-force mimick uncertainty estimates to some degree, and that the model is generally well calibrated (though on what is basically a set of trivia questions, so YMMV)... yet:

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Guardrails AI

    1 project | news.ycombinator.com | 30 Dec 2023
  • Does anyone have an example of a langchain based customer facing agent like a cashier/waitress?

    1 project | /r/LangChain | 28 Jul 2023
  • Ask HN: People who were laid off or quit recently, how are you doing?

    3 projects | news.ycombinator.com | 20 Apr 2023
  • For consistent output from vicuna 13b

    1 project | /r/learnmachinelearning | 9 Apr 2023
  • GitHub – ShreyaR/guardrails: Adding guardrails to large language models

    1 project | news.ycombinator.com | 14 Mar 2023