DeepEval – Unit Testing for LLMs

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • deepeval

    Discontinued Unit Testing For LLMs [Moved to: https://github.com/confident-ai/deepeval] (by mr-gpt)

  • bettertest

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • agentops

    Open source Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

  • promptfoo

    Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

  • agenta

    The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

  • I'd add ours too, although we're trying to be an end-to-end one-stop platform.

    https://github.com/agenta-ai/agenta

  • ai-notes

    notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

  • added to my notes! https://github.com/swyxio/ai-notes/

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • RAG for Medical Research

    1 project | news.ycombinator.com | 21 Oct 2023
  • Patterns for Building LLM-Based Systems and Products

    6 projects | news.ycombinator.com | 1 Aug 2023
  • Talkd/dialog open source project has been selected for 2024 GitHub Accelerator

    1 project | news.ycombinator.com | 26 May 2024
  • talkd.ai got accepted into the Github Accelerator! (also our first official release)

    2 projects | dev.to | 23 May 2024
  • Multi AI Agent Systems Using OpenAI's New GPT-4o Model

    4 projects | news.ycombinator.com | 17 May 2024