Python llmops

Open-source Python projects categorized as llmops

Top 23 Python llmops Projects

  • jina

    ☁️ Build multimodal AI applications with cloud-native stack

  • Project mention: Jina.ai: Self-host Multimodal models | news.ycombinator.com | 2024-01-26
  • vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

  • Project mention: AI leaderboards are no longer useful. It's time to switch to Pareto curves | news.ycombinator.com | 2024-04-30

    I guess the root cause of my claim is that OpenAI won't tell us whether or not GPT-3.5 is an MoE model, and I assumed it wasn't. Since GPT-3.5 is clearly nondeterministic at temp=0, I believed the nondeterminism was due to FPU stuff, and this effect was amplified with GPT-4's MoE. But if GPT-3.5 is also MoE then that's just wrong.

    What makes this especially tricky is that small models are truly 100% deterministic at temp=0 because the relative likelihoods are too coarse for FPU issues to be a factor. I had thought 3.5 was big enough that some of its token probabilities were too fine-grained for the FPU. But that's probably wrong.

    On the other hand, it's not just GPT, there are currently floating-point difficulties in vllm which significantly affect the determinism of any model run on it: https://github.com/vllm-project/vllm/issues/966 Note that a suggested fix is upcasting to float32. So it's possible that GPT-3.5 is using an especially low-precision float and introducing nondeterminism by saving money on compute costs.

    Sadly I do not have the money[1] to actually run a test to falsify any of this. It seems like this would be a good little research project.

    [1] Or the time, or the motivation :) But this stuff is expensive.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • OpenLLM

    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

  • Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

    13. OpenLLM by BentoML | Github | tutorial

  • BentoML

    The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!

  • Project mention: Who's hiring developer advocates? (December 2023) | dev.to | 2023-12-04

    Link to GitHub -->

  • ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

  • Project mention: DeepSeek-V2 integrated, RAGFlow v0.5.0 is released | news.ycombinator.com | 2024-05-07
  • phidata

    Memory, knowledge and tools for LLMs

  • Project mention: Phidata: Add memory, knowledge and tools to LLMs | news.ycombinator.com | 2024-05-06
  • ragas

    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

  • Project mention: Show HN: Ragas – the de facto open-source standard for evaluating RAG pipelines | news.ycombinator.com | 2024-03-21

    congrats on launching! i think my continuing struggle with looking at Ragas as a company rather than an oss library is that the core of it is like 8 metrics (https://github.com/explodinggradients/ragas/tree/main/src/ra...) that are each 1-200 LOC. i can inline that easily in my app and retain full control, or model that in langchain or haystack or whatever.

    why is Ragas a library and a company, rather than an overall "standard" or philosophy (eg like Heroku's 12 Factor Apps) that could maybe be more robust?

    (just giving an opp to pitch some underappreciated benefits of using this library)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • zenml

    ZenML πŸ™: Build portable, production-ready MLOps pipelines. https://zenml.io.

  • Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01
  • giskard

    🐒 Open-Source Evaluation & Testing for LLMs and ML models

  • Project mention: Show HN: Evaluate LLM-based RAG Applications with automated test set generation | news.ycombinator.com | 2024-04-11
  • llm-app

    LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,⚑in sync with your data sources.

  • Project mention: How to use LLMs for real-time alerting | dev.to | 2023-11-20

    Answering queries and defining alerts: Our application running on Pathway LLM-App exposes the HTTP REST API endpoint to send queries and receive real-time responses. It is used by the Streamlit UI app. Queries are answered by looking up relevant documents in the index, as in the Retrieval-augmented generation (RAG) implementation. Next, queries are categorized for intent: an LLM probes them for natural language commands synonymous with notify or send an alert.

  • AGiXT

    AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

  • Project mention: Conversational "memory loss"? | /r/LocalLLaMA | 2023-07-07

    If you are more interested in AI assistants check out AGiXT. It has some really cool features but it is under heavy development. Not everything works jet and updates break sometimes already working functions. But it is still far better than babyAGI and other proof of concepts.

  • uptrain

    UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

  • Project mention: A Developer's Guide to Evaluating LLMs! | dev.to | 2024-05-14

    You can create an account with UpTrain and generate the API key for free. Please visit https://uptrain.ai/

  • openllmetry

    Open-source observability for your LLM application, based on OpenTelemetry

  • Project mention: FLaNK-AIM Weekly 13 May 2024 | dev.to | 2024-05-13
  • cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

  • Project mention: FLaNK AI Weekly for 29 April 2024 | dev.to | 2024-04-29
  • LLMStack

    No-code platform to build LLM Agents, workflows and applications with your data

  • Project mention: Vanna.ai: Chat with your SQL database | news.ycombinator.com | 2024-01-14

    We have recently added support to query data from SingleStore to our agent framework, LLMStack (https://github.com/trypromptly/LLMStack). Out of the box performance performance when prompting with just the table schemas is pretty good with GPT-4.

    The more domain specific knowledge needed for queries, the harder it has gotten in general. We've had good success `teaching` the model different concepts in relation to the dataset and giving it example questions and queries greatly improved performance.

  • lanarky

    The web framework for building LLM microservices

  • Project mention: Lanarky: Deploy LLM applications in production, built on FastAPI | news.ycombinator.com | 2023-06-10
  • llm-guard

    The Security Toolkit for LLM Interactions

  • Project mention: llm-guard: The Security Toolkit for LLM Interactions | /r/blueteamsec | 2023-09-19
  • agenta

    The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

  • Project mention: Top Open Source Prompt Engineering Guides & ToolsπŸ”§πŸ—οΈπŸš€ | dev.to | 2024-05-02

    Agenta is an end-to-end LLMOps platform. It provides tools for prompt engineering and management, evaluation, human annotation, and deployment.

  • langcorn

    ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

  • NeumAI

    Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

  • Project mention: Show HN: Neum AI – Open-source large-scale RAG framework | news.ycombinator.com | 2023-11-21

    Interesting to see that the semantic chunking in the tools library is a wrapper around GPT-4. Asks GPT for the python code and executes it: https://github.com/NeumTry/NeumAI/blob/main/neumai-tools/neu...

  • DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. β€€ πŸ€–πŸ’€

  • Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01
  • llmflows

    LLMFlows - Simple, Explicit and Transparent LLM Apps

  • Project mention: Show HN: LLMFlows – LangChain alternative for explicit and transparent apps | news.ycombinator.com | 2023-07-29
  • burr

    Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure.

  • Project mention: Building an Email Assistant Application with Burr | dev.to | 2024-04-26

    Burr is a lightweight python library you use to build applications as state machines. You construct your application out of a series of actions (these can be either decorated functions or objects), which declare inputs from state, as well as inputs from the user. These specify custom logic (delegating to any framework), as well as instructions on how to update state. State is immutable, which allows you to inspect it at any given point. Burr handles orchestration, monitoring and persistence.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python llmops related posts

  • A Developer's Guide to Evaluating LLMs!

    1 project | dev.to | 14 May 2024
  • Should I add CLA to my Open-source project?

    2 projects | news.ycombinator.com | 4 May 2024
  • Pydantic Logfire

    7 projects | news.ycombinator.com | 30 Apr 2024
  • AI leaderboards are no longer useful. It's time to switch to Pareto curves

    1 project | news.ycombinator.com | 30 Apr 2024
  • Show HN: Cognita – open-source RAG framework for modular applications

    3 projects | news.ycombinator.com | 27 Apr 2024
  • Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry

    1 project | news.ycombinator.com | 27 Apr 2024
  • Building an Email Assistant Application with Burr

    6 projects | dev.to | 26 Apr 2024
  • A note from our sponsor - SaaSHub
    www.saashub.com | 18 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more β†’

Index

What are some of the best open-source llmops projects in Python? This list will help you:

Project Stars
1 jina 20,121
2 vllm 19,344
3 OpenLLM 8,920
4 BentoML 6,603
5 ragflow 7,404
6 phidata 6,023
7 ragas 4,874
8 zenml 3,685
9 giskard 3,192
10 llm-app 2,526
11 AGiXT 2,469
12 uptrain 2,015
13 openllmetry 1,328
14 cognita 1,320
15 LLMStack 1,140
16 lanarky 942
17 llm-guard 870
18 agenta 865
19 langcorn 822
20 NeumAI 785
21 DataDreamer 667
22 llmflows 621
23 burr 463

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com