Python generative-ai

Open-source Python projects categorized as generative-ai

Top 23 Python generative-ai Projects

  • LLaMA-Factory

    Unify Efficient Fine-Tuning of 100+ LLMs

  • Project mention: FLaNK-AIM Weekly 06 May 2024 | dev.to | 2024-05-06
  • jina

    ☁️ Build multimodal AI applications with cloud-native stack

  • Project mention: Jina.ai: Self-host Multimodal models | news.ycombinator.com | 2024-01-26
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • haystack

    :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

  • Project mention: Haystack DB – 10x faster than FAISS with binary embeddings by default | news.ycombinator.com | 2024-04-28

    I was confused for a bit but there is no relation to https://haystack.deepset.ai/

  • NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  • Project mention: [P] Making a TTS voice, HK-47 from Kotor using Tortoise (Ideally WaveRNN) | /r/MachineLearning | 2023-07-06

    I don't test WaveRNN but from the ones that I know the best that is open source is FastPitch. And it's easy to use, here is the tutorial for voice cloning.

  • BentoML

    The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!

  • Project mention: Who's hiring developer advocates? (December 2023) | dev.to | 2023-12-04

    Link to GitHub -->

  • krita-ai-diffusion

    Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

  • Project mention: A quick Krita/ComfyUI LCM live painting tip | /r/StableDiffusion | 2023-12-08

    I have been playing a lot with Krita's SD plugin https://github.com/Acly/krita-ai-diffusion - that uses ComfyUI as it's API source.

  • TaskingAI

    The open source platform for AI-native application development.

  • Project mention: TaskingAI: AI-native app development platform | news.ycombinator.com | 2024-01-30
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • llmware

    Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.

  • Project mention: More Agents Is All You Need: LLMs performance scales with the number of agents | news.ycombinator.com | 2024-04-06

    I couldn't agree more. You should check out LLMWare's SLIM agents (https://github.com/llmware-ai/llmware/tree/main/examples/SLI...). It's focusing on pretty much exactly this and chaining multiple local LLMs together.

    A really good topic that ties in with this is the need for deterministic sampling (I may have the terminology a bit incorrect) depending on what the model is indended for. The LLMWare team did a good 2 part video on this here as well (https://www.youtube.com/watch?v=7oMTGhSKuNY)

    I think dedicated miniture LLMs are the way forward.

    Disclaimer - Not affiliated with them in any way, just think it's a really cool project.

  • h2o-llmstudio

    H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

  • Project mention: Paid dev gig: develop a basic LLM PEFT finetuning utility | /r/LocalLLaMA | 2023-06-02
  • jupyter-ai

    A generative AI extension for JupyterLab

  • Project mention: 🪄 Put magic in your Notebook w/ Jupyter-AI | dev.to | 2024-02-14

    This notebook is dedicated to a (not so) short jupyterlab/jupyter-ai unboxing so anyone can enjoy this kind of magic (and much much more):

  • xTuring

    Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

  • Project mention: I'm developing an open-source AI tool called xTuring, enabling anyone to construct a Language Model with just 5 lines of code. I'd love to hear your thoughts! | /r/machinelearningnews | 2023-09-07

    Explore the project on GitHub here.

  • YiVal

    Your Automatic Prompt Engineering Assistant for GenAI Applications

  • Project mention: YiVal——Unlocking Your Data's Power to Create Customized GenAI Apps | /r/u_YiVal | 2023-11-16

    - 🤖Github:https://github.com/YiVal/YiVal/pull/189

  • dbrx

    Code examples and resources for DBRX, a large language model developed by Databricks

  • Project mention: Hello OLMo: A Open LLM | news.ycombinator.com | 2024-04-08

    One thing I wanted to add and call attention to is the importance of licensing in open models. This is often overlooked when we blindly accept the vague branding of models as “open”, but I am noticing that many open weight models are actually using encumbered proprietary licenses rather than standard open source licenses that are OSI approved (https://opensource.org/licenses). As an example, Databricks’s DBRX model has a proprietary license that forces adherence to their highly restrictive Acceptable Use Policy by referencing a live website hosting their AUP (https://github.com/databricks/dbrx/blob/main/LICENSE), which means as they change their AUP, you may be further restricted in the future. Meta’s Llama is similar (https://github.com/meta-llama/llama/blob/main/LICENSE ). I’m not sure who can depend on these models given this flaw.

  • SDV

    Synthetic data generation for tabular data

  • Project mention: Synthetic data generation for tabular data | news.ycombinator.com | 2024-02-27

    Can someone help me understand the licensing of this?

    https://github.com/sdv-dev/SDV/blob/main/LICENSE

    It was MIT licensed up until 2022 where it was changed to what it is now, where they say that it will become MIT again 4 years after release... but is that from when the license was changed or the first release of the software in GitHub?

  • coffee

    Build and iterate on your UI 10x faster with AI - right from your own IDE ☕️

  • Project mention: AI Grant Traction in OSS Startups | dev.to | 2024-02-01

    Coframe

  • openllmetry

    Open-source observability for your LLM application, based on OpenTelemetry

  • Project mention: FLaNK-AIM Weekly 13 May 2024 | dev.to | 2024-05-13
  • PyRIT

    The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems. (by Azure)

  • Project mention: FLaNK 04 March 2024 | dev.to | 2024-03-04
  • cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

  • Project mention: FLaNK AI Weekly for 29 April 2024 | dev.to | 2024-04-29
  • LLMStack

    No-code platform to build LLM Agents, workflows and applications with your data

  • Project mention: Vanna.ai: Chat with your SQL database | news.ycombinator.com | 2024-01-14

    We have recently added support to query data from SingleStore to our agent framework, LLMStack (https://github.com/trypromptly/LLMStack). Out of the box performance performance when prompting with just the table schemas is pretty good with GPT-4.

    The more domain specific knowledge needed for queries, the harder it has gotten in general. We've had good success `teaching` the model different concepts in relation to the dataset and giving it example questions and queries greatly improved performance.

  • canopy

    Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

  • Project mention: Build a simple RAG chatbot with LangChain... | dev.to | 2024-05-17

    To create a PineCone account, sign up via this link: https://www.pinecone.io/

  • aiconfig

    AIConfig is a config-based framework to build generative AI applications.

  • Project mention: VS Code: Prompt Editor for LLMs (GPT4, Llama, Mistral, etc.) | news.ycombinator.com | 2024-03-08

    doesn't collect prompts and there's a way to disable telemetry as well - https://github.com/lastmile-ai/aiconfig/blob/8a5a59d47cef474...

  • quix-streams

    A Python library for building containerized ML and Generative AI applications with Apache Kafka.

  • Project mention: Show HN: Streaming DataFrames–a Pandas-like syntax for real-time data | news.ycombinator.com | 2024-04-23
  • factool

    FacTool: Factuality Detection in Generative AI

  • Project mention: How to Detect AI Hallucinations | dev.to | 2024-05-03

    FACTOOL is a task and domain-agnostic framework designed to tackle the escalating challenge of factual error detection in generative AI. It is a five-step tool-augmented framework that consists of claim extraction, query generation, tool querying, evidence collection, and verification. FACTOOL uses tools like Google Search, Google Scholar, code interpreters, Python, and even LLMs themselves to detect factual errors in knowledge-based QA, code generation, math problem solving, and scientific literature review writing. It outperforms all other baselines across all scenarios and is shown to be highly robust in performing its specified tasks compared to LLMs themselves.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python generative-ai related posts

  • Build a simple RAG chatbot with LangChain...

    2 projects | dev.to | 17 May 2024
  • everything-ai: the power of AI, on your computer

    2 projects | dev.to | 14 May 2024
  • Show HN: Open-Source SlackAI app for those who don't want to pay $10/user/month

    2 projects | news.ycombinator.com | 9 May 2024
  • How to Detect AI Hallucinations

    5 projects | dev.to | 3 May 2024
  • Show HN: Cognita – open-source RAG framework for modular applications

    3 projects | news.ycombinator.com | 27 Apr 2024
  • Gemini API 102: Next steps beyond "Hello World!"

    5 projects | dev.to | 24 Apr 2024
  • Show HN: Streaming DataFrames–a Pandas-like syntax for real-time data

    1 project | news.ycombinator.com | 23 Apr 2024
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 18 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source generative-ai projects in Python? This list will help you:

Project Stars
1 LLaMA-Factory 21,791
2 jina 20,121
3 haystack 13,883
4 NeMo 10,179
5 BentoML 6,603
6 krita-ai-diffusion 4,732
7 TaskingAI 4,837
8 llmware 3,839
9 h2o-llmstudio 3,614
10 jupyter-ai 2,889
11 xTuring 2,525
12 YiVal 2,439
13 dbrx 2,421
14 SDV 2,153
15 coffee 1,355
16 openllmetry 1,328
17 PyRIT 1,289
18 cognita 1,320
19 LLMStack 1,140
20 canopy 895
21 aiconfig 862
22 quix-streams 809
23 factool 767

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com