Python multimodal-deep-learning

Open-source Python projects categorized as multimodal-deep-learning

Top 9 Python multimodal-deep-learning Projects

  • BentoML

    The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

  • Project mention: Who's hiring developer advocates? (December 2023) | dev.to | 2023-12-04

    Link to GitHub -->

  • pytorch-widedeep

    A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Time-LLM

    [ICLR 2024] Official implementation of " πŸ¦™ Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

  • Project mention: karpathy/llm.c | news.ycombinator.com | 2024-04-08

    Yes general LLM models can be used for time series forecasting:

    https://github.com/KimMeen/Time-LLM

  • CLoT

    CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation". (by sail-sg)

  • Project mention: CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss | dev.to | 2024-04-15

    GitHub

  • DeepViewAgg

    [CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"

  • CapDec

    CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

  • Project mention: Open source – Unsupervised captioning getting closer to supervised captioning | news.ycombinator.com | 2024-04-20
  • VQASynth

    Compose multimodal datasets 🎹

  • Project mention: Show HN: VQASynth – pipelines to synthesize VQA datasets | news.ycombinator.com | 2024-02-23
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • 3DCoMPaT-v2

    3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

  • LIMoE-pytorch

    PyTorch implementation of LIMoE

  • Project mention: Pix2tex: Using a ViT to convert images of equations into LaTeX code | news.ycombinator.com | 2023-11-03

    Makes me wonder what the SOTA is for open source efforts along these lines.

    I have heard about "mixture of experts" as being a potentially important advance, and also of course about multimodality. So I found this: https://github.com/YeonwooSung/LIMoE-pytorch

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python multimodal-deep-learning related posts

  • Open source – Unsupervised captioning getting closer to supervised captioning

    1 project | news.ycombinator.com | 20 Apr 2024
  • [D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool

    1 project | /r/MachineLearning | 10 May 2023
  • Reverse engineer Stable Diffusion images

    2 projects | news.ycombinator.com | 8 Feb 2023
  • [R] [CVPR 2022 Oral] Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

    2 projects | /r/MachineLearning | 11 May 2022

Index

What are some of the best open-source multimodal-deep-learning projects in Python? This list will help you:

Project Stars
1 BentoML 6,627
2 pytorch-widedeep 1,250
3 Time-LLM 842
4 CLoT 242
5 DeepViewAgg 216
6 CapDec 175
7 VQASynth 82
8 3DCoMPaT-v2 69
9 LIMoE-pytorch 46

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com