Top 9 Python multimodal-deep-learning Projects
-
BentoML
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
-
pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Time-LLM
[ICLR 2024] Official implementation of " π¦ Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
-
CLoT
CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation". (by sail-sg)
-
DeepViewAgg
[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Link to GitHub -->
Yes general LLM models can be used for time series forecasting:
https://github.com/KimMeen/Time-LLM
Project mention: CVPR 2024 Survival Guide: Five Vision-Language Papers You Donβt Want to Miss | dev.to | 2024-04-15GitHub
Project mention: Open source β Unsupervised captioning getting closer to supervised captioning | news.ycombinator.com | 2024-04-20
Project mention: Show HN: VQASynth β pipelines to synthesize VQA datasets | news.ycombinator.com | 2024-02-23
Project mention: Pix2tex: Using a ViT to convert images of equations into LaTeX code | news.ycombinator.com | 2023-11-03Makes me wonder what the SOTA is for open source efforts along these lines.
I have heard about "mixture of experts" as being a potentially important advance, and also of course about multimodality. So I found this: https://github.com/YeonwooSung/LIMoE-pytorch
Python multimodal-deep-learning related posts
-
Open source β Unsupervised captioning getting closer to supervised captioning
-
[D] 3DCoMPaT Challenge: Tag materials and parts on 3D Models. 3K$ USD price pool
-
Reverse engineer Stable Diffusion images
-
[R] [CVPR 2022 Oral] Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation
Index
What are some of the best open-source multimodal-deep-learning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | BentoML | 6,627 |
2 | pytorch-widedeep | 1,250 |
3 | Time-LLM | 842 |
4 | CLoT | 242 |
5 | DeepViewAgg | 216 |
6 | CapDec | 175 |
7 | VQASynth | 82 |
8 | 3DCoMPaT-v2 | 69 |
9 | LIMoE-pytorch | 46 |
Sponsored