YOLO-World
openvino_notebooks
YOLO-World | openvino_notebooks | |
---|---|---|
3 | 80 | |
3,442 | 1,991 | |
13.4% | 5.1% | |
9.0 | 9.9 | |
6 days ago | 1 day ago | |
Python | Jupyter Notebook | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
YOLO-World
-
A History of CLIP Model Training Data Advances
2024 is shaping up to be the year of multimodal machine learning. From real-time text-to-image models and open-world vocabulary models to multimodal large language models like GPT-4V and Gemini Pro Vision, AI is primed for an unprecedented array of interactive multimodal applications and experiences.
- FLaNK Stack Weekly 19 Feb 2024
-
Making My Bookshelves Clickable
Post author here. I like this idea. I plan to explore it and make a more generic solution. I'd love to have a point-and-click interface for annotating scenes.
For example, I'd like to be able to click on pieces of coffee equipment in a photo of my coffee setup so I can add sticky note annotations when you hover over each item.
For the bookshelves idea specifically, I would love to have a correction system in place. The problem isn't so much SAM as it is Grounding DINO, the model I'm using for object identification. I then pass each identified region to SAM and map the segmentation mask to the box.
Grounding DINO detects a lot of book spines, but often misses 1-2. I am planning to try out YOLO-World (https://github.com/AILab-CVC/YOLO-World), which, in my limited testing, performs better for this task.
openvino_notebooks
- FLaNK-AIM Weekly 06 May 2024
- FLaNK AI Weekly 18 March 2024
- FLaNK Stack Weekly 19 Feb 2024
- FLaNK Stack Weekly 12 February 2024
- FLaNK Stack 05 Feb 2024
-
Optimum Intel OpenVino Performance
Also, credits for using zram in your VM setup; that's a smart hack for memory management. Have you tried tweaking other models like the ones in this OpenVINO notebook?
- FLaNK Stack Weekly 06 Nov 2023
- Trouvez-la plus vite
- Change your voice. FreeVC offers one-shot voice conversion, no text transcript required. Explore how OpenVINO powers AI solutions, see the code on GitHub.
- Vous aurez la banane
What are some alternatives?
chdb - chDB is an embedded OLAP SQL Engine 🚀 powered by ClickHouse
deepeval - The LLM Evaluation Framework
super-gradients - Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
starcoder - Home of StarCoder: fine-tuning & inference!
open_model_zoo - Pre-trained Deep Learning models and demos (high quality and extremely fast)
netron - Visualizer for neural network, deep learning and machine learning models
trieve - All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.
lst-bench - LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
ProPainter - [ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
tailspin - 🌀 A log file highlighter
openchat - OpenChat: Advancing Open-source Language Models with Imperfect Data