glami-1m
vision-transformer-from-scratch
glami-1m | vision-transformer-from-scratch | |
---|---|---|
3 | 1 | |
64 | 93 | |
- | - | |
2.8 | 4.9 | |
12 months ago | 11 months ago | |
Jupyter Notebook | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
glami-1m
- Glami-1M: A Multilingual Image-Text Fashion Dataset
-
[R] GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Found relevant code at https://github.com/glami/glami-1m + all code implementations here
vision-transformer-from-scratch
-
[P] Implementing Vision Transformer (ViT) from Scratch using PyTorch
Github: https://github.com/tintn/vision-transformer-from-scratch
What are some alternatives?
torchscale - Foundation Architecture for (M)LLMs
continual-pretraining-nlp-vision - Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357
notebooks - Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
super-gradients - Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
pythoncode-tutorials - The Python Code Tutorials
Transformers-Tutorials - This repository contains demos I made with the Transformers library by HuggingFace.
One-Piece-Image-Classifier - A quick image classifier trained with manually selected One Piece images.
maxvit - [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
blended-latent-diffusion - Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
HugsVision - HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
gem - A Pytorch-based library to evaluate learning methods on small image classification datasets
computervision-recipes - Best Practices, code samples, and documentation for Computer Vision.