The largest multilingual image-text classification dataset. It contains fashion products.
Why do you think that https://github.com/tintn/vision-transformer-from-scratch is a good alternative to glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
Why do you think that https://github.com/tintn/vision-transformer-from-scratch is a good alternative to glami-1m