llama-mistral vs megablocks

llama-mistral

Inference code for Mistral and Mixtral hacked up into original Llama implementation (by dzhulgakov)

Suggest topics

Source Code

Suggest alternative

Edit details

megablocks

By databricks

Suggest topics

Source Code

Suggest alternative

Edit details

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

llama-mistral		megablocks
	Project
5	Mentions	6
373	Stars	1,083
-	Growth	3.0%
8.4	Activity	8.7
6 months ago	Latest Commit	8 days ago
Python	Language	Python
GNU General Public License v3.0 or later	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

llama-mistral

Posts with mentions or reviews of llama-mistral. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-09.

Inference code for Mistral and Mixtral hacked up
1 project | news.ycombinator.com | 9 Dec 2023
French AI startup Mistral secures €2B valuation
2 projects | news.ycombinator.com | 9 Dec 2023

No. Without the inference code, the best we can have are guesses on its implementation, so the benchmark figures we can get could be quite wrong. It does seem better than Llama2-70B in my tests, which rely on the work done by Dmytro Dzhulgakov[0] and DiscoResearch[1].
But the point of releasing on bittorrent is to see the effervescence in hobbyist research and early attempts at MoE quantization, which are already ongoing[2]. They are benefitting from the community.
[0]: https://github.com/dzhulgakov/llama-mistral
[1]: https://huggingface.co/DiscoResearch/mixtral-7b-8expert
[2]: https://github.com/TimDettmers/bitsandbytes/tree/sparse_moe
Code to run Mistral - mixtral-8x7b-32kseqlen
1 project | /r/LocalLLaMA | 8 Dec 2023
New Mistral models just dropped (magnet links)
4 projects | /r/LocalLLaMA | 8 Dec 2023

Someone made this. https://github.com/dzhulgakov/llama-mistral
Mistral 8x7B 32k model [magnet]
6 projects | news.ycombinator.com | 8 Dec 2023

If anyone can help running this, would be appreciated. Resources so far:
- https://github.com/dzhulgakov/llama-mistral

megablocks

Posts with mentions or reviews of megablocks. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-01.

FLaNK AI - 01 April 2024
31 projects | dev.to | 1 Apr 2024
Mistrsal has released a new 87GM model
2 projects | /r/LocalLLaMA | 8 Dec 2023
Megablocks-Public
2 projects | news.ycombinator.com | 8 Dec 2023

This is a fork of https://github.com/stanford-futuredata/megablocks
should link to the original when possible, per HN posting guidelines
New Mistral models just dropped (magnet links)
4 projects | /r/LocalLLaMA | 8 Dec 2023

I guess with +40 GBs of VRAM (until quantized) and megablocks as run time. https://github.com/stanford-futuredata/megablocks
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
1 project | news.ycombinator.com | 27 Jan 2023

What are some alternatives?

When comparing llama-mistral and megablocks you can also consider the following projects:

llama.cpp - LLM inference in C/C++

speedb - A RocksDB compliant high performance scalable embedded key-value store

megablocks-public

lapdev - Self-Hosted Remote Dev Environment

tracecat - 😼 The open source alternative to Tines / Splunk SOAR. Build AI-assisted workflows, orchestrate alerts, and close cases fast.

llama-mistral vs llama.cpp megablocks vs speedb llama-mistral vs megablocks-public megablocks vs lapdev megablocks vs tracecat

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

Compare llama-mistral vs megablocks and see what are their differences.

llama-mistral

megablocks

llama-mistral

megablocks

What are some alternatives?