justynasty@lemmy.kya.moe to

LocalLLaMA@sh.itjust.worksEnglish · 11 months ago

Mistral 7B OpenOrca released

5

25

Mistral 7B OpenOrca released

justynasty@lemmy.kya.moe to

LocalLLaMA@sh.itjust.worksEnglish · 11 months ago

5

Open-Orca/Mistral-7B-OpenOrca · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

This release is trained on a curated filtered subset of most of our GPT-4 augmented data.

HF Leaderboard evals place this model as #2 for all models smaller than 30B at release time, outperforming all but one 13B model.

GGUF files:

Mistral-7B-OpenOrca-GGUF

Warning (if I’m not mistaken):

Llama.cpp hasn’t assigned high priority tag to the sliding window. Axolotl replaced Mistral’s attention block by a “simple” flash attention.

That implies, in my opinion, that the new releases do not capitalize on the speedup claimed by Mistral developers.

We can’t expect the new versions to be faster than Llama, because there is no sliding attention to speed up inference.

Chat

noneabove1182@sh.itjust.worksM
link
fedilink
English
arrow-up
2·
11 months ago
Ah good point, definitely looking forward to it being implemented then

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
8 users / week
45 users / month
587 users / 6 months
2 local subscribers
2.18K subscribers
230 Posts
829 Comments
Modlog