Join Our Discord (940+ Members)

Mixtral 8x7b - Mistral AI

Mixtral 8x7B: High-quality sparse model, 6x faster than Llama 2 70B. Best cost/performance, multilingual, fine-tunable. Outperforms GPT3.5.

Model Overview

  • High-quality sparse mixture-of-experts model
  • Licensed under Apache 2.0
  • Outperforms Llama 2 70B with 6x faster inference
  • Best cost/performance trade-offs, matching or surpassing GPT3.5

Capabilities

  • Handles a context of 32k tokens
  • Supports English, French, Italian, German, and Spanish
  • Strong performance in code generation
  • Fine-tunable for instruction-following tasks, achieving 8.3 on MT-Bench

Sparse Architectures

  • Sparse mixture-of-experts network
  • Decoder-only model with 8 distinct parameter groups
  • Router network chooses experts for token processing
  • 46.7B total parameters, uses only 12.9B parameters per token

Performance Comparison

  • Outperforms Llama 2 70B and GPT3.5 on most benchmarks
  • Efficient models compared to Llama 2 family
  • Detailed results provided for performance overview

Bias and Language

  • Less bias on BBQ benchmark compared to Llama 2
  • Displays positive sentiments on BOLD with similar variances

Instructed Models

  • Releases Mixtral 8x7B Instruct optimized for instruction-following
  • Reaches a score of 8.30 on MT-Bench, comparable to GPT3.5

Open-Source Deployment

  • Submitted changes to vLLM project for open-source deployment
  • Skypilot enables vLLM endpoints deployment on any cloud instance

Platform Usage

  • Mixtral 8x7B available on the mistral-small endpoint in beta
  • Early access registration for generative and embedding endpoints

Acknowledgements

  • Thanks to CoreWeave and Scaleway teams for technical support during model training.

Follow AI Models on Google News

An easy & free way to support AI Models is to follow our google news feed! More followers will help us reach a wider audience!

Google News: AI Models

Related Posts

Rauw Alejandro RVC Model AI Voice

Rauw Alejandro RVC Model AI Voice

Discover the groundbreaking collection of AI-generated songs by Rauw Alejandro! Made with state-of-the-art VITS Retrieval technology, these tracks blend a variety of genres and languages to showcase the model’s versatility.

Kim Jong Un RVC Model AI Voice

Kim Jong Un RVC Model AI Voice

Experience a unique blend of styles & languages with AI Kim Jong Un’s collection of mind-blowing tracks!

Childish Gambino RVC Model AI Voice

Childish Gambino RVC Model AI Voice

Introducing AI Childish Gambino’s latest collection of songs, produced by a community of AI enthusiasts.