LLM

This is a port of several models from:

https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/models/

using the Hugging Face swift transformers package to provide tokenization:

https://github.com/huggingface/swift-transformers

The Models.swift provides minor overrides and customization -- if you require overrides for the tokenizer or prompt customizations they can be added there.

This is set up to load models from Hugging Face, e.g. https://huggingface.co/mlx-community

The following models have been tried:

mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
mlx-community/CodeLlama-13b-Instruct-hf-4bit-MLX
mlx-community/phi-2-hf-4bit-mlx
mlx-community/quantized-gemma-2b-it

Currently supported model types are:

Llama / Mistral
Gemma
Phi

See Configuration.swift for more info.

See llm-tool

LoRA

Lora.swift contains an implementation of LoRA based on this example:

https://github.com/ml-explore/mlx-examples/tree/main/lora

See llm-tool/LoraCommands.swift for an example of a driver and llm-tool for examples of how to run it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

LLM

LoRA

Files

README.md

Latest commit

History

README.md

File metadata and controls

LLM

LoRA