This is a port of several models from:
using the Hugging Face swift transformers package to provide tokenization:
The Models.swift provides minor overrides and customization -- if you require overrides for the tokenizer or prompt customizations they can be added there.
This is set up to load models from Hugging Face, e.g. https://huggingface.co/mlx-community
The following models have been tried:
- mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
- mlx-community/CodeLlama-13b-Instruct-hf-4bit-MLX
- mlx-community/phi-2-hf-4bit-mlx
- mlx-community/quantized-gemma-2b-it
Currently supported model types are:
- Llama / Mistral
- Gemma
- Phi
See Configuration.swift for more info.
See llm-tool
Lora.swift contains an implementation of LoRA based on this example:
See llm-tool/LoraCommands.swift for an example of a driver and llm-tool for examples of how to run it.