FastChat Tutorial

type

status

date

slug

Verify the pip and Python paths to ensure that packages are installed in the correct environment. which pip or which python

pip3 install --upgrade pip

pip3 install -e ".[model_worker,webui]"

To speed up the installation, you can use the Tsinghua source by appending

-i https://pypi.tuna.tsinghua.edu.cn/simple to the command. For example,

pip3 install ".[model_worker,webui]" -i https://pypi.tuna.tsinghua.edu.cn/simple

For inference using fine-tuned models: pip install -e ".[model_worker,llm_judge]"

CUDA 11.6 and above. Use the command nvcc --version to check the CUDA version.
PyTorch 1.12 and above. Use the import torch print(torch.__version__)

If you want to use lora or qlora, install the required packages with the command pip install deepspeed bitsandbytes scipy

We use Llama2 models for fine-tuning. Start by filling out this form to obtain access.

After downloading the weights, they will need to be converted to the Hugging Face Transformers format using the conversion script. The script can be called with the following (example) command:

Use the script train_vicuna_7b.sh in directory scripts/ . Modify the following as per your setup:

If you do not want to use FlashAtt, replace fastchat/train/train_mem.py as fastchat/train/train.py

Note: Use default datasets data/dummy_conversation.json for a test run or use your own datasets with the same format.

Others:

If you want to modify the prompt, you can edit get_conversation_template("vicuna") in file fastchat/train/train.py

Quick Test Run: You can reducing the number of layers of model to make it simpler, ensuring a faster execution to verify if everything is set up correctly without waiting for a full-scale training.

Version Mismatch: If the outputs of nvcc -V and nvidia-smi don't align, run:

Note: Replace cuda-12.2 with your version, ensuring it exists in /usr/local/.

Happy Chatbotting with FastChat! 😃