You can now train a 70b language model at home
2024-03-11
The article introduces an open source system by Answer.AI that combines FSDP and QLoRA, enabling the efficient training of a 70b large language model on desktop computers with standard gaming GPUs. This breakthrough makes high-capacity model training accessible to smaller labs and individual researchers, aligning with Answer.AI's mission to democratise AI development.
Was this useful?