VinBERT
1.0.0
VinBERT is a combination of two powerful Vietnamese language models: Vintern-1b-v2 and PhoBERT. With VinBERT, we create a language model optimized to better serve applications in the Vietnamese language, including tasks such as text classification, entity extraction, and more.
Data parallelism and Model parallelism are supported with backend ncclData parallelism are supported with backend xlaml.p4d.24xlarge and ml.trn1.32xlarge instances. pip install -r requirements.txtml.p4d.24xlarge)vantufit/flash-attn-cudadocker pull vantufit/flash-attn-cudaexport INSTANCE=ml.p4d.24xlarge
python training.pyml.trn1.32xlarge)export INSTANCE=ml.trn1.32xlarge
python training.py Implement Tensor parallelism with neuronx_distributed
Monitoring training process