update train-text-from-scratch with tokenization, sample selection and shuffling from finetune

This commit is contained in:
xaedes 2023-09-15 23:45:54 +02:00
parent cc60b3f639
commit ab56b63b27
No known key found for this signature in database
GPG key ID: 30030EDD817EA2B1

File diff suppressed because it is too large Load diff