> you can't meaningfully modify them given there is almost no information availa...

chasd00 · on July 23, 2024

From what i understand the training data and careful curation of it is the hard part. Everyone wants training data sets to train their own models instead of producing their own.

mesebrec · on July 23, 2024

Indeed, fine-tuning is still possible, but you can only go so far with fine-tuning before you need to completely retrain the model.

This is why Silo AI, for example, had to start from scratch to get better support for small European languages.