I’m not sure this is true. They could be trained based on published works prior to a certain date as the formal writing style, eg Project Gutenberg, then layer on the recent internet to better capture modern stylistic trends.
Ultimately, the models will always require fine tuning, and selecting which data set you use for early training has a very large impact on the overall performance of the model. Additional knowledge and trendiness can be learned after the fact.
I’m not sure this is true. They could be trained based on published works prior to a certain date as the formal writing style, eg Project Gutenberg, then layer on the recent internet to better capture modern stylistic trends.
Ultimately, the models will always require fine tuning, and selecting which data set you use for early training has a very large impact on the overall performance of the model. Additional knowledge and trendiness can be learned after the fact.