"Faith and Fate: Limits of Transformers on Compositionality"
https://arxiv.org/abs/2305.18654
"Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks":
https://arxiv.org/abs/2311.09247
"Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve"
https://arxiv.org/abs/2309.13638
"Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models"
https://arxiv.org/abs/2311.00871
"Faith and Fate: Limits of Transformers on Compositionality"
https://arxiv.org/abs/2305.18654
"Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks":
https://arxiv.org/abs/2311.09247
"Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve"
https://arxiv.org/abs/2309.13638
"Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models"
https://arxiv.org/abs/2311.00871