Preliminary Evidence: we are closer to AGI than it appears.
If prompted with "concepts" on propositional logic, GPT4's lowest score on ConceptARC goes from 13% GPT4 vs 86% Human, to 100% GPT4, without training examples.
This performance jump extends to ALL text benchmarks.
[Around January 4, 2024] I will release a full chess engine based on GPT4, whose code / prompts anyone will be able to inspect… GPT4's "performant output" willbeat every other chess engine in existence in a tournament of any size.
https://twitter.com/kenshin9000_/status/1734238211088506967