Skip to main content

Large language models can do jaw-dropping things. But nobody knows exactly why... - The Diigo Meta page

www.technologyreview.com/...s-amazing-but-nobody-knows-why

Share This

Bookmarking History
Comments (0)

This link has been bookmarked by 21 people . It was first bookmarked on 04 Mar 2024, by someone privately.

02 Apr 24

Matt Bower
"Large language models can do jaw-dropping things. But nobody knows exactly why.
"

AI genAI generative MIT explanation
28 Mar 24

Juergen Plieninger
ki ai
Stella Porto
language models artificial intelligence
- they do more or less the same thing as a much better understood statistical construct called a Markov chain,
- He thinks there could be a hidden mathematical pattern in language that large language models somehow come to exploit: “Pure speculation but why not?
- grokking and double descent are in fact aspects of the same phenomenon
1 more annotation...
27 Mar 24

Jan Eggers
Emergente Fähigkeiten - woher kommen sie? Wie eine Maschine, die Tokens vorhersagt, dazu in der Lage, intelligente Antworten zu produzieren?

ki sprachmodelle emergenz intelligenz grokking
26 Mar 24

karlhorky
llms ai artificialintelligence aiinacademia
18 Mar 24

Christopher Kent
AI LLMs Machine+learning 2024 generative+AI grokking
13 Mar 24

Francois Guite
Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.

AI LLM understanding explainability grokking
06 Mar 24

Vahid Masrour
artificialintelligence

Would you like to comment?

Join Diigo for a free account, or sign in if you are already a member.

Top Tags

IFTTT
Feedly
Math

Other bookmarks from the site www.technologyreview.com »

Check out another URL