This link has been bookmarked by 21 people . It was first bookmarked on 04 Mar 2024, by someone privately.
-
02 Apr 24Matt Bower
"Large language models can do jaw-dropping things. But nobody knows exactly why.
" -
28 Mar 24
-
-
they do more or less the same thing as a much better understood statistical construct called a Markov chain,
-
He thinks there could be a hidden mathematical pattern in language that large language models somehow come to exploit: “Pure speculation but why not?
-
grokking and double descent are in fact aspects of the same phenomenon
-
-
27 Mar 24Jan Eggers
Emergente Fähigkeiten - woher kommen sie? Wie eine Maschine, die Tokens vorhersagt, dazu in der Lage, intelligente Antworten zu produzieren?
-
26 Mar 24
-
18 Mar 24
-
13 Mar 24Francois Guite
Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.
-
06 Mar 24
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.