• 1 Post
  • 15 Comments
Joined 2 years ago
cake
Cake day: June 21st, 2023

help-circle


  • It’s not it’s biological origins that make it hard to understand the brain, but the complexity. For example, we understand how the heart works pretty well.

    While LLMs are nowhere near as complex as a brain, they’re complex enough to make it extremely difficult to understand.

    But then there comes the question: if they’re so difficult to understand, how did people make them in the first place?

    The way they did it actually bears some similarities to evolution. They created an “empty” model - a large neural network that wasn’t doing anything useful or meaningful. But it depended on billions of parameters, and if you tweak a parameter, its behavior changes slightly.

    Then they expended enormous amount of computing power tweaking parameters, each tweak slightly improving its ability to model language. While doing this, they didn’t know what each number meant. They didn’t know how or why each tweak was improving the model. Just that each tweak was making an improvement.

    Unlike evolution, each tweak isn’t random. There’s an algorithm called back-propagation that can tell you how to tweak the neural network to make it predict some known data slightly better. But unfortunately it doesn’t tell you anything about the “why” this tweak is good, or “what” each parameter change means. Hence why we don’t understand how LLMs work.

    One final clarification: It’s not a complete black box. We do have some understanding of how LLM works, mostly on high level. Kind of like we have some basic understanding of how a brain works. We understand LLMs much better than brains, of course.


  • It’s not that nobody took the time to understand. Researchers have been trying to “un-blackbox” neural networks pretty much since those have been around. It’s just an extremely complex problem.

    Logistic regression (which is like a neural network but with just one node) is pretty well understood - but even then sometimes it can learn some pretty unintuitive coefficients and it can be tricky to understand why.

    With LLMs - which are enormous by comparison - it’s simply not a tractable problem to understand how it works in detail.











  • I see it as compensating for disadvantages people have. So, if one student has lower test scores, but achieved them despite going to an underfunded school and having a part-time job, then that student scores are actually more impressive than someone else who scored better, but had private tutors throughout high school. Once you account for people’s disadvantages, you should naturally get more diverse student body.

    And of course minority students have disadvantages that should be accounted for. But they don’t affect everyone the same, and racial quotas is a very lazy way to do this. Instead, admissions should look at the individual circumstances of each student.