How to teach things well
post by Neel Nanda (neel-nanda-1) · 2020-08-28T16:44:27.817Z · LW · GW · 17 commentsThis is a link post for https://www.neelnanda.io/blog/mini-blog-post-18-how-to-teach-things-well
Contents
Introduction Framing How to teach Teaching 1-on-1 Teaching Maths Teaching Applied Rationality/Life Advice Conclusion None 17 comments
(This is a post on my thoughts on good teaching techniques from a daily blogging project, that I thought might be of interest to LessWrong readers)
Introduction
This is a blog post on how to teach things well. I’ll mostly be focusing on forms of teaching that involve preparation and structure, like talks and tutoring, but these ideas transfer pretty broadly. I think teaching and explaining ideas is an incredibly important skill, and one that most people aren’t great at. I’ve spent a lot of time practicing teaching ideas, and I think I’ve found a bunch of important ideas and approaches that work well. I’m giving a talk next week, so I’ll initially focus on how to give good talks, but try to outline the underlying concepts and high-level ideas of teaching. And then talk about how these can transfer to contexts like tutoring, and to teaching specifically maths or applied rationality - the main areas I have actual teaching experience with.
Note: I mostly care about teaching concepts and ideas, and teaching things to people who genuinely want to learn and be there, so my advice will focus accordingly.
I think it’s useful to think about good teaching even if you don’t intend to spend much time teaching - learning and teaching are flip sides of the same process. I’ve found that even when in the role of a student, understanding what good teaching looks like can often fix a lot of the shortcomings of a bad teacher!
Framing
The key insight of this post is that good teaching requires you to be deliberate, and keep the purpose in mind: learning is a process of information compression. When you’re learning something new, you essentially receive a stream of new information. But human cognition doesn’t work by just storing a flood of information as is. The student takes in the information stream, extracts out the important ideas, converts it to concepts, and stores those in their mind. This is a key distinction, because it shows that the job of a teacher is not to give the student information, it’s to get the student to understand the right concepts. Conveying information is only useful as a means to an end to this goal.
In practice, it often works to just give a stream of information! Good students have learned the skill of taking streams of information and converting it to concepts. Often this happens implicitly, they student will absorb and memorise a lot of data, and over time this forms into concepts and ideas in their head automatically. But this is a major amount of cognitive labour. And a good teacher will try to do as much of it as possible, to let the student focus their cognitive labour on the important things.
My underlying model here is that we all have a web of concepts in our minds, our knowledge graph. The collection of all the concepts we understand, all of our existing knowledge and intuitions, connected together. And you have learned something when you can convert it to concepts and connect it to your existing understanding. This means not just understanding the concept itself, but understanding where it fits into the bigger picture, where to use it, etc.
The final part there is key - if the student leaves with a good understanding of the ideas in the abstract, but no idea when to think about the ideas again, it’s no better than if they’d learned nothing at all. We call on our knowledge when something related triggers, so in order for a lesson to be useful, you need to build those connections and triggers in the student’s mind.
A key distinction to bear in mind is ideas being legible vs tacit. A legible idea is something concrete that can easily be put into unambiguous words, eg how to do integration by parts. While tacit knowledge is something fuzzier and intuitive, eg recognising the kinds of integrals where you’d use integration by parts in the first place - essentially the intuitions you want the student to have. This is a good distinction to bear in mind, because legible knowledge is much easier to convey, but often your goal is to convey tacit knowledge (at least, it should be!). And there’s a lot of skill to conveying tacit knowledge well, and making it as legible as possible without losing key nuance. And different techniques work better for the two kinds. A lot of my issues with the Cambridge maths course is an extreme focus on legible knowledge over tacit - the underlying intuitions and motivations.
How to teach
There are two key problems when teaching, that any good teaching advice must account for:
- Limited ways to convey information
- The ideas I’m teaching are stored as implicit concepts in my mind, but in order to convey them, I must translate them into language. This language maps to ideas in my head according to all of my implicit knowledge and worldview, but translates into the student’s head according to their implicit knowledge and worldview. This often creates errors
- And converting concepts to language is inherently lossy, ideas have a lot of tacit nuance that is hard to capture
- Essentially, words can only convey legible knowledge, and I need to figure out how to hack this to convey tacit knowledge. Or how to find alternate information channels
- Alternately, I need to have error checking mechanisms to notice when I’ve failed to convey something well.
- Typical Mind Fallacy
- A key part of learning is combining knowledge you hear with the ideas already in your head.
- But I only have access to what’s in my head, not what’s in the student’s. And by definition this is different - I already understand what I’m teaching!
- This is a crippling blow to my ability to teach, and I need to be constantly aware of this and trying to build models of what’s in the student’s heads, and how they receive what I’m saying.
Here are some of the most important tools I have for addressing these problems:
- High-level picture
- The student’s knowledge graph is big. So the first, and most important part, is identifying which part of the graph they should add these ideas to
- Thus you should always highlight where this fits in to the bigger picture. Which questions are we currently trying to answer? Why is this idea interesting at all? Where can the student use this? What are its limitations?
- This should always be the first thing when introducing a new idea
- Prioritising information
- Learning is information compression. This fundamentally means that the student needs to be paying selective attention. When learning something new, the Pareto Principle always applies - 80% of the importance lies in 20% of the ideas.
- Identifying this 20% is significant cognitive labour, because it’s not immediately obvious. They need to pay attention to everything and later filter.
- But, the question of “what matters here” is tacit knowledge that I already have! This talk is high labour for the student, but easy for me. Thus, the most important thing a teacher can possibly do is to highlight what matters and what doesn’t, to tell the student where to focus their attention.
- In practice, you should always be saying “this is really, really important” or “these are just fiddly details” or “this is a bit of a niche edge case” etc. It is extremely hard to do this too often.
- And give more time to the important things. If you say an important point, write it down and put a box around it. Explain why it’s important and how it fits into the bigger picture. Give an alternate explanation, or an example.
- This is really easy to miss if you think of teaching as information transfer - where your goal is to tell the student everything and hope they figure out what to pay attention to themselves.
- Another tool: Have frequent summaries, highlighting the key points in the previous section.
- This is further useful to highlight connections. Some ideas will fit into their knowledge graph more easily than others, and pointing out connections can leverage the easy connections to make hard ones easier
- I’m a big of the advice “say what you’re going to tell them, tell them, and say what you’ve just told them”, I think it’s a good way to implement this principle
- The fundamental principle behind this is that students retain a tiny fraction of what they hear. If you give a one hour talk, they’ll retain maybe a few minutes worth of content. This is a fundamental fact of the learning process, and the only thing you can do about it is to control what they retain. Focus their attention on the important parts
- This mindset helps me identify what’s important. Set a 5 minute timer, and write down everything important that you want people to retain. These are the key points around which your talk should be structured!
- Everything else you say should be intended to help these key points stick better - to highlight connections between them and to existing knowledge, to ensure good information transfer, and to convey the tacit knowledge underlying the key points
- Another key skill of prioritisation is cutting things. If one part is irrelevant, or it’s boring and fiddly, cut it! It’s painful to not talk about everything cool, but you have limited time - if you don’t actively prioritise, you aren’t avoiding trade-offs, you’re just ceding control to “whatever you leave last”
- Anyone who’s gone to a talk by me knows that I have yet to internalise this lesson
- If there’s one point you retain from this post, let it be this one - this is incredibly important, and the main mark of a good teacher vs a mediocre one
- Understand pre-requisites
- A consequence of the Typical Mind Fallacy, is that it’s super easy to forget that your students don’t have all the context you have! This manifests as people teaching ideas without the pre-requisites.
- The underlying idea: the ideas you teach are in your knowledge graph, and are built upon existing ideas - these are the pre-requisites. You need to figure out whether the students
- A common secondary mistake is to understand pre-requisites, and then try to explain all of them!
- A good framing here is inferential distance. The inferential distance of a new idea is the number of steps of new concepts someone must understand before they can understand the new idea. Eg, to teach a young kid about the quadratic formula, they first need to understand the idea of polynomials, for which they need to understand algebra - this is three inferential steps.
- A general rule of thumb: never teach things with more than 2 inferential steps. An idea just learned is shaky, and doesn’t yet have good connections built. It’s very, very hard to anchor new ideas onto new ideas.
- This is really hard to get right. Pre-requisites require you to have a good model of somebody else’s mind. A useful technique is often to do a practice run on someone from your target demographic, and ask them to flag everything that confuses them.
- Further, for groups, this can be an intractable problem, everyone has different prior knowledge. You need to have a clear picture in your head of who the talk is aimed at.
- Students should learn actively, not passively
- It’s really easy for a student to just passively sit in a stream of information, taking none of it in. This achieves neither of your goals, because to form connections, compress information and connect it to their knowledge graph, they need to be putting in some cognitive labour.
- Good ways to encourage this: explicitly telling them that this is important, giving exercises and time to think through them, asking the audience regular questions and giving them some time to think.
- Question asking has the failure mode where most people won’t volunteer, or just zone out a bit - I lack great solutions to this problem
- This is much easier to handle in smaller settings, I’m bad at handling it in talks. The main solution I have is just to be engaging and keep the pace going well.
- Often people will zone out, so having regular breaks, and check-points of “if you weren’t following, we’re changing topic so it doesn’t matter” can help to rectify this
- Even short, 30s-120s breaks can be helpful! Encourage people to get up and stretch.
- Breaks never feel important, but they really, really help
- Give examples
- Examples are an insanely powerful tool for teaching things well, and people rarely use them enough. I have never given a class with too many examples (and believe me, I’ve tried)
- “You can teach a class with no content, only examples; you can’t teach a class with only content, no examples”
- Why examples are awesome:
- Examples are an excellent way to resolve lossy information transfer - they’re a completely different channel of communication than normal. If nothing else, they serve as an error check
- Examples are a great way to transfer tacit knowledge, without necessarily making it legible - this is what it means to build intuition
- Examples can help fit things in to the bigger picture, they can motivate the ideas, and locate where they fit in to the student’s knowledge graph
- By giving the student a pool of motivating examples, they can often generate the ideas themselves by generalising from the examples
- Examples can bridge the gap from “understanding the knowledge in the abstract” to “understanding where to use these ideas, and where they should come up”
- How to use them?
- Often when giving a point, I’ll give a micro-example to give context to it - eg “sometimes straight lines aren’t enough to model data, eg with a quadratic”. This should be quick and effortless, the example should make immediate sense to the students, with 0 inferential steps
- (I can’t believe it’s taken me 18 posts to get to my first nested example :( )
- Examples can be good at the start, to motivate things and show the questions we’re trying to answer. It can be good to give an example, and then constantly refer back to it as we generalise the example into a concept
- After introducing a complex idea, go through a long example and illustrate which parts of the example embody the complex idea
- Use examples to illustrate the importance and relateability of an idea - eg if explaining how to think about good planning to students, give an example of a student with a deadline crisis that they missed - everyone relates to this
- Examples contain a lot of information, so the idea of information prioritisation applies strongly here - tell the students what to pay attention to in the example, and why it’s interesting
- Visual information and diagrams
- Often tacit knowledge manifests as a literal picture in my head - draw this!
- This is another good alternate communication channel
- Our visual memory and processing is often much better than our abilities with language - this can work well for clarified confusing and complex parts
- Pacing
- An easy mistake that I often fall into is to give a section the amount of time it takes me to say it. I convert the ideas into words, and just read through what I’ve come up with.
- This is the fallacy of viewing teaching as giving an information stream! Time should be allocated for people to process and compress information, and they need more time for hard parts and less time for easy parts
- This is hard to get right intuitively - when you understand an idea, it feels easy!
- A good trick: make a high-level summary, and rate each section out of 5 for difficulty. Then go and explicitly give more time to those sections - eg add more intuitions, say the key ideas more, give more examples
- Note - pacing doesn’t mean the speed at which you speak, it’s about the time you give to different ideas!
- Note that difficulty =/= importance. If one part is hard, but unimportant, cut it. Or give a brief overview, explain the important idea, and say “don’t sweat the details”
- Another trick - explicitly tell students which ideas are important and worth paying attention to, and which aren’t
- If doing a practice run (which you totally should), regularly check in with the test audience about pacing - the default state of the world is that you get pacing wrong
- It's key to get pacing right - people zone out in slow, easy sections, and get lost in fast, hard sections. Your job as the teacher is to keep as many people as possible absorbing information at the optimum rate.
- It’s very hard to give accurate time estimates for things - my trick is to have a bonus section at the end which I’ll cut if need be, and to pace in the moment according to my existing notes and my intuitions
- Understand the mindset behind a question
- When somebody asks a question, the default response is to answer it. This is a failure to be deliberate! The student asks a question because they’re confused about something, and your goal is to resolve that confusion - answering the question directly is just a means to an end.
- This is an important distinction, because often questions are weird. They’re confused, or don’t quite make sense, or are asking about unimportant things. This manifests, for me, as the student’s mind not making sense. And it’s easy to get frustrated, or just to answer the dumb question directly. But this is ineffective.
- A related effect - somebody asks a question that isn’t the real question they want to ask. Eg, a student at a university open day who asks “how many A-Levels did you do?”, when what they really care about is “how many A-Levels do I need to do to get in”
- Your goal should be to understand the state of mind from which that question made sense - once you’ve done this, you can often resolve the confusion directly, or answer the question they really care about.
- Do this by asking clarifying questions, trying to answer and saying “did that answer your question?”, giving them several interpretations of what they’re really asking and asking whether any resonate, paraphrasing the question back to them, etc.
- The main trigger to look for here is a note of confusion - the question feeling a bit off, or out of nowhere, something isn’t quite making sense.
- It’s a delicate balance between doing this and moving on with the talk - try to gauge whether many people share the same confusion, if not, just move on
- Often doing this can uncover Pedagogical Content-Knowledge, common ways that people misunderstand the ideas you’re teaching. It’s super valuable to collect these, because then you can recognise them in future and dissolve them directly.
- Illusion of transparency
- A consequence of the typical mind fallacy - it’s easy to think you’ve clearly communicated knowledge when you really, really haven’t. As a consequence, you need to put a lot of effort into being grounded and calibrated - because often a confused audience won’t feel like a confused audience to you.
- Ask questions! Especially ones that highlight the key ideas, eg “how to do easy thing X” or “what was the key idea in here?”
- Do hand-polls - ask people to indicate their understanding by putting their hand up high if it’s clear, and low if it’s less clear. This is a good technique, because most people will actually do it, unlike “does anyone have any questions?” or “is this making sense to people?”
- Seek feedback and iterate
- Teaching is hard. You’re fundamentally trying to convey tacit knowledge, via lossy and low bandwidth communication channels, into an alien mind that you have very limited access to. The default state of the world is that you suck at this
- The solution is to regularly ask for specific, actionable feedback and calibration, and to actually put meaningful effort into updating on this! Feedback is one of the main ways you can better understand the mind of a student.
Teaching 1-on-1
Practicing tutoring and explaining things one on one can often be more valuable! I think a great use of time for most students is to do tutoring - it’s pretty fun, you get paid decently, and you get way better at explaining ideas. And the ability to explain an idea clearly in a conversation is an amazingly applicable skill - I use this all the time in daily life.
The main difference is that it’s a lot easier to get them to be active, and it’s much easier to adapt the pace and difficulty well. Essentially, invert all of the ideas in my post on how to learn from conversation
- The key technique is asking the student to paraphrase what you’ve said back to you
- This forces them to be active, and to process information
- It identifies errors, and helps you to correct them
- Often you can then recognise th
- It helps build a model of what’s going on in their mind
- It helps you calibrate the difficulty and pacing
- If you’re a tutor who doesn’t do get the student to this, I think you’re missing out on a major free win
- This is also super effective when explaining ideas to friends, though can seem a bit rude
- Get the student to tell you the key points/ideas in what you’ve said
- Get the student to generate examples, especially typical examples
- Here, understanding the mindset behind the question is even more important. You should always do this when they ask a question, especially if they seem dissatisfied with your answer.
Teaching Maths
- It’s easy to neglect the tacit information - the intuitions, underlying concepts, motivations. This is terrible. One of the most important parts of teaching maths well is to convey this high-level overview
- Every proof can be heavily compressed. Most proofs have some key ideas, followed by repeatedly doing the obvious thing. “Repeatedly doing the obvious thing” will inevitably be compressed in the student’s mind, so you should skip saying it at all, and just give the key ideas
- Examples, especially motivating examples, are incredibly important. It’s really, really hard to learn a new concept without having a clear motivating example in mind.
- Examples teach tacit knowledge well - they illustrate what you can and can’t do, and why you care about ideas
- After learning rigour for a while, you’ll end up with post-formal intuitions, where you mostly ignore rigour and think intuitively, but can drop into rigour if need be. Most of the cognitive labour in maths is reaching this point, and a good teacher will try to give as much of the post-formal intuition as possible
- Maths, especially pure maths, is often formalising an intuition. Probability is the formal study of uncertainty. Groups are the formal study of symmetries. Topology is the formal study of continuous deformations (things which don’t rip or glue). Pointing this out is vital
- A good way to find these is to notice which questions the topic newly lets you answer. This is a great way to motivate things!
- Diagrams are awesome
- Often you begin being able to answer a type of question with a lot of tacit knowledge, and are expected to pick all this up with examples. Often 80% of this can be captured in an explicit algorithm - this is a great way to make tacit knowledge legible.
Teaching Applied Rationality/Life Advice
- These are much more about tacit knowledge than explicit, so these should be done in a workshop format with a big focus on exercises
- Emphasise that everything is highly personal and subjective - all ideas should be adapted to your mind and your circumstances
- Pairing people up works well for getting them to actually do the exercises!
- Ideally, boil down the tacit knowledge to a rough algorithm, and alternate explaining steps and getting the students to practice them
- The impact of the class is mostly students retaining key insights and mental habits - what the “time when I should apply this idea” feels like from the inside. This is the small fraction of information they’ll retain. Thus it’s your job to boil down the idea to these habits, say this explicitly, and structure the class to reinforce them
- Much of the impact comes from the students taking action after the class - this is hard! You want to emphasise this, and minimise barriers
- Give time for the students to generates lists of ways to apply the ideas, and how they’re relevant in their day to day life
- Give time for them to set reminders for actions taken after the class
- Make it feel actionable - it’s easy to think an idea is important, but for it to feel abstract. Eg, to think that prioritisation is a good idea, but to never get round to it.
- Emphasise how the idea fits into everyday life and everyday problems
- Give a lot of examples of how to use it - this can form connections like “oh! I never thought of using it for that”
- This conveys the tacit knowledge of when to use the idea!
- Emphasise relatability and importance - give examples of a bad situation where the technique wasn’t applied, and make it feel visceral and relatable
Conclusion
If you’re planning on teaching something in the future, I hope these thoughts were useful! But even if not, I think these skills transfer excellently to explaining things in everyday life. And that thinking about teaching can make you a much more effective learner.
I find that often, as a student, I can help the teacher be more effective by asking the right questions - asking them which information is the most important, checking that my understanding is correct by paraphrasing back, asking them for the motivations and higher-level picture. The feeling of “something not fitting into my knowledge graph well” can be made into a pretty visceral one. And realising the habits of students that hinder them from learning, like being passive instead of active, and not trying to do information compression themselves, can help me recognise when I fail to do those things!
17 comments
Comments sorted by top scores.
comment by Liron · 2020-08-29T18:24:41.595Z · LW(p) · GW(p)
My favorite part was the advice to highlight what’s important, and it helped that you applied your own advice by highlighting that the most important part of your lesson is the advice to highlight the most important part of your lesson.
I’ve previously attempted to elaborate on why examples are helpful for teaching: https://www.lesswrong.com/posts/CD2kRisJcdBRLhrC5/the-power-to-teach-concepts-better [LW · GW]
comment by Neel Nanda (neel-nanda-1) · 2021-12-15T10:51:21.635Z · LW(p) · GW(p)
Self review: I'm very flattered by the nomination!
Reflecting back on this post, a few quick thoughts:
- I put a lot of effort into getting better at teaching, especially during my undergrad (publishing notes, mentoring, running lectures, etc). In hindsight, this was an amazing use of time, and has been shockingly useful in a range of areas. It makes me much better at field-building, facilitating fellowships, and writing up thoughts. Recently I've been reworking the pedagogy for explaining transformer interpretability work at Anthropic, and I've been shocked at how relevant all of this is.
- A related idea is that of the Pareto Frontier [LW · GW]. Most people are bad at teaching, this leads to eg Research Debt in academia. I'm a pretty great teacher, but not exactly world-class. But I'm a great mathematician, and trying to become a great AI Safety researcher, and there are very, very few people who are great at both - this gives me a lot of room to explore my comparative advantage by eg writing field-building docs
- I wish I'd better emphasised just how useful a skill this is
- A lot centres on teaching in specific contexts. This is reasonable, since it's what I know, but I wish I'd better clarified what would and would not generalise - I'm afraid people who see this post will bounce off as it's not relevant to them
- I wish I'd given more caveats about teaching gone wrong. My experiences teaching younger people who view me as high-status is that it's very easy to appear over-confident. I try to caveat what I say, but I tend to present as fairly confident, and people often take me way too seriously. While the techniques I present here are v effective at teaching, they have the flipside of better inserting my knowledge into the student's system 1 and bypassing some of their mental filters, which can be bad and eg lead to groupthink and lowered agency.
- Some, such as Socratic method, are better on this front by at least giving me chances to notice if what I'm teaching is wrong
- Sometimes it may be good to deliberately be a bad teacher, to teach the students agency and give them room to grow. on their own and to form their own ideas. It's worth checking for this - I just reflexively use good teaching technique nowadays and it's hard to suppress
- Some ideas such as a knowledge graph are vague intuitions that it would have been good to operationalise more
With all that said, I'd only been blogging for 3 weeks when I wrote this post, and I wrote it in an afternoon, so I'm really happy with this as an artefact to come out of that! I am so, so happy I decided to do a month of daily blogging
comment by Ian Televan · 2021-07-31T19:56:00.130Z · LW(p) · GW(p)
In my experience teachers tend to only give examples of typical members of a category. I wish they'd also give examples along the category border, both positive and negative. Something like: "this seems to have nothing to do with quadratic equations, but it actually does, this is why" and "this problem looks like it can be solved using quadratic equations but this is misleading because XYZ". This is obvious in subjects like geography, (when you want to describe where China is, don't give a bunch of points around Beijing as examples, but instead draw the border and maybe tell about ongoing territorial conflicts) but for some reason less obvious in concept-heavy subjects like mathematics.
Another point on my wishlist: create sufficient room for ambition. Give bonus points for optional but hard exercises. Tell about some problems that even world's top experts don't know how to solve.
comment by romeostevensit · 2020-08-29T09:12:37.586Z · LW(p) · GW(p)
Too many great points to call out. Love this post. My own model for teaching a particular topic from reviewing some pedagogy research was
Pattern break, starting with a vivid hook that gets people's attention by being surprising in some way
Several examples from which the pattern can be inferred
Drawing the student's attention back and forth between the common elements of the examples
Creating a toy example of the core concept that has moving parts the student can then move themselves to see how other parts move (conceptually)
Anchoring the new set of intuitions with a succinct anchor phrase or image that ideally has conceptual hooks into the relevant problem domains so that the concept automatically gets triggered in the situations in which it is useful
This is all much easier said than done, but is a good skeleton for when you really really want people to get something.
Replies from: neel-nanda-1↑ comment by Neel Nanda (neel-nanda-1) · 2020-08-29T10:00:49.433Z · LW(p) · GW(p)
Thanks!
Anchoring the new set of intuitions with a succinct anchor phrase or image that ideally has conceptual hooks into the relevant problem domains so that the concept automatically gets triggered in the situations in which it is useful
Strongly agreed, I've been very pleasantly surprised by how valuable this approach is. I think having a clear label to important intuitions is one of the really valuable things I've gotten from the rationalist community. When writing blog posts, I try fairly hard to give clear labels to the key ideas and to put them in bold.
Creating a toy example of the core concept that has moving parts the student can then move themselves to see how other parts move (conceptually)
I'd be curious to see any examples of this you have in mind? I'm super excited about this as a form of learning, but struggle to imagine a specific example for anything I've tried teaching. This seems better suited to tutoring 1 on 1 than to larger groups/talks, I think?
Replies from: Liron, romeostevensit↑ comment by Liron · 2020-08-29T18:32:32.406Z · LW(p) · GW(p)
Re examples of toy examples with moving parts:
Andy Grove’s classic book High Output Management starts with the example of a diner that has to produce breakfasts with cooked eggs, and keeps referring to it to teach management concepts.
Minute Physics introduces a “Spacetime Globe” to visualize spacetime (the way a globe visualizes the Earth’s surface) and refers to it often starting at 3:25 in this video: https://youtu.be/1rLWVZVWfdY
↑ comment by romeostevensit · 2020-08-29T15:31:18.087Z · LW(p) · GW(p)
It's more scalable with remote learning where each student can access an animation with sliders that they can move themselves. This is extremely valuable for helping math concepts click IME. The intuitions get tuned by directly seeing how some output varies with an input. Otherwise there is manually going through several dimensions, what happens if we vary this vs if we vary that etc.
comment by johnswentworth · 2020-08-29T19:18:23.967Z · LW(p) · GW(p)
I'm always a bit frustrated when people talk about a "knowledge graph"; the concept seems obviously useful, but also obviously incomplete. What precisely are the nodes and edges in the graph? What are the type signatures of these things?
I was thinking about this over breakfast. Here are some guesses.
One simple model is that each "node" in the graph is essentially a trigger-action plan [? · GW]. There's a small pattern-matcher, for example a pattern which recognizes root-finding problems with quadratic functions. When the pattern matches something, it triggers a bunch of possible connections - e.g. one connection might be a pointer to the quadratic equation, another might be a connection to polynomial factorization, etc. Each of those is itself either another node (e.g. the quadratic equation node) or, in the base-case, a simple action to take (e.g. writing some symbols on paper).
In this model, teaching involves a few different pieces:
- Creation of the node itself - just giving it a name and emphasizing importance can help
- Refining the pattern - e.g. practice recognizing root-finding problems with quadratic functions. Examples are probably the best tool here.
- Installing the "downstream" pointers to other concepts, and "upstream" pointers from other concepts to this one. "Downstream" pointers would be things like "here's a list of tricks you can use to solve this sort of problem", "upstream" pointers would be things like "this is itself a root-finding method, so look for quadratics when you need to solve equations, and also use other equation-solving tools like adding a number to both sides".
- Giving weight to upstream/downstream concepts - i.e. indicating which connections are more/less important, so they're properly prioritized in the list of "actions" triggered when a pattern is detected.
- Building the habit of actually checking for the pattern, and actually triggering the "actions" when the pattern is matched. I.e. practice, preferably on a fairly wide variety of problems to minimize "out-of-distribution"-style failures.
So that's one model.
That model seems to capture a lot of useful things about procedural knowledge graphs, but it seems like there's a separate kind of knowledge graph for world-models. The part above is analogous to a program (it guides what-to-do), whereas a world-modelling knowledge graph would be analogous to the contents of a database; it's the datastructure on which the procedural knowledge graph operates. My current best model for a world-modelling knowledge graph looks something like this [? · GW] - it's a causal model recursively built out of reusable sub-models.
Teaching components of the world-modelling knowledge graph would involve somewhat different pieces:
- We'd typically be teaching some prototypical submodel, a building block to use in many different places in the world-model. For instance, in introductory physics these submodels would be things like "masses" and "inclined planes" and "masses on inclined planes".
- Teaching the submodel itself means walking through the components of the little causal subgraph the model specifies - e.g. how masses on inclined planes behave.
- The submodel will have some pattern-matcher associated with it, for recognizing components of the real world to which the submodel applies. This means examples, to practice recognizing e.g. "masses" and "inclined planes" and "masses on inclined planes".
- The submodel will itself have submodels, and these are the pointers out to other nodes. E.g. if there's a submodel for the prototypical mass-on-inclined-plane problem, then it should have a pointer to a "point mass" submodel. Here, the real key is the connections which are not present - e.g. the point-mass submodel doesn't care about the shape of the object in question, it's just approximated as a point.
It feels like there should be a clean way to unify the procedural and world-modelling knowledge graphs. I'm not sure what it is. I'm sure somebody will argue that it's all procedural and the world-modelling is just embedded in a bunch of procedures, but I'm not convinced; it sure feels like there's a graph of data on which the program operates. I could see it working the opposite way, though... maybe it's all world-modelling, and part of the world-model is something like "model of the best way to solve this problem", and our "procedural" behavior is actually just prediction on that part of the world model (sort predictive-processing-esque).
Replies from: Lanrian↑ comment by Lukas Finnveden (Lanrian) · 2020-08-31T18:35:27.892Z · LW(p) · GW(p)
There is some research on knowledge graphs as a data-structure, and as a tool in AI. Wikipedia and a bunch of references.
comment by Mathisco · 2020-08-29T13:43:47.008Z · LW(p) · GW(p)
As I grow older I spend more and more time teaching. I can concur with all points in this post. Sadly it contained no diagrams.
Diagrams are truly awesome. Great diagrams are absolutely amazing. High level summary diagrams are the best. I spend most of my time at work now drawing and explaining diagrams.
comment by Gunnar_Zarncke · 2020-08-31T18:04:45.205Z · LW(p) · GW(p)
Thank you for your comprehensive post. It makes a lot of good points and does a good job of relating them to well-known terminology here. But I am missing sources. Teaching and learning can have counterintuitive effects and we should consider that.
A good overview of what is known about the effectiveness of teaching methods (though mostly in the school-level) is covered in Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement
Replies from: Gunnar_Zarncke↑ comment by Gunnar_Zarncke · 2020-08-31T18:05:56.098Z · LW(p) · GW(p)
Interestingly, I just read a thread about Project Follow Through that counterintuitively showed that Direct Instruction is effective (effect size 0.58) but almost never used.
Main source is Theory of Instruction: Principles and Applications.
comment by FallibleDan · 2020-09-01T15:08:49.582Z · LW(p) · GW(p)
Thank you for your outline and pearls. Getting more skillful at framing, as you point out, is a key mindset. The framing of teaching depends on the learner's various states (current abilities in the subject domain(s); physical, social and emotional states, etc.) and the learner's context. Teaching requires that the teacher adjust to the learner's current states and the learner's context, and select the appropriate frames.
One perhaps obvious frame is to think of teaching as "that which enables learning." What enables learning?
Imprinting to the body, including, as pointed out, by doing. Sometimes called "getting the learner's skin in the game." The body is a human's interface with what is, so of course learning relies on bodies. Repetition is a particularly powerful way of getting the body's attention: "Oh, I guess this isn't just a one-off - I keep coming across this experience so I guess I'd better adapt to it." Examples: athletic or musical performance training, doing problem sets in engineering, etc. The body's strategy, including its brain, is that, for the long run (literally), an efficient response to repeated experiences is to hypertrophy muscles/neural pathways.
Example: paraphrasing, as pointed out, is a way to check whether the learner is keeping up. See whether the student "follows." The phrase, "Do you follow me?" uses the language of the body.
Engaging affective valences (joy, fear, longing, satisfaction, appreciation of beauty (e.g., maths concepts are often beautiful)). An appropriate emotional valence is crucial for long-term memory.
Engaging social or intrasocial valences - how can one belong, join, nurture or protect? 99.99% of the human operating system can be regarded as, "mammalian," but, like water to a fish, it's ubiquity makes it invisible to us. Yet who optimally trains or learns in a social vacuum or executes or performs in a social vacuum? Huge stadia and social media platforms and the fact that we love to hear and tell stories are more obvious testimony to the importance of social valences.
- e.g., working in a group (including see one, do one, teach one); getting students to "pair and share" as a way to anchor student mindsets into a learning mode
- e.g., working with a future self, an idealized or shadow self, or with an in-dwelling parent, child, friend, advocate or mentor
Copying the best or what has been honed over time by linking with culture. Think of culture as being the ancient apperceptive mass of humanity's experience and learning. Engage with culture, e.g., by looking up the recent (in English, most words have a Germanic or Latin origin; scientific or technical words may, in addition, have a Greek origin) and more ancient (Proto-indo-european - the payoffs in PIE are often massive) etymologies of any new word and every key concept.
In sum, teaching is often more successful if it has actionable 'relevance' (recent etymology of relevant = "apropos;" ancient etymology = "that which lightens [a burden]"). Learning is easier if the learner or learner's body senses that something is useful or unburdening to him or her or to his or her "group," especially in ways in which the body (including emotion) or culture (especially language) have already provided hooks to latch on to.
comment by Crackatook (peterson-yook) · 2020-08-29T04:20:47.798Z · LW(p) · GW(p)
While delivering knowledges is a major part of education, I just want to mention that post-education is very important because most students stop learning about subjects. When I was young, I joined a science summer camp and had fun and challenging time, but after camp my life hadn't changed at all. Today I reflect past memories and think I may become different person if my learning is continued even after no teachers are around.
Finding outside resources, planning a research or engaging with a community can be desirable habits for students. Also questioning a big picture and real-life applications help students learn about other subjects.
comment by vmehra · 2022-01-07T04:01:47.416Z · LW(p) · GW(p)
>The final part there is key - if the student leaves with a good understanding of the ideas in the abstract, but no idea when to think about the ideas again, it’s no better than if they’d learned nothing at all.
Hard disagree. As you will agree, learning about the world is also supposed to bring excitement and pure joy. That enjoyment doesn't have to always translate into a conscious idea about revisiting the topic again. I have taken a few lectures where my small audience was fully engaged, asking questions and definitely enjoyed their time. But I doubt the subject matter was of direct relevance to most of them in their regular working life. They may occasionally think about it but doubt it's much. Does that mean those lectures were a complete waste of time for them? I don't think so. Just because those lectures didn't fundamentally change their thinking doesn't mean they were useless. And I assure you that these people were "people who genuinely want to learn and be there".
Thank you for writing this, lots of useful advice and ideas to think about.