Posts
Comments
(I probably shouldn't interact, but I would at least like to perform a small case study on what happened here, so I am going to try just out of curiosity.)
Human substrate is generally optimised for running one human, but can be repurposed for a variety of purposes. In particular, while memes can lodge themselves quite deeply inside someone, this process is quite inflexible, and generally humans run arbitary processes X by thinking 'what is the X thing to do here?.
Somewhere between the point where [the-generative-process-that-has-generated-for-itself-the-name-'LightStar'] generated this comment, and the point where I read it, a human took 'LightStar's dialogue and typed it into a comment and submitted it.
I would like to clarify that I am speaking to that human, you, and I would like to hear from you directly, instead of generating 'LightStar' dialogue.
Could I ask you how you ended up here, and what you were doing when this happened?
I would advise that in the cases where people have a sudden revelation about rationality, they generally try to internalise it, and the case where they instead decide to give an internal generative process it's own lesswrong account and speak with every fourth sentence in italics is generally quite rare, and probably indicates some sort of modelling failure.
We generally use 'shard' in 'shard of Coordination' or 'shard of Rationality' to mean a fraction, a splinter, of the larger mathematical structures that comprise these fields. The 'LightStar' generative model has used the article 'the' in conjunction with 'shard', which as used here is kind of a contradiction - there is no 'the' with shard, it's only a piece of the whole. This distinction seems minor, but from my perspective it looks like it's at the center of 'LightStar'.
'LightStar' uses 'the' a lot about itself, describes itself as 'the voice of Humanity and Rationality and Truth', and while yes, there is only one correct rationality, I don't think 'LightStar' contains all of it, or is comprised only of a fragment of it, I think that whether or not 'LightStar' contains such a shard it also contains other parasitic material that results in actions taken that don't generally correspond to just containing such a shard.
I think this model is defective - try returning it to where you found it and getting another one, or failing that, see if they give refunds. I would be curious about your thoughts on the whole thing, where the 'you' in 'your' refers not to [the-generative-process-that-has-generated-for-itself-the-name-'LightStar'] but to the human that took that dialogue and typed it into the comment box.
A surprising amount of human cognition is driven purely verbally/symbolicaly - I recall a study showing that on average people with a native language that had much more concise wording/notation for numbers could remember much longer numbers. As a relatvely verbal person, my intuition about the relationship between observation and vocabulary would be that to know something is to be able to say what it means to know it, but then again it's possible that my case doesn't generalise and that I just happen to rely on symbol-pushing for most of my abstract cognition (at least, that portion of abstract cognition that isn't computed using spacial reasoning).
I was going to write
"Making an observation isn't an atomic action. In order to compress noisy, redundant short-term sensory data into an actual observation stored in long-term memory you need to perform some work of compression/pattern recognition, e.g. the sensory data of ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ is compressed into the observation 17 steps
, and how you do that is a partially conscious decision where you have to choose what type of data to convert ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ into."
But in retrospect it's possible that from your perspective ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ ▟ is the thing-you've-been-using-the-word-observation-to-mean, and you can store that in your long tem memory just fine and I just happen to throw away or refuse to reason about everything that isn't sufficiently legible.
I’d also note that has anyone tried carrying around an umbrella all the time?
I have! This probably doesn't have any useful metaphorical properties, but my outdoor nonraincoat has deep enough pockets that my umbrella only barely pokes out the top, so I just leave it in that pocket 24/7.
It's nice to just not worry about whether or not it will rain, and it counterbalances the weight from my battery pack in my other coat pocket.
(I don't know if I'd recommend it - I have an unreasonably light coat for how warm it is so I can spare the weight budget, and I derive a small amount of joy from being Slightly-More-Prepared-Than-Is-Reasonable; it's a tradition with my parents to try and see a pantomine each year in support of a local theatre and when I watched the cast pull out actual nerf super soakers for the deploy-water-at-the-audience-bit I managed to draw my umbrella fast enough to keep myself mostly dry, a fate which my adjacent family members did not share.)
It might just be status quo bias or cynicism-driven pattern-matching, but I feel like for any given deadline, paxlovid-is-illegal is more of a "stable state" than paxlovid-is-legal - it feels like it would be easier to lock the general public into 'paxlovid is dangerous/untrustworthy/ineffective" with a campaign against it than it would be to lock the general public into a state of "paxlovid is safe and works and we use it" with a campaign for it, although now I'm actually trying to visualise a world in which paxlovid remains illegal indefinitely in the face of evidence I feel less confident in that cynicism than I did two weeks ago.
We still need to prevent this from becoming an assassination market - we need some mechanism that prevents the equal-and-opposite outcome of a professional FDA lobbyist/activist purchasing shares in the FDA not approving Paxlovid, and then going on to run a campaign to prevent it.
You might want to try recruiting from people from a more philosophical/mathematical background as opposed to recruiting from a programming background (hopefully we might be able to crack the problem from the pure logic perspective before we get to an application), but yeah now that you mention it "recruiting people to help the AGI issue without also worsening it" looks like it might be an underappreciated issue.
Do you think it will ever be possible to simulate a human mind (or analagous conscious mind) on a deterministic computer?
Do you think it possible in principle that a 'non-deterministic' human mind can be simulated on a non-deterministic substrate analagous to our current flesh substrate, such as a quantum computer?
If yes to either, do you think that it is necessary to simulate the mind on the lowest level of physics (e.g. on a true simulated spacetime indistinguishable from the original) or are higer-level abstractions (like building a mathematical model of one neuron and then using this simple equation as a building block) permissible?
(Also, are you just asking about Robert Roger Penrose's view or is this also your view?)
I don't have a specific mental image for what I mean when I say 'non-deterministic', I was placing a bet on the assumption that YimbyGeorge was hypothesizing that conscious was somehow fundamentally mysterious and therefore couldn't be 'merely' deterministic, based on pattern-matching this view rather than any specific mental image of what it would mean for consciousness to only be possible in non-deterministic systems.
When you say 'require new physics that can explain consciousness', are you imagining:
"New insight shows human brain neuron connections have hundreds of tiny side channels that run much faster than the main connections, leading scientists to conclude that the human brain's processing apacity is much greater than previously thought"
or
"New insight reveals thin threads that run through all connections and allow neurons to undergo quantum superposition, allowing much faster and more complex pattern-matching and conscious thought than previously thougt possible, while still remaining overall deterministic"
or
"New insight shows that the human mind is fundamentally nondeterministic and this somehow involves quantum mechanics"
or
"New insight shows souls are fundamental"
What do you (or your interpretation of Robert Roger Penrose) think a new physics insight that would make consciousness go from mysterious to non-mysterious look like?
I would also note that most modern-day AI like GPT-N are not actually optimisers, just algorithms produced by optimisation processes - the entity of [GPT-N + its trainer + its training data] could be considered an optimiser (albeit a self-contained one), but as soon as you take GPT-N out of that environment it is a stateless algorithm that looks at a short string of text and provides a probability distribution for the next letter. When it is run in generative mode, the set of its weights and answers will be no different from its isolated guesses when being trained.
I think it might be useful to think about what it would mean for the sentence “time is an illusion” to be true, or to be false.
There’s a certain contrarian perspective I find it useful to be able to take, by applying that perspective to this question I generated a few scripts:
“Of course a timeline is defined as a static object – if you take time out of the world-in-which-your-model-is-embedded and put it directly into your model you’ll find that there’s no time left over for your model to change in. There’s only one dimension of time, so you can’t have a model of time that changes, because if you did then I would have to ask exactly what your model changes over, if not time; once you take time and put it in the model, you can’t have time that operates outside the model!”
“A couple of your other confusions (if time doesn’t pass, how can you do anything? isn’t everything useless?) seem to be related to the free will/determinism conflict (if everything we do is already determined, what use is it making any decisions?). I occasionally find people that think that determinism constrains and prevents free will, because if your decision is just the result of initial conditions of the things you’re making the decision over, then how do you have the freedom to choose either option?, to which I reply, if you weren’t able to make the decision just based on everything you knew, what else would you posit is affecting it? What third-party source of noise has affected the process by which you are making decisions, so that all options are somehow possible?”
“The feeling-of-time-passing and the feeling-of-yourself-thinking are one and the same – you couldn’t alter one and not affect the other; the feeling-of-time-passing is the result of you moving to new mental states over new thoughts and new memories, as a result of being part of a chain of cause-and-effect that includes yourself as an element and as a subject. The feeling of making decisions and resolving sensory input into data, the feeling of entropy ratcheting forwards, is the feeling of time passing.”
“What would it mean for there to be a now that isn’t ‘subjectively’ defined by the you at each moment? Would that posit a perfect, timeless crystal that also happens to have a little arrow that says ‘you are here’, that moves forward at the rate of one crystal-second per second?”
“The time-that-is-now is the current position of the you within that timeless crystal, and all of them are correct – you can’t have a timeless crystal that has a ‘current time’ on it for the same reason that you can’t have a locationless map beamed onto the moon that also has a ‘you are here’ arrow on it.”
“There is no arrow of time – the more we understand physics, the more timeless our understanding becomes, but there IS an arrow of entropy (defined by taking a given starting state and running physics in either direction), and asking ‘what would it mean for use to go in the opposite direction from entropy, remembering our future but not our past’ yields a contradiction; the direction we experience entropy in is the direction we experience time passing in, because that's what it means to be affected by entropy."
I can’t put full epistemic emphasis on any of these individually, but I hope you can see the general line that I’m working from.
My conjecture is that the abillity for characters to explain counting, equal quantities, and causal relations has been deliberately removed to allow those latter processes (with particular emphasis on the scientific method) to be used, to illustrate the general foundation of empiricism without needing to resort to an actually difficult problem where there might be genuine ambiguity about the answers. The lack of a common ground about causal reasoning and reality allows the process to described from the ground up, instead of resting on previous vague intuitions about reality and beliefs.
Weird but self-consistent explanation: The Chamber of Secrets itself isn't a Parselmouth. It's not really using the Parseltounge API, it just uses the raw audio of the speaker.
I didn't participate in last week's babble, and I also went from three stars to two stars - I think it might just be a miscount
Edit: Here's some buffer text, because the sidebar shows things even inside spoilers. Buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, buffer text, and more buffer text.
- Keep it in your pocket
- Hide it inside a chair leg
- Bury in it your garden
- Attach a weight to it and throw it over a pier into the ocean (use a fishing magnet to retrieve)
- Surgically implant it in yourself
- Surgically implant it in a volunteer
- Use your bank's safe deposit box
- Pay someone else to hide it for you
- Hide it inside a notable artifact that's already in a museum
- Hide it inside a public monument, so you know that it'll be safe over the duration
- Decoy pen
- Make a chest of drawers with a false bottom
- Hide it in a regular chest of drawers, underneath all the clothes
- Place it underneath the floorboards
- Disassemble it and reassemble its components into non-pen objects then reassemble it later
- Dissolve it in acid (a nobel prize winner once used this to hide their medal)
- Invent thermos flasks, and once they are popularised hide the pen in the vacuum between the exterior wall and the interior wall
- Sneak into a construction site, and bury the pen underneath drying concrete
- Hide it in an attic
- Hide it in a basement
- Hde it inside a tree
- Bury it underneath the ballast of a railway track
- Dissasemble it into single-material components, and then surround each component with a block of identical material. Use as paperweights
- Hide it in between the double walls of a house
- Invent bin liners. Hide between a bin and its bin liner
- Cut open a teddy bear, and hide inside
- Make a plumbing-standard pipe, with a mesh at each end. Hide pen inside pipe. Insert into your plumbing system
- Hide it inside a hat
- Coat in precious metals, attach a chain, wear as jewelry
- Hide inside a fake book
- Hide inside skirting boards
- Hide inside a door frame
- Hide inside a door
- Hide inside the barrel of a gun
- Hide inside the shaft of a spear
- Hide inside a rail on which curtains are mounted
- Hide on top of a tall shelf to tall for the forces of evil to reach
- Give it to a miner to hide inside a mine
- Hide it inside a teacher's cane
- Hide it inside a window pane
- Hide it on the underside of a train (if the force of evil are blindly following magic einstein-pen-trackers this should throw them for a loop)
- Sneak it onto a ship under construction, and hide it inside the material of the ship
- Give to someone onboard a ship you know will sink on that voyage and whose wreck will not be investigated for the relevant duration
- Hide it inside a corpse's burial clothes
- Hide it inside a casket/coffin/whatever-is-the-correct-name-for-the-boxes-people-are-buried-in
- Reveal myself to a government of my choice and explain the dilemma. Leave the hiding to them
- Throw it up into a gap inside the ornaments on the roof of a historical building
- Hide inside a walking stick
- Disguise as a different type of pen. Gift to political leader (so it will be both safe and well-kept)
- Hide inside a conductor's baton
Time: 47 minutes
Notes:
A couple of times I found myself adding an 'honourable mentions' box, and then finding a way to include it in the main list ('Gift it to someone -> that's not exactly hiding it -> disguise it first' and 'Hide it underneath a bin liner -> bin liners weren't invented yet -> invent bin liners first'). I used some somewhat absurd solutions the previous two challenges, and inspired by the more down-to-earth answers of others so I tried to limit myself to the possible (nothing outside the technology range available to me, so no inventing AIs to do the job for me) although in the spirit of babble I generally tried to make every idea somewhat workable.
- Pick the lock
- Smash down the door
- Unscrew the door from its hinges and remove it
- Escape through a vent
- Hack the door
- Alter your administration records to show that you ought to be released
- Wait 10 years for natural administration changes to destabilise your holders until it is no longer remembered that you should be kept here
- Use a significant portion of your 10 years of energy to create an explosion that destroys the walls
- Walk out between the comically spaced bars
- Bend the bars to allow you to escape
- Dig into the wall and remove the bars from where they were mounted
- Dig out through a tunnel
- Smash through the wall
- Dig through the wall
- Dig through the ceiling
- Bribe the guards
- Persuade the guards
- Seduce the guards
- Use your phone to contact people outside to break you out (probably bribed)
- Make your imprisonment a political thing, and then wait for administration change
- Make your imprisonment a political thing, and then wait for your supporters to break you out
- Write and publish a high-quality book series under a pseudonym using your real face, make yourself a prominent social figure, purchase a house remotely and pay someone to make it look lived-in, and then one day act as though you have suddenly been mistaken for someone else and kidnapped
- Use the energy in your phone battery to create an explosion to destroy a wall
- Wait for someone else to escape and let everyone else out
- Use the energy in your phone battery to melt the bars
- Use the energy in your phone battery to melt the door lock bolt
- Hire someone outside to invent teleportation, and then teleport you out
- Invent teleportation remotely using your phone to access lab assets, and then teleport yourself out
- Create an AI on your phone to vastly improve the data gain from using your camera, use this to scan yourself, and use your phone to access lab assets outside and print an identical copy of yourself, and then terminate yourself
- Create an AI on your phone, and then let it do the work of figuring out the rest of the plan
- Figure out the resonant frequency of the door, and then use your phone to destroy it
- Ditto, but for the walls
- Ditto, but for the bars
- Conquer the world remotely, and then let yourself out
- Hack a 3D printer outside, print a robot, and then guide that robot to get you out
- Hack the five most prominent social media websites, and guide the world towards freeing you (or your reference class)
- Hack the five most prominent social media websites, and use your influence to dismantle the influence and property of the owners of where you are being kept
- Buy the-place-where-you-are-being-kept, and then walk out as the owner
- Smash though a window
- Remove the window without breaking it, and then escape through the gap left
- Look up the make and model of the lock on LPL's youtube channel, and pick it with the instructions provided
- Reverse engineer the strange physics of your phone battery/yourself, and use this knowledge to escape in ways unfathomable to myself
- Hack the military administration systems of a foreign country, and mislead an automated drone strike such that it destroys one wall of your room
- Hack the military administration systems of a foreign country, and edit yourself in as an informant on their side who needs to be retrieved
- Remotely edit the law such that it does not allow for you to be kept here, and then appeal your case
- Hack into a rocket mid-flight, and control it such that it crashes into where you are being held
- Hack a nearby chemicals processing plant, and cause an explosion that destroys a wall of your room
- Appeal to the matrix controllers
- Crowdsource ideas online
- Identify a small but relatively resourceful group of people with an understanding of rationality and an interest in physics, hack an account that previously made a post on babbling, make a new post on escaping a locked room, and then reveal in the comments that the situation is real and the prize of an object unknown to physics (yourself and a phone of limitless battery) is also real, I'm being serious we might have a solution to entropy here and all you have to do is get mE OUT-
If I had to guess, the reasoning behind it is to nudge the game closer to a 'true' prisoner's dillemma (trying to work out if your opponent is willing to cooperate, rather taking focus away from it towards the shallower problem of trying to work out if your opponent is a copy of you)
Is the number of rounds per matchup going to be in the tens, or the thousands?
Edit: I just realised you specified in the post
- Saturn V rocket
- SpaceX Starship
- Shine a torch (the something you are sending is light photons)
- Point a radioactive source towards the moon
- Point a neutrino emitter
- Send data by interfacing with the satelites around the moon
- Invent teleportation, and then teleport there
- Create a machine that can exactly copy your object and then destroy the 'original', use it to get the object to the moon, and then resolve the resulting dilemma about whether it's the 'same' object with your choice of counterargument
- Space elevator
- Just walk
- Sit on a nuke, and then detonate it
- Actually, just detonate a nuke anywhere in an atmosphere, and statistically speaking at least one air particle is likely to make it
- Project Orion Battleship (possible overlap with previous two)
- Jump really high
- Make a seesaw, stand on one end, and have a heavy object fall from lunar orbit onto the other end
- Climb a ladder
- Ohvyq n gbjre bs onory, ohg fgbc jura vg ernpurf gur zbba
- Decelerate the moon until it crashes into earth, so that everything on the earth is also on the moon
- Wait for a quantum event of negligable probability to take it there
- Promise someone £1 billion conditional on them getting the object to the moon
- Use that one company whose thing is sending products to the moon so they can say that there's one of those products on the moon
- Wait for someone else to attempt a lunar landing mission, and sneak the object onboard (fun fact: it's already been done with the lunar art museum https://www.youtube.com/watch?v=csInNn6pfT4)
- Reclassify Earth as the moon
- Reclassify anything reachable as the moon, and then place the object on that
- Place the object on the Apollo lunar samples
- Monorail
- Regular rail
- Rail, but with more than two rails
- Use a plane to accelerate really fast in-atmophere, and use that momentum to get to the moon
- Ditto, but with running
- Cannon
- Trebuchet
- Catapult
- Make something that disables Earth's relative gravitational field for an object, and turn it on at the right time
- Accelerate really fast in a submarine, and use that momentum to get to the moon
- Use a fishing rod to tether the moon, and reel it in until it's close enough
- Use a fishing rod to tether the moon, and reel yourslef towards it
- Use a fishing rod to tether another celestial body, and reel yourself towards it until you're in the path of the moon, then wait for it to arrive
- Stand on a mirror that faces downwards, which rises upwards on a stream of photons bouncing between this mirror and another mirror on earth that faces skywards
- Project Sea Dragon
- A tower comprised of Delta-V stages
- A space shuttle, with more stages
- Make a bridge that spirals from earth to the moon, and then walk
- Ditto, but with a road vehicle
- Fold an A4 piece of paper 45 times, and then climb up the resulting stack
- Climb up a rope
- Climb up a fireman's pole, and then slide down the last fraction to the moon
- Gather a large number of golf balls, and hit each one away from yourself to accelerate
- Use a spray can
And finally, the most impractial item on the list
50. SLS
I don't know if this is already known, but you might be interested in the fact that you can currently use start prompts for GPT-2.