Posts

Correcting human error vs doing exactly what you're told - is there literature on this in context of general system design? 2022-06-29T21:30:05.753Z
Steganography and the CycleGAN - alignment failure case study 2022-06-11T09:41:28.841Z
Free course review — Reliable and Interpretable Artificial Intelligence (ETH Zurich) 2021-08-10T16:36:39.737Z
Jan Czechowski's Shortform 2021-07-19T21:55:01.201Z

Comments

Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2022-07-04T14:53:17.645Z · LW · GW

What can EA learn from Scouting

epistemic status: random notes for writing something more structured in the future

After EA fUnconference in Berlin, I'm thinking more and more about what we can learn from the scouting movement for purpose of community building/cooperation. Some elements characteristic of scouting (note, it's a rough idea, I don't think all of them are worth reproducing):

EDIT: to clarify (as I'm afraid now in this chaotic draft this may be misunderstood) this is not a list of things I observed on fUnconference. There there was only a partial implementation of the 'small groups' idea, as indicated in the text. Everything else is my memories from scouting times, and I never observed them in any EA group.

  • Arbitrary splitting larger groups into smaller troops (up to 10 people) with an appointed leader. On fUnconference this was implemented as "Huddle" groups, they were used only for daily checkups (how is everyone doing, does everybody feel welcome and is having fun) and once as (soft) default split for team-based games.
  • Assigning lots of meaning to color, name, and symbols related to your small group / larger community
  • Incorporating music both for general singing together but also specific songs for specific occasions (morning, before a meal, evening)
  • Also, spread and evolution of songs - various smaller communities will sing similar things but will develop slight differences in details in time. This gives you both sense of belonging to the whole movement (because you know the same songs) and your local group (because you sing them in a very specific way)
  • Rituals/traditions related to gathering around the campfire - leader of the gathering, a special role for the person sustaining the fire, a special role for the person starting the fire, singing together, order of picking songs to sing, order of speaking, gestures indicating you want to speak next
Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2022-06-29T21:21:52.802Z · LW · GW

That's an interesting post, thanks. But I think you can bring value with your creations even if you're neither pareto optimal nor one-dimension top performer. So if you write scifi stories, you might still write some interesting stuff and deliver inspiring idea even if you don't have unique mixture of non-usual-sci-fi-writer traits in any tangible sense. You're not the best skateboard-scifi writer. You're just an average scifi writer and it's fine. So my point is in this case you might still need motivation. Maybe to develop your unique style or hit the niche market in the future. But if from the very beginning you compare yourself to superhumans boosted to the right side of the distribution by their unique genome or other irreproducible factors, you might never even start.

Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2022-06-28T22:05:08.155Z · LW · GW

Coping with being average: local group. If you want to produce some content like writing, you're very likely to take popular writers as a baseline. But in times of the Internet, those are super-high-performers from the far end of the distribution. You and I are just not as good them (I'm 99% sure the reader is not in top 1%).

This might make you lose motivation. You will produce less then your reference point. Your content will feel medicore. You will get orders of magnitude smaller audience.

Idea how to cope better: instead of taking the whole internet, take a random sample of 100 people from your reference group and select top achievers (in the desired area) from this 100. They will probably be only slightly better than you.

A simple mechanism partially emulating this random sampling is your local group.

Comparing yourself to people who are closer to you on performance axis will probably be more productive. They might have struggled with similar problems as you only recently. They might have some tricks that are applicable in your case. You have higher chance of reaching their level 2 years from now.

In other words, if you want to be a better writer, aspire to be like the best writer in your local EA group, not like EY or Scott Alexander.

Comment by Jan Czechowski (przemyslaw-czechowski) on What's the "This AI is of moral concern." fire alarm? · 2022-06-14T18:38:54.607Z · LW · GW

Hm, I was also thinking of moral value of children in this context. At least in my perception, important part of the moral value is the potential to become a conscious, self-aware being. In what sense does this potential translate to artificially created beings?

Maybe if in neural network parameter space there's a subspace of minds with moral value, also points close to this subspace would have moral value?

Comment by Jan Czechowski (przemyslaw-czechowski) on What's the "This AI is of moral concern." fire alarm? · 2022-06-14T18:33:56.739Z · LW · GW

I think the idea with internal activations manipulation is interesting. It might require some refinement - I think activations of encoder-decoder transformer model are a function of inputs, and they change with every token. At first, the input is your prompt, then it's your prompt + generated tokens. So the protocol / task for GPT3 would be: generate now 5 tokens, so with the last generation this logit is maximized? Also, it depends on hyperparameters of beam search which are controlled by human

Comment by Jan Czechowski (przemyslaw-czechowski) on alexlyzhov's Shortform · 2022-06-13T21:28:57.570Z · LW · GW

I just logged in there, thanks - however they have a couple of disclaimers that this is not a place for beginner / technical support questions. But I suppose it's more to avoid people asking "how do I install pytorch?" rather than AI safety questions, so maybe it will suit the needs of OP

Comment by Jan Czechowski (przemyslaw-czechowski) on alexlyzhov's Shortform · 2022-06-13T21:23:26.840Z · LW · GW

There's an AI safety camp slack with #no-stupid-questions channel. I think people stay there even after the camp ends (I'm still there although this year edition ended last week). So you can either apply for next years edition (which I very much recommend!) or maybe contact organizers if they can add you without you being AISC participant/alumni? Just a disclaimer, I'm not sure how active this slack is between camps, and it might be that lot of people leave after the camp ends.

Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2022-06-13T21:18:42.453Z · LW · GW

Thought experiment: Can question answering language model without memory, instantiated separately for every session (like GPT-3) be pursuing a goal? Does such setup exclude agency? ----Imagine you put thousand super-geniuses in prison, each in separate cell. You will call them in random order to interrogation room and always ask only one question, listen to the answer and send them back. Each person will be called only once.----- Super-geniuses are allowed to devise a strategy before you lock them. Their goal is to manipulate you to release them. Will they manage? If they are intelligent enough, the fact that each one answers only one question without knowing how many others were interrogated before and what their answers were shouldn't be a unworkable problem.

Comment by Jan Czechowski (przemyslaw-czechowski) on Probability that the President would win election against a random adult citizen? · 2022-06-01T22:14:29.042Z · LW · GW

Current president of Poland went from being relatively unknown parliament member (I'd bet, less than 5% of general public knew who he was? Also there's another public figure with the same last name, so maybe even some people who knew the name were confusing him with the other guy) to being the president within one year. So with longer timescale, big support from one of leading political powers, and some political backlog it is possible. There are surely similar stories around the world, of outsiders rising to leaders. Of course what you're asking is much more extreme, but that provides some staring datapoint.

Comment by Jan Czechowski (przemyslaw-czechowski) on MikkW's Shortform · 2021-10-01T20:08:12.745Z · LW · GW

I'm not really sure if food shakes are the cheapest nutrition possible. They are optimized for time and convenience. Yeah they are probably cheaper than eating out, but I'm not sure how low you can get with cooking yourself from basic products, theoretically scaling up for a larger group of people. I guess thinks like armies and monasteries might have it figured out. Maybe interesting to check what's the average price of feeding a soldier / monk?

Comment by Jan Czechowski (przemyslaw-czechowski) on TurnTrout's shortform feed · 2021-10-01T20:05:07.136Z · LW · GW

As usual, the deadweight loss is quadratic in the size of the tax.

Can you give some clarifications for this concept? I'm not sure what you mean here.

Comment by Jan Czechowski (przemyslaw-czechowski) on How to Sleep Better · 2021-10-01T19:59:19.900Z · LW · GW

Thanks for this piece of advice - I actually invested in custom earplugs, it cost me around 60eur and 2h of visits and commute. They are not really perfectly comfortable in the night, but they help in the morning, when my Wife gets up earlier than me, she doesn't wake me up anymore.

Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2021-10-01T19:54:52.101Z · LW · GW

Well, for me 'logging in to work after 10AM' seems much worse than 'starting to get ready to sleep after 10PM'. It's a very natural and strong Schelling fence. Therefore the idea to play in the mornings rather than nights.

Comment by Jan Czechowski (przemyslaw-czechowski) on The Best Software For Every Need · 2021-09-20T13:45:13.743Z · LW · GW
  • for ripgrep. This is my standard code analysis and config searching tool, I use it multiple times a day. Always feels like papercut when I work in a server with only grep
Comment by Jan Czechowski (przemyslaw-czechowski) on Sherrinford's Shortform · 2021-08-01T13:38:49.520Z · LW · GW

Did you consider looking at it rather from "options" than "goals" perspective? Rather than defining goals and looking for the optimal path to get there, you can look at /brainstorm exploitable options that you have available and seem to have high returns. And then prioritize them. I recently spent half a day writing down cool ideas for things to do, then collected them in todoist, and since then, whenever I have time I go through them. And add something new.

Comment by Jan Czechowski (przemyslaw-czechowski) on What are some triggers that prompt you to do a Fermi estimate, or to pull up a spreadsheet and make a simple/rough quantitative model? · 2021-07-26T20:49:12.238Z · LW · GW

Recent spreadsheet situations:

  • I had a free day and I didn't have much inspiration on how to spend it. So I decided to sit down and rethink my goals, habits. This made me realize that I have a time-tracking record collected from the last 8 months, so it's a good moment to put them in a spreadsheet and analyze
  • I was organizing a birthday party and wanted to invite various groups of friends. I didn't want the party to be too big, but I wanted to know when I still have space to invite something more. So I created a spreadsheet, grouping people in columns with the probability that they will actually show up (got an invitation but no answer: 30%, confirmed no: 0%, confirmed yes: 90%).
  • I was organizing a weekend stag do for a friend, with ~10 participants. There were various activities and costs involved, and different people paid for different things. So I asked everyone to put the costs in a spreadsheet and then made a detailed calculation about who put how much money, who should carry costs for what (not everyone used both nights of accommodation for example), calculated a balance for everyone involved.
Comment by Jan Czechowski (przemyslaw-czechowski) on What are some triggers that prompt you to do a Fermi estimate, or to pull up a spreadsheet and make a simple/rough quantitative model? · 2021-07-26T20:38:52.044Z · LW · GW

I notice that with regards to many things I always think of at least one of the following aspects:

  • Money
  • Time
  • Risk

As each of those is quantifiable, it prompts me to actually put some numbers on the given problem.

Comment by Jan Czechowski (przemyslaw-czechowski) on Failing safely is the anomaly · 2021-07-26T20:34:37.906Z · LW · GW

I wonder how much professional sports and general engagement in artificial conflicts are (anti-)correlated with actual conflicts in the given group? I always considered sports club identification a civilizational device to satisfy tribal needs for ingroup vs outgroup conflicts without causing any real conflicts.

Comment by Jan Czechowski (przemyslaw-czechowski) on Rana Dexsin's Shortform · 2021-07-26T20:21:13.820Z · LW · GW

I happen actually to be in my very specific allocated time for "30 mins of LW reading and writing". But usually, this site is a procrastination hole for me, so thanks. Still, I must say, a very life-improving procrastination hole.

Comment by Jan Czechowski (przemyslaw-czechowski) on Sherrinford's Shortform · 2021-07-26T20:16:56.085Z · LW · GW

The best book I have ever only read the review of: Selfish Reasons to Have More Kids Seems to have a good summary of evidence for upbringing if you're interested in the subject.

Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2021-07-26T20:14:57.811Z · LW · GW

Video Games protocol: I like video games. Some of them are really life-changing stories. I have some titles on my list that have the potential to be really cool adventures. However, I'm hesitant to try them due to some considerations:

  • They are very immersive experiences and will not let you do anything else at the same time. I like mixing activities
  • A standard AAA RPG playthrough takes around 100 hours. One can argue that the same amount of positive experience can be taken from one good book (~10 hours)
  • Lastly, video games are so addictive it's hard for me to stop playing once I start. I always play longer than planned. When making a break I want to go back to the game as soon as possible.

I'm the most concerned with the last point right now. I have some ideas to manage that better, and I'm open to new ones:

  • When playing make sure there's always wallclock time displayed within eyesight. Preferably integrated on your screen. I think I once found a Skyrim mode especially for that
  • Include some accountability (ask your partner to remind you the game time is over) or time tracking with daily reflection
  • Only play in the mornings, before work. If you want to play longer, you have to get up earlier. And you cannot play too long because, well, you need to get to work.
Comment by Jan Czechowski (przemyslaw-czechowski) on Jan Czechowski's Shortform · 2021-07-19T21:55:01.911Z · LW · GW

Bayesian Signaling: good way to think about signaling is handcrafting a piece of evidence, that for the other person will be objectively strong evidence for the claim that you're making. Hearing X saying "I'm pretty smart" is a weak evidence for the hypothesis "X is smart". Seeing a Harvard's degree with X's name on it is much stronger evidence. Hearing X saying "I'm a millionaire" is a weak evidence for the hypothesis "X is a millionaire". Receiving a 10000$ gift from them is much stronger evidence.

Comment by Jan Czechowski (przemyslaw-czechowski) on Is the argument that AI is an xrisk valid? · 2021-07-19T15:18:30.014Z · LW · GW

I didn't read your full paper yet, but from your summary, it's unclear to me how such understanding of intelligence would be inconsistent with the "Singularity" claim

  • Instrumental superintelligence seems to be feasible - a system that is better at achieving a goal than the most intelligent human
  • Such system can also self-modify, to better achieve its goal, leading to an intelligence explosion
Comment by Jan Czechowski (przemyslaw-czechowski) on How to Sleep Better · 2021-07-16T20:47:37.753Z · LW · GW

I've heard about that, but I think it only make sense to try if you diagnose yourself as a mouth- breather?

Comment by Jan Czechowski (przemyslaw-czechowski) on How to Sleep Better · 2021-07-16T20:45:53.885Z · LW · GW

Anyone really had good long-term experience with earplugs? I use them during vacations in various conditions and it's usually a life saver, but after a couple of nights my ears get sore and I dread the thought of putting in earplugs AGAIN. Don't really feel it's something I can use at home on regular basis. Also at home I usually don't have problems with noise, so maybe it's not a high priority intervention.

Comment by Jan Czechowski (przemyslaw-czechowski) on How to Sleep Better · 2021-07-16T20:43:32.512Z · LW · GW

Thank you for the post.

  • It motivated me to really invest more time and effort into sleep quality - although I think it's pretty good it should be worth exploring if my productivity can be improved by better sleep
  • Summarized most of the things I knew already: sleep cycles, avoid screens etc
  • Motivated me to do something with the things I knew already (it's 11pm and I'm writing this on my full-brightness phone) *Motivated me to try something new: an app or a fancy gadget
Comment by Jan Czechowski (przemyslaw-czechowski) on Could MMRPGs be used to test economic theories? · 2021-07-14T09:46:01.899Z · LW · GW

I think the Corrupted Blood incident in WoW might be an interesting context here (not exactly economy but still multi-agent phenomenon). It was apparently used in research for epidemic modeling.

Comment by Jan Czechowski (przemyslaw-czechowski) on Reasons for Punishment · 2021-07-14T09:32:17.804Z · LW · GW

In "Making a Statement" category you mean punishing may be a signalling strategy? That if I punish somebody for X (especially a friend or family member) I send an expensive signal that I'm really against X? "Signalling" looks like a common word around here, so it might be worthwhile to reword it using this vocabulary.

Comment by Jan Czechowski (przemyslaw-czechowski) on Reasons for Punishment · 2021-07-14T09:29:22.057Z · LW · GW

Not sure if you've seen the Anti Social Punishment post: https://www.lesswrong.com/posts/X5RyaEDHNq5qutSHK/anti-social-punishment In short, research shows some people tend to punish pro-social behaviours. To take vengeance on others for making the feel uncomfortable with their own antisocial behaviours, I guess? Not sure if it fits your list as a separate category but still an interesting context.

Comment by Jan Czechowski (przemyslaw-czechowski) on Reasons for Punishment · 2021-07-14T09:24:35.207Z · LW · GW

Hm, I would say Discouragement as you described is more generic: X happens and you want it to happen less, so you discourage it. My idea was to underline the Discouragement taken to the extreme level. X can happen, and you don't want it to ever happen. So you make a precommitment / ultimatum. In this sense Discouragement is the effect of ongoing, regular punishment, Precommitment is the effect of punishment that might happen in the future. But let's not discuss the words too much- in your future post please organize and describe as you wish. Looking forward to reading those btw.

Comment by Jan Czechowski (przemyslaw-czechowski) on Reasons for Punishment · 2021-07-12T22:48:08.239Z · LW · GW

After reading it and some comments I also see:

  • Precommitment - you precommited to punish X to avoid X, hoping you will never have to execute the threat. I guess in this sense "punishment" as a word makes sense even for something that is never executed.
  • Social absolution - if somebody undergoes an official punishment, the people he encounters later may be more willing to integrate him into the group - without the official punishment there would be a neverending, unofficial exclusion
Comment by Jan Czechowski (przemyslaw-czechowski) on Chess and cheap ways to check day to day variance in cognition · 2021-07-07T16:54:57.203Z · LW · GW

I think once on LW I found checking the reaction time with some app as a proposed fast and uncomplicated benchmark to investigate cognition correlations with time of the day, sleep deprivation etc.

Comment by Jan Czechowski (przemyslaw-czechowski) on Chess and cheap ways to check day to day variance in cognition · 2021-07-07T16:52:01.722Z · LW · GW

How many games you play daily, and what time do you play them? If you have data from different times of the day, do you noticed any patterns? I once had a feeling that with some tasks at work I can "unblock" and really get the job done only after 5 or 6PM. But never put any serious effort into observing it for a longer period of time.

Comment by Jan Czechowski (przemyslaw-czechowski) on Consequences of Bayesian Epistemology? · 2021-07-07T16:41:15.706Z · LW · GW

I think for me the main takeaway was that to have better beliefs about the world I don't have to look for Proofs but rather Evidence. So if I try to evaluate hypothesis/belief H based on some observed reality R, I shouldn't ask: "does R prove/disprove H"? Or: "can H explain R"? But rather: "how complicated is H's explanation of R"? And then update my beliefs about H and then move on and look for further evidence.

Comment by Jan Czechowski (przemyslaw-czechowski) on Rationality Yellow Belt Test Questions? · 2021-07-07T16:25:29.359Z · LW · GW

I'm recently considering if problems like ones from International Olympiad in Linguistics (https://ioling.org/) can be a good exercise or test for some aspects of Rationality. See example problems here: https://ioling.org/booklets/iol-2018-indiv-prob.en.pdf Usually you are given 10-20 sentences in some obscure or ancient language with translations and are tasked with translating some more lines. The generic strategy would be:

  • Generate some hypotheses about how the language works
  • Think of base probabilities and ways to test the ideas
  • Test selected ideas using given data
  • Update your beliefs
  • Repeat

In my opinion this kind of problems is a sweet spot between exactness and "soft" understanding of the world and language in general. Physics and mathematics problems might be too much to the "exactness" end of the scale.

I don't really expect that the Rationalist Test as you described it would really contain IOL-like problems, but hopefully it gives some inspiration