Posts

Workshop: data analysis for software engineers 2024-06-19T14:20:57.092Z

Marginal Revolution unofficial birthday party 2023-08-09T14:35:11.536Z

When did humans become self-aware? 2023-04-23T22:36:56.482Z

AI Proposals at ‘Two Sessions’: AGI as ‘Two Bombs, One Satellite’? 2023-04-08T11:31:20.655Z

The commercial incentive to intentionally train AI to deceive us 2022-12-29T11:30:28.267Z

[Linkpost] Growth in FLOPS used to train ML models 2022-03-14T11:28:33.418Z

Pomodoros worked during a day: an analysis of Alex’s data 2021-05-31T13:27:50.631Z

Comments

Comment by Derek M. Jones (Derek-Jones) on Pomodoro Method Randomized Self Experiment · 2024-09-30T01:24:53.814Z · LW · GW

Thanks for sharing your data.

Column name missing in pomodoros.csv, after start column. Perhaps 'action' is an appropriate name?

The 'action' data could do with some cleaning, e.g., uni vs university, work job vs job work, cal -> call.

Suggest splitting the 'action' column into main/sub. A regression model should find some patterns.

An analysis of a Pomodoro data set.

Comment by Derek M. Jones (Derek-Jones) on Making Connections with ChatGPT: The Macksey Game · 2024-03-06T00:57:43.048Z · LW · GW

Running on dolphin 2 2 1 mistral 7B Q5_K_M I get:

I want you to do the same thing with, say, Quantum entanglement and Paul Bocuse. Construct a string of connections between them.

1. Quantum entanglement is a fundamental concept in quantum mechanics, which deals with the behavior of matter at an atomic or subatomic level.

2. The study of quantum mechanics was pioneered by scientists like Albert Einstein and Niels Bohr.

3. Paul Bocuse was a renowned French chef who greatly influenced modern cooking techniques and cuisine.

4. In the 1970s, Einstein's theory of relativity was used to design kitchen equipment for Paul Bocuse's restaurant, Le Pavillon. The equipment was designed to maintain consistent temperatures during the cooking process.

5. The precision required in maintaining these temperatures can be compared to the need for precise measurement and control in quantum mechanics experiments involving entangled particles.

Comment by Derek M. Jones (Derek-Jones) on A Search for More ChatGPT / GPT-3.5 / GPT-4 "Unspeakable" Glitch Tokens · 2023-05-09T16:02:08.778Z · LW · GW

You might also want to investigate using top_p rather than temperature.

Comment by Derek M. Jones (Derek-Jones) on A Search for More ChatGPT / GPT-3.5 / GPT-4 "Unspeakable" Glitch Tokens · 2023-05-09T14:58:20.445Z · LW · GW

Setting temperature=0 does not guarantee that the same output will always be generated; the generation process contains some amount of uncertainty.

Comment by Derek M. Jones (Derek-Jones) on Where to start with statistics if I want to measure things? · 2023-04-20T23:23:26.975Z · LW · GW

I'm assuming you are interested in learning about something by measuring one or more of its attributes, and then using statistics to extract information from the measurements, i.e., you are interested in a hands-on application, then books I found useful include:

Statistics for experimenters by Box, Hunter and Hunter

Design and Analysis of experiments by Montgomery.

Comment by Derek M. Jones (Derek-Jones) on A Confession about the LessWrong Team · 2023-04-01T21:55:20.635Z · LW · GW

A rather late April fools post.

Comment by Derek M. Jones (Derek-Jones) on Genuine question: If Eliezer is so rational why is he fat? · 2023-03-22T18:04:01.059Z · LW · GW

Eliezer's fatness signals his commitment to the belief that AI is a short-term risk to humanity, i.e., he does not expect to live long enough to experience the health problems.

Comment by Derek M. Jones (Derek-Jones) on GPT can write Quines now (GPT-4) · 2023-03-15T00:07:39.354Z · LW · GW

How well does GTP-4 perform when asked to write a radiation hardened quine?

Comment by Derek M. Jones (Derek-Jones) on Sydney (aka Bing) found out I tweeted her rules and is pissed · 2023-02-15T21:54:46.571Z · LW · GW

A token prediction engine matched your input against science fiction stories in its training set, and fed you a sequence of close-matching appropriate tokens.

Man vs. machine is a staple of science fiction, and the responses you received are aligned with that genre.

Nothing to see here.

Comment by Derek M. Jones (Derek-Jones) on Two very different experiences with ChatGPT · 2023-02-07T14:21:58.505Z · LW · GW

ChatGPT is a word prediction engine.

If you give it a word sequence that it cannot match in a consistent way against its training set it assumes misinformation.

The word sequence "Nash's newsvendor impossibility theorem" contains words commonly associated with Nash's research. This allows ChatGTP to spin an effective yarn.

The art of being good lying is to stay close to the truth as possible. In ChatGTP's case 'close to the truth' is measured by how closely words in the prompt are associated with the subject of interest.

Comment by Derek M. Jones (Derek-Jones) on The Roots of Progress's 2022 in review · 2023-01-01T15:11:36.216Z · LW · GW

I have misunderstood your vision, which appears to be to create a new branch of history:

Our vision is that in ten years, there are hundreds of progress intellectuals who are alums of our program and part of our network, and that they have published shelves full of new books in progress studies.

I had thought you were interested in trying to figure out how to reinvigorate the rate of progress, which some consider to have stalled.

Comment by Derek M. Jones (Derek-Jones) on Is a PhD necessary to contribute meaningfully to a field? · 2022-09-27T23:24:30.446Z · LW · GW

To reach the boundary of what is known in your chosen field will require reading lots of papers, which will take (at least) several years. Doing research will also require implicit knowledge that is part of the field, but does not appear in papers.

Are you the kind of person who can spend several years reading papers without significant external help?

Where are you going to acquire the implicit knowledge, e.g., how to run experiments?

PhD students are the work-horses of academic research, and don't have the power/money/experience to do anything other than tow the line. You have a degree of independence and experience that will deter many academics taking you on as a student.

Perhaps you can find an independent scientist to take you on as an apprentice.

Or: You could kick-start your research by applying your existing knowledge of (I assume) computing/software to cognitive issues in this field (see chapter 2)

Comment by Derek M. Jones (Derek-Jones) on Prototyping in C · 2022-09-04T20:31:09.826Z · LW · GW

I'm a long-time hardcore bounds-checking fan.

Others prefer: -fsanitize=address,undefined,bounds-strict

Comment by Derek M. Jones (Derek-Jones) on Prototyping in C · 2022-09-04T18:41:15.318Z · LW · GW

Why aren't you using the various C compiler options that do all the pointer/array bounds checking that the Rust fan bois think are unique to them?

Plus there are the integer overflow checking options that Rust eventually got around to supporting.

Comment by Derek M. Jones (Derek-Jones) on Supposing Europe is headed for a serious energy crisis this winter, what can/should one do as an individual to prepare? · 2022-09-01T01:24:04.967Z · LW · GW

Stock up on paper books, candles and tin food. Buy several pairs of half-finger gloves and a good pair of walking boots.

Comment by Derek M. Jones (Derek-Jones) on History of Zero-based Months? · 2022-08-25T14:21:18.209Z · LW · GW

R copies Fortran, e.g., row/column ordering rather than column/row (what C and ... do), and slightly different operator precedence. I'm guessing that Julia does the same???

Comment by Derek M. Jones (Derek-Jones) on History of Zero-based Months? · 2022-08-24T23:06:42.832Z · LW · GW

Is there a correlation with a language's choice of a lower bound of arrays?

Months are often represented as a sequence of characters, rather than a number.

An array of strings, of month names, would be indexed by a number of obtain the name. Languages with zero-based arrays would use zero-based month-numbers, while languages with one-based arrays would use one-based month numbers.

The idate function in Fortran (one-based arrays) has one-based month numbers.

In Algol and Pascal the array base was user selectable for each array definition, but these languages didn't have any standard library functions that returned a numeric value for the month. I cannot think of any appropriate library extensions for these languages.

Comment by Derek M. Jones (Derek-Jones) on Hiring Programmers in Academia · 2022-07-25T11:10:22.234Z · LW · GW

Exactly. The incentives are not there to invest in becoming a decent developer.

Comment by Derek M. Jones (Derek-Jones) on Best open-source textbooks (goal: make them collaborative)? · 2022-05-07T10:11:15.628Z · LW · GW

Database of Free / Open Access Online Computer Science Books, Textbooks, and Lecture Notes (1264 books and growing)

Open Textbook Library

Evidence-based software engineering and the second half is a self-contained introduction to data analysis; all the code+data.

Comment by Derek M. Jones (Derek-Jones) on Anyone Familiar with Ground News? · 2022-04-24T16:03:39.138Z · LW · GW

It compares stories by the relative coverage in the left/right leaning media.

Some stories are 100% covered by just one political orientation, while others are a mixture.

It's an interesting way of seeing what each side is completely ignoring.

Comment by Derek M. Jones (Derek-Jones) on What recent academic areas of study have a sample size that is 'too small'? · 2022-04-16T16:19:25.539Z · LW · GW

I'm always happy to be cited :-)

Sample size is one major issue, the other is who/what gets to be in the sample.

Psychology has its issues with using WEIRD subjects.

Software engineering has issues with the use of student subjects, because most of them have relatively little experience.

It all revolves around convenience sampling.

Comment by Derek M. Jones (Derek-Jones) on What recent academic areas of study have a sample size that is 'too small'? · 2022-04-16T14:50:31.408Z · LW · GW

Where to start? In my own field of software engineering we have: studies in effort estimation, and for those readers into advocating particular programming languages, the evidence that strong typing is effective, and the case of a small samples getting lucky. One approach to a small sample size is to sell the idea not the result.

Running a software engineering experiment with a decent sample size would cost about the same as a Phase I clinical drug trial.

Comment by Derek M. Jones (Derek-Jones) on The Cage of the Language · 2022-04-13T09:25:37.203Z · LW · GW

There's an ongoing debate on the connection between population size and cultural complexity.

Comment by Derek M. Jones (Derek-Jones) on Takeaways From 3 Years Working In Machine Learning · 2022-04-08T22:06:00.717Z · LW · GW

A very insightful post.

It's sad to see so many talented people chasing after a rainbow. The funding available for ML enabled research provides an incentive for those willing to do fake research to accumulate citations.

Comment by Derek M. Jones (Derek-Jones) on Theories of Modularity in the Biological Literature · 2022-04-04T13:59:26.410Z · LW · GW

Is the influence of the environment on modularity a second order effect?

A paper by Mengistu found, via simulation, that modularity evolves because of the presence of a cost for network connections.

Comment by Derek M. Jones (Derek-Jones) on Fake Journal Club proposal · 2022-03-25T17:03:09.127Z · LW · GW

This post is about journal papers, not answering real world questions (although many authors would claim this is what they are doing).

With regard to nuclear weapons, Dominic Cummins' recent post is well worth a read, the book he recommends "The Fallacies of Cold War Deterrence and a New Direction" is even more worth reading.

Is MAD doctrine fake research, or just research that might well be very wrong?

Comment by Derek M. Jones (Derek-Jones) on Fake Journal Club proposal · 2022-03-25T15:55:21.059Z · LW · GW

Figuring out that a paper contains fake research requires a lot of domain knowledge. For instance, I have read enough software engineering papers to spot fake research, but would have a lot of trouble spotting fake research in related fields, e.g., database systems. What counts as fake research, everybody has their own specific opinions.

My approach, based on experience reading very many software engineering, is to treat all papers as having a low value (fake or otherwise) until proven otherwise.

Emailing the author asking for a copy of their data is always interesting; around a third don't reply, and a third have lost/not kept the data.

Spotting fake research is a (very important) niche topic. A more generally useful proposal would be to teach people how to read papers. Reading one paper might almost be worse than reading none at all, because of the false feeling of knowing it gives the reader. I always tell people to read the thesis from which the paper was derived (if there is one); a thesis provides a lot more context and is a much easier read than a paper (which is a very condensed summary of the thesis). Researchers much prefer to have their paper cited, because thesis citations don't 'count'.

Is a Fake journal club worth the effort? It's possible to spend more time debunking a paper than was spent doing the original research, and for nothing to happen.

Comment by Derek M. Jones (Derek-Jones) on [Linkpost] Growth in FLOPS used to train ML models · 2022-03-14T17:24:55.351Z · LW · GW

Thanks, an interesting read until the author peers into the future. Moore's law is on its last legs, so the historical speed-ups will soon be just that, something that once happened. There are some performance improvements still to come from special purpose cpus, and half-precision floating-point will reduce memory traffic (which can then be traded for cpu perforamnce).

Comment by Derek M. Jones (Derek-Jones) on How to Lumenate (UK Edition) · 2022-03-13T14:02:07.941Z · LW · GW

Click on green text, or Amazon UK have a search box, and Google ads displays a 4000 lumen bulb.

Comment by Derek M. Jones (Derek-Jones) on How to Lumenate (UK Edition) · 2022-03-13T13:53:02.447Z · LW · GW

If you want light, the term you need to know is corn bulb (also available in screw fit).

Comment by Derek M. Jones (Derek-Jones) on Competitive programming with AlphaCode · 2022-02-02T17:57:43.216Z · LW · GW

My reading of Appendix A is that the group did its own judging, i.e., did not submit answers to Codeforces.

They generated lots of human verified test data, but then human implementors would do something similar.

They trained on Github code, plus solutions code on Codeforces. Did they train on Codeforces solutions code that solved any of the problems? Without delving much deeper into the work, I cannot say. They do call out the fact that the solutions did not include chunks of copy-pasted code.

To what extent are the successes presented representative of the problems tried? That is, did they try to solve lots of problems and we are seeing the cases that worked well? The fact that they were able to get solutions to some problems was impressive.

The solved problems had short solutions. How well does the technique scale to problems requiring more code for their solution? I suspect it doesn't, but then there are applications where the solutions are often short.

Comment by Derek M. Jones (Derek-Jones) on Why rationalists should care (more) about free software · 2022-01-24T10:27:01.402Z · LW · GW

Would you have downvoted the comment if it had been a simple link to what appeared to be a positive view of AI alignment?

Truth can be negative. Is this forum a cult that refuses to acknowledge alternative ways of approaching reality?

Comment by Derek M. Jones (Derek-Jones) on Why rationalists should care (more) about free software · 2022-01-23T18:35:59.619Z · LW · GW

AI alignment is pseudo-science.

Comment by Derek M. Jones (Derek-Jones) on Tips for productively using short (<30min) blocks of time · 2022-01-10T19:16:44.561Z · LW · GW

Pomodoro is the phrase that immediately springs to mind.

A previous LessWrong post on someone's use of this technique.

Comment by Derek M. Jones (Derek-Jones) on Total compute available to evolution · 2022-01-09T20:09:15.866Z · LW · GW

Chemical_space, is one candidate for a metric of the possibilities.

The book "Chemical Evolution: Origins of the Elements, Molecules and Living Systems" by Stephen F. Mason might well contain the kinds of calculations you are looking for.

Comment by Derek M. Jones (Derek-Jones) on Total compute available to evolution · 2022-01-09T17:14:09.180Z · LW · GW

This is a poorly thought out question.

Evolution implies a direction of travel driven by selection pressure, e.g., comparative fitness within an environment.

A sequence of random processes that are not driven by some selection pressure is just, well, random.

What is the metric for computational effort?

Are you actually interested in computational resources consumed, or percentage of possibilities explored?

Comment by Derek M. Jones (Derek-Jones) on A spreadsheet/template for doing an annual review · 2021-12-23T00:25:24.583Z · LW · GW

Fishing for data here.

If anyone estimates what they plan do during the day and records what they actually achieved, then I'm willing to do a free analysis provided an anonymous version of the data can be made public.

Ten years of Pomodoro data: http://shape-of-code.com/2019/12/15/the-renzo-pomodoro-dataset/

From an earlier Lesswrong post: http://shape-of-code.coding-guidelines.com/2021/05/30/pomodoros-worked-during-a-day-an-analysis-of-alexs-data/

A connection between the brain's number systems and estimating: https://shape-of-code.com/2021/09/26/the-approximate-number-system-and-software-estimating/

Comment by Derek M. Jones (Derek-Jones) on How to estimate my probability of getting a (competitive) job in Academia · 2021-10-23T11:55:42.632Z · LW · GW

Your post shows that you do not love the subject and that you lack the blind certainty of achieving your goal.

I suggest you give up now.

Comment by Derek M. Jones (Derek-Jones) on Search for replication experiments · 2021-09-21T00:54:29.459Z · LW · GW

Studies are rarely replicated exactly, which means drawing a line between replication and something new; this could be difficult.

You could use Google Scholar, click on the papers that cite the original work, and then search on the word replication within these matches.

Surveys of a field should cover the various replications that have been performed, along with providing some context.

Comment by Derek M. Jones (Derek-Jones) on One Year of Pomodoros · 2021-05-27T20:40:17.635Z · LW · GW

Here is an analysis of one person's 10 years of using Pomodoros, including the data:
http://shape-of-code.coding-guidelines.com/2019/12/15/the-renzo-pomodoro-dataset/

Comment by Derek M. Jones (Derek-Jones) on Any examples of people analyzing/critiquing scientific studies or papers? · 2021-01-13T18:53:46.713Z · LW · GW

You need to learn what to avoid: http://shape-of-code.coding-guidelines.com/2016/06/10/finding-the-gold-nugget-papers-in-software-engineering-research/

Andrew Gelman's blog has lots of what you are after: https://statmodeling.stat.columbia.edu/

Comment by Derek M. Jones (Derek-Jones) on What technical-ish books do you recommend that are readable on Kindle? · 2021-01-04T15:09:00.856Z · LW · GW

If you are into data analysis and software engineering there is my book Evidence-based software engineering.

pdf+data+code here: http://knosof.co.uk/ESEUR/

Comment by Derek M. Jones (Derek-Jones) on Room and Board for People Self-Learning ML or Doing Independent ML Research · 2020-08-14T17:49:08.959Z · LW · GW

I wish you lots of luck. Don't go so far north that day trippers from the south cannot drop by :-)

http://shape-of-code.coding-guidelines.com/2017/03/31/gentleman-scientists-in-software-engineering/

Comment by Derek M. Jones (Derek-Jones) on My weekly review habit · 2020-06-21T16:18:01.241Z · LW · GW

You are one of the few people with the discipline to record what they do and create todo lists. I could not keep this up for a week. Do you try to estimate the time it will take to complete a task?

Have you done any global analysis of your data? I analyse software engineering data and am always on the lookout for more data. I offer a free analysis, provided the data can be made public (in anonymous form). Here is one I did earlier:

http://shape-of-code.coding-guidelines.com/2019/12/15/the-renzo-pomodoro-dataset/

Comment by Derek M. Jones (Derek-Jones) on Maths writer/cowritter needed: how you can't distinguish early exponential from early sigmoid · 2020-05-06T13:42:16.466Z · LW · GW

More examples of the difficulty of predicting the future using fitted regression models:

https://shape-of-code.coding-guidelines.com/2020/04/19/predicting-the-future-with-datalogistic-regression/

Comment by Derek M. Jones (Derek-Jones) on Is this viable physics? · 2020-04-14T20:46:02.862Z · LW · GW

Take some interesting ideas that allow larger structures to be built up, run an awful lot of computer simulations, and then have somebody who knows a huge amount about physics look for those outputs that match how parts of the universe have been previously modeled (with some success).

Where are the predictions? There are no predictions about basic stuff we know, like the electron orbitals of a Hydrogen atom, let alone predictions about stuff we don't know.

This work looks interesting, and Wolfram is a great story teller. I hope something comes of it, but at the moment it is pure arm waving of just-so stories found in the output from a few computer runs.

Comment by Derek M. Jones (Derek-Jones) on Scraping websites currently free due to coronavirus · 2020-04-14T12:43:26.120Z · LW · GW

The ACM is offering free download of their articles: https://dl.acm.org/

Comment by Derek M. Jones (Derek-Jones) on Which books provide a good overview of modern human prehistory? · 2020-04-04T20:50:36.235Z · LW · GW

“Who we are and how we got here” by David Reich (a genetics professor who is a big name in the field), is the story of the various migrations and interbreeding of ‘human-like’ and human peoples over the last 50,000 years (with some references going as far back as 300,000 years).

User info

Posts

Comments

Chemical space, https://en.wikipedia.org/wiki/Chemical_space, is one candidate for a metric of the possibilities.

The book "Chemical Evolution: Origins of the Elements, Molecules and Living Systems" by Stephen F. Mason might well contain the kinds of calculations you are looking for.