Posts

ACX Montreal Meetup April 27th 2024 2024-04-23T03:58:53.933Z
ACX Montreal Meetup April 13th 2024 2024-04-10T20:22:48.603Z
ACX Meetups Everywhere Spring 2024, Montreal, QC 2024-03-22T23:15:15.085Z
ACX Montreal Meetup March 30th 2024 - Spaced Repetition 2024-03-22T20:25:48.374Z
ACX Montreal Meetup March 16th, 2024 - Film Screening 2024-03-11T20:02:46.282Z
ACX Montreal Meetup March 2nd 2024 2024-02-27T04:07:30.784Z
ACX Montreal Meetup February 17th 2024 2024-02-13T21:41:12.768Z
ACX Montreal Meetup February 3rd 2024 2024-01-30T21:21:08.045Z
ACX Montreal Meetup January 20th 2024 2024-01-15T21:37:32.131Z
ACX Montreal Meetup January 6th 2024 2024-01-02T05:56:45.955Z
Montreal Solstice Celebration 2023-12-16T04:48:36.311Z
ACX Montreal Meetup December 23rd 2023 2023-12-16T04:27:07.052Z
ACX Montreal Meetup December 9th 2023 2023-12-03T21:38:33.729Z
ACX Montreal Meetup November 25th 2023 2023-11-21T04:10:46.848Z
ACX Montreal Meetup November 11th 2023 2023-11-04T05:54:36.779Z
ACX Montreal Meetup October 28th 2023 2023-10-25T05:21:20.258Z
ACX Montreal Meetup October 14th 2023 2023-10-02T17:04:29.008Z
ACX Montreal Meetup September 30th 2023 2023-09-21T16:06:20.320Z
ACX Montreal Meetup September 2nd 2023 2023-08-25T21:57:20.176Z
ACX Meetups Everywhere Fall 2023, Montreal, QC 2023-08-14T07:42:51.739Z
ACX Montreal Meetup August 19th 2023 2023-08-10T04:11:04.026Z
ACX Montreal Meetup August 5th 2023 2023-07-29T05:34:29.361Z
ACX Montreal Meetup July 22nd 2023 2023-07-11T02:17:11.615Z
ACX Montreal Meetup July 9th 2023 2023-06-29T04:47:00.972Z
ACX Montreal Meetup June 24th 2023 2023-06-19T22:48:23.348Z
ACX Montreal Meetup June 10th 2023 2023-06-01T04:18:40.184Z
[Venue Change] ACX Montreal Meetup May 27th 2023 2023-05-17T03:00:00.977Z
ACX Montreal Meetup April 22th 2023 2023-04-02T22:58:41.375Z
Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent 2023-04-01T16:39:09.643Z

Comments

Comment by BionicD0LPH1N (jumeaux200) on ACX Montreal Meetup November 11th 2023 · 2023-11-06T06:12:20.939Z · LW · GW

Thanks for noticing and pointing it out!

Comment by BionicD0LPH1N (jumeaux200) on Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent · 2023-08-12T23:50:13.449Z · LW · GW

The API token was cancelled, sorry about that. The most recent version of the chatbot is now at https://chat.stampy.ai and https://chat.aisafety.info, and should not have the API token issue.

Comment by BionicD0LPH1N (jumeaux200) on Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent · 2023-04-28T18:31:02.890Z · LW · GW

I'm glad to hear you're trying to catch up with the alignment ecosystem!

It is still supposed to be live and active, and it still works for me. Are you sure you have https://alignmentsearch.up.railway.app? If so, then I'm not sure what's going on, it worked for everyone who I know that tried. If you have a different link, maybe we've been linking to the website incorrectly somewhere so please share the link you do have.

Edit: just realized you weren't speaking of https://alignmentsearch.up.railway.app, I thought it was a standalone comment. I'm getting the same 404 error for the aisafety.world link.

Comment by BionicD0LPH1N (jumeaux200) on Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent · 2023-04-02T20:15:15.280Z · LW · GW

Thanks!

  1. Just did. :)
  2. We've seen the channel, yes, though haven't messaged in it or anything.
Comment by BionicD0LPH1N (jumeaux200) on Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent · 2023-04-02T20:07:18.897Z · LW · GW

Thanks for the comment!

At this point, we don't have a very clear plan, other than thinking of functionalities and adding them as fast as possible in an order that seems sensible. The functionalities we want to add include:

  • Automatic update of the dataset relatively often.
  • Stream completions.
  • Test embeddings using SentenceTransformers + Finetuning instead of OpenAI for cost and quality, and store them in Pinecone/Weaviate/Other (tbd); this will enable us to use the whole dataset for semantic search, and for the semantic similarity to have more 'knowledge' about technical terms used in the alignment space, which I expect to produce better results. We also want to test and add biases to favor 'good' sources to maximize the quality of semantic search. It's also possible that we'll make a smaller, more specialized dataset of curated content.
  • Add modes and options. HyDE, Debate, Comment, Synthesis, temp, etc. Possibly add options to make use of GPT-4, depending on feasibility.
  • Figure out how to make this scale without going bankrupt.
  • Add thumbs-up/down for A/B testing prompt, the bias terms, and curated vs uncurated datasets.
  • Add recommended next questions the user can ask, possibly taken from a question database.
  • Improve UX/UI.

We have not taken much time (we were very pressed for it!) to consider the best way to onboard collaborators. We are communicating on our club's Discord server at the moment, and would be happy to add people who want to contribute, especially if you have experience in any of the above. DM me on Discord at BionicD0LPH1N#5326 or on LW.

The current version has trouble answering basic AI safety questions.

That's true sometimes, and a problem. We observe fewer such errors on the full dataset, and are currently working on having that up. Additional modes, like HyDE, and the bias mentioned earlier, might further improve results. Getting better embeddings + finetuning them on our dataset might improve search. Finally, when the thumbs up/down feature is up, we will be able to quickly search over a list of possible prompts we think might be more successful, and find the ones that reduce bad answers. Overall, I think that this is a very solvable problem, and are making rapid progress.

About curating the dataset (or favoring some types of content), we agree and are currently investigating the best ways to do this.

About walking people through the extended alignment bingo, this is a feature we're planning to add. Something that might make sense is to have a slider for 'level-of-expertise', where beginners would have more detailed answers that assume less knowledge, and get recommended further questions that guide them through the bad takes bingo.

The feedback function for wrong answers is one of our top priorities, and in the meantime we ask you give the failing question-answer pairs in our form.