Reforming rot13
post by DanielVarga · 2011-11-04T23:41:21.315Z · LW · GW · Legacy · 35 commentsContents
35 comments
rot13.com is a service frequently used here to hide spoilers. I really hate it, though. If I had the time, I would build a simple, but much better alternative. Maybe somebody has more time to do that, so I'll share my rough specification:
- Dynamic. The encoding happens on the client automatically, whenever the content of the input textbox changes. No Submit key needed.
- Fully client-side. No server-side script, just a static page with Javascript.
- Decoded content can be reached through an encoded URL. Instead of the receiver pasting the ciphertext into the rot13.com textbox, the sender can provide an encoded URL, and the receiver simply clicks on the link. This is slightly less convenient for the sender, but is much more convenient for the (hopefully many) receivers.
- Many people can decrypt parts of rot13 ciphertexts without conscious effort. This defeats the purpose. The new service could provide some alternative encryption that is not a substitution cipher. It should still be a reciprocal cipher, that is a nice property of rot13.
- rot13 fails for non-ASCII characters. (rot13.com leaves them intact.) The alternative encryption should work for all Unicode strings.
- Strong, password-based encryption could also be added as an extra feature. I could send you a very long URL, and tell you a password on the phone, or only tell you the password at a later date.
The only important design problem I don't know how best to solve is making the encryption work for Unicode, with the following three constraints: making it a reciprocal cypher, outputting visually nice strings, and making it map ASCII to ASCII. One possible solution is to drop the constraint that it is a reciprocal cypher. For this service it is probably not crucial anyway: the ciphertext can be base64, with some escape prefix distinguishing it from plaintext.
After writing the above, I found this LW thread: Does anyone else find ROT13 spoilers as annoying as I do? There were several suggestions there, and two of the commenters, sketerpot and LightningRose even coded their own solutions to the spoiler problem. Each solution had some merit, and LightningRose's in particular was far superior to the rot13.com site I used to use, but basically, they only dealt with the third point of my proposal.
Is there anything like what I envision? Is anyone interested in building it? What changes or extra features would you like to see?
35 comments
Comments sorted by top scores.
comment by Vladimir_Nesov · 2011-11-05T22:58:25.465Z · LW(p) · GW(p)
The purpose of rot13 is hiding of spoilers. An improvement over current situation should be in methods for hiding spoilers, not in methods for managing rot13, which is a lost purpose. (I realize the post applies to use of rot13 alternative on arbitrary sites, but that particular conversation shouldn't be on LW.)
What is needed is support for spoiler markup and hidden-until-activated rendering. This is how it's implemented as an extension to Markdown on Stack Exchange.
Replies from: HonoreDB, DanielVarga, lessdazed↑ comment by DanielVarga · 2011-11-06T14:55:20.532Z · LW(p) · GW(p)
I agree that site-specific solutions are superior (if they are actually implemented). I agree that considering this, my original post was quite off-topic. Fortunately, my original off-topic thread about a proposed general solution was hijacked into an on-topic conversation about the best LW-specific solution. I think satt and Misha gave a quite acceptable temporary solution that would deserve some more visibility. Meanwhile, HonoreDB implemented my proposal (big thanks, HonoreDB!). So everything turned out fine.
↑ comment by lessdazed · 2011-11-06T07:22:27.499Z · LW(p) · GW(p)
It's done in this thread.
comment by satt · 2011-11-05T01:15:35.165Z · LW(p) · GW(p)
I think one alternative works right now, although it's not elegant: use Markdown's ability to add a title attribute to links for spoilers. Make a link (a spurious one pointing nowhere, if necessary) with the bracket syntax and then, after the URL, add a space and your spoiler text in double quotes. The Markdown would look something like
[visible link text](http://fake.a.url/or_you_could_link_Wikipedia "spoiler shown when link is hovered over")
An example: hover over this text.
This isn't a disinterested suggestion, by the way. I find ROT13 far more annoying than spoilers so I'd prefer a simple hover-to-reveal trick.
Replies from: pedanterrific↑ comment by pedanterrific · 2011-11-05T04:33:20.126Z · LW(p) · GW(p)
This brings up something I've been meaning to figure out for a while: is it possible to get to hovertext (on images or hyperlinks) on smartphones? Touchscreens don't really support mouseover, after all.
If not (and all my research indicates not), this would be rather annoying for me, as that's how I most frequently access the site.
Replies from: RobinZ, satt↑ comment by RobinZ · 2012-10-05T19:11:51.567Z · LW(p) · GW(p)
I was thinking about this question today and had an idea: instead of putting a fake URL, link to a ROT-13 site like decode.org with a URL that decrypts to the desired text. Thus, if you wanted to say "Snow is white" in a spoilered fashion:
[Spoiler](http://decode.org/?q=Fabj+vf+juvgr "Snow is white").
equals
↑ comment by satt · 2011-11-05T13:26:27.496Z · LW(p) · GW(p)
Good question. I don't have a smartphone and don't know. And now I think a bit more about it, I wonder about screen readers. If screen readers indiscriminately read out links' title text then people using them would automatically get spoiled.
comment by ArisKatsaris · 2011-11-05T23:21:20.301Z · LW(p) · GW(p)
First of all, if you're using Firefox, just install some addon like "LeetKey" which allows you to rot13 by selecting the text and choosing ROT13 from the popup menu, or even by using some shortcut key. It's stupid to have to go to a different site to decode, regardless of whether it's javascript-based or server-side scripted.
Secondly, we just need an enhancement of the available markdown tags like siodine suggests in a comment below.
Replies from: Xachariah↑ comment by Xachariah · 2011-11-06T11:40:08.175Z · LW(p) · GW(p)
I came here to suggest LeetKey too. It's client side, and supports a number of reversible transformations. Most of the encryptions work with all types of punctuation (morse/rot13 being the exceptions). Some people might be able to decrypt rot13, a handful morse, and a couple hexadecimal; nobody is going to accidentally read AES encryption. It supports password based encryption/decryption for those encryptions that require it. Also, it's free and easy to use. The following encryptions all took about 2 seconds each. If we were all using leetkey, swapping to a new spoiler type would be as easy as picking and choosing.
Here's a spoiler in Rot13. Urer'f n fcbvyre va Ebg13.
Here's a spoiler in base64. SGVyZSdzIGEgc3BvaWxlciBpbiBiYXNlNjQu
Here's a spoiler in AES encryption. (Key: lesswrong)
XqqfeDMBITAhITAhLV5g2FnewrRa20y8iuzAA/AttnBO8f40jPAUYDulZnYyVi16mco=
You can also type in reverse, if you really want. .tnaw yllaer uoy fi ,esrever ni epyt osla nac uoY
Edit: Removed binary, morse, DES encryption, etc because they looked really annoying on the page and that many examples were unnecessary.
comment by siodine · 2011-11-04T23:44:47.869Z · LW(p) · GW(p)
Replies from: Vladimir_Nesov, drethelin↑ comment by Vladimir_Nesov · 2011-11-05T23:59:14.710Z · LW(p) · GW(p)
(Dislike presence of tooltip text obscuring the text already appearing in the mouseover area. No need for the many flavors of specialized spoilers.)
↑ comment by drethelin · 2011-11-05T01:13:14.757Z · LW(p) · GW(p)
Strongly agree, though I don't think it's worth the effort to implement a new spoilers standard on the boards considering how rarely I see any posts containing spoilers AND how few people can naturally read rot13
Replies from: siodine↑ comment by siodine · 2011-11-05T03:45:14.385Z · LW(p) · GW(p)
Here's the code. Should work for lesswrong too, I think.
comment by HonoreDB · 2011-11-05T18:11:29.156Z · LW(p) · GW(p)
I've written something to your spec, tested it briefly in a couple browsers. It uses a bitwise not on the unicode values, basically the generalized version of rot13. It's technically still a substitution cipher, but due to the way high-value unicode tends to display it's going to be much more difficult to accidentally learn.
Replies from: HonoreDB, DanielVarga↑ comment by DanielVarga · 2011-11-06T14:54:42.351Z · LW(p) · GW(p)
That's very cool, thanks a lot! I think it has a chance to become the de facto standard solution to a very obscure problem, if it gets some momentum. :) I don't think the bitwise-not is the best choice, though. Right now I don't have any time to improve it, but I'll try to make others interested, if you don't mind.
Replies from: HonoreDB↑ comment by HonoreDB · 2011-11-08T23:48:20.657Z · LW(p) · GW(p)
Go ahead. Feel free to steal, modify, redistribute, etc. Also, I made a new version that enables passwords and maps printable ascii to printable ascii. You seem to have nerd-sniped me.
Replies from: DanielVarga↑ comment by DanielVarga · 2011-11-09T20:32:56.041Z · LW(p) · GW(p)
That's very-very cool. And I even learned a new phrase.
comment by dlthomas · 2011-11-05T02:08:01.879Z · LW(p) · GW(p)
It's been packaged on *NIX systems forever. On Debian and Ubuntu, it's in the package "bsdgames".
Like most traditional unix utilities, it reads from stdin and writes to stdout, so if you have xclip or xsel installed you can copy the text in question, then run
xsel -ob | rot13 | xsel -ib
and then paste the rot13'ed (or un-rot13'ed) text.
You could even bind it to a key combo without too much trouble.
comment by Incorrect · 2011-11-05T00:01:05.336Z · LW(p) · GW(p)
Many people can decrypt parts of rot13 ciphertexts without conscious effort.
How does one happen to acquire this skill?
Replies from: None, lessdazed↑ comment by lessdazed · 2011-11-05T03:36:39.574Z · LW(p) · GW(p)
People are differently good at linguistic skills. My experience in yeshiva demonstrated this to me. Aramaic is a language related to Hebrew, and in yeshiva it is very, very often read and never written or spoken, nor are new thoughts generated in it. Hebrew is used in Israel, the outside world for Americans learning in Israel, so it will be read frequently, and spoken and heard sometimes. English will be how Americans communicate with each other, even when learning things in Hebrew or Aramic. Yiddish is heard and spoken to various degrees in different places.
One guy at my yeshiva was, as a gimmick or parlor trick, able to speak fluently about mundane topics in (Babylonian Jewish) Aramaic without any paying any special attention to trying to learn to do it, in a way that seemed magical to me. This guy also learned to understand, read, speak, and write in Yiddish and Hebrew with apparently zero effort. In general, he wasn't very smart. Some non-Americans developed perfect understanding of spoken English without being able to speak much at all. Some native Hebrew speakers were able to read Aramaic after just a bit of practice, others learned much more slowly than even average English speakers, as the Hebrew speakers were confounded by the similarities amid differences. And, of course, most people had their separate skills in a language partially correlated instead of entirely correlated or uncorrelated, and their Aramaic and Hebrew skills loosely correlated.
The stereotypical case is of an illiterate person who can't read or write, making one think that those skills are inherently harder than listening or speaking. I don't think that any of the four skills is inherently the hardest or easiest for all humans.
I expect people's ability to learn rot13 to vary greatly.
Replies from: Bill_McGrath↑ comment by Bill_McGrath · 2011-11-05T09:29:00.725Z · LW(p) · GW(p)
A more lowbrow example:
The webcomic "Order of the Stick". for a while, had a character who was unable to speak intelligibly as the result of a curse. This character's speech bubbles were full of meaningless strings of letters. However, as I read it it became pretty clear that there was some kind of substitution going on. It was a simple substitution cipher, so it was possible with some effort to make sense of this character's dialogue. The cipher would change every few strips, and I'm not aware of any clue or key given within the comic itself.
Figuring out how to read this had a little to do with knowing the typical frequency of letters in English, but was probably more down to context (predicting what the character might say) and the fact that the length and capitalization of the words remained intact.
I just checked an old episode there, and was very quickly able to decipher the unintelligible dialogue, but this was almost entirely down to knowing the context. I remember when I initially read it, I more or less memorized the ciphers without specific effort.
Predicting what a rot13ed text might say could be a factor, and in my experience it was not particularly difficult to assimilate similar ciphers once I was reasonably confident my decryptions were correct.
comment by djcb · 2011-11-05T14:04:52.350Z · LW(p) · GW(p)
People using emacs to edit text boxes, for example when using It's all text! or org-protocol can use M-x rot13-region (after the text-to-be-rot13'ed is marked/selected).
Interestingly, there's also M-x toggle-rot13-mode, which, when enabled, translates the current window to rot13, including your input.
For non-ascii encoding, maybe base64 would be useful? M-x base64-encode-region and M-x base64-decode-region are your friends here. If people find substitution cyphers to easy tor read, the text could simply be reverted (M-x reverse-region). That should work well enough for most people...
Edit: quick hack for reversed base64:
(defun toggle-reverse-base64-region (b e)
"Instead of rot13, we can use reversed base64 to hide text. The
encoded text is prefixed with \"rb64:\"."
(interactive "r")
(let*
((str (buffer-substring b e))
(match (string-match "^rb64:\\(.*\\)" str))
(revstr (lambda (str) (concat (reverse (string-to-list str)))))
(code
(if match
(funcall revstr (base64-decode-string (match-string 1 str)))
(concat "rb64:" (base64-encode-string (funcall revstr str))))))
(kill-region b e)
(insert code)))
comment by MichaelHoward · 2011-11-05T13:46:15.929Z · LW(p) · GW(p)
Instead of the receiver pasting the ciphertext into the rot13.com textbox...
Make an easy-to-get-to bookmark in your browser, and paste this into where you'd normally put the link to the web site:
javascript:inText=window.getSelection()+'';if(inText=='')%7Bvoid(inText=prompt('Phrase...',''))%7D;if(!inText)%7BoutText='No%20text%20selected'%7Delse%7BoutText='';for(i=0;i%3CinText.length;i++)%7Bt=inText.charCodeAt(i);if((t%3E64&&t%3C78)%7C%7C(t%3E96&&t%3C110))%7Bt+=13%7Delse%7Bif((t%3E77&&t%3C91)%7C%7C(t%3E109&&t%3C123))%7Bt-=13%7D%7DoutText+=String.fromCharCode(t)%7D%7Dalert(outText)
Now, select the text you want to rot13 and click this bookmark.
I can't remember where I got this from, it's been in my browser for ages.
[Edit: someone added a different one below, so I guess you have a backup if either doesn't work on your browser].
Replies from: dbauppcomment by see · 2011-11-05T01:33:09.053Z · LW(p) · GW(p)
I'm personally much happier using a rot13 bookmarklet rather than having to use an external site to decode the spoiler text. If your alternative comes with a bookmarklet I can use to decode, I'm willing to use that; otherwise, I'd rather people stick with rot13 coding.
Replies from: dbaupp↑ comment by dbaupp · 2011-11-05T02:20:51.119Z · LW(p) · GW(p)
I agree that a bookmarklet is preferable to external applications. And in that vein, I've adapted this one so that it does in-place encoding/decoding both of normal text and text inside textboxes (like this: na rknzcyr).
javascript:function%20convert(inText){var%20outText='',t;for(i=0;i64&&t96&&t77&&t109&&t<123))t%20-=%2013;outText%20+=%20String.fromCharCode(t);}return%20outText;}var%20sel=window.getSelection().getRangeAt(0);if(sel==''){var%20act=document.activeElement;if(act.type=="textarea"){var%20start=act.selectionStart,end=act.selectionEnd;if(start!=end){var%20prefix=act.value.substring(0,start);var%20middle=convert(act.value.substring(start,end));var%20suffix=act.value.substring(end);act.value=prefix;act.value%20+=%20middle;void(act.value%20+=%20suffix);}}else{void(inText=prompt('Phrase...',''));if(inText)alert(convert(inText));}}else{out=convert(sel.toString());sel.deleteContents();var%20span=document.createElement("span");span.innerHTML=out;sel.insertNode(span);}
(I've only tested it on newer versions of Firefox and Chrome)
Replies from: seecomment by [deleted] · 2011-11-04T23:54:08.485Z · LW(p) · GW(p)
If you're going to be giving a link to another website anyway, then what's the point of the encoding?
Replies from: DanielVarga↑ comment by DanielVarga · 2011-11-05T00:11:51.748Z · LW(p) · GW(p)
If the other website stores the content, like pastebin.com or sketerpot's solution, then the content will be irreparably lost when the website goes offline. If the content is in the URL unencoded, then mouseover uncovers it.
Replies from: None↑ comment by [deleted] · 2011-11-05T01:59:19.497Z · LW(p) · GW(p)
Assuming I have my Markdown syntax correct, that can be avoided by providing alternate mouseover text.
Edit: it seems that this is a partial solution... my browser still shows the URL when I mouse over the link, but it's out of the way and easy to ignore.
comment by DanielLC · 2011-11-05T01:33:39.103Z · LW(p) · GW(p)
Does anyone want to volunteer for hosting it?
Why would it need to be a reciprocal cipher? From what I understand, you go to the website and type it to encode. You click a link to decode.
There's a significant advantage to it not being reciprocal if it's encoding non-standard characters. If it's reciprocal, it would have to be based on a set of all printable characters. If not, you could do whatever crazy stuff you want, then add the escape() command so what you end up with is printable.