Instapaper

bilal

bilal

Create Account Sign In

Apps How to Save Premium Developers Press Publishers Privacy & Terms

Twitter Facebook

bilal

ARENA 4.0 Impact Report — LessWrong

ARENA 4.0 Impact Report — LessWrong

If you're interested in helping to run the ARENA program, note that we're currently hiring for an Operations Lead! For more details, and to apply, see here. Summary The purpose of this report is to ev

Nov 28, 2024

Want To Be An Expert? Build Deep Models — EA Forum

Want To Be An Expert? Build Deep Models — EA Forum

forum.effectivealtruism.org

This piece is crossposted on my blog . Good news! This post is shorter than it looks. There’s a bunch of supplemental info in endnotes if you want the extended edition director’s cut. But it’s much sh

Nov 27, 2024

Surviving an open-plan office

Surviving an open-plan office

Like apparently every other startup in San Francisco, Theorem works in an open plan office. Not by choice. It was long ago that the barbarian hordes swept across Silicon Valley with battle cries of “less square footage per head!” and “increased collaboration!”, demolishing every interior wall in…

Nov 27, 2024

The correct response to uncertainty is *not* half-speed — LessWrong

The correct response to uncertainty is *not* half-speed — LessWrong

Once upon a time (true story), I was on my way to a hotel in a new city. I knew the hotel was many miles down this long, branchless road. So I drove for a long while. After a while, I began to worry I

Nov 27, 2024

Potential employees have a unique lever to influence the behaviors of AI labs — EA Forum

Potential employees have a unique lever to influence the behaviors of AI labs — EA Forum

forum.effectivealtruism.org

People who have received and are considering an offer from an AI lab are in a uniquely good spot to influence the actions of that lab. People who care about AI safety and alignment often have things t

Nov 27, 2024

Leveraging academia | Andrew Critch

Leveraging academia | Andrew Critch

Since a lot of interest in AI alignment has started to build, I’m getting a lot more emails of the form “Hey, how can I get into this hot new field?”. This is great. In the past I was getting so few m

Nov 26, 2024

Research Taste Exercises [rough note]

Research Taste Exercises [rough note]

colah.github.io

Posted on Jan 9, 2021 This article is a rough note. Writing rough notes allows me share more content, since polishing takes lots of time. While I hope it's useful, it's likely lower quality and less c

Nov 26, 2024

♟️ why i'm considering a PhD

♟️ why i'm considering a PhD

This is actually a revised copy of something i wrote on March 7th 2021, before i started my PhD. It had two purposes at the time: 1) it clarified my ideas about pursuing a PhD and 2) acted as a manife

Nov 26, 2024

Deliberate Grad School | Andrew Critch

Deliberate Grad School | Andrew Critch

Among my friends interested in rationality, effective altruism, and existential risk reduction, I often hear: “If you want to have a real positive impact on the world, grad school is a waste of time.

Nov 26, 2024

What to care about in a job

What to care about in a job

Someone recently asked me if I had any updates on my 2014 post on job hunting. I haven’t done a systematic job search since writing that post (I decided to join Wave without considering any other offe

Nov 25, 2024

FAQ: Advice for AI alignment researchers – Rohin Shah

FAQ: Advice for AI alignment researchers – Rohin Shah

Consider reading How to pursue a career in technical AI alignment. It covers more topics and has more details, and I endorse most if not all of the advice. To quote Andrew Critch: I get a lot of email

Nov 25, 2024

Keep Your Identity Small

Keep Your Identity Small

February 2009 I finally realized today why politics and religion yield such uniquely useless discussions. As a rule, any mention of religion on an online forum degenerates into a religious argument. W

Nov 24, 2024

AI x-risk reduction: why I chose academia over industry — LessWrong

AI x-risk reduction: why I chose academia over industry — LessWrong

I've been leaning towards a career in academia for >3 years, and recently got a tenure track role at Cambridge. This post sketches out my reasoning for preferring academia over industry. Thoughts on I

Nov 24, 2024

Salary Negotiation: Make More Money, Be More Valued | Kalzumeus Software

Salary Negotiation: Make More Money, Be More Valued | Kalzumeus Software

[Editor’s note: At nearly 7,000 words, you probably don’t want to try reading this on an iDevice. Bookmark it and come back later.] Imagine something a wee bit outside your comfort zone. Nothing scand

Nov 22, 2024

OK, I can partly explain the LLM chess weirdness now

OK, I can partly explain the LLM chess weirdness now

We recently talked about a mystery: All large language models (LLMs) are terrible at chess. All, that is, except for gpt-3.5-turbo-instruct, which for some reason can play at an advanced amateur level

Nov 22, 2024

Harry Potter and the Methods of Rationality, Chapter 113: Final Exam

Harry Potter and the Methods of Rationality, Chapter 113: Final Exam

The gibbous moon riding higher in the cloudless sky, the stars and wash of the Milky Way visible in all their majesty within the darkness: All these illuminated thirty-seven skull masks gleaming above

Nov 22, 2024

Harry Potter and the Methods of Rationality, Chapter 112: Failure, Pt 2

Harry Potter and the Methods of Rationality, Chapter 112: Failure, Pt 2

Even as Harry had raised the gun, he'd known he was making a mistake, his forebrain saw it and tried to stop his hand, but somehow the sick certainty didn't propagate fast enough to prevent his finger

Nov 21, 2024

Harry Potter and the Methods of Rationality, Chapter 111: Failure, Pt 1

Harry Potter and the Methods of Rationality, Chapter 111: Failure, Pt 1

The Dark Lord was laughing. From the empty air came the voice of the Defense Professor laughing wildly, so high and terrible his laughter; it was Voldemort's laughter now, the Dark Lord's laughter bey

Nov 21, 2024

Harry Potter and the Methods of Rationality, Chapter 110: Reflections, Pt 2

Harry Potter and the Methods of Rationality, Chapter 110: Reflections, Pt 2

The grimness on Albus Dumbledore's face lasted only an instant before giving way to bewilderment. "Quirinus? What -" And then there was a pause. "Well," said Albus Dumbledore. "I do feel stupid." "I s

Nov 20, 2024

Harry Potter and the Methods of Rationality, Chapter 109: Reflections

Harry Potter and the Methods of Rationality, Chapter 109: Reflections

Even the greatest artifact can be defeated by a counter-artifact that is lesser, but specialized. That was what the Defense Professor had told Harry, after dropping the True Cloak of Invisibility to p

Nov 19, 2024

Consider tabooing "I think" — LessWrong

Consider tabooing "I think" — LessWrong

People say "I think" a lot. Here are some examples: I think you brought me the wrong order.I think the numbers in the report are wrong.I think you need to turn left at the light.I think we need to rep

Nov 19, 2024

Harry Potter and the Methods of Rationality, Chapter 108: The Truth, Pt 5, Answers and Riddles

Harry Potter and the Methods of Rationality, Chapter 108: The Truth, Pt 5, Answers and Riddles

The Defense Professor had set up a cauldron, floating it into place with a wave of his wand, another wave starting a fire beneath it. A brief circling of the Defense Professor's finger had set in moti

Nov 19, 2024

Using Dangerous AI, But Safely?

Using Dangerous AI, But Safely?

Can we keep powerful AI under control, using AI? The Paper: https://arxiv.org/abs/2312.06942 AI Lab Watch: https://ailabwatch.org/ Thanks to my wonderful patrons: https://www.patreon.com/robertskmiles

Nov 19, 2024

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory - YouTube

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory - YouTube

Gwern's blog: https://gwern.net/ Gwern is a pseudonymous researcher and writer. After the episode, I convinced Gwern to create a donation page where people can help sustain what he's up to. Please go

Nov 19, 2024

A Survival Guide to a PhD

A Survival Guide to a PhD

karpathy.github.io

This guide is patterned after my “Doing well in your courses”, a post I wrote a long time ago on some of the tips/tricks I’ve developed during my undergrad. I’ve received nice comments about that guid

Nov 18, 2024

LessWrong's (first) album: I Have Been A Good Bing — LessWrong

LessWrong's (first) album: I Have Been A Good Bing — LessWrong

Let me not hold on, me hearties

Nov 17, 2024

Harry Potter and the Methods of Rationality, Chapter 107: The Truth, Pt 4

Harry Potter and the Methods of Rationality, Chapter 107: The Truth, Pt 4

The spiraling leaves of the gigantic dieffenbachia felt like forest loam beneath Harry's shoes, not as unyielding as concrete, but supporting his weight. Harry kept a wary eye on the tendrils, but the

Nov 16, 2024

Harry Potter and the Methods of Rationality, Chapter 106: The Truth, Pt 3

Harry Potter and the Methods of Rationality, Chapter 106: The Truth, Pt 3

After a single step into Dumbledore's forbidden chamber, Harry shrieked and jumped back and collided with Professor Snape, sending the two of them down in a heap. Professor Snape picked himself up and

Nov 16, 2024

Harry Potter and the Methods of Rationality, Chapter 105: The Truth, Pt 2

Harry Potter and the Methods of Rationality, Chapter 105: The Truth, Pt 2

Tom Riddle. The words seemed to echo inside Harry's head, sparking resonances that as quickly died away, broken patterns trying to complete themselves and failing. Tom Riddle is a Tom Riddle was the R

Nov 16, 2024

Harry Potter and the Methods of Rationality, Chapter 104: The Truth, Pt 1, Riddles and Answers

Harry Potter and the Methods of Rationality, Chapter 104: The Truth, Pt 1, Riddles and Answers

June 13th, 1992. It was the last week of school in Hogwarts, and Professor Quirrell was still alive, barely. The Defense Professor himself would be in a healer's bed, this day, as he'd been for almost

Nov 15, 2024

Attitudes one can take towards people who have behaved badly

Attitudes one can take towards people who have behaved badly

dynomight.substack.com

Have you ever noticed that reality has some properties that are quite annoying? For example, have you noticed that some people do bad things? And yet those same people sometimes have interesting ideas

Nov 14, 2024

Something weird is happening with LLMs and chess

Something weird is happening with LLMs and chess

A year ago, there was a lot of talk about large language models (LLMs) playing chess. Word was that if you trained a big enough model on enough text, then you could send it a partially played game, as

Nov 14, 2024

What failure looks like — LessWrong

What failure looks like — LessWrong

The stereotyped image of AI catastrophe is a powerful, malicious AI system that takes its creators by surprise and quickly achieves a decisive advantage over the rest of humanity. I think this is prob

Nov 14, 2024

You don't need to work on hard problems

You don't need to work on hard problems

College, 2012—Internship recruiting season. “What are you looking for in your internship?” the recruiter asks. “I’d like to solve hard technical problems,” I reply. I end up at Jane Street writing sof

Nov 14, 2024

Harry Potter and the Methods of Rationality, Chapter 103: Tests

Harry Potter and the Methods of Rationality, Chapter 103: Tests

June 4th, 1992. Daphne Greengrass was in the Slytherin common room, writing a letter to her Lady Mother (who was surprisingly intransigent about power-sharing, despite not even being in Hogwarts to ex

Nov 14, 2024

Harry Potter and the Methods of Rationality, Chapter 102: Caring

Harry Potter and the Methods of Rationality, Chapter 102: Caring

June 3rd, 1992. Professor Quirrell was very sick. He'd seemed better for a while, after drinking his unicorn's blood in May, but the air of intense power which had surrounded him afterward hadn't last

Nov 14, 2024

Harry Potter and the Methods of Rationality, Chapter 101: Precautionary Measures, Pt 2

Harry Potter and the Methods of Rationality, Chapter 101: Precautionary Measures, Pt 2

Harry stood, panting, in the midst of a brief wasted circle amid the forest, more destruction than a first-year should have been able to reach, by himself. The Severing Charm wouldn't bring down a tre

Nov 13, 2024

Harry Potter and the Methods of Rationality, Chapter 100: Precautionary Measures, Pt 1

Harry Potter and the Methods of Rationality, Chapter 100: Precautionary Measures, Pt 1

May 13th, 1992. Argus Filch's face appeared twisted in the light of the oil lamp he held, shadows dancing over his face. Behind them the doors of Hogwarts quickly receded, and the dark grounds moved c

Nov 13, 2024

How To Go From Interpretability To Alignment: Just Retarget The Search — LessWrong

How To Go From Interpretability To Alignment: Just Retarget The Search — LessWrong

[EDIT: Many people who read this post were very confused about some things, which I later explained in What’s General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems? You might

Nov 13, 2024

How useful is mechanistic interpretability? — LessWrong

How useful is mechanistic interpretability? — LessWrong

Opening positions ryan_greenblatt I'm somewhat skeptical about mech interp (bottom-up or substantial reverse engineering style interp): Current work seems very far from being useful (it isn't currentl

Nov 13, 2024

No articles.

▹ Newer Articles Older Articles ▹

↑ Click & drag up to your Bookmarks Bar.