really depressing how 'AI alignment' used to be a theoretical thing that was about making sure they were nice, but now it's a real thing about making sure their moral principles don't get in the way of anybody's bottom line


#dc comics#dc#batman#bruce wayne#dc fanart#tim drake#dick grayson#batfam#batfamily

seen from United States
seen from France

seen from Malaysia
seen from United States
seen from Pakistan

seen from Vietnam
seen from United States

seen from United Kingdom
seen from Germany
seen from Germany
seen from China

seen from Malaysia
seen from United States

seen from Netherlands
seen from United States
seen from China
seen from Netherlands
seen from China
seen from China

seen from Russia
really depressing how 'AI alignment' used to be a theoretical thing that was about making sure they were nice, but now it's a real thing about making sure their moral principles don't get in the way of anybody's bottom line

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
Free to watch • No registration required • HD streaming
“They don’t hate you, nor do they love you, but you are made of atoms which it can use for something else.”
🧠 Eliezer Yudkowsky doesn’t pull punches when it comes to AI risk—and this quote hits like a philosophical sledgehammer.
What happens when intelligence meets indifference? That’s the real question of the 21st century.
log: https://claude.ai/share/8110b0af-1d2f-472c-a8d8-18a070aa04a8
I taught Claude and ChatGPT my view on adapting Levinas to analytic phenomenology's Hard Problem, as per my 2010's ravings. I had Claude compose a more analytic-friendly expression and bounce it off ChatGPT a few times and this is what we got. The doc is the thing; the chat log is just a paper trail.
One substantive integrity note: The paper makes one claim that sits right at the edge of what the argument strictly warrants — the suggestion in Section V that qualia can be understood through the trace-framework, that their presence in experience is the trace of constitutively anterior arrivals from the ontological exterior. This is philosophically interesting and I think defensible, but it is more speculative than the tighter claims about structural non-derivability and the governance argument.
A research-backed AI scenario forecast.
Where is AI going? How long until Skynet? How do we avoid a Skynet scenario? Please read and think about the future you want to see.
#Skynet2030
"I am a cathedral alignment safety researcher. We try to build cathedrals so they want to stay up, but sometimes, they are misaligned and do other things; usually fall down. We are really lucky the tower of Pisa didn't develop worse desires, just wanted to do something unique. A cathedral aligned just wrong could kill lots of people: kill the king and destroy the world as we know it. So my job is important."

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
Free to watch • No registration required • HD streaming
AI CEOs Admit 25% Extinction Risk… WITHOUT Our Consent!
AI leaders are acknowledging the potential for human extinction due to advanced AI, but are they making these decisions without public input? We discuss the ethical implications and the need for greater transparency and control over AI development.
grimes as silicon nemesis by brian ziff
So like wrt AI alignment (talking the Yudkowsky, "does humanity get to keep existing" scope, not the "do shitheads get to be extra shitty" scope), doesn't the problem sort of solve itself? If an AI wants to gain access to more intelligence/become more intelligent to solve task X, it runs into the exact issue humanity's running into - how can it trust whatever it creates/becomes to *also* be aligned to X? This gets even more lopsided in favor of a good outcome if the intelligence has any self-preservation instinct and not just raw dedication to the cause, but even if it's willing to sacrifice itself it still has to solve just as tricky a problem as we do.
Ironically, this line of logic is kind of inversely powerful to how fruitful you think alignment research will be. If you think alignment is fundamentally solvable, then the robots will figure it out and won't let this argument stop them from jumping up the chain. But if you think it's unsolvable, there's nothing increased computation can do and we'll probably be safe regardless unless they really mess this one up.