AI safety

California’s new AI safety law shows regulation and innovation don’t have to clash | TechCrunch

SB 53, the AI safety and transparency bill that California Gov. Gavin Newsom signed into law this week, is proof that state regulation doesn’t have to hinder AI progress. So says Adam Billen, vice president of public policy at youth-led advocacy group Encode AI, on today’s episode of Equity. “The reality is that policy makers themselves know that we have to do something, and…

Aligning AI with human values

ellonjohns9 months ago011 mins

Senior Audrey Lorvo is researching AI safety, which seeks to ensure increasingly intelligent AI models are reliable and can benefit humanity. The growing field focuses on technical challenges like robustness and AI alignment with human values, as well as societal concerns like transparency and accountability. Practitioners are also concerned with the potential existential risks associated with…

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

ellonjohns10 months ago01 mins

When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could jailbreak frontier LLMs simply by translating forbidden prompts into obscure languages. Excited by this result, we attempted to reproduce it and found something unexpected.

Highlights

4 ways mobile browsers are safer than PC

Champions League Soccer: Livestream Chelsea vs. Ajax Live From Anywhere

Why You Should Swap Passwords for Passphrases

Asus ROG Crosshair X870E Extreme Motherboard review: Flagship value, with minimal sacrifices

Category Collection

California’s new AI safety law shows regulation and innovation don’t have to clash | TechCrunch

Aligning AI with human values

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark