Effective Altruism News
Effective Altruism News
- In my most recent post, I introduced a corrigibility transformation that could take an arbitrary goal over external environments and define a corrigible goal with no hit to performance. That post focused on corrigibility and deception in training, which are some of the biggest problems in AI alignment, but the underlying mechanism has broader applicability.
- AI policy and the unique advantages of individual donors
- Researchers rely on a wide range of tools throughout the research lifecycle for storing data, analyzing results, writing papers, and sharing outcomes. The OSF helps bring those tools into one place, making it easier to organize, share, and connect your work.
- For Pride, kind of. Are bisexuals canonically late? Probably.
- Sometimes working on animal issues feels like an uphill battle, with alternative protein losing its trendy status with VCs, corporate campaigns hitting blocks in enforcement and veganism being stuck at the same percentage it's been for decades.
- Don't have your self-image be as the emotionally-troubled, intellectually complex protagonist
- Just past the halfway point of 2025, Liz Wheeler offers an update to our supporters and fellow advocates about our latest research, newest resources, and what’s in store for the rest of the year. The post Faunalytics’ 2025 Mid-Year Report appeared first on Faunalytics.
- Just pretend you're someone else
- Large, charismatic mammals receive the bulk of attention and funding despite numerous other species facing a much greater risk of extinction. The post You Win Some, You Lose Many: Conservation Bias Fails The Most Vulnerable appeared first on Faunalytics.
- IUDs generate a lot of strong opinions. Why?
- Apply now! Mentorship for ambitious professionals | Job Blast 🚀 Apply by July 25 and get matched with a mentor that can transform your career for free ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏...
- Dr Estee Torok, Senior Program Officer in Surveillance, Data & Epidemiology in Malaria / Global Health at the Gates Foundation visited Target Malaria Uganda based at the Uganda Virus Research Institute. Dr Torok was welcomed by the Head of the Entomology Department and Principal Investigator of Target Malaria Project, Dr Jonathan Kayondo, who provided an […].
- What candidate truths do you find most disturbing?
- EA Forum Digest #249 Hello!. Next week is Career Conversations Week on the Forum! Consider writing about your job, or thinking about what you’d like to ask advisors from Probably Good, 80,000 Hours, Animal Advocacy Careers and Successif in our AMA next week (it'll be pinned on the front page on Monday).
- Achieving a 10-minute warning would save thousands of lives
- Open science practices are embedded throughout Amélie Godefroidt’s research, which explores public opinion during and after wars, civil conflicts, and terrorist attacks. As a Postdoctoral Researcher and Lecturer at the KU Leuven Centre for Research on Peace and Development in Belgium, she regularly engages with ethically and methodologically complex material and highly sensitive data, and...
- On the margins of the 2025 AI for Good Summit, the Simon Institute for Longterm Governance (SI) co-hosted a Geneva Security Debate on the…
- Late on Monday night, July 14, 2025, the ninth richest man in the world broke some momentous news: The US government would allow him, Nvidia CEO Jensen Huang, to sell H20 processors to Chinese customers again. To people following the Trump administration and its seemingly unending announcements and reversals of trade restrictions, this might not […]...
- That’s what this blog is about: the unexpected happiness that comes from helping others and making a difference in the world. And I’m not talking about just giving your attention to and being aware of the problems in the world, but giving your time and money, strategically and generously, to actually solve those problems for others.
- Climate and Nutrition Data Analysis Intern admin_inox Wed, 07/16/2025 - 10:30 vacancy_id SYS-1286 location London (UK), Delhi (IN) Contract type Intern Duration Other Frontend apply URL https://jobs.gainhealth.org/vacancies/1286/apply/ Closing date Wed, 07/23/2025 - 12:00 Department Programmes about_the_role <p>The Global Alliance for Improved Nutrition (GAIN) is seeking a...
- Originally submitted as a project for the AI Safety Fundamentals AI Governance course in April 2025; edited and somewhat expanded for publication on the EA Forum in July 2025. Thanks to Yip Fai Tse, Arturs Kanepajs, Max Taylor, Constance Li, Kevin Xia, Adrià Moret and Sam Tucker-Davis for advice and suggestions before and/or after the writing of this piece. Introduction.
- Newspaper announcements that give a taste of relationship breakdowns. The post Runaway wives in 1700s Pennsylvania appeared first on Otherwise.
- MSEP is a free, open-source platform for designing and simulating atomically precise nanomechanical systems — a tool for exploring the foundations of future physical technologies.
- At Greener by Default, you will be working towards the ambitious goal of completely transforming institutional foodservice - and the food system as a whole - by making plant-based food the default. We recognize that the fate of humans, animals, and ecosystems are bound together, and strive to create a food system that will allow all life on earth to flourish.
- This analysis provides an accessible reading of the EU’s new Codes of Practice for General-Purpose AI, which help model providers comply with the AI Act’s provisions. Our focus is on the Safety and Security Code. The post What’s in the European Union’s Codes of Practice for Governing General-Purpose AI? appeared first on The Future Society.
- Thanks to Neel Nanda and Helena Casademunt for feedback on a draft. In a previous post, I argued that some interpretability researchers should prioritize showcasing downstream applications of interpretability work; I call this type of research practical interpretability. How should we pick promising downstream applications for practical interpretability researchers to target?
- Twitter | Paper PDF. Seven years ago, OpenAI five had just been released, and many people in the AI safety community expected AIs to be opaque RL agents. Luckily, we ended up with reasoning models that speak their thoughts clearly enough for us to follow along (most of the time).
- How Trump is killing millions of people and why few people care
- Key Takeaways Ethical persuasion is a learnable skill. Persuasion is a skill like any other, and it is possible both to learn how to do...
- I spend a lot of time talking to philanthropists about how to donate, and one trend I have noticed, particularly with smart but new-to-grantmaking folks, is the trend below.
- Cause humility, no AI moratorium, Gavi needs funding View this email in your browser Hello! Our favourite links this month include: The US withdrew funding from Gavi, which vaccinates half of the world’s children.
- We're looking for an excellent Development Manager to join our remote team, starting September 2025. The post HLI is recruiting a Development Manager! appeared first on Happier Lives Institute.
- Guinea pigs are popular companion animals, but little is known about their welfare in homes. Researchers examined data from over 1,000 guardians to see what kind of care their guinea pigs are given. The post How Do We Have Healthy, Happy Guinea Pigs? appeared first on Faunalytics.
- The post Rebuilding after apocalypse: What 13 experts say about bouncing back appeared first on 80,000 Hours.
- On the margins of the 2025 AI for Good Summit, the Simon Institute for Longterm Governance (SI) organized an event on the International AI…
- Compassion in World Farming International is the leading international farm animal welfare charity, campaigning to improve the lives of millions of farm animals through advocacy, lobbying for legislative change, and positive engagement with the global food industry. Our established international Food Business programme aims to raise baseline standards for farm animals by securing commitments,...
- Thanks to support from Sightsavers and other organisations, millions of people in Senegal are no longer at risk from losing their sight to the eye disease.
- No one should have to suffer, no matter where they are. And if we can help more people with the same donation, then that’s the right thing to do. Watch Part I and Part II of The Skeptic in our profile now. Or check out our giving pledges, to give to the charities that can help others the most at gwwc.org/pledge...
- It'll be too slow and too late for the timeframes that we need to decarbonise.
- ✻[Perfect stillness]✻.
- Anna and Ed are co-first authors for this work. We’re presenting these results as a research update for a continuing body of work, which we hope will be interesting and useful for others working on related topics. TL;DR:
- Previously, we've shared a few higher-effort project proposals relating to AI control in particular. In this post, we'll share a whole host of less polished project proposals. All of these projects excite at least one Redwood researcher, and high-quality research on any of these problems seems pretty valuable. They differ widely in scope, area, and difficulty. Control.
- Tl;dr how can I improve my literature-review based posts? I write a fair number of blog posts that present the data from scientific papers. There’s a balancing act to this- too much detail and people bounce off, too little and I’m misleading people. I don’t even think I’m on the pareto frontier of this- probably … Continue reading "What do you Want out of Literature Reviews?"...
- Linkpost as I am mostly writing on substack now. TL;DR: NGOs have a unique advantage over for-profits in their ability to cooperate and build common goods, yet few fully leverage "ecosystem thinking.". This approach extends beyond an organization's direct impact to consider the entire field's health.
- This is a write-up of a brief investigation into shutdown resistance undertaken by the Google DeepMind interpretability team. TL;DR: Why do models sometimes resist shutdown? Are they ignoring instructions to pursue their own agenda – in this case, self-preservation? Or is there a more prosaic explanation?
- This post is a companion piece to a forthcoming paper. This work was done as part of MATS 7.0 & 7.1. Abstract. We explore how LLMs’ awareness of their own capabilities affects their ability to acquire resources, sandbag an evaluation, and escape AI control.
- The Houthis sank two ships last week, new powerful open source model released.
- Maximizing respect for others' self-regarding preferences
- There is an important consultation happening right now on revising EU animal welfare laws - covering cages, import standards, male chick culling, and welfare indicators. If you're an EU Citizen, I encourage you to take 15-30 minutes over the next few days to respond. Deadline: July 16th (midnight CET).
- It's surprisingly easy to be a much more ethical omnivore
- what universal human experiences are you missing?
- Animals aren’t just reactive — they form expectations and learn from experience. Just like in humans, this affects how they feel. The post More Than Instinct, Animals Have Expectations appeared first on Faunalytics.
- As the world’s leading animal photojournalism agency, We Animals (WA) advocates for animals through photojournalism. Our global investigations and stories expose our complex relationships with animals, create ethical and cultural shifts in society, and empower human capacity for compassion and change.
- Iceland Foods committed to eliminating eyestalk ablation and implementing pre-slaughter electric stunning across their own-label prawn range by the end of 2027. ICAW has been running a campaign against Iceland for several months inclusive of digital actions, in-person demonstrations (including 70 people gathering in London in May) and other pressure tactics. Only 3 retailers have yet to...
- Empirical AI security/safety projects across a variety of areas
- deliberate practice for portraiture
- Summary: I think organisations using Rethink Priorities’s (RP’s) mainline welfare ranges, at least Ambitious Impact (AIM), Animal Charity Evaluators (ACE), the Animal Welfare Fund (AWF), and RP, should consider effects on soil nematodes, mites, and springtails. I believe these are the driver of the overall effects of the vast majority of interventions.
- Plus, steganography and future superintelligences
- Greetings from a world where…...
- Since January I’ve applied to ~25 EA-aligned roles. Every listing attracted hundreds of candidates (one passed 1,200). It seems we already have a very deep bench of motivated, values-aligned people, yet orgs still run long, resource-heavy hiring rounds. That raises three things: Cost-effectiveness:
- Each line represents the trend in time horizon for one benchmark smoothed spline with s=0.2, k=1. Lines have a range on the y-axis equal to the range of task lengths actually in the benchmark (2nd% to 98th% quantiles). Summary: In the paper Measuring AI Ability to Complete Long Software Tasks (Kwa & West et al.
- The World Bank classifies countries into four income groups based on average income per person. This article explains how these groups are defined.
- As we mark one year since the launch of Mieux Donner, we wanted to share some reflections on our journey and our ongoing efforts to promote effective giving in France. Mieux Donner was founded through the Effective Incubation Programme by Ambitious Impact and Giving What We Can. TLDR: Prioritisation is important.
- The ASPCA is hiring a Senior Manager, Impact Measurement and Data Science to support the development and execution of evaluation strategies for various programs that aim to improve the lives of animals. We are looking for a collaborative and creative critical thinker, committed to the mission and values of the organization. Applications are due 7/20.
- #ai #alignment #aisafety #openai
- this week in security — july 13 edition CitixBleed 2 under attack, 'Hafnium' hacker arrested, Jack Dorsey's not-so-'secure' messaging app, Gemini accessing other Android apps, and more. ~this week in security~. a cybersecurity newsletter by @zackwhittaker volume 8, issue 28 View this email in your browser | past issues | RSS ~ ~ THIS WEEK, TL;DR. Everyone...
- Easy productivity advice
- Hardware is a huge part of the AI game right now - access to chips, the geopolitics of Taiwan - and it's because they need hundreds, then thousands, then tens of thousands and maybe millions more to train the next biggest models. #airisk #aiprogress #compute
- A parody
- Here, I talk about what America used to stand for, and how we are losing it.
- Key takeaway. We need less breadth and more depth. Much of the early growth and success of the movement at the intersection of AI, animals and digital minds has come from exchanging ideas, people and resources across these fields. We think we are now approaching a point where more specialisation and a greater focus on action will lead to the most valuable outcomes. Rather not read?
- On Tuesday, the TSA — a federal agency not known for its generosity — gave American travelers a gift: They will no longer have to take off their shoes when going through airport security. “I think most Americans will be very excited to see they will be able to keep their shoes on,” said Homeland […]...
- TLDR: On a VERY rough Sanity check , GiveWell’s New Incentives “lived saved” estimate seems twice as high as is plausible. On seeing this wonderful graph from a GiveWell Back-check of New Incentives, I thought to myself wow New Incentives saved 27,000 lives - that’s impressive but feels high. So I decided to spend a surprisingly fun 90 minutes of my life doing a back-check of a back-check.
- ⚠️ Découvrez du contenu EXCLUSIF (pas sur la chaîne) ⚠️ ⇒ https://the-flares.com/y/bonus/ ⬇️⬇️⬇️ Infos complémentaires : sources, références, liens... ⬇️⬇️⬇️ Le contenu vous intéresse ? Abonnez-vous et cliquez sur la 🔔 Vous avez aimé cette vidéo ? Pensez à mettre un 👍 et à la partager.
- On most questions about the future, I don’t hold a strong view. I read the aggregate prediction of forecasters on Metaculus or Manifold Markets and then I pretty much believe whatever it says. Various attempts have been made to forecast existential risk.
- An Annotated Guide to My AI Safety Satire
- Climate advocates must double down on pragmatic industrial strategies to make clean energy a winning business for all
- This seems like an important piece of work - an RCT on the use of AI capabilities for developers. The TL;DR of the paper is. The devil is in the details of course. This post by Steve Newman does a good job of working through some of them. I have highlighted some considerations from it: The methodology was as follows:
- A $50,000 first place prize for essays exploring consciousness
- Hi all!. We’re happy to share that Super Festval, a supermarket brand part of Grupo Beal (former “Companhia Beal de Alimentos’), has officially published a commitment to exclusively source pork from group housing systems during gestation in Brazil by 2028, considering preferably preimplantation systems where sows are housed in stalls for no longer than 7 days. You can read the announcement in...
- Thank you to @Jacintha Baas, @Judith Rensing, and the @CE team for your help in editing and improving this post. Introductory Context. Hi, I’m Trish. This is my first post on the EA forum.
- Finalist #3 in the Review Contest
- Transformer Weekly: SB 53’s revamp, Peter Kyle on AGI, and a movie about Ilya Sutskever...
- Just talk like a person!
- It is a truth universally acknowledged that a rationalist, in possession of a vague understanding of Bayes’ Theorem, must be in want of some woo.
- The welfare needs of Japanese quails are understudied compared to other farmed birds. What do we know, and what more can we learn?. The post Cage-Free Housing For Japanese Quails appeared first on Faunalytics.
- Check Your Dialectical Privilege
- no attempts
- Anders Sandberg joins me to discuss superintelligence and its profound implications for human psychology, markets, and governance. We talk about physical bottlenecks, tensions between the technosphere and the biosphere, and the long-term cultural and physical forces shaping civilization.
- From the beginning, Elon Musk has marketed Grok, the chatbot integrated into X, as the unwoke AI that would give it to you straight, unlike the competitors. But on X over the last year, Musk’s supporters have repeatedly complained of a problem: Grok is still left-leaning. Ask it if transgender women are women, and it […]...
- 7 charts about AI deployment
Loading...