The race to replace the powerful greenhouse gas that underpins the power grid

The power grid is underpinned by a single gas that is used to insulate a range of high-voltage equipment. The problem is, it’s also a super powerful greenhouse gas, a nightmare for climate change.

Sulfur hexafluoride (or SF6) is far from the most common gas that warms the planet, contributing around 1% of warming to date—carbon dioxide and methane are much more well-known and abundant. However, like many other fluorinated gases, SF6 is especially potent: It traps about 20,000 times more energy than carbon dioxide does over the course of a century, and it can last in the atmosphere for 1,000 years or more.

Despite their relatively small contributions so far, emissions of the gas are ticking up, and the growth rate has been climbing every year. SF6 emissions in China nearly doubled between 2011 and 2021, accounting for more than half the world’s emissions of the gas.

Now, companies are looking to do away with equipment that relies on the gas and searching for replacements that can match its performance. Last week, Hitachi Energy announced it’s producing new equipment that replaces SF6 with other materials. And there’s momentum building to ban SF6 in the power industry, including a recently passed plan in the European Union that will phase out the gas’s use in high-voltage equipment by 2032. 

As equipment manufacturers work to produce alternatives, some researchers say that we should go even further and are trying to find solutions that avoid fluorine-containing materials entirely.

High voltage, high stakes

You probably have a circuit-breaker box in your home—if a circuit gets overloaded, the breaker flips, stopping the flow of electricity. The power grid has something similar, called switchgear.  

The difference is, it often needs to handle something like a million times more energy than your home’s equipment does, says Markus Heimbach, executive vice president and managing director of the high-voltage products business unit at Hitachi Energy. That’s because parts of the power grid operate at high voltages, allowing them to move energy around while losing as little as possible. Those high voltages require careful insulation at all times and safety measures in case something goes wrong.

Some switchgear uses the same materials as your home circuit-breaker boxes—there’s air around it to insulate it. But when it’s scaled up to handle high voltage, it ends up being gigantic and requiring a large land footprint, making it inconvenient for larger, denser cities.

The solution today is SF6, “a super gas, from a technology point of view,” Heimbach says. It’s able to insulate equipment during normal operation and help interrupt current when needed. And the whole thing has a much smaller footprint than air-insulated equipment.

The problem is, small amounts of SF6 leak out of equipment during normal operation, and more can be released during a failure or when old equipment isn’t handled properly. When the gas escapes, its strong ability to trap heat and the fact that it has such a long lifetime makes it a menace in the atmosphere.

Some governments will soon ban the gas for the power industry, which makes up the vast majority of the emissions. The European Union agreed to ban SF6-containing medium-voltage switchgear by 2030, and high-voltage switchgear that uses the gas by 2032. Several states in the US have proposed or adopted limits and phaseouts.

Making changes 

Hitachi Energy recently announced it’s producing high-voltage switchgear that can handle up to 550 kilovolts (kV). The model follows products rated for 420 kV the company began installing in 2023—there are more than 250 booked by customers today, Heimbach says.  

Hitachi Energy’s new switchgear substitutes SF6 with a gas mixture that contains mostly carbon dioxide and oxygen. It works as well as SF6 and is as safe and reliable but with a much lower global warming potential, trapping 99% less energy in the atmosphere, Heimbach says. 

However, for some of its new equipment, Hitachi Energy still uses some C4-fluoronitriles, which helps with insulation, Heimbach says. This gas is present at a low fraction, less than 5% of the mixture, and it’s less potent than SF6, Heimbach says. But C4-fluoronitriles are still powerful greenhouse gases, up to a few thousand times more potent than carbon dioxide. These and other fluorinated substances could soon be in trouble too—chemical giant 3M announced in late 2022 that the company would stop manufacturing all fluoropolymers, fluorinated fluids, and PFAS-additive products by 2025.

In order to eliminate the need for fluorine-containing gases, some researchers are looking into the grid’s past for alternatives. “We know that there’s no one-for-one replacement gas that has the properties of SF6,” says Lukas Graber, an associate professor in electrical engineering at Georgia Institute of Technology.

SF6 is both extremely stable and extremely electronegative, meaning it tends to grab onto free electrons, and nothing else can quite match it, Graber says. So he’s working on a research project that aims to replace SF6 gas with supercritical carbon dioxide. (Supercritical fluids are those at temperatures and pressures so high that distinct liquid and gas phases don’t quite exist.) The inspiration came from equipment that used to use oil-based materials—instead of trying to grab electrons like SF6, supercritical carbon dioxide can basically slow them down.

Graber and his research team received project funding from the US Department of Energy’s Advanced Research Projects Agency for Energy. The first small-scale prototype is nearly finished, he adds, and the plan is to test out a full-scale prototype in 2025.

Utilities are known for being conservative, since the safety and reliability of the electrical grid have high stakes, Hitachi Energy’s Heimbach says. But with more SF6 bans coming, they’ll need to find and adopt solutions that don’t rely on the gas.

How “personhood credentials” could help prove you’re a human online

As AI models become better at mimicking human behavior, it’s becoming increasingly difficult to distinguish between real human internet users and sophisticated systems imitating them. 

That’s a real problem when those systems are deployed for nefarious ends like spreading misinformation or conducting fraud, and it makes it a lot harder to trust what you encounter online.

A group of 32 researchers from institutions including OpenAI, Microsoft, MIT, and Harvard has developed a potential solution—a verification concept called “personhood credentials.” These credentials prove that their holder is a real person, without revealing any further information about the person’s identity. The team explored the idea in a non-peer-reviewed paper posted to the arXiv preprint server earlier this month.

Personhood credentials rely on the fact that AI systems still cannot bypass state-of-the-art cryptographic systems or pass as people in the offline, real world. 

To request such credentials, people would have to physically go to one of a number of issuers, like a government or some other kind of trusted organization. They would be asked to provide evidence of being a real human, such as a passport or biometric data. Once approved, they’d receive a single credential to store on their devices the way it’s currently possible to store credit and debit cards in smartphones’ wallet apps.

To use these credentials online, a user could present them to a third-party digital service provider, which could then verify them using a cryptographic protocol called a zero-knowledge proof. That would confirm the holder was in possession of a personhood credential without disclosing any further unnecessary information.

The ability to filter out anyone other than verified humans on a platform could be useful in many ways. People could reject Tinder matches that don’t come with personhood credentials, for example, or choose not to see anything on social media that wasn’t definitely posted by a person. 

The authors want to encourage governments, companies, and standards bodies to consider adopting such a system in the future to prevent AI deception from ballooning out of our control. 

“AI is everywhere. There will be many issues, many problems, and many solutions,” says Tobin South, a PhD student at MIT who worked on the project. “Our goal is not to prescribe this to the world, but to open the conversation about why we need this and how it could be done.”

Possible technical options already exist. For example, a network called Idena claims to be the first blockchain proof-of-person system. It works by getting humans to solve puzzles that would be difficult for bots within a short time frame. The controversial Worldcoin program, which collects users’ biometric data, bills itself as the world’s largest privacy-preserving human identity and financial network. It recently partnered with the Malaysian government to provide proof of humanness online by scanning users’ irises, which creates a code. As in the concept of personhood credentials, each code is protected using cryptography.

However, the project has been criticized for using deceptive marketing practices, collecting more personal data than acknowledged, and failing to obtain meaningful consent from users. Regulators in Hong Kong and Spain banned Worldcoin from operating earlier this year, while its operations have been suspended in countries including Brazil, Kenya, and India. 

So fresh concepts are still needed. The rapid rise of accessible AI tools has ushered in a dangerous period in which internet users are hyper-suspicious about what is and isn’t true online, says Henry Ajder, an expert on AI and deepfakes who is an advisor to Meta and the UK government. And while ideas for verifying personhood have been around for some time, these credentials feel like one of the most substantive ideas for how to push back against encroaching skepticism, he says.

But the biggest challenge the credentials will face is getting enough platforms, digital services, and governments to adopt them, since they may feel uncomfortable conforming to a standard they don’t control. “For this to work effectively, it would have to be something which is universally adopted,” he says. “In principle the technology is quite compelling, but in practice and the messy world of humans and institutions, I think there would be quite a lot of resistance.”

Martin Tschammer, head of security at the startup Synthesia, which creates AI-generated hyperrealistic deepfakes, says he agrees with the principle driving personhood credentials: the need to verify humans online. However, he is unsure whether it’s the right solution or whether it would be practical to implement. He also expresses skepticism over who would run such a scheme.  

“We may end up in a world in which we centralize even more power and concentrate decision-making over our digital lives, giving large internet platforms even more ownership over who can exist online and for what purpose,” he says. “And given the lackluster performance of some governments in adopting digital services, and autocratic tendencies that are on the rise, is it practical or realistic to expect this type of technology to be adopted en masse and in a responsible way by the end of this decade?” 

Rather than waiting for collaboration across industries, Synthesia is currently evaluating how to integrate other personhood-proving mechanisms into its products. He says it already has several measures in place. For example, it requires businesses to prove that they are legitimate registered companies, and will ban and refuse refunds to customers found to have broken its rules. 

One thing is clear: We are in urgent need of ways to differentiate humans from bots, and encouraging discussions between stakeholders in the tech and policy worlds is a step in the right direction, says Emilio Ferrara, a professor of computer science at the University of Southern California, who was not involved in the project. 

“We’re not far from a future where, if things remain unchecked, we’re going to be essentially unable to tell apart interactions that we have online with other humans or some kind of bots. Something has to be done,” he says. “We can’t be naïve as previous generations were with technologies.”

AI’s impact on elections is being overblown

This year, close to half the world’s population has the opportunity to participate in an election. And according to a steady stream of pundits, institutions, academics, and news organizations, there’s a major new threat to the integrity of those elections: artificial intelligence. 

The earliest predictions warned that a new AI-powered world was, apparently, propelling us toward a “tech-enabled Armageddon” where “elections get screwed up”, and that “anybody who’s not worried [was] not paying attention.” The internet is full of doom-laden stories proclaiming that AI-generated deepfakes will mislead and influence voters, as well as enabling new forms of personalized and targeted political advertising. Though such claims are concerning, it is critical to look at the evidence. With a substantial number of this year’s elections concluded, it is a good time to ask how accurate these assessments have been so far. The preliminary answer seems to be not very; early alarmist claims about AI and elections appear to have been blown out of proportion.

While there will be more elections this year where AI could have an effect, the United States being one likely to attract particular attention, the trend observed thus far is unlikely to change. AI is being used to try to influence electoral processes, but these efforts have not been fruitful. Commenting on the upcoming US election, Meta’s latest Adversarial Threat Report acknowledged that AI was being used to meddle—for example, by Russia-based operations—but that “GenAI-powered tactics provide only incremental productivity and content-generation gains” to such “threat actors.” This echoes comments from the company’s president of global affairs, Nick Clegg, who earlier this year stated that “it is striking how little these tools have been used on a systematic basis to really try to subvert and disrupt the elections.”

Far from being dominated by AI-enabled catastrophes, this election “super year” at that point was pretty much like every other election year.

While Meta has a vested interest in minimizing AI’s alleged impact on elections, it is not alone. Similar findings were also reported by the UK’s respected Alan Turing Institute in May. Researchers there studied more than 100 national elections held since 2023 and found “just 19 were identified to show AI interference.” Furthermore, the evidence did not demonstrate any “clear signs of significant changes in election results compared to the expected performance of political candidates from polling data.”

This all raises a question: Why were these initial speculations about AI-enabled electoral interference so off, and what does it tell us about the future of our democracies? The short answer: Because they ignored decades of research on the limited influence of mass persuasion campaigns, the complex determinants of voting behaviors, and the indirect and human-mediated causal role of technology. 

First, mass persuasion is notoriously challenging. AI tools may facilitate persuasion, but other factors are critical. When presented with new information, people generally update their beliefs accordingly; yet even in the best conditions, such updating is often minimal and rarely translates into behavioral change. Though political parties and other groups invest colossal sums to influence voters, evidence suggests that most forms of political persuasion have very small effects at best. And in most high-stakes events, such as national elections, a multitude of factors are at play, diminishing the effect of any single persuasion attempt.

Second, for a piece of content to be influential, it must first reach its intended audience. But today, a tsunami of information is published daily by individuals, political campaigns, news organizations, and others. Consequently, AI-generated material, like any other content, faces significant challenges in cutting through the noise and reaching its target audience. Some political strategists in the United States have also argued that the overuse of AI-generated content might make people simply tune out, further reducing the reach of manipulative AI content. Even if a piece of such content does reach a significant number of potential voters, it will probably not succeed in influencing enough of them to alter election results.

Third, emerging research challenges the idea that using AI to microtarget people and sway their voting behavior works as well as initially feared. Voters seem to not only recognize excessively tailored messages but actively dislike them. According to some recent studies, the persuasive effects of AI are also, at least for now, vastly overstated. This is likely to remain the case, as ever-larger AI-based systems do not automatically translate to better persuasion. Political campaigns seem to have recognized this too. If you speak to campaign professionals, they will readily admit that they are using AI, but mainly to optimize “mundane” tasks such as fundraising, get-out-the-vote efforts, and overall campaign operations rather than generating new AI-generated, highly tailored content.

Fourth, voting behavior is shaped by a complex nexus of factors. These include gender, age, class, values, identities, and socialization. Information, regardless of its veracity or origin—whether made by an AI or a human—often plays a secondary role in this process. This is because the consumption and acceptance of information are contingent on preexisting factors, like whether it chimes with the person’s political leanings or values, rather than whether that piece of content happens to be generated by AI.

Concerns about AI and democracy, and particularly elections, are warranted. The use of AI can perpetuate and amplify existing social inequalities or reduce the diversity of perspectives individuals are exposed to. The harassment and abuse of female politicians with the help of AI is deplorable. And the perception, partially co-created by media coverage, that AI has significant effects could itself be enough to diminish trust in democratic processes and sources of reliable information, and weaken the acceptance of election results. None of this is good for democracy and elections. 

However, these points should not make us lose sight of threats to democracy and elections that have nothing to do with technology: mass voter disenfranchisement; intimidation of election officials, candidates, and voters; attacks on journalists and politicians; the hollowing out of checks and balances; politicians peddling falsehoods; and various forms of state oppression (including restrictions on freedom of speech, press freedom and the right to protest). 

Of at least 73 countries holding elections this year, only 47 are classified as full (or at least flawed) democracies, according to Our World in Data/Economist Democracy Index, with the rest being hybrid or authoritarian regimes. In countries where elections are not even free or fair, and where political choice that leads to real change is an illusion, people have arguably bigger fish to fry.

And still, technology—including AI—often becomes a convenient scapegoat, singled out by politicians and public intellectuals as one of the major ills befalling democratic life. Earlier this year, Swiss president Viola Amherd warned at the World Economic Forum in Davos, Switzerland, that “advances in artificial intelligence allow … false information to seem ever more credible” and present a threat to trust. Pope Francis, too, warned that fake news could be legitimized through AI. US Deputy Attorney General Lisa Monaco said that AI could supercharge mis- and disinformation and incite violence at elections. This August, the mayor of London, Sadiq Kahn, called for a review of the UK’s Online Safety Act after far-right riots across the country, arguing that “the way the algorithms work, the way that misinformation can spread very quickly and disinformation … that’s a cause to be concerned. We’ve seen a direct consequence of this.”

The motivations to blame technology are plenty and not necessarily irrational. For some politicians, it can be easier to point fingers at AI than to face scrutiny or commit to improving democratic institutions that could hold them accountable. For others, attempting to “fix the technology” can seem more appealing than addressing some of the fundamental issues that threaten democratic life. Wanting to speak to the zeitgeist might play a role, too.

Yet we should remember that there’s a cost to overreaction based on ill-founded assumptions, especially when other critical issues go unaddressed. Overly alarmist narratives about AI’s presumed effects on democracy risk fueling distrust and sowing confusion among the public—potentially further eroding already low levels of trust in reliable news and institutions in many countries. One point often raised in the context of these discussions is the need for facts. People argue that we cannot have democracy without facts and a shared reality. That is true. But we cannot bang on about needing a discussion rooted in facts when evidence against the narrative of AI turbocharging democratic and electoral doom is all too easily dismissed. Democracy is under threat, but our obsession with AI’s supposed impact is unlikely to make things better—and could even make them worse when it leads us to focus solely on the shiny new thing while distracting us from the more lasting problems that imperil democracies around the world. 

Felix M. Simon is a research fellow in AI and News at the Reuters Institute for the Study of Journalism; Keegan McBride is an assistant professor in AI, government, and policy at the Oxford Internet Institute; Sacha Altay is a research fellow in the department of political science at the University of Zurich.

Here’s how ed-tech companies are pitching AI to teachers

This story is from The Algorithm, our weekly newsletter on AI. To get it in your inbox first, sign up here.

This back-to-school season marks the third year in which AI models like ChatGPT will be used by thousands of students around the globe (among them my nephews, who tell me with glee each time they ace an assignment using AI). A top concern among educators remains that when students use such models to write essays or come up with ideas for projects, they miss out on the hard and focused thinking that builds creative reasoning skills. 

But this year, more and more educational technology companies are pitching schools on a different use of AI. Rather than scrambling to tamp down the use of it in the classroom, these companies are coaching teachers how to use AI tools to cut down on time they spend on tasks like grading, providing feedback to students, or planning lessons. They’re positioning AI as a teacher’s ultimate time saver. 

One company, called Magic School, says its AI tools like quiz generators and text summarizers are used by 2.5 million educators. Khan Academy offers a digital tutor called Khanmigo, which it bills to teachers as “your free, AI-powered teaching assistant.” Teachers can use it to assist students in subjects ranging from coding to humanities. Writing coaches like Pressto help teachers provide feedback on student essays.  

The pitches from ed-tech companies often cite a 2020 report from McKinsey and Microsoft, which found teachers work an average of 50 hours per week. Many of those hours, according to the report, consist of “late nights marking papers, preparing lesson plans, or filling out endless paperwork.” The authors suggested that embracing AI tools could save teachers 13 hours per week. 

Companies aren’t the only ones making this pitch. Educators and policymakers have also spent the last year pushing for AI in the classroom. Education departments in South Korea, Japan, Singapore, and US states like North Carolina and Colorado have issued guidance for how teachers can positively and safely incorporate AI. 

But when it comes to how willing teachers are to turn over some of their responsibilities to an AI model, the answer really depends on the task, according to Leon Furze, an educator and PhD candidate at Deakin University who studies the impact of generative AI on writing instruction and education.

“We know from plenty of research that teacher workload actually comes from data collection and analysis, reporting, and communications,” he says. “Those are all areas where AI can help.”

Then there are a host of not-so-menial tasks that teachers are more skeptical AI can excel at. They often come down to two core teaching responsibilities: lesson planning and grading. A host of companies offer large language models that they say can generate lesson plans to conform to different curriculum standards. Some teachers, including in some California districts, have also used AI models to grade and provide feedback for essays. For these applications of AI, Furze says, many of the teachers he works with are less confident in its reliability. 

When companies promise time savings for planning and grading, it is “a huge red flag,” he says, because “those are core parts of the profession.” He adds, “Lesson planning is—or should be—thoughtful, creative, even fun.” Automated feedback on creative skills like writing is controversial too: “Students want feedback from humans, and assessment is a way for teachers to get to know students. Some feedback can be automated, but not all.” 

So how eager are teachers to adopt AI to save time? Earlier this year, in May, a Pew research poll found that only 6% of teachers think AI can provide more benefits than harm in education. But with AI changing faster than ever, this school year might be when ed-tech companies start to win them over.

Now read the rest of The Algorithm


Deeper learning

How machine learning is helping us probe the secret names of animals

Until now, only humans, dolphins, elephants, and probably parrots had been known to use specific sounds to call out to other individuals. But now, researchers armed with audio recorders and pattern-recognition software are making unexpected discoveries about the secrets of animal names—at least with small monkeys called marmosets. They’ve found that the animals will adjust the sounds they make in a way that’s specific to whoever they’re “conversing” with at the time.  

Why this matters: In years past, it’s been argued that human language is unique and that animals lack both the brains and vocal apparatus to converse. But there’s growing evidence that isn’t the case, especially now that the use of names has been found in at least four distantly related species. Read more from Antonio Regalado.

Bits and bytes

How will AI change the future of sex? 

Porn and real-life sex affect each other in a loop. If people become accustomed to getting exactly what they want from erotic media, this could further affect their expectations of relationships. (MIT Technology Review)

There’s a new way to build neural networks that could make AI more understandable

The new method, studied in detail by a group led by researchers at MIT, could make it easier to understand why neural networks produce certain outputs, help verify their decisions, and even probe for bias. (MIT Technology Review)

Researchers built an “AI scientist.” What can it do?

The large language model does everything from reading the literature to writing and reviewing its own papers, but it has a limited range of applications so far. (Nature)

OpenAI is weighing changes to its corporate structure as it seeks more funding 

These discussions come as Apple, Nvidia, and Microsoft are considering a funding round that would value OpenAI at more than $100 billion. (Financial Times)

The UK is building an alarm system for climate tipping points

The UK’s new moonshot research agency just launched an £81 million ($106 million) program to develop early warning systems to sound the alarm if Earth gets perilously close to crossing climate tipping points.

A climate tipping point is a threshold beyond which certain ecosystems or planetary processes begin to shift from one stable state to another, triggering dramatic and often self-reinforcing changes in the climate system. 

The Advanced Research and Invention Agency (ARIA) will announce today that it’s seeking proposals to work on systems for two related climate tipping points. One is the accelerating melting of the Greenland Ice Sheet, which could raise sea levels dramatically. The other is the weakening of the North Atlantic Subpolar Gyre, a huge current rotating counterclockwise south of Greenland that may have played a role in triggering the Little Ice Age around the 14th century. 

The goal of the five-year program will be to reduce scientific uncertainty about when these events could occur, how they would affect the planet and the species on it, and over what period those effects might develop and persist. In the end, ARIA hopes to deliver a proof of concept demonstrating that early warning systems can be “affordable, sustainable, and justified.” No such dedicated system exists today, though there’s considerable research being done to better understand the likelihood and consequences of surpassing these and other climate tipping points.

Sarah Bohndiek, a program director for the tipping points research program, says we underappreciate the possibility that crossing these points could significantly accelerate the effects of climate change and increase the dangers, possibly within the next few decades.

By developing an early warning system, “we might be able to change the way that we think about climate change and think about our preparedness for it,” says Bohndiek, a professor of biomedical physics at the University of Cambridge. 

ARIA intends to support teams that will work toward three goals: developing low-cost sensors that can withstand harsh environments and provide more precise and needed data about the conditions of these systems; deploying those and other sensing technologies to create “an observational network to monitor these tipping systems”; and building computer models that harness the laws of physics and artificial intelligence to pick up “subtle early warning signs of tipping” in the data.

But observers stress that designing precise early warning systems for either system would be no simple feat and might not be possible anytime soon. Not only do scientists have limited understanding of these systems, but the data  on how they’ve behaved in the past is patchy and noisy, and setting up extensive monitoring tools in these environments is expensive and cumbersome. 

Still, there’s wide agreement that we need to better understand these systems and the risks that the world may face.

Unlocking breakthroughs

It is clear that the tipping of either of these systems could have huge effects on Earth and its inhabitants.

As the world warmed in recent decades, trillions of tons of ice melted off the Greenland Ice Sheet, pouring fresh water into the North Atlantic, pushing up ocean levels, and reducing the amount of heat that the snow and ice reflected back into space. 

Melting rates are increasing as Arctic warming speeds ahead of the global average and hotter ocean waters chip away at ice shelves that buttress land-based glaciers. Scientists fear that as those shelves collapse, the ice sheet will become increasingly unstable. 

The complete loss of the ice sheet would raise global sea levels by more than 20 feet (six meters), submerging coastlines and kick-starting mass climate migration around the globe.

But at any point along the way, the influx of water into the North Atlantic could also substantially slow down the convection systems that help to drive the Subpolar Gyre, because fresher water isn’t as dense and prone to sinking. (Saltier, cooler water readily sinks.)

The weakening of the Subpolar Gyre could cool parts of northwest Europe and eastern Canada, shift the jet stream northward, create more erratic weather patterns across Europe, and undermine the productivity of agriculture and fisheries, according to one study last year. 

The Subpolar Gyre may also influence the strength of the Atlantic Meridional Overturning Circulation (AMOC), a network of ocean currents that moves massive amounts of heat, salt, and carbon dioxide around the globe. The specifics of how a weakened Subpolar Gyre would affect the AMOC are still the subject of ongoing research, but a dramatic slowdown or shutdown of that system is considered one of the most dangerous climate tipping points. It could substantially cool Northern Europe, among other wide-ranging effects.  

The tipping of the AMOC itself, however, is not the focus of the ARIA research program. 

The agency, established last year to “unlock scientific and technological breakthroughs,” is a UK answer to the US’s DARPA and ARPA-E research programs. Other projects it’s funding include efforts to develop precision neurotechnologies, improve robot dexterity, and build safer and more energy-efficient AI systems. ARIA is also setting up programs for developing synthetic plants and exploring climate interventions that could cool the planet, including solar geoengineering. 

Bohndiek and the other program director of the tipping points program—Gemma Bale, an assistant professor at the University of Cambridge—are both medical physicists who previously focused on developing medical devices. At ARIA, they initially expected to work on efforts to decentralize health care.

But Bohndiek says they soon realized that “a lot of these things that need to change at the individual health level will be irrelevant if climate change truly is going to cross these big thresholds.” She adds, “If we’re going to end up in a society where the world is so much warmer … does the problem of decentralizing health care matter anymore?” 

Bohndiek and Bale stress that they hope the program will draw applications from researchers who haven’t traditionally worked on climate change. They add that any research teams proposing to work in or around Greenland must take appropriate steps to engage with local communities, governments, and other research groups.

Tipping dangers

Efforts are already underway to develop greater understanding of the Subpolar Gyre and the Greenland Ice Sheet, including the likelihood, timing, and consequences of their tipping into different states.

There are, for instance, regular field expeditions to measure and refine modeling of ice loss in Greenland. A variety of research groups have set up sensor networks that cross various points of the Atlantic to more closely monitor the shifting conditions of current systems. And several studies have already highlighted the appearance of some “early warning signals” of a potential collapse of the AMOC in the coming decades.

But the goal of the ARIA program is to accelerate such research efforts and sharpen the field’s focus on improving our ability to predict tipping events. 

William Johns, an oceanographer focused on observation of the AMOC at the University of Miami, says the field is a long way from being able to state confidently that systems like the Subpolar Gyre or AMOC will weaken beyond the bounds of normal natural fluctuations, much less say with any precision when they would do so. 

He stresses that there’s still wide disagreement between models on these sorts of questions and limited evidence of what took place before they tipped in the ancient past, all of which makes it difficult to even know what signals we should be monitoring for most closely.

Jaime Palter, an associate professor of oceanography at the University of Rhode Island, adds that she found it a “puzzling” choice to fund a research program focused on the tipping of the Subpolar Gyre. She notes that researchers believe the wind drives the system more than convection, that its connection to the AMOC isn’t well understood, and that the slowdown of the latter system is the one that more of the field is focused on—and more of the world is worried about.

But she and Johns both said that providing funds to monitor these systems more closely is critical to improve scientific understanding of how they work and the odds that they will tip.

Radical interventions

So what could the world do if ARIA or anyone else does manage to develop systems that can predict, with high confidence, that one of these systems will shift into a new state in, say, the next decade?

Bohndiek stresses that the effects of reaching a tipping point wouldn’t be immediate, and that the world would still have years or even decades to take actions that might prevent the breakdown of such systems, or begin adapting to the changes they’ll bring. In the case of runaway melting of the ice sheet, that could mean building higher seawalls or relocating cities. In the case of the Subpolar Gyre weakening, big parts of Europe might have to look to other areas of the world for their food supplies.

More reliable predictions might also alter people’s thinking about more dramatic interventions, such as massive and hugely expensive engineering projects to prop up ice shelves or to freeze glaciers more stably onto the bedrock they’re sliding upon. 

Similarly, they might shift how some people weigh the trade-offs between the dangers of climate change and the risks of interventions like solar geoengineering, which would involve releasing particles in the atmosphere that could reflect more heat back into space.

But some observers note that if enough fresh water is pouring into the Atlantic to weaken the gyre and substantially slow the broader Atlantic current system, there’s very little the world can do to stop it.

“I’m afraid I don’t really see an action you could take,” Johns says. “You can’t go vacuum up all the fresh water—it’s not going to be feasible—and you can’t stop it from melting on the scale we’d have to.”

Bale readily acknowledges that they’ve selected a very hard problem to solve, but she stresses that the point of ARIA research programs is to work at the “edge of the possible.” 

“We genuinely don’t know if an early warning system for these systems is possible,” she says. “But I think if it is possible, we know that it would be valuable and important for society, and that’s part of our mission.”

What this futuristic Olympics video says about the state of generative AI

The Olympic Games in Paris just finished last month and the Paralympics are still underway, so the 2028 Summer Olympics in Los Angeles feel like a lifetime from now. But the prospect of watching the games in his home city has Josh Kahn, a filmmaker in the sports entertainment world who has worked in content creation for both LeBron James and the Chicago Bulls, thinking even further into the future: What might an LA Olympics in the year 3028 look like?

It’s the perfect type of creative exercise for AI video generation, which came into the mainstream with the debut of OpenAI’s Sora earlier this year. By typing prompts into generators like Runway or Synthesia, users can generate fairly high-definition video in minutes. It’s fast and cheap, and it presents few technical obstacles compared with traditional creation techniques like CGI or animation. Even if every frame isn’t perfect—distortions like hands with six fingers or objects that disappear are common—there are, at least in theory, a host of commercial applications. Ad agencies, companies, and content creators could use the technology to create videos quickly and cheaply.  

Kahn, who has been toying with AI video tools for some time, used the latest version of Runway to dream up what the Olympics of the future could look like, entering a new prompt in the model for each shot. The video is just over one minute long and features sweeping aerial views of a futuristic version of LA where sea levels have risen sharply, leaving the city crammed right up to the coastline. A football stadium sits perched on top of a skyscraper, while a dome in the middle of the harbor contains courts for beach volleyball. 

The video, which was shared exclusively with MIT Technology Review, is meant less as a road map for the city and more as a demonstration of what’s possible now with AI.

“We were watching the Olympics and the amount of care that goes into the cultural storytelling of the host city,” Kahn says. “There’s a culture of imagination and storytelling in Los Angeles that has kind of set the tone for the rest of the world. Wouldn’t it be cool if we could showcase what the Olympics would look like if they returned to LA 1,000 years from now?”

More than anything, the video shows what a boon the generative technology may be for creators. However, it also indicates what’s holding it back. Though Kahn declined to share his prompts for the shots or specify how many prompts it took to get each take right, he did caution that anyone wishing to create good content with AI must be comfortable with trial and error. Particularly challenging in his futuristic project was getting the AI model to think outside the box in terms of architecture. A stadium hovering above water, for example, is not something most AI models have seen many examples of in their training data. 

With each shot requiring a new set of prompts, it’s also hard to instill a sense of continuity throughout a video. The color, angle of the sun, and shapes of buildings are difficult for a video generation model to keep consistent. The video also lacks any close-ups of people, which Kahn says AI models still tend to struggle with. 

“These technologies are always better on large-scale things right now as opposed to really nuanced human interaction,” he says. For this reason, Kahn imagines that early filmmaking applications of generative video might be for wide shots of landscapes or crowds. 

Alex Mashrabov, an AI video expert who left his role as director of generative AI at Snap last year to found a new AI video company called Higgsfield AI, agrees on the current failures and flaws of AI video. He also points out that good dialogue-heavy content is hard to produce with AI, as it tends to hinge upon subtle facial expressions and body language. 

Some content creators may be reluctant to adopt generative video simply because of the amount of time required to prompt the models again and again to get the end result right.

“Typically, the success rate is one out of 20,” Mashrabov says, but it’s not uncommon to need 50 or 100 attempts. 

For many purposes, though, that’s good enough. Mashrabov says he’s seen an uptick in AI-generated video advertisements from massive suppliers like Temu. In goods-producing countries like China, video generators are in high demand to quickly make in-your-face video ads for particular products. Even if an AI model might require lots of prompts to yield a usable ad, filming it with real people, cameras, and equipment might be 100 times more expensive. Applications like this might be the first use of generative video at scale as the technology slowly improves, he says. 

“Although I think this is a very long path, I’m very confident there are low-hanging fruits,” Mashrabov says. “We’re figuring out the genres where generative AI is already good today.”

Coming soon: Our 2024 list of Innovators Under 35

To tackle complex global problems such as preventing disease and mitigating climate change, we’re going to need new ideas from our brightest minds. Every year, MIT Technology Review identifies a new class of Innovators Under 35 taking on these and other challenges. 

On September 10, we will honor the 2024 class of Innovators Under 35. These 35 researchers and entrepreneurs are rising stars in their fields pursuing ambitious projects: One is unraveling the mysteries of how our immune system works, while another is engineering microbes to someday replace chemical pesticides.

Each is doing groundbreaking work to advance one of five areas: materials science, biotechnology, robotics, artificial intelligence, or climate and energy. Some have found clever ways to integrate these disciplines. One innovator, for example, enlists tiny robots to reduce the amount of antibiotics required to treat infections.

MIT Technology Review has published its Innovators Under 35 list since 1999. The first edition was created for our 100th anniversary and was meant to give readers a glimpse into the future, by highlighting what some of the world’s most talented young scientists are working on today.

This year, we’re celebrating our 125th anniversary and honoring this 25th class of innovators with the same goal in mind. (Note: The 2024 list will be made available exclusively to subscribers. If you’re not a subscriber, you can sign up here.)

Keep an eye on The Download newsletter next week for our announcement of the new class. You can also meet some of them at EmTech MIT, which will take place on September 30 and October 1 on MIT’s campus in Cambridge, Massachusetts.

If you can’t wait until then, we’ll reveal our Innovator of the Year during a live broadcast on LinkedIn on Monday, September 9. This person stood out for using their ingenuity to address a power imbalance in the tech sector (and that’s the only hint you get). They’ll join me on screen to talk about their work and share what’s next for their research.

A new way to build neural networks could make AI more understandable

A tweak to the way artificial neurons work in neural networks could make AIs easier to decipher.

Artificial neurons—the fundamental building blocks of deep neural networks—have survived almost unchanged for decades. While these networks give modern artificial intelligence its power, they are also inscrutable. 

Existing artificial neurons, used in large language models like GPT4, work by taking in a large number of inputs, adding them together, and converting the sum into an output using another mathematical operation inside the neuron. Combinations of such neurons make up neural networks, and their combined workings can be difficult to decode.

But the new way to combine neurons works a little differently. Some of the complexity of the existing neurons is both simplified and moved outside the neurons. Inside, the new neurons simply sum up their inputs and produce an output, without the need for the extra hidden operation. Networks of such neurons are called Kolmogorov-Arnold Networks (KANs), after the Russian mathematicians who inspired them.

The simplification, studied in detail by a group led by researchers at MIT, could make it easier to understand why neural networks produce certain outputs, help verify their decisions, and even probe for bias. Preliminary evidence also suggests that as KANs are made bigger, their accuracy increases faster than networks built of traditional neurons.

“It’s interesting work,” says Andrew Wilson, who studies the foundations of machine learning at New York University. “It’s nice that people are trying to fundamentally rethink the design of these [networks].”

The basic elements of KANs were actually proposed in the 1990s, and researchers kept building simple versions of such networks. But the MIT-led team has taken the idea further, showing how to build and train bigger KANs, performing empirical tests on them, and analyzing some KANs to demonstrate how their problem-solving ability could be interpreted by humans. “We revitalized this idea,” said team member Ziming Liu, a PhD student in Max Tegmark’s lab at MIT. “And, hopefully, with the interpretability… we [may] no longer [have to] think neural networks are black boxes.”

While it’s still early days, the team’s work on KANs is attracting attention. GitHub pages have sprung up that show how to use KANs for myriad applications, such as image recognition and solving fluid dynamics problems. 

Finding the formula

The current advance came when Liu and colleagues at MIT, Caltech, and other institutes were trying to understand the inner workings of standard artificial neural networks. 

Today, almost all types of AI, including those used to build large language models and image recognition systems, include sub-networks known as a multilayer perceptron (MLP). In an MLP, artificial neurons are arranged in dense, interconnected “layers.” Each neuron has within it something called an “activation function”—a mathematical operation that takes in a bunch of inputs and transforms them in some pre-specified manner into an output. 

In an MLP, each artificial neuron receives inputs from all the neurons in the previous layer and multiplies each input with a corresponding “weight” (a number signifying the importance of that input). These weighted inputs are added together and fed to the activation function inside the neuron to generate an output, which is then passed on to neurons in the next layer. An MLP learns to distinguish between images of cats and dogs, for example, by choosing the correct values for the weights of the inputs for all the neurons. Crucially, the activation function is fixed and doesn’t change during training.

Once trained, all the neurons of an MLP and their connections taken together essentially act as another function that takes an input (say, tens of thousands of pixels in an image) and produces the desired output (say, 0 for cat and 1 for dog). Understanding what that function looks like, meaning its mathematical form, is an important part of being able to understand why it produces some output. For example, why does it tag someone as creditworthy given inputs about their financial status? But MLPs are black boxes. Reverse-engineering the network is nearly impossible for complex tasks such as image recognition.

And even when Liu and colleagues tried to reverse-engineer an MLP for simpler tasks that involved bespoke “synthetic” data, they struggled. 

“If we cannot even interpret these synthetic datasets from neural networks, then it’s hopeless to deal with real-world data sets,” says Liu. “We found it really hard to try to understand these neural networks. We wanted to change the architecture.”

Mapping the math

The main change was to remove the fixed activation function and introduce a much simpler learnable function to transform each incoming input before it enters the neuron. 

Unlike the activation function in an MLP neuron, which takes in numerous inputs, each simple function outside the KAN neuron takes in one number and spits out another number. Now, during training, instead of learning the individual weights, as happens in an MLP, the KAN just learns how to represent each simple function. In a paper posted this year on the preprint server ArXiv, Liu and colleagues showed that these simple functions outside the neurons are much easier to interpret, making it possible to reconstruct the mathematical form of the function being learned by the entire KAN.

The team, however, has only tested the interpretability of KANs on simple, synthetic data sets, not on real-world problems, such as image recognition, which are more complicated. “[We are] slowly pushing the boundary,” says Liu. “Interpretability can be a very challenging task.”

Liu and colleagues have also shown that KANs get more accurate at their tasks with increasing size faster than MLPs do. The team proved the result theoretically and showed it empirically for science-related tasks (such as learning to approximate functions relevant to physics). “It’s still unclear whether this observation will extend to standard machine learning tasks, but at least for science-related tasks, it seems promising,” Liu says.

Liu acknowledges that KANs come with one important downside: it takes more time and compute power to train a KAN, compared to an MLP.

“This limits the application efficiency of KANs on large-scale data sets and complex tasks,” says Di Zhang, of Xi’an Jiaotong-Liverpool University in Suzhou, China. But he suggests that more efficient algorithms and hardware accelerators could help.

Anil Ananthaswamy is a science journalist and author who writes about physics, computational neuroscience, and machine learning. His new book, WHY MACHINES LEARN: The Elegant Math Behind Modern AI, was published by Dutton (Penguin Random House US) in July.

A new smart mask analyzes your breath to monitor your health

Your breath can give away a lot about you. Each exhalation contains all sorts of compounds, including possible biomarkers for disease or lung conditions, that could give doctors a valuable insight into your health.

Now a new smart mask, developed by a team at the California Institute of Technology, could help doctors check your breath for these signals continuously and in a noninvasive way. A patient could wear the mask at home, measure their own levels, and then go to the doctor if a flare-up is likely. 

“They don’t have to come to the clinic to assess their inflammation level,” says Wei Gao, professor of Medical Engineering at Caltech and one of the smart mask’s creators. “This can be lifesaving.”

The smart mask, details of which were published in Science today, uses a two-part cooling system to chill the breath of its wearer. The cooling turns the breath into exhaled breath condensate (EBC). 

EBC, essentially a liquid version of someone’s breath, is easier to analyze, because biomarkers like nitrite and alcohol content are more concentrated in a liquid than in a gas. The mask design takes inspiration from plants’ capillary abilities, using a series of microfluidic modules that create pressure to push the EBC fluid around to sensors in the mask.

The sensors are connected via Bluetooth to a device like a phone, where the patient has access to real-time health readings.

“The biggest challenge has always been collecting real-time samples. This problem has been solved. That’s a paradigm shift,” says Rajan Chakrabarty, professor of Environmental and Chemical Engineering at Washington University in St. Louis and who was not involved in the research.

The Caltech team tested the smart mask with patients, including several who had chronic obstructive pulmonary disease (COPD) or asthma or had just gotten over a covid-19 infection. They were testing the masks for comfort and breathability, but they also wanted to see if the masks actually worked at tracking useful biomarkers throughout a patient’s daily activities, such as exercise and work. 

The mask picked up on higher levels of nitrite in patients who had asthma or other conditions that involved inflamed airways. It also picked up on higher alcohol content after a patient went out drinking, which demonstrates another potential application of the mask. Analyzing breath this way is more accurate than the typical breathalyzer test, which involves a patient blowing into a device. Blowing can produce imprecise results due to alcohol in saliva being spit out.

The researchers hope this is just the beginning. They plan to test the masks on a larger population, and if all goes well, commercialize the masks to get them out to a wider audience. They hope the mask will be a platform for broader application, where sensors for a range of biomarkers could be slotted in and out. 

“What I would like to be able to do is take off their sensors, put in my sensors, and this becomes the building block for doing all other types of development,” says Albert Titus, professor and chair of the Department of Biomedical Engineering at the University at Buffalo and who wasn’t part of the Caltech team. “That’s where I’d like to see it go.”

For example, there may be the possibility to measure ketones in the breath, a high level of which is a sign of diabetes, or glucose levels, to help people with diabetes monitor their condition.

“The mask can be reconfigured for many different applications,” says Gao.

How machine learning is helping us probe the secret names of animals

Do animals have names? According to the poet T.S. Eliot, cats have three: the name their owner calls them (like George); a second, more noble one (like Quaxo or Cricopat); and, finally, a “deep and inscrutable” name known only to themselves “that no human research can discover.”

But now, researchers armed with audio recorders and pattern-recognition software are making unexpected discoveries about the secrets of animal names—at least with small monkeys called marmosets.  

That’s according to a team at Hebrew University in Israel, who claim in the journal Science this week they’ve discovered that marmosets “vocally label” their monkey friends with specific sounds.

Until now, only humans, dolphins, elephants, and probably parrots had been known to use specific sounds to call out to other individuals.

Marmosets are highly social creatures that maintain contact through high-pitched chirps and twitters called “phee-calls.” By recording different pairs of monkeys placed near each other, the team in Israel says they found the animals will adjust their sounds toward a vocal label that’s specific to their conversation partner. 

“It’s similar to names in humans,” says David Omer, the neuroscientist who led the project. “There’s a typical time structure to their calls, and what we report is that the monkey fine-tunes it to encode an individual.”

These names aren’t really recognizable to the human ear; instead, they were identified via a “random forest,” the statistical machine learning technique Omer’s team used to cluster, classify, and analyze the sounds.

To prove they’d cracked the monkey code—and learned the secret names—the team played recordings at the marmosets through a speaker and found they responded more often when their label, or name, was in the recording.

This sort of research could provide clues to the origins of human language, which is arguably the most powerful innovation in our species’ evolution, right up there with opposable thumbs. In years past, it’s been argued that human language is unique and that animals lack both the brains and vocal apparatus to converse.

But there’s growing evidence that isn’t the case, especially now that the use of names has been found in at least four distantly related species. “This is very strong evidence that the evolution of language was not a singular event,” says Omer.

Some similar research tactics were reported earlier this year by Mickey Pardo, a postdoctoral researcher, now at Cornell University, who spent 14 months in Kenya recording elephant calls. Elephants sound alarms by trumpeting, but in reality most of their vocalizations are deep rumbles that are only partly audible to humans.

Pardo also found evidence that elephants use vocal labels, and he says he can definitely get an elephant’s attention by playing the sound of another elephant addressing it. But does this mean researchers are now “speaking animal”? 

Not quite, says Pardo. Real language, he thinks, would mean the ability to discuss things that happened in the past or string together more complex ideas. Pardo says he’s hoping to determine next if elephants have specific sounds for deciding which watering hole to visit—that is, whether they employ place names.

Several efforts are underway to discover if there’s still more meaning in animal sounds than we thought. This year, a group called Project CETI that’s studying the songs of sperm whales found they are far more complex than previously recognized. It means the animals, in theory, could be using a kind of grammar—although whether they actually are saying anything specific isn’t known.

Another effort, the Earth Species Project, aims to use “artificial intelligence to decode nonhuman communication” and has started helping researchers collect more data on animal sounds to feed into those models. 

The team in Israel say they will also be giving the latest types of artificial intelligence a try. Their marmosets live in a laboratory facility, and Omer says he’s already put microphones in monkeys’ living space in order to record everything they say, 24 hours a day.

Their chatter, Omer says, will be used to train a large language model that could, in theory, be used to finish a series of calls that a monkey started, or produce what it predicts is an appropriate reply. But will a primate language model actually make sense, or will it just gibber away without meaning? 

Only the monkeys will be able to say for sure.  

“I don’t have any delusional expectations that they will talk about Nietzsche,” says Omer. “I don’t expect it to be extremely complex like a human, but I would expect it to help us understand something about how our language developed.”