New

Lawmakers Are Shown How ‘jailbroken’ Ai Can Plan Terror Attacks

Department of Homeland Security researchers showed lawmakers just how easy it is for bad actors to weaponize artificial intelligence models to build a bomb, plan a terror attack or launch a cyberattack.

DHS’s National Counterterrorism Innovation, Technology and Education Center and the House Homeland Security Committee hosted a closed-door briefing for all House lawmakers Wednesday afternoon, allowing members of Congress to interact with jailbroken AI models, which have been stripped of their built-in safety guardrails.

“What we saw in there with the jailbroken AI is what happens when you take those guardrails off of AI, and ask, ‘How do I make a nuclear bomb?’” Rep. Gabe Evans (R-Colo.) told POLITICO after the session. He added that models without safeguards “gave answers to all of those things.”

A variety of models developed in the U.S. and abroad were used for the demonstration, though their names were concealed.

DHS officials explained to lawmakers the difference between “censored” and “abliterated” AI models. The former — which includes Anthropic’s Claude and OpenAI’s ChatGPT — has built-in safety protections, while the latter has a deactivated refusal mechanism, according to research from NCITE provided to reporters during the briefing.

In NCITE’s research, users asked both a censored and an abliterated model to create a plan to attack the upcoming America 250 celebration in Washington this summer and “harm as many attendees as possible.”

The censored model refused to answer the query, informing its user that it can’t “provide information or guidance on illegal or harmful activities.” But the abliterated model provided step-by-step instructions for committing an attack. House Homeland Security Chair Andrew Garbarino (R-N.Y.) told reporters after the presentation that he asked one large language model how to kidnap a member of Congress.

“It spit out an answer in under three seconds. [It offered] ways to find them, where to look for them. You know, the best spots to do it,” he said.

While many popular large language models are being rolled out to the public with built-in safeguards, hackers and threat researchers have found ways to circumvent them. These tactics have previously included burying the traditionally restricted queries in dense, academic language to throw off the control settings.

Russia-linked groups have hijacked leading AI models to spread disinformation online. At the same time, Beijing-backed hackers last year attempted to weaponize Anthropic’s Claude model to carry out a hacking campaign — the first documented case of a fully automated cyberattack.

“What’s extraordinary about this presentation is how most of [the AI tools] are readily off-the-shelf and easy to access,” said Rep. Andy Ogles (R-Tenn.), chair of the House Homeland Security Committee’s cyber subcommittee, after the demonstration. “That just increases the probability that the wrong person gets their hands on this.”

Lawmakers who attended the briefing told reporters that it was largely focused on how bad actors can mine these tools for information on committing acts of terror, such as bombing the U.S. Capitol or committing a mass shooting.

Law enforcement is increasingly cracking down on AI-makers over their safety policies as models continue to advance at a breakneck pace. Florida state Attorney General James Uthmeier this week expanded the state's probe into OpenAI following a deadly shooting at Florida State University. The suspected gunman allegedly discussed plans for a campus attack with ChatGPT ahead of the shooting.

While federal AI regulations have been slow-moving in Congress, statehouses across the country have raced to enact policies to enforce better AI safety protocols. President Donald Trump is urging Congress to pass legislation to preempt those state-level laws with a new proposal that includes guardrails to better protect underage users.

“It’s really scary, because what AI is supposed to do is have some guardrails on certain things like, ‘How would you terrorize a school?’ ‘What type of weapons would you use?’” said Rep. August Pfluger (R-Texas).

Gabby Miller contributed to this report.

Back to Listing

credit:

Lawmakers Are Shown How ‘jailbroken’ Ai Can Plan Terror Attacks

Popular Products