Exclusive: Anthropic's Claude AI model takes on (and beats) human hackers

""Originally it was just me at a hotel realizing that PicoCTF had started and being like, 'Oh, I wonder if Claude could do some of these challenges,'" said Lucas."

""Claude was able to solve most of those challenges and get in the top 3% of PicoCTF," he said."

"In one competition, Claude solved 11 out of 20 progressively harder challenges in just 10 minutes."

"Claude isn't alone. Across the industry, AI agents are proving they're already achieving near-expert levels of offensive cybersecurity work."

Claude, developed by Anthropic, was tested in hacking competitions and showed remarkable performance by achieving top positions. Keane Lucas entered Claude into the PicoCTF competition, where it solved most challenges quickly and reached the top 3%. The use of AI in cybersecurity challenges has proven that such agents can independently solve complex problems with minimal assistance. In one event, Claude displayed rapid problem-solving skills, showing the growing potential of AI in offensive cybersecurity roles. The success of Claude highlights a trend of AI capabilities advancing towards expert levels in security tasks.

#ai #cybersecurity #offensive-security #claude #hacking-competitions

Read at Axios

Unable to calculate read time

Collection

[

...

]

Exclusive: Anthropic's Claude AI model takes on (and beats) human hackersExclusive: Anthropic's Claude AI model takes on (and beats) human hackers Briefly

Exclusive: Anthropic's Claude AI model takes on (and beats) human hackers
Exclusive: Anthropic's Claude AI model takes on (and beats) human hackers
Briefly