{"id":712436,"date":"2026-02-19T04:30:20","date_gmt":"2026-02-19T01:30:20","guid":{"rendered":"https:\/\/buradabiliyorum.com\/en\/openai-pits-ai-agents-against-each-other-to-red-team-smart-contracts\/"},"modified":"2026-02-19T04:30:20","modified_gmt":"2026-02-19T01:30:20","slug":"openai-pits-ai-agents-against-each-other-to-red-team-smart-contracts","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/openai-pits-ai-agents-against-each-other-to-red-team-smart-contracts\/","title":{"rendered":"OpenAI pits AI agents against each other to red team smart contracts"},"content":{"rendered":"<p style=\"float:right;margin:0 0 10px 15px;width:240px\"><img decoding=\"async\" src=\"https:\/\/images.cointelegraph.com\/images\/840_aHR0cHM6Ly9zMy5jb2ludGVsZWdyYXBoLmNvbS91cGxvYWRzLzIwMjYtMDEvMDE5YmMxOTItZTE1Zi03ODg0LWExMDAtNTY5NDBmYmYyNTM1LmpwZw==.jpg\" alt=\"OpenAI pits AI agents against each other to red team smart contracts\" class=\"type:primaryImage\"><\/p>\n<p>OpenAI said it is becoming increasingly important to evaluate the performance of AI agents in \u201ceconomically meaningful environments\u201d as their adoption grows.<\/p>\n<p>OpenAI has launched a new benchmark that evaluates how well different AI models detect, patch, and even exploit security vulnerabilities found in crypto smart contracts.<\/p>\n<p>OpenAI <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/cdn.openai.com\/evmbench\/evmbench.pdf\" rel=\"noopener nofollow\" target=\"_blank\">released<\/a> the \u201cEVMbench: Evaluating AI Agents on Smart Contract Security\u201d paper on Wednesday, in collaboration with crypto investment firm Paradigm and crypto security firm OtterSec, to evaluate how much the AI agents could theoretically exploit from 120 smart contract vulnerabilities.<\/p>\n<p>Anthropic\u2019s Claude Opus 4.6 came out on top with an average \u201cdetect award\u201d of $37,824, followed by OpenAI\u2019s OC-GPT-5.2 and Google\u2019s Gemini 3 Pro at $31,623 and $25,112, respectively.<\/p>\n<p>Read more<\/p>\n<\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">News<\/a> articles, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" target=\"_blank\" >General category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/cointelegraph.com\/news\/openai-benchmark-ai-agents-detect-smart-contract-flaws?utm_source=rss_feed&#038;utm_medium=feed&#038;utm_campaign=rss_partner_inbound\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI said it is becoming increasingly important to evaluate the performance of AI agents in \u201ceconomically meaningful environments\u201d as their adoption grows. OpenAI has launched a new benchmark that evaluates how well different AI models detect, patch, and even exploit security vulnerabilities found in crypto smart contracts. OpenAI released the \u201cEVMbench: Evaluating AI Agents on&#8230;<\/p>\n","protected":false},"author":1,"featured_media":712437,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/images.cointelegraph.com\/cdn-cgi\/image\/f=auto,onerror=redirect,w=1200\/https:\/\/s3.cointelegraph.com\/uploads\/2026-01\/019bc192-e15f-7884-a100-56940fbf2535.jpg","fifu_image_alt":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-712436","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-general"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/712436","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=712436"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/712436\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/712437"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=712436"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=712436"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=712436"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}