Back to AI information
OpenAI releases Aardvark: A GPT-5-powered "proxy security researcher" who has begun private beta testing.

OpenAI releases Aardvark: A GPT-5-powered "proxy security researcher" who has begun private beta testing.

AI information Admin 66 views

OpenAI has announced the release of Aardvark, a GPT-5-powered "proxy security researcher" tool, now in private beta. Aardvark continuously analyzes codebases, builds threat models, locates vulnerabilities based on changes submitted, and assesses exploitability. After reproducing the vulnerabilities in a sandbox, it calls Codex to generate fixes, which are then submitted for human review and merging along with step-by-step explanations and code annotations. The system is geared towards development and security teams, emphasizing providing actionable remediation suggestions without disrupting development processes.

According to official disclosures, Aardvark has been running continuously in internal and external early partner environments and has identified 92% of known and synthetic vulnerabilities in the "Golden Repository" benchmark. It has also responsibly disclosed multiple issues in open-source projects, ten of which have been assigned CVE numbers. OpenAI also stated that it will provide free scanning services for some non-commercial open-source repositories and update its external coordination and disclosure policies; the scope of access will be gradually expanded during the private testing phase.

Frequently Asked Questions

Q: How can Aardvark be integrated currently?

A: It is currently in private testing phase and is open to invited partners and some open-source projects. The official website provides an application portal.

Q: How is it different from traditional tools (such as fuzz/SCA)?

A: Centered on LLM inference and tool usage, it follows a multi-stage process of "analysis - submission scan - sandbox verification - patching", which is closer to the methodology of human security researchers.

Q: Does it support automatic repair?

A: It will come with a patch generated by Codex and reviewed by Aardvark, but it still needs to be manually reviewed and merged with one click to avoid unsupervised changes.

Q: Is there data on the actual results?

A: The official statement claims a 92% recognition rate in the benchmark repository and has facilitated 10 CVE-level disclosures; external reproduction and cross-sectional evaluation are still underway.

Q: What are the plans for the open source community?

A: The plan is to provide free scanning for some non-commercial open source repositories and promote sustainable collaboration with an updated disclosure policy.

Aardvark Agent Security Researcher AardvarkGPT5 driver Aardvark Continuous Code Analysis Aardvark Threat Modeling Aardvark submitted a change scan. Aardvark vulnerability location Aardvark Availability Assessment Aardvark Sandbox Recreation AardvarkCodex Patch Aardvark step-by-step explanation Aardvark code annotations Aardvark manual review Aardvark One-Click Merge Aardvark does not interrupt research and development Aardvark Development Security Integration AardvarkDevSecOps Integration Aardvark CICD Integration Aardvark Private Beta Application Portal Aardvark Invited Partner Aardvark Public Welfare Scan Aardvark is open source friendly. Aardvark Disclosure Policy Aardvark's Disclosure of Responsibilities AardvarkCVE's record Aardvark Gold Warehouse 92% Aardvark benchmark recognition rate Aardvark vs. fuzzSCA AardvarkLLM Reasoning Aardvark tool call Aardvark multi-stage process Aardvark patch generation Aardvark false alarm control Aardvark vulnerability priority Aardvark risk convergence Aardvark codebase continuously monitored Aardvark dependencies and SCA Aardvark submits diff protection AardvarkPR Auto Comments Aardvark Knowledge Report Aardvark Security Metrics Aardvark Extended Ecosystem Aardvark Enterprise Landing Aardvark External Reviews Aardvark model update Aardvark Multilingual Repository Aardvark repeatable experiments Aardvark's repair suggestions are executable. Aardvark team collaboration Aardvark scan range configuration Aardvark speed and credit limit

Recommended Tools

More