Back to AI information
Perplexity Releases BrowseSafe and BrowseSafe-Bench to Enhance AI Browsing Security

Perplexity Releases BrowseSafe and BrowseSafe-Bench to Enhance AI Browsing Security

AI information Admin 104 views

Perplexity announced the launch of the BrowseSafe system and its companion benchmark, BrowseSafe-Bench, to improve the security of AI browsers in real-world web environments. The solution is aimed at its Comet browser scenario, and at its core, it is a model that specifically detects malicious natural language instructions in web pages, which can scan full-page HTML in real time without significantly increasing latency to identify prompt injection attacks against agents.

According to the

article, BrowseSafe-Bench contains more than 14,000 production-friendly web page samples, covering 11 types of attack targets, 9 injection locations, and multiple languages and expression styles, to evaluate the performance of different defense strategies on complex, noisy-rich pages. Perplexity treats the browser as a "working environment that proxies tasks", treats all content from web pages, emails, and files as untrusted input, and reduces the risk of the model being hijacked by hidden instructions through a "defense in depth" strategy, combined with content scanning, least privilege tool calls, and secondary confirmation of sensitive operations.

The company said that BrowseSafe and the benchmark are provided in an open-source manner, allowing developers to run detection models locally to stress test and secure security hardening of self-built browsing agents without building a protection framework from scratch. The evaluation results show that direct and explicit attacks are relatively easy to intercept, and multilingual or hidden instructions in an indirect, hypothetical tone are more confusing, suggesting that continuous training and iteration for these weaknesses are still needed in the future.

FAQs

Q: What is BrowseSafe?

A: BrowseSafe is a model that focuses on detecting malicious instructions in web pages and is used to identify prompt injection attacks in real-time in AI browsers.

Q: What does BrowseSafe-Bench do?

A: It is a public benchmark of more than 14,000 web page samples to evaluate and improve the effectiveness of prompt injection defenses.

Q: What types of security threats does the program mainly address?

A: It mainly targets malicious text instructions hidden in comments, templates, footers, and other places on web pages to prevent them from hijacking AI agents.

Q: How does Perplexity implement "depth of defense" in the browser?

A: It takes effect by pre-scanning all untrusted content, restricting tool permissions, and requiring users to confirm sensitive operations.

Q: How can developers use BrowseSafe?

A: Developers can directly call open source detection models and benchmarks, integrate them locally into their own proxy systems, and automatically scan and evaluate page content.

BrowseSafe Browser prompts for injection protection PerplexityCometAI browsing security mechanisms BrowseSafeBench prompts the injection benchmark AI browser web malicious instruction detection Defend against hidden cue injection attacks on web pages Multilingual prompts are injected into offensive and defensive security practices Security risks of AI agents in real networks HTML full-page scan detects malicious natural language The browser agent enforces environmental protection policies Tips inject defense-deep multi-layered architectures Malicious instruction identification scheme in the comment area of the web page Hidden attack statement detection in the footer template AI browsing agent least privilege tool call Sensitive operation secondary confirmation to prevent overreach BrowseSafe open-source prompt injection detection model BrowseSafeBench 14,000 web samples Production-friendly prompt injection test sets Multiple injection locations and expression style assessments AI browsing security benchmarks to evaluate defense strategies The browser treats the webmail file as untrustworthy A practical guide to building a secure AI browsing agent Multilingual implicit prompt injection detection difficulty analysis Browser prompts are injected into the offensive and defensive open source toolchain Run the BrowseSafe model stress test locally Integrated browsing security protection for self-built AI agents Malicious instruction identification in complex noise web pages Comparison of direct and indirect attack interception effects How AI browsers can avoid being hijacked by hidden instructions Impact of prompt injection attacks on AI agent tasks Best practices for LLM security in browser scenarios PerplexityComet browses the proxy security architecture analysis Impact of prompt injection protection on AI search experience See industry security trends from BrowseSafe open source Set the least permission on tool calls in the browser The safety design of sensitive operations requires secondary confirmation by the user BrowseSafeBench is used to evaluate a variety of defense strategies Tip injection defense requires continuous training and iteration Build a safe and reliable AI web browsing assistant How developers can integrate BrowseSafe detection locally Filtering malicious statements for comment sections and footers AI Browser maintains zero trust principles for unknown web pages The BrowseSafe model has minimal impact on latency Prompt injection attack samples cover multiple languages and scenarios A baseline of security when AI agents browse the real web Use BrowseSafe for internal knowledge base linking Security challenges for agents browsing mailboxes and file contents Prompt injection protection and privacy compliance are considered together AI browser security assessment and red teaming process Combined with BrowseSafe and Sandbox isolation, multiple protections Prompt injection attack protection in product design

Recommended Tools

More