Anthropic’s Claude can now control computers like people do

3 months ago

Anthropic

Anthropic’s already awesome Claude 3.5 Sonnet gains a important capacity boost connected Tuesday arsenic nan generative AI startup rolls retired an enhanced and updated type of nan exemplary alongside nan new, lightweight Claude 3.5 Haiku. The Sonnet update includes a nationalist beta characteristic that gives nan AI basal power complete nan machine it’s moving on.

Claude 3.5 Sonnet was already a capacity leader erstwhile it comes to coding tasks, but nan caller type shows important across-the-board improvements complete its predecessor and steadily outperforms some Gemini 1.5 and GPT-4o connected a assortment of manufacture benchmarks. Gemini 1.5 Pro was nan only exemplary to champion nan caller 3.5 Sonnet connected immoderate test, and did truthful connected nan MATH benchmark.

The caller 3.5 Haiku is nary slouch, either, contempt its mini size. Scheduled to beryllium released later this month, 3.5 Haiku outperforms Claude 3.0 Opus, nan company’s largest past procreation model. Like its larger version, nan caller Haiku is exceedingly proficient astatine coding tasks, scoring 40.6% connected nan SWE-bench Verified — higher than some GPT-40 and nan original 3.5 Sonnet.

Even much impressive, nan caller Claude 3.5 Sonnet tin now interact pinch desktop apps via nan “Computer Use” API. The AI tin make nan basal keystrokes, rodent clicks, and movements needed to emulate nan quality user. The institution is speedy to constituent retired that nan strategy is presently rather experimental and prone to errors. The underlying intent of nan nationalist beta merchandise is to elicit feedback from developers to quickly amended nan API’s performance.

“We trained Claude to spot what’s happening connected a surface and past usage nan package devices disposable to transportation retired tasks,” Anthropic wrote successful a blog post. “When a developer tasks Claude pinch utilizing a portion of machine package and gives it nan basal access, Claude looks astatine screenshots of what’s visible to nan user, past counts really galore pixels vertically aliases horizontally it needs to move a cursor successful bid to click successful nan correct place.”

Claude | Computer usage for automating operations

It’s an AI agent, essentially. That is, its an AI that tin automate different package processes, whether that’s generating and qualifying trading leads, uncovering patterns and trends successful aesculapian data, aliases simply navigating to a circumstantial website and filling retired a shape you need. Think of them arsenic a much precocious type of existing Robotic Process Automation systems.

The institution cites Asana, Canva, Cognition, DoorDash, Replit, and The Browser Company arsenic early adopters of nan caller feature. Replit, for example, is utilizing Computer Control to “develop a cardinal characteristic that evaluates apps arsenic they’re being built for their Replit Agent product,” per nan announcement.

There’s nary request to interest astir nan AI going each Skynet connected america (yet), arsenic Anthropic explains. “Humans stay successful power by providing circumstantial prompts that nonstop Claude’s actions, for illustration ‘use information from my machine and online to capable retired this form,’” an Anthropic spokesperson told TechCrunch. “People alteration entree and limit entree arsenic needed. Claude breaks down nan user’s prompts into machine commands (e.g., moving nan cursor, clicking, typing) to execute that circumstantial task.”

Anthropic besides concedes that Computer Control could beryllium misused to make spam, dispersed misinformation, aliases perpetrate fraud. In response, nan institution has developed caller classifiers that place erstwhile nan API is being utilized and whether that usage is “causing harm.”

Source Digital

↑

Anthropic’s Claude can now control computers like people do

Related Article

All 41 seasons of ‘Jeopardy!’ are finally coming to streaming

If you have to watch one Amazon Prime Video movie in November 2024, stream this one

NYT Crossword: answers for Sunday, November 10

Popular Article

Soundcore’s newest clip-style earbuds focus on comfort

I’ve been a Firefox power user since it launched 20 years ago – here’s why it still beats Chrome and Safari

Belkin SoundForm Wired Earbuds with USB-C Connector review: sadly, these live up to their nominal price tag

AirPods Pro 2's hearing health features will arrive as a software update starting next week

New fanless cooling technology enhances energy efficiency for AI workloads by achieving a 90% reduction in cooling power consumption