Databricks companions with OpenAI on GPT-5.5


Databricks is worked up to associate with OpenAI on GPT-5.5, their newest frontier mannequin. GPT-5.5 is OpenAI’s strongest frontier mannequin for agentic work in enterprise, advanced doc reasoning, and long-horizon coding brokers. GPT-5.5 additionally now powers Codex, OpenAI’s coding agent.

GPT-5.5 Options and Advantages

GPT-5.5 is the neatest frontier mannequin but and the subsequent step towards a brand new method of getting work carried out. It understands what you’re making an attempt to do extra shortly and may tackle extra of the work itself. Codex, OpenAI’s coding agent, is now powered by GPT-5.5, with stronger reasoning and execution capabilities for developer workflows. 

The identical strengths that make GPT-5.5 nice at coding additionally make it highly effective for on a regular basis work on a pc. As a result of the mannequin is healthier at understanding intent, it might transfer extra naturally by the total loop of data work: discovering info, understanding what issues, utilizing instruments, checking the output, and turning uncooked materials into one thing helpful.

It may well write and debug code, analysis on-line, analyze knowledge, create paperwork and spreadsheets, function software program, and transfer throughout instruments till a process is completed. As an alternative of rigorously managing each step, you can provide GPT-5.5 a messy, multi-part process and belief it to plan, use instruments, examine its work, get well from ambiguity, and preserve going.

GPT-5.5 units the state-of-the-art efficiency

To know how these enhancements translate into actual enterprise workloads, we evaluated GPT-5.5 on OfficeQA, Databricks’ benchmark for document-heavy, multi-step analytical duties clients carry out each day. OfficeQA, constructed from 89,000 pages of U.S. Treasury Bulletins, measures a mannequin’s skill to retrieve info throughout paperwork, interpret advanced tables, and carry out exact calculations grounded in actual enterprise knowledge.

When given the suitable paperwork (OfficeQA Professional LLM with Oracle PDF + Net Search), GPT-5.5 scored 64.66%, an honest bounce from GPT-5.4’s 57.14%, representing a ~13% enchancment and a brand new state-of-the-art on this benchmark. This checks the ceiling of what the mannequin can do when retrieval is already dealt with.
In a full-agent workflow eval (OfficeQA Professional Agent Harness), the place the mannequin should discover the suitable paperwork, parse them, and compute solutions by itself utilizing the Codex agent harness, GPT-5.5 scored 52.63%, up from GPT-5.4’s 36.10%. That is a 46% discount in errors, exhibiting that GPT-5.5’s positive factors aren’t simply theoretical; they maintain up in sensible, end-to-end enterprise workflows.

GPT-5.5 is coming quickly to Databricks. Deliver frontier reasoning to your enterprise knowledge, securely, and at scale.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles