Be part of our day by day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Study Extra
When authorized analysis firm LexisNexis created its AI assistant Protégé, it needed to determine the easiest way to leverage its experience with out deploying a big mannequin.
Protégé goals to assist legal professionals, associates and paralegals write and proof authorized paperwork and make sure that something they cite in complaints and briefs is correct. Nevertheless, LexisNexis didn’t need a normal authorized AI assistant; they needed to construct one which learns a agency’s workflow and is extra customizable.
LexisNexis noticed the chance to deliver the facility of enormous language fashions (LLMs) from Anthropic and Mistral and discover the most effective fashions that reply person questions the most effective, Jeff Riehl, CTO of LexisNexis Authorized and Skilled, instructed VentureBeat.
“We use the best model for the specific use case as part of our multi-model approach. We use the model that provides the best result with the fastest response time,” Riehl stated. “For some use cases, that will be a small language model like Mistral or we perform distillation to improve performance and reduce cost.”
Whereas LLMs nonetheless present worth in constructing AI functions, some organizations flip to utilizing small language fashions (SLMs) or distilling LLMs to grow to be small variations of the identical mannequin.
Distillation, the place an LLM “teaches” a smaller mannequin, has grow to be a well-liked methodology for a lot of organizations.
Small fashions typically work greatest for apps like chatbots or easy code completion, which is what LexisNexis needed to make use of for Protégé.
This isn’t the primary time LexisNexis constructed AI functions, even earlier than launching its authorized analysis hub LexisNexis + AI in July 2024.
“We have used a lot of AI in the past, which was more around natural language processing, some deep learning and machine learning,” Riehl stated. “That really changed in November 2022 when ChatGPT was launched, because prior to that, a lot of the AI capabilities were kind of behind the scenes. But once ChatGPT came out, the generative capabilities, the conversational capabilities of it was very, very intriguing to us.”
Small, fine-tuned fashions and mannequin routing
Riehl stated LexisNexis makes use of completely different fashions from many of the main mannequin suppliers when constructing its AI platforms. LexisNexis + AI used Claude fashions from Anthropic, OpenAI’s GPT fashions and a mannequin from Mistral.
This multimodal method helped break down every activity customers needed to carry out on the platform. To do that, LexisNexis needed to architect its platform to change between fashions.
“We would break down whatever task was being performed into individual components, and then we would identify the best large language model to support that component. One example of that is we will use Mistral to assess the query that the user entered in,” Riehl stated.
For Protégé, the corporate needed sooner response instances and fashions extra fine-tuned for authorized use instances. So it turned to what Riehl calls “fine-tuned” variations of fashions, basically smaller weight variations of LLMs or distilled fashions.
“You don’t need GPT-4o to do the assessment of a query, so we use it for more sophisticated work, and we switch models out,” he stated.
When a person asks Protégé a query a couple of particular case, the primary mannequin it pings is a fine-tuned Mistral “for assessing the query, then determining what the purpose and intent of that query is” earlier than switching to the mannequin greatest suited to finish the duty. Riehl stated the following mannequin might be an LLM that generates new queries for the search engine or one other mannequin that summarizes outcomes.
Proper now, LexisNexis largely depends on a fine-tuned Mistral mannequin although Riehl stated it used a fine-tuned model of Claude “when it first came out; we are not using it in the product today but in other ways.” LexisNexis can also be occupied with utilizing different OpenAI fashions particularly for the reason that firm got here out with new reinforcement fine-tuning capabilities final 12 months. LexisNexis is within the means of evaluating OpenAI’s reasoning fashions together with o3 for its platforms.
Riehl added that it could additionally have a look at utilizing Gemini fashions from Google.
LexisNexis backs all of its AI platforms with its personal information graph to carry out retrieval augmented technology (RAG) capabilities, particularly as Protégé may assist launch agentic processes later.
The AI authorized suite
Even earlier than the appearance of generative AI, LexisNexis examined the opportunity of placing chatbots to work within the authorized {industry}. In 2017, the firm examined an AI assistant that might compete with IBM’s Watson-powered Ross and Protégé sits within the firm’s LexisNexis + AI platform, which brings collectively the AI companies of LexisNexis.
Protégé helps regulation companies with duties that paralegals or associates are inclined to do. It helps write authorized briefs and complaints which are grounded in companies’ paperwork and information, recommend authorized workflow subsequent steps, recommend new prompts to refine searches, draft questions for depositions and discovery, hyperlink quotes in filings for accuracy, generate timelines and, in fact, summarize complicated authorized paperwork.
“We see Protégé as the initial step in personalization and agentic capabilities,” Riehl stated. “Think about the different types of lawyers: M&A, litigators, real estate. It’s going to continue to get more and more personalized based on the specific task you do. Our vision is that every legal professional will have a personal assistant to help them do their job based on what they do, not what other lawyers do.”
Protégé now competes in opposition to different authorized analysis and know-how platforms. Thomson Reuters custom-made OpenAI’s o1-mini-model for its CoCounsel authorized assistant. Harvey, which raised $300 million from traders together with LexisNexis, additionally has a authorized AI assistant.