SAP and DataRobot: Elevating Bill Processing with Anomaly Detection and Generative AI


SAP and DataRobot are taking their partnership to new heights by strengthening their collaboration by means of the mixing of predictive and generative AI capabilities. We’ve got developed a cutting-edge partnership that can empower clients to generate worth with AI by seamlessly connecting core SAP BTP with DataRobot AI capabilities.  

For example, let’s discover how organizations can harness the facility of predictive and generative AI to streamline bill processing providing a sooner, extra correct and cost-effective different to guide evaluate and validation.

The Enterprise Downside

Proper now firms of all sizes grapple with a standard problem:  the relentless inflow of invoices.  The substantial quantity of monetary documentation may be overwhelming, usually necessitating a military of workers devoted to guide evaluate and validation.  Nevertheless this strategy shouldn’t be solely time-consuming and expensive, but additionally susceptible to human error, making it a fragile hyperlink within the monetary chain.  

Harnessing the potential of AI is extra necessary than ever earlier than.  Companies can make use of predictive AI fashions to be taught from historic bill information, acknowledge patterns, and mechanically flag potential anomalies in real-time.  This not solely accelerates the validation course of but additionally considerably reduces the margin of error, stopping expensive errors. Moreover, the mixing of generative AI permits for the concise summarization of detected anomalies, bettering communication and making it simpler for groups to take swift and knowledgeable actions.

SAP and DataRobot Built-in AI Answer

This AI software enhances bill processing by means of a mixture of a predictive and generative AI to determine irregularities amongst invoices and to speak the problems across the invoices.

  • Leverage Predictive AI mannequin for anomaly detection.
    • Enterprise perspective: Anomaly detection may also help determine irregularities, reminiscent of incorrect quantities, lacking data or uncommon patterns, earlier than processing funds.
    • Implementation: Practice the mannequin utilizing historic bill information to acknowledge patterns and typical bill traits.  When processing new invoices, the AI mannequin can flag potential anomalies for evaluate, lowering the chance of errors and fraud.
  • Generative AI Summarization:
    • Enterprise perspective: After figuring out anomalies, it is very important talk the problems to the related crew members.  Conventional reporting strategies could also be wordy and time-consuming.  Generative AI may also help interpret and summarize the detected anomalies in a concise and human-readable format.
    • Implementation: Leverage a LLM to generate an explanatory abstract of the detected anomalies.  The AI mannequin can extract key data from the anomaly detection outcomes and supply a transparent and structured narrative that summarizes the detected anomalies and the explanations to be thought-about anomalies, making it simpler for analysts and managers to grasp the problems. 

Structure and Implementation Overview

To attain these aims, our platforms make use of assorted integration factors, as illustrated within the structure graph beneath:

Graph 1. Architecture overview for the SAP - DataRobot Integrated Solution
Graph 1. Structure overview for the SAP – DataRobot Built-in Answer
1. Information preparation and ingestion 

Bill information is ready and parsed in SAP Datasphere / HANA Cloud.  DataRobot accesses and ingest this information from HANA Cloud by means of a JDBC connector.

Graph 2. DataRobot access to create a JDBC connector with SAP HANA.
Graph 2. DataRobot entry to create a JDBC connector with SAP HANA.
2. Function engineering and predictive mannequin coaching

DataRobot  engineers options and conducts experiments with the bill information set, permitting you to coach anomaly detection fashions that excel at recognizing invoices with irregular or irregular data.  The strategy you select may be tailor-made to your particular information situation—whether or not you’ve gotten labeled information or not.  You’ve got choices to deal with this problem successfully, both with a supervised or an unsupervised strategy.

On this case, we utilized historic data that had been categorized as anomalies and non-anomalies.  After information ingestion, DataRobot runs an in depth information exploratory evaluation, identifies any information high quality points, and mechanically generates new options and related function lists.   With that prepared, we had been in a position to conduct a complete evaluation by means of 64 distinct experiments in a brief time period.  In consequence, we had been in a position to pinpoint the top-performing mannequin on the forefront of the leaderboard.  This strategy allowed us to pick out the best predictive mannequin for the duty at hand.  

Graph 3. DataRobot Leaderboard highlighting the best performing model.
Graph 3. DataRobot Leaderboard highlighting one of the best performing mannequin.

Inside every of those experiments, you’ve gotten the chance to completely assess and gauge their efficiency.  This evaluation offers invaluable insights into how every predictive mannequin leverages the options inside your bill to make correct predictions.  To facilitate this course of, you’ve gotten entry to an array of instruments, together with elevate charts, ROC curve, and SHAP prediction explanations, which estimate how a lot every function contributes to a given prediction. These insights supply an intuitive means to realize a deeper understanding of the mannequin’s habits and their affect of the bill information, guaranteeing you make well-informed selections.

Graph 4. This Lift Chart depicts how well the model segments the target population and how capable it is to predict the target, letting you visualize the model’s effectiveness.
Graph 4. This Raise Chart depicts how nicely the mannequin segments the goal inhabitants and the way succesful it’s to foretell the goal, letting you visualize the mannequin’s effectiveness.
Graph 5. SHAP Prediction Explanations estimate how much a feature contributes to a given prediction, reported as its difference from the average. In this example how the delivery Date, shipping and gross amount had an impact.
Graph 5. SHAP Prediction Explanations estimate how a lot a function contributes to a given prediction, reported as its distinction from the common. On this instance how the supply Date, transport and gross quantity had an affect.
3. Mannequin deployment

As soon as we determine the optimum predictive mannequin, we transfer ahead to transition the answer into manufacturing.  This part seamlessly merges our predictive and generative AI strategy by orchestrating the deployment of an unstructured mannequin inside DataRobot.  This deployment harmonizes the predictive AI mannequin for anomaly detection with a Massive Language Mannequin (LLM), which excels in producing textual content to speak the predictive insights.  Alternatively, you’ve gotten the pliability to deploy predictive AI fashions straight inside SAP AI Core, providing a further route for operationalizing your resolution.

The LLM summarizes the rationales linked to every prediction, making it readily digestible on your monetary evaluation wants. This versatile deployment technique ensures that the insights generated are accessible and actionable in a fashion that fits your distinctive enterprise necessities. 

Two easy python recordsdata simply orchestrate this integration by means of easy features and hooks that might be executed every time an bill requires a prediction and its consecutive evaluation.  The primary file named helper.py, has the credentials to attach with GPT 3.5 by means of Azure and incorporates the immediate to summarize the reasons and insights derived from the predictive mannequin.  The second file, named customized.py, simply orchestrates the entire predictive and generative pipeline by means of a number of easy hooks.   You will discover an instance of the right way to assemble customized python recordsdata for unstructured fashions in our github repository.  

You’ve got the potential to check and validate this unstructured mannequin prior its deployment, assuring that it persistently produces the supposed outcomes, freed from any operational hitches.  

Graph 6. Validation of the unstructured model before deployment.
Graph 6. Validation of the unstructured mannequin earlier than deployment.
4. Enterprise Software

As soon as the deployment is formally in manufacturing, an accessible API endpoint turns into your bridge to attach with the deployment, seamlessly producing the exact outcomes you search in SAP Construct. 

Graph 7. SAP Build Workflow that includes a module to connect with the deployment of DataRobot via API.
Graph 7. SAP Construct Workflow that features a module to attach with the deployment of DataRobot by way of API.

Subsequent, we craft a enterprise software for bill anomaly detection inside SAP Construct.  This software retrieves the predictive and generative output by way of API integration and affords a user-friendly interface.  It presents the ends in a sensible and intuitive method, guaranteeing that monetary analysts can effortlessly add invoices in PDF format, simplifying their workflow and enhancing the general consumer expertise.  

Graph 8. SAP Build Workflow for the invoice approval business application.
Graph 8. SAP Construct Workflow for the bill approval enterprise software.
Graph 9 - Final output generated in the business application for financial analysts to approve or reject an invoice based on the anomaly prediction and the corresponding LLM summary.
Graph 9. Last output generated within the enterprise software for monetary analysts to approve or reject an bill based mostly on the anomaly prediction and the corresponding LLM abstract.
5. Manufacturing Monitoring

DataRobot maintains an oversight over the generative AI pipeline by means of the utilization of customized efficiency metrics and predictive fashions.  This rigorous monitoring course of ensures the continual reliability and effectivity of our resolution, providing you a seamlessly reliable expertise.   

Graph 10. DataRobot deployment containing the predictive and generative pipeline properly monitored over time with relevant custom metrics.
Graph 10. DataRobot deployment containing the predictive and generative pipeline correctly monitored over time with related customized metrics.

Conclusion

In abstract, the partnership between SAP and DataRobot continues to permit organizations to rapidly drive worth from their AI investments, and now much more by leveraging generative AI.  Predictive anomaly detection and generative AI can rework the challenges and dangers related to bill processing.  Effectivity and accuracy soar, whereas communication turns into clearer and extra streamlined.  Companies can now modernize their operations, save time and cut back errors.  It’s time to unlock the potential of this transformative expertise and take your operations to the subsequent stage. 

Free trial

Expertise the DataRobot AI Platform

Much less Friction, Extra AI. Get Began As we speak With a Free 30-Day Trial.


Signal Up for Free

In regards to the writer

Belén Sánchez Hidalgo
Belén Sánchez Hidalgo

Senior Information Scientist, Crew Lead and WaiCAMP Lead, DataRobot

Belén works on accelerating AI adoption in enterprises in the US and in Latin America. She has contributed to the design and improvement of AI options within the retail, training, and healthcare industries. She is a pacesetter of WaiCAMP by DataRobot College, an initiative that contributes to the discount of the AI Trade gender hole in Latin America by means of pragmatic training on AI. She was additionally a part of the AI for Good: Powered by DataRobot program, which companions with non-profit organizations to make use of information to create sustainable and lasting impacts.


Meet Belén Sánchez Hidalgo