Artificial intelligence (AI) is a branch of computer science that focuses on creating systems capable of performing tasks that normally require human intelligence. These tasks include learning, reasoning, perception, natural language understanding and decision-making. AI is divided into two main categories: weak AI, designed for specific tasks such as virtual assistants and search engines, and strong AI, which has the potential to perform any cognitive task that a human being can do.
The development of AI has been driven by advances in algorithms, increased processing power and the availability of large amounts of data. Technologies such as machine learning and deep learning allow AI systems to improve their performance over time, adapting to new data and situations. AI has applications in diverse fields, from medicine and education to industry and entertainment, transforming how we live and work.
Oracle offers a comprehensive AI portfolio integrated into its cloud applications on best-in-class AI infrastructure and with next-generation AI-generating innovations. These enhancements cover both the technical and application areas, introducing innovation that automates, optimizes, and transforms business processes. Now, let’s dive into these exciting application enhancements.
In addition, Oracle has developed a Generative AI equipped with advanced language understanding to create the next generation of enterprise applications. Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service available through APIs to seamlessly integrate these versatile language models into a wide range of use cases, including script support, summarization, and chat. Now it’s only working well in English, we tested the Spanish version but Oracle has to keep training.
For this article, we are going to focus in the Oracle AI services and later on we will do a monograph on generative AI exclusively & Oracle Digital Assistant. A good development on the logistics of the sales order side, introduced in the latest JD Edwards release, is the calculation of order preparation date and delivery date preference based on the workday calendar.
OCI language
OCI Language analyses unstructured text for you. It provides models trained on industry data to perform language analysis with no data science experience needed. It has five main capabilities.
First, it detects the language of the text. It recognizes 75 languages, from Afrikaans to Welsh. It identifies entities, things like names, places, dates, emails, currency, organizations, phone numbers- 14 types in all. It identifies the sentiment of the text, and not just one sentiment for the entire block of text, but the different sentiments for different aspects.
So, let's say you read a restaurant review that said the food was great, but the service sucked. You'll get food with a positive sentiment and service with a negative sentiment. And it also analyses the sentiment of every sentence. It identifies key phrases in the text that represent important ideas or subjects. It classifies the general topic of the text from a list of 600 categories and subcategories.This will allow you to predict, yield and trace products and processes for better decision-making.
OCI Speech
OCI Speech is very straightforward. It unlocks the data in audio tracks by converting speech to text. Developers can use Oracle's time-tested acoustic language models to provide highly accurate transcription for audio or video files across multiple languages. OCI Speech automatically transcribes audio and video files into text using advanced deep learning techniques. There's no data science experience required. It processes data directly in object storage and it generates timestamped, grammatically accurate transcriptions.
OCI Speech supports multiple languages, specifically English, Spanish, and Portuguese, with more coming in the future. It has batching support where multiple files can be submitted with a single call. It has blazing fast processing. It can transcribe hours of audio in less than 10 minutes. It does this by chunking up your audio into smaller segments and transcribing each segment, and then joining them all back together into a single file.
OCI Speech makes transcribed text more readable to resemble how humans write. This is called normalization. The service normalizes addresses, times, numbers, URLs, and other data. For example: Blend management users will be pleased to know that audit reports will become easier to access, available just in time upon the auditor’s demand. These reports allow you to track and trace your product throughout the entire process.
OCI Vision
Vision is a computer vision service that works on images. It provides two main capabilities, image analysis and Document AI. Image analysis analyses photographic images. Object detection is the feature that detects objects inside an image using a bounding box and assigning a label to each object with an accuracy percentage. Object detection also locates and extracts text that appears in the scene like on a sign.
Image classification will assign classification labels to the image by identifying the major features in the scene. One of the most powerful capabilities of image analysis is that, in addition to pre-trained models, users can retrain the models with their own unique data to fit their specific needs. For Argentina, there is now the option to include or exclude ICMS in PIS/COFINS.
Now, the second major capability of Vision is called Document IA or Document Understanding.
OCI Document Understanding
It's used for working with document images. You can use it to understand PDFs or document image types like JPEG, PNG, and TIFF, or photographs containing textual information. The features of Document AI are text recognition, also known as OCR or Optical Character Recognition. And this extracts text from images, including non-trivial scenarios like handwritten texts, plus tilted, shaded, or rotated documents.
Document classification classifies documents into 10 different types based on visual appearance, high-level features, and extracted keywords. This is useful when you need to process a document based on its classification, like an invoice, a receipt, or a resume. Language detection analyses the visual features of text to determine the language, rather than relying on the text itself. Table extraction identifies tables in docs and extracts their content in tabular form. Key value extraction finds values for 13 common fields and line items in receipts, things like merchant name and transaction date. Let’s see an example how the OCI console works:
The default image here is a receipt. And it detects it is English. And it extracts a lot of raw text from this receipt. You can see it. Everything that's highlighted here, which is the entire contents of the receipt, has been extracted. And we see what it has extracted here, both in line format and in individual word format.
Now let's look and see if it has any key values. And yes, it did. Remember that for receipts, it has several specific keys to look for. And then if it finds a matching value, it will assign that value to the key. So, we get the merchant’s name, Example Cafe, the merchant address, merchant phone number, transaction time, transaction date, a lot of information on this receipt that's useful in processing it as an expense.
And then we also see the line-item data down here. And we can also see tabular data. It's picked out both two items here, an Americano and a water.
So, this gives us an idea of how OCI Vision & Document works.
OCI Anomaly Detection
Oracle Cloud Infrastructure Anomaly Detection identifies anomalies in time series data. Equipment sensors generate time series data, but all kinds of business metrics are also time-based. The unique feature of this service is that it finds anomalies not just in a single signal, but across many signals at once. That's important because machines often generate multiple signals at once, and the signals are often related.
Think of a pump that has an output pressure, a flow rate, an RPM, and an electrical current draw. When a pump is going to fail, anomalies may appear across several of those signals, but at different times. OCI Anomaly Detection helps you to identify anomalies in a multivariate data set by taking advantage of the interrelationship among signals.
The service contains algorithms for both multi-signal, in multivariate. The single signal is in univariate anomaly detection, and it automatically determines which algorithm to use based on the training data provided. The multivariate algorithm is called MSET-2, which stands for Multivariate State Estimation Technique, and it's unique to Oracle.
The 2 in the name refers to the patented enhancements by Oracle labs that automatically identify and fix data quality issues resulting in fewer false alarms and more accurate results.
OCI REST API
All these AI services we have seen can be used through the OCI stack and can also be accessed through the REST API.
It can write code against the REST API or use any of the various language SDKs shown here. But for data scientists working in OCI Data Science, it makes sense to use Python.
We will not in this article go down to the technical level. But we will talk more in detail about Oracle's Generative AI and its digital assistant and I expect it will be sooner rather than later. In the meantime, don't hesitate to contact Quistor for more information. Stay tuned!
Before you go
Feel free to ask us any question, ask for more information or simply say hello in this contact form.