No NLP Keywords
True Semantic Meaning of Text
Sophisticated Natural Language Understanding (NLU)
All our AI microservices are reinforced by proprietary neural network architectures, to extend and improve advanced Transformers (e.g. mT5).
This empowers sophisticated Natural Language Understanding (NLU), the advanced area of Natural Language Processing (NLP), to extract the true meaning and context of text in a human-like way, in an unparalleled way.
NLU vs. NLP
NLU is a significantly more accurate (and far more complex) true identification of the meaning of text vs. deriving value and insights through ’simple’ NLP keyword extractions.
The NLU functionality will examine the parsed elements of the text and allow an application to interpret what was meant or intended based on its underlying model.
NLU plays a fundamental and vital role in all our sophisticated no-code AI pipeline applications around text categorisation & classification, content moderation and advanced sentiment analysis.
Want +35% Accuracy, With -65% Cost in Your Text Data Analytics?
We have dubbed our proprietary web text data extraction and labeling technology, Blind Vision. It is now, arguably, the best in class globally and superior to the established and trusted Computer Vision and other technologies.
Test studies* have been conducted versus a US based web data integration platform that uses Computer Vision, and the results are conclusive. They showed a +35% increase in accuracy and -65% lower running costs using Blind Vision technology.
We like to remain a little secretive about the ‘sauce’, but we can say that Blind Vision combines sophisticated Raw Code Processing algorithms and our own deep learning network architecture.
Advanced Sentiment Analysis
Sentiment analysis is a very powerful tool with many commercial applications. However, it is very difficult to do well. Many claim to have a sentiment solution, but upon analysis, few in the market really do. Current solutions suffer from poor consistency, limited accuracy and lack of advanced deep learning techniques.
Humans can easily judge the polarity of text, unlike machines. We have developed an apex sentiment tool, using our proprietary neural network architecture, which enables real Natural Language Understanding (NLU) and emulates how humans judge the content and context of text.
Our solution provides consistent sentiment analysis of high or low-frequency content, in long or short format, across 100+ languages. And, we can do all of this with an impressive F1 score of 0.9443!
Named Entity Recognition
Valuable (business) information is buried in a largely text-based (79%) data explosion, most of which also resides in unstructured data on the web. The ability to extract, organize, analyze and connect large amounts of unstructured text data has become of paramount importance.
Extracting, classifying, and connecting entities via Named Entity Recognition (NER) technology plays an important role in sorting unstructured data and identifying valuable information. NER is a key foundational block for any information discovery pipeline and the basis for most Natural Language Processing (NLP) solutions.
We have built the new industry standard: Multilingual NER (in 100+ languages) that is unrivaled in accuracy vs. current ‘open source’ solutions and performs with an F1 score of 0.95.
Web Scraping may sound easy, but it’s not! We have solved the 5 most prevalent issues faced by standard web scraping methods.
One key issue involves constant website layout changes. SEO improvements and UX/UI changes are delivered through HTML layout amendments. As a result, element locators that web scrapers are configured to in order to extract data, change and break the scraping process by extracting incorrect or no data. It takes a lot of manual effort to update these configurations and maintaining thousands of sources becomes near impossible.
We have fully automated the process of source reconfiguration to present you with a truly scalable, leading-edge web scraping solution. One that operates in 100+ languages and can scrape any text data from any web source in real-time.
Automated Text Classification
We have built an industry leading, multilingual, automated text classification capability that demonstrates superior accuracy (underpinned by Natural Language Understanding), uses no language translation layer (which significantly distorts the meaning of content) and is able to proces all length content, via our single, one stop shop, platform.
These accuracy levels now mimic human understanding of any text.
BERT Semantic Similarities
Research into the possibilities of using the multilingual BERT model for determining semantic similarities of news content.
Semantic Similarity of
Arbitrary-length Text Content
Research on the specific features of determining the semantic similarity of arbitrary–length text content using multilingual Transformer based models.