The most widely used APIs in machine learning, apart from Google, IBM or Amazon

4 min reading

APIs , Developers / 14 April 2016

At BBVAOpen4U we have already seen on more than one occasion the importance of machine learning for the business development of companies and its huge impact on large technology companies like Google, IBM, Amazon or Microsoft. But they are not the only market players in the creation of predictive models or natural language processing. Some companies successfully struggle to find a place in this highly competitive field with a bright future.

PredictionIO, AT&T Speech, Wit.ai and Diffbot are four practical examples that prove that it is possible to emerge and grow within machine learning and natural language processing only, although this later leads to accepting offers for integrating with big companies. The success of these projects is explained by APIs (application programming interfaces). Without them, doing business would be impossible.

PredictionIO

PredictionIO is an open source machine learning server that enables development and data science teams to build fully scalable prediction engines, a major consideration when working with data in real time. These are some of the most interesting features of PredictionIO:

● Simplified data infrastructure management.

● Support for such well-known machine learning and data processing libraries as Spark MLlib (the tool offered by Apache Spark, the open source distributed computing platform, that contains algorihtms for logistic regression, support vector machines (SVM), Bayesian regression tree models, least square techniques, analysis of average K conglomerates…) or OpenNLP (machine learning library based on natural language processing).

● Incorporate proprietary predictive models into the PredictionIO engine.

● Response to dynamic queries in real time.

● Unification of data from different platforms, both in batches and in real time sources, to make predictive analysis fully comprehensive.

PredictionIO has several SDKs for languages such as Java, Ruby, Python or PHP. The tool is basically based on three components:

● PredictionIO platform: An open source development stack that enables clients to build, evaluate and implement engines using machine learning algorithms in an easy and scalable way.

● Event server: This PredictionIO tool enables applications to send events to the server through an API.

● Template gallery: There is no need to download templates for the different engines based on each machine learning application.

AT&T Speech

AT&T Speech APIs enable developers to include voice recognition functionality in both web applications and mobile apps. It has three application programming interfaces that transform voice into text and text into voice, in a general or customized way.

● The voice-to-text API: It only accepts single-channel audio formats and it uses a grammar dictionary to complete transcriptions in both English and Spanish and a contextualization system to optimize accuracy. The API transcribes voice in batches in four minutes. The different batches would later need to be joined to obtained the full transcription.

● The voice-to-customized text API: In this case, the interface creates transcriptions from the terms (grammar and suggestions) in a database generated by the developers themselves. More accuracy is sought.

● The text-to-voice API: It accepts plain text or text in XML format with a maximum limit of 500 bytes (equivalent to a text containing around 100 words) and supports both male and female voices in two languages, English and Spanish.

The different SDKs can be downloaded here.

Wit.ai

Wit.ai is a natural language processing platform for developers, specifically, a community with more than 20,000 professionals. Why do they use Wit.ai? To include new functionality in web and mobile applications in fields such as robotics, messaging services or wearables. The Wit.ai API has the ability to learn human language on its own with each interaction.

Some of its key features are:

● Wit.ai is completely free, even for commercial use. The platform’s applications are open because, according to its creators, “only private applications are used when there are privacy restrictions”. “An open application is capable of using the data provided by the community to be even smarter”.

● The users or developers using Wit.ai own their data, but they must be aware that it will be used to enhance the platform.

● It now supports many languages: English, Spanish, French, German, Italian, Dutch, Polish, Swedish, Portuguese or Russian.

● Wit.ai has tutorials for mobile operating systems such as iOS, Android and Windows Phone, for all web browsers and for programming languages such as Python, Ruby, C or Rust.

To test the platform’s benefits, Wit.ai is a web app that can be used to try its functionality through microphone access.

Diffbot

Diffbot is a platform that uses artificial intelligence (a combination of machine learning and natural language recognition) to automatically extract data from websites, such as text, pictures, videos or comments. It is therefore a tool that can be used for scraping anything retrievable from a website. This is possible thanks to the repertoire of APIs provided by the platform. However, Diffbot is not an open source tool.

Some of key features of Diffbot are:

● The Diffbot APIs are run in JavaScript.

● It works on websites in English and in other languages.

● Automatic tagging of scraped information.

● Extraction of data in JSON or CSV formats. Bulk API enables developers to scrape hundreds of websites simultaneously.

● Libraries in PHP, Python, JavaScript, Objective-C or Perl.

More information on APIs here.

Follow us on @BBVAAPIMarket

It may interest you

Main methods of payment to suppliers and the advantages of Reverse Factoring

The cash flow and payment management departments must consider the way in which payments to suppliers are made, because each type of payment has its own advantages or is better suited to the company’s strategy, the relationship with the supplier, or the respective needs of both. Far from being a trivial matter, choosing the payment […]

APIs , ERP , Treasury / 17 June 2024
What is leasing and how does it work?

Businesses, from self-employed to SMEs and large companies, need financing solutions that suit their needs. Leasing is a method that can optimize the use of resources and which combines business liquidity (or lack of) with the use of assets. What is leasing and how does it work? Leasing is a financing method by which a […]

APIs , Banking as a service , Funding / 30 January 2024
What is an API, types of APIs and how they work

An API is a very useful mechanism that connects two pieces of software equipment to exchange messages or data in a standard format such as XML or JSON. Thus, it becomes an instrument that can be used to search for revenue, open the doors to talent or innovate and automate processes.

APIs , Banking as a service , Desarrollo de negocio , Digital transformation / 18 December 2023

Name	Owner	Duration	Description
gobp.lang	BBVA	1 month	Language preference
aceptarCookies	BBVA	1 year	Configuration Accepted Cookies
_abck	BBVA	1 year	Helps protect against malicious website attacks
bm_sz	BBVA	4 hours	Helps protect against malicious website attacks
ADRUM_BTs	Salesforce Marketing Cloud	Session	Required for monitoring of the service, inherent to SFMC
ADRUM_BT1	Salesforce Marketing Cloud	Session	Required for monitoring of the service, inherent to SFMC
ADRUM_BTa	Salesforce Marketing Cloud	Session	Required for monitoring of the service, inherent to SFMC
ADRUM_BT	Salesforce Marketing Cloud	Session	Required for monitoring of the service, inherent to SFMC
xt_0d95e	Salesforce Marketing Cloud	Session	Remember user preferences (if any)
__s9744cdb192d044faa1bf201d29fafd1e	Salesforce Marketing Cloud	Session	Remember user preferences (if any)
wpml_browser_redirect_test	WPML	Session	Text translation in the portal
wp-wpml_current_language	WPML	24 hours	Text translation in the portal

Name	Owner	Duration	Description
AMCV_***	Adobe Analytics	Session	Unique Visitor IDs used in Cloud Marketing solutions
AMCVS_***	Adobe Analytics	2 years	Unique Visitor IDs used in Cloud Marketing solutions
demdex (safari)	Adobe Analytics	180 days	Create and store unique and persistent identifiers
sessionID	Adobe Analytics	Session	Launch's internal cookie used to identify the user
gpv_URL	Adobe Analytics	Session	Adobe Analytics plugin: getPreviousValue Capture the value of a certain variable in the following page view, in this case the prop1
gpv_level1	Adobe Analytics	Session	Cookie used to store the DataLayer levl1 of the previous page.
gpv_pageIntent	Adobe Analytics	Session	Cookie used to store the pageIntent of the previous page.
gpv_pageName	Adobe Analytics	Session	Cookie used to store the pagename of the previous page.
aocs	Adobe Analytics	Session	Cookie that stores the first values collected at the beginning of a process.
TTC	Adobe Analytics	Session	Cookie used to store the time between the App Page Visit event and the App Completed event.
TTCL	Adobe Analytics	Session	Cookie used to store the time between the LogIn event and App Completed.
s_cc	Adobe Analytics	Session	Determine if cookies are active
s_hc	Adobe Analytics	Session	Cookie used by Adobe for analytical purposes
s_ht	Adobe Analytics	Session	Cookie used by Adobe for analytical purposes
s_nr	Adobe Analytics	2 years	Determine the number of user visits
s_ppv	Adobe Analytics	Permanent	Adobe Analytics plugin: getPercentPageViewed Determine what percentage of the page a user views
s_sq	Adobe Analytics	Session	ClickMap/ActivityMap features
s_tp	Adobe Analytics	Session	Cookie used by Adobe for analytical purposes
s_visit	Adobe Analytics	2 years	Cookie used by Adobe to know when a session has been started.

Name	Owner	Duration	Description
OT2	VersaTag	90 days	VersaTag Cookie used to store a user id and the number of user visits.
u2	VersaTag	90 days	VersaTag Cookie where the user ID is stored
TargetingInfo 2	MediaMind	1 year	Cookie that serves to assign a unique random number that generates MediaMind.

Name	Owner	Duration	Description
mbox	Adobe Target	9 days	Cookie used by Adobe Target to test user experience customization.

The most widely used APIs in machine learning, apart from Google, IBM or Amazon

It may interest you

Main methods of payment to suppliers and the advantages of Reverse Factoring

What is leasing and how does it work?

What is an API, types of APIs and how they work