8 minutes reading
Everybody has been talking about Machine Learning, and everybody wants to get the benefits of Artificial Intelligence. It’s a new thing for IT managers to grab that old problem from inside the old locker and think: “Hey! Maybe new Watson can solve it for me!”. But every time I hear someone asking about how to solve a problem with AI, the problem looks like something new, never seen before. Every day a new solution is researched to solve a new problem. Therefore, how can we identify what are the borders for AI? Since AI stands for “Artificial Intelligence” what is the “intelligence” border? What can and cannot be solved with what we have today?
How to identify how hard it will be to find an AI to your scenario
Machine Learning projects can be split into three groups:
Cool! What are the ready-to-use APIs?
My suggestions are inside this table below. But the options are not limited to it. You can find many others. They are maintained by the biggest cloud and AI players. It means you can trust it, and probably for what they focus, they are the best you will find.
Feature | IBM | Microsoft | Amazon | |
Chatbot related | DialogFlow | Watson Assistant and Virtual Agent | Bot | Lex |
Video Analysis | Video Intelligence | Intelligent Video Analytics | Video Indexer | Rekognition |
Image Analysis | Vision | Visual Recognition | Computer Vision API | Rekognition |
Speech to Text | Speech | Speech to Text | Bing Speech | Transcribe |
Text to Speech | Text to Speech | Bing Speech | Text to Speech | |
Natural Language Classifier | Natural Language | Natural Language Classifier, Natural Language Understanding, Personality Insights, and Tone Analyzer | Language Understanding | Comprehend |
Translation | Translate | Translator | Translator | Translate |
Trends search and analysis | Trends | Discovery and IBM’s Discovery News | ||
Find patterns over unstructured text | Knowledge Studio | |||
Content moderator* | Anomaly Detection | |||
Jobs discovery | Job discovery** |
* Google, IBM, and Amazon have content moderator built-in their products. Microsoft has this specific product looking for anomalies only.
** Job Discovery is a private tool available for only a few partners.
Two examples to talk about the borders again
Image recognition
Just like the example above: a customer came to me asking about a solution to identify people on images. Great! Let’s use Google’s Vision! Vision identifies people on photos and gives a lot more information about the colors on that image, about places the image may contain, and etc. But then the customer asked me: I want to recognize if it is a woman. I said ok! And then: I want to recognize the woman’s hair color. Ok, all open APIs are off the game. Let’s find a model, train it and then get hair colors. For you to be able to answer those questions there is no shortcut. You will have to read the documentation of each open API you find and run tests on it.
Language defect recognition
Another customer came to me asking if they could give a microphone to their employees in order to operate a system just by voice commands. Ok, it’s not new. We could use a mix of speech-to-text and natural language processing APIs, let’s move ahead! But then the customer said the system should recognize internal terms like acronyms and words they invented to communicate with each other. Erm… it’s not possible. You can’t train ready-to-use APIs to understand your very own specific terms. The easiest way to suggest the operators was to change the words for some others so the system would recognize it. Otherwise, they would have to grab models, configure and train them to understand the new words.
Then, why don’t you give your first AI step over ready-to-use APIs evaluation? The sooner you start, the sooner you will understand how to approach that old problem.