Machine Learning Algorithms – DataScientist may be the sexiest job of today but the understanding, implementation, applied ML experience is missing. Having the top algorithms on your fingertips in real business is missing big time. The real job for any data scientist is the ability to clarify, demonstrate, extract real values out of data and reap rewards. “Machine Learning” as a basic skill sounds like teleportation tool to many businesses especially for the companies which are actually data factories i.e social media platforms. Describing and picturising the top few machine learning algorithms is the main idea of this post.
Data Scientists Basic Tool Kit
Data scientists with his/her basic skill should able to communicate insights of data, impact on business and able to work with non-analytics resources. Answering questions from business people at ground level is more important rather answering from PhD scholar books material perspective. In this post, the main idea is to introduce you to the main algorithms that should be in the data scientist’s toolbox. Some of the popular machine learning algorithms.
The data scientist should be bilingual in terms of day to day business in order to provide an overview of
- The data organisation collect & how to conceptualise the ideas behind data that can turn data into actionable knowledge.
- Answer the questions business have
- Facilitate the tools that data analysts, data scientists and business team should work with.
Let the business know what’s in the data and learn about the goals and objectives of the business to hunt down related information in the collected data. Data scientists tool kit should be able to picturise the great primer for what Data Science is about. Point to note here is that AI is much more than ML. I particularly think that getting to know the types of MLAlgos actually helps to see a somewhat clear picture of AI. The answer to the question “What machine learning algorithm should I use?” is always “It depends.” It depends on the size, quality, and nature of the data. It depends on what you want to do with the answer.
To us, at AILabPage we say machine learning is crystal clear and ice cream eating task. It is not only for PhDs aspirants but it’s for you, us and everyone.
Machine Learning and AI
Artificial intelligence and machine learning are used interchangeably often but they are not the same. Machine learning is one of the most active areas and a way to achieve AI. Why ML is so good today; for this, there are a couple of reasons like below but not limited to though.
- The explosion of big data
- Hunger for new business and revenue streams in this business shrinking times
- Advancements in machine learning algorithms
- Development of extremely powerful machine with high capacity & faster computing ability
- Storage capacity
Today’s machines are learning and performing tasks; that was only be done by humans in the past like making a better judgement, decisions, playing games etc. This is possible because machines can now analyse and read through patterns and remember learnings for future use. Today the major problem is to find resources that are skilled enough to demonstrate & differentiate their learning from university & PhD books in real business rather than just arguing on social media with others.
Machine learning should be treated as a culture in an organisation where business teams, managers and executives should have some basic knowledge of this technology. In order to achieve this as a culture, there have to be continuous programs and road shows for them. There are many courses which are designed for students, employees with little or no experience, managers, professionals and executive to give them a better understanding of how to harness this magnificent technology in their business.
Top Machine Learning Algorithms
Data science is neither magic nor rocket science, it does not create or invent any new information or facts. Data science helps us to make some sense of what’s already in front of us hidden in our data. Machine learning and its algorithms are either supervised or unsupervised as of today but the future really lies in reinforcement learning.
In deep learning, the concept of using many layers of nonlinear processing units for feature extraction and transformation extend machine learning boundaries. The artificial neural network structure is what enables artificial intelligence, machine learning and supercomputing to flourish. Neural networks are powering language translation, face recognition, picture captioning, text summarization and lot more.
In short, the success of an algorithm depends upon how the math of the algorithm was translated into instructions for the computer you are using. And it depends on how much time you have.
- Linear Regression – Simple Linear Regression- there is only an independent variable. Multiple Linear Regression- refers to defining a relationship between independent and dependent variables
- Logistic Regression – A super simple form of regression analysis in which the outcome variable is binary or dichotomous. Helps to estimate adjusted prevalence rates, adjusted for potential confounders (socio-demographic or clinical characteristics)
Classification and regression trees are an important type of algorithm for predictive modelling machine learning. A greedy algorithm based on divide and conquer rule. Split the records based on an attribute test that optimizes certain criterion. The real value is in determining how to split the records.
Naive Bayes classifiers are a family of simple probabilistic classifiers based on applying Bayes’ theorem with strong (naive) independence assumptions between the features.
The laziest algorithm which is also a very simple algorithm that stores all available cases and predicts the numerical target based on a similarity measure. In the beginning of 1970s as a non-parametric technique, KNN has been used in statistical estimation and pattern recognition already.
Recursive Neural Networks
Call it as a deep tree-like structure. When the need is to parse a whole sentence we use a recursive neural network. Tree-like topology allows branching connections and hierarchical structure. Arguments here can be how recursive neural networks are different than recurrent neural networks?
Questions – How recursive neural networks are different than recurrent neural networks?
Answer – Recurrent neural networks are in fact recursive neural networks with a particular structure: that of a linear chain.
RNNs are hierarchical kind of network with no time aspect to the input sequence but the input has to be processed hierarchically in a tree fashion.
Recurrent Neural Networks – Call it as a deep tree-like structure. These neural networks are used to understand the context in speech, text or music. The RNN allows information to loop through the network. Tree-like topology allows branching connections and hierarchical structure. In RNNs data flow is in multiple directions. These networks are employed for highly complex tasks i.e voice recognition, handwriting and language recognition etc.
RNNs abilities are quite a limitless. Don’t get lost between Recursive and Recurrent NNs. ANN’s structure is what enables artificial intelligence, machine learning and supercomputing to flourish. Neural networks are used for language translation, face recognition, picture captioning, text summarization and lot more tasks.
Convolutional Neural Networks (CNNs) is an excellent tool and one of the most advanced achievements in deep learning. CNNs got too much attention and focus from all major business players because of the hype of AI. The two core concepts of convolutional neural networks are convolution (hence the name) and pooling. It does this job at the backend with many layers transferring information in a sequence from one to another.
The human brain detects any image in fractions of seconds without much of efforts but computer vision the image is really just an array of numbers. In that array, each cell value represents the brightness of the pixel from black to white for a black and white image. Why do we need CNN’s and not just use feed-forward neural networks? How capsule networks can be used to overcome the shortcoming of CNN’s? etc. I guess if you read this post on “Convolutional Neural Networks“; you will find out the answer.
A very young family member of Deep Neural Networks Architecture. Introduce by Ian Goodfellow and his team at the University of Montreal in 2014. GANs are class of unsupervised machine learning algorithm. So as the name suggests it is called as Adversarial Networks because this is made up of two neural networks. Both neural networks are assigned different job role i.e. contesting with each other.
Neural Network one is called as the Generator because it generates new data instances.
Other neural net is called the Discriminator, evaluates work for the first neural net for authenticity.
The cycle continues to obtain accuracy or near perfection results. Still confused, it’s ok to read this post on “Generative Adversarial Networks“; you will find more details and understanding.
Books Referred & Other material referred
- Open Internet reading and research work
- AILabPage (group of self-taught engineers) members hands-on lab work.
Points to Note:
When to use artificial neural networks as oppose to traditional machine learning algorithms is a complex one to answer. It entirely depends upon on the problem in hand to solve. One needs to be patient and experienced enough to have the correct answer. All credits if any remains on the original contributor only. In the next upcoming post will talk about Recursive Neural Networks in detail.
Feedback & Further Question
Do you have any questions about Deep Learning or Machine Learning? Leave a comment or ask your question via an email. Will try my best to answer it.
Conclusion – For any effective machine learning model requirement is only one which is reliable data pipelines. I particularly think that getting to know the types of machine learning algorithm actually helps to see a somewhat clear picture. The answer to the question “What machine learning algorithm should I use?” is always “It depends.” It depends on the size, quality, and nature of the data. Also, what is the objective/motive data torturing? As more we torture data more useful information comes out. It depends on how the math of the algorithm was translated into instructions for the computer you are using. And it depends on how much time you have. Only recently machine learning got spotlight and attention from the industry. Machine learning use cases like face recognition, image captioning, voice & text processing and self-driving cars now everyone talks about.
============================ About the Author =======================
Read about Author at : About Me
Thank you all, for spending your time reading this post. Please share your opinion / comments / critics / agreements or disagreement. Remark for more details about posts, subjects and relevance please read the disclaimer.