Research says that AI will automate most of the functions of a Data Scientist by 2020. The explosion of Big Data led to the hottest job of the past five years – Data Scientist. Now experts are speculating 2017 to be the year of Artificial Intelligence. What does it say about the future of jobs?...
Read More >>Overfitting happens mostly because the model becomes too complex. Such a model will give poor accuracies, as it memorizes the noise in the training data. A model is usually fit by achieving the highest accuracy on the training data set. However, its efficiency is judged by its its performance on tes...
Read More >>A machine learning model consists of various parameters that need to be learned from the data. The crux of Machine learning is fitting a model to the data. This process of training a model with existing data to fit the model parameters, is called model training. Hyperparameters refer to anoth...
Read More >>One of the key differentiators in any data science problem is the quality of feature selection and importance. When we have a lot of data available to be used by our model, the task of feature selection becomes inevitable due to computational constraints and the elimination of noisy variables...
Read More >>The hype is real. But let’s get past it. What exactly is Data Science? And why is it the next big thing. Massive amounts of data are being generated every sec. The total amount of data in the world is 4.4 zetabytes. And this is not just the internet data. We are talking about finance, the...
Read More >>With the onset of digital world, everyday a number of new websites is being created trying to sell wide range of products and services. And sustaining in this highly competitive market is itself an art. Companies are hiring tech geeks to customize their website and services according to the r...
Read More >>Artificial Intelligence is tipped to be the next big thing. A world where computers think and respond like human beings. Inducing intelligence to a machine – artificially. But where exactly are we right now? What were the milestones so far? If we compare Artificial Intelligence to the grow...
Read More >>Data is growing faster than ever before and by the year 2020, about 1.7 megabytes of new information will be created every second for every human being on the planet. With the advent of Twitter and YouTube, this prediction has only been cemented further. 300 hours of video are uploaded eve...
Read More >>It all begins with an idea. When that idea starts shaping up into a product or service, there comes the need to #HireTheRightPeople for the development of the product/service. Founders make sure that they recruit the best people but it can be called a hit & trial method at best. As wor...
Read More >>Startup as a whole is based on the idea of right people working for an amazing idea. You have the amazing idea. You have the funding to execute it. But where do you get the right people from? You hire an HR and you trust that person to bring the right people on-board for you. But does that g...
Read More >>If you could describe your company culture in five words, what would those be? Or rather can you manage to describe it in five words? Often, when we think about company culture, we think about fun team activities, fun after work, fun during work, freedom at workspace, freeness at workspace. But is t...
Read More >>Databricks ran Apache Spark Survey 2016 this summer to identify how organizations are using Apache Spark. The survey results suggest that Spark’s growth continues across various industries, building sophisticated data solutions by people in various functional roles. Databricks 2016 survey...
Read More >>GraphX is Apache Spark’s API for graphs and graph-parallel computation. This includes transformation, exploration, and graph computation. Data can be viewed both as graph & collections. This use case discusses friend follower analysis using Apache Spark GraphX’s PageRank operator. PageR...
Read More >>Data science is the study of where information comes from, what it represents and how it can be turned into a valuable resource in the creation of business and IT strategies. Mining large amounts of structured and unstructured data to identify patterns can help an organization rein in costs,...
Read More >>Many a times, the term internship is ill-defined to both the company and the candidate. Company perceives an intern as a trainee who works without or on less pay, in order to gain some extremely confined work experience. On the other hand, candidate takes an internship to be an opportunity w...
Read More >>In programming language design, a first-class citizen (also type, object, entity, or value) in a given programming language is an entity which supports all the operations generally available to other entities. These operations typically include being passed as an argument, returned from a fu...
Read More >>In functional programming we use functions to produce the required output instead of objects. The primary difference between the two is that the state of objects changes continuously whereas functions have no state and make no changes to variables which Is not visible. The itertools functions...
Read More >>You might wonder why do we need cross validation in the first place itself. Let’s explain that first. Normally, the generalization performance of a machine learning algorithm depends on its prediction capability on an independent test data. This assessment is of utmost importance to us. Cro...
Read More >>Ever have a question that, “Why is Linear Regression giving me such good accuracy on the training set but a low accuracy on the test set in spite of adding all the available dependent features to the model?” The question above seems inexplicable to many people but is answered by a...
Read More >>Big dead fishes found in the ocean. New fishes to take over. In the next decade, over half the Fortune 500 will no longer exist. Statistics from U.S. Bureau of Economic Analysis Most of the largest companies in the world will be replaced in the next decade. What can save them are reinventions an...
Read More >>User interaction with a website can be used to predict buy event. A user’s behavior can be used to predict user intentionality towards different products. This can be used to show ads for retargeting. In e-commerce a lot of data of user is available like searches, page views, time spent pe...
Read More >>