With the onset of digital world, everyday a number of new websites is being created trying to sell wide range of products and services. And sustaining in this highly competitive market is itself an art. Companies are hiring tech geeks to customize their website and services according to the response rate of customers....
Data Science – Let the Data Sing
Oct 21, 2016 DATA SCIENCE, DATA VISUALIZATION
The hype is real. But let’s get past it. What exactly is Data Science? And why is it the next big thing. Massive amounts of data are being generated every sec. The total amount of data in the world is 4.4 zetabytes. And this is not just the internet data. We are talking...
Feature selection using Decision Tree
Oct 21, 2016 DATA SCIENCE, DATA SCIENCE
One of the key differentiators in any data science problem is the quality of feature selection and importance. When we have a lot of data available to be used by our model, the task of feature selection becomes inevitable due to computational constraints and the elimination of noisy variables for better prediction. Also,...
User interaction with a website can be used to predict buy event. A user’s behavior can be used to predict user intentionality towards different products. This can be used to show ads for retargeting. In e-commerce a lot of data of user is available like searches, page views, time spent per item, basket...
A machine learning model consists of various parameters that need to be learned from the data. The crux of Machine learning is fitting a model to the data. This process of training a model with existing data to fit the model parameters, is called model training. Hyperparameters refer to another kind of parameters...
Overfitting happens mostly because the model becomes too complex. Such a model will give poor accuracies, as it memorizes the noise in the training data. A model is usually fit by achieving the highest accuracy on the training data set. However, its efficiency is judged by its its performance on test data. Overfitting occurs...