A Data Science Central Community
Curse of Dimensionality:
One of the most commonly faced problems while dealing with data analytics problem such as recommendation engines, text analytics is high-dimensional and sparse data. At many times, we face a situation where we have a large set of features and fewer data points, or we have data with very high feature vectors. In such scenarios, fitting a model to the dataset, results in lower predictive power of the model. This scenario is often termed as…Continue
Added by suresh kumar gorakala on February 28, 2016 at 9:30pm — No Comments
Gets Tweets from Twitter:
Added by suresh kumar gorakala on January 11, 2016 at 6:00am — No Comments
As a part of Twitter Data Analysis, So far I have completed Movie review using R& Document Classification using R. Today we will be dealing with discovering topics in Tweets, i.e. to mine the tweets data to discover underlying topics– approach known as Topic Modeling.
Added by suresh kumar gorakala on December 23, 2015 at 8:30pm — No Comments
Originally posted here.
Added by suresh kumar gorakala on October 13, 2015 at 6:30am — No Comments
In my previous blog I have explained about linear regression. In today’s post I will explain about logistic regression.
Consider a scenario where we need to predict a medical condition of a patient (HBP) ,HAVE HIGH BP or NO HIGH BP, based on some observed symptoms – Age, weight, Issmoking, Systolic value, Diastolic value, RACE, etc.. In this…
Added by suresh kumar gorakala on October 9, 2015 at 9:13am — No Comments