Regression analysis is a collection of statistical techniques for modeling and predicting one or multiple continuous variables based on data. It's a cornerstone of machine learning, and various algorithms and software packages can be used to fit and use regression models.
24
votes
4answers
6k views
Screening (multi)collinearity in a regression model
I hope that this one is not going to be "ask-and-answer" question... here goes:
(multi)collinearity refers to extremely high correlations between predictors in the regression model. How to cure ...
23
votes
4answers
54k views
What is the difference between Multiple R-squared and Adjusted R-squared in a single-variate least squares regression?
Could someone explain to the statistically naive what the difference between Multiple R-squared and Adjusted R-squared is? I am doing a single-variate regression analysis as follows:
v.lm <- ...
19
votes
5answers
55k views
Handling browser pop-up windows with Selenium
We are running Selenium regression tests against our existing code base, and certain screens in our web app use pop-ups for intermediate steps.
Currently we use the commands in the test:
// force ...
15
votes
3answers
438 views
Partial Least Squares Library
There was already a question like this, but it was not answered, so I try to post it again. Does anyone know of an open-source implementation of a partial least squares algorithm in C++ (or C)?
Or ...
13
votes
2answers
969 views
How to change points and add a regression to a cloudplot (using R)?
To make clear what I'm asking I've created an easy example. Step one is to create some data:
gender <- factor(rep(c(1, 2), c(43, 41)), levels = c(1, 2),labels = c("male", "female"))
numberofdrugs ...
13
votes
6answers
7k views
Simple multidimensional curve fitting
I have a bunch of data, generally in the form
a, b, c, ..., y
where y = f(a, b, c...)
Most of them are three and four variables, and have 10k - 10M records. My general assumption is that they are ...
12
votes
4answers
8k views
Java-R integration?
I have a Java app which needs to perform partial least squares regression. It would appear there are no Java implementations of PLSR out there. Weka might have had something like it at some point, but ...
9
votes
9answers
894 views
Best approach to what I think is a machine learning problem
I am wanting some expert guidance here on what the best approach is for me to solve a problem. I have investigated some machine learning, neural networks, and stuff like that. I've investigated ...
8
votes
3answers
303 views
How to put a complicated equation into a R formula?
We have the diameter of trees as the predictor and tree height as the dependent variable. A number of different equations exist for this kind of data and we try to model some of them and compare the ...
8
votes
3answers
2k views
Regression tree in R
I am having trouble making a regression tree in R. I have a data frame with 17 attributes
library(rpart)
rt.model <- rpart(razlika ~ ., learn)
I get an error:
Error in `[.data.frame`(frame, ...
8
votes
2answers
2k views
Calculation of R^2 value for a non-linear regression
I would first like to say, that I understand that calculating an R^2 value for a non-linear regression isn't exactly correct or a valid thing to do.
However, I'm in a transition period of performing ...
7
votes
1answer
6k views
Extract regression coefficient values in R
I have a regression model for some time series data investigating drug utilisation. The purpose is to fit a spline to a time series and work out 95% CI etc. The model goes as follows:
...
7
votes
1answer
502 views
Distinguishing overfitting vs good prediction
These are questions on how to calculate & reduce overfitting in machine learning. I think many new to machine learning will have the same questions, so I tried to be clear with my examples and ...
7
votes
3answers
625 views
Tools for sparse least squares regression
I want to do sparse high dimensional (a few thousand features) least squares regression with a few hundred thousands of examples. I'm happy to use non fancy optimisation - stochastic gradient descent ...
7
votes
3answers
718 views
Visual Comparison of Regression & PCA
I'm trying to perfect a method for comparing regression and PCA, inspired by the blog Cerebral Mastication which has also has been discussed from a different angle on SO. Before I forget, many thanks ...