# regression's questions - Chinese 1answer

16.598 regression questions.

### 2 Is setting lambda equal to zero the same thing as not applying regularization at all?

If I set the regularization parameter to 0, does it essentially mean I'm not applying regularization (I've boxed the regularization bits in red)? Also, what is this type of regularization called?

### 1 Is there any relationship between step-wise multiple Regression analysis results and ANOVA results?

1 answers, 11 views regression anova
I performed two separate regression analyses by dividing my total sample into 'Male' and 'Female' samples, that resulted into different % of variances of variables (e.g for male the percent variance ...

### R: logistic regression residual deviance higher and null deviance but predictors all significant interpretation

1 answers, 14 views regression logistic binary-data
I am running a logistic regression, but for a group of predictors I tried, all of then are highly significant but the residual deviance is much higher than the null deviance. ...

### 2 standardising independent and dependent variables within blocks?

I have data from a randomised blocked experiment testing the effect of different crops on weeds. Each treatment was replicated in one plot in each of three blocks, on two farms, in two years (but NOT ...

### 4 CNN architectures for regression?

I've been working on a regression problem where the input is an image, and the label is a continuous value between 80 and 350. The images are of some chemicals after a reaction takes place. The color ...

### Regression and ARIMA transfer functions in R

0 answers, 9 views r regression time-series
I'm currently following the "Forecasting Product Demand in R" (https://www.datacamp.com/courses/visualizing-time-series-data-in-r) course on Datacamp, and I've been stuck trying to understand transfer ...

### 1 Why doesn't deep learning work as well in regression as in classification?

1 answers, 630 views regression deep-learning
there is a lot of research where deep learning works so well with classification but not in regression field SVR, tree-based approach is still good and I couldn't find good architecture about ...

### Dealing with new Factor Levels in a Regression in R

1 answers, 619 views r regression machine-learning
I originally posted this in stackoverflow (as given here) but was told to try here since it might be more relevant here. I am very new to statistics and R in general so my question might be a bit dumb,...

### 12 Comparison between Newey-West (1987) and Hansen-Hodrick (1980)

Question: What are the main differences and similarities between using Newey-West (1987) and Hansen-Hodrick (1980) standard errors? In which situations should one of these be preferred over the other? ...

### How do I formally test outliers from a linear regression?

Hello (and thanks for reading this). I have a set of data that looks at the area of remaining retina against age. I have three points of data per patient. We know that over time, the area of ...

### 2 How is canonical correlation analysis related to multivariate regression? [duplicate]

Given a $m\times p$ matrix $Y$ on the left, and a $m\times q$ matrix $X$ on the right, CCA tries to find 2 sets of mapping coefficients such that $Y\beta_{l}$ and $X\beta_{r}$ have the highest ...

### How do I formally test outliers from a linear regression?

Hello (and thanks for reading this). I have a set of data that looks at the area of remaining retina against age. I have three points of data per patient. We know that over time, the area of ...

### 1 How can we get the weights of ridge regression if there is bias term?

2 answers, 20 views regression mathematical-statistics
For ridge regression I learned before, $\hat{w} = argmin_{\theta}||y-Xw||_2^2 + \lambda||w||_2^2$. Thinking about if the bias is added, so the new $X$ become $[1,X]$, and we have a new weight $\theta$...

### Is this method for computing regression is correct or wrong?

0 answers, 42 views regression ardl
i am using a model in which i have 3 dependent variables. i) Co2 emission ii) Nox emission iii) deforestation. and 6 independent variables i)Governance ii)Foreign Direct Investment iii)GDP Per ...

### 3 Model to predict Residuals of another model

I am using a random forest for a 2 class classification problem. But eventually using probability of class "1" returned by the model for my task and not the label. I get AUC of about 70% Then I ...

### 2 Keep eliminating data points until good correlation coefficient is obtained-using Python

1 answers, 29 views regression correlation python outliers
I have been trying to find out a way in order to eliminate outliers from a dataset. The outliers are removed the following way: Any value which results into a 10% reduction in R2 value needs to be ...

### 30 Interpretation of R's output for binomial regression

1 answers, 64.303 views r regression logistic binomial interpretation
I'm quite new on this with binomial data tests, but needed to do one and now I´m not sure how to interpret the outcome. The y-variable, the response variable, is binomial and the explanatory factors ...

### Generalized Additive Model (k value)

I am trying to have result with GAM using R. In R, I am using mgcv and the code is following. However, I do not understand what k value for? If I want to see the seasonal effect on time then ...

### 1 I am looking for an explanation of inference of results from weighted regression for a ratio of two continous variables.

I am performing a weighted regression for a ratio $R = \frac{Y}{X}$ using $X$ as weights. Y and X are normal and they are not independent. Is it valid to use $X$ as weights? What is the interpretation ...

### 6 Prior for the coefficients of a linear regression model

I have a linear regression model $\bf Y=\bf{X}\bf{\beta}+\epsilon$. I want to assign a prior on $\bf\beta$ in order to derive the posterior predictive model $p(y_{predictive}|\bf{y},\bf{X},\beta)$. ...

### Forcing regression coefficients to be certain values based on population estimates

0 answers, 15 views regression cox-model confounding
I'm working with a researcher who found this paper and suggested we do something similar rather than the proportional hazards model I suggested. The model used in the paper is what the authors call ...

### Predict likelihood of deaths which distribution and which code [on hold]

In R: I have the following dataframe: ...

### GINI coefficient-whose data is more reliable? [on hold]

0 answers, 53 views regression correlation
I would like to ask you a super-basic question. I need to use Gini coefficient as moderator to examine well-being, resilience and other variables in Immigrants. Which Gini index is more reliable or ...

### Linear regression with a dependent variable that is mathematically related to an independent variable

0 answers, 43 views regression correlation medicine
Background I have clinical study data from 500 subjects. Measured variables (measured once in each subject), among others: $P_s$: Systolic blood pressure $P_d$: Diastolic blood pressure $d_s$: ...

### Using regression analysis for forecast - dummy encoding

I have took a look in a video that the guy was doing a regression analysis for forecast sales. He has a dataset with two columns (date and sales, for the past). He made a transformation on the ...

### Difference between regression between groups vs across all subjects (continuum)?

1 answers, 450 views regression group-differences
I'd like to understand this better in terms of drawbacks and suitability. For example, if my data includes investigating differences between 2 patient groups and a control group (3 groups in total), ...

### 1 Correlation Coefficient vs Kills/Death Ratio

0 answers, 51 views regression correlation
First off, I'm unfamiliar to statistics so bear with me. I am creating a web app that does some basic analysis on player stats. Right now I am trying to determine correlations (Pearson's correlation ...

### regression accuracy in matlab

0 answers, 20 views regression machine-learning svm
I have dataset that kind of climate features (11 attributes) hourly and electricity consumption in non-residental building, I test Regression (Tree, svm (guassian), Random forest and deep learning), ...

### 1 Can regression R-squared be smaller than sum of squared semi-partial correlations?

I have a linear regression equation where Y ~ X1 + X2 + X3 + X4. The intercept, X2, and X4 are statistically significant. The multiple r-squared is 0.04696, and adj R-sq is 0.0300. I am trying to ...

### How to carry out correlation/significance analysis?

0 answers, 203 views regression correlation
I am very new to statistics, and have searched around the net and stack exchange for an answer without luck. Therefore this post. I hope someone can help... I am carrying out an analysis of the ...

### Notational issues for point estimates

0 answers, 22 views regression random-variable notation
In the most basic form, (as I recall), consider a random variable $X$ which is defined over a probability space $\Omega$. Now, let us call a realization of $X$ as $x$ . As such, we can define ...

### Help interpreting regression [duplicate]

0 answers, 36 views r regression
these are the results I get using R on my database , can someone help me interpret the results ...

### 1 Help with R: Need to import data from a .csv file. Turn into time series and plot the time series as well as the linear regression [on hold]

0 answers, 17 views r regression time-series
I need to import two columns (time and temp) worth of data from a .csv file into R and convert it to a time-series and the plot it with a linear regression line. So far this is my approach: ...

### 1 Deriving predicted probabilities from gologit2 (proportional odds models) output

I am trying to understand the output from Richard Williams's amazing gologit2 STATA package. The software is used for ordinal logistic regression and circumvents violations of the proportional odds ...

### Panel regression with multiple fixed effects and heterogeneity

For a research project I am supposed to estimate a panel regression model on a dataset with user data over observation time (the sample is assumed to represent general population). The supervisor is ...

### How do I analyse the relationship between teachers' leadership and students' achievement? [on hold]

0 answers, 30 views regression correlation
I would like to know the relationship between teachers' leadership and students' achievement. There is a set of data on 60 respondents and two years (2015-2016) of student achievement data. Do I use ...

### 5 Autocorrelation and GLS

If autocorrelation in a model is detected by the Breusch-Godfrey test for r-th order autocorrelation, what is the GLS procedure for "fixing" the autocorrelation problem? And is Cochrane-Orcutt ...

### Transform (triangular?) data for linear regression

0 answers, 24 views regression data-transformation
I'd like to transform data (if possible) to perform linear regression. The data appears to be triangular. Are there specific/recommended transforms which might be suitable for this? This is the ...

### Report GLM and Posthoc with emmeans in APA format

0 answers, 10 views r regression logistic post-hoc reporting
This is the results of my anova(glm()) and the post-hoc analyses emmeans() :       ...

### 1 How to select hyperparameters for SVM regression after grid search?

1 answers, 2.155 views regression machine-learning svm scikit-learn
I have a small data set of $150$ points each with four features. I plan to fit a SVM regression for the reason that the $\varepsilon$ value gives me the possibility of define a tolerance value, ...

### 2 How to penalize a regression loss function to account for correctness on the sign of the prediction?

I am dealing with a regression problem (my targets could potentially take values between -inf to +inf). To optimise my model, I have two objectives: 1) Predictions should be close to the targets. 2)...

### 5 Is $R^2$ useless? [duplicate]

0 answers, 80 views regression r-squared
I stumbled on a discussion regarding the usefulness of $R^2$ as a metric. Where $R^2$ is defined as: $$\frac{\sum (\hat{y} - \bar{\hat{y}})^2 } {\sum(y - \bar{y})^2 }.$$ The criticism is backed by ...

### 1 Matlab ROC curve calculation question

1 answers, 331 views regression matlab roc
I'm working through the example code given by Matlab, but I can't seem to exactly reproduce the ROC curve that is plotted. I want to make sure I am understanding the thresholding concept properly. ...

### 2 Suitable function to choose the best split in a regression tree/oblivious tree

1 answers, 349 views regression cart
My main objective is to construct a regression (decision) tree. It is a part of a boosting algorithm using additive regression trees. The first question is what other functions (other than least ...

### Using regression parameter as mean in rnorm [on hold]

I want to test a model where the distribution of a random variable, assumed normal, is conditional on the regime of another random variable, that switches state according to a Markov chain. The first ...

### 3 Proof that centering will yield multiple linear regression model with same slopes but different intercept

1 answers, 77 views regression mathematical-statistics
When we perform multiple linear regression with centered predictors (that is, $x_{ij}^c = x_{ij} - \bar{x}_j$) we get the same coefficients as with the original predictors but a different intercept. I'...

### 1 what method can be used to test the correlation between multiple ratios and their cumulative correlation to a single binary value?

0 answers, 16 views regression logistic correlation
Let's say I have a certain number of rows, the first 3 columns of which are ratios, the last 1 is binary These ratios interact with each other in various ways, but all contribute in some manner to a ...

### 1 How to identify crucial features that lead to a strong sports performance

0 answers, 26 views regression correlation eda
I have a dataset (500 samples) that contains information about sportsmen. It contains about 30 features that describe: age body composition like weight and size amount of daily training ...

### 3 How to use a mathematical model for data analysis in R

1 answers, 412 views r regression ecology
I am looking to use a mathematical model developed by Firbank & Watkinson (1985) J. App. Ecol. 22:503-517 for the analysis of competition between plants grown in mixture. The model is as follows: ...

### 1 What are the assumptions of linear regression (both simple and multiple) [on hold]

I know this is a basic question, but I get slightly different answers everywhere I look, a quick example is that I learnt at Uni that the errors of the dependent variable have to be normally ...