regression's questions - Chinese 1answer

16.598 regression questions.

If I set the regularization parameter to 0, does it essentially mean I'm not applying regularization (I've boxed the regularization bits in red)? Also, what is this type of regularization called?

I performed two separate regression analyses by dividing my total sample into 'Male' and 'Female' samples, that resulted into different % of variances of variables (e.g for male the percent variance ...

I am running a logistic regression, but for a group of predictors I tried, all of then are highly significant but the residual deviance is much higher than the null deviance. ...

I have data from a randomised blocked experiment testing the effect of different crops on weeds. Each treatment was replicated in one plot in each of three blocks, on two farms, in two years (but NOT ...

I've been working on a regression problem where the input is an image, and the label is a continuous value between 80 and 350. The images are of some chemicals after a reaction takes place. The color ...

I'm currently following the "Forecasting Product Demand in R" (https://www.datacamp.com/courses/visualizing-time-series-data-in-r) course on Datacamp, and I've been stuck trying to understand transfer ...

there is a lot of research where deep learning works so well with classification but not in regression field SVR, tree-based approach is still good and I couldn't find good architecture about ...

I originally posted this in stackoverflow (as given here) but was told to try here since it might be more relevant here. I am very new to statistics and R in general so my question might be a bit dumb,...

Question: What are the main differences and similarities between using Newey-West (1987) and Hansen-Hodrick (1980) standard errors? In which situations should one of these be preferred over the other? ...

Hello (and thanks for reading this). I have a set of data that looks at the area of remaining retina against age. I have three points of data per patient. We know that over time, the area of ...

Given a $m\times p$ matrix $Y$ on the left, and a $m\times q$ matrix $X$ on the right, CCA tries to find 2 sets of mapping coefficients such that $Y\beta_{l}$ and $X\beta_{r}$ have the highest ...

Hello (and thanks for reading this). I have a set of data that looks at the area of remaining retina against age. I have three points of data per patient. We know that over time, the area of ...

For ridge regression I learned before, $\hat{w} = argmin_{\theta}||y-Xw||_2^2 + \lambda||w||_2^2$. Thinking about if the bias is added, so the new $X$ become $[1,X]$, and we have a new weight $\theta$...

i am using a model in which i have 3 dependent variables. i) Co2 emission ii) Nox emission iii) deforestation. and 6 independent variables i)Governance ii)Foreign Direct Investment iii)GDP Per ...

I am using a random forest for a 2 class classification problem. But eventually using probability of class "1" returned by the model for my task and not the label. I get AUC of about 70% Then I ...

I have been trying to find out a way in order to eliminate outliers from a dataset. The outliers are removed the following way: Any value which results into a 10% reduction in R2 value needs to be ...

I'm quite new on this with binomial data tests, but needed to do one and now I´m not sure how to interpret the outcome. The y-variable, the response variable, is binomial and the explanatory factors ...

I am trying to have result with GAM using R. In R, I am using mgcv and the code is following. However, I do not understand what k value for? If I want to see the seasonal effect on time then ...

I am performing a weighted regression for a ratio $R = \frac{Y}{X}$ using $X$ as weights. Y and X are normal and they are not independent. Is it valid to use $X$ as weights? What is the interpretation ...

I have a linear regression model $\bf Y=\bf{X}\bf{\beta}+\epsilon$. I want to assign a prior on $\bf\beta$ in order to derive the posterior predictive model $p(y_{predictive}|\bf{y},\bf{X},\beta)$. ...

I'm working with a researcher who found this paper and suggested we do something similar rather than the proportional hazards model I suggested. The model used in the paper is what the authors call ...

In R: I have the following dataframe: ...

I would like to ask you a super-basic question. I need to use Gini coefficient as moderator to examine well-being, resilience and other variables in Immigrants. Which Gini index is more reliable or ...

Background I have clinical study data from 500 subjects. Measured variables (measured once in each subject), among others: $P_s$: Systolic blood pressure $P_d$: Diastolic blood pressure $d_s$: ...

I have took a look in a video that the guy was doing a regression analysis for forecast sales. He has a dataset with two columns (date and sales, for the past). He made a transformation on the ...

I'd like to understand this better in terms of drawbacks and suitability. For example, if my data includes investigating differences between 2 patient groups and a control group (3 groups in total), ...

First off, I'm unfamiliar to statistics so bear with me. I am creating a web app that does some basic analysis on player stats. Right now I am trying to determine correlations (Pearson's correlation ...

I have dataset that kind of climate features (11 attributes) hourly and electricity consumption in non-residental building, I test Regression (Tree, svm (guassian), Random forest and deep learning), ...

I have a linear regression equation where Y ~ X1 + X2 + X3 + X4. The intercept, X2, and X4 are statistically significant. The multiple r-squared is 0.04696, and adj R-sq is 0.0300. I am trying to ...

I am very new to statistics, and have searched around the net and stack exchange for an answer without luck. Therefore this post. I hope someone can help... I am carrying out an analysis of the ...

In the most basic form, (as I recall), consider a random variable $X$ which is defined over a probability space $\Omega$. Now, let us call a realization of $X$ as $x$ . As such, we can define ...

these are the results I get using R on my database , can someone help me interpret the results ...

I need to import two columns (time and temp) worth of data from a .csv file into R and convert it to a time-series and the plot it with a linear regression line. So far this is my approach: ...

I am trying to understand the output from Richard Williams's amazing gologit2 STATA package. The software is used for ordinal logistic regression and circumvents violations of the proportional odds ...

For a research project I am supposed to estimate a panel regression model on a dataset with user data over observation time (the sample is assumed to represent general population). The supervisor is ...

I would like to know the relationship between teachers' leadership and students' achievement. There is a set of data on 60 respondents and two years (2015-2016) of student achievement data. Do I use ...

If autocorrelation in a model is detected by the Breusch-Godfrey test for r-th order autocorrelation, what is the GLS procedure for "fixing" the autocorrelation problem? And is Cochrane-Orcutt ...

I'd like to transform data (if possible) to perform linear regression. The data appears to be triangular. Are there specific/recommended transforms which might be suitable for this? This is the ...

This is the results of my anova(glm()) and the post-hoc analyses emmeans() :       ...

I have a small data set of $150$ points each with four features. I plan to fit a SVM regression for the reason that the $\varepsilon$ value gives me the possibility of define a tolerance value, ...

I am dealing with a regression problem (my targets could potentially take values between -inf to +inf). To optimise my model, I have two objectives: 1) Predictions should be close to the targets. 2)...

I stumbled on a discussion regarding the usefulness of $R^2$ as a metric. Where $R^2$ is defined as: $$ \frac{\sum (\hat{y} - \bar{\hat{y}})^2 } {\sum(y - \bar{y})^2 }.$$ The criticism is backed by ...

I'm working through the example code given by Matlab, but I can't seem to exactly reproduce the ROC curve that is plotted. I want to make sure I am understanding the thresholding concept properly. ...

My main objective is to construct a regression (decision) tree. It is a part of a boosting algorithm using additive regression trees. The first question is what other functions (other than least ...

I want to test a model where the distribution of a random variable, assumed normal, is conditional on the regime of another random variable, that switches state according to a Markov chain. The first ...

When we perform multiple linear regression with centered predictors (that is, $x_{ij}^c = x_{ij} - \bar{x}_j$) we get the same coefficients as with the original predictors but a different intercept. I'...

Let's say I have a certain number of rows, the first 3 columns of which are ratios, the last 1 is binary These ratios interact with each other in various ways, but all contribute in some manner to a ...

I have a dataset (500 samples) that contains information about sportsmen. It contains about 30 features that describe: age body composition like weight and size amount of daily training ...

I am looking to use a mathematical model developed by Firbank & Watkinson (1985) J. App. Ecol. 22:503-517 for the analysis of competition between plants grown in mixture. The model is as follows: ...

I know this is a basic question, but I get slightly different answers everywhere I look, a quick example is that I learnt at Uni that the errors of the dependent variable have to be normally ...

Related tags

Hot questions

Language

Popular Tags