All Questions
Tagged with statistics linear-regression
393
questions
0
votes
0
answers
28
views
How to compare variance of distance between two treatments over time?
I am interested in understanding how my variable (distance) changes over time between two treatment groups. I now want to compare measures of variance for distance between the two treatment groups, ...
0
votes
0
answers
16
views
Difference between least square regression line and population regression line
What is difference between least square regression line and population regression line ? For example, what is difference between predicted values b0 and b1 vs actual values B0 and B1 ? Does actual ...
0
votes
0
answers
26
views
leverage of new predicted value
I would like to find the leverage of the new predicted values in my regression. Below is an example code
how would I be able to find the leverage for the 2 new predicted values ?
data("mtcars&...
-1
votes
1
answer
55
views
How to represent a model fitted using spline transformer + linear regression in an mathematical equation. The data is high dimensional
I have a dataset consisting of 5 features and 1 target. I am using a spline transformer to transform the features and then fit it using linear regression. I am fitting the data with 1st degree and 5 ...
0
votes
0
answers
42
views
Logistic regression resulting in "eval(family$initialize) : y values must be 0 <= y <= 1"?
Keep in mind that I'm very new to regression (not necessarily R as a language, but I'm no expert). I am using a data set called MidCity and I am supposed to determine whether or not brick homes are ...
0
votes
0
answers
28
views
Why TukeyHSD test keeps returning NA for a linear model in R?
I have a dataframe with four columns: BMI, AGE, SEX and Geno. Geno is a factor variable has 4 levels(A|A,A|G,G|A,ref:G|G). I would like to fit a linear regression model use BMI~AGE + SEX + Geno, and ...
0
votes
0
answers
14
views
find the optimal model for dataset
dataset
Hi everyone, i need to find the model to answer the question that young people are going to theater less than before. Could you guy help me to find which model fits the best to this dataset in ...
0
votes
0
answers
30
views
Sampling Variance of OLS Estimators of Regression Coefficients
I am confused about whether the value of the sampling variance of the OLS estimator of a regression coefficient (e.g. slope) differs from sample to sample.
Assume we have the following simple linear ...
1
vote
0
answers
45
views
Why are my VIF scores so low, but in my OLS Regression Results, I get a warning: the condition number is large 1.98 * 10^3, possible multicollinearity
mport pandas as pd import numpy as np import matplotlib.pyplot as plt import statsmodels.api as sm import seaborn as sns from statsmodels.stats.outliers_influence import variance_inflation_factor from ...
0
votes
0
answers
16
views
How to account for variable group sizes in linear model?
Example: My linear model shows that plant richness increases with latitude. However, the number of individuals surveyed in each sampling site also increases with latitude. Can I include the number of ...
0
votes
0
answers
33
views
OLS linear regression and causality - does adding control variables create identical subgroups within which treatment is estimated?
Im trying to understand the difference between an OLS estimation model and a Propensity Score Matching (PSM) Technique.
I understand that, essentially, what the PSM does is comparing treatment and ...
0
votes
0
answers
24
views
Disentangling 2 unknowns using linear regression
I am figuring out how to disentangle two unknown values using regressions, and I want to ask whether it is possible.
Let's say I have a measured value (y-axis) that is a sum of two unknowns (A and B).
...
0
votes
0
answers
38
views
Issue of multicollinearity in R for glm analysis
I was wondering if someone could help me with a statistical problem I have run into. Any help would be incredibly helpful.
Please note that for clarity, I have simplified the below description. It ...
1
vote
1
answer
457
views
"Full rank" error when estimating OLS with statsmodel
I have historical data for crop yield, annual temperature and annual precipitation for a given region. My goal is to estimate the following linear model:
In which y is the crop annual yield, t ...
1
vote
1
answer
87
views
Getting a different kurtosis from numpy array method than from Summary
I have to extract information from the OLS statsmodel summary. While doing it, the Kurtosis results from the summary is different than the array method kurtosis().
Here is the code:
from sklearn....