Skip to main content

All Questions

0 votes
0 answers
28 views

How to compare variance of distance between two treatments over time?

I am interested in understanding how my variable (distance) changes over time between two treatment groups. I now want to compare measures of variance for distance between the two treatment groups, ...
twilightecologist's user avatar
0 votes
0 answers
16 views

Difference between least square regression line and population regression line

What is difference between least square regression line and population regression line ? For example, what is difference between predicted values b0 and b1 vs actual values B0 and B1 ? Does actual ...
Krupa Khandor's user avatar
0 votes
0 answers
26 views

leverage of new predicted value

I would like to find the leverage of the new predicted values in my regression. Below is an example code how would I be able to find the leverage for the 2 new predicted values ? data("mtcars&...
AshleyCrasto's user avatar
-1 votes
1 answer
55 views

How to represent a model fitted using spline transformer + linear regression in an mathematical equation. The data is high dimensional

I have a dataset consisting of 5 features and 1 target. I am using a spline transformer to transform the features and then fit it using linear regression. I am fitting the data with 1st degree and 5 ...
Yash Pisat's user avatar
0 votes
0 answers
42 views

Logistic regression resulting in "eval(family$initialize) : y values must be 0 <= y <= 1"?

Keep in mind that I'm very new to regression (not necessarily R as a language, but I'm no expert). I am using a data set called MidCity and I am supposed to determine whether or not brick homes are ...
bay's user avatar
  • 13
0 votes
0 answers
28 views

Why TukeyHSD test keeps returning NA for a linear model in R?

I have a dataframe with four columns: BMI, AGE, SEX and Geno. Geno is a factor variable has 4 levels(A|A,A|G,G|A,ref:G|G). I would like to fit a linear regression model use BMI~AGE + SEX + Geno, and ...
Timon's user avatar
  • 327
0 votes
0 answers
14 views

find the optimal model for dataset

dataset Hi everyone, i need to find the model to answer the question that young people are going to theater less than before. Could you guy help me to find which model fits the best to this dataset in ...
Thang Ho's user avatar
0 votes
0 answers
30 views

Sampling Variance of OLS Estimators of Regression Coefficients

I am confused about whether the value of the sampling variance of the OLS estimator of a regression coefficient (e.g. slope) differs from sample to sample. Assume we have the following simple linear ...
Jingyang Zhang's user avatar
1 vote
0 answers
45 views

Why are my VIF scores so low, but in my OLS Regression Results, I get a warning: the condition number is large 1.98 * 10^3, possible multicollinearity

mport pandas as pd import numpy as np import matplotlib.pyplot as plt import statsmodels.api as sm import seaborn as sns from statsmodels.stats.outliers_influence import variance_inflation_factor from ...
Adonistats's user avatar
0 votes
0 answers
16 views

How to account for variable group sizes in linear model?

Example: My linear model shows that plant richness increases with latitude. However, the number of individuals surveyed in each sampling site also increases with latitude. Can I include the number of ...
stb's user avatar
  • 47
0 votes
0 answers
33 views

OLS linear regression and causality - does adding control variables create identical subgroups within which treatment is estimated?

Im trying to understand the difference between an OLS estimation model and a Propensity Score Matching (PSM) Technique. I understand that, essentially, what the PSM does is comparing treatment and ...
Giulio Cavallari's user avatar
0 votes
0 answers
24 views

Disentangling 2 unknowns using linear regression

I am figuring out how to disentangle two unknown values using regressions, and I want to ask whether it is possible. Let's say I have a measured value (y-axis) that is a sum of two unknowns (A and B). ...
Joji's user avatar
  • 1
0 votes
0 answers
38 views

Issue of multicollinearity in R for glm analysis

I was wondering if someone could help me with a statistical problem I have run into. Any help would be incredibly helpful. Please note that for clarity, I have simplified the below description. It ...
IanH's user avatar
  • 161
1 vote
1 answer
457 views

"Full rank" error when estimating OLS with statsmodel

I have historical data for crop yield, annual temperature and annual precipitation for a given region. My goal is to estimate the following linear model: In which y is the crop annual yield, t ...
Oalvinegro's user avatar
1 vote
1 answer
87 views

Getting a different kurtosis from numpy array method than from Summary

I have to extract information from the OLS statsmodel summary. While doing it, the Kurtosis results from the summary is different than the array method kurtosis(). Here is the code: from sklearn....
Heitor Nunes Rosa's user avatar

15 30 50 per page
1
2 3 4 5
27