Questions tagged [statistics]

Ask Question

Statistics is the mathematical study of using probability to infer characteristics of a population from a limited number of samples or observations. Non-programming statistics questions are off-topic here, and they should be posted at https://stats.stackexchange.com instead.

16,447 questions

0 votes

0 answers

3 views

How to deal with word counts of zero when calculating Pointwise Mutual Information (PMI) for word cooccurrences in Natural Language Processing

I have a co-occurrence matrix of words in a text (two words x and y are considered co-occurring, if they both occur in a context window of w words). I want to calculate the Pointwise Mutual ...

AlinaOs

asked 18 mins ago

0 votes

0 answers

13 views

How to code an algorithm for Linda Index calculation in SQL (Apache Superset)?

I collided with a real obstacle trying to bring the calculation of Linda index in Apache Superset SQL Notebook. Let's assume we have a table for 6 firms in one region for one year with their market ...

Serggio_K

asked yesterday

1 vote

2 answers

39 views

Python - Check if the last value in a sequence is relatively higher than the rest

For a list of percentage data, I need to check if the last value (90.2) is somehow higher and somewhat "abnormal" than the rest of the data. Clearly it is in this sequence. delivery_pct = [...

SIGHUP

modified yesterday

0 votes

0 answers

7 views

Hausman test report [closed]

I've been working on a research project using a multi-level regression model, and I'm currently figuring out how to present the Hausman test results. I've seen some papers where authors mention doing ...

Sabrina

asked yesterday

55 votes

6 answers

86k views

How to perform two-sample one-tailed t-test with numpy/scipy

In R, it is possible to perform two-sample one-tailed t-test simply by using > A = c(0.19826790, 1.36836629, 1.37950911, 1.46951540, 1.48197798, 0.07532846) > B = c(0.6383447, 0.5271385, 1....

thalassa

modified yesterday

0 votes

0 answers

10 views

How to prove expected residual time formula

Can anyone help me to show that for a general random variable X with a fixed distribution to model the time interval between the independent events. The expected residual time, which is the time one ...

Pique

asked 2 days ago

0 votes

0 answers

14 views

Variation in the total number of samples when performing multiple fisher exact tests [migrated]

I have dataset of 2 time points. In each time point I have to compare the counts of different components at each position (number of positions =1000). I used fisher exact test to compare if there is a ...

Marwah Al-kaabi

asked Jul 15 at 1:49

2 votes

1 answer

6k views

Statsmodels OLS Regression: Log-likelihood, uses and interpretation

I'm using python's statsmodels package to do linear regressions. Among the output of R^2, p, etc there is also "log-likelihood". In the docs this is described as "The value of the likelihood function ...

JeeyCi

modified Jul 14 at 10:03

0 votes

0 answers

27 views

How to compare variance of distance between two treatments over time?

I am interested in understanding how my variable (distance) changes over time between two treatment groups. I now want to compare measures of variance for distance between the two treatment groups, ...

twilightecologist

asked Jul 13 at 20:56

0 votes

0 answers

64 views

Error obj is not defined with pyinstaller when crating an EXE

I've created a script that calculates the p-value using the Sum Rank method, when I run the scripts it's all ok, but when I run the .exe from command prompt, I get the following error: D:\...

marc_s

748k

modified Jul 13 at 18:36

1 vote

0 answers

19 views

How to check if a variable varies across timestamps or across samples

Let's assume we have some time-series data where every person (sample) has measurements for 2 variables (Var1, Var2) for three timepoints. Following is a dummy representation of the data. Is there a ...

Kowshika

modified Jul 13 at 7:17

0 votes

0 answers

149 views

Using R, how do I perform bonferroni type test to decide which areas have move coronavirus cases than expected after performing Chi-square test

Using R, I have to perform a test to test if the number of coronavirus cases is evenly distributed in some areas in respect to those areas populations and then if they are not evenly distributed with ...

JayPeerachai

3,671

modified Jul 12 at 14:57

0 votes

0 answers

9 views

Hovmoller Plot in Python not showing correctly

I am trying to create a Hovmoller plot in Python on Google Colab, but Hovmoller is not showing as it should show. It's showing vertical different color boxes how to fix this? Here is the code that I ...

Zohair

asked Jul 12 at 11:51

-1 votes

0 answers

28 views

What distribution should I use? [closed]

I'm new to statistics and need to create a probability distribution with the following study: *Note: Due to proprietary reasons the example given below is the best representation of the real data. ...

Rookie_Programmer

asked Jul 12 at 1:59

3 votes

4 answers

90 views

Rename function in Python

Now I'm trying to learn about pandas package in python. but using rename function, I have some problem. mushroom = pd.read_csv('./mushroom/agaricus-lepiota.data', header=None) mushroom ...

M--

28.1k

modified Jul 11 at 14:39

15 30 50 per page

2 3 4 5

…

1097 Next

Collectives™ on Stack Overflow

Questions tagged [statistics]

How to deal with word counts of zero when calculating Pointwise Mutual Information (PMI) for word cooccurrences in Natural Language Processing

How to code an algorithm for Linda Index calculation in SQL (Apache Superset)?

Python - Check if the last value in a sequence is relatively higher than the rest

Hausman test report [closed]

How to perform two-sample one-tailed t-test with numpy/scipy

How to prove expected residual time formula

Variation in the total number of samples when performing multiple fisher exact tests [migrated]

Statsmodels OLS Regression: Log-likelihood, uses and interpretation

How to compare variance of distance between two treatments over time?

Error obj is not defined with pyinstaller when crating an EXE

How to check if a variable varies across timestamps or across samples

Using R, how do I perform bonferroni type test to decide which areas have move coronavirus cases than expected after performing Chi-square test

Hovmoller Plot in Python not showing correctly

What distribution should I use? [closed]

Rename function in Python

Hot Network Questions

Collectives™ on Stack Overflow

Questions tagged [statistics]

Related Tags