Skip to main content

Questions tagged [statistics]

Statistics is the mathematical study of using probability to infer characteristics of a population from a limited number of samples or observations. Non-programming statistics questions are off-topic here, and they should be posted at https://stats.stackexchange.com instead.

0 votes
0 answers
5 views

How to deal with word counts of zero when calculating Pointwise Mutual Information (PMI) for word cooccurrences in Natural Language Processing

I have a co-occurrence matrix of words in a text (two words x and y are considered co-occurring, if they both occur in a context window of w words). I want to calculate the Pointwise Mutual ...
AlinaOs's user avatar
  • 15
0 votes
0 answers
13 views

How to code an algorithm for Linda Index calculation in SQL (Apache Superset)?

I collided with a real obstacle trying to bring the calculation of Linda index in Apache Superset SQL Notebook. Let's assume we have a table for 6 firms in one region for one year with their market ...
Serggio_K's user avatar
1 vote
2 answers
39 views

Python - Check if the last value in a sequence is relatively higher than the rest

For a list of percentage data, I need to check if the last value (90.2) is somehow higher and somewhat "abnormal" than the rest of the data. Clearly it is in this sequence. delivery_pct = [...
RSW's user avatar
  • 1,152
0 votes
0 answers
7 views

Hausman test report [closed]

I've been working on a research project using a multi-level regression model, and I'm currently figuring out how to present the Hausman test results. I've seen some papers where authors mention doing ...
Sabrina's user avatar
0 votes
0 answers
10 views

How to prove expected residual time formula

Can anyone help me to show that for a general random variable X with a fixed distribution to model the time interval between the independent events. The expected residual time, which is the time one ...
Pique's user avatar
  • 11
0 votes
0 answers
14 views

Variation in the total number of samples when performing multiple fisher exact tests [migrated]

I have dataset of 2 time points. In each time point I have to compare the counts of different components at each position (number of positions =1000). I used fisher exact test to compare if there is a ...
Marwah Al-kaabi's user avatar
0 votes
0 answers
28 views

How to compare variance of distance between two treatments over time?

I am interested in understanding how my variable (distance) changes over time between two treatment groups. I now want to compare measures of variance for distance between the two treatment groups, ...
twilightecologist's user avatar
1 vote
0 answers
19 views

How to check if a variable varies across timestamps or across samples

Let's assume we have some time-series data where every person (sample) has measurements for 2 variables (Var1, Var2) for three timepoints. Following is a dummy representation of the data. Is there a ...
Kowshika's user avatar
0 votes
0 answers
9 views

Hovmoller Plot in Python not showing correctly

I am trying to create a Hovmoller plot in Python on Google Colab, but Hovmoller is not showing as it should show. It's showing vertical different color boxes how to fix this? Here is the code that I ...
Zohair's user avatar
  • 195
-1 votes
0 answers
28 views

What distribution should I use? [closed]

I'm new to statistics and need to create a probability distribution with the following study: *Note: Due to proprietary reasons the example given below is the best representation of the real data. ...
Rookie_Programmer's user avatar
0 votes
0 answers
10 views

What is the best test to run to compare presence/absence of something between sampling locations?

I have data on the presence/absence of four different pathogens found in edible crabs in 7 locations over 2 seasons. For this study, 30 crabs were collected from each site and for each crab the ...
Kiran B's user avatar
  • 21
1 vote
0 answers
30 views

What does the attribute x_loadings_ represent?

I'm doing a canonical correlation analysis using scikit-learn's CCA. After doing the usual steps and calling ca.x_loadings_, I see that I get values bigger than 1. How I intepret the explanation of ...
Hendrik's user avatar
  • 21
0 votes
0 answers
20 views

How do I correctly specify nesting in the sommer R package?

I accustomed to lme4 where it would be simply: (1|genotype/root_ID) but it's unclear to me looking through the documentation the appropriate way to do this in sommer. mod1 <- mmer(Y ~ 1, ...
bigoof99's user avatar
0 votes
0 answers
18 views

Remove altitude "outliers" from a GPS track

I've recorded a bike track with a very noisy result for altitude (barometric sensor). I enhanced the altitude data using a digital elevation model. See example here: However, crossing bridges or ...
DJJazz's user avatar
  • 55
0 votes
0 answers
10 views

I need to take inputs of confidence levels and orders so far in a day and use it to predict how many more orders will come in, using past data

Using replaceable sets of data about past orders and the times in which they were placed, and inputs for the confidence level and the amount of orders already placed, estimate the amount of orders ...
Shreyas's user avatar

15 30 50 per page
1
2 3 4 5
1097