Questions tagged [data-modeling]
Data modeling questions refer to the techniques used to gather and analyze data requirements needed to support data operations in programs and systems.
data-modeling
3,102
questions
0
votes
1
answer
16
views
Is there a way to relate a Calendar lookup table and a 24hr Time lookup table to 2 different tables on more than 1 column in PowerBI?
I have 2 tables, encounters table and procedures table, that both have 2 columns containing a date data type and 2 columns containing a time data type. I have created a Calendar lookup and a Time ...
-1
votes
2
answers
56
views
Need help in Pivot table or Formatting the data
I have a scenario where I have the list of promotions by each job level by Month. In December, the promotion receivership % is 6.6% and in June 2.5% however when I include both I was expecting 9.1%. ...
0
votes
0
answers
30
views
Optimizing Data Model for Frequent IoT Sensor Data Updates in Data Warehouse
I am working on a data engineering project involving IoT sensor data. In this project, I have a use case where:
We have a table in the data warehouse to store sensor data generated every second by 10 ...
0
votes
0
answers
26
views
Is there a way to modelize a partial predictor in a classification problem with an unbalanced target?
I would like to share with you a classification issue I faced during the modelling process. I have to create a model for an unbalanced binary target by 4 predictors where one of them has 45% of wrong ...
0
votes
1
answer
19
views
Getting same aggregation value for all category
I have three table named Content, Reaction and ReactionTypes, the Content and ReactionTypes tables are connected to Reaction table using One to Many relationship. Now, Category and Score are the the ...
1
vote
1
answer
43
views
Power BI file size not making sense
So I'm working on a Power BI file, I want to add more data to it, and I can't understand how the file size is bigger than the sum of the datas...
The existing file is 1.2 GB.
I want to add some "...
0
votes
0
answers
10
views
WRF rain output time series ideas?
To obtain rain time series, I did RAINC + RAINCC for each time from my simulation. To have hourly data from simulation, I subtract RAINC and RAINCC from the subsequent time (rain time 2 - rain time 1 ...
0
votes
1
answer
35
views
Does duplicated data from denormalization affect the performance of vector searches?
In Cassandra, data are usually denormalized to match the query pattern. However, with vector columns, it means duplication of the same vectors. I know vector similarity search and index is very ...
0
votes
0
answers
25
views
GraphQL Python Graphene - How do resolvers work? If there is a resolver function for each field, how does graphql know to order them?
I am a beginner at GraphQL and is having trouble grasping how resolvers really work. If there a resolver function for each field and there are multiple rows of data, how does graphQL know how to put ...
-3
votes
0
answers
26
views
How is the params parameter specified in statsmodels.tsa.statespace.varmax.VARMAX.simulate?
I am trying to use the statsmodels.tsa.statespace.varmax.VARMAX.simulate method. I am familiar with the lag polynomial representation of VARMA models, but I am not sure how the params argument should ...
1
vote
0
answers
15
views
Degenerate Dimension/Keys and Reference tables
All, what are the best practices for handling degenerated Keys/dimensions in RAW data vault??
also how to handle Xref, Lookups or static tables (One time load) designing RAW data vault, they should be ...
-1
votes
1
answer
48
views
A schema with 10 Fact Tables and 1 Dimension Table
Is it normal / best practice to have multiple fact tables but only minimal dimension table(s).
I am new to data modelling and trying to create the best / most efficient data model and it always seems ...
0
votes
0
answers
35
views
DWH Model - Relationship between Fact tables
I have been given a new requirement which in Power BI would probably be fixed in Power Query so each data set might not have an issue with multiple fact tables but with how I store the data I am ...
1
vote
1
answer
45
views
Modeling Point Processes with Covariates: Use of quadscheme in R
I am currently attempting to model fire outbreaks using precipitation as a covariate. Upon reading various suggestions, I came across the quadscheme function in R. According to the documentation, the ...
0
votes
1
answer
38
views
Fact Table Type2 History to show changes
I have 2 source tables, asset + asset attributes which I’ve modelled into a fact and 3-4 dimensions.
I created a dimension for some of the attributes to show change so if an asset is a 2 bedroom but ...