Questions tagged [dataset]
A dataset is a collection of data, generally represented in tabular form, with columns signifying different variables and rows signify different members of the set. If you are looking for a freely available dataset for any purpose, please consider asking your question on https://opendata.stackexchange.com.
dataset
11,450
questions
0
votes
0
answers
23
views
Adding Average at the very last part of the row and column in a dataset
I have a sample dataset of total outbound calls, it comprises rows (dates) and columns (hourly), my goal is to show totals and average of outbound calls per day (rows), and the totals and average of ...
0
votes
0
answers
5
views
How to use an Iterable dataloader for training in pytorch
This is the code i use for creating my dataloaders:
import torch
import os
import pickle
import numpy as np
from torch.utils.data import DataLoader, IterableDataset
from datasets import Dataset, ...
-2
votes
0
answers
10
views
How to annotate VQA dataset? [closed]
Can't find a useful annotation platform or software? How did everyone solve this problem. And how to organize the JSON of the dataset.
I used label me and label studio, but the results were not good, ...
-1
votes
0
answers
11
views
Seeking advice on creating datasets and tools for real-time Indian Sign Language conversion project [closed]
I am currently working on a project aimed at real-time conversion between audio/text and Indian Sign Language (ISL), utilizing 3D character animation. However, I am encountering challenges in two key ...
0
votes
0
answers
4
views
Text clustering and classification model
What I want to do - I have a dataset of different phrases, I want to train a model which would set a few clusters depending on the dataset and then cluster the data for that clusters. i have an ...
0
votes
0
answers
13
views
lmdb.InvalidParameterError: /data/project/hsi_foundation/HyperSIGMA/ImageDenoising/utility/WDC/wdc.db: Invalid argument
I am working on hypersigma github below. Please refer to the link below. By the way I have a problem.
HyperSIGMA github Link: https://github.com/WHU-Sigma/HyperSIGMA
The input is a dc.tif file ...
0
votes
0
answers
22
views
When using Google Colab, Python package 'datasets' just disappeared from virtualenv directory 'site-packages'
I'm using Google Colab and trying make a virtual environment to work.
My code is:
from google.colab import drive
drive.mount('/content/drive')
!pip install virtualenv
myenv_dir = '/content/...
1
vote
2
answers
38
views
How to keep the first appearance of a value while filtering everything else out in R?
This is the appearance of my dataset currently. I want to include patient 1 data until the first '1' occurs in 'test.result' then remove any information about patient 1 after that.
current dataset
...
0
votes
0
answers
10
views
Knowing the format of dataset a pretrained model was trained on
i am working on a Multilingual TTS project , and developing a TTS for my regional language by using a pretrained model from Hugging Face hub , the model i am trying to fine tune is facebook-mms-tts ...
0
votes
0
answers
11
views
Predicting new data with a model - preprocessing
I have studied a dataset and built a model. During the exploration I made several transformations and before training the model I applied a pandas pipe to my train dataset. This pipe includes ...
-1
votes
0
answers
14
views
OTBI Data Model Dataset Join [closed]
I am newly introduced to OTBI due to work (about 2 months ago). I have some SQL knowledge, but I am at a loss on how to join to datasets I have.
I want to list my budget and expenses (see screenshot) ...
0
votes
0
answers
14
views
Subset from NetCDF4 dataset - How
I have a netCDF4 dataset in python with 3 variables (height, lat, long).
I want all the lat and long values when height is 1.
I understand that is something like generate a new dataset when height is ...
0
votes
0
answers
26
views
Turn the Year columns into a one column with the years a feature using Pandas [duplicate]
I have a file with the area burned due to wildfires by month (rows) and Years as columns. The numbers in the dataset represent the area burned in hectares.
It looks like this:
How I can convert the ...
0
votes
0
answers
17
views
How to acquire real time data in python for ev charging data and from that how threshold values can be setup using Python dynamically changing [closed]
I am developing a program using python/ Pycharm environment, in which I am importing an Ontology with SWRl rules. And I have to use ev charging data in it but I donot have access to live/real-time ev ...
1
vote
1
answer
33
views
plotly isosurface problem based on data size: it doesn't plot but also doesn't give an error message
I am a beginner in python and was playing around with data visualization for a task at my job. Simply, I need to visualize a cloud of 3D data points and I decided Isosurface function meets my ...