Questions tagged [computer-vision]
Use this tag for questions related to Computer Vision -- any aspect of software that enables computers to perceive, understand and react to their environment using cameras. For questions related to image filtering and quantification, use the tag [image-processing] instead.
computer-vision
15,784
questions
0
votes
0
answers
3
views
Why we use concatenation in U net
Unet Architecture
why we use concatenation in unet and how did we get this grey lines. pls can someone elaborate this I'm new to this
I tried U net architecture but I'm unable to understand how we get ...
-1
votes
0
answers
14
views
How do I compare two big Temperature-Matrix-Datasets and classify the difference/similarities? How do I proceed finding a good method? [closed]
I have the following issue: I want to compare thermal pictures (data is present as a big Matrix/ each Pixel one value) in python and understand weather they are different. What statistical approaches ...
-3
votes
0
answers
47
views
How to add horizontal lines to the table in image opencv
I want to add horizontal lines to the below image
resultant output should be like:
Any methods on how to work it out, even if there are diff images which don't have horizontal lines.
I've tried ...
0
votes
0
answers
15
views
How to segment an image based on the spacing between objects?
I am a beginner in object detection and currently working on a project to segment engineering design drawings and export those segments as PNG files.
For this drawing, I need to create three bounding ...
-2
votes
0
answers
11
views
How to annotate VQA dataset? [closed]
Can't find a useful annotation platform or software? How did everyone solve this problem. And how to organize the JSON of the dataset.
I used label me and label studio, but the results were not good, ...
-2
votes
0
answers
13
views
Seeking advice on creating datasets and tools for real-time Indian Sign Language conversion project [closed]
I am currently working on a project aimed at real-time conversion between audio/text and Indian Sign Language (ISL), utilizing 3D character animation. However, I am encountering challenges in two key ...
-3
votes
0
answers
28
views
Why the Intel Iris performs better than RTX 3050 laptop GPU in training a Yolov8 model [closed]
I have Dell G-15 Gaming laptop with Core i5 12500h and Rtx 3050. I am new to deep learning and AI. I am training a model with Yolo v8 for Automated Number Plate Detection with a custom dataset of 28k ...
-1
votes
0
answers
37
views
Improve signature quality extracted using OpenCV from scanned sheet paper
I extracted a signature from a scanned sheet of paper. Users know there should be only the signature on a white sheet of paper.
I implemented the signature cropping from the uploaded image using the ...
1
vote
0
answers
32
views
Improving VTK Virtual Camera Setup for Accurate Mesh Visualization
I am working on a project where I need to visualize a 3D object mesh using a virtual camera in VTK, aiming to match the real-world camera setup. I use SolvePNP to obtain the rotation vector (rvec) and ...
-1
votes
0
answers
17
views
What type of camera or computer vision hardware/software be needed to track a human looking for a specific movement [closed]
I need to find a camera and a corresponding computer vision technology that can track a human until a specific action is made, in which case it will trigger another system event. For example, this ...
-2
votes
0
answers
29
views
OpenCV program cannot detect more than one frame [closed]
I wanted to try using opencv for some simple face recognition (not specific faces, just if it sees a human face or not). I was using this model, installed both opencv and opencv-contrib, and it does ...
-1
votes
0
answers
36
views
What input for KLDivLoss
I have a CNN architecture and wish to use the Kullback-Leibler loss (KLDivLoss from pytorch) to compare output tensors and target tensors (grayscale images).
I am a bit confused as to what the format ...
3
votes
1
answer
59
views
How can I optimize the process to find the best 4-sided shape that contains my mask?
I am currently working on a project where I need to detect pool table and pool balls from a video frame.
I then need to recreate the state of the game in a 2D mini-map, and in order to do so I need ...
0
votes
0
answers
14
views
Video Object Segmentation Accuracy
I have a question for measuring accuracy of video segmentation models. I see that these models use 2 measures of success generaly: Jaccard Index and Contour Accuracy. However, I don't really ever see ...
0
votes
0
answers
26
views
Generate graph edges based on region boundaries on image
I want to analyze experiment data of particle packing, and specifically calculate things like coordination number. I've managed to autodetect the centroids of each particle using OpenCV thresholding ...