Skip to main content

Questions tagged [computer-vision]

Use this tag for questions related to Computer Vision -- any aspect of software that enables computers to perceive, understand and react to their environment using cameras. For questions related to image filtering and quantification, use the tag [image-processing] instead.

0 votes
0 answers
3 views

Why we use concatenation in U net

Unet Architecture why we use concatenation in unet and how did we get this grey lines. pls can someone elaborate this I'm new to this I tried U net architecture but I'm unable to understand how we get ...
Manoj Nayak's user avatar
-1 votes
0 answers
14 views

How do I compare two big Temperature-Matrix-Datasets and classify the difference/similarities? How do I proceed finding a good method? [closed]

I have the following issue: I want to compare thermal pictures (data is present as a big Matrix/ each Pixel one value) in python and understand weather they are different. What statistical approaches ...
Luiza's user avatar
  • 1
-3 votes
0 answers
47 views

How to add horizontal lines to the table in image opencv

I want to add horizontal lines to the below image resultant output should be like: Any methods on how to work it out, even if there are diff images which don't have horizontal lines. I've tried ...
Elysian's user avatar
  • 19
0 votes
0 answers
15 views

How to segment an image based on the spacing between objects?

I am a beginner in object detection and currently working on a project to segment engineering design drawings and export those segments as PNG files. For this drawing, I need to create three bounding ...
Kiki's user avatar
  • 13
-2 votes
0 answers
11 views

How to annotate VQA dataset? [closed]

Can't find a useful annotation platform or software? How did everyone solve this problem. And how to organize the JSON of the dataset. I used label me and label studio, but the results were not good, ...
user25097261's user avatar
-2 votes
0 answers
13 views

Seeking advice on creating datasets and tools for real-time Indian Sign Language conversion project [closed]

I am currently working on a project aimed at real-time conversion between audio/text and Indian Sign Language (ISL), utilizing 3D character animation. However, I am encountering challenges in two key ...
Devi anugraga's user avatar
-3 votes
0 answers
28 views

Why the Intel Iris performs better than RTX 3050 laptop GPU in training a Yolov8 model [closed]

I have Dell G-15 Gaming laptop with Core i5 12500h and Rtx 3050. I am new to deep learning and AI. I am training a model with Yolo v8 for Automated Number Plate Detection with a custom dataset of 28k ...
Ahsan Irfan's user avatar
-1 votes
0 answers
37 views

Improve signature quality extracted using OpenCV from scanned sheet paper

I extracted a signature from a scanned sheet of paper. Users know there should be only the signature on a white sheet of paper. I implemented the signature cropping from the uploaded image using the ...
Paolo's user avatar
  • 553
1 vote
0 answers
32 views

Improving VTK Virtual Camera Setup for Accurate Mesh Visualization

I am working on a project where I need to visualize a 3D object mesh using a virtual camera in VTK, aiming to match the real-world camera setup. I use SolvePNP to obtain the rotation vector (rvec) and ...
CupidONO's user avatar
  • 182
-1 votes
0 answers
17 views

What type of camera or computer vision hardware/software be needed to track a human looking for a specific movement [closed]

I need to find a camera and a corresponding computer vision technology that can track a human until a specific action is made, in which case it will trigger another system event. For example, this ...
Daniel Jakubowski's user avatar
-2 votes
0 answers
29 views

OpenCV program cannot detect more than one frame [closed]

I wanted to try using opencv for some simple face recognition (not specific faces, just if it sees a human face or not). I was using this model, installed both opencv and opencv-contrib, and it does ...
Costi2m's user avatar
-1 votes
0 answers
36 views

What input for KLDivLoss

I have a CNN architecture and wish to use the Kullback-Leibler loss (KLDivLoss from pytorch) to compare output tensors and target tensors (grayscale images). I am a bit confused as to what the format ...
Witaek's user avatar
  • 43
3 votes
1 answer
59 views

How can I optimize the process to find the best 4-sided shape that contains my mask?

I am currently working on a project where I need to detect pool table and pool balls from a video frame. I then need to recreate the state of the game in a 2D mini-map, and in order to do so I need ...
LordGrim's user avatar
0 votes
0 answers
14 views

Video Object Segmentation Accuracy

I have a question for measuring accuracy of video segmentation models. I see that these models use 2 measures of success generaly: Jaccard Index and Contour Accuracy. However, I don't really ever see ...
Joe Tsai's user avatar
0 votes
0 answers
26 views

Generate graph edges based on region boundaries on image

I want to analyze experiment data of particle packing, and specifically calculate things like coordination number. I've managed to autodetect the centroids of each particle using OpenCV thresholding ...
Viviana Palacio Betancur's user avatar

15 30 50 per page
1
2 3 4 5
1053