I have minimal experience with Deep Learning and Computer Vision. My only experience with it was spending a few months in my job developing some basic CNNs and working on some image deconvolution tasks. Despite this limited experience, I just added a bullet point on my resume for using CNNs
I've had a few interviews recently in which I was asked some questions about my experience with DL/CNNs and I was honest in that I only have a little experience with it. But those roles seem to have DL as just one component of the job
I have an upcoming interview in which it seems that DL will be the primary component of the job. Although it sounds very interesting, I'm concerned I won't be qualified as I have minimal experience with DL
What are some interview questions I should expect to know about DL?
I am also looking for a cheap stereo camera system to do basically the same thing that an NDI Polaris Optical tracker would do. Basically it emits IR light and the RRT (retroreflective markers) reflect that light and these points are tracked by a single stereo camera. Very common in medical tool tracking.
I've found systems that do SLAM or stereo vision stuff (ex.MYNT EYE (WITH IR), ZED, etc from this previous post) that do great for VR applications. When I asked MYNT EYE about tracking rrt markers, they said their system didn't do that so well. I was wondering if they knew what I meant when I asked them.
I have a rigged system with these markers already and the reason I don't just use the above NDI system is of course expense, and I am wanting to make a more mobile system so I also cant buy a set of Vicon trackers, etc. to set up.
Additionally I could code maybe some kind of sphere detection algorithm and piece together the IR input with this to get accurate estimates from a MYNT EYE system perhaps, but I would of course prefer some software out of the box. This problem has been solved, and we shouldn't have to reinvent the wheel. Also #timeconstraints.
I would appreciate thoughts on camera systems you have used for this type of thing, or code that you might know of that already exists or any other thoughts. Thanks.
Hello guys, I'm a newbie in computer vision. I'm dealing with detecting products in supermarket shelf. I'm facing a problem in detecting stacked products. For examples, a bottle of milk is behind another one and we can only see it cap. How do I draw the bounding box of the blocked bottle? Draw bounding box on the cap only? Or it expected position? I'm using Faster RCNN. Any help will be appreciated!
Hello, for the OCR processing of documents, I'd like to be able to set boundaries for text within a unique template, and those boundaries could be translated on any of those documents after being filled and scanned.
For this purpose, I need to match a scanned document with it's template and retrieve a transformation matrix. I'm using OpenCV and Python.
The template already contains text, has no recognizable logo or fixation cross that I could use for pattern matching. I had to extract the layout of the documents using line segment detector, and it was pretty efficient.The two images displayed below are the line recontruction :
I want to learn computer vision from scratch in c++ and not in python. I've already started to read multiple view geometry in computer vision by Richard Hartley and was looking into a proper pathway to career in cv in c++. Any suggestions for the best pathway from theoretical perspective is also welcome. Thanks!!