Data Annotation and What Data Annotation Companies do
The Data Blog
by igor
2y ago
Data annotation is one of the core functions of machine learning. The more data an ML model is trained with, the more accurate it will become. Just like humans learn through training and practice, machine learning models are also trained by feeding them with huge volumes of data. One of the reasons Google is still the best search engine is because it has a lot of data compared to its competitors, including Yahoo and Bing (Microsoft’s search engine). With this data, Google is able to give users the best search results that match their search queries. Several other web apps also rely on data ann ..read more
Visit website
Rotoscoping: Hollywood’s video data segmentation?
The Data Blog
by igor
3y ago
In Hollywood, video data segmentation has been done for decades. Simple tricks such as color keying with green screens can reduce work significantly. In late 2018 we worked on a video segmentation toolbox. One of the common problems in video editing is oversaturated or too bright sky when shooting a scene. Most skies in movies have been replaced by VFX specialists. The task is called “sky replacement”. We thought this is the perfect starting point for introducing automatic segmentation to mask the sky for further replacement. Based on the gathered experience I will explain similarities in VFX ..read more
Visit website
Thank you for reading this blog
The Data Blog
by igor
3y ago
Since I started this blog at the beginning of February over 200 interested people visited the website. I’ll use this post to express my thanks to everyone. Since this is a data-driven blog and I like to be transparent let’s have a look from where the interested readers are coming from. Among the top 5 countries, we have the US, Switzerland (where I’m from), Germany, India, and France. Visitor statistics based on the country Now, let’s have a look at how they found out about this blog. A big chunk of visitors came from social networks or other news sites such as Hacker News, Reddit, Twitter. Mo ..read more
Visit website
List of Data Annotation Companies
The Data Blog
by igor
3y ago
An up to date and manually curated list of top data annotation companies from all over the world. Grouped by annotation type. I’ve been looking for data labeling for computer vision data. There are a lot of good companies offering services. Most of them follow similar principles such as outsourcing to countries with a cheaper labor cost. From my experience, there are mostly differences in the data type they focus on (images vs audio vs text) as well as the way they work. Unfortunately, the pricing isn’t always transparent. Usually, they charge per hour. For computer vision data this is usuall ..read more
Visit website
Humans Powering the Machines
The Data Blog
by igor
3y ago
There is a hype around AI built on top of recent success with deep learning. But there is one unsolved piece in the equation. AI needs to learn from humans. Small robots learning from humans. (Matan Segev, pexels.com) When I heard about machine learning for the first time, I thought it would just simply work like this. Let’s say I want to classify pictures of dogs or cats. I would show the model a picture of a dog and a picture of a cat and it would learn to separate the two classes. Unfortunately, that’s not the case. To get high accuracy with deep learning models trained from scratch we need ..read more
Visit website
Tools and Frameworks
The Data Blog
by igor
3y ago
Find the best tool for you out of a list of fast tools and frameworks for data annotation or labeling for images, videos, text (NLP) or audio. I had trouble getting a good overview of all the tools and frameworks around for data annotation so I created this list. I will try to keep it up to date. Computer Vision NLP Audio Others Open Source tools and frameworks Images Video LiDAR3D Text Audio Time Series MultiDomain Open Source Tools and Frameworks Here you find a list of open-source projects grouped by datatypes! Computer Vision Images Alturos.ImageAnnotation – A collaborativ ..read more
Visit website
A random forest image classifier in a day
The Data Blog
by igor
3y ago
Learn about how we did collect data and trained a random forest image classifier within a single day for Hack Zurich 2016. One of my first projects using my newly gathered know-how of machine learning was during HackZurich 2016. We built a sign digitizer which turned handwritten signs into word-like masterpieces. The final result can be seen below. The text detection used a cloud API, the sign recognition, however, used a custom model built using a random forest image classifier. Demo of our Hack Zurich 2016 project Data Collection First, we needed a dataset of signs. A quick Google search m ..read more
Visit website
Welcome to my blog about ML and data
The Data Blog
by igor
3y ago
My background and motivation for starting a blog about my personal experience and projects around data annotation and machine learning. Around 2014 my interests in machine learning started to grow. Just the idea of teaching a machine to do certain things instead of hard coding it fascinated me. To feed my hunger for more information I took the ever-growing (the number of students was almost doubling every second year) lecture “machine learning” at ETH Zurich held by Prof. Buhmann. We started with statistics, regression, moved on to support vector machines, bagging, boosting, random forest and ..read more
Visit website
Rotoscoping: Hollywood’s video data segmentation?
The Data Blog
by igor
4y ago
In Hollywood, video data segmentation has been done for decades. Simple tricks such as color keying with green screens can reduce work significantly. In late 2018 we worked on a video segmentation toolbox. One of the common problems in video editing is oversaturated or too bright sky when shooting a scene. Most skies in movies have been replaced by VFX specialists. The task is called “sky replacement”. We thought this is the perfect starting point for introducing automatic segmentation to mask the sky for further replacement. Based on the gathered experience I will explain similarities in VFX ..read more
Visit website
Protected: Hollywood’s way for video data segmentation?
The Data Blog
by igor
4y ago
This content is password protected. To view it please enter your password below: Password: The post Protected: Hollywood’s way for video data segmentation? appeared first on The Data Blog ..read more
Visit website

Follow The Data Blog on FeedSpot

Continue with Google
Continue with Apple
OR