Understanding Stable Diffusion and ControlNet for a Bar Conversation
the Serious Computer Vision Blog
by Gooly
4M ago
(By Li Yang Ku) In my last post I talked about how generative diffusion models (such as DALLE, Imagen, and Stable Diffusion) work. I also mentioned that I would talk about specific models and tools like Stable Diffusion and ControlNet. I admit that this second post took a bit longer than I expected, mostly due to my laziness ..read more
Visit website
Generative Diffusion Models: Explain to me like I am 35
the Serious Computer Vision Blog
by Gooly
1y ago
(By Li Yang Ku) It’s interesting times to be in the field of Computer Vision. In the past I judge the quality of a Computer Vision publication based on it’s accuracy on benchmarks and the number of citations. Now I also consider how popular it is on Reddit and Youtube. With all the Computer Vision ..read more
Visit website
Vicarious Publications
the Serious Computer Vision Blog
by Gooly
1y ago
by Li Yang Ku I worked at Vicarious, a robotics AI startup, from mid 2018 till it was acquired by Alphabet in 2022. Vicarious was a startup founded before the deep learning boom and it had been approaching AI through a more neuroscience based graphical model path. Nowadays it is definitely rare for AI startups ..read more
Visit website
Consciousness and Intelligence
the Serious Computer Vision Blog
by Gooly
1y ago
By Li Yang Ku In the past I’ve always avoided to make comments about consciousness. My view was that due to consciousness being internal to ourselves it is extremely difficult if not impossible to evaluate scientifically. Also, why talk about consciousness when we couldn’t even understand intelligence? However, some recent readings have changed my view ..read more
Visit website
Visual Loop Machine
the Serious Computer Vision Blog
by Gooly
2y ago
by Li Yang Ku Visual Loop Machine is my new side project since the Rap Machine I made that completes rap sentences. It is a tool that plays visual loops generated by StyleGAN2 along music in real-time. One of the reasons I started this project was because I’ve been waiting for visual effect/mixing software like ..read more
Visit website
The Quest to Finding “The” Object Representation for Robot Manipulation
the Serious Computer Vision Blog
by Gooly
2y ago
By Li Yang Ku For many researchers in the field of Computer Vision, coming up with “the” object representation is a lifetime goal. An object representation is the result of mapping an Image to a feature space such that an agent can recognize or interact with these object. The field came a long way from ..read more
Visit website
BARS 2021 Paper Picks
the Serious Computer Vision Blog
by Gooly
2y ago
I was at the Bay Area Robotics Symposium (BARS) at Stanford in person last week. It’s nice to see real person even though there is a mask mandate (which could be a good thing since the audience won’t be biased by the speaker’s look.) Faculty talks can be found in the video below. My recommended ..read more
Visit website
Transformer for Vision
the Serious Computer Vision Blog
by Gooly
2y ago
By Li Yang Ku In my previous post I talked about this web app I made that can generate rap lyrics using the transformer network. Transformer is currently the most popular approach for natural language related tasks (I am counting OpenAI’s GPT-3 as a transformer extension.) In this post I am going to talk about ..read more
Visit website
Task and Motion Planning
the Serious Computer Vision Blog
by Gooly
3y ago
By Li Yang Ku In this post I’ll briefly go through the problem of Task and Motion Planning (TAMP) and talk about some recent works that try to tackle it. One of the main motivation of solving the TAMP problem is to allow robots to solve household tasks like the robot Rosey in the cartoon ..read more
Visit website
Paper Picks: RSS 2020
the Serious Computer Vision Blog
by Gooly
3y ago
by Li Yang Ku Just like CVPR, RSS (Robotics: Science and Systems) is virtual this year and all the videos are free of charge. You can find all the papers here and corresponding videos on the RSS youtube page once you finished bingeing Netflix, Hulu, Amazon Prime, and Disney+. In this post, I am going ..read more
Visit website

Follow the Serious Computer Vision Blog on FeedSpot

Continue with Google
Continue with Apple
OR