CV-Tricks.com on Feedspot

Technical overview of Image Synthesis : Stable Diffusion

CV-Tricks.com

by Ankit Sachan

1y ago

Tex to Image models like DALL-E, Imagen, and Stable Diffusion have attracted a lot of attention to Image Synthesis models, recently. These models can generate impressive looking images from benign looking prompts. Here are a few typical examples of images from Stable Diffusion: Looking under the hood ..read more

Visit website

MOTR: End-to-End Multi-Object Tracking with Transformers

CV-Tricks.com

by Ankit Sachan

1y ago

MOTR is a state of the art end-to-end multiple object tracker that does not require any temporal association between objects of adjacent frames. It directly outputs the track of objects in a sequence of input images (video). MOTR uses Deformable DETR for object detection on a single image. To understand the architecture of MOTR it ..read more

Visit website

GhostNetV2: Enhance Cheap Operation with Long-Range Attention

CV-Tricks.com

by Ankit Sachan

1y ago

GhostNetV2 is a recent SOTA architecture that allows an implementation of Long-Range attention in the deep CNN frameworks used in various ML tasks such as image classification, object detection, and video analysis. GhostNetV2 proposes a new attention mechanism called DFC attention to capture long range spatial information. And it does so while keeping the implementation ..read more

Visit website

Understanding CLIP by OpenAI

CV-Tricks.com

by Ankit Sachan

2y ago

CLIP By OPEN-AI Introduction Nearly all state-of-the-art visual perception algorithms rely on the same formula: (1) pretrain a convolutional network on a large, manually annotated image classification dataset (2) finetune the network on a smaller, task-specific dataset. This technique has been widely used for several years and has led to impressive improvements on numerous tasks. ..read more

Visit website

Using Active Learning to Improve your Machine Learning Models

CV-Tricks.com

by Ankit Sachan

2y ago

Machine Learning Reality Check In the Machine Learning World or broadly in the AI Universe, the colonists such as Data Scientists, Machine Learning Engineers, Deep Learning Specialist are coached towards a belief i.e. “More Training Data Means Highly Accurate Production Model“. Which to some extent is unavoidably true but predominately it’s also a fact, that ..read more

Visit website

Understanding and improving Image to Image Translation Pix2PixHD

CV-Tricks.com

by Ankit Sachan

2y ago

Introduction Photo-realistic image rendering using standard graphics techniques requires realistic simulation of geometry and light. The algorithms which we use currently for the task are effective but expensive. If we were able to render photo-realistic images using a model learned from data, we could turn the process of graphics rendering into a model learning and ..read more

Visit website

Understanding StyleGAN for Image Generation using Deep Learning

CV-Tricks.com

by Ankit Sachan

2y ago

Introduction Images produced by generative methods have been improving lately. Most of the recent generative algorithms have made use of generative networks that are trained using a discriminator network as their adversary. Generative Adversarial Networks (GANs) or generators, in other words, are a relatively new concept in the field of computer vision. Their aim is ..read more

Visit website

State-Of-The-Art Text to Image Generation using DALL-E

CV-Tricks.com

by Ankit Sachan

2y ago

DALL-E – Creating images from text Code – openai/DALL-E: PyTorch package for the discrete VAE used for DALL·E. Paper – https://arxiv.org/pdf/2102.12092.pdf What is DALL-E? On 5th January 21, OpenAI unveiled their novel text to image generation model, DALL-E. This model is capable of generating various types of images from textual descriptions. A humongous 12 ..read more

Visit website

Training Image Classification 8x Faster With NFNet

CV-Tricks.com

by Ankit Sachan

2y ago

Introduction: Anyone who has deployed a neural network on production knows that deploying a network is easy but making sure that it stays updated as new user data flows is a harder task. It involves keep training the network with new incoming data frequently and in such a case being able to train faster is ..read more

Visit website

Training Object Detectors using TensorFlow Object Detection API

CV-Tricks.com

by Ankit Sachan

2y ago

Machine learning algorithms are everywhere around you. The recommendations you receive on youtube, estimation of commute time, face detection in google photos, and many more, all of these features that make our lives easier would not have been possible without advances in machine learning algorithms. Machine learning can be further classified into various fields; many ..read more

Visit website

Follow CV-Tricks.com on FeedSpot