Paper Piecing Pattern Making Tutorial

A Leader’s Framework for Decision Making

In January 1993, a gunman murdered seven people in a fast-food restaurant in Palatine, a suburb of Chicago. In his dual roles as an administrative executive and spokesperson for the police department, ...

GitHub

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference

In vision-language models (VLMs), visual tokens usually consume a significant amount of computational overhead, despite their sparser information density compared to text tokens. To address this, ...

IEEE

PVNet: Pixel-Wise Voting Network for 6DoF Object Pose Estimation

Abstract: This paper addresses the problem of instance-level 6DoF object pose estimation from a single RGB image. Many recent works have shown that a two-stage approach, which first detects keypoints ...

IEEE

A Survey on Vision Transformer

Abstract: Transformer, first applied to the field of natural language processing, is a type of deep neural network mainly based on the self-attention mechanism. Thanks to its strong representation ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results