양고네 연구노트

학술

[11.27] Mutual information-based 3D object tracking

2009.11.22 by 양고
[11.26] A comparison of iterative 2D-3D pose estimation methods for real-time applications

2009.11.22 by 양고
[11.25] Relevance of interest points for eye position prediction on videos

2009.11.18 by 양고
[11.24] FaceL: Facile face labeling

2009.11.18 by 양고
[11.23] Recognizing gestures for virtual and real world interaction

2009.11.18 by 양고
[11.20] Video suggestion and discovery for YouTube: Taking random walks through the view graph

2009.11.18 by 양고
[11.18-19] Unsupervised natural experience rapidly alters invariant object representation in visual cortex

2009.11.18 by 양고
[11.16-17] Object detection by color histogram-based fuzzy classifier with support vector learning

2009.11.16 by 양고

[11.27] Mutual information-based 3D object tracking

IJCV 2008 Giorgio Panin and Alois Knoll 내용 기본적으로는 L-M 기반의 multi-level LSE tracker. SSD 또는 NCC 대신 Mutual Information (MI) 를 사용하여 정확성을 높이고 계산시간을 단축함. 계산시간 단축은 derivatives를 제공함으로써 별도 estimation할 필요가 없다는 점으로 설명이 되나, robustness의 증가는 이해할 수 없음. MI maximization이라는 것을 더 조사해야 알 수 있을 듯. SIFT 등의 keypoint 기반 방식과도 비교하였으나, 제안 방식은 initialization에 크게 좌우된다는 점에서 정확한 비교는 불가능할 듯. 추적 성능은 1~2 fps.

학술 2009. 11. 22. 19:28

[11.26] A comparison of iterative 2D-3D pose estimation methods for real-time applications

SCIA 2009 (LNCS) Denmark 애들 내용 comparison이라고 하지만 겨우 POSIT과 L-M (BIAS 라이브러리의 CamPoseCalib (CPC)) 비교. 제목에는 iterative가 들어가 있지만 겨우 두 개로는 민망했는지 Direct Linear Transform (DLT) 를 넣어서 세 가지 방식을 비교함. planar points를 위해 planar version of POSIT을 제안...이 아니고 가져옴. 결론: 대체로 CPC가 정확하다. POSIT은 starting pose가 필요 없다. 제목만 보고 제대로 낚임. 파닥파닥

학술 2009. 11. 22. 15:57

[11.25] Relevance of interest points for eye position prediction on videos

ICVS 2009 Gipsa-lab, France 내용 Spatial Interest Points (SIPs): Harris detector Spce-Time Interest Points (STIPs): Laptev detector 15명의 실험자가 53개의 비디오로부터 만들어진 305개의 비디오 조각을 관람. SR Research의 Eyelink II eye tracker(500Hz)를 사용. eye position density maps와 interest point detectors에 의한 interest maps의 비교를 위한 comparison metric: Normalized Scanpath Saliency (NSS).

학술 2009. 11. 18. 23:53

[11.24] FaceL: Facile face labeling

ICVS 2009 Colorado State University 내용 Face Detection → Eye Localization → Face Labeling. Face detection은 OpenCV cascade face detector를 튜닝해서 사용. Eye localization은 저자들이 CVPR 2009에 발표한 내용을 OSMU. Face labeling은 SVC (libsvm 사용). open source, 10 fps.

학술 2009. 11. 18. 23:09

[11.23] Recognizing gestures for virtual and real world interaction

ICVS 2009 MIT CSAIL, Toyota Research Institute 내용 Stereo → Articulated ICP → Recognition. model을 ICP로 3D data에 fitting할 때 multi-hypothesis framework 사용. N = 4 hypothesis is enough. recognition은 bag of features (BOF) 기반이며, temporal descriptor로 codewords의 histogram을 사용. histogram을 SVM으로 training. 물론 classes는 different gestures. 비교는 HMM과 Conditional Random Fields (CRF). 실시간으로 동작한다고.

학술 2009. 11. 18. 22:38

[11.20] Video suggestion and discovery for YouTube: Taking random walks through the view graph

WWW 2008 Google guys 내용 video co-view graph를 사용, YouTube의 recommendation video를 자동 생성한다. 실험은 92일 간의 YouTube 데이터를 사용 (46 days for training, 46 days for test - 무엇을 training하는지는 잘 모르겠다 - Lv, 아마도 user 간 선호도 유사성일 듯). 비교 알고리듬: Global Popularity (그냥 most popular videos), Local Popularity (co-views 고려. = Amazon.com). node 간에 label을 전파하는 (label propagation) Adsorption algorithm이란 것을 제안하여, precision(추천 비디..

학술 2009. 11. 18. 21:41

[11.18-19] Unsupervised natural experience rapidly alters invariant object representation in visual cortex

Science 2008 Nuo Li and James J. DiCarlo 내용 Responses of neurons from the inferior temporal cortex (IT) are selective to different objects, yet tolerant (invariant) to changes in object position, scale, and pose. How does the brain construct this neuronal tolerance? 원숭이 두 마리를 놓고 그림이 바뀌는 것을 반복적으로 보여줘서 position tolerance를 붕괴시키는 내용 (사람 가지고 실험하면... 안 되겠지?). 시간이 부족해서 제대로 읽지는 못했지만 reference로 넣으면 간지 +5..

학술 2009. 11. 18. 21:27

[11.16-17] Object detection by color histogram-based fuzzy classifier with support vector learning

Neurocomputing 2009 National Chung-Hsing University, Taiwan 내용 기본적으로 histogram + template matching. Contributions: The proposed non-uniform partition (in histogram) The use of Self-organizing Takagi-Sugeno-type fuzzy network with support vector learning (SOTFN-SV) classifier A splitting K-means clustering algorithm is proposed (to eliminate false alarms and determine object location). 비교: templa..

학술 2009. 11. 16. 13:08