Real-time three-dimensional head position tracker system with stereo cameras using a face recognition neural network

Real-time three-dimensional head position tracker system with stereo cameras using a face recognition neural network(2004)

引用 25|浏览19
暂无评分
摘要
Creating credible virtual reality (VR) computer-generated worlds requires constant updating of the images in all displays to have the correct perspective for the user. To achieve this, the computer must know the exact position and orientation of the user's head. Examples of current techniques for addressing head tracking include magnetic and acousto-inertial trackers, both requiring the user to wear clumsy head-mounted sensors with transmitters and/or wires. This thesis describes a two-camera, video-based, tetherless 3D head position tracker system specifically targeted for both autostereoscopic displays and projection-based virtual reality systems. The user does not need to wear any sensors or markers. The head position technique is implemented using Artificial Neural Networks (ANN), allowing the detection and recognition of upright, tilted, frontal and non-frontal faces in the midst of visually cluttered environments. In developing a video-based object detector using machine learning, three main sub-problems arise: first, images of objects such as faces vary considerably with lighting, occlusion, pose, facial expression, and identity. Second, the system has to deal with all the variations in distinguishing objects (faces) from non-objects (non-faces). Third, the system has to recognize a target face from other possible faces so it can identify the correct user to track. This thesis introduces some solutions to problems in the face detection/recognition domain. For example, it discusses several Neural Networks (NN) per left and right channel, one for recognition, one for detection and one for tracking; real-time NN face and background training (new users have to spend only 2 minutes training before being able to use the system); infrared (IR) illumination (to further reduce image dependency cause by room lighting variation) and global image equalization (in place of local); algorithms highly tuned for the Intel Pentium IV vector processor; and a prediction module to achieve faster frame rates once a face has been recognized. The goal is to reach real-time tracking, in our case 30 frames per second (fps), at 640 by 480 video-image resolution. The system has been evaluated on an ongoing autostereoscopic Varrier™ display project achieving 30 frames per second (fps) at 320 x 240 video-image resolution and 90% tracking position rate.
更多
查看译文
关键词
real-time three-dimensional head position,face detection,projection-based virtual reality system,exact position,face recognition neural network,head tracking,head position technique,head position tracker system,new user,correct user,stereo camera,position rate,video-image resolution
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要