<html>

<head>
<meta http-equiv="Content-Type" content="text/html; charset=Unicode">
<title>Jingwen Dai - Research Projects</title>
</head>

<body>

<p><SPAN 
style="FONT-FAMILY: Georgia">
<b><font face="Cambria" size="6">Mobile Animatronics 
	Telepresence System</font></b></SPAN></p>
<hr>
<p align="justify"><font face="Cambria" size="4"><a href="#CurDir">Current 
research projects</a></font></p>
<p><b><i><font face="Cambria" size="5">The Previous System</font></i></b></p>
<p><u><b><font face="Cambria" size="4">Inhabitor Station</font></b></u></p>
<iframe width="853" height="480" src="//www.youtube.com/embed/PmKOYM6clkQ?rel=0&hd=1" frameborder="0" allowfullscreen></iframe>
<table cellpadding="0" cellspacing="0" width="1000" height="394">
	<!-- MSTableType="layout" -->
	<tr>
		<td height="394" valign="top" width="1000">
		<p align="justify"><font face="Cambria">In the inhabitor station, the 
		user’s head pose (especially the orientation) should be tracked 
		continuously to control the avatar head on the mobile avatar side. 
		Currently, the user has to wear a helmet, on which there are optical 
		trackers for acquiring the position and orientation of the inhabitor and 
		a camera to capture the frontal face imagery, as shown in Fig. 1.</font></p>
		<p align="justify">
		<img border="0" src="img/Inha.PNG" width="510" height="361"></p>
		<p align="justify"><font face="Cambria"><b>Figure 1: User in inhabitor 
		station wearing a helmet with optical trackers and camera.</b><br>
		<br>
		For the user, the requirement of wearing a helmet or a hat is not very 
		natural and convenient. And the user’s 3D face model was generated by 
		3rd party software FaceWorx through moving control points in photographs 
		showing the frontal and profile face [1, 2]. This procedure requires 
		manual identification of distinctive facial features, so it is very 
		time-consuming to model a user’s face.<br>
		<br>
		Using vision-related method (single-, multiple- camera or Kinect) to 
		estimate the head pose and get the 3D face model for the avatar side 
		projection is one of the approaches to solve these problems.</font></td>
	</tr>
	</table>
<p><u><b><font face="Cambria" size="4">Mobile Avatar</font></b></u></p>
<iframe width="853" height="480" src="//www.youtube.com/embed/9HFBD_t5upQ?rel=0&hd=1" frameborder="0" allowfullscreen></iframe>
<table cellpadding="0" cellspacing="0" width="1000" height="564">
	<!-- MSTableType="layout" -->
	<tr>
		<td height="564" valign="top" width="1000">
		<p align="justify"><font face="Cambria">Comparing with the previous 
		avatar, the current avatar adopts rear projection instead of front 
		projection, the projector is fixed rigidly with the face-shaped 
		projection surface.</font></p>
		<p align="justify"><font face="Cambria">Currently, the alignment of 
		projection image and the face-shaped surface is accomplished manually 
		(the size, position and rotation of the projection image), this setup is 
		not user-friendly. And there are still some misalignment between the 
		projected image and the face-shaped surface, making the appearance of 
		the avatar distorted, as shown in Fig. 2(a) and (b). When the inhabitor 
		speaks or has some expression changes, this misalignment becomes more 
		distinct. Moveover, due to the inter-reflection and specular reflection, 
		the appearance of projected avatar face is not homogeneous, in Fig. 2(b) 
		some errors are shown (especially the sparkling spot in the eye region).</font></p>
		<p align="justify">
		<img border="0" src="img/Avatarall.PNG" width="510" height="222"></p>
		<p align="justify"><font face="Cambria"><b>Figure 2: Some imperfections 
		of projected avatar face: (a) &amp; (b) misalignment,<br>
		(c) inhomogeneous appearance due to inter-reflection or specular 
		reflection.</b><br>
		<br>
		The potential improvement on the avatar side may include<br>
		** Automatic alignment of projected face image and face-shaped surface.<br>
		** Compensation of the error caused by inter-reflection and specular 
		reflection to get more realistic face projection, even adding the 
		feedback of a camera.<br>
		** Automatic adjustment for the misalignment caused by expression 
		variations of the inhabitor.</font></td>
	</tr>
	</table>
<p><b><i><font face="Cambria" size="5"><a name="CurDir"></a>My Current Research 
Projects</font></i></b></p>
<table cellpadding="0" cellspacing="0" width="1000" height="352">
	<!-- MSTableType="layout" -->
	<tr>
		<td bgcolor="#808080" height="40"><b>
		<font face="Cambria" size="4" color="#FFFFFF">
		<a href="../HeadPoseKin/HeadPoseKin.htm">Unencumbered Head Pose and 
		Body Posture Estimation</a> </font></b></td>
	</tr>
	<tr>
		<td valign="top" height="114">
		<p align="justify"><font face="Cambria">The emergence of low-cost 3D 
		sensors (e.g., Kinect) makes it possible to achieve an acceptable 
		quality of 3D capture and pose estimation of a human head and body for 
		many applications, without encumbering the user with sensors or markers. 
		It would be useful to explore the use of such devices for real-time head 
		pose and estimation for local head control of the NTU prototype 
		Physical-Virtual Avatar (PVA) avatar, without regard for the appearance 
		(imagery) of the head. In addition, the same technology could be used 
		for remote body control of the UNC RoboThespian RT-3. This will require 
		understanding/exploration of both the body capture, and also the RT-3 
		control. Finally, real-time head pose information, with camera imagery, 
		could be used for face modeling and deformation.</font></td>
	</tr>
	<tr>
		<td bgcolor="#808080" height="46"><b>
		<font face="Cambria" size="4" color="#FFFFFF">
		Dynamic Face Modeling and 
		Expression Deformation</font></b></td>
	</tr>
	<tr>
		<td height="152" width="1000" valign="top">
		<p align="justify"><font face="Cambria">While modern depth sensors are 
		noteworthy in many ways, getting an accurate real-time dynamic 3D face 
		model remains a challenging problem. In general, the quality of single 
		frame is not sufficient to generate reasonable 3D face models, and there 
		is little or no temporal coherence (filtering or fusion). It would be 
		useful to explore the use of the Kinect or other sensors to build up a 
		parametric model of the human head, with evolving dynamic textures, that 
		could be rendered onto the PVA using the head pose information. One 
		important factor is that of temporal coherence. The geometry of the 
		model (the parameters) should be evolved in a way that simultaneously 
		affords a stable base head model, and yet allows for shape changes due 
		to facial expressions. This might be accomplished by using repeated 
		poses and depth information, accumulating and refining the model over 
		time. A general model of the human head, perhaps a parametric model, may 
		be employed as prior knowledge to simplify the problem.</font></td>
	</tr>
</table>
<table cellpadding="0" cellspacing="0" width="1000" height="346">
	<!-- MSTableType="layout" -->
	<tr>
		<td bgcolor="#808080" height="40"><b>
		<font face="Cambria" size="4" color="#FFFFFF">
		<a href="../FaceProj/FaceProj.htm">Direct Face Mapping to 
		Avatar Head</a></font></b></td>
	</tr>
	<tr>
		<td valign="top" height="171">
		<p align="justify"><font face="Cambria">The current approach used to 
		dynamically map a real human face to the face of the PVA depends on a 
		full 3D model of the real human head, a full 3D model of the PVA head, 
		and very precise (in space and time) dynamic head tracking via a 
		head-worn marker system. One of the dominant goals of the project is to 
		un-encumber the user, while<br>
		simultaneously improving the quality (clarity, responsiveness, and 
		stability) of the face imagery. Several methods for more direct mapping 
		of the face should be explored, to attempt to avoid the need for a human 
		head model and a head-worn device. For example, it might be possible to 
		use a Kinect or some other means to dynamically synthesize an image as 
		would be generated via a head-worn camera. If a relatively stable 
		dynamic face image could be synthesized, then a separate process could 
		attempt to continually fit that 2D image onto a static “image” 
		corresponding to the PVA facial feature locations in the projector 
		image. (In the case where the projector is affixed to the PVA head, the 
		locations of the PVA facial features are fixed in the image.)</font></td>
	</tr>
	<tr>
		<td height="59" width="1000" bgcolor="#808080"><b>
		<font face="Cambria" size="4" color="#FFFFFF">Photometric Issues</font></b></td>
	</tr>
	<tr>
		<td height="76" width="1000" valign="top">
		<p align="justify"><font face="Cambria">Errors in appearance (color 
		and/or luminance) arise as a result of light being projected onto an 
		opaque surface, or through a translucent head material. The sources of 
		error can include inter-reflection, specular highlights, interior (to 
		the head material) diffusion and scattering of light. It should be 
		possible to model and calibrate for some of these error sources, 
		potentially using a camera, and to then add a post-rendering correction 
		that adapts the luminance and color throughout.</font></td>
	</tr>
</table>
<table cellpadding="0" cellspacing="0" width="1000" height="140">
	<!-- MSTableType="layout" -->
	<tr>
		<td height="40">&nbsp;</td>
	</tr>
	<tr>
		<td valign="top" height="100" width="1000">
		<p align="justify"><font face="Cambria"><b><font size="4">Reference</font></b><br>
[1] Peter Lincoln, Greg Welch, Andrew Nashel, Andrei State, Adrian Ilie, and 
Henry Fuchs. Animatronic shader lamps avatars. Virtual Reality, 15(2-3):225–238, 
2011.<br>
[2] Peter Lincoln, Greg Welch, and Henry Fuchs. Continual surface-based 
multi-projector blending for moving objects. In Proc. of IEEE Virtual Reality, 
pages 115–118, 2011.</font></td>
	</tr>
	</table>
<p align="justify"><font face="Cambria"><a href="../Research.htm">Back to Projects 
Page</a></font></p>

<!-- Start of StatCounter Code for Default Guide -->
<script type="text/javascript">
var sc_project=9091095; 
var sc_invisible=0; 
var sc_security="223ebe5a"; 
var scJsHost = (("https:" == document.location.protocol) ?
"https://secure." : "http://www.");
document.write("<sc"+"ript type='text/javascript' src='" +
scJsHost+
"statcounter.com/counter/counter.js'></"+"script>");
</script>
<noscript><div class="statcounter"><a title="web analytics"
href="http://statcounter.com/" target="_blank"><img
class="statcounter"
src="http://c.statcounter.com/9091095/0/223ebe5a/0/"
alt="web analytics"></a></div></noscript>
<!-- End of StatCounter Code for Default Guide -->

</body>

</html>