Research interest

 

Overview Multiview Video Communication System

My research interests lie all technical aspects of 3D video communication end-to-end services related to 3DTV and FTV, including multiple cameras acquisition, 3D video data representation (multiview video, video-plus-depth, layer depth video, etc.), video transmission/coding (MPEG-2, H.264/AVC, H.264/MVC, JMVM, etc), point cloud compression, and rendering on 3D displays (pre/post preocessing, DIBR, autostereoscopic display, etc). Prior works have been published in journals and conferences.

Overview Point Cloud Communication System

Recent publications

 
  • [journal] I. Daribo, R. Furukawa, R. Sagawa, H. Kawasaki, S. Hiura, N. Asada, "Efficient Rate-Distortion Compression of Dynamic Point Cloud for Grid-Pattern-Based 3D Scanning Systems", in 3D Research Journal, Vol. 3, Issue 1, Springer, 2012
  • [book] I. Daribo, H. Saito, R. Furukawa, S. Hiura, N. Asada, "Hole-Filling for View Synthesis", to appear in "3D-TV System with Depth-Image-Based Rendering" published by Springer, 2012
  • [book] I. Daribo, H. Saito, R. Furukawa, S. Hiura, N. Asada, "Effects of Wavelet-Based Depth Video Compression", to appear in "3D-TV System with Depth-Image-Based Rendering" published by Springer, 2012
  • [conf] I. Daribo, T. Maugey, G. Cheung, P. Frossard, "R-D Optimized Auxiliary Information for Inpainting-based View Synthesis", IEEE 3DTV-Conference, Zurich, Switzerland, Oct. 2012
  • [+] ... see more

Current research achievements

 
  • Compact 3D imaging data representation
  • Inpainting-based hole-filling method
  • Lossless edge codec (patent filling in process)
  • Compression of point cloud for 3D scanning systems
  • BRDF re-sampling

Context

 

3D context

Three-dimensional technologies, as the next revolution in visual technology, promises to bring to the customers a new generation of services. The improvement of multiview technologies raised interest in 3D television (3DTV) and in free viewpoint video (FVV). While 3DTV offers depth perception of program entertainments without wearing special additional glasses, FVV allows the user to freely change his viewpoint position and viewpoint direction around a 3D reconstructed scene. Another target fields can be expected, like Digital Cinema, IMAX theaters, medicine, dentistry, air-traffic control, military technologies, computer games, and more.

In the meantime, the development of digital TV and 3D displays has largely improved recently, and thus, created a wide interest in multiview applications. Sharp, Sony and Sanyo, three Japanese companies, have formed in march 2003 the 3D Consortium in order to help the development of 3D technologies. Japan seems to be again among the first countries in the world to put 3DTV in the market, and to develop FVV applications as discussed above. Japan plans to make it a commercial reality by 2020.

Video context

In particular, capturing with multiple cameras, processing and coding the acquired multiview video have become an active research topic. The huge amount of data to be processed by multiview applications raised the problem of efficient encoding of a multiview video sequence.

High compression efficiency is achieved by exploits both spatial and temporal redundancies. Temporally adjacent frames are often high correlated. In the spatial domain neighboring pixels are very similar especially in homogeneous area. A video encoder carries out three main functional units: the temporal prediction, the spatial transformation and the entropy coding to produce a compressed binary stream.

The temporal prediction leads to estimate the motion between adjacent temporal frames. The spatial transformation removes the spatial redundancies into a transform domain and provides a more compact representation of the date into a small number of values. The elements issues from the temporal prediction and the spatial transformation, denoted as symbols, are converted into a binary code and compressed by the entropy coder. The entropy encoder removes the statistical redundancy in the data.

Keywords : video-plus-depth, MVC, DIBR, FVV, 3DTV, MMA Team, TSI, Telecom ParisTech