What makes realistic video?

3D? Nice! High dyanmic range (HDR) imaging? Great! Any thoughts on 3D HDR?

Stereoscopic 3D technology.

Aiming at real-life experience.

High dynamic range imaging.

Struggling to determine a correct exposure? It is time to consider HDR.

Living in a digitized world.

The "Digital Orca" (by Douglas Coupland) at Convention Centre, Vancouver, BC, Canada.

Research Interests:

My research interests are in the broad field of digital image and video processing. I am currently focusing on 3D video capturing and processing, 3D quality of experience, and high dynamic range imaging.

On-going Research Projects:

Quality of Experience for the Horizontal Pixel Parallax Adjustment of Stereoscopic 3D Videos

When capturing stereoscopic 3D video, video distortions can be avoided by setting up the cameras in parallel. Unfortunately, the resulting 3D scene appears to be in front of the screen, which is uncomfortable for the viewers. This problem can be solved by modifying the horizontal pixel parallax, which brings the picture on and behind the screen. We employ an algorithm that extracts and matches features from both left and right video sequences to automatically shift the video frames so that the closest object to the screen has zero pixel parallax. We then perform subjective tests to show that this parallax-adjusting process significantly increases the 3D quality of experience by 19.86%.

Top of page
Quality of Experience of Stereoscopic Content on Displays of Different Sizes: A Comprehensive Subjective Evaluation

3D-capable devices have entered the consumer market. The depth that is perceived with stereoscopic content is strongly linked to the size of the screen in which it is displayed. We tested the effect that different display sizes have on the quality of experience of 3D content and we offer some recommendations.

Top of page
Guidelines for Capturing High Quality Stereoscopic Content Based on a Systematic Subjective Evaluation

Although 3D TVs have already been introduced to the consumer market, the availability of stereoscopic (3D) content remains a serious challenge. Another challenge is that while some manufacturers are introducing 3D cameras and Hollywood is using proprietary solutions, there are no guidelines for consistently capturing high quality stereoscopic content. In this paper, we present a comprehensive stereoscopic image database with content captured at various distances from the camera lenses. We conducted subjective tests to assess the perceived 3D quality of these images which were shown on displays of different sizes. Finally, we provide guidelines of acquisition distances between the cameras and the real scene.

Top of page
Developing 3D visual attention model
3D HDR imaging

Past Research Projects:

Correction of clipped pixels in color images

Conventional images store a very limited dynamic range of brightness. The true luma in the bright area of such images is often lost due to clipping. When clipping changes the R, G, B color ratios of a pixel, color distortion also occurs. In this paper, we propose an algorithm to enhance both the luma and chroma of the clipped pixels. Our method is based on the strong chroma spatial correlation between clipped pixels and their surrounding unclipped area. After identifying the clipped areas in the image, we partition the clipped areas into regions with similar chroma, and estimate the chroma of each clipped region based on the chroma of its surrounding unclipped region. We correct the clipped R, G, or B color channels based on the estimated chroma and the unclipped color channel(s) of the current pixel. The last step involves smoothing of the boundaries between regions of different clipping scenarios. Both objective and subjective experimental results show that our algorithm is very effective in restoring the color of clipped pixels.

Top of page
Logo insertion transcoding for H.264/AVC compressed video

H.264/AVC quickly gains ground in many aspects of video applications, due to its superior coding performance. Inserting a company logo into H.264/AVC compressed video streams has been a highly desirable application in the TV telecasting industry. In this paper, we propose a novel and efficient logo-insertion scheme for H.264/AVC compressed videos. Our proposed scheme overcomes the numerous coding dependencies, and minimizes the changes to the original compressed videos. Experimental results show that our proposed transcoding scheme achieves extraordinary video quality and significantly reduces the bit rate and computational cost. Compared with the cascaded transcoding scheme, our proposed logo-insertion method achieves an average of 1.16 dB PSNR increase, or a 68.6% bit-rate reduction. Our scheme also dramatically reduces the total transcoding time and the motion-estimation time by at least 67.2% and 97.4%, respectively.

Top of page
Adaptive multiscale image representations using normal meshes

The normal-mesh-based image coder of Jansen, Baraniuk, and Lavu (JBL) is studied. First, the JBL coder is introduced, and several of its shortcomings are identiﬁed. Then, to address these shortcomings, three modiﬁcations to this coder are proposed, namely, the use of a data-dependent base mesh, an integer representation for normal/vertical offsets, and a different scan-conversion scheme based on bicubic interpolation. Experimental results show that these proposed changes lead to improved coding performance in terms of both objective and subjective image-quality measures. In particular, the use of a data-dependent base mesh helps to locate horizons more quickly and preserve image edges better. The number of bits required to encode the normal/vertical offsets is reduced by representing this information with integers (as opposed to real numbers). Lastly, bicubic interpolation is found to yield higher-quality image reconstructions, while still maintaining sharp edges.

Top of page
An optimization-based method for the design of high-performance filter banks for image coding

H.264/AVC quickly gains ground in many aspects of video applications, due to its superior coding performance. Inserting a company logo into H.264/AVC compressed video streams has been a highly desirable application in the TV telecasting industry. In this paper, we propose a novel and efficient logo-insertion scheme for H.264/AVC compressed videos. Our proposed scheme overcomes the numerous coding dependencies, and minimizes the changes to the original compressed videos. Experimental results show that our proposed transcoding scheme achieves extraordinary video quality and significantly reduces the bit rate and computational cost. Compared with the cascaded transcoding scheme, our proposed logo-insertion method achieves an average of 1.16 dB PSNR increase, or a 68.6% bit-rate reduction. Our scheme also dramatically reduces the total transcoding time and the motion-estimation time by at least 67.2% and 97.4%, respectively.

Top of page
Different optimization approaches for equiripple filter design
Sparse geometric signal representations with bandelets
Overcomplete-basis optimization for sparse signal representations
Finite-field wavelet tranforms and their applications
Image denoising using 2D non-separable wavelets and 2D dual-tree complex wavelets

Contact Information

E-Mail:	`dixu at ece.ubc.ca`
Phone:	+1 (604) 827 4878
Lab:	x310, ICICS/CS Building, 2366 Main Mall University of British Columbia Vancouver, BC, V6T 1Z4, CANADA

Quick Navigation

Home	Research
Resume	Publications
Personal	Teaching

Di Xu

University of British Columbia

What makes realistic video?

Stereoscopic 3D technology.

High dynamic range imaging.

Living in a digitized world.

Research Interests:

On-going Research Projects:

Quality of Experience for the Horizontal Pixel Parallax Adjustment of Stereoscopic 3D Videos

Quality of Experience of Stereoscopic Content on Displays of Different Sizes: A Comprehensive Subjective Evaluation

Guidelines for Capturing High Quality Stereoscopic Content Based on a Systematic Subjective Evaluation

Past Research Projects:

Correction of clipped pixels in color images

Logo insertion transcoding for H.264/AVC compressed video

Adaptive multiscale image representations using normal meshes

An optimization-based method for the design of high-performance filter banks for image coding

Contact Information

Links

Quick Navigation