InterDigital logo which acts as a link to the home page
  • Research & Innovation  
    • Overview
    • WIRELESS LAB
    • Video Lab
    • Emerging Technologies Lab
    • Talent
  • Thought Leadership  
    • Vault
    • Sustainability
    • Blog
    • Creators
  • About Us  
    • Overview
      • Leadership
      • History
      • Licensing
      • Contact
    • Government Solutions
    • Careers
    • Media
  • Investors  
    • Investor Relations
    • ESG / SUSTAINABILITY
  • Search

Results for Image Process




Results for Image Process

Metagrating solutions for full color single-plate waveguide combiner
RESEARCH PAPER / Feb 2022 / Immersive/AR/VR/MR, Image Processing
In this work we propose several full-color metagrating solutions for single waveguide-based Augmented and Virtual Reality near-eye display systems. The presented solutions are based on a combination of reflective and/or transmissive diffraction gratings inside or outside a waveguide. The proposed designs have high intensity across a wide angular range. Applying...
An Annotated Video Dataset for Computing Video Memorability
RESEARCH PAPER / Dec 2021 / Image processing
Using a collection of publicly available links to short form video clips of an average of 6 seconds duration each, 1,275 users manually annotated each video multiple times to indicate both longterm and short-term memorability of the videos. The annotations were gathered as part of an online memory game and...
Compact and Adaptive Multiplane Images for View Synthesis
RESEARCH PAPER / Sep 2021 / Video coding, Machine learning/ Deep learning /Artificial Intelligence, Image processing, Computer Graphics
Recently, learning methods have been designed to create Multiplane Images (MPIs) for view synthesis. While MPIs are extremely powerful and facilitate high quality renderings, a great amount of memory is required, making them impractical for many applications. In this paper, we propose a learning method that optimizes the available memory...
Deep learning applied to quad pixel plenoptic sensor
RESEARCH PAPER / Sep 2021 / Optics, Machine learning/ Deep learning /Artificial Intelligence, Image processing
In recent years, we have seen the development of integrated plenoptic sensors, where multiple pixels are placed under one microlens. It is mainly used by cameras and smartphones to drive the autofocus of the main lens, and it often takes the form of dual-pixels with 2 rectangular sub-pixels. We study...
Compressive 4D Light Field Reconstruction Using Orthogonal Frequency Selection
RESEARCH PAPER / Oct 2018 / Image Processing, Light Field, Computer Vision
We present a new method for reconstructing a 4D light field from a random set of measurements. A 4D light field block can be represented by a sparse model in the Fourier domain. As such, the proposed algorithm reconstructs the light field, block by block, by selecting frequencies of the...
Efficient Implementation of Enhanced Multiple Transforms For Video Coding
RESEARCH PAPER / Jun 2018 / Image Processing, Video Coding
Recently, the advances in transform coding have contributed to significant bitrate saving for the next generation of video coding. In particular, the combination of different discrete trigonometric transforms (DTT’s) was adopted in the Joint Video Exploration Team (JVET) solution, as well as the Bench-Mark Set (BMS) of the future video...
Color gamut compression for multiple production color gamuts
RESEARCH PAPER / Feb 2018 / Image Processing, Computer Vision, Color Management
A wide color gamut (WCG) display has great color rendering capability and offers the opportunity to achieve a pleasing and realistic appearance in terms of image quality. To take full advantage of the large display gamut, a new gamut extension algorithm (GEA) is proposed based on a new color appearance...
Region-based Prediction for Image Compression in the Cloud
RESEARCH PAPER / Dec 2017 / Image Processing, Video Coding
Thanks to the increasing number of images stored in the cloud, external image similarities can be leveraged to efficiently compress images by exploiting inter-images correlations. In this paper, we propose a novel image prediction scheme for cloud storage. Unlike current state-of-the-art methods, we use a semi-local approach to exploit inter-image...
Light-Field Surface Color Segmentation with an Application to Intrinsic Decomposition
RESEARCH PAPER / Nov 2017 / Image Processing, Light Field
To enable light fields of large environments to be captured, they would have to be sparse, i.e. with a relatively large distance between views. Such sparseness, however, causes subsequent processing to be much more difficult than would be the case with dense light fields. This includes segmentation. In this paper,...
Optical Center Estimation for Lenslet-Based Plenoptic Cameras
RESEARCH PAPER / Oct 2017 / Optics, Image Processing
Plenoptic cameras enable a variety of novel post-processing applications, including refocusing and single-shot 3D imaging. To achieve high accuracy, such applications typically require knowledge of intrinsic camera parameters. One such parameter is the location of the main lens' optical center relative to the sensor, which is required for modeling radially...
MoFA: Model-based deep convolutional face autoencoder for unsupervised monocular reconstruction
RESEARCH PAPER / Oct 2017 / Machine/Deep Learning/AI, Image Processing Computer Vision
In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation...
A Single-Layer HDR Video Coding with SDR Backward Compatibility
RESEARCH PAPER / Oct 2017 / Image Processing, Video Coding, HDR
The proposed Single Layer SDR backward compatible HDR video distribution solution detailed in this paper, named SL-HDR1, and standardized in ETSI TS 103 433 specification, aims at addressing these issues. SL-HDR1 leverages SDR distribution networks and services already in place. It enables both high quality HDR rendering on HDR-enabled CE...
Super-Rays for Efficient Light-Field Processing
RESEARCH PAPER / Oct 2017 / Image processing, Light Field, Computer Vision, Volumetric Imaging
Light field acquisition devices allow capturing scenes with unmatched postprocessing possibilities. However, the huge amount of high-dimensional data poses challenging problems to light field processing in interactive time. In order to enable light field processing with a tractable complexity, in this paper, we address the problem of light field oversegmentation....
Scalable Light Field Compression Scheme Using Sparse Reconstruction and Restoration
RESEARCH PAPER / Sep 2017 / Image Processing, Light Field, Compression
This paper describes a light field scalable compression scheme based on the sparsity of the angular Fourier transform of the light field. A subset of sub-aperture images (or views) is compressed using HEVC as a base layer and transmitted to the decoder. An entire light field is reconstructed from this...
Learn to unify local and non-local signal processings with graph CNN
RESEARCH PAPER / Sep 2017 / Machine/Deep Learning/AI, Computer Vision, Image Process
This paper deals with the unification of local and non-local signal processing on graphs within a single convolutional neural network (CNN) framework. Building upon recent works on graph CNNs, we propose to use convolutional layers that take as inputs two variables, a signal and a graph, allowing the network to...
Optics on the go: Wearable optical technologies
RESEARCH PAPER / Sep 2017 / Optic, IOT, Image Processing
Wearable optical technologies are emerging to keep users safe, powered-up and entertained
CNN-BASED TRANSFORM SYNTAX PREDICTION IN ADAPTIVE MULTIPLE TRANSFORMS FRAMEWORK TO ASSIST ENTROPY CODING IN HEVC
RESEARCH PAPER / Aug 2017 / Machine/Deep Learning /AI, Image Processing, Video Coding
Recent work in video compression has shown that using multiple 2D transforms instead of a single transform in order to de-correlate residuals provides better compression efficiency. These transforms are tested competitively inside a video encoder and the optimal transform is selected based on the Rate Distortion Optimization (RDO) cost. However,...
Mixed Illumination Analysis in Single Image for Color Grading
RESEARCH PAPER / Jul 2017 / Image Processing, Computer Vision, Production Workflow
Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines. While pixel-wise segmentation techniques can help for this task, professional rotoscoping tools rely on parametric curves that offer the artists a much better interactive control on the definition,...
ROAM: a Rich Object Appearance Model with Application to Rotoscoping
RESEARCH PAPER / Jul 2017 / Image Processing; Computer Vision, Production Workflow
Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines. While pixel-wise segmentation techniques can help for this task, professional rotoscoping tools rely on parametric curves that offer the artists a much better interactive control on the definition,...
DEEP LEARNING FOR MULTIMODAL-BASED VIDEO INTERESTINGNESS PREDICTION
RESEARCH PAPER / Jul 2017 / Machine/Deep Learning/AI, Image Processing
Predicting interestingness of media content remains an important, but challenging research subject. The difficulty comes first from the fact that, besides being a high-level semantic concept, interestingness is highly subjective and its global definition has not been agreed yet. This paper presents the use of up-to-date deep learning techniques for...
CAMERA-AGNOSTIC FORMAT AND PROCESSING FOR LIGHT-FIELD
RESEARCH PAPER / Jul 2017 / Light Field, Image Processing, Volumetric Imaging
Light-field (LF) is foreseen as an enabler for the next generation of 3D/AR/VR experiences. However, lack of unified representation, storage and processing formats, variant LF acquisition systems and capture-specific LF processing algorithms prevent cross-platform approaches and constrain the advancement and standardization process of the LF information. In this work we...
Video Style Transfer by Adaptive Patch Sampling
RESEARCH PAPER / Jun 2017 / Image Processing, Computer Vision, Machine/Deep Learning/ AI
This paper addresses the example-based stylization of videos. Style transfer aims at editing an image so that it matches the style of an example. This topic has recently been investigated massively, both in the industry and academia. The difficulty lies in how to capture the style of an image. For...
Which saliency weighting for omni directional image quality assessment?
RESEARCH PAPER / May 2017 / Image processing, Human Machine Interface
With the explosion of Virtual Reality technologies, the production and usage of omni directional images (a.k.a 360 images) is presenting new challenges in the domains of compression, transmission and rendering. The evaluation of the quality of images generated by these technologies is therefore paramount. As the exploration of 360 images...
Dataset and Pipeline for Multi-View Light Field Video
RESEARCH PAPER / May 2017 / Image processing, Light Field, Computer Vision, Volumetric Imaging
The quantity and diversity of data in Light-Field videos makes this content valuable for many applications such as mixed and augmented reality or post-production in the movie industry. Some of such applications require a large parallax between the different views of the Light-Field, making the multi-view capture a better option...
An Image Rendering Pipeline for Focused Plenoptic Cameras
RESEARCH PAPER / May 2017 / Image processing, Light Field, Computer Vision, Volumetric Imaging
In this paper, we present a complete processing pipeline for focused plenoptic cameras. In particular, we propose 1) a new algorithm for microlens center calibration fully in the Fourier domain, 2) a novel algorithm for depth map computation using a stereo focal stack, and 3) a depth-based rendering algorithm that...
Perceptual Lightness Modeling for High Dynamic Range Imaging
RESEARCH PAPER / Apr 2017 / Image Processing, HDR
The human visual system (HVS) non-linearly processes light from the real world, allowing us to perceive detail over a wide range of illumination. Although models that describe this non-linearity are constructed based on psycho-visual experiments, they generally apply to a limited range of illumination and therefore may not fully explain...
Optimization of Sample Adaptive Band Offset in HEVC
RESEARCH PAPER / Apr 2017 / Image Processing, Video Coding
Summary form only given. This paper presents two sets of modifications to band offset type of the Sample Adaptive Offset technique in HEVC. First, some constraints on the SAO semantics are added to solve sub-optimal syntax issue and to exploit the actual range information of reconstructed samples. Next, the classification...
Adaptive Clipping in JEM
RESEARCH PAPER / Apr 2017 / Image Processing, Video Coding
This paper presents an adaptive clipping technique with optimized syntax in the video coding Joint Exploratory Model (JEM), which exploits the signal characteristics of the video sequence. The component-wise clipping bounds are coded for each slice. Two encoding methods leveraging the efficiency of the proposed technique are then described. The...
Scalable image coding based on epitomes
RESEARCH PAPER / Mar 2017 / Image Processing, Video Coding
In this paper, we propose a novel scheme for scalable image coding based on the concept of epitome. An epitome can be seen as a factorized representation of an image. Focusing on spatial scalability, the enhancement layer of the proposed scheme contains only the epitome of the input image. The...
Noisy Tensor Completion for Tensors With a Sparse Canonical Polyadic Factor
RESEARCH PAPER / Feb 2017 / Image Processing, Machine/Deep Learning/AI
“To be considered for the 2017 IEEE Jack Keil Wolf ISIT Student Paper Award.” In this paper we study the problem of noisy tensor completion for tensors that admit a canonical polyadic or CANDE-COMP/PARAFAC (CP) decomposition with one of the factors being sparse. We present general theoretical error bounds for...
A SINGLE-LAYER HDR VIDEO CODING FRAMEWORK WITH SDR COMPATIBILITY
RESEARCH PAPER / Feb 2017 / Image Processing; Video Coding, HRD
The migration from high-definition TV to ultrahigh definition (UHD) is already underway. In addition to an increase of picture spatial resolution, UHD potentially provides more color by introducing a wider color gamut, and better contrast by moving from standard dynamic range (SDR) to high dynamic range (HDR). The transition from...
Structured sampling and fast reconstruction of smooth graph signals
RESEARCH PAPER / Feb 2017 / Image Processing, Computer Vision, Machine/Deep Learning/AI
This work concerns sampling of smooth signals on arbitrary graphs. We first study a structured sampling strategy for such smooth graph signals that consists of a random selection of few pre-defined groups of nodes. The number of groups to sample to stably embed the set of $k$-bandlimited signals is driven...
Towards a Perceptually-Motivated Color Space for High Dynamic Range Imaging
RESEARCH PAPER / Dec 2016 / Image Processing, Computer Vision
To reproduce the appearance of real world scenes, a number of color appearance models have been proposed thanks to adapted psycho-visual experiments. Most of them were designed and intended for a limited dynamic range, or address only dynamic range compression applications. However, given the increasing availability of displays with higher...
Technicolor - Philips HDR solution
RESEARCH PAPER / Oct 2016 / Image Processing, Video Coding, HDR
HDR Solution presentation
Light Field Segmentation Using a Ray Based Graph Structure
RESEARCH PAPER / Oct 2016 / Image processing, Light Field, Computer Vision, Volumetric Imaging
In this paper, we introduce a novel graph representation forinteractive light field segmentation using Markov Random Field (MRF).The greatest barrier to the adoption of MRF for light field processing isthe large volume of input data. The proposed graph structure exploits theredundancy in the ray space in order to reduce the...
Supervised Learning Of Low-Rank Transforms For Image Retrieval
RESEARCH PAPER / Sep 2016 / Image Processing, Computer Vision, Machine/Deep Learning/AI
In this paper we propose a new method to automatically select the rank of linear transforms during supervised learning. Our approach relies on a sparsity-enforcing element-wise soft-thresholding operation applied after the linear transform. This novel approach to supervised rank learning has the important advantage that it is very simple to...
Clustering-Based Linear Mappings Learning For Quantization Noise Removal
RESEARCH PAPER / Sep 2016 / Image Processing, Video Coding
This paper describes a novel scheme to reduce the quantization noise of compressed videos and improve the overall coding performances. The proposed scheme first consists in clustering noisy patches of the compressed sequence. Then, at the encoder side, linear mappings are learned for each cluster between the noisy patches and...
On plenoptic sub-aperture view recovery
RESEARCH PAPER / Sep 2016 / Optics, Light Field, Image Processing, Computer Vision
Light field imaging is recently made available tothe mass market by Lytro and Raytrix commercial cameras.Thanks to a grid of microlenses put in front of the sensor, aplenoptic camera simultaneously captures several images of thescene under different viewing angles, providing an enormousadvantage for post-capture applications,e.g., depth estimationand image refocusing. In...
Multi-reference combinatorial strategy towards longer long-term dense motion estimation
RESEARCH PAPER / Sep 2016 / Image processing, Video Coding
This paper addresses the estimation of accurate long-term dense motion fields from videos of complex scenes. With computer vision applications such as video editing in mind, we exploit optical flows estimated with various inter-frame distances and combine them through multi-step integration and statistical selection (MISS). In this context, managing numerous...
HDR tutorial (EUSIPCO)
RESEARCH PAPER / Aug 2016 / Image Processing, Video Coding, HDR
HDR Tutorial
Overview of Color Gamut Scalability
RESEARCH PAPER / Aug 2016 / Image Processing, Video coding, Color Management
Displays' new rendering capabilities combined with the ever-growing number of video applications have fueled the emergence of new video formats addressing wider color gamut and larger frame size. Thus, the need in scalable compression technology to provide backward compatibility with legacy devices and capitalize on the superior compression performance of...
HDR video distribution with SDR backward compatibility
RESEARCH PAPER / Jun 2016 / Image Processing, Video Coding, HDR
The movie industry has been using Unmanned Aerial Vehicles as a new tool to produce more and more complex and aesthetic camera shots. However, the shooting process currently rely on manual control of the drones which makes it difficult and sometimes inconvenient to work with. In this paper we address...
High Dynamic Range and Wide Color Gamut video standardization - status and perspectives
RESEARCH PAPER / Apr 2016 / Image Processing, Video Coding, HDR
With the advent of ultra-high-definition TV services, high dynamic range (HDR) and wide color gamut (WCG) have become two highly desired image quality improvements for delivering immersive video experiences to the consumer mass market. Capture and rendering technologies have reached a level of maturity that now allows HDR and WCG...
HDR and WCG Video Coding in HEVC: Status and Potential Future Enhancements
RESEARCH PAPER / Dec 2015 / Image Processing, Video Coding, HDR
As the video industry begins deployment of ultrahigh-definition TV in both professional and consumer markets, including support for higher dynamic range and wider color gamut services is considered essential within the industry. Higher dynamic range and wider color gamut offer end users a significantly enhanced viewing experience by supporting intensity...
InterDigital Engineer Appointed as Associate Editor for IEEE Transactions on Image Processing
BLOG / Jul 2015 / IEEE, HEVC, SHVC, Image Processing / Posted By: Kelly Capizzi
The Institute of Electrical and Electronics Engineers (IEEE) Signal Processing Society’s publication, IEEE Transactions on Image Processing, is considered the flagship peer-reviewed publication in image processing… and has recently appointed our own Dr. Rahul Vanam, a staff engineer in InterDigital Labs, as Associate Editor for the 2015 – 2018 term. IEEE...

ready to
learn more?

CONTACT INVESTORS
RESEARCH & INNOVATION
  • Overview
  • Wireless Lab
  • Video Lab
  • Emerging Technologies Lab
  • Talent
THOUGHT LEADERSHIP
  • Vault
  • Blog
  • Creators
ABOUT US
  • Overview
  • Leadership
  • History
  • Licensing
  • Contact
  • Government Solutions
  • Careers
  • Media
  • LinkedIn
  • Twitter
InterDigital logo in white

   © COPYRIGHT 2023 INTERDIGITAL, INC. ALL RIGHTS RESERVED.

  • Privacy Policy
  • Forward Looking Statements
  • Legal Notices
  • Research & Innovation
    • Overview
    • Wireless Lab
    • Video Lab
    • Emerging Technologies Lab
    • Talent
  • Thought Leadership
    • Vault
    • Sustainability
    • Blog
    • Creators
  • About Us
    • Overview
    • Leadership
    • History
    • Licensing
    • Contact
    • Government Solutions
    • Careers
    • Media
  • Investors
    • Investor Relations
    • ESG / SUSTAINABILITY
  • Search