What defines a good versus a bad sensory experience –particularly if you want to charge a premium for it? That’s a difficult question when physical, digital and virtual worlds merge. It depends on a human’s subjective perceptions as well as a network’s capabilities. Human-centered quality metrics are urgently needed to...
"5G new radio (NR) sidelink (SL) is envisioned as a key enabler of high speed, low latency applications like automated driving. To meet the high data rate requirements for such applications, SL support at mmWave and sub-THz frequencies with large bandwidths will be essential. Several enhancements and optimizations will be...
"In a free space optics (FSO) based drone assisted mobile network, a drone mounted base station (DBS) can be rapidly deployed over a place of interest to relay traffic between mobile users and a macro base station (MBS), and FSO is applied as the fronthaul solution between the DBS and...
This paper presents experimentation results related to adaptive video content mapping used as compression tool of HDR-PQ content. The purpose of adaptive video content mapping is to adapt the video signal dynamically depending on its statistical properties in order to better exploit the signal codewords range. Adaptive video content mapping...
Video services traffic over the internet has drastically grown these past years, currently representing more than 80% of the internet bandwidth [1]. The massive usage of unicast delivery leads to network congestion that can result in poor quality of experience for the viewer, high delivery cost for operators and increased...
We report a new neural backdoor attack, named Hibernated Backdoor, which is stealthy, aggressive and devastating. The backdoor is planted in a hibernated mode to avoid being detected. Once deployed and fine-tuned on end devices, the hibernated backdoor turns into the active state that can be exploited by the attacker....
"The increased mobile connectivity, the range and number of services available in various computing environments in the network, demand mobile applications to be highly dynamic to be able to efficiently incorporate those services into applications, along with other local capabilities on mobile devices. However, the monolithic structure and mostly static...
The future of manufacturing industry highly depends on digital systems that convert existing production and monitoring systems to autonomous systems with stringent requirements in terms of availability, reliability, security, low latency, positioning with high-accuracy and so on. In order to provide such requirements, a private 5G network deployment is considered...
In this work we propose several full-color metagrating solutions for single waveguide-based Augmented and Virtual Reality near-eye display systems. The presented solutions are based on a combination of reflective and/or transmissive diffraction gratings inside or outside a waveguide. The proposed designs have high intensity across a wide angular range. Applying...
Cellular technology evolution across multiple generations has consistently sustained the global economic growth. Each of these generations has uniquely contributed to boosting the world Gross Domestic Product (GDP) by addressing key industry pain-points and enabling new business opportunities not possible with previous generation networks. 6G will not be an exception...
We propose XRC, an explicit rate control algorithm that overcomes the poor performance of commonly used TCP variants in cellular networks. XRC exploits an explicit feedback from the radio access network that is aware of the physical, network and transport layer information of all UEs as well as resource distribution...
RESEARCH PAPER / Jan 2022
/
Immersive / AR/VR/MR,
Light Field,
Volumetric Imaging,
Machine learning/ Deep learning /Artificial Intelligence
Recently, learning methods have been designed to create Multiplane Images (MPIs) for view synthesis. While MPIs are extremely powerful and facilitate high quality renderings, a great amount of memory is required, making them impractical for many applications. In this paper, we propose a learning method that optimizes the available memory...
Facial caricature is the art of drawing faces in an exaggerated way to convey emotions such as humor or sarcasm. Automatic caricaturization has been explored both in the 2D and 3D domain. In this paper, we present the first study of facial mesh caricaturization techniques. In addition to a user...
Waveguide based optical combiners for Augmented Reality (AR) glasses are integrating several Surface Relief Gratings (SRG) whose pitch sizes can be as small as 200 nm for the blue wavelength. All SRG components exploit the first diffraction order to couple in and out or to deviate the light. We present...
Traditional authentication mechanisms use passwords, PINs and biometrics, but these only authenticate at the point of entry. Continuous authentication schemes instead allow systems to verify identity and mitigate unauthorised access continuously. However, recent developments in generative modelling can significantly threaten continuous authentication systems, allowing attackers to craft adversarial examples to...
RESEARCH PAPER / Dec 2021
/
Computer Vision,
Machine learning/ Deep learning /Artificial Intelligence
This paper describes the MediaEval 2021 Predicting Media Memorability task. After first being proposed at MediaEval 2018, the Predicting Media Memorability task is in its 4th edition this year, as the prediction of short-term and long-term video memorability remains a challenging task. This year, two datasets of videos are used:...
Techniques which leverage channel state information (CSI) at a transmitter to adapt wireless signals to changing propagation conditions have been shown to improve the reliability of modern multiple input multiple output (MIMO) communication systems. Due to the difficulty of estimating downlink CSI at the transmitter in many wireless systems, CSI...
AI will become an essential part of our lives in the next few years, with the promise of delivering super-intelligent computers that exceed human analytical abilities. This is, however, several years away; indeed, the industry has only just embarked upon understanding what’s possible. Arguably the hype surrounding AI thus far...
Using a collection of publicly available links to short form video clips of an average of 6 seconds duration each, 1,275 users manually annotated each video multiple times to indicate both longterm and short-term memorability of the videos. The annotations were gathered as part of an online memory game and...
In this work, we propose a novel deep learning-based framework for the adaptive configuration of DMRS used for MIMO CCE. Specifically, a neural network architecture is proposed at the terminal side which based on statistical learning indicates to the network the preferred configuration of DMRS in time, frequency, and code...
RESEARCH PAPER / Oct 2021
/
Computer Graphics,
Machine learning/ Deep learning /Artificial Intelligence
Human character animation is often critical in entertainment content production, including video games, virtual reality or fiction films. To this end, deep neural networks drive most recent advances through deep learning (DL) and deep reinforcement learning (DRL). In this article, we propose a comprehensive survey on the state-of-the-art approaches based...
Open radio access network (Open RAN) technology has emerged as an important focus for the mobile industry over the past 18 months. Its rapid rise to prominence reflects strong interest from many mobile carriers seeking to reduce their reliance on a very small number of traditional technology suppliers and to...
RESEARCH PAPER / Oct 2021
/
Audio processing,
Neural network,
Machine learning/ Deep learning /Artificial Intelligence
Music source separation is the task of isolating individual instruments which are mixed in a musical piece. This task is particularly challenging, and even state-of-the-art models can hardly generalize to unseen test data. Nevertheless, prior knowledge about individual sources can be used to better adapt a generic source separation model...
The backdoor attack raises a serious security concern to deep neural networks, by fooling a model to misclassify certain inputs designed by an attacker. In particular, the trigger-free backdoor attack is a great challenge to be detected and mitigated. It targets one or a few specific samples, called target samples,...
High-level understanding of stories in video such as movies and TV shows from raw data is extremely challenging. Modern video question answering (VideoQA) systems often use additional human-made sources like plot synopses, scripts, video descriptions or knowledge bases. In this work, we present a new approach to understand the whole...
RESEARCH PAPER / Oct 2021
/
Computer Vision,
Neural network,
Machine learning/ Deep learning /Artificial Intelligence
High quality facial attribute editing in videos is a challenging problem as it requires the modifications to be realistic and consistent throughout the video frames. Previous works address the problem with auto-encoder architectures and rely on adversarial training to ensure the attribute editing and the temporal consistency of the results....
Next generation Terabit per second (Tbps) wireless communication systems aim for sub-THz and THz bands where noise and hardware impairments are known to be dominantly non-linear and lack accurate closed form analytical models. The physical layer (PHY) blocks designed under linearity and Gaussian noise assumptions will fall short of satisfying...
RESEARCH PAPER / Sep 2021
/
Video coding,
Compression,
Machine learning/ Deep learning /Artificial Intelligence Neural network
Despite many modern applications of Deep Neural Networks (DNNs), the large number of parameters in the hidden layers makes them unattractive for deployment on devices with storage capacity constraints. In this paper we propose a Data-Driven Low-rank (DDLR) method to reduce the number of parameters of pretrained DNNs and expedite...
RESEARCH PAPER / Sep 2021
/
Video coding,
Machine learning/ Deep learning /Artificial Intelligence,
Image processing,
Computer Graphics
Recently, learning methods have been designed to create Multiplane Images (MPIs) for view synthesis. While MPIs are extremely powerful and facilitate high quality renderings, a great amount of memory is required, making them impractical for many applications. In this paper, we propose a learning method that optimizes the available memory...
RESEARCH PAPER / Sep 2021
/
Optics,
Machine learning/ Deep learning /Artificial Intelligence,
Image processing
In recent years, we have seen the development of integrated plenoptic sensors, where multiple pixels are placed under one microlens. It is mainly used by cameras and smartphones to drive the autofocus of the main lens, and it often takes the form of dual-pixels with 2 rectangular sub-pixels. We study...
RESEARCH PAPER / Aug 2021
/
Neural network,
Machine learning/ Deep learning /Artificial Intelligence,
Computer Vision
Deep neural networks (DNNs) have recently achieved great success in many machine learning tasks including computer vision and speech recognition. However, existing DNN models are computationally expensive and memory demanding, hindering their deployment in devices with low memory and computational resources or in applications with strict latency requirements. In addition,...
Reconfigurable intelligent surfaces (RIS) have the ability to steer the electromagnetic waves to a desired direction. This enables the improvement of the wireless link performance by allowing the illumination of receivers otherwise shadowed by buildings or hills. In this paper, a link level simulations are used to study the performance...
RESEARCH PAPER / Aug 2021
/
Video coding,
Compression,
Machine learning/ Deep learning /Artificial Intelligence
Deep bi-prediction blending. This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, block-based motion compensation blocks from already decoded reference pictures stand out as the main tool used to predict the current frame. Especially, bi-predicted blocks, i.e. blocks that combine two different...
This article proposes multiple multi-domain solutions to deploy private SG networks for vertical industries. Different models of deploying SG private networks are presented, covering deployment on local premises as well as their interconnection with public networks of mobile network operators. Building on a set of industry verticals (comprising Industry 4.0,...
RESEARCH PAPER / Jul 2021
/
5G,
Wireless communication,
Machine learning/ Deep learning /Artificial Intelligence
Building upon on a digital transformation, Industry 4.0 (I4.0) aims to build the factories of the future, which feature additional flexibility, increasingly connected infrastructures and automated processes. 5G is playing a paramount role in this transformation, as it can offer high bandwidth, reliable and low latency wireless connectivity to meet...
Intelligent radio surface (IRS) is a programmable structure that can be used to control the propagation of electromagnetic waves by changing the electric and magnetic properties of the surface. By placing these surfaces in an environment, the properties of radio channels can be, at least partially, controlled. This opens new...
Joint communication and sensing allows the utilization of common spectral resources for communication and localization, reducing the cost of deployment. By using 5G New Radio (NR) (i.e., the 3rd Generation Partnership Project (3GPP) Radio Access Network for fifth generation) reference signals, conventionally used for communication, sub-meter precision localization is possible...
New Radio (NR) supports operations at high frequency bands (e.g., millimeter wave frequencies) by using narrow beam based directional transmissions in order to compensate high propagation losses at such frequencies. Due to the limited spatial coverage with each beam, the broadcast transmission of paging in NR is performed using beam...
In this paper, we propose a novel approach for real-time radar-based human activities detection and classification. In this approach, first, the radar transceiver is mounted on the room’s ceiling leading to considerable variations of the relative received power as the subject perform the different activities. Second, to exploit the different...
RESEARCH PAPER / Dec 2020
/
["Volumetric Imaging",
"Machine learning/ Deep learning /Artificial Intelligence"]
We present a novel learning-based approach to synthesize new views of a light field image. In particular, given the four corner views of a light field, the presented method estimates any in-between view. We use three sequential convolutional neural networks for feature extraction, scene geometry estimation and view selection. Compared...
The traditional role of a communication engineer is to address the technical problem of transporting bits reliably overa noisy channel. With the emergence of 5G, and the availability of a variety of competing and coexisting wireless systems, wireless connectivity is becoming a commodity. This article argues that communication engineers in...
RESEARCH PAPER / Dec 2020
/
5G,
Machine learning/ Deep learning /Artificial Intelligence,
Network and Communications
This document describes the winning solution to the GNN Challenge 2020 organized by the Barcelona Neural Networking Center for the ITU Artificial Intelligence/Machine Learning in 5G Challenge. We first describe our methodology, then give the set of hyper-parameters that allowed us to achieve the best score with an average relative...
From its Nobel prize winning discovery by Denis Gabor over half a century ago, Holography has been alternately put in the spotlight as a promising technique for its capacity of displaying 3D scenes, to be later forgotten regarding the complexity of performing holographic recording outside from optical laboratories. The later...
Blockchain technology is a technology that jointly leverages and builds on top of a few existing techniques such as cryptography, hashing, Peer-to-Peer (P2P) networking, smart contract, and consensus protocols. Fully distributed control and management capability of the 5G/6G wireless systems may be enabled by blockchain technology, especially in the untrusted...
RESEARCH PAPER / Dec 2020
/
Network and Communications,
5G,
Wireless communication,
Computing and Optimization
For Internet operators, on-line service providers and end-users, representative operational measurements are crucial to monitor and diagnose the performance of networks and on-line services. While numerous approaches have been proposed to measure performance, only a few works fully adopt an end-user perspective by taking measurements from within web browsers. In...
RESEARCH PAPER / Nov 2020
/
5G,
Wireless communication,
Network and Communications,
Computing and Optimization
This paper investigates the task management for cooperative mobile edge computing (MEC), where a set of geographically distributed heterogeneous edge nodes not only cooperate with remote cloud data centers but also help each other to jointly process tasks and support real-time IoT applications at the edge of the network. Especially,...
In this presentation we’ll talk about the development of Beam Management and describe the system attributes related to performing the initial beam establishment. We will also review the supported 3GPP requirements and limitations of the current implementation. We’ll then share some details about the beam management implementation done on the...
Representation of 3D scenes is gaining popularity in industry, notably for Virtual Reality, Augmented Reality, and 360° Video. The point cloud format is well suited for such representations. Indeed, point clouds can be created with a simple capture process and modest processing, enabling a real-time, end-to-end point cloud distribution chain....
We describe a multi-user system enabling instant messaging in Augmented Reality. A user can get in contact with another one without requiring his/her identification number and can easily localize the person initiating the contact. It is also possible to exchange various types of personal information in a private manner. This...
The emergence of edge computing introduces new complexity in the creation of distributed mobile applications. Application designers can now deploy application functionality in three or more tiers of compute infrastructure to optimize bandwidth, latency, cost, user experience and privacy for their users and their own operations. However, the diversity of...
We introduce a system to assign navigation tasks to a self-moving robot using an Augmented Reality (AR) application running on a smartphone. The system relies on a robot controller and a central server hosted on a PC. The user points at a target location in the phone camera view and...
The world has witnessed significant change since the dawn of the industrial revolution. Life expectancy has more than doubled; travel across the planet can happen in less than a day; loved ones can be reached via a video screen and vast quantities of information can be accessed at the touch...
Representation of 3D scenes is gaining popularity in industry, notably for Virtual Reality, Augmented Reality, and 360° Video. The point cloud format is well suited for such representations. Indeed, point clouds can be created with a simple capture process and modest processing, enabling a real-time, end-to-end point cloud distribution chain....
In this paper we address the problem of view synthesis from large baseline light fields, by turning a sparse set of input views into a Multi-plane Image (MPI). Because available datasets are scarce, we propose a lightweight network that does not require extensive training. Unlike latest approaches, our model does...
Face age editing has become a crucial task in film post-production, and is also becoming popular for general purpose photography. Recently, adversarial training has produced some of the most visually impressive results for image manipulation, including the face aging/de-aging task. In spite of considerable progress, current methods often present visual...
This paper presents CompressAI, an open-source library that provides custom operations, layers, models and tools to research, develop, and evaluate end-to-end image and video codecs. In particular, CompressAI includes pre-trained models and evaluation tools to compare learned methods with traditional codecs. Multiple models from the state-of-the-art on learned end-to-end image...
arxiv version of https://interdigital.sharepoint.com/sites/RI/Lists/ID Publications/DispForm.aspx?ID=794
RESEARCH PAPER / Sep 2020
/
Wireless communication,
Network and Communications,
Computing and Optimization
Mobile edge computing (MEC) is an emerging paradigm that integrates computing resources in wireless access networks to process computational tasks in close proximity to mobile users with low latency. In this paper, we propose an online double deep Q networks (DDQN) based learning scheme for task assignment in dynamic MEC...
In this paper, a simple compact radiation pattern reconfigurable patch antenna is presented. The proposed design is composed of two back-to-back identical patches surrounded by a conductive strip, etched on both sides of a dielectric substrate. It has the same dimensions as a classical patch. Thanks to the use of...
Human Pose Estimation is a low-level task useful for surveillance, human action recognition, and scene understanding at large. It also offers promising perspectives for the animation of synthetic characters. For all these applications, and especially the latter, estimating the positions of many joints is desirable for improved performance and realism....
5G promises to bring back value to the network operator and not only serve as a data pipe solution anymore. Operators have started to adopt cloud-native paradigms for the deployment of the 5G core architecture. With it goes the introduction of the service-based architecture as a key design pattern for...
The development of tactile screens opens new perspectives for co-located images and haptic rendering, leading to the concept of “haptic images.” They emerge from the combination of image data, rendering hardware, and haptic perception. This enables one to perceive haptic feedback while manually exploring an image. This raises nevertheless two...
In visual effects it is usually required to process multiple film plates in order to reproduce the performance of real actors using a digital 3D character. One of the most challenging aspects of this process is the complexity of facial animation. Motivated by the work required to capture and digitally...
Zero-forcing precoding is a commonly-used multiple-user, multiple-input multiple-output beamforming technique. Applying such precoding, the signal-to-interference-plus-noise ratio (SINR) statistics with outdated channel state information, which involve various projections related to the multi-dimensional channel vectors and precoding vectors, have never been explicitly derived. In this paper, for the multiple-input single-output scenario with...
The upcoming MPEG Immersive Video (MIV) standard will enable storage and distribution of immersive video content over existing and future networks, for playback with 6 full or partial degrees of freedom of view position and orientation. The demo showcases a VOD server streaming MIV encoded immersive video contents up to...
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach is to rely on Product Quantization, which allows the storage of large vector databases in memory and efficient distance computations. Yet, implementations of nearest neighbor search with Product Quantization have their...
This paper presents a novel planar compact integrated four-sector antenna for sub-6GHz 5G indoor access and content distribution over 2.45GHz/ 5.5GHz dual-band WiFi in 2x2 MIMO configuration. The four sectors, for 5G applications, are obtained via Vivaldi antennas with ground reuse for achieving very compact dimensions. A compact two-port slot...
The global economy is experiencing rapid changes during the first years of the 21st century, with new technologies, concepts, and behaviors now changing the way we live and work. Blockchain, self-driving cars, Artificial Intelligence, and machine learning, Three-Dimensional (3D) printing, and Augmented Reality (AR) are just a few examples of...
View the slides from our Volumetric Photobooth demonstration at MWC19!
View the slides from our demonstration on Core Network Testbeds from MWC19!
Discover InterDigital's role in Beyond 5G in this MWC19 presentation.
View the slides from our demonstration of 5G Edge Mission Critical Automation from MWC19!
View the slides from our demonstration of 360 Video Standards from MWC19!
"A recently celebrated kind of deep neural networks is Generative Adversarial Networks. GANs are generators of samples from a distribution that has been learned; they are up to now centrally trained from local data on a single location. We question the performance of training GANs using a spread dataset over...
Scene-agnostic visual inpainting remains very challenging despite progress in patch-based methods. Recently, Pathak et al. [26] have introduced convolutional "context encoders'' (CEs) for unsupervised feature learning through image completion tasks. With the additional help of adversarial training, CEs turned out to be a promising tool to complete complex structures in...
We present a diminished reality application running live on consumer mobile devices. In our pre-observation-based approach, the clean 3D scene, free of undesired objects, is scanned beforehand and reconstructed as a high resolution textured 3D model. At runtime, objects added in a region of interest are efficiently removed by projecting...
We present a new method for reconstructing a 4D light field from a random set of measurements. A 4D light field block can be represented by a sparse model in the Fourier domain. As such, the proposed algorithm reconstructs the light field, block by block, by selecting frequencies of the...
Which parts or objects are interesting in a content? In this paper we first propose three computational models to automatically predict interestingness rankings of areas/objects inside a 2D picture. We based our modeling on previous experimental findings to ensure reliability of the prediction when compared to the human assessement of...
Augmented Reality (AR) is a concept and a set of technologies for merging of real and virtual elements to produce new visualizations – typically a video – where physical and digital objects co-exist and interact in real time. Most AR applications support real-time interaction with content (AR scene with virtual...
This paper presents a novel compact hybrid monopole- Annular Slot Antenna (ASA). The proposed antenna combines the advantages of an ASA with the broadband feature of a wideband monopole. The antenna simulated performance is thoroughly analyzed and validated via measurements.
This paper presents a novel compact, wideband, four directional radiation patterns, antenna. The radiating element topology is inspired from the Annular Slot Antenna (ASA) and printed wideband monopole. The antenna is designed for North American ATSC-TV indoor reception. For an S11<;-6dB, it covers both High-VHF and UHF bands. It presents...
Recently, the advances in transform coding have contributed to significant bitrate saving for the next generation of video coding. In particular, the combination of different discrete trigonometric transforms (DTT’s) was adopted in the Joint Video Exploration Team (JVET) solution, as well as the Bench-Mark Set (BMS) of the future video...
In this paper, we propose a new format for haptic texture mapping which is not dependent on the haptic rendering setup hardware. Our “haptic material” format encodes ten elementary haptic features in dedicated maps, similarly to “materials” used in computer graphics. These ten different features enable the expression of compliance,...
With the growth of virtual reality setups, digital sculpting tools become more and more immersive. It is now possible to create a piece of art within a virtual environment, directly with the controllers. However, these devices do not allow to touch the virtual material as a sculptor would do. To...
A Preliminary Investigation into the Impact of Training for Example-Based Facial Blendshape Creation
Our work is a perceptual study into the effects of training poses on the Example-Based Facial Rigging (EBFR) method. We analyse the output of EBFR given a set of training poses to see how well the results reproduced our ground truth actor scans compared to a Deformation Transfer approach. While...
Over the last few years, we watched head-worn virtual and augmented reality devices move from research laboratories to store shelves as the newest platform enabling novel virtual experiences. The big promise of emerging mixed reality (MR) technology is a far deeper immersion into imaginary worlds than is possible with the...
Inductive matrix completion (IMC) is a model for incorporating side information in form of “features” of the row and column entities of an unknown matrix in the matrix completion problem. As side information, features can substantially reduce the number of observed entries required for reconstructing an unknown matrix from its...
Memorability of media content such as images and videos has recently become an important research subject in computer vision. This paper presents our computation model for predicting image memorability, which is based on a deep learning architecture designed for a classification task. We exploit the use of both convolutional neural...
Style transfer' among images has recently emerged as a very active research topic, fuelled by the power of convolution neural networks (CNNs), and has become fast a very popular technology in social media. This paper investigates the analogous problem in the audio domain: How to transfer the style of a...
Photometric registration consists in blending real and virtual scenes in a visually coherent way. To achieve this goal, both reflectance and illumination properties must be estimated. These estimates are then used, within a rendering pipeline, to virtually simulate the real lighting's interaction with the scene. In this paper, we are...
Quadrotor drones equipped with high quality cameras have rapidly raised as novel, cheap and stable devices for filmmakers. While professional drone pilots can create aesthetically pleasing videos in short time, the smooth -- and cinematographic -- control of a camera drone remains challenging for most users, despite recent tools that...
A wide color gamut (WCG) display has great color rendering capability and offers the opportunity to achieve a pleasing and realistic appearance in terms of image quality. To take full advantage of the large display gamut, a new gamut extension algorithm (GEA) is proposed based on a new color appearance...
Everything we do, from wireless platforms to IoT and beyond, is designed around partnership. Find out how InterDigital is Creating the Living Network. Together. in this MWC18 presentation.
To work at scale, a complete image indexing system comprises two components: An inverted file index to restrict the actual search to only a subset that should contain most of the items relevant to the query; An approximate distance computation mechanism to rapidly scan these lists. While supervised deep learning...
Check out this report detailing the State of Art (SoA) of optical hardware currently used in the field of 3D display technology, as well as emerging optical hardware technologies that feature Light Field (LF) and Holographic techniques.
Thanks to the increasing number of images stored in the cloud, external image similarities can be leveraged to efficiently compress images by exploiting inter-images correlations. In this paper, we propose a novel image prediction scheme for cloud storage. Unlike current state-of-the-art methods, we use a semi-local approach to exploit inter-image...
A professor of Electrical Engineering at L’École de Technologie Supérieure (ÉTS) in Montreal and the chair holder of the Richard J. Marceau Chair on Wireless IP Technology for Developing Countries, François Gagnon has begun a project to bring the Internet to the most sparsely populated areas of the developing world. InterDigital...
In order for network slicing to deliver on its promise as an effective tool to generate new revenues and profitability, new business models must match the dynamic technical and operational changes that it brings. Significant progress is being made in demonstrations and trials, as highlighted in this report and the...
Color grading is a crucial step in film post-production for defining the creative intent and giving a particular look and feel to the content. In the context of VR, no adapted solutions exist yet for color grading 360° imagery, leading to cumbersome work-arounds and costing precious production time. To enable...
Virtual reality offers a new way of telling stories and engaging audiences. To create appealing content for this new imaging modality, content production and post-production workflows need to be adapted or completely rethought. One key aspect for ensuring image quality and preservation of the director's intent throughout the production workflow...
Joe Giersch, a biologist from USGS, sat down with InterDigital to discuss technology’s role in the study of stream ecology, and explain why we should all know about the insect, Lednia tumana, that he has spent his life studying.
Learning parameters from voluminous data can be prohibitive in terms of memory and computational requirements. We propose a ‘compressive learning’ framework, where we estimate model parameters from a sketch of the training data. This sketch is a collection of generalized moments of the underlying probability distribution of the data. It...
The success of Google’s PageRank algorithm popularized graphs as a tool to model the web’s navigability. At that time, the web topology was resulting from human edition of hyper-links. Nowadays, that topology is mostly resulting from algorithms. In this paper, we propose to study the topology realized by a class...
To enable light fields of large environments to be captured, they would have to be sparse, i.e. with a relatively large distance between views. Such sparseness, however, causes subsequent processing to be much more difficult than would be the case with dense light fields. This includes segmentation. In this paper,...
We consider the problem of identifying people on the basis of their walk (gait) pattern. Classical approaches to tackle this problem are based on, e.g., video recordings or piezoelectric sensors embedded in the floor. In this work, we rely on acoustic and vibration measurements, obtained from a microphone and a...
Smart city technologies could save enterprises, governments and citizens globally over US $5 trillion annually by 2022. This is the conclusion reached by ABI Research in this new white paper, which analyzes the scope for cost savings and efficiency as a driver for smart city deployments, smart technologies and the...
A multi-user experience extending a standard TV content with AR elements is presented. It runs with both a standard tablet and a premium MR headset, the Microsoft HoloLens. A virtual TV mosaic is displayed around the TV screen and used as a GUI to control both TV and MR content....
Presentation of Light Fields pipeline for Immersive Video Experiences
Plenoptic cameras enable a variety of novel post-processing applications, including refocusing and single-shot 3D imaging. To achieve high accuracy, such applications typically require knowledge of intrinsic camera parameters. One such parameter is the location of the main lens' optical center relative to the sensor, which is required for modeling radially...
A large portion of data mining and analytic services use modern machine learning techniques, such as deep learning. The state-of-the-art results by deep learning come at the price of an intensive use of computing resources. The leading frameworks (e.g., TensorFlow) are executed on GPUs or on high-end servers in datacenters....
Recently, the Qualcomm Tricorder XPRIZE® announced its winners. We caught up with Ed Hepler, ex-InterDigital engineer and hardware development lead of the winning bootstrap operation, Final Frontier Medical Devices, after the winning announcement in this CREATORS sit-down.
It takes someone special to look at the island of Manhattan – possibly the most transformed, altered landscape on earth – and see the beaver dams, streams, marshes, and rolling hills that it once was... and want to bring that back to life. That someone special is Dr. Eric Sanderson, the...
Discover how Ph.D. candidate at the University of Bristol, Dulce Rodriguez, turned her interest in how technology impacts society into a full-blown research project that aims to create a blended intergenerational learning environment.
For large-scale visual search, highly compressed yet meaningful representations of images are essential. Structured vector quantizers based on product quantization and its variants are usually employed to achieve such compression while minimizing the loss of accuracy. Yet, unlike binary hashing schemes, these unsupervised methods have not yet benefited from the...
In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation...
Electroencephalography (EEG)-based emotion recognition is currently a hot issue in the affective computing community. Numerous studies have been published on this topic, following generally the same schema: 1) presentation of emotional stimuli to a number of subjects during the recording of their EEG, 2) application of machine learning techniques to...
The proposed Single Layer SDR backward compatible HDR video distribution solution detailed in this paper, named SL-HDR1, and standardized in ETSI TS 103 433 specification, aims at addressing these issues. SL-HDR1 leverages SDR distribution networks and services already in place. It enables both high quality HDR rendering on HDR-enabled CE...
Technicolor has been investigating how Mixed Reality technology could impact the future of home entertainment. We have designed and implemented a system to extend a standard TV experience with AR content, using a consumer tablet or a headset. A virtual TV mosaic is displayed around the TV screen and used...
Augmented Reality (AR) scenarios aim to provide realistic blending between real world and virtual objects. A key factor for realistic AR is thus a correct illumination simulation. This consists in estimating the characteristics of real light sources and use them to model virtual lighting. In this paper, we briefly introduce...
Light field acquisition devices allow capturing scenes with unmatched postprocessing possibilities. However, the huge amount of high-dimensional data poses challenging problems to light field processing in interactive time. In order to enable light field processing with a tractable complexity, in this paper, we address the problem of light field oversegmentation....
Some structural characteristics of online discussions have been successfully modeled in the recent years. When parameters of these models are properly estimated, the models are able to generate synthetic discussions that are structurally similar to the real discussions. A common aspect of these models is that they consider that all...
This chapter presents the Integrated Lens Antenna (ILA) technology as it evolved since its introduction aiming to respond to the needs of emerging applications such as high-data-rate communication, intelligent transport, and mm-wave imaging. The topics covered include the ILA design concepts as well as the electromagnetic phenomena intrinsic to dielectric...
We consider example-guided audio source separation approaches, where the audio mixture to be separated is supplied with source examples that are assumed matching the sources in the mixture both in frequency and time. These approaches were successfully applied to the tasks such as source separation by humming, score-informed music source...
This paper describes a light field scalable compression scheme based on the sparsity of the angular Fourier transform of the light field. A subset of sub-aperture images (or views) is compressed using HEVC as a base layer and transmitted to the decoder. An entire light field is reconstructed from this...
This paper summarizes the computational models that Technicolor proposes to predict interestingness of images and videos within the MediaEval 2017 PredictingMedia Interestingness Task. Our systems are based on deep learning architectures and exploit the use of both semantic and multimodal features. Based on the obtained results, we discuss our findings...
In this paper, the Predicting Media Interestingness task which is running for the second year as part of the MediaEval 2017 Benchmarking Initiative for Multimedia Evaluation, is presented. For the task, participants are expected to create systems that automatically select images and video segments that are considered to be the...
This paper deals with the unification of local and non-local signal processing on graphs within a single convolutional neural network (CNN) framework. Building upon recent works on graph CNNs, we propose to use convolutional layers that take as inputs two variables, a signal and a graph, allowing the network to...
Lighting is a key element in photography. Professional photographers often work with complex lighting setups to directly capture an image close to the targeted one. Some photographers reversed this traditional workflow. Indeed, they capture the scene under several lighting conditions, then combine the captured images to get the expected one....
In this work, we propose a framework, dubbed Union-of-Subspaces SVM (US-SVM), to learn linear classifiers as sparse codes over a learned dictionary. In contrast to discriminative sparse coding with a learned dictionary, it is not the data but the classifiers that are sparsely encoded. Experiments in visual categorization demonstrate that,...
Wearable optical technologies are emerging to keep users safe, powered-up and entertained
Recent work in video compression has shown that using multiple 2D transforms instead of a single transform in order to de-correlate residuals provides better compression efficiency. These transforms are tested competitively inside a video encoder and the optimal transform is selected based on the Rate Distortion Optimization (RDO) cost. However,...
Estimating the inverse covariance matrix of p variables from n observations is challenging when n p, since the sample covariance matrix is singular and cannot be inverted. A popular solution is to optimize for the `1 penalized estimator; however, this does not incorporate structure domain knowledge and can be...
Matrix completion (MC) with additional information has found wide applicability in several machine learning applications. Among algorithms for solving such problems, Inductive Matrix Completion(IMC) has drawn a considerable amount of attention, not only for its well established theoretical guarantees but also for its superior performance in various real-world applications. However,...
Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines. While pixel-wise segmentation techniques can help for this task, professional rotoscoping tools rely on parametric curves that offer the artists a much better interactive control on the definition,...
To ensure that all important moments of an event are represented and that challenging scenes are correctly captured, both amateur and professional photographers often opt for taking large quantities of photographs. As such, they are faced with the tedious task of organizing large collections and selecting the best images among...
Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines. While pixel-wise segmentation techniques can help for this task, professional rotoscoping tools rely on parametric curves that offer the artists a much better interactive control on the definition,...
Zepeda and Perez [41] have recently demonstrated the promise of the exemplar SVM (ESVM) as a feature encoder for image retrieval. This paper extends this approach in several directions: We first show that replacing the hinge loss by the square loss in the ESVM cost function significantly reduces encoding time...
360° video, supporting the ability to present views consistent with the rotation of the viewer's head along three axes (roll, pitch, yaw) is the current approach for creation of immersive video experiences. Nevertheless, a more fully natural, photorealistic experience - with support of visual cues that facilitate coherent psycho-visual sensory...
This paper investigates the role of the embodiment in an immersive video experience. A system allowing to play back omnidirectional videos enhanced with real-time 3D content is presented. It enables the user to be embodied in an avatar and to interact with 3D objects added to the video. A user...
Predicting interestingness of media content remains an important, but challenging research subject. The difficulty comes first from the fact that, besides being a high-level semantic concept, interestingness is highly subjective and its global definition has not been agreed yet. This paper presents the use of up-to-date deep learning techniques for...
Light-field (LF) is foreseen as an enabler for the next generation of 3D/AR/VR experiences. However, lack of unified representation, storage and processing formats, variant LF acquisition systems and capture-specific LF processing algorithms prevent cross-platform approaches and constrain the advancement and standardization process of the LF information. In this work we...
Regularly, hackers steal data sets containing user identifiers and passwords. Often these data sets become publicly available. The most prominent and important leaks use bad password protection mechanisms, e.g. rely on unsalted password hashes, despite longtime known recommendations. The accumulation of leaked password data sets allows the research community to...
This paper addresses the example-based stylization of videos. Style transfer aims at editing an image so that it matches the style of an example. This topic has recently been investigated massively, both in the industry and academia. The difficulty lies in how to capture the style of an image. For...
With virtual reality (VR), the goal is to create experiences in which the user can be completely immersed; an alternative reality produced by a computer simulation and displayed to the user as a completely synthetic view generated by computer graphics. In both VR and AR, the reality that the physical...
Recent years have shown significant advances in immersive media experiences. Three-dimensional representation formats allow for new forms of entertainment and communication. In this context, point cloud data has emerged as a promising enabler for such experiences. Because efficient enough point cloud compression technologies are still to be found, the Moving...
Despite the relatively recent commercial availability of consumer devices, various market forecasts are expecting a rapid increase in the number of users and the market value of augmented reality (AR) content in the coming years. When it comes to media consumption, the quality of the experience is the major driving factor...
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. Because it offers low responses times, Product Quantization (PQ) is a popular solution. PQ compresses high-dimensional vectors into short codes using several sub-quantizers, which enables in-RAM storage of large databases. This allows fast answers...
With the explosion of Virtual Reality technologies, the production and usage of omni directional images (a.k.a 360 images) is presenting new challenges in the domains of compression, transmission and rendering. The evaluation of the quality of images generated by these technologies is therefore paramount. As the exploration of 360 images...
Les liens DSL peuvent subir des pannes sporadiques entraînant des deconnexions ou un acces Internet dégradé. Ces pannes sont a l’origine d’une expérience utilisateur négative et générent des coûts pour les fournisseurs d’accès Internet (FAI) via des appels d’assistance technique. La prediction de pannes permet aux FAI de mettre en...
La recommandation joue un rôle central dans le e-commerce et dans l'industrie du divertissement. L'intérêt croissant pour la transparence algorithmique nous motive dans cet article à observer les résultats de recommandations sous la forme d'un graphe capturant les navigations proposées dans l'espace des items. Nous argumentons qu'une telle approche en...
RESEARCH PAPER / May 2017
/
Machine learning/ Deep learning /Artificial Intelligence,
Computing and Optimization
Learning from multi-label data in an interactive framework is a challenging problem as algorithms must withstand some additional constraints: in particular, learning from few training examples in a limited time. A recent study of multi-label classifier behaviors in this context has identified the potential of the ensemble method “Random Forest...
The quantity and diversity of data in Light-Field videos makes this content valuable for many applications such as mixed and augmented reality or post-production in the movie industry. Some of such applications require a large parallax between the different views of the Light-Field, making the multi-view capture a better option...
In this paper, we present a complete processing pipeline for focused plenoptic cameras. In particular, we propose 1) a new algorithm for microlens center calibration fully in the Fourier domain, 2) a novel algorithm for depth map computation using a stereo focal stack, and 3) a depth-based rendering algorithm that...
While the IoT market in general was slow to capture the public's imagination prior to 2017, one area where IoT is blooming is in the smart city and smart building industry; mass rollouts of IoT in an industrial setting, including urban environments and business hubs, are beginning to garner success....
Generating complex discrete distributions remains as one of the challenging problems in machine learning. Existing techniques for generating complex distributions with high degrees of freedom depend on standard generative models like Generative Adversarial Networks (GAN), Wasserstein GAN, and associated variations. Such models are based on an optimization involving the distance...
Focalization and viewpoint are important aspects of narrative movie-making that need to be taken into account by cinematography and editing. In this paper, we argue that viewpoint can be determined from the first principles of focalization in the screenplay and adherence to a slightly modified version of Hitchcock's rule in...
This paper presents an open database of annotated film clips together with an analysis of elements of film style related to how the shots are composed, how the transitions are performed between shots and how the shots are sequenced to compose a film unit. The purpose is to initiate a...
The human visual system (HVS) non-linearly processes light from the real world, allowing us to perceive detail over a wide range of illumination. Although models that describe this non-linearity are constructed based on psycho-visual experiments, they generally apply to a limited range of illumination and therefore may not fully explain...
In winter 2016, a global survey with nearly 1,000 respondents uncovered significant understanding of the issues facing the wider deployment and monetization of connected cars, along with significant enthusiasm for connected car technologies and services in general and autonomous driving. This white paper outlines the results of the survey, as well...
Summary form only given. This paper presents two sets of modifications to band offset type of the Sample Adaptive Offset technique in HEVC. First, some constraints on the SAO semantics are added to solve sub-optimal syntax issue and to exploit the actual range information of reconstructed samples. Next, the classification...
This paper presents an adaptive clipping technique with optimized syntax in the video coding Joint Exploratory Model (JEM), which exploits the signal characteristics of the video sequence. The component-wise clipping bounds are coded for each slice. Two encoding methods leveraging the efficiency of the proposed technique are then described. The...
Uncovering Influence Cookbooks : Reverse Engineering the Topological Impact in Peer Ranking Services
Ensuring the early detection of important social network users is a challenging task. Some peer ranking services are now well established, such as PeerIndex, Klout, or Kred. Their function is to rank users according to their influence. This notion of influence is however abstract, and the algorithms achieving this ranking...
Immersive videos allow users to freely explore 4 π steradian scenes within head-mounted displays (HMD), leading to a strong feeling of immersion. However users may miss important elements of the narrative if not facing them. Hence, we propose four visual effects to guide the user's attention. After an informal pilot...
InterDigital's Mobile World Congress 2017 presentation on Contextual Driving Platform.
In this paper, we propose a novel scheme for scalable image coding based on the concept of epitome. An epitome can be seen as a factorized representation of an image. Focusing on spatial scalability, the enhancement layer of the proposed scheme contains only the epitome of the input image. The...
Several high-dimensional learning applications require the parameters to satisfy a “group sparsity” constraint, where clusters of coefficients are required to be simultaneously selected or rejected. The group lasso and its variants are common methods to solve problems of this form. Recently, in the standard sparse setting, it has been noted...
In this paper we tackle the problem of single channel audio source separation driven by descriptors of the sounding object's motion. As opposed to previous approaches, motion is included as a soft-coupling constraint within the nonnegative matrix factorization framework. The proposed method is applied to a multimodal dataset of instruments...
An infrastructure-less indoor localization system is proposed based on fingerprints of light signals acquired at high frequencies. In contrast to other systems that modulate lights, the proposed system distinguishes lights by learning from training samples. Due to slight differences in the electronic components used in the construction of compact fluorescent...
We propose a novel informed source separation method for audio object coding based on a recent sampling theory for smooth signals on graphs. Assuming that only one source is active at each time-frequency point, we compute an ideal map indicating which source is active at each time-frequency point at the...
Neural node embedding has been recently developed as a powerful representation for supervised tasks with graph data. We leverage this recent advance and propose a novel approach for unsupervised community discovery in graphs. Through extensive experimental studies on simulated and real-world data, we demonstrate consistent improvement of the proposed approach...
“To be considered for the 2017 IEEE Jack Keil Wolf ISIT Student Paper Award.” In this paper we study the problem of noisy tensor completion for tensors that admit a canonical polyadic or CANDE-COMP/PARAFAC (CP) decomposition with one of the factors being sparse. We present general theoretical error bounds for...
The migration from high-definition TV to ultrahigh definition (UHD) is already underway. In addition to an increase of picture spatial resolution, UHD potentially provides more color by introducing a wider color gamut, and better contrast by moving from standard dynamic range (SDR) to high dynamic range (HDR). The transition from...
Some recent smartphones have offered the so-called audio zoom feature which allows to focus sound capture in the front direction while attenuating progressively surrounding sounds along with video zoom. This paper proposes a complete implementation of such function involving two major steps. First, targeted sound source is extracted by a novel approach...
With decreasing costs for DNA synthesis and sequencing, ultra-dense DNA storage is an emerging, viable technology. The original proof of concept [1]-[3] has yielded several experiments of larger scale demonstrating archival storage in DNA molecules [4]-[7]. In particular, a recent collaboration by Harvard and Technicolor announced the storage of 22...
This work concerns sampling of smooth signals on arbitrary graphs. We first study a structured sampling strategy for such smooth graph signals that consists of a random selection of few pre-defined groups of nodes. The number of groups to sample to stably embed the set of $k$-bandlimited signals is driven...
The ability of multimedia data to attract and keep people’s interest for longer periods of time is gaining more and more importance in the fields of information retrieval and recommendation, especially in the context of the ever growing market value of social media and advertising. In this chapter we introduce...
We introduce analytic approximations for accurate real-time rendering of surfaces lit by non-occluded area light sources. Our solution leverages the Irradiance Tensors developed by Arvo for the shading of Phong surfaces lit by a polygonal light source. Using a reformulation of the 1D boundary edge integral, we develop a general...
To reproduce the appearance of real world scenes, a number of color appearance models have been proposed thanks to adapted psycho-visual experiments. Most of them were designed and intended for a limited dynamic range, or address only dynamic range compression applications. However, given the increasing availability of displays with higher...
Accounts are often shared by multiple users, each of them having different item consumption and temporal habits. Identifying of the active user can lead to improvements in a variety of services by switching from account personalized services to user personalized services. To do so, we develop a topic model extending...
We consider inverse covariance estimation with group sparsity. The groups areoverlapping principal submatrices, which may correspond to structural similarity(e.g. pixels in adjacent regions) or categories (e.g. voter party loyalties). Wepropose a scalable method that makes use of chordal decomposition and appliesthe Frank-Wolfe algorithm. For small simulated problems with block sparsity,...
Adaptive transform learning schemes have been extensively studied in the literature with a goal to achieve better compression efficiency compared to extensively used Discrete Cosine Transforms (DCT) inside a video codec. These transforms are learned offline on a large training set and are tested either in competition with or in...
Time series prediction problems are becoming increasingly high-dimensional in modern applications, such as climatology and demand forecasting. For example, in the latter problem, the number of items for which demand needs to be forecast might be as large as 50,000. In addition, the data is generally noisy and full of...
Several learning applications require solving high-dimensional regression problems where the relevant features belong to a small number of (overlapping) groups. For very large datasets and under standard sparsity constraints, hard thresholding methods have proven to be extremely efficient, but such methods require NP hard projections when dealing with overlapping groups....
HDR Solution presentation
This paper presents the work done at Technicolor regardingthe MediaEval 2016 Predicting Media Interestingness Task,which aims at predicting the interestingness of individual im-ages and video segments extracted from Hollywood movies.We participated in both the image and video subtasks.
This paper provides an overview of the Predicting MediaInterestingness task that is organized as part of the Media-Eval 2016 Benchmarking Initiative for Multimedia Evalua-tion. The task, which is running for the first year, expectsparticipants to create systems that automatically select images and video segments that are considered to be the...
This paper tackles the task of storing a large collection of vectors, such as visual descriptors, and of searching in it. To this end, we propose to approximate database vectors by constrained sparse coding, where possible atom weights are restricted to belong to a finite subset. This formulation encompasses, as...
The aggregation of image statistics – the so-called pooling step of image classification algorithms – as well as the construction of part-based models, are two distinct and well-studied topics in the literature. The former aims at leveraging a whole set of local descriptors that an image can contain (through spatial...
In this paper, we introduce a novel graph representation forinteractive light field segmentation using Markov Random Field (MRF).The greatest barrier to the adoption of MRF for light field processing isthe large volume of input data. The proposed graph structure exploits theredundancy in the ray space in order to reduce the...
With the recent finish of the 2016 Olympics in Rio de Janeiro, there is huge interest in how the 2018 Winter Olympics and 2020 Summer Games will be the launchpad for the latest generation of mobile technology – 5G. A new whitepaper from InterDigital and Mobile World Live cuts through...
The polarization of the BSC(γ 1 ) with a BSC(γ 2 ) is characterized explicitly for γ 1 , γ 2 ∈ [0, 1/2]. The polarization yields a channel W - which is a BSC(λ), and a channel W + which is composed of a BSC(ξ) with probability 1 -...
In this paper we propose a new method to automatically select the rank of linear transforms during supervised learning. Our approach relies on a sparsity-enforcing element-wise soft-thresholding operation applied after the linear transform. This novel approach to supervised rank learning has the important advantage that it is very simple to...
This paper describes a novel scheme to reduce the quantization noise of compressed videos and improve the overall coding performances. The proposed scheme first consists in clustering noisy patches of the compressed sequence. Then, at the encoder side, linear mappings are learned for each cluster between the noisy patches and...
The acquisition of surface material properties and lighting conditions is a fundamental step for photo-realistic Augmented Reality (AR). In this paper, we present a new method for the estimation of diffuse and specular reflectance properties of indoor real static scenes. Using an RGB-D sensor, we further estimate the 3D position...
As a new generation of smartwatches enters the market, one common use is for displaying information such as notifications. While some content might warrant immediately interrupting a user, there is also information that might be important to display yet less urgent. It would be useful to show this content on...
Virtual Reality enclosures are inexpensive devices that create a virtual reality experience using a mobile phone. For example, Google Cardboard lets a user put their phone inside the device and is made from cardboard, some simple lens and a magnet. Because the phone is encased, there is no way to...
Despite the rapid growth of wearable and mobile devices and associated availability of vibrotactile (VT) actuators, the design space of VT applications has remained primarily limited to event-based VT notifications. While useful in some cases, these notifications can be disruptive and cognitively-demanding. Further, the lack of perceptually salient VT sensations...
Easy accessibility can often lead to over-consumption, as seen in food and alcohol habits. On video on-demand (VOD) services, this has recently been referred to as binge watching, where potentially entire seasons of TV shows are consumed in a single viewing session. While a user viewership model may reveal this...
Single Index Models (SIMs) are simple yet flexible semi-parametric models for machine learning, where the response variable is modeled as a monotonic function of a linear combination of features. Estimation in this context requires learning both the feature weights and the nonlinear function that relates features to observations. While methods...
Light field imaging is recently made available tothe mass market by Lytro and Raytrix commercial cameras.Thanks to a grid of microlenses put in front of the sensor, aplenoptic camera simultaneously captures several images of thescene under different viewing angles, providing an enormousadvantage for post-capture applications,e.g., depth estimationand image refocusing. In...
This paper addresses the estimation of accurate long-term dense motion fields from videos of complex scenes. With computer vision applications such as video editing in mind, we exploit optical flows estimated with various inter-frame distances and combine them through multi-step integration and statistical selection (MISS). In this context, managing numerous...
Image retrieval in large image databases is an important problem that drives a number of applications. Yet the use of supervised approaches that address this problem has been limited due to the lack of large labeled datasets for training. Hence, in this paper we introduce two new datasets composed of...
Wi-Fi is the preferred way of accessing the Internet for many devices at home, but it is vulnerable to performance problems. The analysis of Wi-Fi quality metrics such as RSSI or PHY rate may indicate a number of problems, but users may not notice many of these problems if they...
Camcorder piracy refers to the process of using a camcorder to record a screen that displays copyrighted content. In contrast to the previous works that aimed at detecting the occurrence of camcorder piracy, this paper conducts an in-depth study of the luminance flicker that is naturally present in camcorded videos...
Spatio-temporal desynchronization remains a major challenge for watermarking system as it could impair the detection of the hidden payload. Over the years, several (non-blind) registration techniques have been proposed to realign the analyzed content prior to watermark detection and thereby achieve robustness against severe attacks such as display-and-camcord. Such techniques...
Displays' new rendering capabilities combined with the ever-growing number of video applications have fueled the emergence of new video formats addressing wider color gamut and larger frame size. Thus, the need in scalable compression technology to provide backward compatibility with legacy devices and capitalize on the superior compression performance of...
Monte-Carlo integration techniques for global illumination are popular on GPUs thanks to their massive parallel architecture, but efficient implementation remains challenging. The use of randomly decorrelated low-discrepancy sequences in the path-tracing algorithm allows faster visual convergence. However, the parallel tracing of incoherent rays often results in poor memory cache utilization,...
Smartening up the City - New technologies promise a breakthrough for efforts to improve urban living
In practice, making existing cities smart is proving hard. Many different stakeholders need to be involved, while sensors, controls and connectivity can be difficult to install in dense urban environments. In response, some cities are now experimenting with low cost, low power Internet of Things technologies that could usher in...
Adding the sense of touch to hearing and seeing would be necessary for a true immersive experience. This is the promise of the growing "4D-cinema" based on motion platforms and others sensory effects (water spray, wind, scent, etc.). Touch provides a new dimension for filmmakers and leads to a new...
We present a dual-view mixture model to cluster users based on their features and latent behavioral functions. Every component of the mixture model represents a probability density over a feature view for observed user attributes and a behavior view for latent behavioral functions that are indirectly observed through user actions...
Horizontal IIoT platforms will proliferate once organizations implement multiple IIoT applications across different business units within a single corporation, for example, and in communal environments, such as smart cities and multi-modal transportation systems. Two other factors that will fuel this trend are the favorable economics of shared platforms and the...
The movie industry has been using Unmanned Aerial Vehicles as a new tool to produce more and more complex and aesthetic camera shots. However, the shooting process currently rely on manual control of the drones which makes it difficult and sometimes inconvenient to work with. In this paper we address...
The movie industry has been using Unmanned Aerial Vehicles as a new tool to produce more and more complex and aesthetic camera shots. However, the shooting process currently rely on manual control of the drones which makes it difficult and sometimes inconvenient to work with. In this paper we address...
Détecter au plus tôt les utilisateurs importants dans les réseaux sociaux est un problème majeur. Les services de classement d'utilisateurs (peer ranking) sont maintenant des outils bien établis, par des sociétés comme PeerIndex, Klout ou Kred. Leur fonction est de ``classer'' les utilisateurs selon leur influence. Cette notion est néanmoins...
We present in this paper a production-oriented technique designed to visualize contact in real-time between 3D objects. The motivation of this work is to provide integrated tools in the production workflow that help artists setting-up scenes and assets without undesired floating objects or inter-penetrations. Such issues can occur easily and...
The rise of Unmanned Aerial Vehicles and their increasing use in the cinema industry calls for the creation of dedicated tools. Though there is a range of techniques to automatically control drones for a variety of applications, none have considered the problem of producing cinematographic camera motion in real-time for...
The InterDigital Message Dissector Tool, powered by Wireshark® for oneM2M™ standard-based messages analyzes and displays oneM2M™ requests and responses. It is developed to display the packet information according to oneM2M™ structure as defined in the standard. The current version supports the oneM2M™ Release 1 bindings for both HTTP and CoAP....
Discover how using non-standardized versus standards-based solutions for IoT will increase the cost of deployment, hinder mass scale and adoption, and stifle technology innovation for smart city initiatives worldwide in this Machina Research white paper.
With the advent of ultra-high-definition TV services, high dynamic range (HDR) and wide color gamut (WCG) have become two highly desired image quality improvements for delivering immersive video experiences to the consumer mass market. Capture and rendering technologies have reached a level of maturity that now allows HDR and WCG...
This article presents an empirical study that investigated and compared two “big data” text analysis methods: dictionary-based analysis, perhaps the most popular automated analysis approach in social science research, and unsupervised topic modeling (i.e., Latent Dirichlet Allocation [LDA] analysis), one of the most widely used algorithms in the field of...
Between the recent popularity of virtual reality (VR) and the development of 3D, immersion has become an integral part of entertainment concepts. Head-mounted Display (HMD) devices are often used to afford users a feeling of immersion in the environment. Another technique is to project additional material surrounding the viewer, as...
High speed and low latency are expected to be cornerstone 5G requirements, particularly for the delivery of virtual reality and augmented reality. Learn how InterDigital used its EdgeHaulTM millimeter-wave mesh backhaul technology to deliver a live, functioning virtual reality telepresence use case at Mobile World Congress 2016.
Interestingness is the quantification of the ability of an imageto induce interest in a user. Because defining and interpretinginterestingness remain unclear in the literature, we introduce inthis paper two new notions, intra- and inter-interestingness, andinvestigate a novel set of dedicated experiments.More specifically, we propose four experimental protocols:1/ object ranking with...
Focusing on error-correction methods and codes, a systems level design is presented for encoding movies and digital information in DNA storage. A source of data (e.g., movies, audio) is compressed, efficiently encoded with redundant information, modulated, and stored in multiple DNA oligonucleotide strands. The goal is to decode the source...
As the video industry begins deployment of ultrahigh-definition TV in both professional and consumer markets, including support for higher dynamic range and wider color gamut services is considered essential within the industry. Higher dynamic range and wider color gamut offer end users a significantly enhanced viewing experience by supporting intensity...
Aggregation of streamed data is key to the expansion of the Internet of Things. This paper addresses the problem of designing a topology for reliably aggregating data flows from many devices arriving at a datacenter. Reliability here means ensuring operation without data loss. We seek a frugal solution that prevents...
Discover InterDigital's secure and scabable horizontal platform that helps businesses launch and manage IoT applications, oneMPOWER IoT platform in this recent IoT Week Korea 2015 presentation.
This IoT Week Korea 2015 presentation highlights advanced features that it is implementing that include interworking of the oneMPOWER IoT platform with underlying 3GPP and LWM2M network technologies as well as scaling down the oneMPOWER IoT solution to run on resource constrained devices such as the Atmel SAM D21 Xplained...
This IoT Week Korea 2015 presentation demonstrates the key role that the oneMPOWER IoT platform is serving in the EU funded project.
This IoT Week Korea 2015 presentation demonstrates how the oneMPOWER platform can be used to develop horizontal mash-up applications.
Discover how the oneMPOWER IoT solution App Developers Kit (ADK) powered by ThingWorx supports a powerful "drag and drop" Mashup Builder that can be used to rapidly create rich, interactive IoT applications, real-time dashboards, collaborative workspaces and mobile interfaces with minimal coding.
Discover how the oneMPOWER IoT platform is used to enable a Continua Alliance certified personal connected health solution from Lamprey Networks Inc. with oneM2M services such as device management capabilities.
Learn how the oneMPOWER IoT platform can be interworked with existing networking technologies such as Systech's Z-Wave home and building automation gateway solution from this IoT Week Korea 2015 demo.
Discover the benefits of a policy driven intelligent connection management solution, such as InterDigital's Smart Access Manager, from the perspective of subscribers and service providers.
To truly enable the next generation of applications, the IoT should start to look more like an operating environment stated InterDigital's Steve Burr repeatedly in his presentation at LinuxCon 2015. Check out his slides here to learn more about InterDigital's wot.io.
Learn how the mechanics of vision can be leveraged to identify and remove details that cannot be perceived by a viewer in a specific viewing situation and much more in this Colin Dixon whitepaper.
Within the next two decades, it is predicted that every person, industry, and service provider will be using 5G systems. The fifth generation wireless standard is expected to underpin new technology deployments as well as future technologies that at this time can only be imagined. This paper provides valuable insight...
Nowadays, agile operators view unlicensed spectrum as a tool that can be used to help build a heterogeneous network that offers users a homogenous experience. Wi-Fi is no longer considered the competition, but instead a potential partner. To learn more about carrier-grade Wi-Fi, how operators are integrating Wi-Fi into their...
The Smart City concept is predicted to provide safer, more efficient and environmentally conscious living corridors for a large and growing urban population. The migration of a city’s energy grid, transportation system, and more to efficient platforms that are interconnected is a tremendously complex task that will require many partnerships...
With the first commerical deployments of 5G predicted to begin in just five years, learn more about the requirements, use cases and technologies of 5G.
In wireless networks such as those based on IEEE 802.11, packet losses due to fading and interference are often misinterpreted as indications of congestion, causing unnecessary decrease in the data sending rate due to congestion control at higher layer protocols. For delay-constrained applications such as video teleconferencing, packet losses may...
5G will deliver the next level of experience and enable a diverse array of 5G services. Learn about eHealth services, pervasive video and high quality content, and enabling tactile internet in this 2015 Mobile World Congress presentation.
Advanced waveforms and multiple access, advanced antenna and multi-site technologies, novel duplexing schemes, new and flexible spectrum usage. These 5G Air Interface technology trends and more are detailed in this 2015 Mobile World Congress presentation.
As the initial 5G requirements highlight the need for spectrum, industry and governments are actively promoting spectrum sharing. Learn more about sharing in licensed, unlicensed and government controlled spectrum.
An examination of the value chain that drives video is necessary to understand and examine the future of video distribution. Operators, broadcasters or platforms that either produce or aggregate content, pay for video distribution with revenues generated directly by consumers or via advertising. In the past five years, consumer expectation...
Check out this 2014 IBC presentation on User Aware Video!
InterDigital's presentation for the 2014 Geoweb Summit, "The Next-Generation Internet of Things: The Role of Horizontal, Standards-Based Platforms in the IoT."
MWC 2013 presentation for 5G Advanced Waveforms and RF that covers key advantages, use cases, digital pre-distortions, crest factor reduction, envelope tracking and more.
BLOG / Jul 2014
/
Bandwidth Crunch,
IEEE,
Spectrum Harvesting,
TVWS,
Webinar
/ Posted By: Meghan Carney
BLOG / Jul 2014
/
Internet of Things,
IoT,
M2M,
M2M security,
security
/ Posted By: Stephanie Stocker
BLOG / Sep 2014
/
InterDigital,
Smart Cloud,
Third Industrial Revolution,
TIR
/ Posted By: Stephanie Stocker
BLOG / Nov 2014
/
Network of Networks,
Small Cells,
Network Operators,
Narayan Menon
/ Posted By: meghan.carney
BLOG / Sep 2015
/
IoT,
Mediatek,
LinkIt ONE,
shipiot,
bipio,
wotio,
accelerometer,
demo,
tutorial
/ Posted By: wotio team
BLOG / Oct 2015
/
ship iot,
columbia,
atmel,
iot,
columbia university,
wot.io
/ Posted By: wotio team
BLOG / Oct 2015
/
wot.io,
shipiot,
ti,
texas instruments,
sensortag,
cc2541,
cc2650,
beagleboard,
beaglebone black,
node.js,
python,
devicehive,
device management,
bluetooth,
bluetooth low energy,
nest,
demo,
tutorial
/ Posted By: wotio team
BLOG / Oct 2015
/
wot.io,
shipiot,
ti,
texas instruments,
sensortag,
cc2541,
cc2650,
beagleboard,
beaglebone black,
node.js,
python,
devicehive,
device management,
bluetooth,
bluetooth low energy,
nest,
demo,
tutorial,
circonus,
scriptr,
bip.io,
javascript
/ Posted By: wotio team
BLOG / Oct 2015
/
techinmotion,
nyc,
iot,
tom gilley,
wot.io,
internet of things,
Thingworx
/ Posted By: wotio team
BLOG / Oct 2015
/
iot,
interoperability,
wotio,
data,
internet of things,
connected devices
/ Posted By: wotio team
BLOG / Nov 2015
/
wot.io,
node red,
data service exchange,
protocol adapters
/ Posted By: wotio team
BLOG / Nov 2015
/
Freescale,
bipio,
ARM,
mbed,
wot.io,
data service exchange
/ Posted By: wotio team
BLOG / Nov 2015
/
AT&T,
M2X,
Elasticsearch,
Kibana,
WebSocket,
MTA,
GTFS,
Digital Signage,
Metro,
riak
/ Posted By: wotio team
BLOG / Dec 2015
/
protocol adapters,
ARM,
ARMmbed,
WebSocket,
bridge,
MQTT,
CoAP,
virtual device,
bip.io,
B+B SmartWorx
/ Posted By: wotio team
BLOG / Dec 2015
/
Smart City,
ARM,
ARMmbed,
scriptr,
bip.io,
thingworx,
Elasticsearch
/ Posted By: wotio team
BLOG / Dec 2015
/
MQTT,
protocol adapters,
Elasticsearch,
thingworx,
bip.io,
scriptr,
ARM
/ Posted By: wotio team
BLOG / Dec 2015
/
bip.io,
data service exchange,
protocol adapters,
browser,
chrome,
extension
/ Posted By: wotio team
BLOG / Jan 2019
/
MWC19,
IDCCatMWC19,
IDCC,
InterDigital,
MWC,
Mobile World Congress,
MWL
/ Posted By: The InterDigital Communications Team
BLOG / Feb 2019
/
MWC19,
IDCCatMWC19,
IDCC,
InterDigital,
MWC,
Mobile World Congress,
MWL
/ Posted By: The InterDigital Communications Team
BLOG / Apr 2020
/
coronavirus,
Rel-16 ASN.1,
3GPP,
RAN2,
Release 16,
5G,
Rel-17,
Diana Pani
/ Posted By: Roya Stephens