Mpeg 4 Beyond Conventional Video Coding

Mpeg 4 Beyond Conventional Video Coding Book in PDF, ePub and Kindle version is available to download in english. Read online anytime anywhere directly from your device. Click on the download button below to get a free pdf file of Mpeg 4 Beyond Conventional Video Coding book. This book definitely worth reading, it is an incredibly well-written.

MPEG-4 Beyond Conventional Video Coding

Author : Mihaela van der Schaar,Deepak S Turaga,Thomas Stockhammer
Publisher : Springer Nature
Page : 80 pages
File Size : 54,6 Mb
Release : 2022-05-31
Category : Technology & Engineering
ISBN : 9783031022395

Get Book

MPEG-4 Beyond Conventional Video Coding by Mihaela van der Schaar,Deepak S Turaga,Thomas Stockhammer Pdf

An important merit of the MPEG-4 video standard is that it not only provided tools and algorithms for enhancing the compression efficiency of existing MPEG-2 and H.263 standards but also contributed key innovative solutions for new multimedia applications such as real-time video streaming to PCs and cell phones over Internet and wireless networks, interactive services, and multimedia access. Many of these solutions are currently used in practice or have been important stepping-stones for new standards and technologies. In this book, we do not aim at providing a complete reference for MPEG-4 video as many excellent references on the topic already exist. Instead, we focus on three topics that we believe formed key innovations of MPEG-4 video and that will continue to serve as an inspiration and basis for new, emerging standards, products, and technologies. The three topics highlighted in this book are object-based coding and scalability, Fine Granularity Scalability, and error resilience tools. This book is aimed at engineering students as well as professionals interested in learning about these MPEG-4 technologies for multimedia streaming and interaction. Finally, it is not aimed as a substitute or manual for the MPEG-4 standard, but rather as a tutorial focused on the principles and algorithms underlying it.

MPEG-4 Beyond Conventional Video Coding

Author : Mihaela van der Schaar,Deepak S. Turaga,Thomas Stockhammer
Publisher : Morgan & Claypool Publishers
Page : 87 pages
File Size : 48,7 Mb
Release : 2006
Category : Computers
ISBN : 9781598290424

Get Book

MPEG-4 Beyond Conventional Video Coding by Mihaela van der Schaar,Deepak S. Turaga,Thomas Stockhammer Pdf

An important merit of the MPEG-4 video standard is that it not only provided tools and algorithms for enhancing the compression efficiency of existing MPEG-2 and H.263 standards, but also contributed key innovative solutions for new multimedia applications such as: real-time video streaming to PCs and cell-phones over Internet and wireless networks, interactive services, and multimedia access. Many of these solutions are currently used in practice or have been important step-stones for new standards and technologies.In this lecture, the authors focus on three key innovations of MPEG-4 video that will continue to serve as an inspiration and basis for emerging standards, products, and technologies. The three topics highlighted in this lecture are object based coding and scalability, Fine Granularity Scalability (FGS), and error resilience tools. This lecture is aimed at engineering students as well as professionals interested in learning about these MPEG-4 technologies for multimedia streaming and interaction. Finally, this lecture is not aimed as a substitute or manual for the MPEG-4 standard, but rather as a tutorial focused on the principles and algorithms underlying it.

Advanced Video Coding for Next-Generation Multimedia Services

Author : Yo-Sung Ho
Publisher : BoD – Books on Demand
Page : 214 pages
File Size : 46,7 Mb
Release : 2013-01-09
Category : Computers
ISBN : 9789535109297

Get Book

Advanced Video Coding for Next-Generation Multimedia Services by Yo-Sung Ho Pdf

This book aims to bring together recent advances and applications of video coding. All chapters can be useful for researchers, engineers, graduate and postgraduate students, experts in this area, and hopefully also for people who are generally interested in video coding. The book includes nine carefully selected chapters. The chapters deal with advanced compression techniques for multimedia applications, concerning recent video coding standards, high efficiency video coding (HEVC), multiple description coding, region of interest (ROI) coding, shape compensation, error resilient algorithms for H.264/AVC, wavelet-based coding, facial video coding, and hardware implementations. This book provides several useful ideas for your own research and helps to bridge the gap between the basic video coding techniques and practical multimedia applications. We hope this book is enjoyable to read and will further contribute to video coding.

Understanding MPEG 4

Author : Sebastian Moeritz,Klaus Diepold
Publisher : Taylor & Francis
Page : 328 pages
File Size : 44,6 Mb
Release : 2012-09-10
Category : Language Arts & Disciplines
ISBN : 9781136036972

Get Book

Understanding MPEG 4 by Sebastian Moeritz,Klaus Diepold Pdf

The Practical Guide to MPEG 4 offers an up to date introduction to this important interactive and multimedia compression standard (including MPEG-4 Part 10), with real examples and information as to how and where this new technology should be used. All aspects of MPEG-4 that are relevant in today's technical landscape are described in this book, including video and audio creation, production, distribution, reception and consumption environment. This book explains everything you really need to know in jargon-free language: interactive systems, content management, deployment, licensing and business models.

Versatile Video Coding

Author : Humberto Ochoa Dominguez,K.R. Rao
Publisher : CRC Press
Page : 458 pages
File Size : 44,8 Mb
Release : 2022-09-01
Category : Technology & Engineering
ISBN : 9781000795059

Get Book

Versatile Video Coding by Humberto Ochoa Dominguez,K.R. Rao Pdf

Video is the main driver of bandwidth use, accounting for over 80 per cent of consumer Internet traffic. Video compression is a critical component of many of the available multimedia applications, it is necessary for storage or transmission of digital video over today's band-limited networks. The majority of this video is coded using international standards developed in collaboration with ITU-T Study Group and MPEG. The MPEG family of video coding standards begun on the early 1990s with MPEG-1, developed for video and audio storage on CD-ROMs, with support for progressive video. MPEG-2 was standardized in 1995 for applications of video on DVD, standard and high definition television, with support for interlaced and progressive video. MPEG-4 part 2, also known as MPEG-2 video, was standardized in 1999 for applications of low- bit rate multimedia on mobile platforms and the Internet, with the support of object-based or content based coding by modeling the scene as background and foreground. Since MPEG-1, the main video coding standards were based on the so-called macroblocks. However, research groups continued the work beyond the traditional video coding architectures and found that macroblocks could limit the performance of the compression when using high-resolution video. Therefore, in 2013 the high efficiency video coding (HEVC) also known and H.265, was released, with a structure similar to H.264/AVC but using coding units with more flexible partitions than the traditional macroblocks. HEVC has greater flexibility in prediction modes and transform block sizes, also it has a more sophisticated interpolation and de blocking filters. In 2006 the VC-1 was released. VC-1 is a video codec implemented by Microsoft and the Microsoft Windows Media Video (VMW) 9 and standardized by the Society of Motion Picture and Television Engineers (SMPTE). In 2017 the Joint Video Experts Team (JVET) released a call for proposals for a new video coding standard initially called Beyond the HEVC, Future Video Coding (FVC) or known as Versatile Video Coding (VVC). VVC is being built on top of HEVC for application on Standard Dynamic Range (SDR), High Dynamic Range (HDR) and 360° Video. The VVC is planned to be finalized by 2020. This book presents the new VVC, and updates on the HEVC. The book discusses the advances in lossless coding and covers the topic of screen content coding. Technical topics discussed include: Beyond the High Efficiency Video CodingHigh Efficiency Video Coding encoderScreen contentLossless and visually lossless coding algorithmsFast coding algorithmsVisual quality assessmentOther screen content coding algorithmsOverview of JPEG Series

Wavelet Image Compression

Author : William Pearlman
Publisher : Springer Nature
Page : 78 pages
File Size : 40,8 Mb
Release : 2022-05-31
Category : Technology & Engineering
ISBN : 9783031022487

Get Book

Wavelet Image Compression by William Pearlman Pdf

This book explains the stages necessary to create a wavelet compression system for images and describes state-of-the-art systems used in image compression standards and current research. It starts with a high level discussion of the properties of the wavelet transform, especially the decomposition into multi-resolution subbands. It continues with an exposition of the null-zone, uniform quantization used in most subband coding systems and the optimal allocation of bitrate to the different subbands. Then the image compression systems of the FBI Fingerprint Compression Standard and the JPEG2000 Standard are described in detail. Following that, the set partitioning coders SPECK and SPIHT, and EZW are explained in detail and compared via a fictitious wavelet transform in actions and number of bits coded in a single pass in the top bit plane. The presentation teaches that, besides producing efficient compression, these coding systems, except for the FBI Standard, are capable of writing bit streams that have attributes of rate scalability, resolution scalability, and random access decoding. Many diagrams and tables accompany the text to aid understanding. The book is generous in pointing out references and resources to help the reader who wishes to expand his knowledge, know the origins of the methods, or find resources for running the various algorithms or building his own coding system. Table of Contents: Introduction / Characteristics of the Wavelet Transform / Generic Wavelet-based Coding Systems / The FBI Fingerprint Image Compression Standard / Set Partition Embedded Block (SPECK) Coding / Tree-based Wavelet Transform Coding Systems / Rate Control for Embedded Block Coders / Conclusion

Contextual Analysis of Videos

Author : Myo Thida,How-lung Eng,Dorothy Monekosso,Paolo Remagnino
Publisher : Springer Nature
Page : 8 pages
File Size : 43,9 Mb
Release : 2022-06-01
Category : Technology & Engineering
ISBN : 9783031022494

Get Book

Contextual Analysis of Videos by Myo Thida,How-lung Eng,Dorothy Monekosso,Paolo Remagnino Pdf

Video context analysis is an active and vibrant research area, which provides means for extracting, analyzing and understanding behavior of a single target and multiple targets. Over the last few decades, computer vision researchers have been working to improve the accuracy and robustness of algorithms to analyse the context of a video automatically. In general, the research work in this area can be categorized into three major topics: 1) counting number of people in the scene 2) tracking individuals in a crowd and 3) understanding behavior of a single target or multiple targets in the scene. This book focusses on tracking individual targets and detecting abnormal behavior of a crowd in a complex scene. Firstly, this book surveys the state-of-the-art methods for tracking multiple targets in a complex scene and describes the authors' approach for tracking multiple targets. The proposed approach is to formulate the problem of multi-target tracking as an optimization problem of finding dynamic optima (pedestrians) where these optima interact frequently. A novel particle swarm optimization (PSO) algorithm that uses a set of multiple swarms is presented. Through particles and swarms diversification, motion prediction is introduced into the standard PSO, constraining swarm members to the most likely region in the search space. The social interaction among swarm and the output from pedestrians-detector are also incorporated into the velocity-updating equation. This allows the proposed approach to track multiple targets in a crowded scene with severe occlusion and heavy interactions among targets. The second part of this book discusses the problem of detecting and localising abnormal activities in crowded scenes. We present a spatio-temporal Laplacian Eigenmap method for extracting different crowd activities from videos. This method learns the spatial and temporal variations of local motions in an embedded space and employs representatives of different activities to construct the model which characterises the regular behavior of a crowd. This model of regular crowd behavior allows for the detection of abnormal crowd activities both in local and global context and the localization of regions which show abnormal behavior.

Multimodal Learning toward Micro-Video Understanding

Author : Liqiang Nie,Meng Liu,Xuemeng Song
Publisher : Springer Nature
Page : 170 pages
File Size : 44,6 Mb
Release : 2022-05-31
Category : Technology & Engineering
ISBN : 9783031022555

Get Book

Multimodal Learning toward Micro-Video Understanding by Liqiang Nie,Meng Liu,Xuemeng Song Pdf

Micro-videos, a new form of user-generated contents, have been spreading widely across various social platforms, such as Vine, Kuaishou, and Tik Tok. Different from traditional long videos, micro-videos are usually recorded by smart mobile devices at any place within a few seconds. Due to its brevity and low bandwidth cost, micro-videos are gaining increasing user enthusiasm. The blossoming of micro-videos opens the door to the possibility of many promising applications, ranging from network content caching to online advertising. Thus, it is highly desirable to develop an effective scheme for the high-order micro-video understanding. Micro-video understanding is, however, non-trivial due to the following challenges: (1) how to represent micro-videos that only convey one or few high-level themes or concepts; (2) how to utilize the hierarchical structure of the venue categories to guide the micro-video analysis; (3) how to alleviate the influence of low-quality caused by complex surrounding environments and the camera shake; (4) how to model the multimodal sequential data, {i.e.}, textual, acoustic, visual, and social modalities, to enhance the micro-video understanding; and (5) how to construct large-scale benchmark datasets for the analysis? These challenges have been largely unexplored to date. In this book, we focus on addressing the challenges presented above by proposing some state-of-the-art multimodal learning theories. To demonstrate the effectiveness of these models, we apply them to three practical tasks of micro-video understanding: popularity prediction, venue category estimation, and micro-video routing. Particularly, we first build three large-scale real-world micro-video datasets for these practical tasks. We then present a multimodal transductive learning framework for micro-video popularity prediction. Furthermore, we introduce several multimodal cooperative learning approaches and a multimodal transfer learning scheme for micro-video venue category estimation. Meanwhile, we develop a multimodal sequential learning approach for micro-video recommendation. Finally, we conclude the book and figure out the future research directions in multimodal learning toward micro-video understanding.

Image Fusion in Remote Sensing

Author : Arian Azarang,Nasser Kehtarnavaz
Publisher : Springer Nature
Page : 89 pages
File Size : 48,9 Mb
Release : 2022-05-31
Category : Technology & Engineering
ISBN : 9783031022562

Get Book

Image Fusion in Remote Sensing by Arian Azarang,Nasser Kehtarnavaz Pdf

Image fusion in remote sensing or pansharpening involves fusing spatial (panchromatic) and spectral (multispectral) images that are captured by different sensors on satellites. This book addresses image fusion approaches for remote sensing applications. Both conventional and deep learning approaches are covered. First, the conventional approaches to image fusion in remote sensing are discussed. These approaches include component substitution, multi-resolution, and model-based algorithms. Then, the recently developed deep learning approaches involving single-objective and multi-objective loss functions are discussed. Experimental results are provided comparing conventional and deep learning approaches in terms of both low-resolution and full-resolution objective metrics that are commonly used in remote sensing. The book is concluded by stating anticipated future trends in pansharpening or image fusion in remote sensing.

Combating Bad Weather Part I

Author : Sudipta Mukhopadhyay,Abhishek Kumar Tripathi
Publisher : Springer Nature
Page : 79 pages
File Size : 43,6 Mb
Release : 2022-06-01
Category : Technology & Engineering
ISBN : 9783031022517

Get Book

Combating Bad Weather Part I by Sudipta Mukhopadhyay,Abhishek Kumar Tripathi Pdf

Current vision systems are designed to perform in normal weather condition. However, no one can escape from severe weather conditions. Bad weather reduces scene contrast and visibility, which results in degradation in the performance of various computer vision algorithms such as object tracking, segmentation and recognition. Thus, current vision systems must include some mechanisms that enable them to perform up to the mark in bad weather conditions such as rain and fog. Rain causes the spatial and temporal intensity variations in images or video frames. These intensity changes are due to the random distribution and high velocities of the raindrops. Fog causes low contrast and whiteness in the image and leads to a shift in the color. This book has studied rain and fog from the perspective of vision. The book has two main goals: 1) removal of rain from videos captured by a moving and static camera, 2) removal of the fog from images and videos captured by a moving single uncalibrated camera system. The book begins with a literature survey. Pros and cons of the selected prior art algorithms are described, and a general framework for the development of an efficient rain removal algorithm is explored. Temporal and spatiotemporal properties of rain pixels are analyzed and using these properties, two rain removal algorithms for the videos captured by a static camera are developed. For the removal of rain, temporal and spatiotemporal algorithms require fewer numbers of consecutive frames which reduces buffer size and delay. These algorithms do not assume the shape, size and velocity of raindrops which make it robust to different rain conditions (i.e., heavy rain, light rain and moderate rain). In a practical situation, there is no ground truth available for rain video. Thus, no reference quality metric is very useful in measuring the efficacy of the rain removal algorithms. Temporal variance and spatiotemporal variance are presented in this book as no reference quality metrics. An efficient rain removal algorithm using meteorological properties of rain is developed. The relation among the orientation of the raindrops, wind velocity and terminal velocity is established. This relation is used in the estimation of shape-based features of the raindrop. Meteorological property-based features helped to discriminate the rain and non-rain pixels. Most of the prior art algorithms are designed for the videos captured by a static camera. The use of global motion compensation with all rain removal algorithms designed for videos captured by static camera results in better accuracy for videos captured by moving camera. Qualitative and quantitative results confirm that probabilistic temporal, spatiotemporal and meteorological algorithms outperformed other prior art algorithms in terms of the perceptual quality, buffer size, execution delay and system cost. The work presented in this book can find wide application in entertainment industries, transportation, tracking and consumer electronics. Table of Contents: Acknowledgments / Introduction / Analysis of Rain / Dataset and Performance Metrics / Important Rain Detection Algorithms / Probabilistic Approach for Detection and Removal of Rain / Impact of Camera Motion on Detection of Rain / Meteorological Approach for Detection and Removal of Rain from Videos / Conclusion and Scope of Future Work / Bibliography / Authors' Biographies

Dictionary Learning in Visual Computing

Author : Qiang Zhang,Baoxin Li
Publisher : Springer Nature
Page : 133 pages
File Size : 40,8 Mb
Release : 2022-05-31
Category : Technology & Engineering
ISBN : 9783031022531

Get Book

Dictionary Learning in Visual Computing by Qiang Zhang,Baoxin Li Pdf

The last few years have witnessed fast development on dictionary learning approaches for a set of visual computing tasks, largely due to their utilization in developing new techniques based on sparse representation. Compared with conventional techniques employing manually defined dictionaries, such as Fourier Transform and Wavelet Transform, dictionary learning aims at obtaining a dictionary adaptively from the data so as to support optimal sparse representation of the data. In contrast to conventional clustering algorithms like K-means, where a data point is associated with only one cluster center, in a dictionary-based representation, a data point can be associated with a small set of dictionary atoms. Thus, dictionary learning provides a more flexible representation of data and may have the potential to capture more relevant features from the original feature space of the data. One of the early algorithms for dictionary learning is K-SVD. In recent years, many variations/extensions of K-SVD and other new algorithms have been proposed, with some aiming at adding discriminative capability to the dictionary, and some attempting to model the relationship of multiple dictionaries. One prominent application of dictionary learning is in the general field of visual computing, where long-standing challenges have seen promising new solutions based on sparse representation with learned dictionaries. With a timely review of recent advances of dictionary learning in visual computing, covering the most recent literature with an emphasis on papers after 2008, this book provides a systematic presentation of the general methodologies, specific algorithms, and examples of applications for those who wish to have a quick start on this subject.

Combating Bad Weather Part II

Author : Sudipta Mukhopadhyay,Abhishek Kumar Tripathi
Publisher : Springer Nature
Page : 70 pages
File Size : 43,8 Mb
Release : 2022-05-31
Category : Technology & Engineering
ISBN : 9783031022524

Get Book

Combating Bad Weather Part II by Sudipta Mukhopadhyay,Abhishek Kumar Tripathi Pdf

Every year lives and properties are lost in road accidents. About one-fourth of these accidents are due to low vision in foggy weather. At present, there is no algorithm that is specifically designed for the removal of fog from videos. Application of a single-image fog removal algorithm over each video frame is a time-consuming and costly affair. It is demonstrated that with the intelligent use of temporal redundancy, fog removal algorithms designed for a single image can be extended to the real-time video application. Results confirm that the presented framework used for the extension of the fog removal algorithms for images to videos can reduce the complexity to a great extent with no loss of perceptual quality. This paves the way for the real-life application of the video fog removal algorithm. In order to remove fog, an efficient fog removal algorithm using anisotropic diffusion is developed. The presented fog removal algorithm uses new dark channel assumption and anisotropic diffusion for the initialization and refinement of the airlight map, respectively. Use of anisotropic diffusion helps to estimate the better airlight map estimation. The said fog removal algorithm requires a single image captured by uncalibrated camera system. The anisotropic diffusion-based fog removal algorithm can be applied in both RGB and HSI color space. This book shows that the use of HSI color space reduces the complexity further. The said fog removal algorithm requires pre- and post-processing steps for the better restoration of the foggy image. These pre- and post-processing steps have either data-driven or constant parameters that avoid the user intervention. Presented fog removal algorithm is independent of the intensity of the fog, thus even in the case of the heavy fog presented algorithm performs well. Qualitative and quantitative results confirm that the presented fog removal algorithm outperformed previous algorithms in terms of perceptual quality, color fidelity and execution time. The work presented in this book can find wide application in entertainment industries, transportation, tracking and consumer electronics.

Versatile Video Coding: Latest Advances in Video Coding Standards

Author : Ochoa Dominguez, Humberto,Rao, Kamisetty R.
Publisher : River Publishers
Page : 460 pages
File Size : 48,8 Mb
Release : 2019-03-08
Category : Technology & Engineering
ISBN : 9788770220477

Get Book

Versatile Video Coding: Latest Advances in Video Coding Standards by Ochoa Dominguez, Humberto,Rao, Kamisetty R. Pdf

Video is the main driver of bandwidth use, accounting for over 80 per cent of consumer Internet traffic. Video compression is a critical component of many of the available multimedia applications, it is necessary for storage or transmission of digital video over today’s band-limited networks. The majority of this video is coded using international standards developed in collaboration with ITU-T Study Group and MPEG. The MPEG family of video coding standards begun on the early 1990s with MPEG-1, developed for video and audio storage on CD-ROMs, with support for progressive video. MPEG-2 was standardized in 1995 for applications of video on DVD, standard and high definition television, with support for interlaced and progressive video. MPEG-4 part 2, also known as MPEG-2 video, was standardized in 1999 for applications of low- bit rate multimedia on mobile platforms and the Internet, with the support of object-based or content based coding by modeling the scene as background and foreground. Since MPEG-1, the main video coding standards were based on the so-called macroblocks. However, research groups continued the work beyond the traditional video coding architectures and found that macroblocks could limit the performance of the compression when using high-resolution video. Therefore, in 2013 the high efficiency video coding (HEVC) also known and H.265, was released, with a structure similar to H.264/AVC but using coding units with more flexible partitions than the traditional macroblocks. HEVC has greater flexibility in prediction modes and transform block sizes, also it has a more sophisticated interpolation and de blocking filters. In 2006 the VC-1 was released. VC-1 is a video codec implemented by Microsoft and the Microsoft Windows Media Video (VMW) 9 and standardized by the Society of Motion Picture and Television Engineers (SMPTE). In 2017 the Joint Video Experts Team (JVET) released a call for proposals for a new video coding standard initially called Beyond the HEVC, Future Video Coding (FVC) or known as Versatile Video Coding (VVC). VVC is being built on top of HEVC for application on Standard Dynamic Range (SDR), High Dynamic Range (HDR) and 360° Video. The VVC is planned to be finalized by 2020. This book presents the new VVC, and updates on the HEVC. The book discusses the advances in lossless coding and covers the topic of screen content coding. Technical topics discussed include: Beyond the High Efficiency Video CodingHigh Efficiency Video Coding encoderScreen contentLossless and visually lossless coding algorithmsFast coding algorithmsVisual quality assessmentOther screen content coding algorithmsOverview of JPEG Series

3D Future Internet Media

Author : Ahmet Kondoz,Tasos Dagiuklas
Publisher : Springer Science & Business Media
Page : 302 pages
File Size : 46,9 Mb
Release : 2013-11-12
Category : Technology & Engineering
ISBN : 9781461483731

Get Book

3D Future Internet Media by Ahmet Kondoz,Tasos Dagiuklas Pdf

This book describes recent innovations in 3D media and technologies, with coverage of 3D media capturing, processing, encoding, and adaptation, networking aspects for 3D Media, and quality of user experience (QoE). The main contributions are based on the results of the FP7 European Projects ROMEO, which focus on new methods for the compression and delivery of 3D multi-view video and spatial audio, as well as the optimization of networking and compression jointly across the Future Internet (www.ict-romeo.eu). The delivery of 3D media to individual users remains a highly challenging problem due to the large amount of data involved, diverse network characteristics and user terminal requirements, as well as the user’s context such as their preferences and location. As the number of visual views increases, current systems will struggle to meet the demanding requirements in terms of delivery of constant video quality to both fixed and mobile users. ROMEO will design and develop hybrid-networking solutions that combine the DVB-T2 and DVB-NGH broadcast access network technologies together with a QoE aware Peer-to-Peer (P2P) distribution system that operates over wired and wireless links. Live streaming 3D media needs to be received by collaborating users at the same time or with imperceptible delay to enable them to watch together while exchanging comments as if they were all in the same location. The volume provides state-of-the-art information on 3D multi-view video, spatial audio networking protocols for 3D media, P2P 3D media streaming, and 3D Media delivery across heterogeneous wireless networks among other topics. Graduate students and professionals in electrical engineering and computer science with an interest in 3D Future Internet Media will find this volume to be essential reading.

Image Understanding using Sparse Representations

Author : Jayaraman J. Thiagarajan,Karthikeyan Natesan Ramamurthy,Pavan Turaga,Andreas Spanias
Publisher : Springer Nature
Page : 115 pages
File Size : 45,7 Mb
Release : 2022-06-01
Category : Technology & Engineering
ISBN : 9783031022500

Get Book

Image Understanding using Sparse Representations by Jayaraman J. Thiagarajan,Karthikeyan Natesan Ramamurthy,Pavan Turaga,Andreas Spanias Pdf

Image understanding has been playing an increasingly crucial role in several inverse problems and computer vision. Sparse models form an important component in image understanding, since they emulate the activity of neural receptors in the primary visual cortex of the human brain. Sparse methods have been utilized in several learning problems because of their ability to provide parsimonious, interpretable, and efficient models. Exploiting the sparsity of natural signals has led to advances in several application areas including image compression, denoising, inpainting, compressed sensing, blind source separation, super-resolution, and classification. The primary goal of this book is to present the theory and algorithmic considerations in using sparse models for image understanding and computer vision applications. To this end, algorithms for obtaining sparse representations and their performance guarantees are discussed in the initial chapters. Furthermore, approaches for designing overcomplete, data-adapted dictionaries to model natural images are described. The development of theory behind dictionary learning involves exploring its connection to unsupervised clustering and analyzing its generalization characteristics using principles from statistical learning theory. An exciting application area that has benefited extensively from the theory of sparse representations is compressed sensing of image and video data. Theory and algorithms pertinent to measurement design, recovery, and model-based compressed sensing are presented. The paradigm of sparse models, when suitably integrated with powerful machine learning frameworks, can lead to advances in computer vision applications such as object recognition, clustering, segmentation, and activity recognition. Frameworks that enhance the performance of sparse models in such applications by imposing constraints based on the prior discriminatory information and the underlying geometrical structure, and kernelizing the sparse coding and dictionary learning methods are presented. In addition to presenting theoretical fundamentals in sparse learning, this book provides a platform for interested readers to explore the vastly growing application domains of sparse representations.