Multi-Modal Human Modeling, Analysis and Synthesis

£55.99

Multi-Modal Human Modeling, Analysis and Synthesis

Hospitality and service industries Digital and information technologies: social and ethical aspects Digital and information technologies: Legal aspects Internet guides and online services Software Engineering Computer security Computer networking and communications Computer architecture and logic design Artificial intelligence

Dinosaur mascot

Language: English

Published by: CRC Press

Published on: 21st October 2025

Format: LCP-protected ePub

ISBN: 9781040409817


Introduction

In today's world, where intelligent technologies are deeply transforming human-computer interaction and virtual reality, multi-modal human modeling, analysis and synthesis have become central topics in computer vision. As application scenarios grow increasingly complex, new technologies continue to emerge to address these challenges. These techniques demand systematic summarization and practical guidance.

Purpose and Approach

To meet this need, Multi-Modal Human Modeling, Analysis and Synthesis aims to adopt a structured perspective, building a comprehensive technical framework for multi-modal human modeling, analysis and synthesis—progressing from local details to holistic perspectives, and from face features to body dynamics.

Content Overview

This book begins by examining the anatomy structures and characteristics of human faces and bodies, then analyzes how traditional methods and deep learning approaches provide robust optimization solutions for modeling. For example, it explores how to address challenges in face recognition caused by lighting changes, occlusions, face expressions and aging, as well as methods for body localization, reconstruction, recognition and anomaly detection in multi-modal scenarios. It also explains how multi-modal data can drive realistic face and body synthesis.

Features and Framework

A standout feature is its focus on Huawei's MindSpore framework, bridging the gap between algorithms and engineering through practical case studies. From building face detection and recognition pipelines with the MindSpore toolkit to accelerating model training via automatic parallel computing, and solving large language model (LLM) training challenges, each step is supported by reproducible code and design logic.

Intended Audience and Applications

Designed for researchers and engineers in computer vision and AI, this book balances theoretical foundations with industry-ready technical details. Whether you aim to enhance the reliability of biometric recognition, explore creative possibilities in virtual-real interactions or optimize the deployment of deep learning frameworks, this guide serves as an essential link between academic advancements and real-world applications.

Show moreShow less