Multimodal machine learning for generating three-dimensional audio

Ismael Faro Sertage, Juan Cruz Benito, Francisco Jose Martin Fernandez

May 2024

PDF Source Document

Abstract

Methods and systems use one or more machine learning models to automatically generate three-dimensional sound. A multimodal content item is accessed by a computing device. Three-dimensional sound is automatically generated by the computing device using the one or more machine learning models based on the multimodal content item.

Type

Patent

Publication

World Patent App PCT/CN2023/126718

Multimodal machine learning for generating three-dimensional audio

Abstract

Related