Ping An Insurance (Group) Company of China. has filed a patent for a system and method for multimodal video segmentation in a multi-speaker scenario. The technology segments video transcripts into sentences, detects speaker changes based on audio or visual content, and segments the video into clips accordingly. GlobalData’s report on Ping An Insurance (Group) Company of China gives a 360-degree view of the company including its patenting strategy. Buy the report here.

According to GlobalData’s company profile on Ping An Insurance (Group) Company of China, Digital lending was a key innovation area identified from patents. Ping An Insurance (Group) Company of China's grant share as of January 2024 was 19%. Grant share is based on the ratio of number of grants to total number of patents.

Multimodal video segmentation system for multi-speaker scenario

Source: United States Patent and Trademark Office (USPTO). Credit: Ping An Insurance (Group) Company of China Ltd

The patent application (Publication Number: US20240020977A1) describes a system for multimodal video segmentation in a multi-speaker scenario. The system includes a memory to store instructions and a processor to execute these instructions. The process involves segmenting a video transcript with multiple speakers into sentences, detecting speaker changes based on audio or visual content, and segmenting the video into clips accordingly. The processor predicts punctuations, timestamps, and speaker change probabilities using acoustic and visual features, neural networks, and face identification techniques.

Furthermore, the system utilizes a convolutional neural network for binary classification, cross-scene face re-identification, and speech probability calculations to determine speaker change probabilities accurately. The processor tokenizes the transcript, combines speaker change probabilities, and determines clip boundaries based on segmentation probabilities. The method and computer-readable storage medium also outline the process of segmenting the video based on speaker change information. Overall, the system offers a comprehensive approach to segmenting videos with multiple speakers by leveraging both audio and visual cues, enhancing the accuracy and efficiency of the segmentation process.

To know more about GlobalData’s detailed insights on Ping An Insurance (Group) Company of China, buy the report here.

Premium Insights

From

The gold standard of business intelligence.

Blending expert knowledge with cutting-edge technology, GlobalData’s unrivalled proprietary data will enable you to decode what’s happening in your market. You can make better informed decisions and gain a future-proof advantage over your competitors.

GlobalData

GlobalData, the leading provider of industry intelligence, provided the underlying data, research, and analysis used to produce this article.

GlobalData Patent Analytics tracks bibliographic data, legal events data, point in time patent ownerships, and backward and forward citations from global patenting offices. Textual analysis and official patent classifications are used to group patents into key thematic areas and link them to specific companies across the world’s largest industries.