[go: up one dir, main page]

WO2026018464A1 - Dispositif de traitement vocal, procédé de traitement vocal et programme de traitement vocal - Google Patents

Dispositif de traitement vocal, procédé de traitement vocal et programme de traitement vocal

Info

Publication number
WO2026018464A1
WO2026018464A1 PCT/JP2024/037025 JP2024037025W WO2026018464A1 WO 2026018464 A1 WO2026018464 A1 WO 2026018464A1 JP 2024037025 W JP2024037025 W JP 2024037025W WO 2026018464 A1 WO2026018464 A1 WO 2026018464A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
section
separated
mixed
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/JP2024/037025
Other languages
English (en)
Japanese (ja)
Inventor
博昭 諸橋
龍 相原
祥幹 三井
勝人 伊佐野
進也 田口
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Publication of WO2026018464A1 publication Critical patent/WO2026018464A1/fr
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

L'invention concerne une unité de détection de section (102) qui détecte, à partir de données vocales en série chronologique, une section vocale mélangée qui est une plage temporelle dans laquelle une voix mélangée dans laquelle une pluralité de voix sont mélangées est présente. L'invention concerne également une unité de séparation vocale (103) qui sépare la voix mélangée dans la section vocale mélangée en la pluralité de voix.
PCT/JP2024/037025 2024-07-18 2024-10-17 Dispositif de traitement vocal, procédé de traitement vocal et programme de traitement vocal Pending WO2026018464A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2024-114403 2024-07-18
JP2024114403 2024-07-18

Publications (1)

Publication Number Publication Date
WO2026018464A1 true WO2026018464A1 (fr) 2026-01-22

Family

ID=98437007

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2024/037025 Pending WO2026018464A1 (fr) 2024-07-18 2024-10-17 Dispositif de traitement vocal, procédé de traitement vocal et programme de traitement vocal

Country Status (1)

Country Link
WO (1) WO2026018464A1 (fr)

Similar Documents

Publication Publication Date Title
Zmolikova et al. Neural target speech extraction: An overview
Luo et al. Speaker-independent speech separation with deep attractor network
Adeel et al. Lip-reading driven deep learning approach for speech enhancement
JP7525648B2 (ja) エンドツーエンドの複数話者重複音声認識
CN107146624B (zh) 一种说话人确认方法及装置
US11823685B2 (en) Speech recognition
JP2019522810A (ja) ニューラルネットワークベースの声紋情報抽出方法及び装置
Gogate et al. Visual Speech In Real Noisy Environments (VISION): A Novel Benchmark Dataset and Deep Learning-Based Baseline System.
EP3951777A1 (fr) Dispositif, procédé et programme de traitement de signaux
Nasib et al. A real time speech to text conversion technique for bengali language
KR20200083685A (ko) 실시간 화자 판단 방법
JP2017003622A (ja) 声質変換方法および声質変換装置
US20220198140A1 (en) Live audio adjustment based on speaker attributes
Shao et al. Stream weight estimation for multistream audio–visual speech recognition in a multispeaker environment
Soboleva et al. Replacing human audio with synthetic audio for on-device unspoken punctuation prediction
JP7160095B2 (ja) 属性識別装置、属性識別方法、およびプログラム
Wang et al. Disentangling the impacts of language and channel variability on speech separation networks
CN118369713A (zh) 与语言无关的多语言端到端流式传输设备上的asr系统
WO2026018464A1 (fr) Dispositif de traitement vocal, procédé de traitement vocal et programme de traitement vocal
JP2008145988A (ja) 雑音検出装置および雑音検出方法
Li et al. A visual-pilot deep fusion for target speech separation in multitalker noisy environment
US20240296826A1 (en) System and Method for Multi-Channel Speech Privacy Processing
JP7613587B2 (ja) 信号処理装置、信号処理方法及び信号処理プログラム
JP6526602B2 (ja) 音声認識装置、その方法、及びプログラム
Matsuda et al. Acoustic discriminability of unconscious laughter and scream during game-play