TR201810148T4

TR201810148T4 - CALCULATOR FOR A SOUND SIGNAL AND METHOD FOR DETERMINING PHASE CORRECTION DATA.

Info

Publication number: TR201810148T4
Application number: TR2018/10148T
Authority: TR
Inventors: Disch Sascha; Laitinen Mikko-Ville; Pulkki Ville
Original assignee: Fraunhofer Ges Forschung
Priority date: 2014-07-01
Filing date: 2015-06-25
Publication date: 2018-08-27
Also published as: AU2015282747B2; MX2016016758A; AR101083A1; MY182904A; MX356672B; WO2016001066A1; JP2017525994A; RU2675151C2; CN106663439A; CN106663439B; CA2999327A1; BR112016030343A2; PT3164873T; TR201809988T4; SG11201610837XA; AU2018203475B2; JP6553657B2; ES2683870T3; KR101978671B1; US20190108849A1

Abstract

Bir ses sinyali (55) için faz düzeltme verisinin (295) belirlenmesi için bir hesaplayıcı (270) sunulmaktadır. Hesaplayıcı, birinci ve ikinci varyasyon modunda bir ses sinyalinin (55) bir fazının varyasyonunun belirlenmesi için bir varyasyon belirleyiciyi (275), ikinci varyasyon modunu kullanarak belirlenen ikinci varyasyonu (290b) ve birinci varyasyon modunu kullanarak belirlenen birinci varyasyonun (290a) karşılaştırılması için bir varyasyon karşılaştırıcıyı (280) ve karşılaştırmanın sonucuna dayanarak birinci varyasyon modu veya ikinci varyasyon moduna göre faz düzeltme verilerinin (295) hesaplanması için bir düzeltme verileri hesaplayıcısını (285) ihtiva eder.A calculator 270 is provided for determining phase correction data 295 for an audio signal 55. The calculator provides a variation identifier 275 for determining the variation of a phase of an audio signal 55 in the first and second variation mode, a variation for comparing the second variation 290b determined using the second variation mode and the first variation 290a identified using the first variation mode. the comparator 280 and a correction data calculator 285 for calculating phase correction data 295 according to the first variation mode or the second variation mode based on the result of the comparison.

Description

Tarifname BIR SES siNYALINE YÖNELIK HESAPLAYICI VE PAZ DÜZELTME VERisiNIN BELIRLENMESINE YÖNELIK YÖNTEM Mevcut bulus, bir ses islemcisine ve bir ses sinyalinin islenmesi yöntemine, bir dekodere ve bir ses sinyalinin kodunu çözmek için bir yönteme ve bir kodlayiciya ve bir ses sinyalini kodlamak için bir yönteme iliskindir. Ayrica, bir hesaplayici ve faz düzeltme verilerinin, bir ses sinyalinin ve daha önceki söz konusu yöntemlerin birini gerçeklestirmek için bir bilgisayar programinin belirlenmesine yönelik bir yöntem de açiklanmaktadir. Diger bir deyisle mevcut bulus, algilsal öneme dayanan QMF alaninda bant genisligi uzatilmis sinyallerin faz spektrumunu düzelten veya algisal ses kodekleri için bir faz türevi düzeltmesini ve bant genisligi (BWE) uzatmasini göstermektedir. Algisal ses kodlama Simdiye kadar görülen algisal ses kodlamasi, zaman/frekans-alani isleme, artiklik azaltma olmak üzere birçok ortak temayi izlemektedir [1]. Tipik olarak, giris sinyali, zaman alani sinyalini bir Spektral (zaman/frekans) gösterime dönüstüren bir analiz filtresi bankasi ile analiz edilir. Spektral katsayilara dönüsme, frekans bilesenlerine bagli olarak sinyal bilesenlerinin selektifbir sekilde islenmesini saglar (örnegin, ayri ayri yapilara sahip farkli Paralel olarak giris sinyali, algisal özelliklerine göre analiz edilir, yani, özellikle zamana ve frekansa bagli maskeleme esigi hesaplanir. Zaman/frekans bagimli maskeleme esigi, her bir frekans bandi ve kodlama zaman çerçevesi için mutlak bir enerji degeri veya Maske- Sinyal-Orani [MSR] biçiminde bir hedef kodlama esigi araciligiyla nicemleme ünitesine Analiz filtre bankasi tarafindan verilen Spektral katsayilar, sinyalin temsil edilmesi için gereken veri oranini azaltmak için ölçülür. Bu asama, bilgi kaybi anlamina gelir ve sinyale bir kodlama bozuklugu (hata, gürültü] getirir. Bu kodlama gürültüsünün duyulabilir etkisini en aza indirmek için, nicemleyici asamasinin büyüklükleri, her frekans bandi ve çerçeve için hedef kodlama esiklerine göre kontrol edilir. Ideal olarak, her bir frekans bandina enjekte edilen kodlama gürültüsü, kodlama [maskeleme) esiginden daha düsüktür ve dolayisiyla, öznel seste bozulma algilanamaz (konu disiligin ortadan kaldirilmasi). Psiko-akustik gerekliliklere göre frekans ve zamana göre nicemleme gürültüsünün bu kontrolü, karmasik bir gürültü sekillendirme etkisine yol açar ve bu, kodlayiciyi algisal bir ses kodlayicisi yapan seydir. Daha sonra, modern ses kodlayicilari, nicemlenmis spektral veriler üzerinde entropi kodlamasi (örnegin Huffman kodlama, aritmetik kodlama] yapar. Entropi kodlamasi, bit hizinda da tasarruf saglayan kayipsiz bir kodlama asamasidir. Son olarak, tüm kodlanmis spektral veriler ve ilgili ek parametreler (örnegin, her bir frekans bandi için nicemleyici ayarlari gibi ek bilgiler), dosya depolama veya iletim için amaçlanan son kodlanmis gösterim olan bir bit akisi içerisinde birlikte paketlenir. Bant genisligi uzatimi Filtre bankalarina dayali algisal ses kodlamasinda, tüketilen bit hizinin temel kismi genellikle nicemlenmis spektral katsayilar üzerinde harcanir. Bu nedenle, çok düsük bit hizlarinda, algisal olarak bozulmamis çogalma elde etmek için gerekli olan hassasiyetteki tüm katsayilari temsil etmek için yeterli bit bulunmayabilir. Bu nedenle, düsük bit hizi gereksinimleri, algisal ses kodlamasiyla elde edilebilen ses bant genisligine bir sinirlama getirmektedir. Bant genisligi uzantisi [2], bu uzun süreli temel sinirlamayi kaldirir. Bant genisligi uzantisinin temel fikri, eksik yüksek frekansli içerigi kompakt parametrik bir sekilde ileten ve geri yükleyen ek bir yüksek frekansli islemci tarafindan bir bant-sinirli algisal kodeki tamamlamaktir. Yüksek frekans içerigi, taban bant sinyalinin tek yan bant modülasyonuna, Spektral Band Replikasyonunda (SBR) [3] veya örnegin ses kodlayici [4] gibi yükseklik kaydirma tekniklerinin uygulanmasinda kullanilan kopyalama teknikleri ile üretilebilir. Dijital ses etkileri Zaman-uzatma veya yükseklik kaymasi etkileri genellikle senkronize örtüsme-ekleme elde edilir. Ayrica, alt bantlarda bir SOLA islemi uygulayan hibrit sistemler önerilmistir. Ses kodlayicilar ve hibrid sistemler genellikle dikey faz uyumunun kaybina atfedilebilen duraganlik [8] adi verilen bir durumdan yakinmaktadirlar. Bazi yayinlar, zamana bagli algoritmalarin ses kalitesi üzerindeki gelismelerini, önemli oldugu yerlerde dikey faz tutarliligini koruyarak iliskilendirmektedir [6] [7]. Son teknoloji ses kodlayicilari [1] genellikle kodlanacak sinyalin önemli faz özelliklerini göz ardi ederek algilanan ses sinyallerinin kalitesini riske atmaktadirlar. Algisal ses kodlayicilarinda faz uyumunu düzeltmeye yönelik genel bir öneri, [9]'da ele alinmistir. Bununla birlikte, her türlü faz uyumluluk hatasi ayni anda düzeltilemez ve tüm faz uyumluluk hatalari algisal olarak önemli degildir. Örnegin, ses bant genisligi uzantisinda, en yeni teknolojiden açikça anlasilamamakla birlikte, hangi faz uyumu ile ilgili hatalarin en yüksek öncelikle düzeltilmesi gerektigi ve hangi hatalarin sadece kismen düzeltilmis olabilecegi veya önemsiz algisal etkilerine istinaden, tamamen ihmal edilebilecegi belirtilmektedir. Özellikle ses bant genisligi uzantisinin [2] [3] [4] uygulanmasi nedeniyle, frekans üzerinde ve zaman içerisinde faz uyumu çogu zaman bozulmaktadir. Sonuç, isitsel pürüzlülük gösteren ve orijinal sinyaldeki isitsel nesnelerden ayrilan ve ayrica orijinal sinyale ek olarak isitsel bir nesne olarak algilanan tonlar ihtiva edebilen donuk bir sestir. Üstelik, ses ayni zamanda daha az "vizildayan" sekilde ve dolayisiyla daha az dinleyici katilimini ortaya çikarak uzak bir mesafeden gelebilir [5]. Bu nedenle gelistirilmis bir yaklasima ihtiyaç vardir. Mevcut bulusun bir amaci, bir ses sinyalinin islenmesi için gelistirilmis bir kavram saglanmasidir. Bu amaç, bagimsiz istemlerin konusu ile çözülür. türetilen fazin kullanildigi bir kodlama teknigini açiklar. Spektral büyüklük ve fazin hesaplamalari, Modifiye Ayrik Kosinüs Dönüsümü gibi bir analiz filtresi bankalarindan alinan spektral bilgiler kullanilarak bir tahmin prosesi ile elde edilir. Tahmin prosesi, dürtü yanitlari ile evrisim benzeri islemleri ile uygulanir. Dürtü yanitlarinin parçalari, hesaplama karmasikligi ile tahmin dogrulugu arasinda degis tokus yapmak üzere evrisim benzeri islemlerde kullanim için seçilir. Filtre yapilari ve dürtü yanitlarina yönelik analitik ifadeler içerisindeki matematiksel türevler açiklanir. Mevcut bulus, bir ses sinyalinin fazmin, bir ses islemcisi veya bir dekoder tarafindan hesaplanan bir hedef faza göre düzeltilebilecegi bulgusuna dayanmaktadir. Hedef faz, islenmemis bir ses sinyalinin bir fazinin bir temsili olarak görülebilir. Bu nedenle, islenmis ses sinyalinin fazi, islenmemis ses sinyalinin fazina daha iyi uyacak sekilde ayarlanir. Örnegin, ses sinyalinin zaman frekansi gösterimi, ses sinyalinin fazi bir alt banttaki sonraki zaman çerçeveleri için ayarlanabilir veya faz sonraki frekans alt bantlari için bir zaman çerçevesinde ayarlanabilir. Bu nedenle, en uygun düzeltme yöntemini otomatik olarak belirlemek ve seçmek için bir hesaplayici bulundu. Tarif edilen bulgular farkli düzenlemelerde uygulanabilir veya bir dekoder ve/veya kodlayicida ortak olarak uygulanabilir. Düzenlemeler, bir zaman çerçevesine yönelik bir ses sinyalinin bir faz ölçümünün hesaplanmasina yönelik konfigüre edilen bir ses sinyali faz ölçüm hesaplayicisi ihtiva eden bir ses sinyalinin islenmesine yönelik bir ses islemcisini gösterir. Ayrica ses sinyali, söz konusu zaman çerçevesine yönelik bir hedef faz ölçümünün belirlenmesine yönelik bir hedef faz ölçümü belirleyicisi ve islenmis bir ses sinyali elde etmek üzere hesaplanan faz ölçümü ve hedef faz ölçümü kullanilarak zaman çerçevesine yönelik ses sinyalinin fazlarinin düzeltilmesine yönelik konfigüre edilen bir faz düzelticisi ihtiva eder. Ilave düzenlemelere göre ses sinyali, zaman çerçevesine yönelik birçok alt bant sinyali ihtiva edebilir. Hedef faz ölçümü belirleyicisi, bir birinci alt bant sinyali için bir birinci hedef faz ölçümünün ve ikinci bir alt bant sinyali için ikinci bir hedef faz ölçümünün belirlenmesine yönelik konfigüre edilir. Ek olarak ses sinyali faz ölçümü hesaplayicisi, birinci alt bant sinyali için bir birinci faz ölçümünü ve ikinci alt bant sinyali için ikinci bir faz ölçümünü belirler. Faz düzelticisi, ses sinyalinin birinci faz ölçümü ve birinci hedef faz ölçümü kullanilarak birinci alt bant sinyalinin birinci fazinin düzeltilmesine ve ses sinyalinin ikinci faz ölçümü ve ikinci hedef faz ölçümü kullanilarak ikinci alt bant sinyalinin ikinci bir fazinin düzeltilmesine yönelik konfigüre edilir. Bu nedenle ses islemcisi, düzeltilen birinci alt bant sinyali ve düzeltilen ikinci alt bant sinyali kullanilarak düzeltilen bir ses sinyalinin sentezlenmesine yönelik bir ses sinyali sentezleyicisi ihtiva edebilir. Mevcut bulusa göre ses islemcisi, yatay yönde ses sinyalinin fazinin düzeltilmesine yönelik, diger bir ifadeyle zaman içerisinde bir düzeltme için konfigüre edilir. Bu nedenle ses sinyali, bir dizi zaman çerçevesi halinde bölünebilir, burada her bir zaman çerçevesi fazi hedef faza göre ayarlanabilir. Hedef faz, orijinal bir ses sinyalinin bir temsili olabilir, burada ses islemcisi orijinal ses sinyalinin kodlanmis bir temsili olan ses sinyalinin kodunun çözülmesine yönelik bir dekoder parçasi olabilir. Istege bagli olarak yatay faz düzeltmesi, ses sinyalinin bir zaman-frekans temsilinde bulunmasi halinde ses sinyalinin birçok alt bandi için ayri olarak uygulanabilir. Ses sinyali fazinin düzeltilmesi, ses sinyali fazi ve hedef fazin zaman içerisindeki faz türevinin sapmasinin ses sinyali fazindan çikartilmasi ile gerçeklestirilebilir. Bu nedenle zaman içerisindeki faz türevinin asagidaki gibi bir frekans [bir faz olan çb ile rîî w ` '] olmasi nedeniyle açiklanan faz düzeltmesi, ses sinyalinin her bir alt bandi için bir frekans ayarlamasini gerçeklestirir. Diger ifadelerle ses sinyalinin her bir bandinin bir hedef frekansa farki, ses sinyaline yönelik daha iyi bir kalite elde etmek üzere azaltilabilir. Hedef fazi belirlemek üzere hedef faz belirleyici, mevcut bir zaman çerçevesine yönelik temel bir frekans tahmininin elde edilmesine yönelik ve zaman çerçevesine yönelik temel frekans tahmini kullanilarak birçok zaman çerçevesinin alt bandinin her bir alt bandi için bir frekans tahmininin hesaplanmasina yönelik konfigüre edilir. Frekans tahmini, toplam alt bant ve ses sinyalinin örnekleme frekansi sayisi kullanilarak bir zaman içerisindeki faz türevine dönüstürülebilir. Ilave bir düzenlemede ses islemcisi, bir zaman çerçevesinde ses sinyaline yönelik bir hedef faz ölçümünün belirlenmesine yönelik bir hedef faz ölçüm belirleyicisi, hedef faz ölçümünün zaman çerçevesi ve ses sinyali fazi kullanilarak bir faz hatasinin hesaplanmasina yönelik bir faz hata hesaplayicisi ve faz hatasi kullanilarak zaman çerçevesi ve ses sinyali fazinin düzeltilmesine yönelik konfigüre edilen bir faz düzeltici ihtiva eder. Ilave düzenlemelere göre ses sinyali, bir zaman frekans temsilinde mevcuttur, burada ses sinyali zaman çerçevesine yönelik birçok alt bant ihtiva eder. Hedef faz ölçümü belirleyici, bir birinci alt bant sinyaline yönelik bir birinci hedef faz ölçümü ve ikinci bir alt bant sinyali için ikinci bir hedef faz ölçümünü belirler. Ayrica faz hata hesaplayicisi, faz hatalarinin bir vektörünü olusturur, burada vektörün bir birinci elemani, birinci alt bant sinyali ve birinci hedef faz ölçümü fazinin bir birinci sapmasina refere eder ve burada vektörün ikinci bir elemani, ikinci alt bant sinyali ve ikinci hedef faz ölçümü fazinin ikinci bir sapmasina refere eder. Ek olarak bu düzenlemenin ses islemcisi, düzeltilmis birinci alt bant sinyali ve düzeltilmis ikinci alt bant sinyali kullanilarak düzeltilmis bir ses sinyalinin ortalama düzeltilmis faz degerlerini üretir. Ek olarak veya alternatif olarak birçok alt bant, bir taban bandi ve bir set frekans yamasi halinde gruplandirilir, burada taban bandi ses sinyalinin bir alt bandini ihtiva eder ve frekans yamalari seti, taban bandinda en az bir alt bandin frekansindan daha yüksek bir frekansta, taban bandinin en az bir alt bandini ihtiva eder. Ilave düzenlemeler, ortalama bir faz hatasini elde etmek üzere ikinci frekans yamalarinin birinci yamasina refere eden faz hatalarinin bir vektörünün elemanlarinin bir ortalamasinin hesaplanmasina yönelik konfigüre edilen faz hata hesaplayicisini gösterir. Faz düzelticisi, agirlik ortalamali bir faz hatasi kullanilarak yamanin sinyalinin frekans yamalari setinin birinci ve sonraki frekans yamalarinda alt bant sinyalinin bir fazinin düzeltilmesine yönelik konfigüre edilir, burada ortalama faz hatasi, modifiye dilmis bir yama sinyalini elde etmek üzere frekans yamasinin bir indeksine göre bölünür. Bu faz düzeltmesi, geçis frekanslarinda yeterli kalite saglar, bu sonraki iki frekans yamasi arasindaki sinir frekanslaridir. Diger bir düzenlemeye göre daha önceden açiklanan iki düzenleme, ortalamada ve geçis frekanslarinda yeterli olan faz ile düzeltilmis degerleri ihtiva eden düzeltilmis bir sinyalini elde etmek üzere kombine edilebilir. Bu nedenle ses sinyali faz türev hesaplayicisi, bir taban bandina yönelik frekans üzerinde faz türevlerinin ortalamasinin hesaplanmasina yönelik konfigüre edilir. Faz düzeltici, ses sinyalinin bir taban bandinda en yüksek alt bant indeksi ile alt bant sinyalinin fazina mevcut bir alt bant indeksi ile agirligi belirlenen frekans üzerinde faz türevlerinin ortalamasinin eklenmesi ile optimize bir birinci frekans yamasi ile ilave modifiye edilmis bir yama sinyalini hesaplar. Ayrica faz düzeltici, kombine modifiye edilmis yama sinyalini elde etmek üzere modifiye edilmis yama sinyali ve ilave modifiye edilmis yama sinyalinin agirlikli bir ortalamasinin hesaplanmasina yönelik ve kombine modifiye edilmis yama sinyalinin önceki frekans yamasinda en yüksek alt bant Indeksi ile alt bant sinyali fazina mevcut alt bandin alt bant indeksi ile agirligi belirlenen, frekans üzerinde faz türevlerinin ortalamasinin eklenmesi ile frekans yamalarina dayanarak kombine modifiye edilmis yama sinyalinin tekrar tekrar güncellenmesine yönelik konfigüre edilebilir. Hedef fazi belirlemek üzere hedef faz ölçümü belirleyici, bir pik konumu ve ses sinyalinin mevcut bir zaman çerçevesindeki pik konumlarinin temel frekansinin bir veri akisindan ekstrakte edilmesine yönelik konfigüre edilen bir veri akisi ekstraktörü ihtiva edebilir. Alternatif olarak hedef faz ölçüm belirleyici, bir pik pozisyonu ve mevcut zaman çerçevesinde pik pozisyonlarinin temel bir frekansini hesaplamak üzere mevcut zaman çerçevesinin analiz edilmesine yönelik konfigüre edilen bir ses sinyali analizörü ihtiva edebilir. Ayrica hedef faz ölçüm belirleyici, pik konumu ve pik konumlarinin temel frekansi kullanilarak mevcut zaman çerçevesinde ilave pik konumlarinin tahmin edilmesine yönelik bir hedef spektrum üreteci ihtiva eder. Detayli olarak hedef spektrum üreteci, bir zaman puls katarinin üretilmesine yönelik bir pik detektörü, pik konumlarinin temel frekansina göre puls katarinin bir frekansini ayarlamak üzere bir sinyal Olusturucu, konuma göre puls katarinin fazini ayarlamak üzere bir puls konumlayici ve ayarlanan puls katarinin bir faz spektrumunu üretmek üzere bir spektrum analizörü ihtiva edebilir, burada zaman alani sinyalinin faz spektrumu hedef faz ölçümüdür. Hedef faz ölçüm belirleyicisinin açiklanan düzenlemesi, pikler ile bir dalga formuna sahip bir ses sinyali için bir hedef spektrum olusturulmasi için avantajlidir. Ikinci ses islemcisinin düzenlemeleri, dikey bir faz düzeltmesini açiklar. Dikey faz düzeltmesi, tüm alt bantlar üzerinde bir zaman çerçevesinde ses sinyali fazini ayarlar. Her bir alt bant için bagimsiz olarak uygulanan, ses sinyali fazinin ayarlanmasi, ses sinyali alt bantlarinin sentezlenmesinden sonra düzeltilmemis ses sinyalinden farkli ses sinyalinin bir dalga formu ile sonuçlanir. Bu nedenle örnegin lekelenmis bir pik veya bir geçisin yeniden sekillendirilmesi mümkündür. Ilave bir düzenlemeye göre bir hesaplayici, birinci ve ikinci varyasyon modunda ses sinyali fazinin bir varyasyonunun belirlenmesine yönelik bir varyasyon belirleyici, faz varyasyon modu kullanilarak belirlenen bir birinci varyasyon ve ikinci varyasyon modu kullanilarak belirlenen ikinci bir varyasyonun karsilastirilmasina yönelik bir varyasyon karsilastirici ve karsilastirma sonucuna dayanarak birinci varyasyon modu veya ikinci varyasyon moduna göre faz düzeltmesinin hesaplanmasina yönelik bir düzeltme verisi hesaplayicisi ile bir ses sinyali için faz düzeltme verisinin belirlenmesine yönelik gösterilir. Ilave bir düzenleme, birinci varyasyon modunda faz varyasyonu olarak ses sinyalinin birçok zaman çerçevesine yönelik bir zaman içerisindeki faz türevinin [PDT] standart bir sapma ölçümünün veya ikinci varyasyon modunda faz varyasyonu olarak birçok alt banda yönelik bir frekans üzerinde faz türevinin (PDF) bir standart sapma ölçümünün belirlenmesine yönelik varyasyon belirleyicisini gösterir. Varyasyon karsilastiricisi, birinci varyasyon modu olarak zaman içerisindeki faz türevinin ölçümünü ve ses sinyalinin zaman çerçevelerine yönelik ikinci varyasyon modu olarak frekans üzerinde faz türevinin ölçümünü karsilastirir. Ilave bir düzenlemeye göre varyasyon belirleyicisi, üçüncü bir varyasyon modunda ses sinyali fazinin bir varyasyonunun belirlenmesine yönelik konfigüre edilir, burada üçüncü varyasyon modu bir geçis tespit modudur. Bu nedenle varyasyon karsilastiricisi, üç varyasyon modunu karsilastirir ve düzeltme verisi hesaplayicisi, karsilastirma sonucuna dayanarak birinci varyasyon modu, ikinci varyasyon veya üçüncü varyasyon moduna göre faz düzeltmesini hesaplar. Düzeltme verisi hesaplayicisinin kararlastirma kurallari, asagidaki gibi açiklanabilir. Bir geçisin tespit edilmesi halinde faz, geçis seklini restore etmek üzere geçislere yönelik faz düzeltmesine göre düzeltilir. Aksi durumda birinci varyasyonun ikinci varyasyondan daha küçük veya buna esit olmasi halinde birinci varyasyon modunun faz düzeltmesi uygulanir veya ikinci varyasyonun birinci varyasyondan daha büyük olmasi halinde ikinci varyasyon moduna göre faz düzeltmesi uygulanir. Bir geçis yoklugunun tespit edilmesi halinde ve birinci ve ikinci varyasyonun bir esik degerini asmasi halinde faz düzeltim modlarinin hiçbiri uygulanmaz. Hesaplayici, ses sinyalinin analiz edilmesine yönelik, örnegin bir ses kodlama asamasinda, en iyi faz düzeltim modunu belirlemek üzere ve belirlenen faz düzeltim modu için ilgili parametreleri hesaplamak üzere konfigüre edilebilir. Kod çözme asamasinda parametreler, mevcut teknige ait kodekler kullanilarak kodu çözülen ses sinyallerine kiyasla daha iyi bir kaliteye sahip kodu çözülmüs bir ses sinyali elde etmek üzere kullanilabilir. Hesaplayicinin ses sinyalinin her bir zaman çerçevesi için dogru düzeltme modunu otonom olarak tespit ettigi not edilecektir. Düzenlemeler, bir faz düzeltme algoritmasi ile belirlenen ses sinyalinin birinci zaman çerçevesinde alt bant sinyalinin bir fazinin düzeltilmesine yönelik birinci faz düzeltici ve birinci düzeltme verileri kullanilarak ses sinyalinin ikinci bir sinyalinin birinci zaman çerçevesine yönelik bir hedef spektrumunun üretilmesine yönelik bir birinci hedef spektrum üreteci ile bir ses sinyalinin kodunun çözülmesine yönelik bir dekoderi gösterir, burada düzeltme ses sinyalinin birinci zaman çerçevesinde alt bant sinyalinin ve hedef spektrum ölçümü arasindaki farkin azaltilmasi ile gerçeklestirilir. Ek olarak dekoder, zaman çerçevesine yönelik düzeltilmis bir faz kullanilarak birinci zaman çerçevesine yönelik ses alt bant sinyalinin hesaplanmasina yönelik ve faz düzeltme algoritmasindan farkli ilave bir faz düzeltme algoritmasina göre düzeltilmis bir faz hesaplama islemi kullanilarak veya ikinci zaman çerçevesinde alt bant sinyalinin ölçümü kullanilarak birinci zaman çerçevesinden farkli ikinci bir zaman çerçevesine yönelik ses alt bant sinyalinin hesaplanmasina yönelik bir ses alt bant sinyali hesaplayicisi ihtiva eder. Ilave düzenlemelere göre dekoder, birinci hedef spektrum üretecine es deger ikinci ve üçüncü bir spektrum üreteci ve birinci faz düzelticisine es deger ikinci ve üçüncü faz düzelticisi ihtiva eder. Bu nedenle birinci faz düzelticisi, yatay bir faz düzeltmesi gerçeklestirebilir, ikinci faz düzeltici dikey bir faz düzeltmesi gerçeklestirebilir ve üçüncü faz düzelticisi faz düzeltme geçislerini gerçeklestirebilir. Ilave bir düzenlemeye göre dekoder, ses sinyaline göre az sayida alt bantlar ile bir zaman çerçevesinde ses sinyalinin kodunun çözülmesine yönelik konfigüre edilen bir çekirdek dekoder ihtiva eder. Ayrica dekoder, az sayida alt bant ile çekirdek kodu çözülen ses sinyalinin alt bant setinin yamanmasina yönelik bir yama Olusturucu ihtiva edebilir, burada alt bantlar seti, bir ses sinyalini düzenli sayida alt bant ile elde etmek üzere az sayida alt banda bitisik, zaman çerçevesinde ilave alt bantlara birinci yamayi olusturur. Ayrica dekoder, zaman çerçevesinde ses alt bant sinyalinin büyüklük degerlerinin islenmesine yönelik bir büyüklük islemcisi ve sentezlenmis kodu çözülen bir ses sinyali elde etmek üzere ses alt bant sinyallerinin veya islenmis ses alt bant sinyallerinin büyüklügünün sentezlenmesine yönelik bir ses sinyali sentezleyicisi ihtiva edebilir. Bu düzenleme, kodu çözülen ses sinyalinin bir faz düzeltmesini ihtiva eden bant genisligi uzantisina yönelik bir dekoderi belirleyebilir. Buna uygun olarak bir ses sinyalinin kodlanmasina yönelik bir kodlayici, ses sinyali fazinin belirlenmesine yönelik bir faz belirleyici, ses sinyalinin belirlenen fazina dayanarak bir ses sinyaline yönelik faz düzeltme verisinin belirlenmesine yönelik bir hesaplayici, ses sinyaline göre az sayida alt banda sahip çekirdek kodlanmis bir ses sinyalini elde etmek üzere ses sinyalinin çekirdek kodlanmasina yönelik konfigüre edilen bir çekirdek kodlayici ve çekirdek kodlanan ses sinyaline dahil edilmeyen ikinci alt bant setine yönelik düsük çözünürlükte bir parametrenin elde edilmesine yönelik ses sinyali parametrelerinin ekstrakte edilmesi için konfigüre edilen bir parametre ekstraktörü ve parametreleri, çekirdek kodlanan ses sinyali ve faz düzeltme verisini ihtiva eden bir çikis sinyalinin olusturulmasina yönelik bir ses sinyali olusturucusu, bant genisligi uzantisina yönelik bir kodlaylci olusturabilir. Daha önce açiklanan düzenlemelerin tümü, örnegin kodu çözülen ses sinyalinin bir faz düzeltmesi ile bant genisligi uzantisi için bir kodlayici ve/veya dekoderde toplamda veya kombinasyon halinde görülebilir. Alternatif olarak açiklanan düzenlemelerin tümünün birbirine göre olmadan bagimsiz olarak görüntülenmesi mümkündür. Mevcut bulusun düzenlemeleri, eslik eden çizimlere istinaden açiklanacak olup burada: Sekil la Sekil lb Sekil lc Sekil 1d Sekil 3a Sekil 3b Sekil 3c Sekil 4a Sekil 4b Sekil 4c Sekil 4d bir zaman frekansi gösteriminde bir viyolon sinyalinin büyüklük spektrumunu gösterir; Sekil 1a'nin büyüklük spektrumuna iliskin olan faz spektrumunu gösterir; bir zaman frekansi gösteriminde QMF alanindaki bir trombon sinyalinin Sekil 1C'nin büyüklük spektrumuna iliskin olan faz spektrumunu gösterir; bir zaman çerçevesi ve bir alt bant ile tanimlanan sekilde zaman frekans desenlerini (örnegin QMF kaplari, Kareli Yansima Filtre bankasi kaplari) ihtiva eden bir zaman frekans diyagramini gösterir; bir ses sinyalinin örnek bir frekans diyagramlnl göstermekte olup burada frekansin büyüklügü, ondan fazla farkli alt bant üzerinde gösterilmistir; alim sonrasinda, örnegin bir ara asamada bir kod çözme islemi esnasinda, ses sinyalinin örnek bir frekans gösterimini göstermektedir; yeniden yapilandirilan ses sinyalinin Z[k,ri) örnek bir frekans gösterimini göstermektedir; bir zaman-frekans gösteriminde dogrudan kopyalama SBR'yi kullanarak QMF alanindaki bir viyolon sinyalinin büyüklük spektrumunu gösterir; Sekil 4a'nln büyüklük spektrumuna iliskin olan bir faz spektrumunu gösterir; bir zaman frekansi gösteriminde dogrudan kopyalama SBR'yi kullanarak QMF alanindaki bir trombon sinyalinin büyüklük spektrumunu gösterir; Sekil 4c'nin büyüklük spektrumuna iliskin olan faz spektrumunu gösterir; farkli faz degerleri ile tek bir QMF kabinin zaman-alan gösterimini göstermektedir; Sekil 12a Sekil 12b Sekil 12c Sekil 12a Sekil 13a Sekil 13b Sekil 13c Sekil 13d Sekil 14a 11/4 (üst) ve Sit/4 (alt) bir sabit degeri ile degistirilen faz ve bir sifir olmayan frekans bandi olan tek bir zaman-alani ve frekans-alani gösterimini göstermektedir; bir sifir olmayan frekans bandi olan ve fazin rastgele olarak degistigi bir sinyalin zaman-alani ve frekans-alani temsilini göstermektedir; Sekil 6'da açiklanan etkiyi, dört zaman çerçevesinin ve dört frekans alt- bandmln bir zaman frekansi temsilinde göstermektedir burada sadece üçüncü alt-bant, sifirdan farkli bir frekansi ihtiva etmektedir; Tr/4 (üst) ve 311/4 [alt] sabir degeri ile degistirilen faz ve bir sifir olmayan temporal çerçevesi olan bir sinyalin tek bir zaman-alani ve frekans-alani gösterimini göstermektedir; bir sifir olmayan temporal çerçevesi olan ve fazin rastgele olarak degistigi bir sinyalin zaman-alani ve frekans-alani temsilini gösterir; Sekil 8'de gösterilen zaman frekans diyagramina benzer olan bir zaman frekans diyagramlni göstermektedir, burada sadece üçüncü zaman çerçevesi, sifirdan farkli bir frekansi ihtiva etmektedir; bir zaman frekansi gösteriminde bir QMF alanindaki Viyolon sinyalinin zaman içerisindeki faz türevini gösterir; Sekil 12a'da gösterilen sekilde zaman içerisindeki faz türevine iliskin faz türevi frekansini gösterir; bir zaman frekansi gösteriminde bir QMF alanindaki trombon sinyalinin zaman içerisindeki faz türevini gösterir; Sekil 12c'nin zaman içerisindeki faz türevine iliskin frekans üzerindeki faz türevini gösterir; bir zaman-frekans gösteriminde dogrudan kopyalama SBR'yi kullanarak QMF alanindaki bir Viyolon sinyalinin zaman içerisindeki faz türevini gösterir; Sekil 13a'da gösterilen sekilde zaman içerisindeki faz türevine iliskin frekans içerisindeki faz türevini gösterir; bir zaman frekansi gösteriminde dogrudan kopyalama SBR'yi kullanarak bir QMF alanindaki trombon sinyalinin zaman içerisindeki faz türevini gösterir; Sekil 13c'de gösterilen sekilde zaman içerisindeki faz türevine iliskin frekans üzerindeki faz türevini gösterir; sematik olarak bir birim dairenin dört fazini, örnegin, müteakip zaman Sekil 14b Sekil 18a Sekil 18b Sekil 28a Sekil 28b çerçeveleri veya frekans alt bantlarini gösterir; SBR isleminden sonra Sekil 14a'da gösterilen fazlari ve kesikli çizgiler halinde, düzeltilmis fazlari gösterir; bir ses islemcisinin (50) sematik bir blok diyagramini gösterir; diger bir düzenlemeye göre bir sematik blok diyagramindaki ses islemcisini gösterir; bir zaman-frekans gösteriminde dogrudan kopyalama SBR'yi kullanarak QMF alanindaki bir viyolon sinyalinin PDT'sinde pürüzsüzlestirilmis bir hatayi gösterir; bir zaman-frekans gösteriminde düzeltilmis SBR için QMF alanindaki viyolon sinyalinin PDT'sindeki bir hatayi gösterir; Sekil 18a'da gösterilen hataya iliskin zaman içerisindeki faz türevini gösterir; bir dekoderin sematik bir blok diyagramini gösterir; bir kodlaylcinln sematik bir blok diyagramini gösterir; bir ses sinyali olabilecek bir veri akisinin sematik bir blok diyagramini gösterir; diger bir düzenlemeye göre Sekil 21'in veri akisini gösterir; bir ses sinyalinin islenmesi için bir yöntemin sematik bir blok diyagramini gösterir; bir ses sinyalinin kodunun çözülmesi için bir yöntemin sematik bir blok diyagramini gösterir; bir ses sinyalinin kodlanmasi için bir yöntemin sematik bir blok diyagramini gösterir; diger bir düzenlemeye göre bir ses islemcisinin sematik bir blok diyagramini gösterir; tercih edilen bir düzenlemeye göre ses islemcisinin sematik bir blok diyagramini gösterir; sinyal akisini daha detayli olarak gösteren ses islemcisindeki bir faz düzelticisinin sematik bir blok diyagramini gösterir; Sekiller 26-28a'ya kiyasla diger bir bakis açisindan faz düzeltmesinin asamalarini gösterir; hedef faz ölçüm belirleyicini daha detayli olarak gösteren ses islemcisindeki bir hedef faz ölçüm belirleyicisinin sematik bir blok diyagramini gösterir; Sekil 38a Sekil 38b Sekil 43a Sekil 43b Sekil 43c Sekil 43d hedef spektrum üretecini daha detayli olarak gösteren ses islemcisindeki bir hedef Spektrum üretecinin sematik bir blok diyagramini gösterir; bir dekoderin sematik bir blok diyagramini gösterir; bir kodlaylcinin sematik bir blok diyagramini gösterir; bir ses sinyali olabilecek br veri akisinin sematik bir blok diyagramini gösterir; bir ses sinyalinin islenmesi için bir yöntemin sematik bir blok diyagraminl gösterir; bir ses sinyalinin kodunun çözülmesi için bir yöntemin sematik bir blok diyagramini gösterir; bir ses sinyalinin kodunun çözülmesi için bir yöntemin sematik bir blok diyagramini gösterir; bir zaman frekansi gösteriminde dogrudan kopyalama SBR'yi kullanarak QMF alanindaki trombon sinyalinin faz spektrumundaki bir hatayi gösterir; bir zaman-frekans gösteriminde düzeltilmis SBR'yi kullanarak QMF alanindaki trombon sinyalinin faz spektrumundaki bir hatayi gösterir; Sekil 38a'da gösterilen hataya iliskin frekans üzerindeki faz türevini gösterir; bir hesaplaylcinin sematik bir blok diyagramini gösterir; varyasyon belirleyicide sinyal akisini daha detayli olarak gösteren hesaplayicinin sematik blok diyagramini gösterir; diger bir düzenlemeye göre hesaplayicinin sematik bir blok diyagramini gösterir; bir ses sinyali için faz düzeltme verilerinin belirlenmesi için bir yöntemin sematik bir blok diyagramini gösterir; bir zaman frekansi gösteriminde bir QMF alanindaki viyolon sinyalinin zaman içerisindeki faz türevinin standart sapmasini gösterir; Sekil 4-3a'ya göre gösterilen zaman içerisindeki faz türevinin standart sapmasina karsilik gelen frekans üzerinden faz türevinin standart sapmasini gösterir; bir zaman frekansi gösteriminde bir OMF alanindaki trombon sinyalinin zaman içerisindeki faz türevinin standart sapmasini gösterir; Sekil 43c'de gösterilen sekilde zaman içerisindeki faz türevinin standart sapmasina iliskin frekans üzerindeki faz türevinin standart sapmasini Sekil 44a Sekil 45a Sekil 45b Sekil 46a Sekil 46b Sekil 48a Sekil 48b Sekil 50a Sekil 51a Sekil 51b Sekil 52b gösterir; bir zaman frekansi gösteriminde QMF alanindaki bir viyolon + alkis sinyalinin büyüklügünü gösterir; Sekil 44a'da gösterilen büyüklük spektrumuna iliskin olan faz spektrumunu gösterir; bir zaman frekansi gösteriminde bir QMF alanindaki viyolon + alkis sinyalinin zaman içerisindeki faz türevini gösterir; Sekil 45a'da gösterilen sekilde zaman içerisindeki faz türevine iliskin frekans üzerindeki faz türevini gösterir; bir zaman frekansi gösteriminde düzeltilmis SBR'yi kullanarak bir QMF alanindaki viyolon + alkis sinyalinin zaman içerisindeki faz türevini gösterir; Sekil 46a'da gösterilen sekilde zaman içerisindeki faz türevine iliskin frekans üzerindeki faz türevini gösterir; bir zaman frekansi gösteriminde QMF bantlarlnin frekanslarini gösterir; QMF bantlarinin dogrudan kopyalama SBR'sinin frekanslarlni, orijinal frekanslara kiyasla bir zaman-frekans gösteriminde gösterir; bir zaman-frekans gösterimindeki orijinal frekanslara kiyasla düzeltilmis SBR'yi kullanarak QMF bandinin frekanslarini gösterir; bir zaman-frekans gösteriminde orijinal sinyalin QMF bantlarinin frekanslarina kiyasla harmoniklerin tahmini frekanslarinl gösterir; bir zaman-frekans gösteriminde sikistirilmis düzeltme verileri ile düzeltilmis SBR'yi kullanarak QMF alanindaki viyolon sinyalinin zaman içerisindeki faz türevindeki hatayi gösterir; Sekil 50a'da gösterilen sekilde zaman içerisindeki faz türevinin hatasina iliskin zaman içerisindeki faz türevini gösterir; bir zaman diyagraminda trombon sinyalinin dalga seklini gösterir; yalniz tahmin edilen pikleri ihtiva eden Sekil 51a'daki trombon sinyaline iliskin olan zaman alani sinyalini göstermekte olup buada piklerin konumlari, iletilen meta veriler kullanilarak elde edilir; bir zaman-frekans gösteriminde sikistirilmis düzeltme verileri ile düzeltilmis SBR'yi kullanarak QMF alanindaki trombon sinyalinin faz spektrumundaki hatayi gösterir; Sekil 52a'da gösterilen faz spektrumundaki hataya iliskin frekans üzerindeki faz türevini gösterir; Sekil 53 bir dekoderin sematik bir blok diyagramini gösterir; Sekil 54 tercih edilen bir düzenlemeye göre sematik bir blok diyagrami gösterir; Sekil 55 diger bir düzenlemeye göre dekoderin sematik bir blok diyagramini gösterir; Sekil 56 bir kodlayicinin sematik bir blok diyagramini gösterir; Sekil 57 Sekil 56'da gösterilen kodlayicida kullanilabilecek olan bir hesaplayicinin blok diyagramini gösterir; Sekil 58 bir ses sinyalinin kodunun çözülmesi için bir yöntemin sematik bir blok diyagramini gösterir; ve Sekil 59 bir ses sinyalinin kodlanmasi için bir yöntemin sematik bir blok diyagramini gösterir. Asagida, bulusun düzenlemeleri daha detayli olarak açiklanacaktir. Ayni veya benzer islevsellige sahip olan ilgili sekillerde gösterilen elemanlar, ayni referans isaretleriyle iliskili olacaktir. Mevcut bulusun düzenlemeleri, spesifik bir sinyal islemine istinaden açiklanacaktir. Bu nedenle Sekiller 1-14, ses sinyaline uygulanan sinyal islemini açiklamaktadir. Düzenlemelerin bu spesifik sinyal islemine göre tarif edilmis olmasina ragmen, mevcut bulus bu islemle sinirli degildir ve diger birçok isleme semasina da uygulanabilir. Ayrica Sekiller 15-25, ses sinyalinin yatay faz düzeltmesi için kullanilabilecek olan bir ses islemcisinin düzenlemelerini göstermektedir. Sekiller 26-38, ses sinyalinin dikey faz düzeltmesi için kullanilabilecek olan bir ses islemcisinin düzenlemelerini göstermektedir. Ayrica Sekiller 39-52, bir ses sinyali için faz düzeltme verilerinin belirlenmesi için bir hesaplayicinin düzenlemelerini göstermektedir. Hesaplayici ses sinyalini analiz edebilir ve daha önce söz konusu olan ses islemcilerinin hangisinin uygulandigini veya ses sinyali için ses islemcilerini, ses sinyaline hiçbir sekilde uygulanmamasi için ses islemcisinin uygun olup olmadigini belirleyebilir. Sekiller 53-59, ikinci islemciyi ve hesaplayiciyi ihtiva edebilecek bir dekoder ve bir kodlayicinin düzenlemelerini göstermektedir. 1 Giris Algilanan ses kodlamasi, sinirli kapasiteye sahip iletim veya depolama kanallarini kullanan tüketicilere ses ve çoklu ortam saglayan her türlü uygulama için dijital teknoloji saglayan bir ana akis olarak çogalmistir. Modern algisal ses kodekleri, artan biçimde düsük olan bit hizlarinda tatmin edici ses kalitesi sunmak için gereklidir. Sirayla, dinleyicilerin çogunlugu tarafindan en fazla tolere edilebilen bazi kodlama yapilarina basvurmak gerekir. Ses Bant Genisligi Uzatimi (BWE), bir ses kodlayicinin frekans araligini Spektral ötelemesi veya iletilen düsük bantli sinyal parçalarinin belirli yapilari sunma pahasina yüksek bantta aktarilmasiyla yapay olarak genisletmek için kullanilan bir tekniktir. Bulgu, bu yapilarin bir kisminin, yapay olarak uzatilan yüksek bant içerisinde faz türevinin degismesi ile iliskilidir. Bu yapilardan biri, frekans üzerinde faz türevinin degisimidir dalga biçimi gibi bir puls katari ve oldukça düsük bir temel frekansa sahip olan tonal sinyaller için algisal olarak önemlidir. Dikey faz türevinin degisimi ile ilgili yapilar zaman içinde enerjinin lokal dagilimina denk gelir ve genellikle BWE teknikleri ile islenmis olan ses sinyallerinde bulunur. Bir baska yapi ise, herhangi bir temel frekansin armonik ses açisindan zengin tonal sinyalleri için algisal olarak önemli olan sekilde zaman içinde faz türevinin [bakiniz ayrica "yatay" faz uyumu) degismesidir. Yatay faz türevinin bir degisimi ile ilgili yapilar, alandaki lokal frekans kaymasina karsilik gelir ve genellikle BWE teknikleri ile islenmis olan ses sinyallerinde bulunur. Mevcut bulus, söz konusu ses bant genisligi uzantisinin [BWE] uygulanmasiyla bu özellik riske girdiginde bu tür sinyallerin dikey veya yatay faz türevini yeniden ayarlamak için araçlar sunar. Faz türevinin bir restorasyonunun algisal olarak faydali olup olmadigina ve dikey veya yatay faz türevinin ayarlanmasinin algisal olarak tercih edilip edilmeyecegine karar vermek için baska yollar saglanir. Spektral bant replikasyonu (SBR) [9] gibi bant genisligi uzatma yöntemleri siklikla düsük- bit hizli kodeklerde kullanilir. Daha yüksek bantlar hakkinda parametrik bilgi ile birlikte sadece nispeten dar olan düsük frekansli bir alanin iletilmesine izin verirler. Parametrik bilginin bit hizi küçük oldugundan, kodlama etkinliginde kayda deger bir iyilesme saglanabilir. Tipik olarak, daha yüksek bantlar için sinyal, sadece iletilen düsük frekans bölgesinden kopyalanarak elde edilir. Islem genellikle, asagida da belirtildigi üzere kompleks-modüle edilmis kare-yansima-filtre-bankasi (QMF) [10] alaninda gerçeklestirilir. Kopyalanan sinyal, büyüklük spektrumunu iletilen parametrelere dayanarak uygun edinimler ile çarparak islenir. Amaç, orijinal sinyalinki ile benzer bir büyüklük spektrumu elde etmektir. Aksine, kopyalanan sinyalin faz spektrumu tipik olarak hiç islenmez, bunun yerine k0pyalanmis faz spektrumu dogrudan kullanilir. Dogrudan kopyalanmis faz spektrumunun kullaniminin algisal sonuçlari asagida incelenmistir. Gözlenen etkilere dayanarak, algisal olarak en önemli etkilerin tespit edilmesi için iki ölçüm önerilmektedir. Dahasi, bunlara dayanarak faz spektrumunun nasil düzeltilecegi de önerilmektedir. Son olarak, düzeltmeyi gerçeklestirmek için iletilen parametre degerlerinin en aza indirilmesi için stratejiler de önerilmektedir. Bu bulus, faz türevinin korunmasinin veya restorasyonunun, ses bant genisligi uzamasi (BWE) teknikleriyle indüklenen önemli yapilari çözebildigi bulgusuyla ilgilidir. Örnegin, faz türevinin korunmasinin önemli oldugu tipik sinyaller, seslendirilmis konusma, pirinç aletler veya yayli çalgilar gibi zengin harmonik ses içerigi olan tonlardir. Mevcut bulus ayni zamanda faz türevinin bir restorasyonunun algisal olarak faydali olup olmadigina ve dikey veya yatay faz türevinin ayarlanmasinin algisal olarak tercih edilip edilmeyecegine karar vermek için - verilen bir sinyal çerçevesi için - baska araçlar da Bulus, asagidaki açilarla birlikte BWE teknikleri kullanilarak ses kodeklerinde faz türev düzeltmesi için bir cihaz ve bir yöntem ögretmektedir: 1. Faz türev düzeltmesinin "önemi"nin nicemlemesi 2. Dikey ["frekans"] faz türev düzeltmesi veya yatay ("zaman") faz türev düzeltmesinin sinyale bagli önceliklendirme 3. Düzeltme yönünün ["frekans" veya "zaman") sinyale bagli olarak degistirilmesi 4. Geçis durumlari için özel dikey faz türevi düzeltme modu . Pürüzsüz bir düzeltme için stabil parametrelerin elde edilmesi 6. Düzeltme parametrelerinin kompakt yan bilgi iletim formati 2 QMF alaninda sinyallerin gösterimi Bir zaman-alani sinyali x(m], burada m, ayrik zamandir, örnegin bir kompleks-modüle edilmis Kare Yansima Filtre bankasi (QMF) kullanilarak, bir zaman-frekans alaninda gösterilebilir. Ortaya çikan sinyal X[k,n) olup burada k, frekans bandi indeksidir ve n, temporal çerçeve indeksidir. Görüntülemeler ve düzenlemeler için 64 bantlik QMF ve 48 kHz'lik örnekleme frekansi fs varsayilmistir. Bu nedenle her bir frekans bandinin bant genisligi fBw, 375 Hz'dir ve temporal siçrama boyutu thop (Sekil 2'de 17] 1.33 ms'dir. Ancak islem, bu tür bir dönüsüm ile sinirli degildir. Alternatif olarak, bunun yerine bir MDCT kullanilabilir. Ortaya çikan sinyal X[k,n] olup burada k, frekans bandi indeksidir ve n, temporal çerçeve indeksidir.X[k,n], kompleks bir sinyaldir. Dolayisiyla, kompleks sayi olan j ile büyüklük Xmag(k,n) ve faz bilesenleri XPha(k,n] kullanarak da gösterilebilir Mk. n) 2 X magÜi'. n)eiK"""Umii (1) Ses sinyalleri, en sik XmagUçn] ve XPhaUgn) kullanilarak gösterilmistir (bakiniz Iki örnek için Sekil 1, bir viyolon sinyalinin büyüklük spektrumunu XmagUgn] göstermekte olup burada Sekil 1b, her ikisi de QMF alaninda olmak üzere ilgili faz spektrumunu XPha(k,n) gösterir. Ayrica Sekil 1c, bir trombon sinyalinin büyüklük spektrumunu Xmag[k,n) göstermekte olup burada Sekil 1d, ilgili QMF alaninda olmak üzere yeniden ilgili faz spektrumunu gösterir. Sekiller la ve lc'deki büyüklük spektrumlarina istinaden renk gradyeni, kirmizi = 0 dB'den mavi = -80 dB'ye kadar olan bir büyüklügü belirtir. Ayrica Sekiller 1b ve 1d'deki faz spektrumlari için renk gradyeni, kirmizi = Tr'den mavi = -7T'ye kadar olan fazlari belirtir 3 Ses verileri Tanimlanan bir ses isleminin etkisini göstermek için kullanilan ses verisi, bir trombonun bir ses sinyali için 'trombon', bir viyolonun bir ses sinyali için "viyolon" ve ortasinda bir alkis olan bir viyolon sinyali için "viyolon + alkis" olarak adlandirilir. 4 SBR'nin temel isletimi Sekil 2, bir zaman çerçevesi (15] ve bir alt bant (20] ile tanimlanan sekilde zaman frekans desenlerini (10] [örnegin QMF kaplari, Kareli Yansima Filtre bankasi kaplari] ihtiva eden bir zaman frekans diyagramini (5) gösterir. Bir ses sinyali, bir QMF [Kare Yansima Filtre bankasi] dönüsümü, bir MDCT (Modifiye Ayrik Kosinüs Dönüsümü] veya bir DFT (Ayrik Fourier Dönüsümü) kullanilarak böyle bir zaman frekans gösterimine dönüstürülebilir. Ses sinyalinin zaman çerçevelerinde bölünmesi, ses sinyalinin üst üste gelen kisimlarini ihtiva edebilir. Sekil 1'in alt kisminda, maksimum iki zaman çerçevesinin ayni anda üst üste bindigi tekli zaman çerçeveleri (15) örtüsmesi gösterilmektedir. Ayrica, diger bir deyisle daha fazla artiklik gerekirse, ses sinyali de çoklu örtüsme kullanilarak bölünebilir. Çoklu örtüsme algoritmasinda, üç veya daha fazla zaman çerçevesi, belirli bir zaman noktasinda ses sinyalinin ayni bölümünü ihtiva edebilir. Örtüsmenin süresi, siçrama boyutu thop 17'dir. Bir sinyal X(k, n) varsayildiginda, bant genisligi genisletilmis [BWE] sinyal Z(k, n), iletilen düsük frekansli frekans bandinin belirli bölümlerini kopyalayarak X(k, ri) giris sinyalinden elde edilir. Bir SBR algoritmasi, iletilecek olan frekans bölgesinin seçilmesi ile baslar. Bu örnekte 1 ila 7 arasindaki bantlar seçilmistir: V1 S I( 5 7 : XtmnSUg 71) = X(k, n) . (2) iletilecek olan frekans bantlarinin miktari, istenen bit hizina baglidir. Sekiller ve denklemler, 7 bant kullanilarak üretilir ve karsilik gelen ses verileri için 5 ila 11 bant kullanilir. Böylece, iletilen frekans bölgesi ve daha yüksek bantlar arasindaki çapraz frekanslar, sirasiyla 1875 ila 4125 Hz arasindadir. Bu alanin üzerindeki frekans bantlari hiç iletilmez, ancak bunun yerine bunlari açiklamak için parametrik meta veriler 01usturulur.Xtmns(k,n] kodlanir ve iletilir. Basitlik saglamak açisindan daha sonraki islemin varsayilan durum ile sinirli olmadiginin görülmesine ragmen, kodlamanin herhangi bir sekilde sinyali modifiye etmedigi varsayilir. Alici uçta, iletilen frekans bölgesi dogrudan karsilik gelen frekanslar için kullanilir. Daha yüksek bantlar için sinyal, iletilen sinyal kullanilarak olusturulabilir. Bir yaklasim ise, iletilen sinyali daha yüksek frekanslara kopyalamaktir. Biraz degistirilmis bir versiyonu burada kullanilmaktadir. Birincisi, bir taban bant sinyali seçilir. Iletiler) bütün bir sinyal olabilir, ancak bu düzenlemede birinci frekans bandi göz ardi edilir. Bunun nedeni, çogu durumda birinci bant için faz spektrumunun düzensiz oldugunun fark edilmesidir. Bu nedenle kopyalanacak taban bant, asagida belirtilen sekilde tanimlanir Vl i; I( '5 o : &3.33023 n) r: Xhtmsfk *l* '1, si) g (3) Diger bant genislikleri de iletilen ve taban bant sinyalleri için kullanilabilir. Taban bant sinyalini kullanarak daha yüksek frekanslar için ham sinyaller olusturulur burada Yraw(lgn,i'], frekans yamasi iiçin kompleks QMF sinyalidir. Ham frekans-yama sinyalleri, edinimlerg(k,n,i') ile çarpilarak meta verilerin iletilmesine göre manipüle edilir Y(k,7i, i) = meûc, n, i)g(k,n, i). (5) Edinimlerin gerçek degerde oldugu ve dolayisiyla sadece büyüklük spektrumunun etkilendigi ve böylece istenen bir hedef degere adapte edildigi belirtilmelidir. Bilinen yaklasimlar, edinimlerin nasil elde edildigini göstermektedir. Hedef faz, söz konusu bilinen yaklasimlarda düzeltilmeden kalir. Yeniden üretilecek olan son sinyal, istenen bant genisliginin bir BWE sinyalini elde etmek için bant genisliginin kesintisiz olarak uzatilmasi için iletilen ve yama sinyallerinin birlestirilmesiyle elde edilir. Bu düzenlemede 1' = 7 varsayilmistir. ZÜC› 71) = Xtrans(ki n)i Z(k+6i +l,n)== Y(k,n,i). (6) Sekil 3, grafik bir gösterimde tanimlanan sinyalleri göstermektedir. Sekil 3a, bir ses sinyalinin örnek bir frekans diyagramini göstermekte olup burada frekansin büyüklügü, ondan fazla farkli alt bant üzerinde göstermektedir. Ilk yedi alt bant, iletilen frekans bantlari Xtrans(k,n] (25) yansitir. Taban bant XbaseUçn] (30), ikinci ila yedinci alt bantlari seçerek türevlenir. Sekil 3a, orijinal ses sinyalini, yani iletim veya kodlama öncesindeki ses sinyalini, gösterir. Sekil 3b, alim sonrasinda, örnegin bir ara asamada bir kod çözme islemi esnasinda, ses sinyalinin örnek bir frekans gösterimini göstermektedir. Ses sinyalinin frekans spektrumu, iletilen frekans bantlarini (25) ve frekans bandinin daha yüksek alt bantlarina kopyalanan yedi taban bant sinyalini [30), taban banttaki frekanslardan daha yüksek frekanslar ihtiva eden bir ses sinyali (32] olusturur. Tam taban bant sinyali ayni zamanda bir frekans yamasi olarak da belirtilir. Sekil 3c, yeniden yapilandirilan ses sinyalini Z(k,n] (35] gösterir. Sekil 3b'ye kiyasla taban bant sinyallerinin yamalari, bir edinim faktörü ile ayri ayri çarpilir. Bu nedenle ses sinyalinin frekans spektrumu, ana frekans spektrumunu [25] ve bir dizi büyüklügü düzeltilmis yamalari Y(k,ri,1] (40] ihtiva eder. Bu yama yöntemi, dogrudan kopyalama yamasi olarak ifade edilir. Bulusun bu tür bir yama algoritmasi ile sinirli olmamasina ragmen, dogrudan kopyalama yamasi, mevcut bulusu tarif etmek için örnek olarak kullanilmaktadir. Kullanilabilecek diger bir yama algoritmasi da, örnegin harmonik bir yama algoritmasidir. Yüksek bantlarin parametrik gösteriminin sorunsuz oldugu, yani, yeniden olusturulmus sinyalin büyüklük spektrumunun orijinal sinyalinki ile ayni oldugu varsayilmaktadir. ZmagUc, n) : XmagUc, n). (7) Ancak, faz spektrumunun algoritma tarafindan herhangi bir sekilde düzeltilmedigine dikkat edilmelidir, bu nedenle algoritma mükemmel çalissa bile dogru degildir. Bu nedenle, düzenlemeler, Z(k, n) faz spektrumunun, algilanan kalitenin bir iyilesmesinin elde edilecegi sekilde bir hedef degere ek olarak nasil adapte edilecegini ve düzeltilecegini göstermektedir. Düzenlemelerde düzeltme, "yatay , dikey" ve "geçis" olmak üzere üç farkli islem modu kullanilarak gerçeklestirilebilir. Bu modlar, asagida ayri ayri açiklanmistir. ZmâgUçn] ve ZPha(k,n], viyolon ve trombon sinyalleri için Sekil 4'te gösterilmistir. Sekil 4, dogrudan kopyalanmis yama ile spektral bant genisligi replikasyonunu [SBR] kullanan yeniden yapilandirilmis ses sinyalinin (35) örnek spektrumlarini göstermektedir. Bir viyolon sinyalinin büyüklük spektrumu Zmag(k,n], Sekil 4a'da gösterilmis olup burada Sekil 4b, ilgili faz spektrumunu ZPhaUgn) göstermektedir. Sekiller 4c ve 4d, bir trombon sinyali için ilgili spektrumlari göstermektedir. Sinyallerin tümü, QMF alaninda gösterilmektedir. Sekil 1'de görüldügü üzere renk gradyeni, kirmizi = 0 dB'den mavi = -80 dB'ye bir büyüklügü ve kirmizi = n'den mavi = -n'ye bir fazi belirtir. Faz spektrumlarinin, orijinal sinyallerin spektrumlarindan farkli oldugu görülebilir [bakiniz Sekil 1). SBR'ye bagli olarak viyolonun, çapraz frekanslarda modülasyon seslerini ihtiva edecek sekilde, uyumsuzlugu ve trombonu ihtiva ettigi düsünülmektedir. Bununla birlikte, faz grafikleri oldukça rastgeledir ve farkli olduklarininin ve farkliliklarin algisal etkilerinin ne oldugunu söylemek gerçekten zordur. Ayrica, bu tür rastgele veriler için düzeltme verileri göndermek, düsük bit hizi gerektiren uygulamalarin kodlanmasi için uygun degildir. Bu nedenle, faz Spektrumunun algisal etkilerini anlamak ve bunlari açiklamak için ölçümler bulmak gereklidir. Bu konular, asagidaki bölümlerde açiklanmistir. QMF alaninda faz spektrumunun anlami Siklikla, frekans bandi indeksinin, tek bir tonal bilesenin sikligini tanimladigi, büyüklügünün ise seviyeyi tanimladigi ve fazin ise "zamanlamayi" tanimladigi düsünülmektedir. Ancak, bir QMF bandinin bant genisligi nispeten büyüktür ve veriler, yüksek hizda örneklenir. Bu nedenle, zaman-frekans karolari (yani QMF kutulari] arasindaki etkilesim aslinda bu özelliklerin tümünü tanimlar. kutusunun zaman-alani gösterimi, Sekil S'te verilmistir. Sonuç, 13.3 ms uzunlugunda sinc benzeri bir fonksiyondur. Fonksiyonun tam sekli, faz parametresi ile tanimlanir. Yalniz tek bir frekans bandinin tüm temporal çerçeveler için sifir olmadigi bir durumu göz önüne alarak, yani, VnEiNszag(3,n)=1. (8) Sabit bir deger a, yani, ile temporal çerçeveler arasindaki fazin degistirilmesi ile bir sinüzoid olusur. Ortaya çikan sinyal (yani, ters QMF dönüsümünden sonra zaman-alani sinyali), 01 = 71/4 (üst) ve 37[alt] degerleri ile Sekil 6'da gösterilmistir. Sinüzoidin frekansinin faz degisikliginden etkilendigi görülebilir. Frekans alani sag tarafta gösterilmis olup burada sinyalin zaman alani, Sekil 6'nin sol tarafinda gösterilmistir. Buna karsilik, eger faz rastgele seçilirse sonuç, bir dar bant gürültüsüdür (bakiniz Sekil 7). Dolayisiyla, bir QMF kutusunun fazinin, karsilik gelen frekans bandinin içindeki frekans Içerigini kontrol ettigi söylenebilir. Sekil 8, Sekil 6'da açiklanan etkiyi, dört zaman çerçevesinin ve dört frekans alt bandinin bir zaman frekansi temsilinde göstermekte olup, burada sadece üçüncü alt bant, sifirdan farkli bir frekansi ihtiva eder. Bu, Sekil 8'in sag tarafinda sematik olarak ve Sekil 8'in alt kisminda sematik olarak sunulan Sekil 6'nin zaman alani temsilinde gösterilen Sekil 6'dan gelen frekans alani sinyali ile sonuçlanir. Yalniz bir temporal çerçevenin tüm frekans bantlari için sifir olmadigi bir durum göz önüne alindiginda, yani, Sabit bir deger a, yani, ile frekans bantlar arasindaki fazi degistirerek bir geçis olusturulur. Ortaya çikan sinyal degerleri ile Sekil 9'da gösterilmistir. Geçisin temporal konumunun faz degisikliginden etkilendigi görülebilir. Frekans alani, Sekil 9'un sag tarafinda gösterilmis olup burada sinyalin zaman alani, Sekil 9'nin sol tarafinda gösterilmistir. Buna karsilik, eger faz rastgele seçilirse sonuç, bir kisa gürültü patlamasidir (bakiniz Sekil ). Böylece, bir QMF kutusunun fazinin, ayni zamanda, ilgili temporal çerçeve içindeki harmoniklerin temporal konumlarini kontrol ettigi söylenebilir. Sekil 11, Sekil 8`de gösterilen zaman frekans semasina benzer bir zaman frekans semasini göstermektedir. Sekil 11'de, sadece üçüncü zaman çerçevesi, sifirdan farkli olan bir alt banttan digerine n/4'lük bir zaman kaymasina sahip olan degerler mevcuttur. Bir frekans alanina dönüstürüldügünde, Sekil 9'un sag tarafindaki sematik olarak gösterilen frekans alani sinyali, Sekil 11'in sag tarafinda sematik olarak gösterilir. Sekil 9'un sol kisminin bir zaman alani temsilinin bir semasi, Sekil 11'in altinda gösterilmistir. Bu sinyal, zaman frekansi alanini bir zaman alani sinyaline dönüsmesi ile sonuçlanir. 6 Faz spektrumunun algisal olarak ilgili özelliklerinin açiklanmasi için ölçümler Bölüm 4'te tartisildigi gibi, faz spektrumu kendi içinde oldukça karmasiktir ve algi üzerindeki etkisinin ne oldugunu dogrudan görmek zordur. Bölüm 5, QMF alanindaki faz spektrumunun manipüle edilmesiyle olusabilecek iki etkiyi gösterir: [a] zaman içinde sabit faz degisimi, bir sinüzoidi üretir ve faz degisim miktari, sinüzoidin frekansini kontrol eder ve (b) frekans üzerinden sabit faz degisimi, bir geçisi meydana getirir ve faz degisimi miktari, geçis akisinin temporal konumunu kontrol eder. Bir kismin frekans ve temporal konumu, insan algisi için açikça önemlidir, bu nedenle bu özelliklerin saptanmasi potansiyel olarak yararlidir. Zaman içerisinde faz türevinin (PDT) hesaplanmasi .ÃiL'îLi-lç, n) 2 ,17" &En -r 1.) - Bi› "Çr, n) (12) ve frekans üzerinde faz türevinin (PDF) hesaplanmasi ile tahmin edilebilir xvdf(k,in) z xwe( + m) r xphaçm). (13) Xpdî[k,n), frekansa iliskindir ve XPdf[k,n], bir parçanin temporal konumuna iliskindir. QMF analizinin özelliklerine bagli olarak (bitisik temporal çerçevelerin modülatörlerinin fazlarinin bir geçisin konumuyla nasil eslestikleri], düzgün egriler olusturmak için XPdf(lçn)'nin esit geçis karelerine görsellestirme amaciyla sekillerde temporal çerçevelere 7r eklenir. Daha sonra, bu ölçümlerin örnek sinyallerimiz için nasil göründügü incelenir. Sekil 12, Viyolon ve trombon sinyalleri için türevleri göstermektedir. Daha spesifik olarak, Sekil 12a, QMF alanindaki orijinal, yani islenmemis, Viyolon ses sinyalinin XPdt(k,n] zamani boyunca bir faz türevini göstermektedir. Sekil 12b, frekans XpdfUçn) üzerinde ilgili bir faz türevini gösterir. Sekiller 12c ve 12d, zaman içinde faz türevini ve bir trombon sinyali için frekans üzerinden faz türevini göstermektedir. Renk gradyeni, kirmizi = n'den mavi = -Tr'ye kadar olan faz degerlerini belirtir. Viyolon için, büyüklük spektrumu temel olarak yaklasik 0.13 saniyeye kadar gürültüdür [bakiniz Sekil 1) ve dolayisiyla türevleride gürültülüdür. Yaklasik 0.13 saniyeden baslayarak XPdt'nin, zaman içerisinde nispeten stabil degerler elde etmeye basladigi görülür. Bu, sinyalin güçlü, nispeten stabil sinüzoidler ihtiva ettigi anlamina gelir. Bu sinüzoidlerin frekanslari, XPdt degerleri ile belirlenir. Aksine Xpdf çizimi, riISpeten gürültülü olarak görünür, bu nedenle bunu kullanan viyolon için ilgili verileri bulunamaz. Trombon için XPdti nispeten gürültülüdür. Aksine Xpdf'nin, tüm frekanslarda yaklasik olarak ayni degere sahip oldugu görülür. Uygulamada bu, tüm harmonik bilesenlerin geçici bir sinyal üreten zamanda hizalanmis oldugu anlamina gelir. Bu geçislerin temporal lokasyonlari, Xvdf degerleri ile belirlenir. Ayni türevler ayni zamanda SBR-islenmis sinyalleri Z[k,n) için de hesaplanabilir (bakiniz Sekil 13). Sekiller 13a ila 13d, daha önce açiklanan dogrudan kopyalama SBR algoritmasini kullanarak türevlenen Sekiller 12a ila 12d'ye iliskindir. Faz spektrumu taban bandindan daha yüksek yamalara basitçe kopyalandigindan, frekans yamalarinin PDT'leri taban bandininki ile aynidir. Bu nedenle, viyolon için, PDT, orijinal sinyalin durumunda oldugu gibi, zaman içinde stabil sinüzoidler üreten sekilde nispeten pürüzsüzdür. Ancak, Zivdt degerleri orijinal sinyalin Xpdt'sinden farklidir; bu da üretilen sinüzoidlerin orijinal sinyalden farkli frekanslara sahip olmasina neden olur. Bunun algisal etkisi, Bölüm 7'de tartisilmistir. Buna karsilik, frekans yamalarinin PDF'i taban bandi ile aynidir, fakat çapraz-geçis frekanslarinda PDF, uygulamada, rastgeledir. Çapraz geçiste, PDF aslinda frekans bandinin son ve birinci faz degeri arasinda hesaplanir, yani, ZpdtÜ, n) : zpimw, n) ~ ZPMÜ, n) : wma, n, i) - www, n, i) (14) Bu degerler gerçek PDF'ye ve çapraz frekansa baglidir ve orijinal sinyalin degerleri ile uyusmaz. Trombon için, kopyalanan sinyalin PDF degerleri, çapraz frekanslardan ayridir. Bu nedenle, harmoniklerin çogunun temporal konumlari dogru yerdedir, ancak çapraz frekanslardaki harmonikler uygulamada rasgele konumlardadirlar. Bunun algisal etkisi, Bölüm 7'de tartisilmistir. Sesler kabaca iki kategoriye ayrilabilir: harmonik ve gürültü benzeri sinyaller. Gürültü benzeri sinyaller zaten tanim geregi gürültülü faz özelliklerine sahiptir. Bu nedenle, SBR'nin neden oldugu faz hatalarinin kendileriyle algisal olarak anlamli olmadigi varsayilmaktadir. Bunun yerine harmonik sinyaller üzerinde yogunlasmistir. Müzik aletlerinin çogu ve ayrica konusma, sinyale harmonik bir yapi üretir, yani, ton frekansta frekansa göre ayarlanmis güçlü sinüzoidal bilesenler ihtiva eder. Insan isitmesi genellikle, isitsel filtreler olarak adlandirilan, örtüsen bir bant geçiren filtrelerden olusan bir bankayi içeriyormus gibi davranir. Bu nedenle, isitme cihazinin, karmasik sesleri isledigi varsayilabilir, böylece isitsel filtrenin içindeki kismi sesler tek bir varlik olarak analiz edilir. Bu filtrelerin genisligi, asagida belirtilen formüle göre belirlenebilen sekilde esdeger dikdörtgen bant genisligini (ERB) [11] takip edecek sekilde yaklastirilabilir burada fc, bandin merkezi frekansidir (kHz cinsinden). Bölüm 4'te tartisildigi gibi, taban bant ve SBR yamalari arasindaki çapraz frekans yaklasik 3 kHz`dir. Bu frekanslarda ERB, yaklasik olarak 350 l-lz'dir. Bir QMF frekans bandinin bant genisligi aslinda buna nispeten yakin olarak 375 I-lz'dir. Bu nedenle, QMF frekans bantlarinin bant genisliginin, ilgili frekanslarda ERB'yi takip ettigi varsayilabilir. Yanlis faz spektrumu nedeniyle yanlis gidebilecek bir sesin iki özelligi, Bölüm 6: Kismi bir bilesenin sikligi ve zamanlamasi içerisinde gözlendi. Frekans üzerinde yogunlasin, buradaki soru sudur, insanin duymasi bireysel harmoniklerin frekanslarini algilayabilir mi? Eger yapabilirse, SBR'nin neden oldugu frekans kaymasi düzeltilmelidir ve degilse, düzeltme gerekli degildir. Çözülmüs ve çözülmemis harmonikler için [12] bu konuyu açiklamak için kullanilabilir. ERB içinde sadece bir harmonik varsa, harmonik, çözümlenmis olarak adlandirilir. Tipik olarak insan isitme islemlerinin harmonikleri tek tek çözdügü ve bu nedenle bunlarin frekanslarina duyarli oldugu varsayilir. Uygulamada, çözümlenmis harmoniklerin frekansinin degistirilmesinin, uyumsuzluga neden oldugu sekilde algilanmaktadir. Buna karsilik, ERB içinde çoklu harmonikler varsa, harmonikler, çözülmez olarak adlandirilir. Insan duyularinin bu harmonikleri bireysel olarak islemedikleri varsayilmakta, bunun yerine eklem etkileri isitme sistemi tarafindan görülmektedir. Sonuç, periyodik bir sinyaldir ve sürenin uzunlugu harmoniklerin araligina göre belirlenir. Yükseklik algisi, sürenin uzunluguna baglidir, bu nedenle insanin isitme duyusuna duyarli oldugu varsayilir. Yine de, SBR'deki frekans bandi içindeki tüm harmonikler ayni miktarda kaydirilirsa, harmonikler ve dolayisiyla algilanan yükseklik arasindaki bosluk ayni kalir. Bu nedenle, çözümlenmemis harmonikler söz konusu oldugunda, insan isitme, frekans kaymalarini uyumsuzluk olarak algilamaz. SBR'nin neden oldugu zamanlama ile ilgili hatalar daha sonra ele alinacaktir. Harmonik bir bilesenin temporal konumunun veya fazin zamanlamasi kastedilmektedir. Bu bir QMF kutunun faziyla karistirilmamalidir. Zamanlama ile ilgili hatalarin algilanmasi detayli olarak incelenmistir [13]. Insan sinyallerinin çogunun, harmonik bilesenlerin zamanlamasina veya fazina duyarli olmadigi gözlemlenmistir. Bununla birlikte, insan isitme duyusunun, kismi zamanlamaya karsi çok hassas oldugu belirli sinyaller vardir. Sinyaller, örnegin, trombon ve trompet seslerini ve konusmayi ihtiva eder. Bu sinyallerle, belirli bir faz açisi, tüm harmonikler ile ayni anda gerçeklesir. Farkli isitsel bantlarin nöral ateslenme orani simüle edildi [13]. Bu faza duyarli sinyaller ile üretilen sinirsel atesleme oraninin tüm isitsel bantlarda kesik oldugu ve doruklarin zaman içinde hizalandigi görülmüstür. Tek bir harmonigin bile fazini degistirmek, nöral atesleme hizinin bu sinyallerle doruklugunu degistirebilir. Resmi dinleme testinin sonuçlarina göre, insan isitmesi buna duyarlidir [13]. Üretilen etkiler, fazin degistirildigi frekanslarda eklenen sinüzoidal bir bilesenin veya dar bant gürültüsünün algilanmasidir. Ek olarak, zamanlama ile iliskili etkilere olan duyarliligin harmonik tonun temel frekansina bagli oldugu bulundu [13]. Temel frekans ne kadar düsükse, algilanan etkiler de 0 kadar büyüktür. Temel frekans yaklasik 800 Hz'nin üzerindeyse, isitsel sistem, zamanlama ile ilgili etkilere hiç duyarli degildir. Bu nedenle, eger temel frekans düsükse ve harmoniklerin fazi frekansa göre hizalanmissa harmoniklerin zamanlamasindaki veya baska bir deyisle fazindaki, degisiklikler, insanin isitmesi tarafindan algilanabilir. Temel frekans yüksekse ve/veya harmoniklerin fazi frekansa göre hizalanmamissa, insan duyumu harmoniklerin zamanlamasindaki degisikliklere duyarli degildir. 8 Düzeltme yöntemleri Bölüm 7'de, insanlarin çözülmüs harmoniklerin frekanslarindaki hatalara duyarli olduklari kaydedilmistir. Ek olarak, insanlar, temel frekans düsükse ve harmonikler frekans üzerinden hizalanmissa, harmoniklerin temporal konumlarindaki hatalara duyarlidir. SBR, Bölüm 6'da tartisildigi gibi, bu hatalarin her ikisine de neden olabilir, böylece algilanan kalite, düzeltilerek iyilestirilebilir. Bunu yapmak için yöntemler bu bölümde önerilmektedir. Sekil 14, düzeltme yöntemlerinin temel fikrini sematik olarak göstermektedir. Sekil 14a, sematik olarak, örnegin, bir birim dairedeki müteakip zaman çerçevelerinin veya frekans alt bantlarinin dört fazini (45a-d) göstermektedir. Fazlar (45a-d), esit olarak 90°`lik araliklarla yerlestirilmistir. Sekil 14b, SBR isleminden sonraki asamalari ve kesikli çizgiler halinde düzeltilmis fazlari göstermektedir. islemden önceki faz (45a), faz açisina (45a') kaydirilabilir. Aynisi fazlar (45b ila 45d) için de geçerlidir. lslemden sonraki fazlar, yani faz türevi arasindaki farkin, SBR isleminden sonra bozulabilecegi gösterilmistir. Örnegin, fazlar [45a' ve 45b') arasinda, islemden önce 90°olan fark, SBR isleminden sonra 110° idi. Düzeltme yöntemleri 90°'lik eski faz türevini almak için faz degerlerini (45b') yeni faz degerine [45b") degistirecektir. Ayni düzeltme, fazlar [45d' ila 45d") için de geçerlidir. 8.1 Frekans hatalarinin düzeltilmesi - Yatay faz türevi düzeltmesi Bölüm 7'de tartisildigi üzere, insanlar çogunlukla bir ERB içinde sadece bir harmonik oldugunda bir harmonik frekansindaki bir hatayi algilayabilirler. Ayrica bir QMF frekans bandinin bant genisligi, birinci çaprazda ERB'yi tahmin etmek için kullanilabilir. Bu nedenle, frekans sadece bir frekans bandinda bir harmonik oldugunda düzeltilmelidir. Bu oldukça uygundur, çünkü Bölüm 5, eger bant basina bir harmonik varsa, üretilen PDT degerlerinin stabil oldugunu veya zaman içinde yavasça degistigini ve düsük bit hizini kullanilarak düzeltilebilecegini gösterdi. Sekil 15, bir ses sinyalini (55) islemek için bir ses islemcisini (50) göstermektedir. Ses islemcisi (50) bir ses sinyali fazi ölçüm hesaplayicisi (60), bir hedef faz Ölçüm belirleyicisi (65) ve bir faz düzelticisini (70) ihtiva eder. Ses sinyali fazi hesap hesaplayicisi (60), bir zaman çerçevesi (75) için ses sinyalinin (55) bir faz ölçümünü (80) hesaplamak üzere konfigüre edilmistir. Hedef faz ölçüm belirleyicisi (65), söz konusu zaman çerçevesi (75) için bir hedef faz ölçümünü (85) belirlemek için konfigüre edilmistir. Ayrica, faz düzeltici, islenmis bir ses sinyali (90) elde etmek için hesaplanan faz ölçümü [80) ve hedef faz ölçümü (85) kullanilarak zaman çerçevesi (75) için ses sinyalinin (55) fazlarini (45) düzeltmek üzere konfigüre edilmistir. Istege bagli olarak, ses sinyali (55) zaman çerçevesi düzenlemeleri, Sekil 16'ya göre açiklanmaktadir. Bir düzenlemeye göre, hedef faz ölçüm belirleyicisi (65), ikinci bir alt bant sinyali (95b) için bir birinci hedef faz ölçümü (85a) ve bir ikinci hedef faz ölçümü (85b) belirlemek için konfigüre edilir. Buna göre, ses sinyali fazi hesap hesaplayicisi (60), birinci alt-bant sinyali (95a) için bir birinci faz ölçümü (80a) ve ikinci alt-bant sinyali (95b) için bir ikinci faz ölçümü (80b) belirlemek için konfigüre edilmistir. Faz düzelticisi, ses sinyalinin (55) birinci faz ölçümü (80a) ve birinci hedef faz ölçümü (85a) kullanilarak birinci alt-bant sinyalinin (95a) bir fazini (45a) düzeltmek ve ses sinyalinin (55) ikinci faz ölçümünü (80b) ve ikinci hedef faz ölçümünü (85b) kullanarak ikinci alt bant sinyalinin (95b) ikinci fazini [45b] düzeltmek için konfigüre edilir. Ayrica, ses islemcisi (50) islenmis birinci alt bant sinyalini (95a) ve islenmis ikinci alt bant sinyalini (95b) kullanarak islenmis ses sinyalini (90) sentezlemek için bir ses sinyali sentezleyicisi (100) ihtiva eder. Diger düzenlemelere göre, faz ölçümü (80), zaman içindeki bir faz türevidir. Bu nedenle, ses sinyali fazi hesap hesaplayicisi (60), birden fazla alt bandin her bir alt bandi (95) için, bir mevcut zaman çerçevesinin (75b) bir faz degerinin (45) faz türevini ve bir gelecek zaman çerçevesinin (75c) bir faz degerini hesaplayabilir. Buna göre, faz düzelticisi (70), mevcut zaman çerçevesinin (75b) çok sayidaki alt bandin (95) her bir alt bandi (95) için, hedef faz türevi (85) ile zaman içinde faz türevini (80) arasindaki bir sapmayi hesaplayabilecek olup burada, faz düzelticisi (70) tarafindan gerçeklestirilen bir düzeltme, sapma kullanilarak gerçeklestirilir. Düzenlemeler, zaman çerçevesi (75) içindeki ses sinyalinin (55) farkli alt bantlarinin alt bant sinyallerini (95) düzeltmek için yapilandirilmis olan faz düzelticisini (70) gösterir, böylece düzeltilmis alt bant sinyallerinin (95) frekanslari, ses sinyalinin (55) temel frekansina harmonik olarak atanmis frekans degerlerine sahiptir. Temel frekans, ses sinyalinde (55) veya baska bir deyisle, ses sinyalinin (55) birinci harmoniklerinde meydana gelen en düsük frekanstir. Bundan baska, faz düzelticisi (70), bir önceki zaman çerçevesi, mevcut zaman çerçevesi ve bir sonraki zaman çerçevesi (75a ila 75C) boyunca çok sayida alt-bandin her bir alt bandi (95) için sapmayi (105) pürüzsüzlestirmek üzere konfigüre edilmistir ve bir alt bant (95) içindeki sapmanin (105) hizli degisikliklerinin azaltilmasi için konfigüre edilmistir. Diger düzenlemelere göre, pürüzsüzlük agirlikli bir ortalamadir, burada faz düzelticisi (70), ses sinyalinin (55) büyüklügü ile agirliklidir, önceki, mevcut ve gelecekteki zaman çerçevelerine (75a ila 75C) göre önceki, simdiki ve gelecek zaman çerçevesi (75a ila 75C) içerisinde agirlikli ortalamanin hesaplanmasi için konfigüre edilir. Düzenlemeler, daha önce tarif edilen islem asamalarini vektör bazinda göstermektedir. Bu nedenle, faz düzelticisi (70), sapmalarin (105) bir vektörünü olusturmak için konfigüre edilmis olup, burada vektörün bir birinci elemani, çok sayida alt-bandin birinci alt bandi (95a) için bir birinci sapma (105a) ile ilgilidir ve vektörün ikinci elemani, bir önceki zaman çerçevesinden (75a) bir mevcut zaman çerçevesine (75b) birçok alt bantin ikinci alt bandi (95b) için ikinci bir sapmayi (105b) ifade eder. Dahasi, faz düzelticisi (70), sapma vektörünü (105) ses sinyalinin (55) fazlarina (45) uygulayabilir, burada vektörün birinci elemani, ses sinyalinin (55) bir fazina (45a) çok sayida bir birinci alt banda (953) uygulanir ve ses sinyalinin (55) alt bantlari ve vektörün ikinci elemani, ses sinyalinin (55) çok sayida alt bandinin ikinci bir alt bandindaki (95b) ses sinyalinin (55b) bir fazina (45b) uygulanir. Baska bir bakis açisina göre, ses islemcisindeki (50) tüm islemin vektör tabanli oldugu, burada her vektörün bir zaman çerçevesini (75) temsil ettigi belirtilebilmekte olup burada birçok alt bandin her bir alt bandi (95) vektörün bir elemanini ihtiva eder. Diger düzenlemeler, mevcut zaman çerçevesi (75b) için temel bir frekans tahminini (85b) elde etmek için yapilandirilan hedef faz ölçüm belirleyicisine odaklanmakta olup burada hedef faz ölçüm belirleyicisi (65), zaman çerçevesi (75) için temel frekans tahminini (85) kullanarak zaman çerçevesi (75) için alt bantlarin çogunlugunun her bir alt bandi için bir frekans tahminini (85) hesaplamak için konfigüre edilir. Ayrica, hedef faz ölçüm belirleyicisi (65), birden fazla alt bandin her bir alt bandi (95) için frekans tahminlerini (85) toplam alt bant (95) ve ses sinyalinin (55) örnekleme frekansi sayisini kullanilarak zaman içinde bir faz türevine dönüstürebilir. Açiklik saglamak için, hedef faz ölçüm belirleyicisinin (65) çikisinin (85), düzenlemeye bagli olarak zaman içinde frekans tahmini veya faz türevi olabilecegine dikkat edilmelidir. Bu nedenle, bir düzenlemede, frekans tahmini faz düzelticisinde (70) daha fazla islem için dogru formati ihtiva etmekte olup, burada baska bir düzenlemede, frekans tahmini, zaman içinde bir faz türevi olabilen uygun Buna göre, hedef faz ölçüm belirleyicisi (65), vektör bazinda da görülebilir. Bu nedenle, hedef faz ölçüm belirleyicisi (65), çok sayida alt-bandin her bir alt bandi (95) için bir frekans tahminleri (85) vektörünü olusturabilmekte olup burada vektörün birinci elemani, bir birinci alt-bant (95a) için bir frekans tahmine (85a) ve vektörün ikinci elemani, ikinci bir alt bant (95b) için bir frekans tahmini (85b) anlamina gelir. Ek olarak, hedef faz ölçüm belirleyicisi (65), frekansi (85) temel frekansin katlarini kullanarak hesaplayabilmekte olup burada mevcut alt bandin (95) frekans tahmini (85), alt bandin (95) merkezine en yakin olan temel frekansin katlaridir veya burada, mevcut alt-bandin frekans tahmini (85), temel frekansin katlari mevcut alt bant (95) içinde degilse, mevcut alt-bandin (95) bir sinir frekansidir. Baska bir deyisle, ses islemcisini (50) kullanan harmoniklerin frekanslarindaki hatalari düzeltmek için önerilen algoritma asagidaki gibi çalisir. Birincisi PDT hesaplanir ve SBR, ZPdt sinyalini isler. ZPdt(k,ri) :ZPha(k,n+1) -ZPha(k,n). Yatay düzeltme için hedef PDT ile arasindaki fark hesaplanir: uvdtikm) = zpdtuç, n) - 231%, 71). (iöa) Bu noktada hedef PDT'nin, giris sinyalinin girisinin PDT'sine esit oldugu varsayilabilir thdtac, n) = Xpdwm), (16b) Daha sonra hedef PDT'nin düsük bir bit hizi ile nasil elde edilebilecegi gösterilecektir. Bu deger (yani hata degeri (105)), bir Hann penceresini W(1) kullanarak zaman içerisinde pürüzsüzlestirilir. Uygun uzunluk, örnegin QMF alanindaki 41 örnektir (55 ms'lik bir araliga karsilik gelir). Pürüzsüzlestirme, ilgili zaman-frekans karolarinin büyüklügü ile agirliklandirilir UpdîUr, n) : Circinean'ZDpdt(k,n -l i), izi/'(1)Zmaß(k,n + 5)), ---20 S. 2 'S 20, (17) lll " burada circmean {a, b}, b degerleri ile agirlikli açisal degerler için dairesel ortalama hesaplamayi gösterir. PDT'deki &giri-(hu) pürüzsüzlestirilmis hata dogrudan kopyalama SBR'yi kullanarak QMF alanindaki viyolon sinyali için Sekil 17'de gösterilmistir. Renk gradyeni, kirmizi = n'den mavi = -n'ye kadar olan faz degerlerini belirtir. Ardindan, istenilen PDT'yi elde etmek için faz spektrumunu modifiye etmek amaciyla bir modülatör matrisi olusturulur ..yha L › ndGpm" Au . &asçi; 3 ,4 Sekil 18a, düzeltilmis SBR için QMF alanindaki viyolon sinyalinin zaman içerisindeki faz türevindeki (PDT) Üsm UNO hatayi göstermektedir. Sekil 18b, zaman içerisindeki ilgili faz türevini göstermekte olup Ãfh (1971)' burada Sekil 18a'da gösterilen PDT'deki hata, Sekil 18b'de gösterilen sonuçlar ile Sekil 1221'da gösterilen sonuçlarin kiyaslanmasi ile türevlendi. Yeniden renk gradyeni, kirmizi = n'den mavi = -Jt'ye kadar olan faz degerlerini belirtir. PDT, düzeltilmis faz spektrumu Zch (kmiçin hesaplanir (bakiniz Sekil 18b). Düzeltilmis faz spektrumunun PDT'sinin orijinal sinyal kanalinin PDT'sini hatirlattigi görülebilir (bakiniz Sekil 12] ve hata, önemli enerji ihtiva eden zaman frekansli karolar için küçüktür (bakiniz Sekil 18a). Düzeltilmemis SBR verilerinin uyumsuzlugunun büyük ölçüde giderildigi fark edilebilir. Dahasi, algoritmanin önemli yapilar olusturdugu görülmemektedir. Bir hedef PDT olarak XpdtUçn] kullanildiginda, her bir zaman-frekans karosu için PDT-hata 9:11 (kin) degerlerini iletmesi muhtemel görünmektedir. Hedef PDT'yi hesaplamak için iletim için bant genisligi azaltilacak bir baska yaklasim, bölüm 9'da gösterilmistir. Diger düzenlemelerde ses islemcisi (50), bir dekoderin (110] bir parçasi olabilir. Bu nedenle, bir ses sinyalini (55) çözmek için dekoder (110) ses islemcisini (50), bir çekirdek dekoderi (115) ve bir yama Olusturucuyu (120] ihtiva edebilir. Çekirdek dekoder (115), ses sinyaline (55) göre azaltilmis sayida alt bant ile bir zaman çerçevesindeki (75) bir ses sinyalini (25) kod çözmek için konfigüre etmistir. Yama Olusturucu, azaltilmis sayida alt bant ile çekirdek kodu çözülmüs ses sinyalinin (25) bazi alt bantlarini [95) yaymakta olup burada alt bant seti, normal sayida alt bant ile bir ses sinyali (55) elde etmek için, azalan alt bant sayisina bitisik zaman çerçevesindeki [75) alt bantlarda birinci yamayi (30a) olusturur. Ek olarak, ses islemcisi (50), bir hedef fonksiyona (85) göre birinci yamanin ve 16'ya göre açiklanmistir. Düzenlemelere göre ses islemcisi, faz düzeltmesini gerçeklestirir. Düzenlemelere bagli olarak, ses islemcisi ayrica, yamaya BWE veya SBR parametrelerini uygulayan bir bant genisligi uzatma parametresi uygulayicisi (125) tarafindan ses sinyalinin bir büyüklük düzeltmesini ihtiva edebilir. Ayrica, ses islemcisi, bir sentezleyiciyi (100), örnegin düzgün bir ses dosyasi elde etmek için ses sinyalinin alt bantlarini birlestirmek, yani sentezlemek için bir sentez filtre bankasi, ihtiva edebilir. Diger düzenlemelere göre, yama Olusturucu (120), ses sinyalinin (25) bir alt bant setini (95) yamasi için konfigüre edilmis olup burada alt bantlar takimi, birinci yamaya bitisik olan zaman çerçevesinin alt bantlarina, ikinci bir yama olusturur ve burada ses islemcisi (50), ikinci yamanin alt bantlari içindeki fazin (45) düzeltilmesi için konfigüre edilmistir. Alternatif olarak, yama Olusturucu (120), düzeltilmis birinci yamanin birinci yamaya bitisik olan zaman çerçevesinin diger alt bantlarina yamasi için konfigüre edilmistir. Baska bir deyisle, birinci seçenekte, yama Olusturucu, ses sinyalinin iletilen kismindan düzenli bir alt bant sayisina sahip bir ses sinyali olusturur ve daha sonra ses sinyalinin her bir yamasinin fazlari düzeltilir. Ikinci seçenek, ilk olarak ses bandinin iletilen kismina göre birinci yamanin birinci asamalarini düzeltir ve daha sonra ses sinyalini daha önce düzeltilmis birinci yama ile normal alt bantlarin sayisi ile olusturur. Diger düzenlemeler, ses sinyalinin (55) mevcut zaman çerçevesinin (75) bir temel frekansini (114) bir veri akisindan (135) çikartmak için yapilandirilmis bir veri akisi ekstraktörünü (130) ihtiva eden dekoder [110) göstermekte olup burada veri akisi ayrica azaltilmis alt bant sayisi ile kodlanmis ses sinyalini (145) ihtiva eder. Alternatif olarak, dekoder, temel frekansi (140) hesaplamak için çekirdek kodu çözülmüs ses sinyalini (25) analiz etmek için konfigüre edilmis bir temel frekans analizörü (150) ihtiva edebilir. Baska bir deyisle, temel frekansin [140) türevlenmesi için seçenekler, örnegin, dekoderdeki veya kodlayicidaki ses sinyalinin bir analizidir, bu durumda, degerin kodlayicidan dekodere iletilmesi gerektiginden ötürü ikinci durumda, temel frekans, daha yüksek bir bit hizi açisindan göre daha dogru olabilir. Sekil 20, bir ses sinyalini (55] kodlamak için bir kodlayiciyi (155] göstermektedir. Kodlayici, ses sinyalini (55] kodlayan bir çekirdek kodlanmis ses sinyalini (145] elde etmek için ses sinyalini (55] kodlayan çekirdek için bir çekirdek kodlayiciyi (160] ihtiva eder ve ses sinyali 55'i analiz etmek için bir temel frekans analizörünü (175] veya ses sinyalinin temel bir frekans tahmini elde etmek için ses sinyalinin (55] düsük geçisli bir filtrelenmis versiyonunu ihtiva eder. Ayrica kodlayici, çekirdek kodlanmis ses sinyalinde (145] bulunmayan ses sinyalinin (55] alt bantlarinin parametrelerini ayiklamak için bir parametre ekstraktörü (165] ihtiva eder ve kodlayici, çekirdek kodlanmis ses sinyalini (145], parametreleri ve temel frekans tahminini ihtiva eden bir çikis sinyalini (135) olusturmak için bir çikis sinyali Olusturucu (170] ihtiva eder. Bu düzenlemede, kodlayici (155], çekirdek dekoderinin (160] önünde bir düsük geçis filtresi ve parametre ekstraktörünün (165] önünde bir yüksek geçis filtresini (185] ihtiva edebilir. Diger düzenlemelere göre, çikis sinyali Olusturucu (170), bir dizi çerçeveye sinyal (135] çikisini olusturmak için konfigüre edilmis olup, burada her bir çerçeve, çekirdek kodlanmis bir sinyali (145] parametreleri (190] ve burada temel frekans tahminini (140] ihtiva eden her bir n. çerçeveyi ihtiva eder ve burada n 2 2*dir. Düzenlemelerde, çekirdek kodlayici (160], örnegin bir AAC (Gelismis Ses Kodlama] kodlayicisi olabilir. Alternatif bir düzenlemede, ses sinyalini (55) kodlamak için bir akilli bosluk doldurma kodlayicisi kullanilabilir. Bu nedenle, çekirdek kodlayici tam bir bant genisligi ses sinyalini kodlar, burada ses sinyalinin en az bir alt bandi disarida birakilir. Bu nedenle, parametre ekstraktörü (165), çekirdek kodlayicinin (160] kodlama isleminden disarida birakilan alt bantlarin yeniden konfigüre edilmesi için parametreleri ekstrakte eder. Sekil 21, çikis sinyalinin (135] sematik bir gösterimini göstermektedir. Çikis sinyali, orijinal ses sinyaline (55] göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali (145], çekirdek kodlanmis ses sinyaline (145] dahil edilmeyen ses sinyalinin alt bantlarini temsil eden bir parametre (190] ve bir ses sinyalinin (135] temel frekans tahmini (140] veya orijinal ses sinyali (55] ihtiva eden bir ses sinyalidir. Sekil 22 ses sinyalinin (135] bir uygulamasini göstermektedir, burada ses sinyali bir çerçeve (195) dizisi halinde olusturulmakta olup burada her bir çerçeve (195) çekirdek kodlanmis ses sinyalini (145), parametrelerini (190) ihtiva etmektedir ve burada sadece her bir n. çerçeve (195) temel frekans tahminini (140] ihtiva eder ve burada n 2 2'dir. Bu; örnegin her 20. çerçeve için esit aralikli temel frekans tahmini iletimini açiklayabilir veya burada temel frekans tahmini, örnegin talep veya amaç üzerine düzensiz olarak iletilir. Sekil 23, "bir ses sinyali fazi ölçüm hesaplayici ile bir zaman çerçevesi için bir ses sinyalinin bir faz ölçümünün hesaplanmasi" asamasi (2305), "bir hedef faz ölçüm belirleyici ile bahsedilen zaman çerçevesi için bir hedef faz ölçümünün belirlenmesi" asamasi (2310) ve "islenmis bir ses sinyali elde etmek için hesaplanan faz ölçümü ve hedef faz ölçümü kullanilarak bir faz düzelticisi ile zaman çerçevesi için ses sinyalinin fazlarinin düzeltilmesi" asamasini (2315) ihtiva eden bir ses sinyalinin islenmesi için bir yöntemi (2 3 00) göstermektedir. Sekil 24, "ses sinyaline göre azalan alt bant sayisi ile bir zaman çerçevesinde bir ses sinyalinin kodunun çözülmesi" asamasini (2405), "kodu çözülmüs bir ses sinyalinin bir alt bant setine azaltilmis bir alt bant sayisi ile yama yapma, burada alt bantlar seti, alt bantlarin sayisinin azaltilmasi için bitisik, alt bantlarin düzenli bir sayisi ile bir ses sinyalinin elde edilmesi için, zaman çerçevesindeki alt bantlarin dahasina bir birinci yamayi olusturur" asamasini (2410) ve "ses islemcisiyle bir hedef fonksiyona göre birinci yamanin alt bantlari içindeki fazlarin düzeltilmesi" asamasini (2415) ihtiva eden sekilde bir ses sinyalinin kodunun çözülmesi için bir yöntemi (2400) göstermektedir. Sekil 25, "ses sinyaline göre azaltilmis sayida alt banta sahip olan bir çekirdek kodlanmis ses sinyali elde etmek için bir çekirdek kodlayici ile ses sinyalinin çekirdek kodlamasi" asamasi (2505), "ses sinyalinin temel frekans tahmininin elde edilmesi için temel bir frekans analizörü ile ses sinyalinin veya düsük sinyaldeki filtrelenmis versiyonun analiz edilmesi" asamasi (2510) ve "çekirdek kodlanmis ses sinyaline dahil edilmeyen ses sinyalinin alt bantlarinin parametrelerinin bir parametre ekstraktörü ile ekstre edilmesi" asamasi (2515) ve "çekirdek kodlanmis ses sinyalini, parametreleri ve bir çikis sinyali Olusturucu ile temel frekans tahminini ihtiva eden bir çikis sinyali olusturma" asamasi (2520) ile bir ses sinyalinin kodlanmasi için bir yöntemi (2500) göstermektedir. çalistirilmasi durumunda yöntemleri gerçeklestirmek için bir bilgisayar programinin bir 8.2 Temporal hatalarinin düzeltilmesi - Dikey faz türevi düzeltmesi Daha önce tartisildigi üzere insanlar, harmonikler frekans üzerinden senkronize edilirse ve temel frekans düsükse, bir harmonigin temporal konumundaki bir hatayi algilayabilirler. Bölüm S'te, QMF alaninda frekans üzerinden faz türevi sabit ise harmoniklerin senkronize edildigi gösterilmistir. Bu nedenle, her frekans bandinda en az bir harmonik olmasi avantajlidir. Aksi takdirde, 'bos' frekans bantlari rastgele fazlara sahip olacak ve bu da ölçümü bozacaktir. Neyse ki, insanlar sadece temel frekans düsük oldugunda harmoniklerin temporal konumuna duyarlidir (bakiniz Bölüm 7). Bu nedenle, faz türevini asiri frekans, harmoniklerin temporal hareketlerinden kaynaklanan algisal olarak anlamli etkilerin saptanmasi için bir ölçü olarak kullanilabilir. Sekil 26, bir ses sinyalini (55) islemek için bir ses islemcisinin (50') sematik bir blok diyagramini göstermekte olup burada ses islemcisi (50') bir hedef faz ölçüm belirleyicisi (65'), bir faz hatasi hesaplayicisi (200] ve bir faz düzelticisini [70'] ihtiva eder. Hedef faz ölçüm belirleyicisi (65'), zaman çerçevesindeki [75] ses sinyali (55) için bir hedef faz ölçümünü [85'] belirler. Faz hatasi hesaplayicisi [200], zaman çerçevesindeki [75] ses sinyalinin (55) bir fazini ve hedef faz ölçümünü (85') kullanarak bir faz hatasini (105') hesaplar. Faz düzelticisi (70'), islenen ses sinyalini (90') olusturan faz hatasini [105') kullanarak zaman çerçevesindeki ses sinyalinin [55) fazini düzeltir. Sekil 27, diger bir tercih edilen düzenlemeye göre ses islemcisinin (50') sematik bir blok diyagramini göstermektedir. Bu nedenle ses sinyali [55) zaman çerçevesi [75) için çok sayida alt bant sinyallerini [95) ihtiva eder. Buna göre, hedef faz belirleyicisi (65'), birinci alt-bant sinyali [95a) için bir birinci hedef faz ölçümü (8551') ve ikinci alt-bant sinyali [95b) için bir ikinci faz ölçümü [85b') belirlemek için konfigüre edilmistir. Faz hatasi hesaplayicisi [200), faz hatasinin (105'] bir vektörünü olusturmakta olup burada vektörün bir birinci elemani, birinci alt-bant sinyalinin (95] ve birinci hedef faz ölçümü [85a') fazinin bir birinci sapmasi [105a'] ile ilgilidir ve burada vektörün ikinci elemani, ikinci alt- bant sinyalinin (95b) ve ikinci hedef faz ölçer [85b`] fazinin ikinci bir sapmasi (105b') anlamina gelir. Ayrica, ses islemcisi (50') düzeltilmis birinci alt bant sinyalini (90a') ve düzeltilmis ikinci alt bant sinyalini (90b') kullanarak düzeltilmis ses sinyalini (90') sentezlemek için bir ses sinyali sentezleyicisi (100) ihtiva eder. Diger düzenlemelere iliskin olarak, çok sayida alt-bant (95), bir taban bandi (30) ve bir dizi frekans yamasi (40) olarak gruplandirilir, bu taban bant (30), ses sinyalinin (55) bir alt bantini (95) ihtiva eder ve frekans yamalari seti [40), taban bandin en azindan bir alt bandinin frekansindan daha yüksek bir frekansta, taban bandinin (30) en azindan bir alt bandini (95) ihtiva etmektedir. Ses sinyalinin yamasinin Sekil 3'e göre daha önce tarif edilmis oldugu ve bu nedenle tarifnamenin bu bölümünde detayli olarak tarif edilemeyecegi belirtilmelidir. Frekans yamalarinin (40), bir edinim faktörü ile çarpilan daha yüksek frekanslara kopyalanan ham taban bant sinyali olabilecegi belirtilmeli olup burada faz düzeltmesi uygulanabilir. Ayrica, tercih edilen bir düzenlemeye göre, edinimin çogalmasi ve faz düzeltmesi, ham taban bant sinyalinin fazlarinin, edinim faktörü ile çarpilmadan önce daha yüksek frekanslara kopyalanacagi sekilde degistirilebilir. Yapilanma ayrica faz hata hesaplayicisini (200), bir ortalama faz hatasi (105 ") elde etmek Için frekans yamalari setinin (40) bir birinci yamasina (40a) atifta bulunan faz hatalarinin (105') bir vektörünün bir elemanlarinin hesaplanmasini göstermektedir. Ayrica, bir ses sinyali fazi türev hesaplayicisi (210), taban bandi (30) için frekans (215) üzerinde bir ortalama faz türevlerinin hesaplanmasi için gösterilmistir. Sekil 28a, bir blok diyagraminda faz düzelticisinin (70') daha detayli bir açiklamasini göstermektedir. Sekil 28a'nin üstündeki faz düzelticisi (70'), alt bant sinyallerinin (95) bir fazini, frekans yamalari setinin birinci ve sonraki frekans yamalarinda (40) düzeltmek için konfigüre edilmistir. Sekil 28a'daki düzenlemede, alt-bantlarin (95c ve 95d) yamaya (40a) ve alt bantlara (95e ve 95f) frekans yamasina (40b) ait oldugu gösterilmektedir. Fazlar, agirlikli bir ortalama faz hatasi kullanilarak düzeltilmekte olup burada ortalama faz hatasi (105), degistirilmis bir yama sinyali (40') elde etmek için frekans yamasinin (40) bir endeksine göre agirliklandirilir. Diger bir düzenleme, Sekil 28a'nin alt kisminda gösterilmistir. Faz düzelticisinin (70') sol üst kösesinde, yamalardan [40) ve ortalama faz hatasindan (105") modifiye edilmis yama sinyalinin (40`) elde edilmesi için halihazirda açiklanan düzenleme gösterilmistir. Ayrica faz düzeltici (70'), ses sinyalinin (55) taban bandindan (30) en yüksek alt bant indeksi ile alt bant sinyalinin fazina, mevcut bir alt bant indeksi ile agirlikli hale gelen sekilde frekans üzerinde faz türevlerinin (215) ortalamasini ekleyerek optimize edilen birinci frekans yamasi ile diger bir modifiye edilmis yama sinyali (40") ihtiva eden bir baslatma asamasinda hesaplama yapar. Bu baslangiç asamasi için anahtar (220a), soldaki konumdadir. Herhangi bir baska islem asamasi için anahtar, dikey olarak yönlendirilmis bir baglanti olusturan diger konumda olacaktir. Bir baska düzenlemede, ses sinyali fazi hesaplayicisi (210), alt bant sinyalindeki (95) geçis durumlari tespit etmek için taban bant sinyalinden (30) daha yüksek frekanslar ihtiva eden çok sayida alt bant sinyali için frekans (215) üzerinde bir ortalama faz türevlerinin hesaplanmasi için konfigüre edilmistir. Geçici düzeltmenin, ses islemcisinin (50') dikey faz düzeltmesine benzer oldugu ve taban bandindaki (30) frekanslarin bir geçisin daha yüksek frekanslarini yansitmadigi farkina dikkat edilmelidir. Bu nedenle, bir geçici durumun faz düzeltmesi için bu frekanslar dikkate alinmalidir. Baslatma asamasindan sonra, faz düzeltmesi (70'), önceki frekans yamasinda en yüksek alt bant indeksi ile alt bant sinyalinin fazina, mevcut alt bandin (95) alt bant indeksi tarafindan agirlikli hale getirilen sekilde frekans üzerinde faz türevlerinin (215) ortalamasini ekleyerek baska bir modifiye edilmis yama sinyalinin (40"), frekans yamalarina (40) dayanarak güncellenmesi için konfigüre edilir. Tercih edilen düzenleme, faz düzelticinin (70'), modifiye edilmis yama sinyalinin (40') bir agirlikli ortalamasini ve birlestirilmis degistirilmis bir yama sinyali (40") elde etmek için daha sonra degistirilmis yama sinyalini (40") hesapladigi daha önce tarif edilen düzenlemelerin bir kombinasyonudur. Bu nedenle faz düzeltici (70'), tekrarli olarak frekans yamalarina (40) alt bant indeksi ile alt bant sinyalinin fazina mevcut alt bandinin (95) alt bant indeksi tarafindan agirlikli hale getirilen, frekans üzerinde faz türevlerinin (215) ortalamasinin eklenmesi ile kombine bir modifiye edilmis yama sinyalini (40 ) günceller. Kombine modifiye edilmis yamalari (40a"', 40b"', vb.) elde etmek için, anahtar (220b), birinci tekrardan sonra ve müteakiben, kombine modifiye yamaya (40b"') geçerek baslangiç asamasi için kombine degistirilmis (48") asamada baslayarak her bir yineleme sonrasinda Ayrica, faz düzelticisi (70'), ikinci spesifik agirliklandirma fonksiyonu ile agirlikli hale getirilen mevcut frekans yamasinda modifiye edilmis yama sinyali (40") ve birinci spesifik agirliklandirma fonksiyonu ile agirlikli hale getirilen mevcut frekans yamada yama sinyalinin (40') dairesel ortalamasini kullanarak bir yama sinyalinin (40') ve modifiye edilmis yama sinyalinin (40") agirlikli ortalamasini hesaplayabilir. Ses islemcisi (50) ve ses islemcisi (50') arasinda bir birlikte çalisabilirlik saglamak için, faz düzelticisi (70') faz sapmalarinin bir vektörünü olusturmakta olup burada, faz sapmalari, birlestirilmis bir degistirilmis yama sinyali (40") ve ses sinyali (55) kullanilarak hesaplanir. Sekil 28b, baska bir bakis açisindan faz düzeltmesinin asamalarini göstermektedir. Bir birinci zaman çerçevesi (75a) için, yama sinyali (40'), ses sinyalinin (55) yamalari üzerinde birinci faz düzeltme modunun uygulanmasiyla elde edilir. Yama sinyali (40'), modifiye edilmis yama sinyali (40") elde etmek için ikinci düzeltme modunun baslangiç asamasinda kullanilir. Yama sinyali (40') ve degistirilmis yama sinyali (40") bir kombinasyonu, birlestirilmis ve degistirilmis bir yama sinyali (40 ) ile sonuçlanir. Bu nedenle ikinci düzeltme modu, ikinci zaman çerçevesi (75b) için modifiye edilmis yama sinyali (40 ") elde etmek üzere kombine modifiye yama sinyali (40 ) üzerine uygulanir. Ek olarak, birinci düzeltme modu, yama sinyalinin (40') elde edilmesi için ikinci zaman çerçevesindeki (75b) ses sinyalinin (55) yamalari üzerine uygulanir. Yeniden bir yama sinyali (40') ve degistirilmis yama sinyali (40") bir kombinasyonu, birlestirilmis ve degistirilmis bir yama sinyali (40 ) ile sonuçlanir. Ikinci zaman çerçevesi için açiklanan islem semasi, üçüncü zaman çerçevesine (75c) ve buna göre ses sinyalinin (55) herhangi bir baska zaman çerçevesine uygulanir. Sekil 29, hedef faz ölçüm belirleyicisinin (65') detayli bir blok diyagramini göstermektedir. Bir düzenlemeye göre, hedef faz ölçüm belirleyicisi (65'), bir pik konumunu (230) ve ses sinyalinin (55) bir veri akisindan (135) bir mevcut zaman çerçevesinde pik konumlarin (235) temel frekansini çikarmak için bir veri akisi ekstraktörünü (130') ihtiva eder. Alternatif olarak, hedef faz ölçüm belirleyicisi (65'), mevcut zaman çerçevesindeki ses sinyalini (55) analiz etmek için bir ses sinyali analizörünü (225) ve bir pik konumu (230) ve mevcut zaman çerçevesinde pik noktasi konumlarinin (235) temel bir frekansini hesaplamak için ihtiva eder. Ek olarak, hedef faz ölçüm belirleyicisi, pik noktasi (230) ve pik konumlarin (235) temel frekansi kullanilarak mevcut zaman çerçevesinde daha fazla pik konumlari tahmin etmek için bir hedef spektrum üretecini [240) ihtiva eder. Sekil 30, Sekil 29'da tarif edilen hedef spektrum üretecinin (240) detayli bir blok diyagramini göstermektedir. Hedef spektrum üreteci (240), zaman içinde bir puls katari (265) üretmek için bir pik Olusturucuyu (245) ihtiva eder. Bir sinyal Olusturucu (250), puls katarinin frekansini, pik konumlarinin (235) temel frekansina göre ayarlar. Ayrica, bir puls konumlandirici (255), puls katarinin (265) fazini, pik konuma (230) göre ayarlar. Baska bir deyisle, sinyal Olusturucu (250), puls katarinin frekansi, ses sinyalinin (55) pik konumlarinin temel frekansina esit olacak sekilde, puls katarinin (265) rastgele bir frekans seklini degistirir. Dahasi, puls konumlandirici (255) puls katarinin fazini, puls katarinin pik noktalarindan birinin pik konumuna (230) esit olacak sekilde degistirir. Bundan sonra, bir spektrum analizör (260) ayarlanmis puls katarinin bir faz spektrumunu üretmekte olup burada, zaman alani sinyalinin faz spektrumu hedef faz ölçümü (85') olmaktadir. Sekil 31, bir ses sinyalinin (55) kodunun çözülmesi için bir dekoderin (110') sematik bir blok diyagramini göstermektedir. Dekoder (110), taban bandin bir zaman çerçevesinde bir ses sinyalinin (25) kodunu çözmek için konfigüre edilmis bir çekirdek kod çözmeyi (115) ve kodu çözülmüs taban bandin bir alt bant setini (95) yamamaya yönelik bir yama Olusturucuyu (120) ihtiva etmekte olup burada alt bant seti, taban bandindaki frekanslardan daha yüksek frekanslar ihtiva eden bir ses sinyali (3 2) elde etmek için taban çerçevesine bitisik zaman çerçevesindeki alt-bantlara bir yama olusturur. Ayrica, dekoder (110'), bir hedef faz ölçümüne göre yamanin alt bantlarinin fazlarini düzeltmek için bir ses islemcisi (50') ihtiva eder. Bir baska düzenlemeye göre, yama Olusturucu (120) ses sinyalinin (25) alt bantlarini (95) yayacak sekilde konfigüre edilmekte olup burada alt bantlarin bir seti, yamaya bitisik olan zaman çerçevesinin diger alt bantlarinda bir baska yama olusturur ve burada ses islemcisi (50'), diger yamanin alt bantlari içindeki fazlari düzeltmek için konfigüre edilmistir. Alternatif olarak, yama Olusturucu (120), düzeltilmis yamanin, yamaya bitisik olan zaman çerçevesinin diger alt bantlarina yamanmasi için konfigüre edilmistir. Bir baska düzenleme, bir geçis sinyalini ihtiva eden bir ses sinyalinin kodunu çözmek için bir dekoder ile ilgili olup burada ses islemcisi (50'), geçis fazini düzeltmek için konfigüre edilmistir. Geçici isleme, bölüm 8.4'teki diger bir kelime ile açiklanmistir. Bu nedenle, dekoder (110), bir frekansin bir baska faz türeVini almak ve alinan faz türevini veya frekansini kullanarak ses sinyalindeki (32) geçisleri düzeltmek için bir baska ses islemcisini (50') ihtiva eder. Ayrica, Sekil 31'deki dekoderin (110'), Sekil 19'daki dekodere (110) benzer oldugu, öyle ki, ana elemanlara iliskin açiklamanin, ses islemcilerindeki (50 ve 50'] farkla iliskili olmayan durumlarda karsilikli olarak degistirilebilecegi belirtilmelidir. Sekil 32, bir ses sinyalini (55] kodlamak için bir kodlayiciyi (155') göstermektedir. Kodlayici (155') bir çekirdek kodlayici (160), bir temel frekans analizörü (175'), bir parametre ekstraktörü (165] ve bir çikis sinyali tertibati (170) ihtiva eder. Çekirdek kodlayici (160], ses sinyaline (55) göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali (145) elde etmek için ses sinyalini (55) kodlayan çekirdek için konfigüre edilmistir. Temel frekans analizörü (175'), ses sinyali (55) içindeki pik konumlari (230] veya ses sinyalinde pik konumlarinin (235) temel bir frekans tahminini elde etmek için ses sinyalinin düsük geçisli bir filtrelenmis versiyonunu analiz eder. Ayrica parametre ekstraktörü (165), çekirdek kodlanmis ses sinyaline (145) dahil edilmemis olan ses sinyalinin (55) alt bantlarinin parametrelerini (190] ekstrakte eder ve çikis sinyali olusturucusu (170); pik konumlardan (230) biri, pik konumlarin (235) temel frekansi, parametreler (190] ve çekirdek kodlanmis ses sinyalini (145] ihtiva eden bir çikis sinyalini (135) ihtiva eder. Düzenlemelere göre çikis sinyali olusturucusu (170); bir çerçeve dizisine bir çikis sinyalini (13 5) olusturmak için konfigüre edilmis olup burada her bir çerçeve, çekirdek kodlanmis ses sinyalini (145), parametreleri [190] ihtiva eder ve burada her bir n. çerçeve, pik konumlarin (235) temel frekans tahminini ve pik konumu (230) ihtiva eder ve burada n 2 2'dir. Sekil 33, orijinal ses sinyaline (55) iliskin olarak alt hantlarin azalan sayisini ihtiva eden bir çekirdek kodlanmis ses sinyalini (145], çekirdek kodlanmis ses sinyaline dahil olmayan ses sinyalinin alt bantlarini temsil eden parametreyi (190], pik konumlarin (2 35) temel frekans tahminini ve ses sinyalinin (55) bir pik konum tahminini (230) ihtiva eden bir ses sinyali (135] düzenlemesini göstermektedir. Alternatif olarak ses sinyali (135); bir çerçeve dizisi içerisinde olusturulmus olup burada her bir çerçeve, çekirdek kodlanmis ses sinyalini (145], parametreleri (190] ihtiva eder ve burada her bir n. çerçeve, pik konumlarin (235] temel frekans tahminini ve pik konumu (230) ihtiva eder ve burada n 2 2'dir. Bu fikir, Sekil 22'ye göre daha önce tarif edilmistir. Sekil 34, bir ses islemcisiyle bir ses sinyalini islemek için bir yöntemi (3400] göstermektedir. Yöntem (3400); "hedef faz ölçümü olan bir zaman çerçevesi içinde ses sinyali için bir hedef faz ölçümünün belirlenmesi" asamasi (3405), "zaman çerçevesindeki ses sinyalinin fazini ve hedef faz ölçümünü kullanarak bir faz hatasi hesaplayicisi ile bir faz hatasi hesaplamasi" asamasini (3410] ve "zaman çerçevesindeki ses sinyalinin fazinin faz hatasi kullanilarak düzeltilen bir faz ile düzeltilmesi" asamasini (3415] ihtiva eder. Sekil 35, bir ses dekoder, bir ses sinyalini islemek için bir yöntemi (3500) göstermektedir. Yöntem (3500]: "bir çekirdek dekodere sahip bir taban bandin zaman çerçevesinde bir ses sinyalinin kodunu çözme" asamasini (3505), "kodu çözülmüs taban bandin bir yama Olusturucu ile bir alt bant setinin yama yapilmasi, burada alt bantlar seti, taban bandindaki frekanslardan daha yüksek frekanslar ihtiva eden bir ses sinyali elde etmek için taban bandina bitisik zaman çerçevesindeki alt bantlarda ayrica bir yama olusturur" asamasini (3510] ve "bir hedef faz ölçümüne göre bir ses islemcisi ile birinci yamanin alt bantlari ile fazlarin düzeltilmesi" asamasini (3515) ihtiva eder. Sekil 36, bir ses kodlayici ile bir ses sinyalini kodlamak için bir yöntemi (3600) göstermektedir. Yöntem (3600], "ses sinyaline göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali elde etmek için bir çekirdek kodlayici ile ses sinyalinin çekirdek kodlamasi" asamasini (3605), "ses sinyalindeki pik konumlarinin temel bir frekans tahminini elde etmek için ses sinyalinin veya düsük sinyalden filtrelenmis bir ses sinyali versiyonunun temel bir frekans analizörü ile analiz edilmesi" asamasini (3610), parametrelerinin bir parametre ekstraktörü ile ekstrakte edilmesi" asamasini (3615] ve konumunu ihtiva eden bir çikis sinyali Olusturucusu ile bir çikis sinyali olusturma" asamasini (3620) ihtiva eder. Baska bir deyisle, harmoniklerin temporal konumlarindaki hatalari düzeltmek için önerilen algoritma asagida belirtilen sekilde çalisir. Birincisi, hedef sinyalin ve SBR islenmis sinyalin faz spektrumlari arasindaki fark (Aw W **7 ve 29"] hesaplanir DQ'IJiiwi) -: Ü"(k, n; w 1.; (ip, 12;, (20a) bu da Sekil 37'da gösterilmistir. Sekil 37, dogrudan kopyalama SBR kullanilarak QMF alanindaki trombon sinyalinin faz spektrumunda DPhaUçn) fazindaki hatayi göstermektedir. Bu noktada, hedef faz spektrumu, giris sinyalinin girisininkine esit oldugu varsayilabilir ZâhaUßn) = xphauan) (ZOb) Daha sonra hedef faz spektrumunun düsük bir bit hizi ile nasil elde edilebilecegi gösterilecektir. Dikey faz türev düzeltmesi, iki yöntem kullanilarak gerçeklestirilir ve son düzeltilmis faz spektrumu da bunlarin bir karisimi olarak elde edilir. Birincisi, hatanin frekans bandinda nispeten sabit oldugu ve yeni bir frekans bandina girilirken hatanin yeni bir degere siçradigi görülebilir. Bu, anlamli bir durumdur çünkü faz, orijinal sinyaldeki tüm frekanslarda sabit bir frekans üzerinden degismektedir. Hata, çaprazlamada olusturulur ve hata, yama içinde sabit kalir. Böylece, tüm frekans yamasi Için faz hatasini düzeltmek için tek bir deger yeterlidir. Ayrica, daha yüksek frekans yamalarinin faz hatasi, frekans yamasinin indeks numarasi ile çarpimdan sonra ayni hata degeri kullanilarak düzeltilebilir. Bu nedenle, birinci frekans yamasi için faz hatasinin dairesel ortalamasi hesaplanir Uggs; in.) -: C'il'(`,mif'anî17pi` 'iz'mijlßs : r: li (21) Faz spektrumu, bunu kullanarak düzeltilebilir Bu ham düzeltme; eger hedef PDF, örnegin frekans üzerindeki faz türevi XPdf(k,n], tüm frekanslarda tam olarak sabitse, dogru bir sonuç üretir. Bununla birlikte, Sekil 12'de görülebilecegi gibi, siklikla, frekans üzerinde hafif bir dalgalanma söz konusudur. Böylelikle, üretilen PDF'deki herhangi bir süreksizligin önlenmesi için çapraz geçislerde gelistirilmis islem kullanilarak daha iyi sonuçlar elde edilebilir. Baska bir deyisle, bu düzeltme ortalama olarak PDF için dogru degerler üretir, ancak frekans yamalarinin çapraz frekanslarinda küçük süreksizlikler olabilir. Bunu Önlemek için düzeltme yöntemi uygulanir. Son düzeltilmis faz spektrumu, 1'c "in" i) iki düzeltme yönteminin bir karisimi olarak elde edilir. Diger düzeltme yöntemi, taban bandin PDF'sinin bir ortalamasini hesaplayarak baslar Xgâgüi) = circmean{1Yg::e(k,n)}. (23) Faz spektrumu, fazin bu ortalama deger ile degistigi, yani, W"(k, n, 1) -_- xp" (6, 31) + k -xgýgfçio, ch base ypgaom, i) = Ygýmmm - 1) + k -xW'foû, (24) varsayilarak bu ölçüt kullanilarak düzeltilebilir, buradalrpv iki düzeltme yönteminin kombine yama sinyalidir. Bu düzeltme çaprazlarinda iyi kalite saglar, ancak PDF'de daha yüksek frekanslara dogru bir sapmaya neden olabilir. Bunu önlemek için iki düzeltme yöntemi, bunlarin agirlikli dairesel ortalamasini hesaplayarak birlestirilir Kîhawc, 11,1) ::i (tii'Ltinc-an %_l'cîlîkh n, 1', C), WR (ir, 6); (25) burada c,duzeltme yontemini belirtir [ CV'i veya :va ve Wfc(k,c), agirliklandirma fonksiyonudur Ortaya Çikan faz spektrumu l'îîihçxkinil) ne süreksizliklerden ne de kaymalardan zarar görür. Orijinal spektruma ve düzeltilmis faz spektrumunun PDF'sine kiyasla hata, Sekil 38'de gösterilmektedir. Sekil 38a, faz düzeltmeli SBR sinyalini kullanarak QMF alanindaki Pt 5 . ,\ trombon sinyalinin faz spektrumundaki "cv" (fln ?Uhatayi göstermektedir, burada Sekil 38b, frekans üzerinden karsilik gelen faz türevini Zgîlfçk* 12)* göstermektedir. Hatanin, düzeltme olmadan önemli ölçüde daha küçük oldugu ve PDF'in büyük süreksizliklerden zarar görmedigi görülebilir. Belirli temporal çerçevelerde önemli hatalar vardir, ancak bu çerçeveler düsük enerjiye sahiptir [bakiniz Sekil 4), bu yüzden önemsiz algisal etkileri vardir. Kayda deger enerjiye sahip temporal çerçeveler nispeten iyi sekilde düzeltilmistir. Düzeltilmemis SBR'nin yapilarinin önemli ölçüde hafifletildigi fark edilebilir. Düzeltilmis faz spektrumu 23, (19") düzeltilmis frekans yamalarina konsantre olarak elde edilir Yâ"(k,n,i)_ Yatay düzeltme modu ile uyumlu olmasi için, dikey faz düzeltmesi bir modülatör matrisi kullanilarak da gösterilebilir [bakiniz Denklem 18) thmm) : zfgack, 71) ~ zphauc, n). (2%) 8.3 Farkli faz düzeltme yöntemleri arasinda geçis Bölümler 8.1 ve 8.2, SBR ile indüklenen faz hatalarinin, Viyolonun PDT düzeltmesini ve trombonun PDF düzeltmesini uygulayarak düzeltilebilecegini göstermistir. Bununla birlikte, düzeltmelerden hangisinin bilinmeyen bir sinyale uygulanacagini veya bunlardan herhangi birinin uygulanmasinin nasil gerektigini bilmek göz önüne alinmadi. Bu bölüm, düzeltme yönünü otomatik olarak seçmek için bir yöntem önermektedir. Düzeltme yönü Bu nedenle Sekil 39'da bir ses sinyali için faz düzeltme verilerinin belirlenmesi için bir hesaplayicinin düzenlemeleri gösterilmektedir. Varyasyon belirleyicisi (275], birinci ve ikinci bir varyasyon modunda ses sinyalinin [55) bir fazinin [45] varyasyonunu belirler. Varyasyon karsilastiricisi (280], birinci varyasyon modunu kullanarak belirlenen bir birinci varyasyonu [290a] ve ikinci varyasyon modunu kullanarak belirlenen bir ikinci varyasyonu (290b) karsilastirir ve bir düzeltme veri hesaplayicisi, bir kiyaslayicinin sonucuna dayanarak birinci varyasyon moduna veya ikinci varyasyon moduna dayali olarak faz düzeltme verisini (295] hesaplar. Ayrica, varyasyon belirleyicisi [275), birinci varyasyon modunda fazin varyasyonu [290a) olarak ses sinyalinin [55) birçok zaman çerçevesi için zamana göre bir faz türevinin (PDT) standart sapma ölçümünü belirlemek için ve ikinci varyasyon modunda fazin varyasyonu (29%) olarak ses sinyalinin (55) çok sayida alt bandi için frekans (PDF) üzerinden bir faz türevinin standart sapma ölçümünün belirlenmesi için konfigüre edilebilir. Bu nedenle, varyasyon karsilastiricisi (280), birinci degisme (290a) olarak zamana göre faz türevinin ölçümünü ve ses sinyalinin zaman çerçeveleri için ikinci bir varyasyon (290b) olarak frekans üzerinden faz türevinin ölçümünü karsilastirir. Düzenlemeler, standart sapma ölçümü olarak ses sinyalinin (55) bir önceki ve daha fazla sayida önceki zaman çerçeveleri boyunca bir faz türevinin dairesel bir standart sapmasinin saptanmasi için ve standart sapma ölçümü olarak mevcut zaman çerçevesi için ses sinyalinin (55) bir akiminin ve birden fazla gelecek çerçevesinin zaman içindeki bir faz türevinin dairesel bir standart sapmasinin belirlenmesi için varyasyon belirleyicisini (275) göstermektedir. Ayrica, varyasyon belirleyicisi (275), birinci varyasyonu (290a) belirlerken, her iki dairesel standart sapmanin minimumunu hesaplar. Bir baska düzenlemede, varyasyon belirleyicisi (275), bir frekansin ortalama bir standart sapma ölçümünü olusturmak için bir zaman çerçevesindeki (75) bir çok alt bant (95) için bir standart sapma ölçümünün bir kombinasyonu olarak birinci varyasyon modunda varyasyonu (290a) hesaplar. Varyasyon karsilastiricisi (280), bir enerji ölçümü olarak mevcut zaman çerçevesindeki (75) alt bant sinyalinin (95) büyüklük degerlerini kullanarak çok sayida alt-bandin standart sapma ölçümlerinin bir enerji agirlikli ortalamasini hesaplayarak standart sapma ölçümlerinin kombinasyonunu gerçeklestirmek için konfigüre edilmistir. Tercih edilen bir düzenlemede, varyasyon belirleyicisi (275), birinci varyasyonu (290a), zaman içerisinde, bir çok önceki ve çok sayida gelecekteki zaman çerçevesini belirlerken, ortalama standart sapma ölçümünü pürüzsüzlestirir. Pürüzsüzlestirme, bir pencereleme fonksiyonu ve ilgili zaman çerçevelerini kullanarak hesaplanan enerjiye göre agirlikli hale getirilir. Ayrica varyasyon belirleyicisi (275), ikinci varsasyon (290b) akis üzerinden, bir önceki çoklugu ve çok sayida gelecekteki zaman çerçevesini (75) belirlediginde, standart sapma ölçümünü yumusatmak üzere konfigüre edilmis olup burada pürüzsüzlestirme, bir pencereleme fonksiyonu ve ilgili zaman çerçevelerini (75) kullanarak hesaplanan enerjiye göre agirlikli hale getirilir. Bu nedenle, varyasyon karsilastiricisi (280), birinci varyasyon modu kullanilarak belirlenen birinci varyasyon (290a) olarak düzeltilmis ortalama standart sapma ölçümünü karsilastirmakta ve ikinci varyasyon modu kullanilarak belirlenen ikinci varyasyon (29%) olarak düzeltilmis standart sapma ölçümünü karsilastirmaktadir. Tercih edilen bir düzenleme, Sekil 40'ta gösterilmistir. Bu düzenlemeye göre, varyasyon belirleyicisi (275), birinci ve ikinci varyasyonu hesaplamak için iki islem yolunu ihtiva eder. Birinci islem yamasi, ses sinyalinden (55) veya ses sinyalinin fazindan (305a) zamana kadar faz türevinin standart sapma ölçümünün hesaplanmasi için bir PDT hesaplayicisini (300a) ihtiva eder. Bir dairesel standart sapma hesaplayicisi (310a), zamana (305a) göre bir faz türevinin standart sapma ölçümünden bir birinci dairesel standart sapma (3153) ve bir ikinci dairesel standart sapma (315b) belirler. Birinci ve ikinci dairesel standart sapmalar (315a ve 315b), bir karsilastirici (320) ile karsilastirilir. Karsilastirici (320), iki dairesel standart sapma ölçütünün (315a ve 315b) minimumunu (325) hesaplar. Bir birlestirici, ortalama bir standart sapma ölçütü [335a) olusturmak için frekans üzerinde minimumu (325) birlestirir. Bir pürüzsüzlestirici (340a), ortalama bir standart sapma ölçümü (345a) olusturmak için ortalama standart sapma ölçütünü (335a) pürüzsüzlestirir. Ikinci islem yolu, ses sinyalinden (55) frekansa (305b) göre bir faz türevini veya ses sinyalinin bir fazini hesaplamak için bir PDF hesaplayicisini (300b) ihtiva eder. Bir dairesel standart sapma hesaplayicisi (310b), frekans (305) üzerinde faz türevinin (335b) bir standart sapma ölçümü olusturur. Standart sapma ölçümü [305), düzgün bir standart sapma ölçümü (345b) olusturmak için daha düzgün bir pürüzsüzlestirici (340b) ile pürüzsüzlestirilir. Pürüzsüzlestirilmis ortalama standart sapma ölçümü (345a) ve pürüzsüzlestirilmis standart sapma ölçümü (345b), sirasiyla birinci ve ikinci varyasyonlardir. Varyasyon karsilastiricisi (280), birinci ve ikinci varyasyonu karsilastirir ve düzeltme verisi hesaplayicisi (285), birinci ve ikinci varyasyonun karsilastirilmasina dayanarak faz düzeltme verisini (295) hesaplar. Diger düzenlemeler, üç farkli faz düzeltme modunu ele alan hesaplayiciyi (270) göstermektedir. Sekil 41'de temsili bir blok diyagram gösterilmistir. Sekil 41, üçüncü varyasyon modunun bir geçis tespit modu oldugu üçüncü bir varyasyon modunda ses sinyalinin (55) fazinin bir üçüncü varyasyonu [290c) daha belirleyen varyasyon belirleyicisini (275) göstermektedir. Varyasyon karsilastiricisi (280), birinci varyasyon modunu, ikinci varyasyon modunu kullanarak belirlenen ikinci varyasyonu (29%) ve üçüncü varyasyonu kullanarak belirlenen üçüncü varyasyonu (290c) kullanarak belirlenen birinci varyasyonu (290a) karsilastirir. Bu nedenle, düzeltme verisi hesaplayicisi (285), karsilastirma sonucuna göre birinci düzeltme modu, ikinci düzeltme modu veya üçüncü düzeltme moduna göre faz düzeltme verilerini (295) hesaplar. Üçüncü varyasyon modundaki üçüncü varyasyonu [290c) hesaplamak için, varyasyon karsilastiricisi [280), mevcut zaman çerçevesinin anlik bir enerji tahminini ve çok sayida zaman çerçevesinin varyasyon karsilastiricisi [280), anlik enerji tahmini ile zaman-ortalama enerji tahmininin bir oranini hesaplamak için konügüre edilmistir ve bir zaman çerçevesi (75) içindeki geçici durumlari tespit etmek için tanimli bir esik ile orani karsilastirmak üzere konfigüre edilmistir. Varyasyon karsilastiricisi [280), üç varyasyonu temel alan uygun bir düzeltme modunu belirlemelidir. Bu karara dayanarak, düzeltme verisi hesaplayicisi [285), bir geçisin tespit edilmesi durumunda üçüncü bir varyasyon moduna uygun olarak faz düzeltme verisini faz düzeltme verilerini [295) hesaplar, eger bir geçisin olmadigi belirlenirse, birinci varyasyon modunda belirlenen birinci varyasyon [290a), ikinci varyasyon modundaki ikinci varyasyondan [290b) küçük veya esittir. Ayrica faz düzeltme verileri (295), ikinci varyasyon moduna göre hesaplanir, eger bir geçisin olmadigi belirlenirse, ikinci varyasyon modunda belirlenen ikinci varyasyon [290b), birinci varyasyon modundaki birinci varyasyondan [290a] küçük veya esittir. Düzeltme verileri hesaplayicisi, bir akim, bir veya daha fazla geçmis ve bir veya daha fazla gelecek zaman çerçevesi için üçüncü varyasyon (290c) için faz düzeltme verisinin [295) hesaplanmasi için konfigüre edilmistir. Buna göre, düzeltme verileri hesaplayicisi [285), bir akim, bir veya daha fazla geçmis ve bir veya daha fazla gelecek zaman çerçevesi için ikinci varyasyon modu (290b) için faz düzeltme verisinin [295) hesaplanmasi için konfigüre edilir. Ayrica, düzeltme verisi hesaplayicisi [285), bir yatay faz düzeltmesi ve birinci varyasyon modu için düzeltme verilerini [295) hesaplamak üzere yapilandirilir, Ikinci varyasyon modunda dikey faz düzeltmesi için düzeltme verisi [295) hesaplanir ve üçüncü varyasyon modunda bir geçici düzeltme için düzeltme verisi (295) hesaplanir. Sekil 42, bir ses sinyalinden faz düzeltme verisini belirlemek için bir yöntem [4200) göstermektedir. Yöntem (4200), "bir birinci ve bir ikinci varyasyon modunda bir varyasyon belirleyicisiyle ses sinyalinin bir fazinin bir varyasyonunu belirleme" asamasini belirlenen varyasyonu karsilastirma" asamasini [4210) ve "karsilastirma sonucunda birinci varyasyon moduna veya ikinci varyasyon moduna göre bir düzeltme verisi hesaplayicisi ile faz düzeltmesinin hesaplanmasi" asamasini (4215] ihtiva eder. Baska bir deyisle, viyolonun PDT'si zaman içerisinde pürüzsüz hale gelir öyle ki trombonun PDF'i frekans üzerinde pürüzsüzdür. Bu nedenle, bu ölçümlerin varyasyon ölçütü olarak standart sapmasi (STD) uygun düzeltme yöntemini seçmek için kullanilabilir. Zaman içerisindeki faz türevinin STD'si, XStdUUQn) : circstd{XPdt(k,n + l)},-23 S 1 S 0, XStdtz (Imi) = Circstd{Xpdt(k, n + i)},0 5 i 5 23, Xswtütm) : inin{X5tdn(k, 21.),X5tdt7 (lc, n)}, ve frekans üzerinde faz türevinin STD'si asagidaki sekilde hesaplanabilir X"dfm) : circstd{XPdr(k,n)}, 2 S 1( S '13, (28) burada circstd{}, dairesel STD'nin hesaplanmasini belirtir (gürültülü düsük enerjili kutulara bagli yüksek STD'yi önlemek için açi degerleri potansiyel olarak enerji ile agirliklandirilabilir veya STD hesaplamasi yeterli enerjiye sahip olan sekilde kutular ile sinirlandirilabilir]. Viyolon ve trombon için STD`1er, sirasiyla Sekiller 43a, 43b ve Sekiller 43c, 43d içinde gösterilmistir. Sekiller 43a ve c, QMF alaninda zaman içerisinde faz türevinin standart sapmasini XstdtUgn) göstermekte olup burada Sekiller 43b ve 43d, faz düzeltmesi olmadan frekans üzerinde ilgili standart sapmayi Xstdf(n] göstermektedir. Renk gradyeni, kirmizi =1'den mavi = O'a kadar olan degerleri belirtir. PDT'nin STD'sinin Viyolon Için daha düsük oldugu, Öyle ki PDF'nin STD'sinin trombon için daha düsük oldugu görülebilir (özellikle yüksek enerjiye sahip zaman-frekans karolari için). Her bir temporal çerçeve için kullanilan düzeltme yöntemi, STD'lerin hangisinin daha düsük olduguna göre seçilir. Bu nedenle XsîdtUçn) degerleri, frekans üzerinde birlestirilmelidir. Birlestirme, önceden tanimlanmis bir frekans araligi için enerji agirlikli bir ortalama hesaplanarak gerçeklestirilir (1.29) Sapma tahminleri, düzgün bir geçise sahip olmak ve böylece olasi yapilari önlemek için zaman içinde pürüzsüzlestirilir. Pürüzsüzlestirme, bir Hann penceresi kullanilarak gerçeklestirilir ve temporal çerçevenin enerjisi ile agirlikli hale getirilir burada WH), pencere fonksiyonudur ve Xmagûl) " ZI=1X rragUcl n) frekans üzerinde XmagUçn] toplamidir. Ilgili bir denklem, Xstdf(n] pürüzsüzlestirilmesi için kullanilir. Faz düzeltme yöntemi XssrlistÇÜ-.l ve Xsm (lt) ile kiyaslanmasi ile belirlenir. Varsayilan yontem, PDT (yatay) düzeltmesidir ve eger Ã-îsm (71) < ÂSêâTTI), [n - 5, n + 5] araligi için PDF (dikey) düzeltmesi uygulanir. Her iki sapma büyükse, örnegin önceden tanimlanmis bir esik degerinden daha büyük olan, düzeltme yöntemlerinin hiçbiri uygulanmaz ve bit hizi kurtarmalari yapilabilir. 8.4 Geçici isleme - Geçisler için faz türevi düzeltmesi Ortasinda eklenen bir alkis ile viyolon sinyali, Sekil 44'te gösterilmistir. QMF alaninda viyolon + alkis sinyalinin büyüklügü XmagUçn), Sekil 44a'da gösterilmistir ve ilgili faz spektrumu XPhaUçn), Sekil 44b'de gösterilmistir. Sekil 44a'ya istinaden renk gradyeni, kirmizi = 0 dB'den mavi = -80 dB'ye kadar olan büyüklük degerlerini belirtir. Ardindan Sekil 44b için faz gradyeni, kirmizi = n'den mavi = -n'ye kadar olan faz degerlerini belirtir. Zaman içerisindeki ve frekans üzerindeki faz türevleri, Sekil 45'te gösterilmistir. QMF alaninda viyolon + alkis sinyalinin zaman içerisindeki faz türevi Xpdt[k,n), Sekil 45alda gösterilmistir ve frekans üzerindeki ilgili faz türevi Xpdf[k,n), Sekil 45b'de gösterilmistir. Renk gradyeni, kirmizi = n'den mavi = -n'ye kadar olan faz degerlerini belirtir. PDT'nin alkis için gürültülü oldugu görülebilir, ancak PDF yüksek frekanslarda en azindan biraz pürüzsüzdür. Bu nedenle, keskinligini korumak için alkis için bir PDF düzeltmesi uygulanmalidir. Bununla birlikte, Bölüm 8.2'de önerilen düzeltme yöntemi bu sinyalle birlikte düzgün sekilde çalismayabilir, çünkü viyolon sesi düsük frekanslarda türevleri bozmaktadir. Sonuç olarak, taban bandin faz spektrumu yüksek frekanslari yansitmaz ve böylece tek bir deger kullanan frekans yamalarinin faz düzeltmesi çalismayabilir. Ayrica, düsük frekanslarda gürültülü PDF degerleri nedeniyle, PDF degerinin varyasyonuna bagli olarak geçislerin tespit edilmesi [bakiniz Bölüm 8.3] zor olacaktir. Problemin çözümü hasittir. Ilk olarak, geçisler basit bir enerji tabanli yöntem kullanilarak tespit edilir. Orta/yüksek frekanslarin anlik enerjisi, pürüzsüzlestirilmis bir enerji tahmini ile karsilastirilir. Orta/yüksek frekanslarin anlik enerjisi, asagida belirtilen sekilde hesaplanir eriginlihz) : Z Xiiiagußn) , Pürüzsüzlestirme, birinci sinif bir llR filtresi kullanilarak gerçeklestirilir x""i5"**(n) z 0.1 ixinüenhrii) + 0.? urggîg'mrn -- 1). (32) Eger Xmgmnmyxgijîgmh(n) M9' bu durumda bir geçis belirlenir. Esik degeri 9, istenilen geçis miktarini belirlemek için ince ayarlanabilir. Örnegin 9 = 2 kullanilabilir. Belirlenen çerçeve, dogrudan geçici çerçeve olarak seçilmemistir. Bunun yerine, yere] enerji maksimumu, bunun çevresinden arastirilir. Mevcut uygulamada seçilen aralik, [n - 2, n + 7]'dir. Bu araliktaki maksimum enerjiye sahip temporal çerçeve, geçis olarak seçilir. Teorik olarak, dikey düzeltme modu geçis durumlari için de uygulanabilir. Ancak, geçici durumlarda, taban bandin faz spektrumu çogu zaman yüksek frekanslari yansitmaz. Bu da islenmis sinyaldeki pre- ve post-ekolara yol açabilir. Dolayisiyla geçis durumlari için hafifçe degistirilmis olan isleme önerilmektedir. Yüksek frekanslardaki geçici ortalama PDF hesaplanir rarstzii'wiarz::'i'mt.1:, ":1 ~ l. i ;:- 36 :2 :H: &58'i Geçici çerçeve için faz spektrumu, denklem 24'te oldugu gibi bu sabit faz degisimi kullanilarak sentezlenir ancak "al/Hi", ***1*** ` ) ile deglStirilir. AYm duzeltme, [n - 2, 11 + 2] araligi içindeki temporal çerçevelere uygulanir [n: QMF'nin özelliklerinden ötürü n-1 ve n+1 çerçevelerinin PDF'ine eklenir, bakiniz Bölüm 6). Bu düzeltme halihazirda bulunan uygun bir konuma bir geçisi meydana getirmektedir, ancak geçisin sekli zorunlu olarak istenildigi gibi degildir ve QMF çerçevelerinin önemli temporal örtüsmesi nedeniyle önemli yan kulaklar [yani, ilave geçisler) mevcut olabilir. Bu nedenle mutlak faz açisi da dogru olmalidir. Mutlak açi, sentezlenen ve orijinal faz spektrumu arasindaki ortalama hata hesaplanarak düzeltilir. Düzeltme, geçicinin her bir temporal çerçevesi için ayri ayri gerçeklestirilir. Geçici düzeltmesinin sonucu, Sekil 46'da gösterilmistir. Fazi düzeltilmis SBR kullanilarak QMF alanindaki viyolon + alkis sinyalinin zaman içerisindeki faz türevi X1dt(k,n] gösterilmistir. Sekil 47b, frekans Xldf[k,n) üzerinde ilgili bir faz türevinin gösterir. Yeniden renk gradyeni, kirmizi = n'den mavi = -n'ye kadar olan faz degerlerini belirtir. Dogrudan kopyalamaya kiyasla farkin büyük olmamasina ragmen, fazi düzeltilmis alkisin orijinal sinyalle ayni keskinlige sahip oldugu algilanabilir. Bu nedenle, tüm durumlarda degil sadece dogrudan kopyalama etkin oldugunda geçici düzeltme gereklidir. Aksine, eger PDT düzeltmesi etkinlestirilmisse, PDT düzeltmesi aksi takdirde geçici olarak sürtünmeye yol açacagi için geçici islemin yapilmasi önemlidir. 9 Düzeltme verilerinin sikistirilmasi Bölüm 8, faz hatalarinin düzeltilebilecegini gösterdi, ancak düzeltme için yeterli bit hizi hiç dikkate alinmadi. Bu bölüm, yöntemlerin düzeltme verisinin düsük bit hiziyla nasil temsil edilecegini göstermektedir. 9.1 PDT düzeltme verilerinin sikistirilmasi - yatay düzeltme için hedef spektrumun olusturulmasi PDT düzeltmesini saglamak için iletilebilecek birçok olasi parametre vardir. Ancak gm Ü" "1 zaman içinde puruzsuzlestirildiginden oturu dusuk bit hizi iletimi için potansiyel bir adaydir. Ilk olarak, parametreler için yeterli bir güncelleme hizi tartisilmaktadir. Deger sadece her N çerçevesi için güncellendi ve aralarinda dogrusal olarak enterpolasyon yapildi. Iyi kalite Için güncelleme araligi yaklasik 40 ms'dir. Bazi sinyaller için bir bit, biraz daha az avantajlidir ve digerleri için bir bit biraz daha fazladir. Resmi dinleme testleri, optimal bir güncelleme oranini degerlendirmek için yararli olacaktir. Bununla birlikte, nispeten uzun bir güncelleme araligi kabul edilebilir görünmektedir. "dm U" 71) için uygun bir açisal kesinlik de çalisildi. Algisal olarak iyi kalite için 6 hit (64 olasi açi degeri) yeterlidir. Ayrica sadece degerdeki degisikligin iletimi test edildi. Çogu zaman degerler sadece biraz degismis gibi gözükür, küçük degisimler için daha fazla dogruluk elde etmek için esit olmayan nicemleme uygulanabilir. Bu yaklasimi kullanarak, 4 bitin [16 olasi açi degeri), iyi kalite sagladigi bulundu. Dikkate alinacak son sey, uygun spektral dogruluktur. Sekil 17'de görülebilecegi gibi, birçok frekans bandi, kabaca ayni degeri paylasiyor gibi görünmektedir. Bu nedenle, birkaç frekans bandini temsil etmek için muhtemelen bir deger kullanilabilir. Ek olarak, yüksek frekanslardaki bir frekans bandinda çoklu harmonikler vardir, bu nedenle daha az dogruluk gerekebilir. Yine de, baska, potansiyel olarak daha iyi bir yaklasim bulundu, bu yüzden bu seçenekler iyice arastirilmadi. Önerilen, daha etkili bir yaklasim, asagida tartisilmistir. 9.1.1 PDT düzeltme verilerinin sikistirilmasi için frekans tahmininin kullanilmasi Bölüm S'te ele alindigi üzere, zaman içinde faz türevi temel olarak üretilen sinüzoidin frekansi anlamina gelir. Uygulanan 64-bant kompleks QMF'nin PDT'leri asagidaki denklem kullanilarak frekanslara dönüstürülebilir . . 'v.. î` ;ill (HA *k XIWUC, n) :::i g: iii-;HI + ({ ?izah-band Il) + Ligi: i» %1 mod !N (34) Üretilen frekanslar, fmter(k) = [fc[k) -fBw,fc(k] +fBw] araligi içinde olup buradachQ, frekans bandi k'nin merkez frekansidir ve fBw, 375 Hz'dir. Sonuç, viyolon sinyali için QMF bantlarinin XfreciUçn) frekanslarinin bir zaman-frekans gösteriminde Sekil 47'de gösterilmektedir. Frekanslarin, tonusun temel frekansinin katlarini takip ettigi görülmekte ve harmonikler, temel frekansta, frekansa göre ayrilir. Ek olarak, titretmenin, frekans modülasyonuna neden oldugu görülmektedir. Ayni grafik, dogrudan kopyalama Zfret1[k,n) ve düzeltilmis fm ÜVYZJSBR'ye uygulanabilir (bakiniz sirasiyla Sekil 48a ve Sekil 48b). Sekil 48a, Sekil 47'de gösterilen orijinal sinyal XWJ(k,n] ile karsilastirildiginda dogrudan kopyalama SBR sinyali ZfretIUgn] QMF bantlarinin frekanslarinin bir zaman-frekans temsilini göstermektedir. Sekil 48b, düzeltilmis SBR sinyali 'adi lk› m orijinal sinyal, mavi renkte çizilmis olup burada dogrudan kopyalama SBR'si ve düzeltilmis SBR sinyalleri, kirmizi ile çizilmistir. Dogrudan kopyalama SBR'nin uyumsuzlugu, Özellikle örnegin baslangicinda ve sonunda, görülebilir. Ek olarak, frekans modülasyon derinliginin orijinal sinyalin açikligindan daha küçük oldugu görülebilir. Aksine, düzeltilmis SBR durumunda, harmoniklerin frekanslari orijinal sinyalin frekanslarini izledigi görünmektedir. Ayrica modülasyon derinliginin dogru oldugu görülmektedir. Böylece, bu grafik, önerilen düzeltme yönteminin geçerliligini dogrulamaktadir. Bu nedenle, sonraki düzeltme verilerinin gerçek sikistirilmasi üzerinde yogunlasmistir. Xfreq(k,n] frekanslari ayni uzaklikta yerlestirildiginden, frekanslar arasindaki mesafe tahmin edilir ve iletilirse tüm frekans bantlarinin frekanslari yaklastirilabilir. Harmonik sinyaller olmasi durumunda araliklar, tonun temel frekansina esit olmalidir. Böylece, tüm frekans bantlarini temsil etmek için sadece tek bir sinyal degeri iletilmelidir. Daha düzensiz sinyaller olmasi durumunda, harmonik davranisi tanimlamak için daha fazla degere ihtiyaç vardir. Örnegin, bir piyano tonu olmasi durumunda harmoniklerin mesafesi biraz artar [14]. Basitlik açisindan, asagidakilerin harmoniklerin ayni mesafede yerlestirildigi varsayilmaktadir. Bununla birlikte, bu tarif edilen ses islemenin genelligini sinirlandirmamaktadir. Dolayisiyla harmoniklerin frekanslarini tahmin etmek için tonun temel frekansi tahmin edilir. Temel frekansin tahmini, yaygin olarak incelenen bir konudur (örnegin bakiniz bir tahmin yöntemi uygulanmistir. Yöntem, temel olarak harmoniklerin araliklarini hesaplar ve sonucu, bazi bulussal yöntemlere göre birlestirir (ne kadar enerji, frekans ve zaman içerisinde ne kadar kararli, vb.]. Her durumda sonuç, her bir temporal çerçeve Xf0(n) için temel frekans tahminidir. Baska bir deyisle, zaman içinde faz türevi, karsilik gelen QMF kutusunun frekansi ile ilgilidir. Ayrica, PDT'deki hatalarla ilgili yapilar çogunlukla harmonik sinyallerle algilanabilir. Böylece, hedef PDT'nin [bakiniz Denklem 16a], fo temel frekansinin tahminini kullanarak tahmin edilebilecegi önerilmistir. Temel bir frekansin tahmini, yaygin olarak incelenen bir konudur ve temel frekansin güvenilir tahminlerini elde etmek için birçok uygun yöntem mevcuttur. Burada, BWE'yi gerçeklestirmeden önce dekoder tarafindan bilinen ve BWE içinde bulusa ait faz düzeltmesinin kullanildigi temel frekans Xf°[n) varsayilmaktadir. Bu nedenle, kodlama asamasinin tahmini temel frekansi XfÜÜi] iletmesinden ötürü avantajlidir. Ilaveten, gelistirilmis kodlama verimliligi için, deger sadece, örnegin, her 20. temporal çerçeve (-27 ms'lik bir araliga karsilik gelen] için güncellenebilir ve aralarinda enterpolasyona tabi tutulabilir. Alternatif olarak, temel frekans kod çözme asamasinda tahmin edilebilir ve hiçbir bilgi iletilmemelidir. Bununla birlikte, tahmin, kodlama asamasinda orijinal sinyal ile gerçeklestirilirse daha iyi tahminler beklenebilir. Dekoder islemi, her bir temporal çerçeve için bir temel frekans tahminini X1`°[n) elde ederek baslar. Harmoniklerin frekanslari, bir indeks vektörü ile çarpilarak elde edilebilir V K 3 ?ll : Xha'lmût, n) : K ~ X" ('71) (35) Sonuç, Sekil 49'ta gösterilmistir. Sekil 49, orijinal sinyalin XFFBEI(k,n) QMF bantlarinin frekanslarina kiyasla Xharm(K,n] tahmini harmonik frekanslarinin bir zaman frekansi temsilini göstermektedir. Yine mavi renk, orijinal sinyali gösterir ve kirmizi ise tahmini sinyaldir. Tahmin edilen harmoniklerin frekanslari, orijinal sinyale oldukça iyi uymaktadir. Bu frekanslar, 'izin verilen' frekanslar olarak düsünülebilir. Eger algoritma bu frekanslari üretiyorsa, uyumsuzlukla ilgili yapilardan kaçinilmalidir. Algoritmanin iletilen parametresi, temel frekans Xf°[n)'dir. Gelismis kodlama verimliligi için, deger sadece her 20. temporal çerçeve için güncellenir [yani her 27 ms'de bir). Bu degerin, resmi olmayan dinlemeye dayanan iyi algisal kalite sagladigi görülmektedir. Bununla birlikte, resmi dinleme testleri, güncelleme orani için daha uygun bir degerin degerlendirilmesi için kullanislidir. Algoritmanin bir sonraki asamasi, her bir frekans bandi için uygun bir deger bulmaktir. Bu, bandi yansitmak için her bir fc(k] bandinin merkez frekansina en yakin olan Xharlnûçn] degeri seçerek gerçeklestirilir. Eger en yakin deger, frekans bandinin (flnter[k]) olasi degerlerinin disindaysa, bandin sinir degeri kullanilir. Ortaya çikan matris eh l , } her bir zaman-frekans karosu için bir frekansi ihtiva eder. Düzeltme veri sikistirma algoritmasinin son asamasi, frekans verisini PDT verilerine geri dönüstürmektir › ,freti y Xâigiiwn) : 2". (îgsgmßig mod 1), (36) burada modÜ, modülo operatörünün belirtir. Dogru düzeltme algoritmasi, Bölüm 8.1'de gösterilen sekilde çalisir. Denklem 16a'da Zül 09") hedef PDT olarak Xeh (km.) ile degistirilir ve Denklemler 17-19, Bölüm 8.1'de belirtilen sekilde kullanilir. Düzeltme algoritmasinin sikistirilmis düzeltme verileriyle sonucu Sekil SÜ'de gösterilmektedir. Sekil 50, sikistirilmis düzeltme verileri ile düzeltilmis SBR'nin QMF alanindaki viyolon sinyalinin PDT'sindeki Dsm (kul) hatayi gösterir. Sekil SOb, zaman içerisinde ilgili faz türevini gösterir ich (k' 21)' Renk gradyenleri, kirmizi :ir'den mavi :-71'ye kadar olan degerleri belirtir. PDT degerleri, orijinal sinyalin PDT degerlerini, veri sikistirmasi olmadan düzeltme yöntemi ile benzer dogrulukta takip eder [bakiniz Sekil 18). Dolayisiyla sikistirma algoritmasi geçerlidir. Algilanan kalite, düzeltme verilerinin sikistirilmasi ile ve bu olmadan benzerdir. Düzenlemeler, her bir deger için toplam 12 bit kullanarak düsük frekanslar için daha fazla ve yüksek frekanslar için daha düsük dogrulugu kullanirlar. Elde edilen bit hizi yaklasik 0.5 kbps'dir (entropi kodlamasi gibi herhangi bir sikistirma olmadan). Bu dogruluk, nicemleme olmadan esit algilanan kalite üretir. Bununla birlikte, önemli ölçüde daha düsük bit hizi, yeteri kadar iyi algilanan kalite üreten birçok durumda kullanilabilir. Düsük bit hizi semalari için bir seçenek, iletilen sinyali kullanarak kod çözme asamasinda temel frekansi tahmin etmektir. Bu durumda hiçbir deger iletilmemelidir. Diger bir seçenek, iletilen sinyali kullanarak temel frekansi tahmin etmek, genis bant sinyali kullanilarak elde edilen tahmine göre karsilastirmak ve sadece farki iletmektir. Bu farkin çok düsük bit hizi kullanilarak temsil edilebilecegi varsayilabilir. 9.2 PDF düzeltme verilerinin sikistirilmasi Bölüm 8.2'de tartisildigi üzere PDF düzeltmesi için uygun veriler, birinci frekans yamasinin ortalama faz hatasidir java ah): Düzeltme, bu degerin bilgisi ile tüm frekans yamalari için yapilabilir, böylece her bir temporal çerçeve için sadece bir degerin iletilmesi gerekir. Ancak, her bir temporal çerçeve için tek bir degerin bile iletilmesi, çok yüksek bir bit hizina neden olabilir. Trombon için Sekil 12'nin incelenmesinde, PDF'in frekans üzerinden nispeten sabit bir degere sahip oldugu ve birkaç temporal çerçeve için ayni degerin mevcut oldugu görülebilir. Deger, ayni geçisin QMF analiz penceresinin enerjisine hakim oldugu sürece zaman içinde sabittir. Yeni bir geçis baskin olmaya basladiginda, yeni bir deger bulunur. Bu PDF degerleri arasindaki açi degisimi, bir geçisten digerine ayni gibi görünmektedir. Bu, PDF'in geçisin temporal lokasyonunu kontrol ettiginden ve sinyal sabit bir temel frekansa sahipse, geçisler arasindaki araligin sabit olmasi gerektiginden anlamlidir. Bu nedenle, PDF [veya bir geçis konumu), zaman içinde sadece seyrek olarak iletilebilir ve bu zaman anlari arasindaki PDF davranisi, temel frekans bilgisi kullanilarak tahmin edilebilir. PDF düzeltmesi, bu bilgiler kullanilarak gerçeklestirilebilir. Bu fikir aslinda harmoniklerin frekanslarinin esit aralikli oldugu farz edilen PDT düzeltmesi için esleniktir. Burada da ayni fikir kullanilir, ancak bunun yerine geçislerin temporal konumlarinin esit aralikli oldugu varsayilir. Asagida, dalga seklindeki piklerin konumlarinin saptanmasina dayanan ve bu bilgiyi kullanarak, faz düzeltmesi için bir referans spektrumu olusturulmaktadir. 9.2.1 PDF düzeltme verilerinin sikistirilmasi için pik belirlemenin kullanimi - Dikey düzeltme için hedef spektrumun olusturulmasi Piklerin konumlari, basarili bir PDF düzeltmesi gerçeklestirmek için tahmin edilmelidir. Bir çözüm, Denklem 34'te oldugu gibi, ayni sekilde PDF degerini kullanarak piklerin konumlarini hesaplamak ve tahmini temel frekansi kullanarak piklerin konumlarini tahmin etmek olacaktir. Ancak bu yaklasim nispeten stabil bir temel frekans tahminini gerektirecektir. Düzenlemeler, önerilen sikistirma yaklasiminin mümkün oldugunu gösteren basit, hizli uygulanabilen bir alternatif yöntem göstermektedir. Trombon sinyalinin bir zaman-alani temsili, Sekil 51`de gösterilmektedir. Sekil 51a, bir zaman alani gösteriminde trombon sinyalinin dalga seklini gösterir. Sekil 51b, sadece tahmini piklerini ihtiva eden karsilik gelen zaman alani sinyalini gösterir, burada piklerin konumlari iletilen meta veriler kullanilarak elde edilmistir. Sekil 51b'deki sinyal, örnegin Sekil 30'a istinaden açiklanan puls kataridir (265). Algoritma, dalga seklindeki piklerin konumlarini analiz ederek baslar. Bu, yerel maksimumlar aranarak gerçeklestirilir. Her 27 ms için (yani, her 20 QMF çerçevesi için), çerçevenin merkez noktasina en yakin olan pikin konumu iletilir. Iletilen pik konumlari arasinda, pik noktalarinin zaman içinde esit aralikli oldugu varsayilmaktadir. Böylece, temel frekansi bilerek, piklerin konumlari tahmin edilebilir. Bu düzenlemede, saptanan piklerin sayisi iletilir [bunun, tüm piklerin basarili bir sekilde tespit edilmesini gerektirdigi unutulmamalidir; temel frekansa dayali tahmin, muhtemelen daha dogru sonuçlar verecektir). Elde edilen bit hizi yaklasik 0.5 kbps'dir konumunu 9 bit kullanarak iletmeyi ve 4 bit kullanarak aralarindaki geçis sayisini iletmeyi ihtiva eder. Bu dogruluk, nicemleme olmadan esit algilanan kalitede üretildi. Bununla birlikte, önemli ölçüde daha düsük bit hizi, yeteri kadar iyi algilanan kalite üreten birçok durumda kullanilabilir. lletilen meta verileri kullanarak, tahmin edilen pik noktalarinin konumlarindaki impulslari ihtiva eden bir zaman-alani sinyali olusturulur [bakiniz Sekil 51 b). QMF analizi, bu sinyal için olusturulur ve faz spektrumu )hesaplanir. Gerçek PDF düzeltmesi, Bölüm 8.2'de önerildigi gibi gerçeklestirilir ancak Zî'h (lwl) Denklem 20a'da, asagida belirtilen sekilde degistirilir XEV 0%' n). Dikey faz uyumluluguna sahip sinyallerin dalga sekli tipik olarak bir puls katarini andirir. Böylece, dikey düzeltme Için hedef faz spektrumunun, karsilik gelen konumlarda piklere ve karsilik gelen bir temel frekansa sahip bir puls katarinin faz spektrumu olarakseklinde modellenmesi ile tahmin edilebilecegi öne sürülmüstür. Temporal çerçevenin merkezine en yakin konum, örnegin, her 20. temporal çerçeve için iletilir (-27 ms'lik bir araliga karsilik gelir). Esit oran ile iletilen tahmini temel frekans, iletilen konumlar arasindaki pik konumlarini enterpole etmek için kullanilir. Alternatif olarak, temel frekans ve pik konumlari, kod çözme asamasinda tahmin edilebilir ve hiçbir bilgi iletilmemelidir. Bununla birlikte, tahmin, kodlama asamasinda orijinal sinyal ile gerçeklestirilirse daha iyi tahminler beklenebilir. Dekoder islemi, her bir temporal çerçeve için bir temel frekans tahmini Xm[n] elde etmekle baslar ve ek olarak, dalga seklindeki pik konumlari tahmin edilir. Pik konumlari, bu konumlarda impulslardan olusan bir zaman-alani sinyali olusturmak için kullanilir. QMF analizi, ilgili faz spektrumunu olusturmak için kullanilir Xév 0631)' Bu tahmini faz spektrumu, hedef faz spektrumu olarak Denklem 20a'da kullanilabilir zaman = xgjmrk, n). (37) Önerilen yöntem, sadece tahmin edilen pik konumlarini ve temel frekanslari, örnegin, 27 ms'lik güncelleme orani ile iletmek üzere kodlama asamasini kullanir. Ek olarak, dikey faz türevindeki hatalarin sadece temel frekansin nispeten düsük oldugunda algilanabilecegine dikkat edilmelidir. Böylece, temel frekans nispeten düsük bir bit hizi ile iletilebilir. Düzeltme algoritmasinin sikistirilmis düzeltme verileriyle sonucu Sekil 52'de gösterilmektedir. Sekil 52a, sikistirilmis düzeltme verileri ile düzeltilmis SBR ile QMF alanindaki trombon sinyalinin faz spektrumundaki ÜÇ., *AJÜ hatayi gösterir. Ayrica Sekil 52b, frekans üzerindeki ilgili faz türevini gösterir ZCV (km. Renk gradyeiii, kirmizi degerlerini, veri sikistirmasi olmadan düzeltme yöntemi ile benzer dogrulukta takip eder verilerinin sikistirilmasi ile ve bu olmadan henzerdir. 9.3 Geçici isleme verilerinin sikistirilmasi Geçislerin nispeten seyrek oldugu varsayilirsa, bu verinin dogrudan aktarilabilecegi göz önüne alinabilir. Düzenlemeler, geçis basina alti degerin iletilmesini gösterir: ortalama PDF için bir deger ve mutlak faz açisindaki hatalar için bes deger [[n - 2, n + 2]) aralik içindeki her bir temporal çerçeve için bir deger). Bir alternatif, geçici konumun (yani bir deger] konumunu iletmek ve dikey düzeltme durumunda oldugu gibi hedef faz spektrumunu "cz ' Eger bit hizinin, geçisler için sikistirilmasi gerekiyorsa PDF düzeltmesi için de benzer bir yaklasim kullanilabilir (bakiniz Bölüm 9.2). Basit bir sekilde bir geçisin konumu iletilebilir, yani tek bir deger. Hedef faz spektrumu ve hedef PDF, Bölüm 9.2'deki gibi bu konum degeri kullanilarak elde edilebilir. Alternatif olarak, geçis konumu, kod çözme asamasinda tahmin edilebilir ve hiçbir bilgi iletilmemelidir. Bununla birlikte, tahmin, kodlama asamasinda orijinal sinyal ile gerçeklestirilirse daha iyi tahminler beklenebilir. Önceden açiklanan düzenlemelerin tümü, diger düzenlemelerden veya uygulamalarin bir kombinasyonundan ayri olarak görülebilir. Bu nedenle Sekiller 53 ila 57, daha önce tarif edilen bazi düzenlemeleri birlestiren bir kodlayiciyi ve bir dekoderi göstermektedir. Sekil 53, bir ses sinyalinin kodunu çözmek için bir dekoderi [110"] göstermektedir. Dekoder (110"); bir birinci hedef spektrum üretecini (65a), bir birinci faz düzelticiyi (70a) ve bir ses alt bant sinyal hesaplayicisini [350) ihtiva eder. Hedef faz ölçüm belirleyicisi olarak da anilan birinci hedef spektrum üreteci (653], birinci düzeltme verisini (295a) kullanilarak ses sinyalinin (32) bir alt bant sinyalinin bir birinci zaman çerçevesi için bir hedef spektrum (85a"] olusturur. Birinci faz düzelticisi [70a), alt bant sinyalinin bir fazi çerçevesinde düzeltilmekte olup burada düzeltme, ses sinyalinin [32) birinci zaman çerçevesindeki alt bant sinyalinin Ölçütü ve hedef spektrum [85"] arasindaki bir farkin azaltilmasi ile gerçeklestirilir. Ses alt bant sinyal hesaplayicisi (350], zaman çerçevesi için düzeltilmis bir faz [91a) kullanarak birinci zaman çerçevesi için ses alt bant sinyalini [355) hesaplar. Alternatif olarak, ses alt bant sinyal hesaplayicisi (350) ikinci zaman çerçevesinde alt bant sinyalin [85a") ölçümünü kullanarak veya bir baska faza göre bir düzeltilmis faz hesaplamasi kullanarak faz düzeltme algoritmasindan farkli düzeltme algoritmasina uygun olarak birinci zaman çerçevesinden farkli bir ikinci zaman çerçevesi için ses alt bant sinyalini (355) hesaplar. Sekil 53 ayrica istege bagli olarak ses sinyalini göstermektedir. Diger faz düzeltme algoritmasi, bir ikinci faz düzeltici (7%) veya bir üçüncü faz düzelticisi [70c) içinde gerçeklestirilebilir. Diger faz düzelticiler, Sekil 54'e istinaden gösterilecektir. Ses alt bant sinyal hesaplayicisi [250), birinci zaman çerçevesi için düzeltilmis faz (91] ve birinci zaman çerçevesinin ses alt bant sinyalinin büyüklügü degeri (47] kullanilarak birinci zaman çerçevesi için ses alt bant sinyalini hesaplamakta olup burada büyüklük degeri [47), birinci zaman çerçevesinde ses sinyalinin [32) büyüklügü veya birinci zaman çerçevesindeki ses sinyalinin (35] islenmis büyüklügüdür. Sekil 54, dekoderin [110") diger bir düzenlemesini göstermektedir. Bu nedenle, dekoder üreteci (65b), ikinci düzeltme verilerini (295b) kullanilarak ses sinyalinin (32) alt bandinin ikinci zaman çerçevesi için bir hedef spektrum [85b"] üretir. Detektör [110"] ek olarak, ikinci bir faz düzeltme algoritmasi ile belirlenen ses sinyalinin (32] zaman çerçevesinde alt bandin bir fazini (45] düzeltmek için bir ikinci faz düzelticisini [70b) ihtiva etmekte olup burada düzeltme, ses sinyalinin alt banti ve hedef spektrumun (85b") zaman çerçevesi ölçümü arasindaki farkin azaltilmasi ile gerçeklestirilir. Dolayisiyla, dekoder (110"], üçüncü hedef spektrum üretecini (65c) ihtiva eder, burada üçüncü hedef spektrum üreteci [6503), üçüncü düzeltme verilerini [295c] kullanilarak ses sinyalinin (32] alt bandinin üçüncü zaman çerçevesi için bir hedef spektrum üretir. Ayrica, dekoder (110") alt bant sinyalinin bir fazini [45) ve üçüncü faz düzeltme algoritmasiyla belirlenen ses sinyalinin (32] zaman çerçevesini düzeltmek için bir üçüncü faz düzelticisini [70c] ihtiva etmekte olup burada düzeltme, ses sinyalinin alt bandi ve hedef spektrumun [85c] zaman çerçevesi ölçümü arasindaki bir farki azaltarak gerçeklestirilir. Ses alt bant sinyal hesaplayicisi [350], üçüncü faz düzelticisinin faz düzeltmesini kullanarak birinci ve ikinci zaman çerçevelerinden farkli bir üçüncü zaman çerçevesi için ses alt bant sinyalini hesaplayabilir. Bir düzenlemeye göre, birinci faz düzelticisi [70a), ses sinyalinin bir önceki zaman çerçevesinin bir faz düzeltilmis alt bant sinyalini (91a) depolamak için veya üçüncü faz düzelticisinin (70c) ikinci faz düzelticisinden [70b) önceki ses sinyalinin bir önceki zaman çerçevesinin (375] bir faz düzeltilmis alt bant sinyalini almak için konfigüre edilmistir. Ayrica, birinci faz düzelticisi (70a), ses sinyalinin (32] fazi (45) önceki zaman çerçevesinin sinyalini mevcut zaman çerçevesinde düzeltilir. Diger düzenlemeler, bir birinci faz düzelticisini [70a) bir yatay faz düzeltmesi, ikinci faz düzelticisinin [70b) bir dikey faz düzeltmesi gerçeklestirdigini ve üçüncü faz düzelticisinin Diger bir bakis açisindan Sekil 54, faz düzeltme algoritmasindaki kod çözme asamasinin bir blok diyagramini göstermektedir. lslemin girdisi, zaman-frekans alanindaki ve meta verilerdeki BWE sinyalidir. Yine, pratik uygulamalarda, filtre bankasinin birlikte kullanilmasi veya mevcut bir BWE semasinin dönüstürülmesi için bulusa ait faz-türev düzeltmesinin uygulanmasi tercih edilir. Mevcut örnekte bu, SBR'de kullanilan sekilde bir QMF alanidir. Bir birinci çogullama çözücü (gösterilmemistir), bulusun düzeltmesi ile arttirilan BWE ile donanmis algisal kodekin bit akisindan faz türev düzeltme verilerini ekstrakte eder. Ikinci bir çogullama çözücüsü [ farkli düzeltme modlari için aktivasyon verisine [365] ve daha sonra düzeltme verisine [295a-c) böler. Aktivasyon verilerine dayanarak, dogru düzeltme modu için hedef spektrumun hesaplanmasi aktive edilir [digerleri bosta olabilir). Hedef spektrumu kullanarak, istenen düzeltme modu kullanilarak alinan BWE sinyaline faz düzeltmesi yapilir. Yatay düzeltmenin (7051) ardisik olarak gerçeklestirildigi (diger bir deyisle: önceki sinyal çerçevelerine bagli olarak), diger düzeltme modlari [70b, c] da önceki düzeltme matrislerini aldigi dikkate alinmalidir. Son olarak, düzeltilmis sinyal veya islenmemis olanlar, aktivasyon verilerine dayanarak çiktiya ayarlanir. Faz verisini düzelttikten sonra, mevcut örnek içerisinde SBR sentezi durumunda altta yatan BWE sentezi asagi yönde devam eder. BWE sentez sinyali akisina tam olarak faz düzeltmesinin eklendigi yerlerde varyasyonlar mevcut olabilir. Tercih edildigi üzere, faz türevi düzeltme, faz ZPhaUgn] olan ham spektral yamalar üzerinde bir baslangiç ayarlamasi olarak yapilir ve tüm ilave BWE isleme veya ayarlama asamalari [SBR'de bu, gürültü eklenmesi, ters filtreleme, kayip sinuzoitler, vb. olabilir), duzeltilmis fazlar 3:' ""11" üzerinde asagi akista gerçeklestirilir. Sekil 55, dekoderin (110") diger bir düzenlemesini göstermektedir. Bu düzenlemeye göre, dekoderi (110"], Sekil 54'te gösterilen önceki uygulamalara göre bir çekirdek dekoder (A) ihtiva eder. Çekirdek dekoder (115), ses sinyaline (55) göre azaltilmis sayida alt bant ile bir zaman çerçevesindeki bir ses sinyalini (25) kod çözmek için konfigüre etmistir. Yama Olusturucu (120), azalan sayidaki alt bantlar ile çekirdek kodu çözülmüs ses sinyalinin (25) alt bant setine yama uygulamakta olup burada alt bantlar seti, bir alt bant sayisi ile bir ses sinyali (32) elde etmek için, azalan alt bant sayisina bitisik zaman çerçevesindeki daha fazla sayidaki alt bant için bir birinci yamayi olusturur. Büyüklük islemcisi (125') zaman çerçevesindeki ses alt bant sinyalinin (355) büyüklük degerlerini isler. Önceki dekoderlere (110 ve 110') göre, büyüklük islemcisi, bant genisligi uzatma parametresi uygulayicisi (125) olabilir. Sinyal islemci bloklarinin nerede degistirildigine dair baska birçok düzenleme düsünülebilir. Örnegin, büyüklük islemcisi (125') ve blok (A) degistirilebilir. Bu nedenle, blok (A), yamalarin büyüklük degerlerinin daha önceden düzeltilmis oldugu sekilde yeniden yapilandirilmis ses sinyali (35) üzerinde çalisir. Alternatif olarak, ses alt bant sinyal hesaplayicisi (350), düzeltilmis ses sinyalini (355) düzeltilmis fazdan ve ses sinyalinin büyüklügünü düzeltilmis kismindan olusturmak için büyüklük islemcisinden (1 2 5 ') sonra yerlestirilebilir. Ayrica, dekoder (110"), frekansla birlestirilen islenmis ses sinyalini (90) elde etmek için faz ve büyüklügü düzeltilmis ses sinyalini sentezlemek için bir sentezleyiciyi (100) ihtiva eder. Istege bagli olarak, çekirdek kodu çözülmüs ses sinyali (25) üzerinde ne büyüklük ne de faz düzeltmesi uygulandigindan, söz konusu ses sinyali dogrudan sentezleyiciye (100) iletilebilir. Daha önce tarif edilen dekoderlerden (110 veya 110') birinde uygulanan herhangi bir istege bagli islem blogu, dekodere (110") de uygulanabilir. Sekil 56, bir ses sinyalini (55) kodlamak için bir kodlayiciyi (155") göstermektedir. Kodlayici (155"), bir hesaplayiciya (270), bir çekirdek kodlayicisina (160), bir parametre ekstraktörüne (165) ve bir çikis sinyali Olusturucuya (170) bagli bir faz belirleyiciyi (380) ihtiva eder. Faz belirleyici (380) ses sinyalinin (55) bir fazini (45) belirlemekte olup burada hesaplayici (270), ses sinyalinin (55) belirlenen fazina (45) dayanan ses sinyali (55) için faz düzeltme verisini (295) belirler. Çekirdek kodlayici (160) çekirdegi, ses sinyaline [55) göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali (145) elde etmek için ses sinyalini (55) kodlar. Parametre ekstraktörü (165), çekirdek kodlanmis ses sinyaline dahil edilmeyen ikinci bir alt bant seti için düsük çözünürlükte bir parametre gösterimi elde etmek için ses sinyalinden (55) parametreleri (190) çikartir. Çikis sinyali (170) olusturucusu; parametreleri (190), çekirdek kodlanmis ses sinyalini bagli olarak kodlayici (155"); ses sinyalinin (55) çekirdek kodlamasindan önce bir düsük geçisli filtreyi (180) ve ses sinyalinden (55) parametrelerin (190) ekstrakte edilmesinden önce bir yüksek geçis filtresini (185) ihtiva eder. Alternatif olarak, ses sinyalinin (55) düsük veya yüksek geçisli filtrelenmesi yerine, bir bosluk doldurma algoritmasi kullanilabilir, burada çekirdek kodlayici (160) çekirdegi, alt bant setindeki en az bir alt bandin çekirdek kodlanmis olmadigi azalan bir alt bant sayisini kodlar. Ayrica, parametre ekstraktörü, çekirdek kodlayiciyla (160) kodlanmamis en az bir alt banttan parametreleri (190) ekstrakte eder. Düzenlemelere göre, hesaplayici (2 70), bir birinci varyasyon moduna, bir ikinci varyasyon moduna veya bir üçüncü varyasyon moduna uygun olarak faz düzeltmesini düzeltmek için bir dizi düzeltme verisi hesaplayicisini (285a-c) ihtiva eder. Ayrica, hesaplayici (270), düzeltme verileri hesaplayicilari (285a-c) grubunun bir düzeltme verisi hesaplayicisini aktive etmek için aktivasyon verilerini (365) belirler. Çikis sinyali Olusturucu (170), aktivasyon verilerini, parametreleri, çekirdek kodlanmis ses sinyalini ve faz düzeltme verisini ihtiva eden çikis sinyalini olusturur. Sekil 57, Sekil 56'da gösterilen kodlayicida (155") kullanilabilecek olan bir hesaplayicinin (270) alternatif uygulamasini gösterir. Düzeltme modu hesaplayicisi (385), varyasyon belirleyicisini (275) ve varyasyon karsilastiricisini (280) ihtiva eder. Aktivasyon verileri (365), farkli varyasyonlari karsilastirmanin bir sonucudur. Ayrica, aktivasyon verileri (365), belirlenen varyasyona göre düzeltme verisi hesaplayicilarindan (185a-c) birini çikis sinyali olusturucunun (170) girisi ve dolayisiyla çikis sinyalinin (135) bir parçasi olabilir. Düzenlemeler, hesaplanmis düzeltme verilerini (295a, 295b veya 295c) ve aktivasyon verilerini (365) ihtiva eden bir meta veri akisini (295') olusturan bir meta veri olusturucuyu (390) ihtiva eden hesaplayiciyi (270) göstermektedir. Aktivasyon verileri (365), eger düzeltme verisinin kendisi mevcut düzeltme modunun yeterli bilgilerini ihtiva etmiyorsa, dekodere iletilebilir. Yeterli bilgi, örnegin, düzeltme verileri (295a], düzeltme verileri (295b) ve düzeltme verileri (295c] için farkli olan düzeltme verisini temsil etmek için kullanilan çok sayida veri olabilir. Ayrica, çikis sinyali Olusturucusu (170) ek olarak aktivasyon verisini (365] de kullanabilir, böylece meta veri Olusturucu (390) göz ardi edilebilir. Diger bir bakis açisindan Sekil 57'deki blok diyagram, faz düzeltme algoritmasmdaki kodlama asamasini göstermektedir. Islem girdisi, orijinal ses sinyali [55] ve zaman- frekans alanidir. Pratik uygulamalarda, filtre bankasinin birlikte kullanilmasi veya mevcut bir BWE semasinin dönüstürülmesi için bulusa ait faz-türev düzeltmesinin uygulanmasi tercih edilir. Mevcut örnekte bu, SBR'de kullanilan bir QMF alanidir. Düzeltme modu hesaplama blogu ilk önce her bir temporal çerçeve için uygulanan düzeltme modunu hesaplar. Aktivasyon verilerine (365] dayanarak, düzeltme verilerinin (295a-c] hesaplanmasi, dogru düzeltme modunda aktive edilir (digerleri bosta olabilir). Son olarak, çogullayici (MUX), aktivasyon verilerini ve düzeltme verisini farkli düzeltme modlarindan birlestirir. Baska bir çogullayici (gösterilmemistir) faz-türevi düzeltme verisini, BWE'nin bit akisina ve bulus düzeltmesi ile gelistirilen algisal kodlayiciya birlestirir. Sekil 58, bir ses sinyalinin kodunu çözmek için bir yöntemi (5800] göstermektedir. Yöntem (5800); "birinci düzeltme verisini kullanarak bir birinci hedefspektrum üreteci ile ses sinyalinin bir alt bant sinyalinin bir birinci zaman çerçevesi için bir hedef spektrumun üretilmesi" asamasini (5805), ses sinyalinin birinci zaman çerçevesindeki alt bant sinyalinin bir fazini, bir faz düzeltme algoritmasi ile belirlenen bir birinci faz düzelticisi ile düzeltilmesi olup burada düzeltme, ses sinyalinin birinci zaman çerçevesinde ve hedef spektrumda alt bant sinyalinin bir ölçümü arasindaki bir farkin azaltilmasiyla gerçeklestirilir" asamasini (5810) ve "birinci zaman çerçevesi için ses alt bant sinyalinin zaman çerçevesinin düzeltilmis bir fazini kullanarak ve ikinci zaman çerçevesindeki alt bant sinyalinin ölçümünü kullanarak birinci zaman çerçevesinden farkli bir ikinci zaman çerçevesi için ses alt bant sinyallerini hesaplamak için bir ses alt bandi sinyal hesaplayicisi ile hesaplanmasi veya faz düzeltme algoritmasindan farkli bir baska faz düzeltme algoritmasina göre düzeltilmis bir faz hesaplamasinin kullanilmasi" asamasini (5815) ihtiva eder. Sekil 59, bir ses sinyalinin kodlanmasi için bir yöntemi (5900) göstermektedir. Yöntem sinyalinin belirlenen fazina dayanan bir hesaplayici ile bir ses sinyali için faz düzeltme verilerinin belirlenmesi" asamasini (5910), "ses sinyaline göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali elde etmek için bir çekirdek kodlayici ile ses sinyalinin çekirdek kodlamasi" asamasini (5915], "çekirdek kodlanmis ses sinyaline dahil edilmeyen ikinci bir alt bant seti için düsük çözünürlüklü bir parametre temsili elde etmek için bir parametre ekstraktörü ile ses sinyalinden parametrelerin ekstrakte edilmesi" asamasini (5920] ve "parametreler, çekirdek kodlanmis ses sinyali ve faz düzeltme verisini ihtiva eden bir çikis sinyali Olusturucu ile bir çikis sinyali olusturma" asamasini yöntemler (5800 ve 5900), bir bilgisayarda yürütülebilecek olan bir bilgisayar programinda uygulanabilir. Ses sinyalinin (55), bir ses sinyali için, özellikle orijinal, yani islenmemis ses sinyali, ses sinyalinin iletilen kismi Xtrans(k,n] (25), taban bant sinyali XbaseUgn] [30], orijinal ses sinyaline kiyasla daha yüksek frekanslari (32] ihtiva eden islenmis ses sinyali, yeniden yapilandirilmis ses sinyali (35), büyüklügü düzeltilmis frekans yamasi Y[k,n,i) [40), ses sinyalinin fazi (45] veya ses sinyalinin büyüklügü (47] için genel bir terim olarak kullanildigi belirtilmelidir. Bu nedenle, farkli ses sinyalleri, uygulamanin içerigi nedeniyle karsilikli olarak degistirilebilir. Alternatif düzenlemeler, örnegin, kisa zaman Fourier dönüsümü (STFT) bir Kompleks Modifiye Ayrik Kosinüs Dönüsümü [CMDCT] veya bir Ayrik Fourier Dönüsümü (DFT) alani Için, bulusun zaman-frekans islemesi için kullanilan farkli filtre bankasi veya dönüsüm alanlariyla ilgilidir. Bu nedenle, dönüsüme iliskin spesifik faz özellikleri dikkate alinabilir. Detayli olarak, örnegin kopyalama katsayilari çift sayidan tek bir sayiya veya tam tersi olarak kopyalanir, yani orijinal ses sinyalinin ikinci alt bandi, düzenlemelerde anlatildigi gibi sekizinci alt bant yerine dokuzuncu alt banda kopyalanir, yamanin konjügat kompleksligi de da isleme için kullanilabilir. Ayni durum, bir yama içerisinde faz açilarinin tersine çevrilmis sirasinin üstesinden gelmek için örnegin kopyalama algoritmasini kullanmak yerine yamalarin yansitilmasina uygulanir. Diger düzenlemeler, ek bilgileri kodlayicidan çekebilir ve dekoder alanindaki bazi veya tüm gerekli düzeltme parametrelerini tahmin edebilir. Diger düzenlemeler, örnegin farkli taban bant kisimlarini, farkli bir sayida veya büyüklükteki veya farkli transpozisyon tekniklerini kullanan, örnegin spektral yansitma veya tek yan bant modülasyonu (SSB) gibi baska altta yatan BWE yama semalarini da ihtiva edebilir. Varyasyonlar ayni zamanda BWE sentez sinyali akisina tam olarak faz düzeltmesinin birlikte tasarlandigi yerlerde de mevcut olabilir. Ayrica, pürüzsüzlestirme, örnegin, daha iyi hesaplama verimliligi için degistirilebilen kayan bir Hann penceresi, örnegin birinci sinif llR, kullanilarak gerçeklestirilir. Teknigin bilinen algisal ses kodeklerinin kullanimi, bir ses sinyalinin spektral bilesenlerinin, özellikle bant genisligi uzatma gibi parametrik kodlama tekniklerinin uygulandigi, özellikle düsük bit hizlarinda, faz uyumunu bozar. Bu da ses sinyalinin faz türevinin degismesine yol açar. Bununla birlikte, belirli sinyal tiplerinde faz türevinin korunmasi önemlidir. Sonuç olarak, bu seslerin algisal kalitesi bozulur. Mevcut bulus, faz türevinin bir restorasyonunun algisal olarak faydali olmasi halinde faz türevini ya asiri frekans ("dikey"] veya bu sinyallerin zaman ["yatay"] üzerinde yeniden düzenlemektedir. Ayrica, dikey veya yatay faz türevinin ayarlanmasinin algisal olarak tercih edilip edilmedigi konusunda da karar verilir. Faz türevi düzeltme isleminin kontrol edilmesi için sadece çok kompakt ek bilginin iletilmesi gerekir. Bu nedenle bulus, orta ek bilgi maliyetlerinde algisal ses kodlayicilarinin ses kalitesini gelistirir. Baska bir deyisle, spektral bant replikasyonu (SBR), faz spektrumunda hatalara neden olabilir. Bu hatalarin insan algisi, algisal olarak iki önemli etkiyi ortaya çikarmistir: harmoniklerin frekanslari ve temporal konumlarindaki farkliliklar. Frekans hatalari, sadece temel frekans, bir ERB bandinda sadece bir harmonik oldugundan yeterince yüksek oldugunda algilanabilir gibi görünmektedir. Buna paralel olarak, temporal konum hatalari sadece temel frekans düsük oldugunda ve harmoniklerin fazlari frekansa göre hizalanmissa algilanabilir gibi görünmektedir. Frekans hatalari, zaman içinde faz türevinin (PDT) hesaplanmasiyla tespit edilebilir. PDT degerleri zaman içinde sabit ise, SBR-islenmis ve orijinal sinyaller arasindaki farkliliklar düzeltilmelidir. Bu, harmoniklerin frekanslarini etkili bir sekilde düzeltir ve bu nedenle, uyumsuzluk algisi önlenir. Temporal-konum hatalari, frekans üzerinde içinde faz türevinin (PDF) hesaplanmasiyla tespit edilebilir. PDF degerleri frekans üzerinde sabit ise, SBR-islenmis ve orijinal sinyaller arasindaki farkliliklar düzeltilmelidir. Bu, harmoniklerin temporal konumlarini etkili bir sekilde düzeltir ve böylece, çapraz frekanslarda modülasyon seslerinin algilanmasi önlenir. Mevcut bulusun, bloklarin gerçek veya mantiksal donanim bilesenlerini temsil ettigi blok diyagramlari baglaminda tarif edilmis olmasina ragmen, mevcut bulus, bilgisayar tarafindan uygulanan bir yöntemle de uygulanabilir. Ikinci durumda, bloklar, ilgili yöntem asamalarini temsil etmekte olup bu asamalar, ilgili mantiksal veya fiziksel donanim bloklari tarafindan gerçeklestirilen fonksiyonellikler için geçerlidir. Bazi özelliklerin bir cihaz kapsaminda açiklanmis olmasina ragmen, bir blok veya cihazin bir yöntem asamasina veya bir yöntem asamasinin bir özelligine karsilik geldigi durumlarda, bu özelliklerin ilgili yönteminin bir görünümünü temsil ettigi görülmektedir. Benzer sekilde, bir yöntem asamasi kapsaminda açiklanan özellikler, karsilik gelen bir cihazin ilgili bir blogunun veya ögesinin veya özelliginin bir gösterimini de temsil eder. Yöntem asamalarinin bazilari veya tümü, örnegin, bir mikroislemci, programlanabilir bir bilgisayar veya bir elektronik devre gibi bir donanim cihazi tarafindan [veya kullanilarak) yürütülebilir. Bazi düzenlemelerde bazi bir veya bir çok önemli yöntem asamalarinin biri veya daha fazlasi, bu tür bir cihaz yoluyla yürütülebilir. Bulusa ait iletilmis veya kodlanmis sinyal, bir dijital depolama ortaminda saklanabilir veya bir kablosuz iletim ortami veya Internet gibi bir kablolu iletim ortami gibi bir iletim ortaminda iletilebilir. Belirli uygulama gerekliliklerine dayanarak bulusun düzenlemeleri, donanim veya yazilim olarak uygulanabilir. Uygulama; örnegin, ilgili yöntemin gerçeklestirilebilecegi programlanabilir bir bilgisayar sistemi ile çalisan (veya çalisma kapasitesine sahip) olarak ve elektronik olarak okunabilir kontrol sinyallerine sahip bir flopi disk, bir DVD, bir Blu- Ray, bir CD, bir ROM, bir PROM, bir EPROM, bir EEPROM veya bir FLASH hafiza gibi dijital bir saklama ortami kullanilarak gerçeklestirilebilir. Dolayisiyla dijital saklama ortami, bilgisayar tarafindan okunabilir. Bulusa göre bazi düzenlemeler, burada açiklanan yöntemlerden birinin gerçeklestirilecegi sekilde, programlanabilir bir bilgisayar sistemi ile birlikte çalisabilen, elektronik olarak okunabilir kontrol sinyallerine sahip bir veri tasiyiciyi ihtiva eder. Genellikle, bu bulusun düzenlemeleri, bir bilgisayar programi ürünü ile bir program kodu yoluyla gerçeklestirilebilir; bu program kodu, bilgisayar programi ürünü, bir bilgisayarda çalistirildiginda yöntemlerden birini gerçeklestirmek üzere çalisir. Program kodu, Örnegin, makine tarafindan okunabilir bir tasiyicida saklanabilir. Diger düzenlemeler, makine tarafindan okunabilir bir tasiyicida saklanan, burada açiklanan yöntemlerden birinin gerçeklestirilmesi için bilgisayar programini ihtiva edebilir. Diger bir deyisle, bulus yönteminin bir düzenlemesinde, dolayisiyla, bilgisayar programi bir bilgisayarda çalistiginda, burada açiklanan yöntemlerden birini gerçeklestirmek için bir program koduna sahip olan bir bilgisayar programidir. Bulus yönteminin bir baska düzenlemesi, burada tarif edilen yöntemlerden birini gerçeklestirmek için buna kaydedilen bilgisayar programini ihtiva eden bir veri tasiyicisidir (veya bir dijital depolama ortami veya bir bilgisayar tarafindan okunabilen ortam gibi geçici olmayan bir saklama ortami). Veri tasiyicisi, dijital saklama ortami veya kaydedilen ortam, tipik olarak somut ve /veya geçici degildir. Bulus yönteminin bir baska düzenlemesi, burada tarif edilen yöntemlerden birini gerçeklestirmek için bilgisayar programini temsil eden sinyal dizisi veya bir veri akisidir. Veri akisi veya sinyal dizisi, örnegin, internet yoluyla, örnegin, veri iletisim baglantisi yoluyla aktarilmak üzere konfigüre edilmis olabilir. Baska bir düzenleme, burada tarif edilen yöntemlerden birini gerçeklestirmek üzere yapilandirilmis veya adapte edilmis bir bilgisayar veya programlanabilir bir mantik cihazi gibi bir islem aracini ihtiva etmektedir. Baska bir düzenleme, burada açiklanan yöntemlerden birini gerçeklestirmek için yukarida belirtilen bilgisayar programinin kuruldugu bir bilgisayari ihtiva etmektedir. Bulusa göre bir baska düzenleme, burada tarif edilen yöntemlerden birini bir aliciya aktarmak için bir bilgisayar programini aktarmak (örnegin, elektronik veya optik olarak) için yapilandirilmis bir cihazi veya bir sistemi ihtiva eder. Alici, örnegin, bir bilgisayar, bir mobil cihaz, bir hafiza cihazi veya benzeri olabilir. Cihaz veya sistem, örnegin, bilgisayar programini aliciya aktarmak için bir dosya sunucusunu ihtiva edebilir. Bazi düzenlemelerde, burada açiklanan yöntemlerin fonksiyonlarinin bir kismini veya tamamini gerçeklestirmek için programlanabilir bir mantik cihazi (örnegin, bir alanda programlanabilir geçit dizilimi) kullanilabilir. Bazi düzenlemelerde alanda programlanabilir geçit dizilimi, burada tarif edilen yöntemlerden birini gerçeklestirmek için bir mikroislemci ile birlikte çalisabilir. Genel olarak yöntemler, tercihen herhangi bir donanim cihazi ile gerçeklestirilir. Yukarida tarif edilen düzenlemeler, yalnizca bu bulusun ilkeleri için örnek teskil etmektedir. Düzenlemelerin modifikasyonlari ve varyasyonlarinin ve burada açiklanan detaylarin teknikte uzman olan kisiler tarafindan bilinecegi anlasilmaktadir. Dolayisiyla, sadece buradaki düzenlemelerin açiklamalari ve tanimlari yoluyla sunulan özel ayrintilarla degil, yaklasmakta olan patent taleplerinin kapsamiyla da sinirlandirilmasidir. Referanslar processing and loudspeaker design, John Wiley and Sons Ltd, 2004, Chapters 5, 6. Method with Novel Transient Handling for Audio Codecs, 126th AES Convention, 2009. Perceive Pitch, Timbre, Azimuth and Envelopment of Multiple Sources' Tonmeister subband/time domain approach," IEEE International Conference on Acoustics, Speech and of Signal Processing to Audio and Acoustics, 1997. 1997 IEEE ASSP Workshop on, vol., no., approach in audio coding," in AES 112th Convention, [Munich, Germany), May 2002. Belgium), November 2002. bandwidths and excitation patterns," ]. Acoust. Soc. Am., vol. 74, pp. 750-753, September 1983. pitch perception and frequency modulation discrimination," J. Acoust. Soc. Am., vol. 95, pp. spectral smoothness," IEEE Transactions on Speech and Audio Processing, vol. 11, November 2003. orjinal: büyüklük spektrumu (dB) 0.34 0.16 018 orjinalr faz spektrumu (radyanlar) frekans ('kHZ) orjinal: büyüklük spektrumu (dB) SEKIL 1C orjinal: faz spektrumu (radyanlar) frekans (kHZ) SEKIL 1D 1i _ _ _ _ . . 4IJ///V_ _ _ _ . _ Ãu _ _ nU\I/V _ . _ 1. _ _ 1. _ _ . _ mW\J/IT " o o o o o 0 n n n n 4 3 4' 4' Xiuans(kv n) SEKIL 3A Xîiaiis `7-` 6.4` : : : C i\ SEKIL 3c kopyalama: büyüklük spektrumu (dB) frekans WH?) 0.14 0.16 SEKIL 4A kopyalama: faz spektrumu (radyanlar) 8" T: 9._ i i li." in:: i: SEKIL 4B kopyalama: büyüklük spektrumu (dB) frekans (kHZJ SEKIL 4c kopyalama: faz spektrumu (radyanlar) frekans (kHz) SEKIL 4D /67 Et 98me ?Ev @8% - ..... W ............... W ............. W;;I;:;M ........... 'im .............. W-oq _ 5.›.w;.;.z.w..s.;.ß 11/67 zaman alani frekans alani 60 .I 1 9 ! 3 ! frekans (HZ) 12/67 EGEEN V M N P 1 mcmxmc 13/67 8:; :95& 14/67 zaman alani 0.4 -~ frekans alani 7-; ------- -i frekans (H7) /67 frekans (kHZ) frekans (kHZ) 16/67 orijinal: zaman Içerisindeki faz türevi (radyanlar) SEKIL 12A orijinalj frekans üzerindeki faz türevi (radyanlar) 1 ;11) . .s 2 =-' ii 1522." !:'-'J 'iiim . "i g'n!. .â i'i'i:!rII ii. frekans (kHZ) frekans ('kHZ) 17/67 orijinal: zaman içerisindeki faz türevi (radyaniar) SEKIL 12c orijinal: frekans üzerindeki faz türevi (radyanlar) SEKIL 12D 18/67 kopyalama: zaman içerisindeki faz türevi (radyanlar) frekans (kHZJ SEKIL 13A kopyalama: frekans üzerindeki faz türevi (radyanlar) 19/67 kopyalama: zaman içerisindeki faz türevi (radyanlar) e G'ÄEI. ':'îfiüi'it'î'5:'152':-,I.':=': I: 2 [2.. Ed. m' N maar... IÜ.." W W I I." "-.U F I II frekans (kHz) kopyalama: frekans üzerindeki faz türevi (radyanlar) frekans iikHZ) /67 450 45b' 21/67 2 ..ima 503.958 22/67 2 .:me 2 5.2& &3.0 8% Nmtmuoc F. gsb _gusmammmc . 4 ...N 2.01» :m /m 23/67 A3 :95% 24/67 düzeltilmis: PDT'deki hata (radyanlar) düzeltilmis: zaman içerisindeki faz türevi (radyanlar) /67 2 ..ima (58_ " m m E .5 _ 32296 " mcmxoc " %928 m _226595 " m m: - -4---L 26/67 of 9: of övoxmm " Ecußcm _ n n 953520 ?058% __aga 258: L_ 02 09 mm_ 27/67 03 09 mi o? m.: 9; 2: 2: 28/67 Bir ses sinyali fazi ölçüm hesaplayici ile bir zaman-_çerçevesi için bir ses sinyalinin N2305 Bir hedefiaz' olçum belirleyiei ile hahsedilen zaman @2310 çerçevesi için bir hedef faz olçusunun belirlenmesi Islenmis bir ses sinyali elde etmek için hesaplanan faz ölçümü ve hedef faz ölçümü kullanilarak bir faz düzelticisi ile zaman çerçevesi için ses sinyalinin fazlarinin düzeltilmesi N2315 29/67 Ses sinyaline göre azalan alt bant sayisi ile bir zaman çerçevesinde bir ses sinyalinin kodunun çözülmesi Kodu çözülmüs bir ses sinyalinin bir alt bant setine azaltilmis bir alt bant sayisi ile yama yapma. burada alt bantlar seti. alt bantlarin sayisinin azaltilmasi için bitisik alt bantlarin düzenli bir sayisi ile bir ses sinyalinin elde edilmesi için zaman çerçevesindeki alt bantlarin dahasina bir birinci yamayi olusturur Ses islemcisiyle bir hedef fonksiyona göre birinci yamanin alt bantlari içindeki fazlarin düzeltilmesi /67 Ses sinyaline göre azaltilmis sayida alt banta sahip olan bir çekirdek kodlanmis ses sinyali elde etmek için bir çekirdek kodlayici ile ses sinyalinin çekirdek kodlamasi Ses sinyalinin temel frekans tahmininin elde edilmesi için temel bir frekans analizörü ile ses sinyalinin veya düsük sinyaldeki filtrelenmis versiyonun analiz edilmesi Çekirdek kodlanmis ses sinyaline dahil edilmeyen ses sinyalinin alt bantlarinin parametrelerinin bir parametre ekstraktörü ile ekstre edilmesi Çekirdek kodlanmis ses sinyalini parametreleri ve bir çikis sinyali Olusturucu ile temel frekans tahminini ihtiva eden bir çikis sinyali olusturma 31/67 düzetticisi' 38/67 82296 205.98 208:me 39/67 Çekirdek konumlarinin pik kodlanmis parametre temel frekans konum ses sinyali tahmini tahmini Hedef faz ölçüsü olan bir zaman çerçevesi içinde ses sinyali için bir hedef faz ölçüsünün belirlenmesi N3405 Zaman çerçevesindeki ses sinyalinin fazini ve hedef faz ölçüsünü kullanarak bir faz hatasi hesaplayicisi Zaman çerçevesindeki ses sinyalinin fazinin faz hatasi kullanilarak düzeltilen bir faz ile düzeltilmesi 40/67 Bir çekirdek dekodere sahip bir taban bandin zaman çerçevesinde bir ses sinyalinin kodunu @3505 Kodu çözülmüs taban bandin bir alt bant setini yama yapma. burada alt bantlar seti. taban bandindaki frekanslardan daha yüksek frekanslar ihtiva eden bir ses &3510 sinyali elde etmek için taban bandina bitisik zaman çerçevesindeki alt bantlarda ayrica bir yama olusturur Bir hedef faz ölçüsüne göre bir ses islemcisi ile birinci yamanin alt bantlari @3515 ile fazlarin düzeltilmesi 41/67 Ses sinyaline göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali elde etmek için bir çekirdek kodlayici ile ses sinyalinin çekirdek kodlamasi Ses sinyalindeki pik konumlarinin temel bir frekans tahminini elde etmek için ses sinyalinin veya düsük sinyalden filtrelenmis bir ses sinyali versiyonunun temel bir frekans analizörü ile analiz edilmesi Çekirdek kodlanmis ses sinyaline dahil edilmeyen ses sinyalinin alt bantlarinin parametrelerinin bir parametre ekstraktörü ile ekstrakte edilmesi Çekirdek kodlanmis ses sinyalini parametreleri pik konumlarin temel frekansini ve pik konumunu ihtiva eden bir çikis sinyali olusturucusu ile bir çikis sinyali olusturma 42/67 î cmEaN 43/67 kopyalama: faz spektrumunda hata (radyanlar) SEKIL 38A düzeltilmis: frekans üzerindekt faz türevi (radyanlar) -' .- 8' I I f 9 I "I 6 ; I I : i ii . I i i 4: 'i i 1 4-. . i ' I; i : â I 2-: I i 'i I .. s: ~ I 44/67 55› 95680 05%ch 45/67 8 ..tüm › _253_ _ mEamm , .980: I .amwwz _oczwm_ 6.5› ..93. . acma, . www: :932 E Gîx ::wa Nawwîsa ^._._.VV,.__HX men ?55an ;wgmvh 8.5.2..7X tmbcmwmî EÃX FO& 46/67 atm› @EGNDU h coâmbm› &m 65› _859.99_ wEHENso 859%› :2959 w coâwbse 47/67 Bir birinci ve bir ikinci varyasyon modundaki bir varyasyon belirleyicisi ile ses sinyalinin bir fazinin bir varyasyonunu belirleme Bir varyasyon karsilastiricisi ile birinci ve ikinci varyasyon modu kullanilarak belirlenen varyasyonu karsilastirma Karsilastirma sonucuna dayali birinci varyasyon modu veya ikinci varyasyon moduna göre bir düzetlme veri hesaplayicisi ile faz düzeltmesinin hesaplanmasi 48/67 orijinal: PDT'nin standart sapmasi (radyanlar) SEKIL 43A orijinal: PDF'nin standan sapmasi (radyanlar) 49/67 orijinal: PDT'nin standan sapmasi (radyanlar) zaman {SJ SEKIL 43c orijinal: PDF'nin standarî sapmasi (radyaniar) 50/67 orjinal: büyüklük spektrumu (dB) SEKIL 44A orjinal: faz spektrumu (radyanlar) i..»-3:.':U'w" .van-3 51/67 orijinai :zaman içindeki faz türevi (radyanlar) SEKIL 45A orijinal :frekans üzerindeki faz türevi (radyanlar) 52/67 düzeltilmis: zaman içerisindeki faz türevi (radyanlar) frekans (kH7) SEKIL 46A düzettilmis: frekans üzeri ndeki faz türevi (radyanlar) SEKIL 4GB 53/67 A3 :95& (ZHH) SUBXSJJ 54/67 4.? ..Euw frekans (kHz) 55/67 (2H›i) 56/67 3 :yün (ZHH) suean; 57/67 düzemlmis: PDT'dek'i hata (radyanlar) 0.116 038 0?2 SEKIL 50A düzehilmis: zaman içerisindeki faz türevi (radyanlar) 58/67 59/67 1!iéiue 60/67 düzeltilmis: faz spektrumu içindeki hata (radyanlar) SEKIL 52A düzeltilmis: frekans üzerindeki faz türevi (radyanlar) 61/67 m 5 38 Sam Hamstmmmm Ac x:.N Nahöccxg Ac.±w%x Naomc _m:m .V _uccß mcsowav :.xxsrN _ _0N=mcm " ^:_xvN :wztm www 62/67 o ;MN/mn I IDE I I I | Imdmslcgmýgml | I Immoml I J cam _ " V: __g g gm 2.2" n 22 mmm Nm _ 28.21 En.: â_ mgmw: _ xnsm c -ammwc _ C__m....:m_ Ar_ Q: ...IN .szDU ll. .amvwi Nürg_ _ _mim :D EEÜE . _ .::X @EH-@NSUJ I I L _ EKI( _ -.anam A1 .Eâd _m:m _. I I_ .::N v/__oz 4 63/67 USQNoEow 64/67 N _gamammwc 633%& 32230 . 209 m 958: . i: " 65/67 aktivasyon '- X" `(k. n) düzeltme verisi I 45 365 I 285a l düzeltme I yala-V verisi I için veri I hesaplama | 295a I 285b ' . düzeltme | dlkey verisi i için veri hesaplama 2 I 295b I 285c | gecici düzelme I duâeltme verisi I hesaplama I metaveri Olusturucu i metaveri 66/67 Birinci düzeltme verisini kullanarak birinci hedef spektrum üreticisi ile ses sinyalinin bir altbant sinyalinin bir birinci zaman çerçevesi için bir hedef spektrumu üretilmesi Ses sinyalinin birinci zaman çerçevesindeki altbantsinyalinin bir fazinin, bir faz düzeltme algoritmasi ile belirlenen bir birinci faz düzenleyicisi ile düzelemesi` burada düzeltme, ses sinyalinin birinci zaman çerçevesindeki altbant sinyalinin bir ölçüsü ile hedef spektrumu arasindaki bir farkin azaltilmasiyla gerçeklestirilir. Birinci zaman çerçevesi için ses altbant sinyalinin zaman çerçevesinin düzeltilmis bir fazini kullanarak ve ikinci zaman çerçevesindeki altbant sinyalinin ölçüsünü kullanarak birinci zaman çerçevesinden farkli bir ikinci zaman çerçevesi için ses altbant sinyallerini hesaplamak için bir ses altbandi sinyal hesaplayicisi ile hesaplanmasi veya faz düzeltme algoritmasindan farkli bir baska faz düzeltme algoritmasina göre düzeltilmis bir faz hesaplamasinin kullanilmasi 67/67 Bir faz belirleyici ile ses sinyalinin fazinin belirlenmesi Ses sinyalinin belirlenen fazina dayanan bir hesaplayici ile bir ses sinyali için faz düzeltme verilerinin belirlenmesi N59l0 Ses sinyaline göre azaltilmis sayida alt bant ihtiva eden bir çekirdek kodlanmis ses sinyali elde etmek için bir çekirdek kodlayici ile ses sinyalinin çekirdek kodlamasi Çekirdek kodlanmis ses sinyaline dahil edilmeyen ikinci bir alt bant seti için düsük çözünürlüklü bir parametre temsili elde etmek için bir parametre ekstraktörü ile ses sinyalinden parametrelerin ekstrakte edilmesi Parametreler, çekirdek kodlanmis ses sinyali ve faz düzeltme verisini ihtiva eden bir çikis sinyali Olusturucu ile bir çikis sinyali olusturma TR TR TR TR TR TR TR TR TRDescription OF A CALCULATOR AND METHOD FOR DETERMINING PASS CORRECTION DATA FOR AN AUDIO SIGNAL The present invention relates to an audio processor and a method for processing an audio signal, a decoder and a method for decoding an audio signal, and an encoder and a method for encoding an audio signal. Additionally, a calculator and a method for determining phase correction data, an audio signal, and a computer program to perform one of the aforementioned methods are also described. In other words, the present invention demonstrates a phase derivative correction and bandwidth (BWE) extension for bandwidth-extended signals in the QMF domain, based on perceptual significance, or for perceptual audio codecs. Perceptual audio coding observed so far follows several common themes including time/frequency-domain processing and redundancy reduction [1]. Typically, the input signal is analyzed by a bank of analysis filters that convert the time-domain signal into a spectral (time/frequency) representation. The conversion to spectral coefficients enables selective processing of signal components based on frequency components (e.g., different input signals with distinct structures are analyzed in parallel according to their perceptual characteristics, i.e., the masking threshold is calculated, particularly as a function of time and frequency). The time/frequency dependent masking threshold is a target encoding threshold given to the quantization unit by the analysis filter bank, in the form of an absolute energy value or Mask-to-Signal-Ratio [MSR] for each frequency band and encoding time frame. Spectral coefficients are measured to reduce the data rate required to represent the signal. This stage signifies a loss of information and introduces a coding disorder (error, noise) to the signal. To minimize the audible effect of this coding noise, the magnitudes of the quantizer stages are controlled according to the target coding thresholds for each frequency band and frame. Ideally, the encoding noise injected into each frequency band is lower than the encoding [masking] threshold, and therefore, distortion in the subjective voice is imperceptible (elimination of subjectivity). This control of quantization noise according to frequency and time, based on psycho-acoustic requirements, leads to a complex noise shaping effect, and this is what makes the encoder a perceptual audio encoder. Later, modern audio encoders performed entropy encoding (e.g., Huffman coding, arithmetic coding) on quantized spectral data. Entropy coding is a lossless coding method that also saves on bit rate. Finally, all the encoded spectral data and associated additional parameters (such as quantizer settings for each frequency band) are packaged together in a bitstream, which is the final encoded representation intended for file storage or transmission. In bandwidth-extended perceptual audio coding based on filter banks, the fundamental portion of the consumed bit rate is typically spent on quantized spectral coefficients. Therefore, at very low bit rates, there may not be enough bits to represent all the coefficients of precision required to obtain perceptually undistorted reproduction. Therefore, low bit rate requirements impose a limitation on the audio bandwidth achievable with perceptual audio encoding. Bandwidth extension [2] removes this long-term basic limitation. The basic idea behind bandwidth extension is to complement a band-limited perceptual codec with an additional high-frequency processor that transmits and restores missing high-frequency content in a compact, parametric manner. High frequency content can be produced by single sideband modulation of the baseband signal, by copying techniques used in Spectral Band Replication (SBR) [3] or in the application of amplitude shifting techniques such as audio encoder [4]. Digital audio effects, such as time-extension or pitch shift effects, are generally achieved through synchronized overlay-addition. In addition, hybrid systems that implement a SOLA operation in the lower bands have been proposed. Audio encoders and hybrid systems often suffer from a condition called stagnation [8] which can be attributed to the loss of vertical phase matching. Some publications associate the improvements in sound quality of time-dependent algorithms with maintaining vertical phase consistency where it is important [6] [7]. State-of-the-art audio encoders [1] often disregard important phase characteristics of the signal to be encoded, thereby risking the quality of the perceived audio signals. A general proposal for correcting phase matching in perceptual audio encoders is discussed in [9]. However, not all phase mismatch errors can be corrected simultaneously, and not all phase mismatch errors are perceptually significant. For example, in audio bandwidth extension, although not explicitly understood from the latest technology, it is indicated which phase-matching errors require the highest priority correction and which errors may only be partially corrected or completely ignored due to their insignificant perceptual effects. Phase matching is often disrupted over frequency and over time, especially due to the application of audio bandwidth extension [2] [3] [4]. The result is a dull sound that exhibits auditory roughness and may contain tones that are detached from the auditory objects in the original signal and may also be perceived as an auditory object in addition to the original signal. Moreover, the sound can come from a distance, resulting in less "buzzing" and therefore less listener engagement [5]. Therefore, an improved approach is needed. One aim of the present invention is to provide a concept for processing an audio signal. This objective is resolved by the subject matter of independent claims. It describes a coding technique that uses a derived phase. Spectral magnitude and phase calculations are obtained through an estimation process using spectral information from analysis filter banks such as the Modified Discrete Cosine Transform. The prediction process is implemented using impulse responses and evolutionary-like operations. Components of impulse responses are selected for use in evolutionary-like operations, trading off computational complexity for predictive accuracy. Filter structures and mathematical derivatives within analytical expressions for impulse responses are explained. The present finding is based on the observation that the phase of an audio signal can be corrected according to a target phase calculated by an audio processor or a decoder. The target phase can be viewed as a representation of one phase of an unprocessed audio signal. Therefore, the phase of the processed audio signal is adjusted to better match the phase of the unprocessed audio signal. For example, the time-frequency representation of an audio signal can be adjusted so that the phase of the audio signal is set for subsequent time frames in a sub-band, or the phase is set within a time frame for subsequent frequency sub-bands. Therefore, a calculator was found to automatically determine and select the most appropriate correction method. The findings described may be applicable in different configurations or may be commonly applied in a single decoder and/or encoder. The specifications describe an audio processor for processing an audio signal, including an audio signal phase measurement calculator configured to calculate a phase measurement of an audio signal over a time frame. Additionally, the audio signal contains a target phase gauge determiner for determining a target phase gauge for the given time frame, and a phase corrector configured to correct the phases of the audio signal for the time frame using the calculated phase gauge and target phase gauge to obtain a processed audio signal. According to additional regulations, the audio signal may contain multiple subband signals for the time frame. The target phase measurement determiner is configured to determine a first target phase measurement for a first subband signal and a second target phase measurement for a second subband signal. Additionally, the audio signal phase measurement calculator determines a first phase measurement for the first subband signal and a second phase measurement for the second subband signal. The phase corrector is configured to correct the first phase of the first subband signal using the first phase measurement and the first target phase measurement of the audio signal, and to correct the second phase of the second subband signal using the second phase measurement and the second target phase measurement of the audio signal. Therefore, the audio processor may include an audio signal synthesizer for synthesizing a corrected audio signal using the corrected first subband signal and the corrected second subband signal. According to the current design, the audio processor is configured to correct the phase of the audio signal in the horizontal direction, in other words, for correction over time. Therefore, the audio signal can be divided into a series of time frames, where the phase of each time frame can be adjusted relative to the target phase. The target phase could be a representation of an original audio signal, where the audio processor could be a decoder component designed to decode the audio signal, which is an encoded representation of the original audio signal. Optionally, horizontal phase correction can be applied separately to multiple subbands of the audio signal if the audio signal exists in a time-frequency representation. Audio signal phase correction can be achieved by subtracting the deviation of the phase derivative of the audio signal phase and the target phase over time from the audio signal phase. Therefore, the phase correction explained below, since the phase derivative over time is a frequency [with a phase being cb and rîî w ` '], performs a frequency adjustment for each subband of the audio signal. In other words, the difference between each band of the audio signal and a target frequency can be reduced to achieve better quality for the audio signal. To determine the target phase, the target phase determiner is configured to obtain a fundamental frequency estimate for a given time frame, and to calculate a frequency estimate for each subband of multiple time frame subbands using the fundamental frequency estimate for that time frame. Frequency estimation can be converted into a phase derivative over time using the total subband and the number of sampling frequencies of the audio signal. In an additional configuration, the audio processor includes a target phase measure determiner for determining a target phase measurement of the audio signal within a time frame, a phase error calculator for calculating a phase error using the time frame of the target phase measurement and the audio signal phase, and a configured phase corrector for correcting the time frame and audio signal phase using the phase error. According to additional regulations, the audio signal exists in a time-frequency representation, where the audio signal contains many subbands pertaining to the time frame. The target phase measurement determiner specifies a first target phase measurement for a first subband signal and a second target phase measurement for a second subband signal. Additionally, the phase error calculator generates a vector of phase errors, where the first element of the vector refers to the first subband signal and a first deviation of the first target phase measurement phase, and the second element of the vector refers to the second subband signal and a second deviation of the second target phase measurement phase. Additionally, this arrangement's audio processor generates the average corrected phase values of a corrected audio signal using the corrected first subband signal and the corrected second subband signal. Alternatively, multiple subbands are grouped together into a baseband and a set of frequency patches, where the baseband contains one subband of the audio signal and the set of frequency patches contains at least one subband of the baseband at a frequency higher than at least one subband of the baseband. Additional arrangements describe a phase error calculator configured to calculate an average of the elements of a vector of phase errors referencing the first patch of the second frequency patches in order to obtain an average phase error. The phase corrector is configured to correct a phase of the subband signal in the first and subsequent frequency patches of the frequency patch set using a weighted average phase error, where the average phase error is divided by an index of the frequency patch to obtain a modified patch signal. This phase correction ensures adequate quality at the transition frequencies, which are the boundary frequencies between the next two frequency patches. According to another arrangement, the two arrangements described earlier can be combined to obtain a corrected signal containing phase-corrected values that are sufficient in the average and transition frequencies. Therefore, the audio signal phase derivative calculator is configured to calculate the average of phase derivatives over a given baseband frequency. The phase corrector calculates an optimized first frequency patch by adding the average of the phase derivatives over a specified frequency, weighted by the highest subband index in the baseband of the audio signal, to the phase of the subband signal, and then generating an additional modified patch signal. Additionally, the phase corrector can be configured to repeatedly update the combined modified patch signal based on frequency patches by calculating a weighted average of the modified patch signal and an additional modified patch signal to obtain the combined modified patch signal, and by adding the average of the phase derivatives over the frequency, weighted by the subband index of the current subband to the subband signal phase with the highest subband index in the previous frequency patch. To determine the target phase, a target phase meter may include a data stream extractor configured to extract a peak location and the fundamental frequency of the peak locations of the audio signal within a given time frame from a data stream. Alternatively, the target phase meter may include an audio signal analyzer configured to analyze the current time frame in order to calculate a peak position and a fundamental frequency of peak positions within the current time frame. It also includes a target spectrum generator for estimating additional peak locations within the current timeframe using the target phase meter, peak location, and fundamental frequency of peak locations. Specifically, a target spectrum generator might include a peak detector for generating a time-domain pulse sequence, a signal generator for tuning a frequency of the pulse sequence according to the fundamental frequency of the peak locations, a pulse locator for tuning the phase of the pulse sequence according to its location, and a spectrum analyzer for generating a phase spectrum of the tuned pulse sequence, where the phase spectrum of the time-domain signal is the target phase measurement. The described arrangement of the target phase gauge is advantageous for generating a target spectrum for an audio signal with a waveform that includes peaks. The second audio processor's adjustments explain a vertical phase correction. Vertical phase correction adjusts the phase of the audio signal over a time frame across all subbands. Adjusting the phase of the audio signal, applied independently for each subband, results in a waveform of the audio signal that differs from the uncorrected audio signal after synthesis of the audio signal subbands. Therefore, it is possible, for example, to reshape a stained peak or a transition. According to an additional arrangement, a calculator is shown to determine the phase correction data for an audio signal, including a variation determiner for determining a variation of the audio signal phase in first and second variation modes, a variation comparator for comparing a first variation determined using phase variation mode and a second variation determined using second variation mode, and a correction data calculator for calculating the phase correction according to either the first or second variation mode based on the comparison result. An additional arrangement specifies the variation determinant for determining the phase variation of the audio signal over time as a standard deviation measurement of the phase derivative [PDT] over many time frames, or the phase variation of the audio signal over frequency as a standard deviation measurement of the phase derivative (PDF) over many subbands, either in the first variation mode. The variation comparator compares the measurement of the phase derivative over time as the first variation mode, and the measurement of the phase derivative over frequency as the second variation mode for time frames of the audio signal. According to an additional configuration, the variation identifier is set to detect a variation in the phase of the audio signal in a third variation mode, where this third variation mode is a transition detection mode. Therefore, the variation comparator compares the three variation modes, and the correction data calculator calculates the phase correction based on the comparison result, according to the first, second, or third variation mode. The determination rules of the correction data calculator can be described as follows. If a transition is detected, the phase is corrected according to the phase correction for transitions in order to restore the transition shape. Otherwise, if the first variation is smaller than or equal to the second variation, phase correction is applied according to the mode of the first variation; or if the second variation is larger than the first variation, phase correction is applied according to the mode of the second variation. If a transition absence is detected and both the first and second variations exceed a threshold value, none of the phase correction modes are applied. The calculator can be configured to analyze an audio signal, for example, to determine the best phase correction mode during an audio encoding stage, and to calculate the relevant parameters for the determined phase correction mode. During the decoding phase, parameters can be used to obtain a decoded audio signal with better quality compared to audio signals decoded using existing codecs. It should be noted that the calculator autonomously detects the correct correction mode for each time frame of the audio signal. The diagram illustrates a decoder for decoding an audio signal, with a first phase corrector for correcting a phase of the subband signal in the first time frame of the audio signal, determined by a phase correction algorithm, and a first target spectrum generator for generating a target spectrum of a second signal in the first time frame of the audio signal using the first correction data, where the correction is achieved by reducing the difference between the subband signal and the target spectrum measurement in the first time frame of the audio signal. Additionally, the decoder includes an audio subband signal calculator for calculating the audio subband signal for the first time frame using a phase-corrected algorithm, and for calculating the audio subband signal for a second time frame different from the first using a phase correction algorithm, or by measuring the subband signal in the second time frame. According to additional regulations, the decoder includes a second and third spectrum generator equivalent to the first target spectrum generator, and a second and third phase corrector equivalent to the first phase corrector. Therefore, the first phase corrector can perform a horizontal phase correction, the second phase corrector can perform a vertical phase correction, and the third phase corrector can perform phase correction transitions. According to an additional arrangement, the decoder contains a core decoder configured to decode the audio signal within a time frame using a small number of subbands relative to the audio signal. Additionally, the decoder may include a patch generator for patching the subband set of the audio signal decoded with a small number of subbands, where the set of subbands forms the first patch with additional subbands in a time frame adjacent to a small number of subbands to obtain an audio signal with a regular number of subbands. Additionally, the decoder may include a magnitude processor for processing the magnitude values of the audio subband signal over time, and an audio signal synthesizer for synthesizing the magnitudes of the audio subband signals or processed audio subband signals to obtain a synthesized decoded audio signal. This configuration can specify a decoder for a bandwidth extension that includes a phase correction of the decoded audio signal. Accordingly, an encoder for encoding an audio signal, a phase determiner for determining the phase of an audio signal, a calculator for determining phase correction data for an audio signal based on the determined phase of the audio signal, a core encoder configured for core encoding of an audio signal to obtain a core-encoded audio signal with a small number of subbands relative to the audio signal, and a parameter extractor configured for extracting audio signal parameters to obtain a low-resolution parameter for the second set of subbands not included in the core-encoded audio signal, an audio signal generator for generating an output signal containing the core-encoded audio signal and phase correction data, and an encoder for bandwidth extension can be created. All of the previously described arrangements can be seen, for example, in an encoder and/or decoder for bandwidth extension with a phase correction of the decoded audio signal, either in total or in combination. Alternatively, all of the explained regulations can be viewed independently, without being correlated with each other. The arrangements of the present invention are explained in reference to the accompanying drawings, where: Figure 1a shows the magnitude spectrum of a violin signal in a time-frequency representation; Figure 1a shows the phase spectrum related to the magnitude spectrum; Figure 1C shows the phase spectrum related to the magnitude spectrum of a trombone signal in the QMF domain in a time-frequency representation; Figure 4d shows a time-frequency diagram containing time-frequency patterns (e.g., QMF containers, Quadratic Reflection Filter bank containers) defined by a time frame and a subband; Figure 1c shows a sample frequency diagram of an audio signal where the magnitude of the frequency is shown over more than ten different subbands; After acquisition, for example during a decoding process at an intermediate stage, it shows a sample frequency representation of the audio signal; it shows a sample frequency representation of the reconstructed audio signal Z[k,ri); it shows the magnitude spectrum of a violin signal in the QMF domain using direct copying SBR in a time-frequency representation; it shows a phase spectrum related to the magnitude spectrum of Figure 4a; it shows the magnitude spectrum of a trombone signal in the QMF domain using direct copying SBR in a time-frequency representation; it shows the phase spectrum related to the magnitude spectrum of Figure 4c; it shows the time-domain representation of a single QMF cabinet with different phase values; Figure 12a, Figure 12b, Figure 12c, Figure 13a, Figure 13b, Figure 13c, Figure 13d, Figure 14a show a single time-domain and frequency-domain representation with a phase changed by a constant value of 11/4 (top) and Sit/4 (bottom) and a non-zero frequency band; it shows a time-domain and frequency-domain representation of a signal with a non-zero frequency band and randomly changing phase; it shows the effect explained in Figure 6 in a time-frequency representation of four time frames and four frequency sub-bands where only the third sub-band contains a non-zero frequency; It shows a single time-domain and frequency-domain representation of a signal with a phase changed by the constant values Tr/4 (top) and 311/4 [bottom] and a non-zero temporal frame; it shows a time-domain and frequency-domain representation of a signal with a non-zero temporal frame and randomly changing phase; it shows a time-frequency diagram similar to the time-frequency diagram shown in Figure 8, where only the third time frame contains a non-zero frequency; it shows the phase derivative over time of a violin signal in a QMF field in a time-frequency representation; it shows the phase derivative frequency over time in the figure shown in Figure 12a; it shows the phase derivative over time of a trombone signal in a QMF field in a time-frequency representation; it shows the phase derivative over time in Figure 12c; Shows the phase derivative over time of a violin signal in the QMF domain using direct copy SBR in a time-frequency representation; shows the phase derivative over frequency in relation to the phase derivative over time as shown in Figure 13a; shows the phase derivative over time of a trombone signal in the QMF domain using direct copy SBR in a time-frequency representation; shows the phase derivative over frequency in relation to the phase derivative over time as shown in Figure 13c; schematically shows the four phases of a unit circle, e.g., subsequent time frames or frequency subbands as shown in Figure 14b, Figure 18a, Figure 18b, Figure 28a, Figure 28b; shows the phases after SBR processing as shown in Figure 14a and the corrected phases as dashed lines; shows a schematic block diagram of an audio processor (50); According to another arrangement, it shows the audio processor in a schematic block diagram; it shows a smoothed error in the PDT of a violin signal in the QMF domain using direct copying SBR in a time-frequency representation; it shows an error in the PDT of a violin signal in the QMF domain for corrected SBR in a time-frequency representation; it shows the phase derivative over time of the error shown in Figure 18a; it shows a schematic block diagram of a decoder; it shows a schematic block diagram of an encoder; it shows a schematic block diagram of a data stream that could be an audio signal; according to another arrangement, it shows the data stream in Figure 21; it shows a schematic block diagram of a method for processing an audio signal; it shows a schematic block diagram of a method for decoding an audio signal; it shows a schematic block diagram of a method for encoding an audio signal; Shows a schematic block diagram of an audio processor according to another arrangement; shows a schematic block diagram of an audio processor according to a preferred arrangement; shows a schematic block diagram of a phase corrector in an audio processor showing the signal flow in more detail; shows the stages of phase correction from another perspective compared to Figures 26-28a; shows a schematic block diagram of a target phase measurer in an audio processor showing the target phase measurer in more detail; Figure 38a Figure 38b Figure 43a Figure 43b Figure 43c Figure 43d shows a schematic block diagram of a target spectrum generator in an audio processor showing the target spectrum generator in more detail; shows a schematic block diagram of a decoder; shows a schematic block diagram of an encoder; shows a schematic block diagram of a data flow that could be an audio signal; It shows a schematic block diagram of a method for processing an audio signal; it shows a schematic block diagram of a method for decoding an audio signal; it shows a schematic block diagram of a method for decoding an audio signal; it shows an error in the phase spectrum of a trombone signal in the QMF domain using direct copying SBR in a time-frequency representation; it shows an error in the phase spectrum of a trombone signal in the QMF domain using corrected SBR in a time-frequency representation; it shows the phase derivative over frequency related to the error shown in Figure 38a; it shows a schematic block diagram of a calculator; it shows a schematic block diagram of the calculator showing the signal flow in more detail in the variational predictor; it shows a schematic block diagram of the calculator according to another arrangement; it shows a schematic block diagram of a method for determining phase correction data for an audio signal; It shows the standard deviation of the phase derivative of a violin signal in a QMF field over time in a time-frequency representation; it shows the standard deviation of the phase derivative over frequency corresponding to the standard deviation of the phase derivative over time as shown in Figure 4-3a; it shows the standard deviation of the phase derivative of a trombone signal in an OMF field over time in a time-frequency representation; it shows the standard deviation of the phase derivative over frequency related to the standard deviation of the phase derivative over time as shown in Figure 43c; Figures 44a, 45a, 45b, 46a, 46b, 48a, 48b, 50a, 51a, 51b, and 52b show the magnitude of a violin + clap signal in a QMF field in a time-frequency representation; it shows the phase spectrum related to the magnitude spectrum shown in Figure 44a. It shows the phase derivative of a violon + clap signal over time in a QMF field in a time-frequency representation; it shows the phase derivative over frequency relative to the phase derivative over time as shown in Figure 45a; it shows the phase derivative of a violon + clap signal over time in a QMF field using the corrected SBR in a time-frequency representation; it shows the phase derivative over frequency relative to the phase derivative over time as shown in Figure 46a; it shows the frequencies of the QMF bands in a time-frequency representation; it shows the frequencies of the direct copy SBR of the QMF bands in a time-frequency representation compared to the original frequencies; it shows the frequencies of the QMF band using the corrected SBR in a time-frequency representation compared to the original frequencies; It shows the estimated frequencies of the harmonics in comparison to the frequencies of the QMF bands of the original signal in a time-frequency representation; it shows the error in the phase derivative of the violin signal over time in the QMF domain using the corrected SBR with compressed correction data in a time-frequency representation; it shows the phase derivative over time with respect to the error in the phase derivative over time as shown in Figure 50a; it shows the waveform of the trombone signal in a time diagram; it shows the time-domain signal of the trombone signal in Figure 51a containing only the estimated peaks, where the positions of the peaks are obtained using the transmitted metadata; it shows the error in the phase spectrum of the trombone signal in the QMF domain using the corrected SBR with compressed correction data in a time-frequency representation; Figure 52a shows the phase derivative over frequency relating to the error in the phase spectrum; Figure 53 shows a schematic block diagram of a decoder; Figure 54 shows a schematic block diagram according to a preferred arrangement; Figure 55 shows a schematic block diagram of the decoder according to another arrangement; Figure 56 shows a schematic block diagram of an encoder; Figure 57 shows a block diagram of a calculator that can be used in the encoder shown in Figure 56; Figure 58 shows a schematic block diagram of a method for decoding an audio signal; and Figure 59 shows a schematic block diagram of a method for encoding an audio signal. Below, the arrangements of the invention will be explained in more detail. Elements shown in related figures that have the same or similar functionality shall be associated with the same reference symbols. The details of the current invention will be explained in relation to a specific signal processing method. Therefore, Figures 1-14 illustrate the signal processing applied to the audio signal. Although the regulations are described according to this specific signal processing, the present invention is not limited to this process and can be applied to many other processing schemes. Additionally, Figures 15-25 illustrate the configurations of an audio processor that can be used for horizontal phase correction of the audio signal. Figures 26–38 illustrate the configurations of an audio processor that can be used for vertical phase correction of an audio signal. Additionally, Figures 39-52 illustrate the configurations of a calculator for determining phase correction data for an audio signal. The calculator can analyze the audio signal and determine which of the previously mentioned audio processors is being used, or whether any audio processor is suitable for the audio signal, or whether any audio processor should not be applied to the audio signal at all. Figures 53–59 show the configurations of a decoder and an encoder that may contain a second processor and a calculator. 1 Introduction Perceived audio encoding has proliferated as a mainstream digital technology providing audio and multimedia to all sorts of applications using transmission or storage channels with limited capacity. Modern perceptual audio codecs are essential for delivering satisfactory audio quality at increasingly low bit rates. In order, it is necessary to resort to some coding structures that are most tolerated by the majority of listeners. Audio Bandwidth Extension (BWE) is a technique used to artificially widen the frequency range of an audio encoder by spectral shifting or transmitting low-band signal segments in the high band at the expense of presenting certain structures. The finding suggests that some of these structures are associated with a change in phase derivative within an artificially extended high band. One of these structures, the change in phase derivative over frequency, is perceptually important for waveforms such as pulse trains and tonal signals that have a relatively low fundamental frequency. Patterns related to the change in the vertical phase derivative correspond to the local distribution of energy over time and are commonly found in audio signals processed using BWE techniques. Another structure is the change in phase derivative over time [see also "horizontal" phase coherence], which is perceptually significant for harmonic-rich tonal signals of any fundamental frequency. The structures related to a change in the horizontal phase derivative correspond to a local frequency shift in the field and are commonly found in audio signals processed with BWE techniques. The present invention provides tools for readjusting the vertical or horizontal phase derivative of such signals when this feature is compromised by the implementation of the audio bandwidth extension [BWE]. Further avenues are provided to determine whether a restoration of the phase derivative is perceptually beneficial, and whether adjustment of the vertical or horizontal phase derivative is perceptually preferable. Bandwidth extension methods such as spectral band replication (SBR) [9] are frequently used in low-bit-rate codecs. They allow the transmission of only a relatively narrow low-frequency range, along with parametric information about higher bands. Since the bit rate of the parametric information is small, a significant improvement in encoding efficiency can be achieved. Typically, for higher bands, the signal is obtained by copying only the transmitted low-frequency region. The process is usually carried out in the complex-modulated square-reflection-filter-bank (QMF) [10] domain, as also stated below. The copied signal is processed by multiplying its magnitude spectrum with appropriate acquisitions based on the transmitted parameters. The goal is to obtain a magnitude spectrum similar to that of the original signal. Conversely, the phase spectrum of the copied signal is typically not processed at all; instead, the copied phase spectrum is used directly. The perceptual consequences of using directly copied phase spectra are examined below. Based on the observed effects, two measurements are proposed to identify the most significant perceptual effects. Furthermore, based on these, suggestions are also made on how to correct the phase spectrum. Finally, strategies are also suggested to minimize the parameter values transmitted in order to perform the correction. This finding relates to the observation that preserving or restoring the phase derivative can resolve significant structures induced by audio bandwidth elongation (BWE) techniques. For example, typical signals where preserving the phase derivative is important are tones with rich harmonic sound content, such as voiced speech, brass instruments, or string instruments. The present invention also describes a device and a method for phase derivative correction in audio codecs using BWE techniques, along with other tools for deciding whether a restoration of the phase derivative is perceptually beneficial and whether vertical or horizontal phase derivative adjustment is perceptually preferable – for a given signal frame. Quantization of the "importance" of phase derivative correction 2. Vertical ["frequency"] phase derivative correction or horizontal ("time") phase derivative correction prioritization based on the signal. 3. Changing the direction of correction ["frequency" or "time") depending on the signal 4. Special vertical phase derivative correction mode for transition states. Obtaining stable parameters for smooth correction is step 6. Correction parameters in a compact side information transmission format 2 Representation of signals in the QMF domain A time-domain signal x(m], where m is discrete time, can be represented in a time-frequency domain, for example, using a complex-modulated Square Reflection Filter bank (QMF). The resulting signal is X[k,n), where k is the frequency band index and n is the temporal frame index. For visualizations and adjustments, a 64-band QMF and a sampling frequency of 48 kHz (fs) are assumed. Therefore, the bandwidth of each frequency band fBw is 375 Hz and the temporal jump dimension is thop (17 in Figure 2) 1. It is 33 ms. However, the process is not limited to this type of transformation. Alternatively, an MDCT could be used instead. The resulting signal is X[k,n], where k is the frequency band index and n is the temporal frame index. X[k,n] is a complex signal. Therefore, the magnitude can also be represented using the complex number j, Xmag(k,n), and the phase components XPha(k,n] Mk. n) 2 X magÜi'. (1) Sound signals are most often represented using XmagUgn and XPhaUgn) (see Figure 1 for two examples, XmagUgn shows the magnitude spectrum of a violin signal, where Figure 1b shows the corresponding phase spectrum XPha(k,n), both in the QMF domain. Additionally, Figure 1c shows the magnitude spectrum of a trombone signal (Xmag[k,n]), where Figure 1d shows the corresponding phase spectrum in the relevant QMF domain. Based on the magnitude spectra in figures 1a and 1c, the color gradient indicates a magnitude ranging from red = 0 dB to blue = -80 dB. Additionally, the color gradient for the phase spectra in Figures 1b and 1d indicates the phases from red = Tr to blue = -7T. 3. Sound Data The sound data used to show the effect of a defined sound operation is called 'trombone' for a trombone sound signal, 'violon' for a violin sound signal, and 'violon + clap' for a violin signal with a clap in the middle. The basic operation of 4 SBRs is shown in Figure 2, a time-frequency diagram (5) containing time-frequency patterns (10) [e.g. QMF containers, Quadratic Reflection Filter bank containers] defined by a time frame (15] and a subband (20). An audio signal can be converted into such a time-frequency representation using a QMF [Quarterial Reflection Filter Bank] transform, an MDCT [Modified Discrete Cosine Transform], or a DFT (Discrete Fourier Transform). The division of an audio signal into time frames may include overlapping segments of the audio signal. The lower part of Figure 1 shows the overlap of single time frames (15) where a maximum of two time frames overlap simultaneously. Additionally, if more redundancy is needed, the audio signal can also be split using multiple overlap. In the multiple overlap algorithm, three or more time frames can contain the same segment of the audio signal at a given point in time. The duration of the overlap is thop 17, the jump dimension. Assuming a signal X(k, n), the bandwidth-expanded [BWE] signal Z(k, n) is obtained from the input signal X(k, ri) by copying specific portions of the transmitted low-frequency band. An SBR algorithm begins with selecting the frequency range to be transmitted. In this example, bands 1 to 7 are selected: V1 S I( 5 7 : XtmnSUg 71) = X(k, n) . (2) The number of frequency bands to be transmitted depends on the desired bit rate. The shapes and equations are generated using 7 bands, and 5 to 11 bands are used for the corresponding audio data. Thus, the crossover frequencies between the transmitted frequency range and the higher bands are between 1875 and 4125 Hz, respectively. Frequency bands above this area are not transmitted at all; instead, parametric metadata is generated to describe them. Xtmns(k,n] is encoded and transmitted. For simplicity, it is assumed that the encoding does not modify the signal in any way, even though the subsequent operation is not limited to the default state. At the receiver end, the transmitted frequency range is used for the directly corresponding frequencies. For higher bandwidths, the signal can be generated using the transmitted signal. One approach is to copy the transmitted signal to higher frequencies. A slightly modified version is used here. First, a baseband signal is selected. The messages can be a whole signal, but in this arrangement the first frequency band is ignored. This is because, in most cases, it has been observed that the phase spectrum for the first band is irregular. Therefore, the baseband to be copied is defined as follows: Vl i; I( '5 o : &3. 33023 n) r: Xhtmsfk *l* '1, si) g (3) Other bandwidths can also be used for transmitted and baseband signals. Raw signals for higher frequencies are generated using the baseband signal, where Yraw(lgn,i'] is the complex QMF signal for the frequency patch. Raw frequency-patch signals are manipulated according to the transmission of metadata by multiplying acquisitionsg(k,n,i') Y(k,7i, i) = meûc, n, i)g(k,n, i). (5) It should be stated that the acquisitions are in real value and therefore only the magnitude spectrum is affected and thus adapted to a desired target value. Known approaches demonstrate how acquisitions are made. The target phase remains uncorrected in the known approaches in question. The final signal to be reproduced is obtained by combining the transmitted and patched signals to continuously extend the bandwidth to obtain a BWE signal of the desired bandwidth. This configuration assumes 1' = 7. ZÜC› 71) = Xtrans(ki n)i Z(k+6i +l,n)== Y(k,n,i). (6) Figure 3 shows the signals defined in a graphical representation. Figure 3a shows a sample frequency diagram of an audio signal, where the magnitude of the frequency is shown across more than ten different subbands. The first seven subbands reflect the transmitted frequency bands Xtrans(k,n] (25). Baseband XbaseUçn] (30) is derived by selecting the second to seventh subbands. Figure 3a shows the original audio signal, i.e., the audio signal before transmission or encoding. Figure 3b shows a sample frequency representation of an audio signal after acquisition, for example, during a decoding process at an intermediate stage. The frequency spectrum of the audio signal consists of the transmitted frequency bands (25) and the seven baseband signals that are copied to the higher subbands of the frequency band [30], forming an audio signal (32) that contains frequencies higher than the frequencies in the baseband. The full baseband signal is also described as a frequency patch. Figure 3c shows the reconstructed audio signal Z(k,n] (35]. Compared to Figure 3b, the baseband signal patches are multiplied individually by an acquisition factor. Therefore, the frequency spectrum of the audio signal contains the main frequency spectrum [25] and a series of magnitude corrected patches Y(k,ri,1] (40]. This patching method is referred to as direct copy patching. Although the invention is not limited to this type of patching algorithm, the direct copy patch is used as an example to describe the current invention. Another patching algorithm that could be used is, for example, a harmonic patching algorithm. It is assumed that the parametric representation of the high bands is problem-free, i.e., the magnitude spectrum of the reconstructed signal is identical to that of the original signal. ZmagUc, n) : XmagUc, n). (7) However, it should be noted that the phase spectrum is not corrected in any way by the algorithm, therefore it is not correct even if the algorithm works perfectly. Therefore, the regulations show how the Z(k, n) phase spectrum can be adapted and corrected in addition to a target value such that an improvement in perceived quality is achieved. Corrections in the regulations can be performed using three different processing modes: "horizontal," "vertical," and "transition." These modes are explained individually below. ZmâgUçn] and ZPha(k,n] are shown in Figure 4 for violin and trombone signals. Figure 4 shows sample spectra of the reconstructed audio signal (35) using spectral bandwidth replication [SBR] with direct copied patch. The magnitude spectrum of a violin signal, Zmag(k,n], is shown in Figure 4a, where Figure 4b shows the corresponding phase spectrum, ZPhaUgn). Figures 4c and 4d show the corresponding spectra for a trombone signal. All signals are displayed in the QMF domain. As shown in Figure 1, the color gradient indicates a magnitude from red = 0 dB to blue = -80 dB and a phase from red = n to blue = -n. It can be seen that the phase spectra differ from the spectra of the original signals [see Figure 1]. In relation to SBR, it is thought that the violin incorporates dissonance and trombone, containing modulation sounds in the cross-frequency range. However, phase plots are quite random, and it's really difficult to tell if they are different and what the perceptual effects of those differences are. Furthermore, sending correction data for this type of random data is not suitable for coding applications that require low bit rates. Therefore, it is necessary to understand the perceptual effects of the phase spectrum and to find measurements to explain them. These topics are explained in the following sections. In the QMF field, the meaning of the phase spectrum is often thought to be that the frequency band index defines the frequency of a single tonal component, its magnitude defines the level, and the phase defines the "timing". However, the bandwidth of a QMF band is relatively large, and data is sampled at a high rate. Therefore, the interaction between time-frequency tiles (i.e., QMF boxes) actually defines all of these properties. The time-domain representation of the box is given in Figure S. The result is 13. It is a sinc-like function with a length of 3 ms. The exact form of the function is defined by the phase parameter. Considering only one case where a single frequency band is not zero for all temporal frames, i.e., VnEiNszag(3,n)=1. (8) A sinusoid is formed by changing the phase between the temporal frames with a constant value a, i.e., . The resulting signal (i.e., the time-domain signal after the inverse QMF transformation) is shown in Figure 6 with the values 01 = 71/4 (upper) and 37[lower]. It can be seen that the frequency of the sinusoid is affected by the phase change. The frequency domain is shown on the right, while the time domain of the signal is shown on the left side of Figure 6. In contrast, if the phase is chosen randomly, the result is narrowband noise (see Figure 7). Therefore, it can be said that the phase of a QMF box controls the frequency content within the corresponding frequency band. Figure 8 illustrates the effect described in Figure 6 in a time-frequency representation of four time frames and four frequency subbands, where only the third subband contains a non-zero frequency. This results in the frequency domain signal from Figure 6, shown schematically on the right side of Figure 8 and in the time-domain representation of Figure 6, which is also presented schematically at the bottom of Figure 8. In a case where a temporal frame is not zero for all frequency bands, i.e., a constant value α, i.e., , creates a transition by changing the phase between the frequency bands. The resulting signal values are shown in Figure 9. The temporal position of the transition appears to be affected by the phase change. The frequency domain is shown on the right side of Figure 9, and the time domain of the signal is shown on the left side of Figure 9. In contrast, if the phase is chosen randomly, the result is a short burst of noise (see Figure). Thus, it can be said that the phase of a QMF box also controls the temporal positions of the harmonics within the corresponding temporal frame. Figure 11 shows a time-frequency diagram similar to the one shown in Figure 8. In Figure 11, only the third time frame contains values that have a time shift of n/4 from one non-zero sub-band to another. When converted to a frequency domain, the frequency domain signal, schematically shown on the right side of Figure 9, is schematically shown on the right side of Figure 11. A schematic representation of the time domain in the left portion of Figure 9 is shown below Figure 11. This signal results in the conversion of the time-frequency domain into a time-domain signal. Measurements to elucidate the perceptually relevant features of the phase spectrum, as discussed in Chapter 4, are necessary because the phase spectrum itself is quite complex, and it is difficult to directly see what its effect on perception is. Section 5 illustrates two effects that can occur by manipulating the phase spectrum in the QMF domain: [a] a constant phase change over time produces a sinusoid, and the amount of phase change controls the frequency of the sinusoid; and (b) a constant phase change over frequency produces a transient, and the amount of phase change controls the temporal position of the transient flux. The frequency and temporal location of a particular element are clearly important to human perception, therefore identifying these features is potentially useful. Calculation of the phase derivative (PDT) over time. ÃiL'îLi-lç, n) 2 ,17" &En -r 1. ) - Bi› "Çr, n) (12) and can be estimated by calculating the phase derivative (PDF) over the frequency xvdf(k,in) z xwe( + m) r xphaçm). (13) Xpdî[k,n] is related to frequency and XPdf[k,n] is related to the temporal position of a part. Depending on the characteristics of QMF analysis (how the phases of the modulators of adjacent temporal frames match the position of a transition), 7r are added to the temporal frames in the figures for visualization purposes to create smooth curves from the equal transition squares of XPdf(lçn). Next, we examine how these measurements look for our sample signals. Figure 12 shows the derivatives for violin and trombone signals. More specifically, Figure 12a shows a phase derivative of the original, i.e., unprocessed, Violon sound signal in the QMF domain over time XPdt(k,n]. Figure 12b shows a relevant phase derivative over the frequency XpdfUçn). Figures 12c and 12d illustrate the phase derivative over time and the phase derivative over frequency for a trombone signal. The color gradient indicates the phase values from red = n to blue = -Tr. For the violin, the magnitude spectrum is basically approximately 0. It is noise up to 13 seconds [see Figure 1] and consequently its derivatives are also noise. Approximately 0. Starting from 13 seconds, XPdt is observed to begin obtaining relatively stable values over time. This means the signal contains strong, relatively stable sinusoids. The frequencies of these sinusoids are determined by the XPdt values. Conversely, the Xpdf diagram appears relatively noisy, therefore the relevant data for the violin using it cannot be found. XPdti is relatively noisy for trombone. On the contrary, Xpdf appears to have approximately the same value across all frequencies. In practice, this means that all harmonic components are aligned in time, producing a transient signal. The temporal locations of these transitions are determined by the Xvdf values. The same derivatives can also be calculated for SBR-processed signals Z[k,n) (see Figure 13). Figures 13a to 13d relate to Figures 12a to 12d, which were derived using the direct copy SBR algorithm described earlier. Since the phase spectrum is simply copied to higher-frequency patches than the baseband, the PDTs of the frequency patches are the same as those of the baseband. Therefore, for the violin, the PDT is relatively smooth, producing stable sinusoids over time, just as it was in the case of the original signal. However, the Zivdt values differ from the Xpdt of the original signal; this causes the generated sinusoids to have different frequencies than the original signal. The perceptual impact of this is discussed in Chapter 7. In contrast, the PDF of the frequency patches is the same as the baseband, but the PDF of the crossover frequencies is, in practice, random. In the crossover, the PDF is actually calculated between the final and first phase values of the frequency band, i.e., ZpdtÜ, n) : zpimw, n) ~ ZPMÜ, n) : wma, n, i) - www, n, i) (14) These values depend on the actual PDF and the crossover frequency and do not match the values of the original signal. For trombones, the PDF values of the copied signal are separate from the cross-frequencies. Therefore, while most harmonics are in the correct temporal positions, harmonics at cross-frequencies are in random positions in practice. The perceptual impact of this is discussed in Chapter 7. Sounds can be broadly divided into two categories: harmonic and noise-like signals. Noise-like signals, by definition, possess noisy phase characteristics. Therefore, it is assumed that the phase errors caused by SBR are not perceptually significant. Instead, it has focused on harmonic signals. Most musical instruments, and also speech, produce a harmonic structure to the signal, meaning that the tone contains strong sinusoidal components tuned to frequency. Human hearing generally behaves as if it contains a bank of overlapping band-pass filters, called auditory filters. Therefore, it can be assumed that the hearing aid processes complex sounds, so that the partial sounds within the auditory filter are analyzed as a single entity. The width of these filters can be approximated to follow the equivalent rectangular bandwidth (ERB) [11], which can be determined according to the formula below, where fc is the center frequency of the band (in kHz). As discussed in Chapter 4, the crossover frequency between the baseband and SBR patches is approximately 3 kHz. At these frequencies, the ERB is approximately 350 l-lz. The bandwidth of a QMF frequency band is actually relatively close to this, at 375 I-lz. Therefore, it can be assumed that the bandwidth of the QMF frequency bands follows the ERB at the respective frequencies. Two characteristics of an audio track that can go wrong due to an incorrect phase spectrum were observed in Section 6: Frequency and Timing of a Partial Component. Focus on the frequency; the question here is, can the human ear perceive the frequencies of individual harmonics? If it can, the frequency shift caused by the SBR should be corrected, and if not, correction is not necessary. For resolved and unresolved harmonics [12], this can be used to explain this issue. If there is only one harmonic in the ERB, the harmonic is called resolved. It is typically assumed that human auditory processes resolve harmonics individually and are therefore sensitive to their frequencies. In practice, it is perceived that changing the frequency of the resolved harmonics causes dissonance. Conversely, if there are multiple harmonics within the ERB, the harmonics are referred to as insoluble. It is assumed that human senses do not process these harmonics individually; instead, the articulation effects are perceived by the auditory system. The result is a periodic signal, and the length of its duration is determined by the spacing between the harmonics. The perception of height depends on the length of time, which is why it is assumed that humans are sensitive to the sense of hearing. However, if all harmonics within the frequency band in the SBR are shifted by the same amount, the gap between the harmonics and therefore the perceived amplitude remains the same. Therefore, when dealing with unresolved harmonics, human hearing does not perceive frequency shifts as dissonance. Timing errors caused by SBR will be discussed later. This refers to the temporal position or phase timing of a harmonic component. This should not be confused with the phase of a QMF box. The detection of timing-related errors has been examined in detail [13]. It has been observed that most human signals are not sensitive to the timing or phase of harmonic components. However, there are certain signals to which the human sense of hearing is very sensitive to partial timing. Signals include, for example, trombone and trumpet sounds, and speech. With these signals, a given phase angle occurs simultaneously with all harmonics. Neural firing rates of different auditory bands were simulated [13]. It was observed that the neural firing rate produced by these phase-sensitive signals was discontinuous across all auditory bands, and the peaks were aligned over time. Changing the phase of even a single harmonic can alter the peak of neural firing rate with these signals. According to the results of the official listening test, human hearing is sensitive to this [13]. The effects produced are the perception of an added sinusoidal component or narrowband noise at the frequencies where the phase is shifted. In addition, it was found that sensitivity to timing-related effects depends on the fundamental frequency of the harmonic tone [13]. The lower the fundamental frequency, the greater the perceived effects. If the fundamental frequency is above approximately 800 Hz, the auditory system is completely insensitive to timing-related effects. Therefore, if the fundamental frequency is low and the harmonics are phase-aligned with the frequency, changes in the timing, or in other words, the phase, of the harmonics can be perceived by human hearing. If the fundamental frequency is high and/or the phase of the harmonics is not aligned with the frequency, human perception is not sensitive to changes in the timing of the harmonics. 8 Correction methods In Chapter 7, it was noted that humans are sensitive to errors in the frequencies of resolved harmonics. Additionally, humans are sensitive to errors in the temporal positions of harmonics if the fundamental frequency is low and the harmonics are aligned across the frequency. SBR, as discussed in Section 6, can cause both of these errors, so the perceived quality can be improved by correcting them. Methods for doing this are suggested in this section. Figure 14 schematically illustrates the basic idea of correction methods. Figure 14a schematically shows, for example, the four phases (45a-d) of successive time frames or frequency subbands in a unit circle. The phases (45a-d) are equally spaced at 90° intervals. Figure 14b shows the stages after the SBR treatment and the corrected phases as dashed lines. The phase before the process (45a) can be shifted to the phase angle (45a'). The same applies to phases (45b to 45d). It has been shown that the difference between the phases after the process, i.e., the phase derivative, can be distorted after the SBR process. For example, the difference between phases [45a' and 45b') was 90° before the treatment, and 110° after the SBR treatment. Correction methods will change the phase values (45b') to the new phase value [45b") in order to obtain the old phase derivative of 90°. The same correction applies to phases [45d' to 45d"). 8. 1. Correction of Frequency Errors - Horizontal Phase Derivative Correction As discussed in Chapter 7, humans can often only detect an error at a harmonic frequency when there is only one harmonic in an ERB. Additionally, the bandwidth of a QMF frequency band can be used to estimate the ERB at the first crossover. Therefore, frequency correction is only necessary when there is a harmonic in a frequency band. This is quite convenient because Section 5 showed that if there is one harmonic per band, the generated PDT values are stable or change slowly over time and can be corrected using a low bit rate. Figure 15 shows an audio processor (50) for processing an audio signal (55). The audio processor (50) includes an audio signal phase measurement calculator (60), a target phase measurement determiner (65) and a phase corrector (70). The audio signal phase calculator (60) is configured to calculate a phase measurement (80) of the audio signal (55) for a time frame (75). The target phase measurement identifier (65) is configured to determine a target phase measurement (85) for the said time frame (75). In addition, the phase corrector is configured to correct the phases (45) of the audio signal (55) for the time frame (75) using the calculated phase measurement [80) and the target phase measurement (85) to obtain a processed audio signal (90). Optionally, the time frame arrangements of the audio signal (55) are explained in Figure 16. According to one configuration, the target phase measurement determinant (65) is configured to determine a first target phase measurement (85a) and a second target phase measurement (85b) for a second subband signal (95b). Accordingly, the audio signal phase calculator (60) is configured to determine a first phase measurement (80a) for the first sub-band signal (95a) and a second phase measurement (80b) for the second sub-band signal (95b). The phase corrector is configured to correct one phase (45a) of the first sub-band signal (95a) using the first phase measurement (80a) and first target phase measurement (85a) of the audio signal (55), and to correct the second phase [45b] of the second sub-band signal (95b) using the second phase measurement (80b) and second target phase measurement (85b) of the audio signal (55). In addition, the audio processor (50) includes an audio signal synthesizer (100) to synthesize the processed audio signal (90) using the processed first subband signal (95a) and the processed second subband signal (95b). According to other regulations, phase measurement (80) is a phase derivative over time. Therefore, the audio signal phase calculator (60) can calculate the phase derivative of a current time frame (75b) and a future time frame (75c) for each sub-band (95) of multiple sub-bands. Accordingly, the phase corrector (70) can calculate a deviation between the target phase derivative (85) and the phase derivative (80) in time for each of the many sub-bands (95) of the current time frame (75b), where a correction performed by the phase corrector (70) is performed using the deviation. The arrangements show the phase corrector (70) configured to correct the subband signals (95) of the different subbands of the audio signal (55) within the time frame (75), so that the frequencies of the corrected subband signals (95) have frequency values assigned harmonic to the fundamental frequency of the audio signal (55). The fundamental frequency is the lowest frequency in the audio signal (55) or, in other words, the lowest frequency occurring in the first harmonics of the audio signal (55). Furthermore, the phase corrector (70) is configured to smooth the deviation (105) for each sub-band (95) of multiple sub-bands throughout the previous time frame, the current time frame and the next time frame (75a to 75C) and is configured to reduce the rapid changes in the deviation (105) within a sub-band (95). According to other arrangements, smoothness is a weighted average where the phase corrector (70) is weighted by the magnitude of the audio signal (55), configured to calculate the weighted average within the previous, present and future time frame (75a to 75C) according to the previous, present and future time frames (75a to 75C). The arrangements illustrate the previously described process steps in vector form. Therefore, the phase corrector (70) is configured to form a vector of deviations (105), where the first element of the vector relates to a first deviation (105a) for the first subband (95a) of many sub-bands, and the second element of the vector represents a second deviation (105b) for the second subband (95b) of many sub-bands from a previous time frame (75a) to a current time frame (75b). Furthermore, the phase corrector (70) can apply the deviation vector (105) to the phases (45) of the audio signal (55), where the first element of the vector is applied to a first subband (953) of the audio signal (55) and the second element of the vector is applied to a second subband (95b) of the audio signal (55) and the second element of the vector is applied to a phase (45b) of the audio signal (55) and the second subband (95b) of the audio signal (55) From another point of view, it can be stated that all the processing in the sound processor (50) is vector-based, where each vector represents a time frame (75) and each subband of many subbands contains an element of the vector (95). Other arrangements focus on the target phase measurement determinant configured to obtain a basic frequency estimate (85b) for the current time frame (75b), where the target phase measurement determinant (65) is configured to calculate a frequency estimate (85) for each subband of the majority of subbands for the time frame (75) using the basic frequency estimate (85) for the time frame (75). In addition, the target phase measurement determinant (65) can convert the frequency estimates (85) for each subband (95) of multiple subbands into a phase derivative over time using the total subband (95) and the number of sampling frequencies of the audio signal (55). For clarity, it should be noted that the output (85) of the target phase measurement determinant (65) can be either a frequency estimate or a phase derivative over time, depending on the arrangement. Therefore, in one arrangement, the frequency estimate contains the correct format for further processing in the phase corrector (70), where in another arrangement, the frequency estimate is a suitable phase derivative over time. Accordingly, the target phase measurement determinant (65) can also be seen on a vector basis. Therefore, the target phase measurement determinant (65) can generate a frequency estimate vector (85) for each sub-band (95) of a number of sub-bands, where the first element of the vector means a frequency estimate (85a) for a first sub-band (95a) and the second element of the vector means a frequency estimate (85b) for a second sub-band (95b). In addition, the target phase measurement determinant (65) can calculate the frequency (85) using multiples of the fundamental frequency, where the frequency estimate (85) of the current sub-band (95) is the multiples of the fundamental frequency closest to the center of the sub-band (95), or where the frequency estimate (85) of the current sub-band is a boundary frequency of the current sub-band (95) if the multiples of the fundamental frequency are not within the current sub-band (95). In other words, the proposed algorithm for correcting errors in the frequencies of harmonics using the sound processor (50) works as follows. First, the PDT is calculated, and the SBR processes the ZPdt signal. ZPdt(k,ri):ZPha(k,n+1) -ZPha(k,n). The difference between the target PDT and the horizontal correction is calculated as: uvdtikm) = zpdtuç, n) - 231%, 71). (iöa) At this point, it can be assumed that the target PDT is equal to the input PDT of the input signal thdtac, n) = Xpdwm), (16b) Then it will be shown how the target PDT can be obtained with a low bit rate. This value (i.e., the error value (105)) is smoothed over time using a Hann window W(1). The appropriate length is, for example, 41 samples in the QMF domain (corresponding to an interval of 55 ms). Smoothing is weighted by the size of the relevant time-frequency tiles UpdîUr, n) : Circinean'ZDpdt(k,n -l i), izi/'(1)Zmaß(k,n + 5)), ---20 S. 2 'S 20, (17) lll " where circmean {a, b} shows the calculation of the circular mean for weighted angular values with b values. The smoothed error direct copy SBR in PDT for the violin signal in the QMF domain is shown in Figure 17. The color gradient indicates the phase values from red = n to blue = -n. Next, a modulator matrix is created to modify the phase spectrum in order to obtain the desired PDT. . yha L › ndGpm" Au . Figure 18a shows the UNO error in the time phase derivative (PDT) of the violin signal in the QMF domain for corrected SBR. Figure 18b shows the relevant phase derivative over time, where the error in the PDT shown in Figure 18a was derived by comparing the results shown in Figure 18b with the results shown in Figure 1221. The color gradient specifies the phase values from red = n to blue = -Jt. PDT is calculated for the corrected phase spectrum Zch (km) (see Figure 18b). It can be seen that the PDT of the corrected phase spectrum is reminiscent of the PDT of the original signal channel (see Figure 12], and the error is small for time-frequency tiles containing significant energy (see Figure 18a). It is noticeable that the discrepancy in the uncorrected SBR data has been largely resolved. Furthermore, the algorithm does not appear to generate any significant structures. When XpdtUçn is used as a target PDT, it appears likely to transmit PDT-error 9:11 (kin) values for each time-frequency tile. Another approach to calculating the target PDT, which involves reducing bandwidth for transmission, is shown in section 9. In other configurations, the sound processor (50) may be part of a decoder (110]. Therefore, to decode an audio signal (55), the decoder (110) may include an audio processor (50), a core decoder (115) and a patch generator (120). The core decoder (115) is configured to decode an audio signal (25) in a time frame (75) with a reduced number of subbands relative to the audio signal (55). The Patch Generator emits some subbands [95) of the kernel decoded audio signal (25) with a reduced number of subbands, where the subband set creates the first patch (30a) in the subbands in the time frame [75) adjacent to the reduced number of subbands in order to obtain an audio signal (55) with a normal number of subbands. In addition, the sound processor (50) is explained according to the first patch and 16 according to a target function (85). According to the regulations, the audio processor performs phase correction. Depending on the regulations, the audio processor may also include an amplitude correction of the audio signal by a bandwidth extension parameter applicator (125) which applies BWE or SBR parameters to the patch. In addition, the audio processor may contain a synthesizer (100), a bank of synthesis filters, for example to combine the subbands of the audio signal to obtain a smooth audio file. According to other arrangements, the Patch Generator (120) is configured to patch a set of subbands (95) of the audio signal (25), where the set of subbands forms a second patch with subbands of the time frame adjacent to the first patch, and the audio processor (50) is configured to correct the phase (45) within the subbands of the second patch. Alternatively, the Patch Generator (120) is configured to patch the corrected first patch to other subbands of the time frame adjacent to the first patch. In other words, in the first option, the Patch Generator creates an audio signal with a regular number of subbands from the transmitted portion of the audio signal, and then the phases of each patch of the audio signal are corrected. The second option first corrects the initial stages of the first patch according to the transmitted portion of the audio tape, and then reconstructs the audio signal with the previously corrected first patch and the normal number of subbands. Other arrangements show a decoder [110] which includes a data stream extractor (130) configured to extract a fundamental frequency (114) of the current time frame (75) of the audio signal (55) from a data stream (135) where the data stream also contains the audio signal (145) encoded with a reduced number of subbands. Alternatively, the decoder may include a fundamental frequency analyzer (150) configured to analyze the core decoded audio signal (25) to calculate the fundamental frequency (140). In other words, options for deriving the fundamental frequency [140] are, for example, an analysis of the audio signal in the decoder or encoder, in which case the fundamental frequency may be more accurate in the second case, in terms of a higher bit rate, since the value needs to be transmitted from the encoder to the decoder. Figure 20 shows an encoder (155] for encoding an audio signal (55]. The encoder contains a kernel encoder (160) for the kernel encoding the audio signal (55) to obtain a kernel encoded audio signal (145) which encodes the audio signal (55), and a fundamental frequency analyzer (175) to analyze the audio signal (55) or a low-pass filtered version of the audio signal (55) to obtain a fundamental frequency estimate of the audio signal. Additionally, the encoder includes a parameter extractor (165) to extract the parameters of the subbands of the audio signal (55) that are not present in the core encoded audio signal (145), and the encoder includes an output signal generator (170) to generate an output signal (135) containing the core encoded audio signal (145), its parameters and fundamental frequency estimate. In this configuration, the encoder (155] may contain a low-pass filter in front of the kernel decoder (160] and a high-pass filter (185] in front of the parameter extractor (165]. According to other arrangements, the output signal generator (170) is configured to generate the output signal (135] into a series of frames, where each frame contains a kernel-encoded signal (145], parameters (190] and the basic frequency estimate (140) for each n. It contains the frame, where n is 2 2*. In the regulations, the core encoder (160] can be, for example, an AAC (Advanced Audio Coding) encoder. In an alternative arrangement, a smart gap-fill encoder can be used to encode the audio signal (55). Therefore, the core encoder encodes the full bandwidth audio signal, where at least one subband of the audio signal is left out. Therefore, the parameter extractor (165) extracts the parameters for reconfiguring the subbands that are left out of the encoding process of the kernel encoder (160]. Figure 21 shows a schematic representation of the output signal (135]. The output signal is a kernel-encoded audio signal (145] containing a reduced number of subbands compared to the original audio signal (55], a parameter (190) representing the subbands of the audio signal not included in the kernel-encoded audio signal (145], and an audio signal containing a fundamental frequency estimate (140) of an audio signal (135] or the original audio signal (55]. Figure 22 shows an application of the audio signal (135), where the audio signal is formed in a sequence of frames (195), where each frame (195) contains the kernel encoded audio signal (145), its parameters (190), and where only each n. The frame (195) contains the basic frequency estimate (140] where n 2 2. This is, for example, every 20th This could describe the transmission of equally spaced fundamental frequency estimation for the frame, or where the fundamental frequency estimation is transmitted irregularly, for example, on demand or for purpose. Figure 23 shows a method (2300) for processing an audio signal, which includes the steps "calculating a phase measurement of an audio signal for a time frame with an audio signal phase measurement calculator" (2305), "determining a target phase measurement for the mentioned time frame with a target phase measurement determiner" (2310), and "correcting the phases of the audio signal for the time frame with a phase corrector using the calculated phase measurement and the target phase measurement to obtain a processed audio signal" (2315). Figure 24 shows a method for decoding an audio signal (2400) which includes the steps "decoding an audio signal in a time frame with a decreasing number of subbands according to the audio signal" (2405), "patching a decoded audio signal with a set of subbands with a reduced number of subbands, where the set of subbands forms a first patch to obtain an audio signal with a regular number of subbands, adjacent to the subbands, to reduce the number of subbands" (2410), and "correcting the phases within the subbands of the first patch according to a target function with the audio processor" (2415). Figure 25 shows a method (2500) for encoding an audio signal with the following steps: "core encoding of the audio signal with a core encoder to obtain a core encoded audio signal with a reduced number of subbands relative to the audio signal" (2505), "analysis of the audio signal or its filtered version at the low signal with a basic frequency analyzer to obtain the fundamental frequency estimate of the audio signal" (2510), "extraction of the parameters of the subbands of the audio signal not included in the core encoded audio signal with a parameter extractor" (2515), and "generating an output signal containing the core encoded audio signal, its parameters and the fundamental frequency estimate with an output signal generator" (2520). If executed, a computer program is required to perform the methods. 2. Correction of Temporal Errors - Vertical Phase Derivative Correction As discussed earlier, humans can detect an error in the temporal position of a harmonic if the harmonics are synchronized over frequency and the fundamental frequency is low. In Section S, it is shown that harmonics are synchronized in the QMF domain if the phase derivative over frequency is constant. Therefore, it is advantageous to have at least one harmonic in each frequency band. Otherwise, the 'empty' frequency bands will have random phases, which will distort the measurement. Fortunately, humans are only sensitive to the temporal position of harmonics when the fundamental frequency is low (see Chapter 7). Therefore, the phase derivative can be used as a measure for detecting perceptually significant effects arising from the temporal movement of harmonics and excessive frequency. Figure 26 shows a schematic block diagram of an audio processor (50') for processing an audio signal (55), where the audio processor (50') includes a target phase measurement determinant (65'), a phase error calculator (200] and a phase corrector [70']. Target phase measurement determinant (65') determines a target phase measurement [85'] for the audio signal (55) in the time frame [75]. The phase error calculator [200] calculates a phase error (105') using a phase of the audio signal (55) in the time frame [75] and the target phase measurement (85'). The phase corrector (70') corrects the phase of the audio signal [55) in the time frame by using the phase error [105') that makes up the processed audio signal (90'). Figure 27 shows a schematic block diagram of the sound processor (50') according to another preferred arrangement. Therefore, the audio signal [55) contains a large number of subband signals [95) for the time frame [75). Accordingly, the target phase determiner (65') is configured to determine a first target phase measurement (8551') for the first sub-band signal [95a) and a second phase measurement [85b') for the second sub-band signal [95b). The phase error calculator [200] forms a vector of the phase error (105'] where the first element of the vector relates to a first deviation [105a'] of the phase of the first sub-band signal (95] and the first target phase measurement [85a'], where the second element of the vector means a second deviation (105b') of the phase of the second sub-band signal (95b) and the second target phase meter [85b`]. In addition, the audio processor (50') contains an audio signal synthesizer (100) to synthesize the corrected audio signal (90') using the corrected first subband signal (90a') and the corrected second subband signal (90b'). Regarding other arrangements, numerous sub-bands (95) are grouped as a baseband (30) and a set of frequency patches (40), the baseband (30) containing a sub-band (95) of the audio signal (55) and the set of frequency patches [40) containing at least one sub-band (95) of the baseband (30) at a frequency higher than the frequency of at least one sub-band of the baseband. It should be noted that the patch of the audio signal has been previously described according to Figure 3 and therefore cannot be described in detail in this section of the specification. It should be noted that frequency patches (40) can be raw baseband signals copied to higher frequencies multiplied by an acquisition factor where phase correction can be applied. Additionally, according to a preferred arrangement, the acquisition amplification and phase correction can be modified such that the phases of the raw baseband signal are copied to higher frequencies before being multiplied by the acquisition factor. The configuration also shows the calculation of the elements of a vector of phase errors (105') referring to a first patch (40a) of the set of frequency patches (40) to obtain an average phase error (105") using the phase error calculator (200). In addition, an audio signal phase derivative calculator (210) is shown for calculating the average phase derivatives over frequency (215) for the baseband (30). Figure 28a shows a more detailed explanation of the phase corrector (70') in a block diagram. The phase corrector (70') above Figure 28a is configured to correct one phase of the lowerband signals (95) in the first and subsequent frequency patches (40) of the set of frequency patches. In the arrangement in Figure 28a, it is shown that sub-bands (95c and 95d) belong to the patch (40a) and sub-bands (95e and 95f) belong to the frequency patch (40b). The phases are corrected using a weighted average phase error, where the average phase error (105) is weighted according to an index of the frequency patch (40) to obtain a modified patch signal (40'). Another arrangement is shown in the lower part of Figure 28a. In the upper left corner of the phase corrector (70'), the arrangement described above is shown for obtaining the modified patch signal (40`) from the patches [40] and the average phase error (105"). In addition, the phase corrector (70') performs a calculation in an initialization phase containing another modified patch signal (40") with the first frequency patch optimized by adding the average of the phase derivatives (215) over the frequency weighted by an existing subband index to the phase of the subband signal with the highest subband index (30) from the baseband of the audio signal (55). For this initial stage, the switch (220a) is located in the left position. For any further processing steps, the key will be located at the other position, forming a vertically oriented connection. In another configuration, the audio signal phase calculator (210) is configured to calculate an average phase derivative over frequency (215) for a number of subband signals containing frequencies higher than the baseband signal (30) in order to detect transition states in the subband signal (95). It should be noted that the transient correction is similar to the vertical phase correction of the audio processor (50') and the difference is that the frequencies in the baseband (30) do not reflect the higher frequencies of a transient. Therefore, these frequencies must be considered for phase correction of a transient condition. After the initialization phase, the phase correction (70') is configured to update another modified patch signal (40") based on the frequency patches (40) by adding the average of the phase derivatives (215) over the frequency, weighted by the subband index (95) of the current subband, to the phase of the subband signal with the highest subband index in the previous frequency patch. The preferred arrangement is a combination of the previously described arrangements where the phase corrector (70') calculates a weighted average of the modified patch signal (40') and then calculates the modified patch signal (40") to obtain a combined modified patch signal (40"). Therefore, the phase corrector (70') updates a modified patch signal (40) by repeatedly adding the average of the phase derivatives (215) over the frequency, weighted by the subband index (95) of the current subband to the phase of the subband signal with the subband index (40). Combined modified patches (40a"', 40b"', etc.) To obtain this, the key (220b) starts with the combined modified (48") phase for the initial phase after each iteration, and then after the first iteration, it switches to the combined modified patch (40b''). Additionally, the phase corrector (70') can calculate the weighted average of a patch signal (40') and the modified patch signal (40") using the circular average of the modified patch signal (40") in the current frequency patch weighted by the second specific weighting function and the patch signal (40') in the current frequency patch weighted by the first specific weighting function. To ensure interoperability between the audio processor (50) and the audio processor (50'), the phase corrector (70') generates a vector of phase deviations, where the phase deviations are calculated using a combined modified patch signal (40") and the audio signal (55). Figure 28b shows the stages of phase correction from another perspective. For a first time frame (75a), the patch signal (40') is obtained by applying the first phase correction mode on the patches of the audio signal (55). The patch signal (40') is used in the initial stage of the second correction mode to obtain the modified patch signal (40"). A combination of the patch signal (40') and the modified patch signal (40") results in a combined and modified patch signal (40 ). Therefore, the second correction mode is applied to the combined modified patch signal (40") to obtain the modified patch signal (40") for the second time frame (75b). In addition, the first correction mode is applied to the patches of the audio signal (55) in the second time frame (75b) to obtain the patch signal (40'). A combination of a patch signal (40') and a modified patch signal (40") results in a combined and modified patch signal (40"). The processing scheme described for the second time frame is applied to the third time frame (75c) and accordingly to any other time frame of the audio signal (55). Figure 29 shows a detailed block diagram of the target phase measurement designator (65'). According to one arrangement, the target phase measurement determinant (65') includes a peak location (230) and a data stream extractor (130') to extract the fundamental frequency of peak locations (235) from a data stream (135) of the audio signal (55) in a current time frame. Alternatively, the target phase measurement determinant (65') includes an audio signal analyzer (225) to analyze the audio signal (55) in the current time frame and a peak location (230) and a fundamental frequency of the peak point locations (235) in the current time frame. In addition, the target phase measurement determinant includes a target spectrum generator [240] to estimate more peak locations in the current time frame using the fundamental frequency of the peak point (230) and peak locations (235). Figure 30 shows a detailed block diagram of the target spectrum generator (240) described in Figure 29. The target spectrum generator (240) contains a peak generator (245) to generate a pulse catarithm (265) over time. A signal generator (250) adjusts the frequency of the pulse train according to the fundamental frequency of its peak positions (235). In addition, a pulse positioner (255) adjusts the phase of the pulse train (265) according to the peak position (230). In other words, the Signal Generator (250) modifies the pulse sequence (265) into a random frequency pattern such that the frequency of the pulse sequence is equal to the fundamental frequency of the peak positions of the audio signal (55). Furthermore, the pulse locator (255) changes the phase of the pulse train so that it is equal to the peak position of one of the peak points of the pulse train (230). After this, a spectrum analyzer (260) produces a phase spectrum of the tuned pulse train, where the phase spectrum of the time domain signal is the target phase measurement (85'). Figure 31 shows a schematic block diagram of a decoder (110') for decoding an audio signal (55). The Decoder (110) comprises a kernel decoder (115) configured to decode an audio signal (25) in a time frame of the baseband and a Patch Generator (120) to patch a sub-band set (95) of the decoded baseband, where the sub-band set creates a patch to sub-bands in the time frame adjacent to the baseband to obtain an audio signal (3 2) containing frequencies higher than the frequencies in the baseband. In addition, the decoder (110') contains an audio processor (50') to correct the phases of the subbands of the patch according to a target phase measurement. According to another arrangement, the Patch Generator (120) is configured to spread the subbands (95) of the audio signal (25), where one set of subbands forms another patch in the other subbands of the time frame adjacent to the patch, and the audio processor (50') is configured to correct the phases within the subbands of the other patch. Alternatively, the Patch Generator (120) is configured to patch the corrected patch to other subbands of the time frame adjacent to the patch. Another arrangement involves a decoder for decoding an audio signal containing a crossover signal, where the audio processor (50') is configured to correct the crossover phase. Temporary processing, section 8. This is explained by another word in number 4. Therefore, the decoder (110) contains another audio processor (50') to take another phase derivative of a frequency and correct the transitions in the audio signal (32) using the taken phase derivative or frequency. It should also be noted that the decoder in Figure 31 (110') is similar to the decoder in Figure 19 (110), such that the description of the main elements can be mutually exchanged in cases unrelated to the difference in the audio processors (50 and 50']. Figure 32 shows an encoder (155') for encoding an audio signal (55]. The encoder (155') contains a core encoder (160), a fundamental frequency analyzer (175'), a parameter extractor (165] and an output signal assembly (170). The kernel encoder (160) is configured for the kernel encoding the audio signal (55) in order to obtain a kernel encoded audio signal (145) containing a reduced number of subbands compared to the audio signal (55). The basic frequency analyzer (175') analyzes the peak locations (230] in the audio signal (55) or a low-pass filtered version of the audio signal to obtain a basic frequency estimate of the peak locations (235) in the audio signal. Additionally, the parameter extractor (165) extracts the parameters (190] of the subbands of the audio signal (55) that are not included in the core encoded audio signal (145), and the output signal generator (170) contains an output signal (135) which includes one of the peak positions (230), the fundamental frequency of the peak positions (235), the parameters (190] and the core encoded audio signal (145). According to the regulations, the output signal generator (170) is configured to generate an output signal (13 5) into a frame sequence where each frame contains the kernel encoded audio signal (145), parameters [190] where each n. The frame contains the basic frequency estimate of the peak locations (235) and the peak location (230), where n 2 2. Figure 33 shows an arrangement of an audio signal (135) containing a kernel-encoded audio signal (145) with a decreasing number of sub-bands in relation to the original audio signal (55), a parameter (190) representing the sub-bands of the audio signal not included in the kernel-encoded audio signal, a fundamental frequency estimate of peak positions (2 35), and a peak position estimate (230) of the audio signal (55). Alternatively, the audio signal (135) is generated within a sequence of frames, where each frame contains the kernel-encoded audio signal (145], parameters (190], where each n. The frame contains the basic frequency estimate of the peak locations (235] and the peak location (230), where n 2 2. This idea has been previously described according to Figure 22. Figure 34 shows a method for processing an audio signal with an audio processor (3400]. The method (3400) includes the step (3405) "determining a target phase measurement for the audio signal within a time frame with a target phase measurement", the step (3410) "calculating a phase error with a phase error calculator using the phase of the audio signal in the time frame and the target phase measurement" and the step (3415) "correcting the phase of the audio signal in the time frame with a phase corrected using the phase error". Figure 35 shows a method (3500) for processing an audio signal using an audio decoder. Method (3500) includes the steps (3505) "decoding an audio signal in the time frame of a baseband with a core decoder", "patching a set of subbands of the decoded baseband with a patch generator, where the set of subbands creates another patch in the subbands in the time frame adjacent to the baseband to obtain an audio signal containing frequencies higher than the frequencies in the baseband", and "correcting the phases with the subbands of the first patch with an audio processor according to a target phase measurement" (3515). Figure 36 shows a method (3600) for encoding an audio signal with an audio encoder. Method (3600) includes the following steps: "core encoding of the audio signal with a core encoder to obtain a core encoded audio signal containing a reduced number of subbands relative to the audio signal" (3605), "analysis of the audio signal or a filtered version of the audio signal with a basic frequency analyzer to obtain a basic frequency estimate of the peak positions in the audio signal" (3610), "extraction of its parameters with a parameter extractor" (3615), and "generating an output signal with an output signal generator containing its position" (3620). In other words, the proposed algorithm for correcting errors in the temporal positions of harmonics works as described below. First, the difference between the phase spectra of the target signal and the SBR-processed signal (Aw W **7 and 29"] is calculated as DQ'IJiiwi) -: Ü"(k, n; w 1. ; (ip, 12;, (20a) this is shown in Figure 37. Figure 37 shows the phase error (DPhaUçn) in the phase spectrum of the trombone signal in the QMF domain using direct copying SBR. At this point, it can be assumed that the target phase spectrum is equal to that of the input signal (ZâhaUßn) = xphauan) (ZOb). Then it will be shown how the target phase spectrum can be obtained with a low bit rate. Vertical phase derivative correction is performed using two methods, and the final corrected phase spectrum is obtained as a combination of these. Firstly, it can be observed that the error is relatively constant within the frequency band, and when entering a new frequency band, the error jumps to a new value. This makes sense because the phase changes over a constant frequency across all frequencies in the original signal. The error is created in the crossover, and the error remains fixed within the patch. Thus, a single value is sufficient to correct the phase error for the entire frequency patch. Additionally, the phase error of higher frequency patches can be corrected by using the same error value after multiplying by the index number of the frequency patch. Therefore, the circular average of the phase error for the first frequency patch is calculated using Uggs; in. ) -: C'il'(`,mif'anî17pi` 'iz'mijlßs : r: li (21) The phase spectrum can be corrected using this. This raw correction produces an accurate result if the target PDF, e.g. the phase derivative XPdf(k,n] over the frequency, is exactly constant at all frequencies. However, as can be seen in Figure 12, there is often a slight fluctuation in frequency. Thus, by using improved processing in cross-processing to prevent any discontinuities in the generated PDF, better results can be obtained. In other words, this correction produces accurate values for the PDF on average, but there may be small discontinuities in the cross-frequencies of the frequency patches. A corrective measure is applied to prevent this. The final corrected phase spectrum is obtained as a combination of two correction methods (1'c "in" i). The other correction method starts by calculating an average of the baseband PDFs (Xgâgüi) = circmean{1Yg::e(k,n)}. (23) The phase spectrum can be corrected using this criterion assuming that the phase changes with this average value, i.e., W"(k, n, 1) -_- xp" (6, 31) + k -xgýgfçio, ch base ypgaom, i) = Ygýmmm - 1) + k -xW'foû, (24) wherelrpv is the combined patch signal of the two correction methods. This correction provides good quality in cross-references, but may cause a deviation towards higher frequencies in the PDF. To avoid this, two correction methods are combined by calculating their weighted circular average: Kîhawc, 11,1) ::i (tii'Ltinc-an %_l'cîlîkh n, 1', C), WR (ir, 6); (25) where c denotes the correction method [ CV'i or :va and Wfc(k,c) is the weighting function. The resulting phase spectrum l'îîihçxkinil) is not damaged by discontinuities or shifts. The error compared to the original spectrum and the PDF of the corrected phase spectrum is shown in Figure 38. Figure 38a shows Pt 5 in the QMF domain using a phase-corrected SBR signal. The "cv" (fln ?Uhatayi) in the phase spectrum of the trombone signal shows the corresponding phase derivative over frequency, shown in Figure 38b. It can be seen that the error is significantly smaller without correction, and the PDF is not harmed by large discontinuities. There are significant errors in certain temporal frames, but these frames have low energy [see Figure 4], so they have insignificant perceptual effects. Temporal frames with significant energy are relatively well corrected. It is noticeable that the structures of uncorrected SBR are significantly lightened. The corrected phase spectrum 23, (19") is obtained by concentrating on the corrected frequency patches Yâ"(k,n,i)_ To be compatible with the horizontal correction mode, the vertical phase correction can also be shown using a modulator matrix [see Equation 18) thmm) : zfgack, 71) ~ zphauc, n). (2%) 8. 3. Switching between different phase correction methods. Chapter 8. 1 and 8. 2. It has been shown that phase errors induced by SBR can be corrected by applying PDT correction to the violin and PDF correction to the trombone. However, no consideration was given to knowing which of the corrections should be applied to an unknown signal, or how any of them should be applied. This section proposes a method for automatically selecting the correction direction. Correction direction. Therefore, Figure 39 shows the configurations of a calculator for determining phase correction data for an audio signal. The variation determinant (275] determines the variation of one phase [45] of the audio signal [55] in a first and second variation mode. A variation comparator (280) compares a first variation determined using the first variation mode [290a] and a second variation determined using the second variation mode (290b), and a correction data calculator calculates the phase correction data (295) based on either the first variation mode or the second variation mode, based on the result of a comparator. In addition, the variation determinant [275) can be configured to determine the standard deviation of a phase derivative (PDT) over time for many time frames of the audio signal [55) as phase variation [290a) in the first variation mode and to determine the standard deviation of a phase derivative over frequency (PDF) for many subbands of the audio signal (55) as phase variation (29%) in the second variation mode. Therefore, the variation comparator (280) compares the measurement of the phase derivative with respect to time as the first variation (290a) and the measurement of the phase derivative with respect to frequency as a second variation (290b) for the time frames of the audio signal. The regulations show the variation determinant (275) for determining the circular standard deviation of a phase derivative of the audio signal (55) over the previous and more previous time frames as a standard deviation measure and for determining the circular standard deviation of a phase derivative of the audio signal (55) over a current and more than one future frame as a standard deviation measure. In addition, the variation determinant (275) calculates the minimum of both circular standard deviations when determining the first variation (290a). In another arrangement, the variation determinant (275) calculates the variation in the first variation mode (290a) as a combination of a standard deviation measurement for many subbands (95) in a time frame (75) to form an average standard deviation measurement of a frequency. The variation comparator (280) is configured to perform a combination of standard deviation measurements by calculating an energy-weighted average of standard deviation measurements of multiple sub-bands using the magnitude values of the sub-band signal (95) in the current time frame (75) as an energy measurement. In a preferred arrangement, the variation determinant (275) smooths the mean standard deviation measurement while determining the first variation (290a) over time, across many previous and many future time frames. Smoothing is weighted according to energy, calculated using a windowing function and its associated time frames. In addition, the variation determinant (275) is configured to smooth the standard deviation measurement when the second assumption (290b) determines the previous multiplicity and the multiple future time frames (75) over the flow, where the smoothing is weighted according to the energy calculated using a windowing function and the relevant time frames (75). Therefore, the variation comparator (280) compares the first variation (290a) measurement, corrected as the mean standard deviation, determined using the first variation mode, and the second variation (29%) measurement, corrected as the standard deviation, determined using the second variation mode. A preferred arrangement is shown in Figure 40. According to this arrangement, the variation determinant (275) includes two methods for calculating the first and second variation. The first processing patch contains a PDT calculator (300a) for calculating the standard deviation measurement of the phase derivative from the audio signal (55) or from the phase (305a) of the audio signal to time. A circular standard deviation calculator (310a) determines a first circular standard deviation (3153) and a second circular standard deviation (315b) from a measurement of the standard deviation of a phase derivative with respect to time (305a). The first and second circular standard deviations (315a and 315b) are compared with a comparator (320). The comparator (320) calculates the minimum of the two circular standard deviation measures (315a and 315b) (325). A combiner combines the minimum (325) on the frequency to form a measure of average standard deviation [335a). A smoother (340a) smooths the mean standard deviation measure (335a) to create a mean standard deviation measure (345a). The second method involves a PDF calculator (300b) to calculate a phase derivative of the audio signal (55) with respect to frequency (305b) or a phase of the audio signal. A circular standard deviation calculator (310b) produces a standard deviation measurement of the phase derivative (335b) over the frequency (305). The standard deviation measurement [305] is smoothed with a smoother smoother (340b) to create a smoother standard deviation measurement (345b). The smoothed mean standard deviation measurement (345a) and the smoothed standard deviation measurement (345b) are the first and second variations, respectively. The variation comparator (280) compares the first and second variations, and the correction data calculator (285) calculates the phase correction data (295) based on the comparison of the first and second variations. Other arrangements show the calculator (270) which handles three different phase correction modes. Figure 41 shows a representative block diagram. Figure 41 shows the variation determinant (275) which determines a third variation of the phase of the audio signal (55) in a third variation mode where the third variation mode is a transition detection mode [290c). The variation comparator (280) compares the first variation (290a) determined using the first variation mode, the second variation (29%) determined using the second variation mode, and the third variation (290c) determined using the third variation mode. Therefore, the correction data calculator (285) calculates the phase correction data (295) according to the first correction mode, second correction mode or third correction mode based on the comparison result. To calculate the third variation in the third variation mode [290c), the variation comparator [280) is configured to calculate an instantaneous energy estimate of the current time frame and a ratio of the time-average energy estimate of multiple time frames [280], and to compare the ratio with a defined threshold to detect transient states within a time frame (75). The variation comparator [280) should determine an appropriate correction mode based on the three variations. Based on this decision, the correction data calculator [285) calculates the phase correction data [295] in accordance with a third variation mode if a transition is detected, if it is determined that there is no transition, the first variation [290a) determined in the first variation mode is less than or equal to the second variation [290b) in the second variation mode. In addition, phase correction data (295) are calculated according to the second variation mode, if it is determined that there is no transition, the second variation determined in the second variation mode [290b] is less than or equal to the first variation in the first variation mode [290a]. The phase correction data calculator is configured to calculate phase correction data [295] for the third variation (290c) for a current, one or more past and one or more future time frames. Accordingly, the correction data calculator [285) is configured to calculate the phase correction data [295) for the second variation mode (290b) for a current, one or more past and one or more future time frames. In addition, the correction data calculator [285) is configured to calculate the correction data for a horizontal phase correction and the first variation mode [295), the correction data for a vertical phase correction in the second variation mode [295), and the correction data for a temporary correction in the third variation mode (295). Figure 42 shows a method for determining phase correction data from an audio signal [4200]. Method (4200) includes the step of "determining a variation of a phase of the audio signal with a variation determiner in a first and a second variation mode", the step of "comparing the determined variation" [4210] and the step of "calculating the phase correction with a correction data calculator according to the first variation mode or the second variation mode as a result of the comparison" (4215). In other words, the PDT of the violin becomes smoother over time, just as the PDT of the trombone is smooth across the frequency range. Therefore, the standard deviation (STD) of these measurements can be used as a measure of variation to select the appropriate correction method. STD of the phase derivative over time, XStdUUQn) : circstd{XPdt(k,n + l)},-23 S 1 S 0, The STD of the phase derivative over the frequency X"dfm) : circstd{XPdr(k,n)}, 2 S 1( S '13, (28) where circstd{} denotes the calculation of the circular STD (angle values can be potentially weighted with energy to avoid high STD associated with noisy low-energy boxes or the STD calculation can be limited to boxes with sufficient energy]. STDs for violin and trombone are shown in Figures 43a, 43b and Figures 43c, 43d, respectively. Figures 43a and c show the standard deviation of the phase derivative over time in the QMF domain XstdtUgn], where Figures 43b and 43d show the corresponding standard deviation over frequency Xstdf(n] without phase correction. The color gradient specifies values from red = 1 to blue = 0. It can be seen that the STD of PDT is lower for the violin, and similarly, the STD of PDF is lower for the trombone (especially for high-energy time-frequency tiles). The correction method used for each temporal frame is chosen based on which STD is less severe. Therefore, the XsîdtUçn) values should be combined based on frequency. Combining is performed by calculating an energy-weighted average for a predefined frequency range (1. 29) Deviation estimates are smoothed over time to ensure a smooth transition and thus avoid potential structures. Smoothing is performed using a Hann window, weighted by the energy of the temporal frame, where WH is the window function and Xmagûl) is the sum of XmagUçn] over the frequency ZI=1X rragUcl n). A related equation is used for smoothing Xstdf(n]. Phase correction method XssrlistÇÜ-. It is determined by comparison with l and Xsm (lt). The default method is PDT (horizontal) correction and if Ã-îsm (71) For the interval [n - 5, n + 5], a PDF (vertical) correction is applied. If both deviations are large, e.g., larger than a predefined threshold value, none of the correction methods are applied, and bit rate recovery can be performed. 8.4 Transient processing - Phase derivative correction for transitions A violon signal with an added clap in the middle is shown in Figure 44. The magnitude of the violon + clap signal in the QMF domain (XmagUçn) is shown in Figure 44a, and the corresponding phase spectrum (XphaUçn) is shown in Figure 44b. Referring to Figure 44a, the color gradient indicates magnitude values from red = 0 dB to blue = -80 dB. Next, the phase gradient for Figure 44b specifies the phase values from red = n to blue = -n. The phase derivatives over time and over frequency are shown in Figure 45. The phase derivative over time of the violon + clap signal in the QMF domain, Xpdt[k,n), is shown in Figure 45al, and the corresponding phase derivative over frequency, Xpdf[k,n), is shown in Figure 45b. The color gradient specifies the phase values from red = n to blue = -n. It can be seen that the PDT is noisy for the clap, but the PDF is at least somewhat smooth at high frequencies. Therefore, a PDF correction should be applied for the clap to maintain its sharpness. However, the correction method proposed in Section 8.2 may not work properly with this signal, as the violon sound distorts the derivatives at low frequencies. Consequently, the baseband phase spectrum does not reflect high frequencies, and thus phase correction of frequency patches using a single value may not work. Furthermore, due to noisy PDF values at low frequencies, detecting transitions will be difficult depending on the variation of the PDF value [see Section 8.3]. The solution to the problem is simple. First, transitions are detected using a simple energy-based method. The instantaneous energy of the mid/high frequencies is compared with a smoothed energy estimate. The instantaneous energy of the mid/high frequencies is calculated as follows: Z Xiiiagußn), Smoothing is performed using a first-class llR filter x""i5"**(n) z 0.1 ixinüenhrii) + 0.? urggîg'mrn -- 1). (32) If Xmgmnmyxgijîgmh(n) M9' then a transition is determined. The threshold value 9 can be fine-tuned to determine the desired transition amount. For example, 9 = 2 can be used. The determined frame is not directly selected as the temporal frame. Instead, the energy maximum is sought around it. In the current application, the selected range is [n - 2, n + 7]. The temporal frame with the maximum energy in this range is selected as the transition. Theoretically, the vertical correction mode is also for transition cases. It is applicable. However, in transient cases, the baseband phase spectrum often does not reflect high frequencies. This can lead to pre- and post-echos in the processed signal. Therefore, slightly modified processing is recommended for transient cases. The average PDF of the transient at high frequencies is calculated as follows: The phase spectrum for the transient frame is synthesized using this constant phase shift as in equation 24, but "al/Hi" is replaced with ***1*** ` ). The AYm correction is applied to temporal frames within the range [n - 2, 11 + 2] [n: added to the PDF of n-1 and n+1 frames due to the properties of QMF, see Section 6). This correction creates a transition to a suitable position already present, but the shape of the transition is not necessarily as desired, and significant side-ears [i.e., additional transitions] may be present due to significant temporal overlap of the QMF frames. Therefore, the absolute phase angle must also be correct. The absolute angle is corrected by calculating the average error between the synthesized and original phase spectrum. The correction is performed separately for each temporal frame of the transient. The result of the transient correction is shown in Figure 46. The phase derivative X1dt(k,n] of the violon + clap signal in the QMF domain over time is shown using the phase-corrected SBR. Figure 47b shows a corresponding phase derivative over the frequency Xldf[k,n). The color gradient indicates the phase values from red = n to blue = -n. Although the difference is not large compared to direct copying, it is perceptible that the phase-corrected applause has the same sharpness as the original signal. Therefore, temporary correction is necessary only when direct copying is enabled, not in all cases. Conversely, if PDT correction is enabled, it is important to perform temporary processing because PDT correction would otherwise lead to temporary friction. 9 Compressing Correction Data Section 8 showed that phase errors can be corrected, but sufficient bit rate for correction was never considered. This section demonstrates how methods can represent correction data with a low bit rate. 9.1 Compressing PDT Correction Data - Creating the Target Spectrum for Horizontal Correction There are many possible parameters that can be transmitted to enable PDT correction. However, gm Ü" "1 is a potential candidate for low bit rate transmission because it is smoothed over time. First, a sufficient update rate for the parameters is discussed. The value was updated only for each of the N frames and interpolated linearly between them. For good quality, the update interval is approximately 40 ms. For some signals, one bit is slightly less advantageous, and for others, one bit is slightly more advantageous. Formal listening tests will be useful to evaluate an optimal update rate. However, a relatively long update interval seems acceptable. A suitable angular accuracy for "dm U" (71) was also studied. For perceptually good quality, 6 hits (64 possible angular values) are sufficient. In addition, only the transmission of the change in value was tested. Often the values appear to have changed only slightly; for small changes, unequal quantization can be applied to obtain more accuracy. Using this approach, it was found that 4 bits [16 possible angle values] provided good quality. The last thing to consider is adequate spectral accuracy. As can be seen in Figure 17, many frequency bands appear to share roughly the same value. Therefore, one value could possibly be used to represent several frequency bands. In addition, there are multiple harmonics in a frequency band at high frequencies, so less accuracy may be required. However, another, potentially better approach has been found, so these options have not been thoroughly investigated. A proposed, more efficient approach is discussed below. 9.1.1 Using frequency estimation for compressing PDT correction data As discussed in Section S, the phase derivative over time basically means the frequency of the generated sinusoid. The PDTs of the implemented 64-band complex QMF can be converted into frequencies using the following equation: 'v.. î` ;ill (HA *k XIWUC, n) :::i g: iii-;HI + ({ ?izah-band Il) + Ligi: i» %1 mod !N (34) The generated frequencies are in the range fmter(k) = [fc[k) -fBw,fc(k] +fBw] where Q is the center frequency of the frequency band k and fBw is 375 Hz. The result is shown in Figure 47 in a time-frequency representation of the frequencies of the QMF bands for the violin signal. It is seen that the frequencies follow multiples of the fundamental frequency of the tone and the harmonics are separated by frequency at the fundamental frequency. Additionally, it appears that vibration causes frequency modulation. The same graph can be applied to the direct copy Zfret1[k,n) and corrected fm ÜVYZJSBR (see Figure 48a and Figure 48b, respectively). Figure 48a shows a time-frequency representation of the frequencies of the QMF bands of the direct copy SBR signal [ZfretIUgn] when compared with the original signal XWJ(k,n] shown in Figure 47. Figure 48b shows the corrected SBR signal, where the original signal is plotted in blue, and the direct copy SBR and corrected SBR signals are plotted in red. The mismatch of the direct copy SBR is visible, especially at the beginning and end of the sample. In addition, it can be seen that the frequency modulation depth is smaller than the aperture of the original signal. In contrast, in the case of the corrected SBR, the frequencies of the harmonics appear to follow the frequencies of the original signal. Furthermore, the modulation depth appears to be correct. Thus, this graph confirms the validity of the proposed correction method. Therefore, the focus is on the actual compression of the subsequent correction data. Since the frequencies Xfreq(k,n] are placed at the same distance, the frequencies of all frequency bands can be approximated if the distance between the frequencies is estimated and transmitted. In the case of harmonic signals, the intervals must be equal to the fundamental frequency of the tone. Thus, only a single signal value needs to be transmitted to represent all frequency bands. In the case of more irregular signals, more values are needed to describe the harmonic behavior. For example, in the case of a piano tone, the distance between the harmonics increases slightly [14]. For simplicity, it is assumed that the harmonics are placed at the same distance. However, this does not limit the generality of the sound processing described. Therefore, to estimate the frequencies of the harmonics, the fundamental frequency of the tone is estimated. Estimating the fundamental frequency is a widely studied topic (see, for example, an estimation method has been applied. The method basically estimates the intervals of the harmonics). The calculations and the result are combined according to some heuristic methods (how much energy, frequency and how stable over time, etc.). In each case, the result is the fundamental frequency estimate for each temporal frame Xf0(n). In other words, the phase derivative over time is related to the frequency of the corresponding QMF box. Also, structures related to errors in the PDT can often be detected with harmonic signals. Thus, it has been suggested that the target PDT [see Equation 16a] can be estimated using the estimate of the fundamental frequency fo. Estimating a fundamental frequency is a widely studied topic, and many suitable methods exist to obtain reliable estimates of the fundamental frequency. Here, the fundamental frequency Xf°[n], known by the decoder before implementing the BWE and used in the heuristic phase correction within the BWE, is assumed. Therefore, it is advantageous because the coding phase transmits the estimated fundamental frequency XfÜÜi]. In addition, For improved encoding efficiency, the value can be updated and interpolated, for example, every 20th temporal frame (corresponding to an interval of -27 ms). Alternatively, the fundamental frequency can be estimated during the decoding phase and no information should be transmitted. However, better estimates can be expected if the estimation is performed with the original signal during the encoding phase. The decoder operation starts by obtaining a fundamental frequency estimate X1`°[n) for each temporal frame. The frequencies of the harmonics can be obtained by multiplying with an index vector V K 3 ?ll : Xha'lmût, n) : K ~ X" ('71) (35) The result is shown in Figure 49. Figure 49 shows a time frequency of the estimated harmonic frequencies Xharm(K,n) compared to the frequencies of the QMF bands XFFBEI(k,n) of the original signal. It shows the representation. Again, blue represents the original signal and red is the predicted signal. The frequencies of the predicted harmonics match the original signal quite well. These frequencies can be considered as 'allowed' frequencies. If the algorithm produces these frequencies, mismatched structures should be avoided. The transmitted parameter of the algorithm is the fundamental frequency Xf°[n). For improved coding efficiency, the value is updated only every 20th temporal frame [i.e., every 27 ms]. This value appears to provide good perceptual quality based on unofficial listening. However, formal listening tests are useful for evaluating a more suitable value for the update rate. The next step of the algorithm is to find a suitable value for each frequency band. This is done by selecting the Xharlnûçn] value that is closest to the center frequency of each fc(k] band to reflect the band. If the closest If the value is outside the possible values of the frequency band (flnter[k]), the band's boundary value is used. The resulting matrix eh l , } contains a frequency for each time-frequency tile. The final stage of the correction data compression algorithm is to convert the frequency data back to PDT data › ,freti y Xâigiiwn) : 2". (îgsgmßig mod 1), (36) where modÜ denotes the modulo operator. The correct correction algorithm works as shown in Section 8.1. In Equation 16a, Zül 09") is replaced with Xeh (km.) as the target PDT and Equations 17-19 are used as specified in Section 8.1. The result of the correction algorithm with compressed correction data is shown in Figure 50. The Dsm (kul) error in the PDT of the violin signal in the QMF domain of the corrected SBR with compressed correction data indicates the error. Figure SOb shows the corresponding phase derivative in time ich (k' 21)'. Color gradients indicate values from red :ir to blue :-71. The PDT values follow the PDT values of the original signal with similar accuracy to the correction method without data compression [see Figure 18]. Therefore, the compression algorithm is valid. The perceived quality is similar with and without compression of the correction data. The corrections use more accuracy for lower frequencies and lower accuracy for higher frequencies, using a total of 12 bits for each value. The resulting bit rate is approximately 0.5 kbps (without any compression such as entropy coding). This accuracy is equivalent to the perceived quality without quantization. It produces. However, a significantly lower bit rate can be used in many cases that produce sufficiently good perceived quality. One option for low bit rate schemes is to estimate the fundamental frequency during the decoding phase using the transmitted signal. In this case, no value should be transmitted. Another option is to estimate the fundamental frequency using the transmitted signal, compare it to the estimate obtained using the broadband signal, and transmit only the difference. It can be assumed that this difference can be represented using a very low bit rate. 9.2 Compression of PDF Correction Data As discussed in Section 8.2, suitable data for PDF correction is the average phase error of the first frequency patch (java ah). Correction can be performed for all frequency patches with knowledge of this value, so that only one value needs to be transmitted for each temporal frame. However, transmitting even a single value for each temporal frame can result in a very high bit rate. Figure for Trombone In the examination of 12, it can be seen that the PDF has a relatively constant value over frequency, and the same value is present for several temporal frames. The value is constant over time as long as the same transition dominates the energy of the QMF analysis window. When a new transition begins to dominate, a new value is found. The angular change between these PDF values appears to be the same from one transition to another. This is significant because the PDF controls the temporal location of the transition, and if the signal has a fixed fundamental frequency, the interval between transitions should be constant. Therefore, the PDF [or a transition location] can only be transmitted sparsely over time, and the behavior of the PDF between these time moments can be predicted using fundamental frequency information. PDF correction can be performed using this information. This idea is actually equivalent to PDT correction, where the frequencies of the harmonics are assumed to be equally spaced. The same idea is used here as well. However, instead, it is assumed that the temporal positions of the transitions are equally spaced. Below, a reference spectrum for phase correction is created based on the determination of the positions of the peaks in the waveform and using this information. 9.2.1 Use of peak determination for compressing PDF correction data - Creating the target spectrum for vertical correction The positions of the peaks must be estimated to perform a successful PDF correction. One solution would be to calculate the positions of the peaks using the PDF value in the same way as in Equation 34 and estimate the positions of the peaks using the estimated fundamental frequency. However, this approach would require a relatively stable fundamental frequency estimate. The arrangements show a simple, quickly implementable alternative method demonstrating that the proposed compression approach is possible. A time-domain representation of the trombone signal is shown in Figure 51. Figure 51a, in a time-domain representation Figure 51b shows the waveform of the trombone signal. Figure 51b shows the corresponding time-domain signal containing only the estimated peaks, where the positions of the peaks were obtained using the transmitted metadata. The signal in Figure 51b is, for example, the pulse train described in reference to Figure 30 (265). The algorithm starts by analyzing the positions of the peaks in the waveform. This is done by searching for local maxima. For every 27 ms (i.e., for every 20 QMF frames), the position of the peak closest to the center point of the frame is transmitted. Among the transmitted peak positions, it is assumed that the peak points are equally spaced in time. Thus, by knowing the fundamental frequency, the positions of the peaks can be estimated. In this arrangement, the number of detected peaks is transmitted [this requires that all peaks be successfully detected]. It should be noted that estimation based on the fundamental frequency will likely yield more accurate results. The resulting bit rate is approximately 0.5 kbps, involving transmitting the position using 9 bits and transmitting the number of transitions between them using 4 bits. This accuracy was achieved with equally perceived quality without quantization. However, a significantly lower bit rate can be used in many cases producing sufficiently good perceived quality. Using the transmitted metadata, a time-domain signal containing the impulses at the positions of the predicted peak points is generated [see Figure 51 b]. QMF analysis is performed for this signal and the phase spectrum is calculated. The actual PDF correction is performed as suggested in Section 8.2, but Zî'h (lwl) in Equation 20a is modified as follows: XEV 0%' n). Vertical phase The waveform of signals with matching characteristics typically resembles a pulsed cataract. Thus, it has been suggested that the target phase spectrum for orthogonal correction can be estimated by modeling it as the phase spectrum of a pulsed cataract with peaks at corresponding positions and a corresponding fundamental frequency. The position closest to the center of the temporal frame is transmitted, for example, for every 20th temporal frame (corresponding to an interval of -27 ms). The estimated fundamental frequency, transmitted at an equal rate, is used to interpolate the peak positions between the transmitted positions. Alternatively, the fundamental frequency and peak positions can be estimated during the decoding phase, and no information should be transmitted. However, better estimates can be expected if the estimation is performed with the original signal during the encoding phase. The decoder operation starts by obtaining an estimated fundamental frequency Xm[n] for each temporal frame, and additionally, the peak positions in the waveform are estimated. The positions are used to generate a time-domain signal consisting of impulses at these positions. QMF analysis is used to generate the relevant phase spectrum (Xév 0631). This estimated phase spectrum can be used as the target phase spectrum in Equation 20a (time = xgjmrk, n). (37) The proposed method uses the encoding step to transmit only the estimated peak positions and fundamental frequencies, e.g., with an update rate of 27 ms. In addition, it should be noted that errors in the orthogonal phase derivative can only be detected when the fundamental frequency is relatively low. Thus, the fundamental frequency can be transmitted with a relatively low bit rate. The result of the correction algorithm with compressed correction data is shown in Figure 52. Figure 52a shows the error in the phase spectrum of the trombone signal in the QMF domain with corrected SBR with compressed correction data. Figure 52b also shows the corresponding phase derivative on the frequency ZCV (km. Color gradientiii, red values follow with similar accuracy with the correction method without data compression and with and without data compression. 9.3 Compression of Temporal Processing Data Assuming that the transitions are relatively infrequent, it can be considered that this data can be transmitted directly. Arrangements show the transmission of six values per transition: one value for the average PDF and five values for absolute phase angle errors [[n - 2, n + 2]) one value for each temporal frame within the interval). An alternative is to transmit the position of the temporal position (i.e., one value) and the target phase spectrum "cz" as in the case of vertical correction. A similar approach can be used for PDF correction if the bit rate needs to be compressed for transitions (see Section 9.2). A simple The position of a transition can be transmitted as a single value. The target phase spectrum and target PDF can be obtained using this position value, as in Section 9.2. Alternatively, the transition position can be estimated during the decoding phase, and no information should be transmitted. However, better estimates can be expected if the estimation is performed with the original signal during the encoding phase. All of the previously described arrangements can be viewed separately from other arrangements or a combination of applications. Therefore, Figures 53 to 57 show an encoder and a decoder that combine some of the arrangements described earlier. Figure 53 shows a decoder [110"] for decoding an audio signal. The decoder (110") contains a first target spectrum generator (65a), a first phase corrector (70a), and an audio subband signal calculator [350]. The first target spectrum generator (653), also referred to as the target phase measurement determinant, generates a target spectrum (85a"] for a first time frame of a subband signal of the audio signal (32) using the first correction data (295a). The first phase corrector [70a] corrects the subband signal in a phase frame, where the correction is performed by reducing the difference between the subband signal of the audio signal [32] in the first time frame and the target spectrum [85"]. The audio subband signal calculator (350] calculates the audio subband signal [355] for the first time frame using a corrected phase [91a] for the time frame. Alternatively, the audio subband signal calculator (350) calculates the audio subband signal (355) for a second time frame different from the first time frame, using the measurement of the subband signal in the second time frame [85a] or using a corrected phase calculation according to another phase correction algorithm. Figure 53 also shows the audio signal optionally. The other phase correction algorithm can be implemented in a second phase corrector (7%) or a third phase corrector [70c). Other phase correctors will be shown in reference to Figure 54. The audio subband signal calculator [250) calculates the audio subband signal for the first time frame using the corrected phase (91) for the first time frame and the magnitude value (47) of the audio subband signal of the first time frame, where the magnitude value [47] is the magnitude of the audio signal in the first time frame [32] or the magnitude of the audio signal in the first time frame. (35] is the processed size. Figure 54 shows another arrangement of the decoder [110]. Therefore, the decoder generator (65b) produces a target spectrum [85b"] for the second time frame of the subband of the audio signal (32) using the second correction data (295b). Detector [110"] additionally contains a second phase corrector [70b] to correct a phase (45) of the subband of the audio signal (32) in the time frame determined by a second phase correction algorithm, where the correction is performed by reducing the difference between the subband of the audio signal and the time frame measurement of the target spectrum (85b). Thus, decoder (110"] contains a third target spectrum generator (65c), where the third target spectrum generator [6503) produces a target spectrum for the third time frame of the subband of the audio signal (32) using the third correction data [295c]. In addition, decoder (110") corrects a phase [45) of the subband signal and the third phase correction The algorithm includes a third phase corrector [70c] to correct the time frame of the audio signal (32] determined by the algorithm, where the correction is performed by reducing the difference between the subband of the audio signal and the time frame measurement of the target spectrum [85c]. The audio subband signal calculator [350] can calculate the audio subband signal for a third time frame different from the first and second time frames using the phase correction of the third phase corrector. According to one arrangement, the first phase corrector [70a) is configured to store a phase-corrected subband signal (91a) of the previous time frame of the audio signal, or the third phase corrector (70c) is configured to retrieve a phase-corrected subband signal (375) of the previous time frame of the audio signal from the second phase corrector [70b]. In addition, the first phase corrector (70a) corrects the phase (45) of the audio signal (32) of the previous time frame in the current time frame. Other arrangements From another perspective, Figure 54 shows a block diagram of the decoding step in the phase correction algorithm. The input to the process is the BWE signal in the time-frequency domain and metadata. Again, in practical applications, it is preferable to apply the phase-derivative correction of the discovery to use the filter bank together or to transform an existing BWE scheme. In the present example, this is a QMF domain as used in SBR. A first multiplexing decoder (not shown) extracts the phase-derivative correction data from the bit stream of the perceptual codec equipped with BWE enhanced by the discovery correction. A second multiplexing decoder [ 365 ] performs the activation data for different correction modes and more. Then it divides into correction data [295a-c). Based on the activation data, the calculation of the target spectrum for the correct correction mode is activated [others may be left unattended]. Using the target spectrum, phase correction is applied to the acquired BWE signal using the desired correction mode. It should be noted that horizontal correction (7051) is performed sequentially (in other words: depending on the previous signal frames), and other correction modes [70b, c] also take the previous correction matrices. Finally, the corrected signal or the unprocessed one is adjusted to the output based on the activation data. After correcting the phase data, the underlying BWE synthesis continues downwards in the case of SBR synthesis within the current sample. Variations may exist where the phase correction is fully added to the BWE synthesis signal stream. Preferably, phase derivative correction is performed as an initial adjustment on the raw spectral patches with phase ZPhaUgn]. All additional BWE processing or adjustment steps [in SBR this might be noise addition, inverse filtering, lost sinusoids, etc.) are performed downstream on the corrected phases 3:11. Figure 55 shows another configuration of the decoder (110"). According to this configuration, the decoder (110") contains a core decoder (A) according to the previous implementations shown in Figure 54. The core decoder (115) is configured to decode an audio signal (25) in a time frame with a reduced number of subbands relative to the audio signal (55). The Patch Generator (120) applies a patch to the subband set of the core decoded audio signal (25) with a reduced number of subbands, where the subband set forms a first patch for a greater number of subbands in the time frame adjacent to the reduced number of subbands to obtain an audio signal (32) with a reduced number of subbands. The Magnitude Processor (125') processes the magnitude values of the audio subband signal (355) in the time frame. Compared to the previous decoders (110 and According to 110'), the magnitude processor can be the bandwidth extension parameter implementer (125). Several other arrangements can be considered regarding where the signal processor blocks are swapped. For example, the magnitude processor (125') and block (A) can be swapped. Therefore, block (A) works on the reconstructed audio signal (35) with the magnitude values of the patches already corrected. Alternatively, the audio subband signal calculator (350) can be placed after the magnitude processor (1 2 5 ') to create the corrected audio signal (355) from the corrected phase and magnitude corrected part of the audio signal. Also, the decoder (110") can be used to synthesize the phase and magnitude corrected audio signal to obtain the processed audio signal (90) combined with the frequency. It contains the synthesizer (100). Optionally, the audio signal (25) can be transmitted directly to the synthesizer (100) as no magnitude or phase correction is applied to the decoded core signal (25). Any optional processing block applied to one of the previously described decoders (110 or 110') can also be applied to the decoder (110"). Figure 56 shows an encoder (155") for encoding an audio signal (55). The encoder (155") contains a calculator (270), a core encoder (160), a parameter extractor (165) and a phase determiner (380) connected to an output signal generator (170). The phase determiner (380) determines one phase (45) of the audio signal (55), where the calculator (270) determines the phase correction data (295) for the audio signal (55) based on the determined phase (45) of the audio signal (55). The core encoder (160) core encodes the audio signal (55) to obtain a core encoded audio signal (145) containing a reduced number of subbands compared to the audio signal (55). The parameter extractor (165) extracts parameters (190) from the audio signal (55) to obtain a low-resolution parameter representation for a second set of subbands not included in the core encoded audio signal. The output signal (170) generator; The parameters (190) are encoded by the encoder (155) depending on the core encoded audio signal; a low-pass filter (180) before the core encoding of the audio signal (55) and a high-pass filter (185) before the extraction of parameters (190) from the audio signal (55). Alternatively, instead of low-pass or high-pass filtering of the audio signal (55), a gap-filling algorithm can be used, where the core encoder (160) encodes a decreasing number of subbands in the subband set where at least one subband is not core encoded. In addition, the parameter extractor extracts parameters (190) from at least one subband not encoded by the core encoder (160). According to the regulations, the calculator (270) uses a set of correction data to correct the phase correction according to a first variation mode, a second variation mode, or a third variation mode. It contains the calculator (285a-c). In addition, calculator (270) determines the activation data (365) to activate a correction data calculator of the group of correction data calculators (285a-c). The output signal generator (170) generates the output signal containing the activation data, parameters, core encoded audio signal and phase correction data. Figure 57 shows an alternative implementation of a calculator (270) that can be used in the encoder (155") shown in Figure 56. The correction mode calculator (385) contains the variation determiner (275) and the variation comparator (280). The activation data (365) is the result of comparing different variations. Additionally, the activation data (365) can be input to the output signal generator (170) and thus part of the output signal (135), depending on the specified variation, from one of the correction data calculators (185a-c). The arrangements show the calculator (270) containing a metadata generator (390) that creates a metadata stream (295') containing the calculated correction data (295a, 295b or 295c) and the activation data (365). The activation data (365) can be transmitted to the decoder if the correction data itself does not contain sufficient information about the current correction mode. Sufficient information may be multiple data points used to represent the correction data, for example, correction data (295a), correction data (295b), and correction data (295c), which are different for each. Additionally, the output signal generator (170) can also use the activation data (365), so the metadata generator (390) can be ignored. From another perspective, the block diagram in Figure 57 shows the encoding step in the phase correction algorithm. The processing input is the original audio signal [55] and the time-frequency domain. In practical applications, it is preferable to apply the phase-derivative correction of the budding to use the filter bank together or to transform an existing BWE scheme. In the present example, this is a QMF domain used in the SBR. The correction mode calculation block first calculates the correction mode applied for each temporal frame. Based on the activation data (365), the correction data is calculated. [295a-c] calculation is activated in the correct correction mode (others may be idle). Finally, the multiplexer (MUX) combines the activation data and correction data from different correction modes. Another multiplexer (not shown) combines the phase-derivative correction data into the BWE's bitstream and the perceptual encoder enhanced with discovery correction. Figure 58 shows a method (5800) for decoding an audio signal. Method (5800) is "generating a target spectrum for a first time frame of a subband signal of the audio signal with a first target spectrum generator using first correction data" step (5805), where the phase of the subband signal in the first time frame of the audio signal is corrected with a first phase corrector determined by a phase correction algorithm, where the correction is between a measurement of the subband signal in the first time frame of the audio signal and in the target spectrum. Figure 59 illustrates a method for encoding an audio signal (5900). The method includes the stage of "determining the phase correction data for an audio signal with a calculator based on the determined phase of the signal" (5910), the stage of "core encoding of the audio signal with a core encoder to obtain a core encoded audio signal containing a reduced number of subbands compared to the audio signal" (5915), and the stage of "core encoding of the audio signal" (5810) "calculating the audio subband signals for a second time frame different from the first time frame using a corrected phase of the audio subband signal for the first time frame and using the measurement of the subband signal in the second time frame or using a phase calculation corrected according to another phase correction algorithm different from the phase correction algorithm". The steps "extracting parameters from the audio signal with a parameter extractor to obtain a low-resolution parameter representation for a second set of subbands not included in the signal" (5920) and "generating an output signal with a parameter generator containing parameters, the core encoded audio signal and phase correction data" (5800 and 5900) can be implemented in a computer program that can be run on a computer. For an audio signal (55), specifically the original, i.e., unprocessed audio signal, the transmitted portion of the audio signal Xtrans(k,n] (25), the baseband signal XbaseUgn] [30], the processed audio signal containing higher frequencies (32) compared to the original audio signal, the reconstructed audio signal (35), the magnitude-corrected frequency patch Y[k,n,i) [40), the phase of the audio signal (45] or audio It should be noted that the magnitude of the signal is used as a general term for (47). Therefore, different audio signals can be mutually modified due to the context of the application. Alternative arrangements, for example, a Short Time Fourier Transform (STFT) for a Complex Modified Discrete Cosine Transform [CMDCT] or a Discrete Fourier Transform (DFT) domain, relate to the different filter banks or transformation domains used for the time-frequency processing of the invention. Therefore, specific phase characteristics related to the transformation can be considered. In detail, for example, the copying coefficients are copied from an even number to an odd number or vice versa, i.e., the second subband of the original audio signal is copied to the ninth subband instead of the eighth subband as described in the arrangements, the conjugate complexity of the patch can also be used for processing. The same situation applies to overcoming the inverted order of phase angles within a patch. For example, instead of using a copying algorithm, it is applied to the reflection of patches. Other arrangements can pull additional information from the encoder and estimate some or all of the necessary correction parameters in the decoder domain. Other arrangements may also include other underlying BWE patch schemes, for example, using different baseband portions, different numbers or magnitudes, or different transposition techniques, such as spectral reflection or single sideband modulation (SSB). Variations may also exist where the phase correction is co-designed into the BWE synthesis signal stream. Furthermore, smoothing is achieved, for example, using a sliding Hann window, e.g., first-class llR, which can be modified for better computational efficiency. The use of known perceptual audio codecs of the technique allows for the spectral components of an audio signal to be encoded, especially in parametric encoding such as bandwidth extension. The techniques applied, especially at low bit rates, disrupt phase harmony. This leads to a change in the phase derivative of the audio signal. However, preserving the phase derivative is important for certain signal types. As a result, the perceptual quality of these sounds is degraded. The present invention rearranges the phase derivative either over the overfrequency ("vertical") or over the time ("horizontal") of these signals if a restoration of the phase derivative is perceptually beneficial. It also determines whether adjusting the vertical or horizontal phase derivative is perceptually preferable. Only very compact additional information needs to be transmitted to control the phase derivative correction process. Therefore, the invention improves the sound quality of perceptual audio encoders at moderate additional information costs. In other words, spectral band replication (SBR) can cause errors in the phase spectrum. Human perception of these errors has two significant perceptual effects: harmonics Differences in frequencies and temporal positions. Frequency errors appear to be detectable only when the fundamental frequency is sufficiently high, as there is only one harmonic in an ERB band. Similarly, temporal position errors appear to be detectable only when the fundamental frequency is low and the phases of the harmonics are aligned with the frequency. Frequency errors can be detected by calculating the phase derivative (PDT) over time. If the PDT values are constant over time, the differences between the SBR-processed and original signals should be corrected. This effectively corrects the frequencies of the harmonics and thus prevents the perception of mismatch. Temporal-position errors can be detected by calculating the phase derivative (PDF) over the frequency. If the PDF values are constant over the frequency, the differences between the SBR-processed and original signals should be corrected. This effectively corrects the temporal positions of the harmonics and thus prevents the perception of modulation noises at cross-frequencies. Detection is prevented. Although the present invention is described in the context of block diagrams where the blocks represent the real or logical hardware components, the present invention can also be implemented by a computer-implemented method. In the latter case, the blocks represent the respective method stages, and these stages apply to the functionalities performed by the corresponding logical or physical hardware blocks. Where some features are described within the scope of a device, but a block or device corresponds to a method stage or a feature of a method stage, these features are seen to represent an aspect of the corresponding method. Similarly, features described within the scope of a method stage also represent a representation of a corresponding block, element, or feature of a corresponding device. Some or all of the method stages may be represented by a hardware such as a microprocessor, a programmable computer, or an electronic circuit. The invention may be executed by (or using) a device. In some arrangements, one or more of the key procedural steps may be executed via such a device. The transmitted or encoded signal of the invention may be stored on a digital storage medium or transmitted on a transmission medium such as a wireless transmission medium or a wired transmission medium such as the Internet. Depending on specific application requirements, arrangements of the invention may be implemented as hardware or software. The implementation may be carried out, for example, by a programmable computer system that can perform the relevant method and by using a digital storage medium such as a floppy disk, a DVD, a Blu-ray, a CD, a ROM, a PROM, an EPROM, an EEPROM or a flash memory with electronically readable control signals. Therefore, the digital storage medium must be computer-readable. Depending on the invention, some arrangements may be executed by one of the methods described herein. It contains a data carrier with electronically readable control signals that can work with a programmable computer system, as to be implemented. Typically, arrangements of this invention can be implemented by means of a computer program product and a program code; this program code runs to perform one of the methods when the computer program product is executed on a computer. The program code can be stored, for example, in a machine-readable carrier. Other arrangements may contain a computer program stored in a machine-readable carrier for the implementation of one of the methods described herein. In other words, one arrangement of the invention method is a computer program that, therefore, has a program code to perform one of the methods described herein when the computer program is executed on a computer. Another arrangement of the invention method is a data carrier containing the computer program stored in it to perform one of the methods described herein. The data carrier (or a non-volatile storage medium such as a digital storage medium or a computer-readable medium) is typically intangible and/or non-volatile. Another arrangement of the invention method is a sequence of signals or a data stream representing a computer program to perform one of the methods described herein. The data stream or sequence of signals may be configured to be transmitted, for example, via the internet, for example, via a data communication link. Another arrangement involves a processing device, such as a computer or a programmable logic device, configured or adapted to perform one of the methods described herein. Another arrangement involves a computer on which the aforementioned computer program is installed to perform one of the methods described herein. Another arrangement according to the invention involves transmitting a computer program to transmit one of the methods described herein to a receiver. It includes a device or system configured (e.g., electronically or optically) for receiving. The receiver may be, for example, a computer, a mobile device, a memory device, or similar. The device or system may include, for example, a file server to transmit the computer program to the receiver. In some configurations, a programmable logic device (e.g., a field-programmable gate array) may be used to perform some or all of the functions of the methods described herein. In some configurations, the field-programmable gate array may work in conjunction with a microprocessor to perform one of the methods described herein. In general, the methods are implemented preferably with any hardware device. The configurations described above are only examples of the principles of this invention. It is understood that modifications and variations of the configurations and the details described herein will be known to those who are experts in the field. It is thus limited not only by the specific details presented through the disclosures and descriptions of the embodiments herein, but also by the scope of the upcoming patent claims. References processing and loudspeaker design, John Wiley and Sons Ltd, 2004, Chapters 5, 6. Method with Novel Transient Handling for Audio Codecs, 126th AES Convention, 2009. Perceive Pitch, Timbre, Azimuth and Envelopment of Multiple Sources' Tonmeister subband/time domain approach," IEEE International Conference on Acoustics, Speech and of Signal Processing to Audio and Acoustics, 1997. 1997 IEEE ASSP Workshop on, vol., no., approach in audio coding," in AES 112th Convention, [Munich, Germany), May 2002. Belgium), November 2002. Acoust Am., vol. 74, pp. 750-753, September 1983. "pitch perception and frequency modulation discrimination," J. Acoust. Soc. Am., vol. 95, pp. spectral smoothness," IEEE Transactions on Speech and Audio Processing, vol. 11, November 2003. Original: magnitude spectrum (dB) 0.34 0.16 0.18 Originals phase spectrum (radians) frequency (kHz) Original: magnitude spectrum (dB) FIGURE 1C Original: phase spectrum (radians) frequency (kHz) FIGURE 1D 1i _ _ _ _ . . 4IJ///V_ _ _ _ . _ Ãu _ _ nU\I/V _ . _ 1. _ _ 1. _ _ . _ mW\J/IT " o o o o o 0 n n n n 4 3 4' 4' Xiuans(kv n) FIGURE 3A Xîiaiis `7-` 6.4` : : : C i\ FIGURE 3c copy: magnitude spectrum (dB) frequency WH?) 0.14 0.16 FIGURE 4A copy: phase spectrum (radians) 8" T: 9._ i i li." in:: i: FIGURE 4B copy: magnitude spectrum (dB) frequency (kHz) FIGURE 4c copy: phase spectrum (radians) frequency (kHz) FIGURE 4D /67 Et 98me ?Ev @8% - ..... W ............... W ............. W;;I;:;M ........... 'im .............. W-oq _ 5.›.w;.;.z.w..s.;.ß 11/67 time domain frequency domain 60 .I 1 9 ! 3 ! frequency (HZ) 12/67 EGEEN V M N P 1 mcmxmc 13/67 8:; :95& 14/67 time domain 0.4 -~ frequency domain 7-; ------- -i frequency (H7) /67 frequency (kHz) frequency (kHz) 16/67 original: phase derivative in time (radians) FIGURE 12A original phase derivative on frequency (radians) 1 ;11) . .s 2 =-' ii 1522." !:'-'J 'iiim . "i g'n!. .â i'i'i:!rII ii. frequency (kHz) frequency ('kHz) 17/67 original: phase derivative over time (radians) FIGURE 12c original: phase derivative over frequency (radians) FIGURE 12D 18/67 copy: phase derivative over time (radians) frequency (kHz) FIGURE 13A copy: phase derivative over frequency (radians) 19/67 copy: phase derivative over time (radians) e G'ÄEI. ':'îfiüi'it'î'5:'152':-,I.':=': I: 2 [2.. Ed. m' N maar... IÜ.." W W I I." "-.U F I II frequency (kHz) copy: phase derivative over frequency (radians) frequency (kHz) /67 450 45b' 21/67 2 ..ima 503.958 22/67 2 .:me 2 5.2& &3.0 8% Nmtmuoc F. gsb _gusmammmc . 4 ...N 2.01» :m /m 23/67 A3 :95% 24/67 corrected: Error in PDT (radians) corrected: Phase derivative over time (radians) /67 2 ..ima (58_ " m m E .5 _ 32296 " mcmxoc " %928 m _226595 " m m: - -4---L 26/67 of 9: of övoxmm " Ecußcm _ n n 953520 ?058% __aga 258: L_ 02 09 mm_ 27/67 03 09 mi o? m.: 9; 2: 2: 28/67 Decoding an audio signal in a time frame with a phase measurement calculator N2305 Determining a target phase measurement for the mentioned time frame @2310 with a target phase measurement determinant Correcting the phases of the audio signal in a time frame with a phase corrector using the calculated phase measurement and target phase measurement to obtain a processed audio signal N2315 29/67 Decoding an audio signal in a time frame with a decreasing number of subbands relative to the audio signal Patching a decoded audio signal with a set of subbands with a reduced number of subbands, where is the set of subbands. To obtain an audio signal with a regular number of adjacent subbands, a first patch is created with more subbands in the time frame to reduce the number of subbands Correcting the phases within the subbands of the first patch according to a target function with an audio processor /67 According to the audio signal Core encoding of the audio signal with a core encoder to obtain a core encoded audio signal with a reduced number of subbands. Analysis of the audio signal or its filtered version at a low signal with a basic frequency analyzer to obtain a fundamental frequency estimate of the audio signal. Extraction of the parameters of the subbands of the audio signal not included in the core encoded audio signal with a parameter extractor. Generating an output signal containing the core encoded audio signal parameters and the fundamental frequency estimate with an output signal generator. 31/67 corrector' 38/67 82296 205.98 208:me 39/67 Core positions peak encoded parameter fundamental frequency position audio signal estimation Determining a target phase measure for the audio signal within a time frame with a target phase measure N3405 Phase error calculator using the phase of the audio signal in the time frame and the target phase measure Using the phase error of the audio signal in the time frame Correction with a corrected phase 40/67 Decoding an audio signal in the time frame of a baseband with a core decoder @3505 Patching a set of subbands of the decoded baseband, where is the set of subbands. Creates a patch in the subbands in the time frame adjacent to the baseband to obtain an audio signal containing frequencies higher than the frequencies in the baseband &3510 Correction of phases with subbands of the first patch @3515 with an audio processor according to a target phase measure 41/67 Core encoding of the audio signal with a core encoder to obtain a core encoded audio signal containing a reduced number of subbands relative to the audio signal Analysis of the audio signal or a filtered version of the audio signal with a basic frequency analyzer to obtain a basic frequency estimate of the peak positions in the audio signal Parameters of the subbands of the audio signal not included in the core encoded audio signal with a parameter extractor Extraction of the core encoded audio signal and generation of an output signal with an output signal generator containing the parameters of the peak positions, the fundamental frequency and the peak position. 42/67 î cmEaN 43/67 Copying: error in phase spectrum (radians) FIGURE 38A Corrected: phase derivative over frequency (radians) -' .- 8' I I f 9 I "I 6 ; I I : i ii . I i i 4: 'i i 1 4-. . i ' I; i : â I 2-: I i 'i I .. s: ~ I 44/67 55› 95680 05%ch 45/67 8 ..all › _253_ _ mEamm , .980: I .amwwz _oczwm_ 6.5› ..93. . acma, . www: :932 E Gîx ::wa Nawwîsa ^._._.VV,.__HX men ?55an ;wgmvh 8.5.2..7X tmbcmwmî EÃX FO& 46/67 atm› @EGNDU h coâmbm› &m 65› _859.99_ wEHENso 859%› :2959 w coâwbse 47/67 Determining a variation of a phase of an audio signal with a variation comparator in a first and a second variation mode. Comparing the variation determined using a variation comparator with the first and second variation modes. Based on the comparison result, according to the first variation mode or the second variation mode. Calculation of phase correction with a correction data calculator 48/67 original: Standard deviation of PDT (radians) FIGURE 43A original: Standard deviation of PDF (radians) 49/67 original: Standard deviation of PDT (radians) time {SJ FIGURE 43c original: Standard deviation of PDF (radians) 50/67 original: Magnitude spectrum (dB) FIGURE 44A original: Phase spectrum (radians) i..»-3:.':U'w" .van-3 51/67 original: Phase derivative over time (radians) FIGURE 45A original: Phase derivative over frequency (radians) 52/67 corrected: Phase derivative over time (radians) frequency (kH7) FIGURE 46A corrected: Frequency Phase derivative (radian) on it FIGURE 4GB 53/67 A3 :95& (ZHH) SUBXSJJ 54/67 4.? ..Euw frequency (kHz) 55/67 (2H›i) 56/67 3 :wool (ZHH) suean; 57/67 corrected: Error in PDT (radians) 0.116 038 0?2 FIGURE 50A corrected: Phase derivative over time (radians) 58/67 59/67 1!iéiue 60/67 corrected: Error in phase spectrum (radians) FIGURE 52A corrected: Phase derivative over frequency (radians) 61/67 m 5 38 Sam Hamstmmmm Ac x:.N Nahöccxg Ac.±w%x Naomc _m:m .V _uccß mcsowav :.xxsrN _ _0N=mcm " ^:_xvN :wztm www 62/67 o ;MN/mn I IDE I I I | Imdmslcgmýgml | I Immoml I J cam _ " V: __g g gm 2.2" n 22 mmm Nm _ 28.21 En.: â_ mgmw: _ xnsm c -ammwc _ C__m....:m_ Ar_ Q: ...IN .szDU ll. .amvwi Nürg_ _ _mim :D EEÜE . _ .::X @EH-@NSUJ I I L _ EKI( _ -.anam A1 .Eâd _m:m _. I I_ .::N v/__oz 4 63/67 USQNoEow 64/67 N _gamammwc 633%& 32230 . 209 m 958: . i: " 65/67 activation '- X" `(k. n) correction data I 45 365 I 285a | Correction | Calculation of data for DL-V data | 295a | 285b | Correction | Calculation of data for DL-V data | 2 | 295b | 285c | Temporary Correction | Calculation of correction data | Metadata Generator | Metadata 66/67 Generating a target spectrum for a first time frame of a subband signal of the audio signal using the first target spectrum generator with the first correction data. Correction of a phase of the subband signal in the first time frame of the audio signal with a first phase regulator determined by a phase correction algorithm. Here, the correction is performed by reducing the difference between a measure of the subband signal in the first time frame of the audio signal and the target spectrum. Using a corrected phase of the audio subband signal for the first time frame and using the measure of the subband signal in the second time frame, generating audio subband signals for a second time frame different from the first time frame. Calculation using an audio subband signal calculator or using a phase calculation corrected according to a phase correction algorithm different from the phase correction algorithm 67/67 Determining the phase of an audio signal with a phase determiner Determining the phase correction data for an audio signal with a calculator based on the determined phase of the audio signal N59l0 Core encoding of an audio signal with a core encoder to obtain a core encoded audio signal containing a reduced number of subbands relative to the audio signal Extraction of parameters from the audio signal with a parameter extractor to obtain a low-resolution parameter representation for a second set of subbands not included in the core encoded audio signal Generating an output signal with a generator containing the parameters, core encoded audio signal and phase correction data TR TR TR TR TR TR TR TR TR TR