CN117877497A

CN117877497A - Time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification

Info

Publication number: CN117877497A
Application number: CN202311771469.0A
Authority: CN
Inventors: 苏文康
Original assignee: Guangzhou University
Current assignee: Guangzhou University
Priority date: 2023-12-21
Filing date: 2023-12-21
Publication date: 2024-04-12
Anticipated expiration: 2043-12-21
Also published as: CN117877497B

Abstract

The invention discloses a time domain audio steganography method based on generalized audio internal energy and micro amplitude suppression modification, which comprises the following steps: acquiring carrier audio, carrying out segmentation pretreatment on the carrier audio, and cutting the carrier audio into audio fragments with set lengths; dividing and slicing secret information bits to be embedded into equal sections according to the number of the audio fragments to obtain secret information bit fragments; based on a complexity priority criterion, firstly constructing a steganography distortion cost function based on generalized audio internal energy, and then providing a distortion cost design criterion of micro amplitude suppression modification, wherein the steganography distortion cost function and the distortion cost design criterion are combined to obtain a final steganography distortion cost function; calculating the steganography distortion cost of each audio segment, and embedding the secret information bit segment into each audio segment based on a steganography coding method to obtain a plurality of carrier audio segments; and splicing all the encrypted audio fragments to obtain the final encrypted audio. The invention has the advantages of higher steganography safety and better carrier audio quality.

Description

Time domain audio steganography based on generalized audio intrinsic energy and micro-amplitude suppression modification Method

技术领域Technical Field

本发明涉及时域音频隐写技术领域，具体涉及一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法。The present invention relates to the technical field of time domain audio steganography, and in particular to a time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification.

背景技术Background technique

随着近年来计算机和电子技术的快速发展，WAV格式的高质量无损音频(时域音频)在互联网上被广泛传播和分享。以此为载体的时域音频隐写(Audio Steganography inTemporal Domain，ASTD)技术就是将秘密信息以某种方式嵌入到时域音频中，同时不引起监控方的怀疑。目前主流且实用的时域音频隐写方案都是基于最小失真嵌入框架的设计的，该框架主要包括隐写编码和失真代价函数设计两部分，而随着高效且实用的STC、SPC等隐写编码的提出，现阶段该框架下的研究重点都集中在了隐写失真代价函数的设计上。With the rapid development of computer and electronic technology in recent years, high-quality lossless audio (time domain audio) in WAV format has been widely disseminated and shared on the Internet. The audio steganography in temporal domain (ASTD) technology based on this medium is to embed secret information into the time domain audio in a certain way without arousing suspicion from the monitoring party. At present, the mainstream and practical time domain audio steganography schemes are all based on the design of the minimum distortion embedding framework, which mainly includes two parts: stego coding and distortion cost function design. With the introduction of efficient and practical stego coding such as STC and SPC, the research focus under this framework is currently on the design of the stego distortion cost function.

在该框架下，有学者提出了基于导数滤波残差失真代价的时域音频隐写方法DFR。据公开报道，DFR是目前安全性能最好的时域音频隐写算法，具体来说，DFR在现有复杂度优先准则的指导下，引入了导数滤波残差来刻画载体的复杂度从而构建了相应的隐写失真代价函数。然而，这种方法的隐写安全性能通常在很大程度上取决于用来获取残差的滤波器的选取，不同的滤波器最终得到的隐写安全性能可能会有较大的差异，因此该方法的性能并不稳定；其次，尽管DFR方法中的大幅值优先(LAF)准则可以防止微小幅值采样点的修改，但它在一定程度上会与内容自适应隐写的复杂度优先准则相冲突，使得原本适合嵌入的高复杂度的低幅值(非微小幅值)采样点较少被修改，而原本不适合嵌入的低复杂度的高幅值采样点则较多被修改，从而可能引起隐写安全性能的急剧下降。Under this framework, some scholars proposed a time-domain audio steganography method DFR based on derivative filter residual distortion cost. According to public reports, DFR is currently the best time-domain audio steganography algorithm with the best security performance. Specifically, under the guidance of the existing complexity priority criterion, DFR introduces derivative filter residual to characterize the complexity of the carrier and constructs the corresponding stego distortion cost function. However, the stego security performance of this method usually depends to a large extent on the selection of the filter used to obtain the residual. The final stego security performance obtained by different filters may be quite different, so the performance of this method is not stable; secondly, although the large amplitude priority (LAF) criterion in the DFR method can prevent the modification of small amplitude sampling points, it will conflict with the complexity priority criterion of content adaptive steganography to a certain extent, making the high-complexity low-amplitude (non-small amplitude) sampling points originally suitable for embedding less modified, while the low-complexity high-amplitude sampling points originally not suitable for embedding are more modified, which may cause a sharp decline in stego security performance.

发明内容Summary of the invention

为了克服现有时域音频隐写算法DFR在其隐写失真代价函数设计中存在的缺陷和不足，本发明提供一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法，用载体的内在能量来刻画载体复杂度，将微幅值抑制准则应用于复杂度优先准则之后，再把微小幅值采样点设置为“湿点”，致力于精准地预防微小幅值采样点被修改，同时还不影响其它幅值采样点的嵌入修改，这使得由本发明得到的载密音频更难被隐写分析器检测出来，进而提高了本发明的隐写安全性。In order to overcome the defects and shortcomings of the existing time-domain audio steganography algorithm DFR in the design of its steganalysis distortion cost function, the present invention provides a time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification, which uses the intrinsic energy of the carrier to characterize the complexity of the carrier, applies the micro-amplitude suppression criterion to the complexity priority criterion, and then sets the small amplitude sampling points as "wet points", which is committed to accurately preventing the small amplitude sampling points from being modified, while not affecting the embedded modification of other amplitude sampling points, which makes the secret audio obtained by the present invention more difficult to be detected by the steganalyzer, thereby improving the steganographic security of the present invention.

本发明的第二目的在于提供一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写系统；The second object of the present invention is to provide a time domain audio steganography system based on generalized audio intrinsic energy and micro-amplitude suppression modification;

本发明的第三目的在于提供一种计算机可读存储介质；A third object of the present invention is to provide a computer readable storage medium;

本发明的第四目的在于提供一种计算机设备。A fourth object of the present invention is to provide a computer device.

为了达到上述目的，本发明采用以下技术方案：In order to achieve the above object, the present invention adopts the following technical solutions:

本发明提供一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法，包括下述步骤：The present invention provides a time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification, comprising the following steps:

获取载体音频，对载体音频进行切分预处理，将载体音频切割成设定长度的音频片段；Acquire carrier audio, perform segmentation preprocessing on the carrier audio, and cut the carrier audio into audio segments of a set length;

根据音频片段的数量将待嵌入的秘密信息比特进行等分切片，得到秘密信息比特片段；The secret information bits to be embedded are equally sliced according to the number of audio segments to obtain secret information bit segments;

基于复杂度优先准则，构建基于广义音频内在能量的隐写失真代价函数，将基于广义音频内在能量的隐写失真代价函数与微幅值抑制修改的失真代价设计准则相结合，得到最终的隐写失真代价函数；Based on the complexity priority criterion, a steganographic distortion cost function based on generalized audio intrinsic energy is constructed. The steganographic distortion cost function based on generalized audio intrinsic energy is combined with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganographic distortion cost function.

计算每个音频片段的隐写失真代价，基于最小失真隐写框架的STC隐写编码方法将秘密信息比特片段嵌入每个音频片段，得到多个载密音频片段；The steganalysis distortion cost of each audio segment is calculated, and the STC steganalysis coding method based on the minimum distortion steganography framework is used to embed the secret information bit segment into each audio segment to obtain multiple secret audio segments.

将所有的载密音频片段拼接得到最终的载密音频。All the encrypted audio clips are spliced together to get the final encrypted audio.

作为优选的技术方案，构建基于广义音频内在能量的隐写失真代价函数，具体包括：As a preferred technical solution, a steganalysis distortion cost function based on generalized audio intrinsic energy is constructed, which specifically includes:

获取局部载体音频信号，基于频谱变换得到不同的载体音频频率成分；Acquire a local carrier audio signal, and obtain different carrier audio frequency components based on spectrum transformation;

不同的载体音频频率成分设置不同的权重，所述权重设置指数调整系数，计算局部载体音频信号用于隐写的内在能量；Different weights are set for different carrier audio frequency components, and the weights are used to set exponential adjustment coefficients to calculate the intrinsic energy of the local carrier audio signal for steganography;

将局部载体音频信号用于隐写的内在能量作为局部载体音频信号的中心点的复杂度指标，载体音频的隐写失真代价与复杂度成反比，构建得到基于广义音频内在能量的隐写失真代价函数。The intrinsic energy of the local carrier audio signal used for steganography is used as the complexity index of the center point of the local carrier audio signal. The steganalysis distortion cost of the carrier audio is inversely proportional to the complexity. A steganalysis distortion cost function based on the generalized audio intrinsic energy is constructed.

作为优选的技术方案，所述获取局部载体音频信号，采用设置滑动窗口截取或使用镜像填充获取局部载体音频信号。As a preferred technical solution, the local carrier audio signal is obtained by setting a sliding window to intercept or using mirror filling to obtain the local carrier audio signal.

作为优选的技术方案，所述频谱变换采用离散余弦变换、离散傅里叶变换或离散小波变换中的任意一种。As a preferred technical solution, the spectrum transformation adopts any one of discrete cosine transform, discrete Fourier transform or discrete wavelet transform.

作为优选的技术方案，构建基于广义音频内在能量的隐写失真代价函数，具体表示为：As a preferred technical solution, a steganalysis distortion cost function based on generalized audio intrinsic energy is constructed, which is specifically expressed as:

其中，ρ_c表示局部载体音频信号的中心点的失真代价，λ表示用于防止ρ_c过小的平衡常数，ε表示稳定常数，n表示不同的载体音频频率成分的个数，f_i表示第i个载体音频频率成分的幅值，w_i表示第i个载体音频频率成分对应的权重，p表示指数调整系数，表示局部载体音频信号x_l用于隐写的内在能量；计算音频的内在隐写能量时，排除直流频率成分的影响，即舍弃f₁。Wherein, ρ _c represents the distortion cost of the center point of the local carrier audio signal, λ represents the equilibrium constant used to prevent ρ _c from being too small, ε represents the stability constant, n represents the number of different carrier audio frequency components, _fi represents the amplitude of the i-th carrier audio frequency component, _wi represents the weight corresponding to the i-th carrier audio frequency component, and p represents the exponential adjustment coefficient. represents the intrinsic energy of the local carrier audio signal x _l used for steganography; when calculating the intrinsic steganographic energy of the audio, the influence of the DC frequency component is excluded, that is, f ₁ is discarded.

作为优选的技术方案，指数调整系数表示为：As a preferred technical solution, the index adjustment coefficient is expressed as:

其中，psehf表示每个载体中高频分量所占的能量百分比，k表示高频成分的选取阈值，k≥2，是每个载体中所有载体音频频率成分的平均幅值。Among them, psehf represents the energy percentage of high-frequency components in each carrier, k represents the selection threshold of high-frequency components, k≥2, is the average amplitude of all carrier audio frequency components in each carrier.

作为优选的技术方案，将基于广义音频内在能量的隐写失真代价函数与微幅值抑制修改的失真代价设计准则相结合，得到最终的隐写失真代价函数，具体表示为：As a preferred technical solution, the steganographic distortion cost function based on generalized audio intrinsic energy is combined with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganographic distortion cost function, which is specifically expressed as:

其中，|x_c|为局部载体音频信号的中心点x_c的幅值，T为被抑制修改的采样点的幅值阈值，ψ为微幅值抑制准则下湿点的失真代价，w_i表示第i个载体音频频率成分对应的权重，p表示指数调整系数，n表示不同的载体音频频率成分的个数，ε表示稳定常数，ρ_c表示局部载体音频信号的中心点的失真代价。Among them, | _xc | is the amplitude of the center point _xc of the local carrier audio signal, T is the amplitude threshold of the suppressed and modified sampling point, ψ is the distortion cost of the wet point under the micro-amplitude suppression criterion, _wi represents the weight corresponding to the i-th carrier audio frequency component, p represents the exponential adjustment coefficient, n represents the number of different carrier audio frequency components, ε represents the stability constant, and _ρc represents the distortion cost of the center point of the local carrier audio signal.

为了达到上述第二目的，本发明采用以下技术方案：In order to achieve the above second purpose, the present invention adopts the following technical solutions:

本发明提供一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写系统，包括：载体音频获取模块、载体音频预处理模块、秘密信息比特预处理模块、隐写失真代价函数构建模块、隐写失真代价计算模块、隐写编码模块、音频拼接模块；The present invention provides a time domain audio steganography system based on generalized audio intrinsic energy and micro-amplitude suppression modification, comprising: a carrier audio acquisition module, a carrier audio preprocessing module, a secret information bit preprocessing module, a steganalysis distortion cost function construction module, a steganalysis distortion cost calculation module, a steganalysis encoding module, and an audio splicing module;

所述载体音频获取模块用于获取载体音频；The carrier audio acquisition module is used to acquire carrier audio;

所述载体音频预处理模块用于对载体音频进行切分预处理，将载体音频切割成设定长度的音频片段；The carrier audio preprocessing module is used to perform segmentation preprocessing on the carrier audio, cutting the carrier audio into audio segments of a set length;

所述秘密信息比特预处理模块用于根据音频片段的数量将待嵌入的秘密信息比特进行等分切片，得到秘密信息比特片段；The secret information bit preprocessing module is used to equally slice the secret information bits to be embedded according to the number of audio segments to obtain secret information bit segments;

所述隐写失真代价函数构建模块用于基于复杂度优先准则，构建基于广义音频内在能量的隐写失真代价函数，将基于广义音频内在能量的隐写失真代价函数与微幅值抑制修改的失真代价设计准则相结合，得到最终的隐写失真代价函数；The steganalysis distortion cost function construction module is used to construct a steganalysis distortion cost function based on generalized audio intrinsic energy based on the complexity priority criterion, and combine the steganalysis distortion cost function based on generalized audio intrinsic energy with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganalysis distortion cost function;

所述隐写失真代价计算模块用于计算每个音频片段的隐写失真代价；The steganalysis distortion cost calculation module is used to calculate the steganalysis distortion cost of each audio clip;

所述隐写编码模块用于基于最小失真隐写框架的STC隐写编码方法将秘密信息比特片段嵌入每个音频片段，得到多个载密音频片段；The stego-coding module is used to embed the secret information bit fragment into each audio fragment based on the STC stego-coding method of the minimum distortion steganographic framework to obtain multiple secret audio fragments;

所述音频拼接模块用于将所有的载密音频片段拼接得到最终的载密音频。The audio splicing module is used to splice all the encrypted audio clips to obtain the final encrypted audio.

为了达到上述第三目的，本发明采用以下技术方案：In order to achieve the third objective, the present invention adopts the following technical solutions:

一种计算机可读存储介质，存储有程序，所述程序被处理器执行时实现如上述基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法。A computer-readable storage medium stores a program, which, when executed by a processor, implements the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification as described above.

为了达到上述第四目的，本发明采用以下技术方案：In order to achieve the fourth objective, the present invention adopts the following technical solutions:

一种计算机设备，包括处理器和用于存储处理器可执行程序的存储器，所述处理器执行存储器存储的程序时，实现如上述基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法。A computer device comprises a processor and a memory for storing a program executable by the processor. When the processor executes the program stored in the memory, the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification as described above is implemented.

本发明与现有技术相比，具有如下优点和有益效果：Compared with the prior art, the present invention has the following advantages and beneficial effects:

(1)现有DFR法使用导数滤波残差来刻画载体的复杂度并构建其隐写失真代价函数，这种方法下的隐写安全性能通常在很大程度上取决于高效滤波器的选择，而本发明舍弃了用载体信号残差来表征载体复杂度的思路，用载体的内在能量来刻画载体复杂度，并基于此设计了一个新的隐写失真代价函数，这使得算法在应用时省去了精心挑选高效滤波器的麻烦，进而为本发明带来了更优且更稳定的隐写安全性。(1) The existing DFR method uses the derivative filter residual to characterize the complexity of the carrier and constructs its stegographic distortion cost function. The stegographic security performance under this method usually depends to a large extent on the selection of efficient filters. The present invention abandons the idea of using the carrier signal residual to characterize the carrier complexity, and uses the intrinsic energy of the carrier to characterize the carrier complexity. Based on this, a new stegographic distortion cost function is designed, which saves the trouble of carefully selecting efficient filters when applying the algorithm, thereby bringing better and more stable stegographic security to the present invention.

(2)现有DFR方法中提出的大幅值优先准则可以在一定程度上防止微小幅值采样点的修改，但它在一定程度上会与内容自适应隐写的复杂度优先准则相冲突，而本发明提出了微幅值抑制准则，将其应用于复杂度优先准则之后，再把微小幅值采样点设置为“湿点”，致力于精准地预防微小幅值采样点被修改，同时还不影响其它幅值采样点的嵌入修改，这使得由本发明得到的载密音频更难被隐写分析器检测出来，进而提高了本发明的隐写安全性，具有隐写安全性更高、载密音频质量更好的优点。(2) The large amplitude priority criterion proposed in the existing DFR method can prevent the modification of small amplitude sampling points to a certain extent, but it will conflict with the complexity priority criterion of content adaptive steganography to a certain extent. The present invention proposes a small amplitude suppression criterion, which is applied to the complexity priority criterion, and then the small amplitude sampling points are set as "wet points", which is committed to accurately preventing the small amplitude sampling points from being modified, while not affecting the embedded modification of other amplitude sampling points. This makes the encrypted audio obtained by the present invention more difficult to be detected by the steganalyzer, thereby improving the steganographic security of the present invention, and has the advantages of higher steganographic security and better quality of encrypted audio.

(3)本发明在载密音频的安全性和音频质量方面都超越了目前最先进的同类算法，取得了更优的性能，具有更好的隐写不可感知性。(3) The present invention surpasses the most advanced similar algorithms in terms of security and audio quality of encrypted audio, achieves better performance, and has better steganographic imperceptibility.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法的流程示意图；FIG1 is a flow chart of a time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification according to the present invention;

图2为本发明构建隐写失真代价函数的实现过程框架示意图。FIG. 2 is a schematic diagram of the framework of the implementation process of constructing a steganographic distortion cost function according to the present invention.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白，以下结合附图及实施例，对本发明进行进一步详细说明。应当理解，此处所描述的具体实施例仅仅用以解释本发明，并不用于限定本发明。In order to make the purpose, technical solution and advantages of the present invention more clearly understood, the present invention is further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention and are not intended to limit the present invention.

实施例1Example 1

如图1所示，本实施例提供一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法，包括下述步骤：As shown in FIG1 , this embodiment provides a time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification, comprising the following steps:

S1：获取载体音频，对载体音频进行切分预处理，将载体音频切割成设定长度的音频片段，本实施例每个音频片段的切割长度优选为1s；S1: Acquire carrier audio, perform segmentation preprocessing on the carrier audio, and cut the carrier audio into audio segments of a set length. In this embodiment, the cutting length of each audio segment is preferably 1 second;

S2：根据音频片段的数量将待嵌入的秘密信息比特bit也进行等分切片，得到秘密信息比特片段；S2: the secret information bits to be embedded are also equally sliced according to the number of audio segments to obtain secret information bit segments;

S3：针对每个音频片段，先构建隐写失真代价函数计算其隐写失真代价，然后基于最小失真隐写框架的STC隐写编码方法将秘密信息比特片段嵌入其中，得到多个载密音频片段；S3: For each audio clip, a stego-distortion cost function is first constructed to calculate its stego-distortion cost, and then the secret information bit fragment is embedded into it based on the STC stego-coding method of the minimum distortion steganography framework to obtain multiple secret audio clips;

S4：将所有的载密音频片段拼接起来就得到了最终的载密音频。S4: All the encrypted audio clips are spliced together to obtain the final encrypted audio.

在本实施例中，基于构建隐写失真代价函数计算载体相应的隐写嵌入失真代价，其隐写失真代价函数的构建过程为：先基于复杂度优先准则，提出基于广义音频内在能量的隐写失真代价函数，在此基础上，又提出微幅值抑制修改的失真代价设计准则对其进行补充；In this embodiment, the steganalysis distortion cost corresponding to the carrier is calculated based on the construction of the steganalysis distortion cost function. The construction process of the steganalysis distortion cost function is as follows: first, based on the complexity priority criterion, a steganalysis distortion cost function based on the generalized audio intrinsic energy is proposed. On this basis, a distortion cost design criterion of micro-amplitude suppression modification is proposed to supplement it.

在本实施例中，基于广义音频内在能量的隐写失真代价函数设计具体包括：In this embodiment, the design of the steganalysis distortion cost function based on the generalized audio intrinsic energy specifically includes:

数字信号的能量可以定义为其所有频率分量的幅值平方之和，对于一维音频信号而言，其频率分量可以利用一维的频率变换函数轻松获得，如DCT(离散余弦变换)、DFT(离散傅里叶变换)、DWT(离散小波变换)等；The energy of a digital signal can be defined as the sum of the squares of the amplitudes of all its frequency components. For a one-dimensional audio signal, its frequency components can be easily obtained using a one-dimensional frequency transform function, such as DCT (discrete cosine transform), DFT (discrete Fourier transform), DWT (discrete wavelet transform), etc.

如图2所示，本实施例首先取一个大小为n(n>1)的滑动窗口SW_n来截取一段局部载体音频信号，并将其表示为x_l＝[x₁,x₂...x_n]，其中n通常为奇数。对于载体音频频率成分的提取，本实施例优选DCT变换来实现，接着，通过对局部载体音频信号x_l进行DCT变换，可以得到n个不同的频率成分，即F₁,F₂...F_n。假设第i个频率分量F_i的强度(幅度)为f_i，那么参照数字信号能量的定义，局部载体音频信号x_l的内在能量就可以表示为：As shown in FIG2 , this embodiment first uses a sliding window SW _n of size n (n>1) to intercept a section of the local carrier audio signal, and represents it as x _l =[x ₁ ,x ₂ ...x _n ], where n is usually an odd number. For the extraction of the carrier audio frequency component, this embodiment preferably uses DCT transformation to achieve it. Then, by performing DCT transformation on the local carrier audio signal x _l , n different frequency components, namely F ₁ ,F ₂ ...F _n , can be obtained. Assuming that the intensity (amplitude) of the i-th frequency component F _i is _fi , then referring to the definition of digital signal energy, the intrinsic energy of the local carrier audio signal x _l is It can be expressed as:

在载体的高频区域(高复杂度区域)的嵌入修改一般比在低频区域(低复杂度区域)的嵌入修改更安全，这意味着，在上述能量公式中信号能量相等的情况下，高频区域理论上更适合嵌入，为此，在计算音频内在能量来设计隐写失真代价时，应该给公式(1)中的不同频率成分分配不同的权重w，其原则是频率越高，权重越大。此外，隐写失真代价一般与载体的内容复杂度有关，而与载体的内容(即载体信号的幅值)本身无关，DFR_based方法中的结果也证实了音频信号幅值不是很适合用于其隐写失真代价的设计，因此，需要去除公式中的直流(Direct Current)频率分量f₁，以便消除音频幅值对其内在能量计算的影响。另一方面，为了便于调整权重w，为权重w引入了一个指数调整系数p；至此，在本实施例提出的方案中，局部载体音频信号x_l用于隐写的内在能量的构造可以进一步表示成如下广义的形式：Embedding modification in the high frequency region (high complexity region) of the carrier is generally safer than embedding modification in the low frequency region (low complexity region). This means that when the signal energy is equal in the above energy formula, the high frequency region is theoretically more suitable for embedding. Therefore, when calculating the intrinsic energy of the audio to design the steganographic distortion cost, different weights w should be assigned to different frequency components in formula (1). The principle is that the higher the frequency, the greater the weight. In addition, the steganographic distortion cost is generally related to the content complexity of the carrier, but not to the content of the carrier (i.e., the amplitude of the carrier signal). The results in the DFR_based method also confirm that the amplitude of the audio signal is not very suitable for the design of its steganographic distortion cost. Therefore, it is necessary to remove the direct current frequency component _f1 in the formula to eliminate the influence of the audio amplitude on its intrinsic energy calculation. On the other hand, in order to facilitate the adjustment of the weight w, an exponential adjustment coefficient p is introduced for the weight w; so far, in the scheme proposed in this embodiment, the local carrier audio signal _xl is used for the intrinsic energy of steganography. The construction can be further expressed in the following generalized form:

在不失一般性的前提下，为了音频隐写失真代价函数设计的需要，在本实施例提出的方案中，将被视为x_l的中心点x_c的复杂度指标，载体音频的隐写失真代价被设计成与其复杂度成反比，按照这个范式，将所提出的基于广义音频内在能量的时域音频隐写的失真代价函数定义为：Without loss of generality, in order to meet the needs of audio steganography distortion cost function design, in the solution proposed in this embodiment, The complexity index of the center point _xc of _xl is considered, and the steganalysis distortion cost of the carrier audio is designed to be inversely proportional to its complexity. According to this paradigm, the distortion cost function of the proposed time-domain audio steganography based on generalized audio intrinsic energy is defined as:

其中，ρ_c是x_c的失真代价，λ是防止ρ_c过小的平衡常数，ε是稳定常数。针对逐个采样点依次滑动SW_n窗口，再利用上述公式就可以得到每个音频载体样本中所有采样点的失真代价，计算音频的内在隐写能量时，排除直流频率成分的影响，即舍弃f₁；Among them, ρ _c is the distortion cost of x _c , λ is the equilibrium constant to prevent ρ _c from being too small, and ε is the stability constant. Slide the SW _n window for each sampling point in turn, and then use the above formula to get the distortion cost of all sampling points in each audio carrier sample. When calculating the intrinsic steganographic energy of the audio, exclude the influence of the DC frequency component, that is, discard f ₁ ;

值得注意的是，如果需要的话，将使用镜像填充来获取x_l。由于本发明新提出的音频隐写失真代价函数是基于其广义的音频内在能量设计的，为了便于表述，将其命名为GAIE(Generalized Audio Intrinsic Energy)方法。It is worth noting that, if necessary, mirror padding will be used to obtain x _l . Since the audio steganography distortion cost function proposed in the present invention is designed based on its generalized audio intrinsic energy, it is named GAIE (Generalized Audio Intrinsic Energy) method for ease of description.

上述公式中稳定常数ε取10^-2，第i个频率成分对应的权重w_i被设置为w_i＝i(2≤i≤n)。考虑到指数权重w_i ^p的影响，公式中平衡常数λ被设置为n^p*10⁶。而对于滑动窗口大小n的参数选择，经实验发现通常随着n的增大，本发明的隐写安全性能也会随之提升，当n到达169时性能趋于稳定，因此n的值优选固定取169。此外，实验表明参数p的最优取值与数据集密切相关，经分析，参数p主要作用于频率成分的权重，因此尝试通过每个载体中高频分量所占的能量百分比(percentage of signal energy occupied by the high-frequencycomponents:pseh f)来确定参数p。psehf的具体定义如下：In the above formula, the stability constant ε is taken as ^10-2 , and the weight w _i corresponding to the i-th frequency component is set to w _i =i (2≤i≤n). Taking into account the influence of the exponential weight w _i ^p , the equilibrium constant λ in the formula is set to n ^p *10 ^6. As for the parameter selection of the sliding window size n, experiments have found that as n increases, the steganographic security performance of the present invention will also increase accordingly. When n reaches 169, the performance tends to be stable, so the value of n is preferably fixed to 169. In addition, experiments show that the optimal value of the parameter p is closely related to the data set. After analysis, the parameter p mainly acts on the weight of the frequency component. Therefore, an attempt is made to determine the parameter p by the percentage of signal energy occupied by the high-frequency components in each carrier (percentage of signal energy occupied by the high-frequency components: pseh f). The specific definition of psehf is as follows:

其中，k(k≥2)是高频成分的选取阈值，实验表明k的最佳取值为6。是每个载体中所有局部信号片段的频率F_i的平均强度，直流频率也需要被排除在外。样本的psehf越大，表明其高频分量的比例就越大。经实验表明，当psehf越大时，相对较大的p能取得更好的隐写安全性。由此，构造了psehf与参数p的具体对应关系如下：Among them, k (k≥2) is the selection threshold of high-frequency components. Experiments show that the best value of k is 6. is the average intensity of the frequency F _i of all local signal fragments in each carrier, and the DC frequency also needs to be excluded. The larger the psehf of the sample, the greater the proportion of its high-frequency component. Experiments have shown that when the psehf is larger, a relatively large p can achieve better steganographic security. Therefore, the specific correspondence between psehf and parameter p is constructed as follows:

在本实施例中，基于广义音频内在能量和微幅值抑制修改的隐写失真代价函数设计具体包括：In this embodiment, the design of the steganalysis distortion cost function based on generalized audio intrinsic energy and micro-amplitude suppression modification specifically includes:

针对DFR算法中LAF准则不通用的缺陷，本发明提出微幅值抑制准则(Micro-Amplitude-Suppression，MAS)，并通过将其与复杂度优先准则相配合，即在应用复杂度优先准则之后，再将微小幅值采样点设置为“湿点”(代表不能用的点)，致力于精准地预防微小幅值采样点被修改，同时还不影响其它幅值采样点的嵌入修改。通过将本MAS准则与上面的GAIE相结合，就可得到本发明的关键技术——基于广义音频内在能量和微幅值抑制修改的隐写失真代价函数(GAIE_MAS)，形式如下：In view of the defect that the LAF criterion in the DFR algorithm is not universal, the present invention proposes a micro-amplitude suppression criterion (Micro-Amplitude-Suppression, MAS), and combines it with the complexity priority criterion, that is, after applying the complexity priority criterion, the micro-amplitude sampling points are set as "wet points" (representing unusable points), which is committed to accurately preventing the micro-amplitude sampling points from being modified, while not affecting the embedded modification of other amplitude sampling points. By combining this MAS criterion with the above GAIE, the key technology of the present invention can be obtained - the steganographic distortion cost function based on generalized audio intrinsic energy and micro-amplitude suppression modification (GAIE_MAS), which is in the following form:

其中，|x_c|为采样点x_c的幅值，T为被抑制修改的采样点的幅值阈值。ψ为MAS准则下湿点的失真代价，本实施例MAS准则下湿点的失真代价优选设置为10²⁰。即当采样点幅值小于阈值时，其失真代价会非常之大，来防止该点被修改。Wherein, |x _c | is the amplitude of the sampling point x _c , and T is the amplitude threshold of the sampling point that is suppressed from modification. ψ is the distortion cost of the wet point under the MAS criterion. In this embodiment, the distortion cost of the wet point under the MAS criterion is preferably set to 10 ²⁰ . That is, when the amplitude of the sampling point is less than the threshold, its distortion cost will be very large to prevent the point from being modified.

现有DFR法使用导数滤波残差来刻画载体的复杂度并构建其隐写失真代价函数，这种方法下的隐写安全性能通常在很大程度上取决于高效滤波器的选择。为此，本发明舍弃了用载体信号残差来表征载体复杂度的思路，利用载体的内在能量来刻画载体复杂度，并基于此设计了一个新的隐写失真代价函数，这使得算法在应用时省去了精心挑选高效滤波器的麻烦，进而为本发明带来了更优且更稳定的隐写安全性。The existing DFR method uses the derivative filter residual to characterize the complexity of the carrier and construct its steganalysis distortion cost function. The steganalysis security performance under this method usually depends to a large extent on the selection of efficient filters. To this end, the present invention abandons the idea of using the carrier signal residual to characterize the complexity of the carrier, uses the intrinsic energy of the carrier to characterize the complexity of the carrier, and designs a new steganalysis distortion cost function based on this, which saves the trouble of carefully selecting efficient filters when applying the algorithm, thereby bringing better and more stable steganalysis security to the present invention.

此外，现有DFR方法中提出的大幅值优先准则可以在一定程度上防止微小幅值采样点的修改，但它在一定程度上会与内容自适应隐写的复杂度优先准则相冲突，为此，本发明提出了微幅值抑制准则(Micro-Amplitude-Suppression，MAS)，将其应用于复杂度优先准则之后，再把微小幅值采样点设置为“湿点”，致力于精准地预防微小幅值采样点被修改，同时还不影响其它幅值采样点的嵌入修改，这使得由本发明得到的载密音频更难被隐写分析器检测出来，进而提高了本发明的隐写安全性，本发明具有隐写安全性更高、载密音频质量更好的优点。In addition, the large amplitude priority criterion proposed in the existing DFR method can prevent the modification of small amplitude sampling points to a certain extent, but it will conflict with the complexity priority criterion of content adaptive steganography to a certain extent. For this reason, the present invention proposes a micro-amplitude suppression criterion (Micro-Amplitude-Suppression, MAS), which is applied to the complexity priority criterion, and then the small amplitude sampling points are set as "wet points", which is committed to accurately preventing the small amplitude sampling points from being modified, while not affecting the embedded modification of other amplitude sampling points. This makes the secret audio obtained by the present invention more difficult to be detected by the steganalyzer, thereby improving the steganalysis security of the present invention. The present invention has the advantages of higher steganalysis security and better quality of secret audio.

为验证本发明的隐写方法具有更好的隐写安全性，在CMU_ARCTIC(CMU)，LJSpeech(LJ)，TED-LIUM corpus(TED)，Arabic Speech(ARABIC)四个音频数据集上进行测试；In order to verify that the steganographic method of the present invention has better steganographic security, tests are conducted on four audio datasets: CMU_ARCTIC (CMU), LJSpeech (LJ), TED-LIUM corpus (TED), and Arabic Speech (ARABIC);

对于上述四个数据集中的每一个，将首先从中随机选择一些音频片段，然后将其裁剪成一系列时长为1秒的较短片段，并以WAV格式进行存储。从每个数据集中选择10000个1秒的音频片段进行实验，从而组成四个音频数据集；For each of the above four datasets, some audio clips will be randomly selected from them, and then they will be cropped into a series of shorter clips of 1 second in length and stored in WAV format. 10,000 1-second audio clips are selected from each dataset for experiments, thus forming four audio datasets;

在测试时，所有的隐写方案都是在其相应的负载-失真界限下模拟的，相对负载α∈{0.1,0.2,0.3,0.4,0.5}bps。对于隐写分析器的设计，采用目前最优的隐写特征TMF来评估隐写方案的经验安全性能，FLD被用来训练集成分类器。数据集中一半的原始载体/含密载体音频对将被用于训练分类器，另一半将被当作测试集。安全性能被定义为在测试集上执行20次随机测试得到的相同先验条件下的平均分类错误概率，记为平均分类错误概率/>越接近50％，则说明隐写方案的安全性越好。During testing, all steganographic schemes are simulated under their corresponding load-distortion bounds, with relative loads α∈{0.1, 0.2, 0.3, 0.4, 0.5} bps. For the design of the steganalyzer, the currently optimal steganalysis feature TMF is used to evaluate the empirical security performance of the steganographic scheme, and FLD is used to train the ensemble classifier. Half of the original carrier/secret carrier audio pairs in the dataset will be used to train the classifier, and the other half will be used as a test set. The security performance is defined as the average classification error probability under the same prior conditions obtained by performing 20 random tests on the test set, denoted as Average classification error probability/> The closer it is to 50%, the better the security of the steganography scheme.

为了体现所提出的GAIE_MAS方案的优势，将其与现有的最先进的隐写方案AAC和DFR进行安全性能对比，如下表1和表2所示，展示了测试的隐写方案在四个数据集上的性能对比，从表中可以发现，AAC和DFR方法在不同数据集上的性能并不稳定，波动较大。而本发明提出的综合方案GAIE_MAS几乎在所有数据集上均取得了最优或与最优方案相差无几的性能。In order to reflect the advantages of the proposed GAIE_MAS scheme, the security performance is compared with the existing most advanced steganography schemes AAC and DFR, as shown in Tables 1 and 2 below, which show the performance comparison of the tested steganography schemes on four data sets. It can be found from the table that the performance of the AAC and DFR methods on different data sets is not stable and fluctuates greatly. The comprehensive scheme GAIE_MAS proposed in this invention has achieved the best or almost the same performance as the best scheme on almost all data sets.

表1隐写方案在CMU和LJ数据集上抵抗TMF特征检测的安全性能对比Table 1 Security performance of steganographic schemes against TMF feature detection on CMU and LJ datasets Compared

表2隐写方案在TED和ARABIC数据集上抵抗TMF特征检测的安全性能对比Table 2 Security performance of steganographic schemes against TMF feature detection on TED and ARABIC datasets Compared

除隐写安全性之外，不可感知性也是评价隐写质量的一个重要指标。为了比较不同时域音频隐写方案的不可感知性，引入了音频质量感知评价(PEAQ)指标，这是ITU-RBS.1387推荐的音频感知的客观评价标准。其首先利用人耳的主观感知特性，计算出音频信号的掩蔽阅值和失真阅值，然后采取人工神经网络来融合评价参数ODG(对象差异等级)。其中，ODG分数越高，说明音频的质量越好。本实施例分别从CMU和LJ中选择了2000个音频片段，然后比较测试的隐写方案在不同负载下隐写的音频质量。如下表3所示，从ODG得分对比结果可见本发明提出的GAIE_MAS方案拥有最高的ODG得分，也就是说，其具有最好的隐写不可感知性。In addition to steganographic security, imperceptibility is also an important indicator for evaluating steganographic quality. In order to compare the imperceptibility of different time-domain audio steganographic schemes, the perceptual evaluation of audio quality (PEAQ) index is introduced, which is an objective evaluation standard for audio perception recommended by ITU-RBS.1387. It first uses the subjective perception characteristics of the human ear to calculate the masking value and distortion value of the audio signal, and then adopts an artificial neural network to fuse the evaluation parameter ODG (object difference level). Among them, the higher the ODG score, the better the quality of the audio. This embodiment selects 2000 audio clips from CMU and LJ respectively, and then compares the audio quality of the steganographic schemes tested under different loads. As shown in Table 3 below, it can be seen from the ODG score comparison results that the GAIE_MAS scheme proposed in the present invention has the highest ODG score, that is, it has the best steganographic imperceptibility.

表3在CMU和LJ上测试的隐写方案的ODG得分对比Table 3 Comparison of ODG scores of steganographic schemes tested on CMU and LJ

实施例2Example 2

本实施例提供一种基于广义音频内在能量和微幅值抑制修改的时域音频隐写系统，用于实现上述实施例1的基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法，该系统包括：载体音频获取模块、载体音频预处理模块、秘密信息比特预处理模块、隐写失真代价函数构建模块、隐写失真代价计算模块、隐写编码模块、音频拼接模块；This embodiment provides a time-domain audio steganography system based on generalized audio intrinsic energy and micro-amplitude suppression modification, which is used to implement the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification of the above-mentioned embodiment 1. The system includes: a carrier audio acquisition module, a carrier audio preprocessing module, a secret information bit preprocessing module, a stego distortion cost function construction module, a stego distortion cost calculation module, a stego encoding module, and an audio splicing module;

在本实施例中，载体音频获取模块用于获取载体音频；In this embodiment, the carrier audio acquisition module is used to acquire the carrier audio;

在本实施例中，载体音频预处理模块用于对载体音频进行切分预处理，将载体音频切割成设定长度的音频片段；In this embodiment, the carrier audio preprocessing module is used to perform segmentation preprocessing on the carrier audio, cutting the carrier audio into audio segments of a set length;

在本实施例中，秘密信息比特预处理模块用于根据音频片段的数量将待嵌入的秘密信息比特进行等分切片，得到秘密信息比特片段；In this embodiment, the secret information bit preprocessing module is used to equally slice the secret information bits to be embedded according to the number of audio segments to obtain secret information bit segments;

在本实施例中，隐写失真代价函数构建模块用于基于复杂度优先准则，构建基于广义音频内在能量的隐写失真代价函数，将基于广义音频内在能量的隐写失真代价函数与微幅值抑制修改的失真代价设计准则相结合，得到最终的隐写失真代价函数；In this embodiment, the steganalysis distortion cost function construction module is used to construct a steganalysis distortion cost function based on generalized audio intrinsic energy based on the complexity priority criterion, and combine the steganalysis distortion cost function based on generalized audio intrinsic energy with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganalysis distortion cost function;

在本实施例中，隐写失真代价计算模块用于计算每个音频片段的隐写失真代价；In this embodiment, the steganalysis distortion cost calculation module is used to calculate the steganalysis distortion cost of each audio clip;

在本实施例中，隐写编码模块用于基于最小失真隐写框架的STC隐写编码方法将秘密信息比特片段嵌入每个音频片段，得到多个载密音频片段；In this embodiment, the stego-coding module is used to embed the secret information bit fragment into each audio fragment based on the STC stego-coding method of the minimum distortion steganographic framework, so as to obtain a plurality of secret audio fragments;

在本实施例中，音频拼接模块用于将所有的载密音频片段拼接得到最终的载密音频。In this embodiment, the audio splicing module is used to splice all the encrypted audio clips to obtain the final encrypted audio.

实施例3Example 3

本实施例提供一种存储介质，存储介质可以是ROM、RAM、磁盘、光盘等储存介质，该存储介质存储有一个或多个程序，程序被处理器执行时，实现实施例1的基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法。This embodiment provides a storage medium, which may be a storage medium such as ROM, RAM, disk, or CD. The storage medium stores one or more programs. When the program is executed by a processor, the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification of Embodiment 1 is implemented.

实施例4Example 4

本实施例提供一种计算设备，该计算设备可以是台式电脑、笔记本电脑、智能手机、PDA手持终端、平板电脑或其他具有显示功能的终端设备，该计算设备包括处理器和存储器，存储器存储有一个或多个程序，处理器执行存储器存储的程序时，实现实施例1的基于广义音频内在能量和微幅值抑制修改的时域音频隐写方法。This embodiment provides a computing device, which can be a desktop computer, a laptop computer, a smart phone, a PDA handheld terminal, a tablet computer or other terminal device with a display function. The computing device includes a processor and a memory, and the memory stores one or more programs. When the processor executes the program stored in the memory, the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification of Embodiment 1 is implemented.

上述实施例为本发明较佳的实施方式，但本发明的实施方式并不受上述实施例的限制，其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化，均应为等效的置换方式，都包含在本发明的保护范围之内。The above embodiments are preferred implementation modes of the present invention, but the implementation modes of the present invention are not limited to the above embodiments. Any other changes, modifications, substitutions, combinations, and simplifications that do not deviate from the spirit and principles of the present invention should be equivalent replacement methods and are included in the protection scope of the present invention.

Claims

1. A time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification, characterized in that it includes the following steps:

Acquire carrier audio, perform segmentation preprocessing on the carrier audio, and cut the carrier audio into audio segments of a set length;

The secret information bits to be embedded are equally sliced according to the number of audio segments to obtain secret information bit segments;

Based on the complexity priority criterion, a steganographic distortion cost function based on generalized audio intrinsic energy is constructed. The steganographic distortion cost function based on generalized audio intrinsic energy is combined with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganographic distortion cost function.

The steganalysis distortion cost of each audio segment is calculated, and the STC steganalysis coding method based on the minimum distortion steganography framework is used to embed the secret information bit segment into each audio segment to obtain multiple secret audio segments.

All the encrypted audio clips are spliced together to get the final encrypted audio.

2. According to claim 1, the time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification is characterized in that a steganographic distortion cost function based on generalized audio intrinsic energy is constructed, specifically comprising:

Acquire a local carrier audio signal, and obtain different carrier audio frequency components based on spectrum transformation;

Different weights are set for different carrier audio frequency components, and the weights are used to set exponential adjustment coefficients to calculate the intrinsic energy of the local carrier audio signal for steganography;

The intrinsic energy of the local carrier audio signal used for steganography is used as the complexity index of the center point of the local carrier audio signal. The steganalysis distortion cost of the carrier audio is inversely proportional to the complexity. A steganalysis distortion cost function based on the generalized audio intrinsic energy is constructed.

3. According to claim 2, the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification is characterized in that the local carrier audio signal is obtained by setting a sliding window to intercept or using mirror filling to obtain the local carrier audio signal.

4. According to claim 2, the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification is characterized in that the spectrum transformation adopts any one of discrete cosine transform, discrete Fourier transform or discrete wavelet transform.

5. The time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification according to claim 1 or 2 is characterized in that a steganographic distortion cost function based on generalized audio intrinsic energy is constructed, which is specifically expressed as:

Wherein, ρ _c represents the distortion cost of the center point of the local carrier audio signal, λ represents the equilibrium constant used to prevent ρ _c from being too small, ε represents the stability constant, n represents the number of different carrier audio frequency components, _fi represents the amplitude of the i-th carrier audio frequency component, _wi represents the weight corresponding to the i-th carrier audio frequency component, and p represents the exponential adjustment coefficient. represents the intrinsic energy of the local carrier audio signal x _l used for steganography; when calculating the intrinsic steganographic energy of the audio, the influence of the DC frequency component is excluded, that is, f ₁ is discarded.

6. The time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification according to claim 2, characterized in that the exponential adjustment coefficient is expressed as:

Among them, psehf represents the energy percentage of high-frequency components in each carrier, k represents the selection threshold of high-frequency components, k≥2, is the average amplitude of all carrier audio frequency components in each carrier.

7. The time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification according to claim 1 is characterized in that the steganographic distortion cost function based on generalized audio intrinsic energy is combined with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganographic distortion cost function, which is specifically expressed as:

Among them, | _xc | is the amplitude of the center point _xc of the local carrier audio signal, T is the amplitude threshold of the suppressed and modified sampling point, ψ is the distortion cost of the wet point under the micro-amplitude suppression criterion, _wi represents the weight corresponding to the i-th carrier audio frequency component, p represents the exponential adjustment coefficient, n represents the number of different carrier audio frequency components, ε represents the stability constant, and _ρc represents the distortion cost of the center point of the local carrier audio signal.

8. A time-domain audio steganography system based on generalized audio intrinsic energy and micro-amplitude suppression modification, characterized by comprising: a carrier audio acquisition module, a carrier audio preprocessing module, a secret information bit preprocessing module, a steganalysis distortion cost function construction module, a steganalysis distortion cost calculation module, a steganalysis encoding module, and an audio splicing module;

The carrier audio acquisition module is used to acquire carrier audio;

The carrier audio preprocessing module is used to perform segmentation preprocessing on the carrier audio, cutting the carrier audio into audio segments of a set length;

The secret information bit preprocessing module is used to equally slice the secret information bits to be embedded according to the number of audio segments to obtain secret information bit segments;

The steganalysis distortion cost function construction module is used to construct a steganalysis distortion cost function based on generalized audio intrinsic energy based on the complexity priority criterion, and combine the steganalysis distortion cost function based on generalized audio intrinsic energy with the distortion cost design criterion of micro-amplitude suppression modification to obtain the final steganalysis distortion cost function;

The steganalysis distortion cost calculation module is used to calculate the steganalysis distortion cost of each audio clip;

The stego-coding module is used to embed the secret information bit fragment into each audio fragment based on the STC stego-coding method of the minimum distortion steganographic framework to obtain multiple secret audio fragments;

The audio splicing module is used to splice all the encrypted audio clips to obtain the final encrypted audio.

9. A computer-readable storage medium storing a program, characterized in that when the program is executed by a processor, the time-domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification as described in any one of claims 1 to 7 is implemented.

10. A computer device comprising a processor and a memory for storing a program executable by the processor, wherein when the processor executes the program stored in the memory, the time domain audio steganography method based on generalized audio intrinsic energy and micro-amplitude suppression modification as described in any one of claims 1 to 7 is implemented.