The invention provides a highly efficient technique for encoding a multi-channel audio signal. The invention relies on the basic principle of encoding a first signal representation of one or more of the multiple channels in a first encoder and encoding a second signal representation of one or more of the multiple channels in a second, multi-stage, encoder. This procedure is significantly enhanced by providing a controller for adaptively allocating a number of encoding bits among the different encoding stages of the second, multi-stage, encoder in dependence on multi-channel audio signal characteristics.
Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A decoding method to decode an encoded multi-channel audio signal, the decoding method comprising: receiving first signal reconstruction data; a first decoder decoding, in response to first signal reconstruction data, an encoded first signal representation of at least one channel of said multi-channel audio signal in a first signal decoding process; a second decoder decoding, in response to second signal reconstruction data, an encoded second signal representation of at least one channel of said multi-channel audio signal in a second signal decoding process, the second signal decoding process being a multi-stage decoding process comprising first and second decoding stages; a controller receiving bit allocation information representative of how a number of bits have been allocated among a parametric encoding stage and a non-parametric encoding stage in a second multistage hybrid parametric and non-parametric signal encoding process corresponding to the second signal decoding process; and the controller determining, based on said bit allocation information, how to interpret said second signal reconstruction data in said multi-stage signal decoding process including interpreting which bits of the second signal reconstruction data are allocated to the parametric encoding stage and which bits are allocated to the non-parametric encoding stage; wherein the first and second decoding stages of the multi-stage decoding processes are such that the first decoding stage receives at least one output generated from the second decoding stage, wherein determining, based on the bit allocation information, how to interpret the second signal reconstruction data comprises determining a second signal prediction portion of the second signal reconstruction data allocated to carry an encoded second signal prediction, the second signal prediction being a prediction of the second signal representation, and determining a second signal prediction error portion of the second signal reconstruction data allocated to carry an encoded second signal prediction error, the second signal prediction error being an estimated error of the second signal prediction, wherein the second decoding stage comprises decoding the second signal prediction error portion to determine the second signal prediction error, and wherein the first decoding stage comprises decoding the second signal prediction portion; determining the second signal representation based on the decoded second signal prediction portion and based on the first signal representation decoded by the first decoder; and applying the second signal prediction error received from the second decoding stage to the determined second signal representation.
A method for decoding a multi-channel audio signal involves receiving reconstruction data, decoding a first representation of at least one channel using a first decoder, and decoding a second representation of at least one channel using a second, multi-stage decoder. A controller uses bit allocation information (how bits are split between parametric and non-parametric encoding in the encoder) to interpret the second reconstruction data. The multi-stage decoding has two stages where the first stage uses output from the second. The controller uses bit allocation to identify portions of the second reconstruction data as an encoded prediction and an encoded prediction error. The second stage decodes the prediction error, and the first stage decodes the prediction. The second signal representation is determined from the decoded prediction and the first signal representation. Finally, the prediction error is applied to the determined second signal representation.
2. A decoding apparatus configured to decode an encoded multi-channel audio signal, the decoding apparatus comprising: a receiver configured to receive first signal reconstruction data; a first decoder configured to decode, in response to first signal reconstruction data, an encoded first signal representation of at least one channel of said multi-channel audio signal; a second decoder configured to decode, in response to second signal reconstruction data, an encoded second signal representation of at least one of said multiple channels, the second decoder being a multi-stage decoder comprising second and first decoding stages; and a controller configured to receive bit allocation information representative of how a number of bits have been allocated among a parametric encoding stage and a non-parametric encoding stage in a multi-stage hybrid parametric and non-parametric encoder corresponding to the second decoder, and determine, based on said bit allocation information, how to interpret said second signal reconstruction data in said second decoder for the purpose of decoding the second signal representation including interpreting which bits of the second signal reconstruction data are allocated to the parametric encoding stage and which bits are allocated to the non-parametric encoding stage; wherein the first and second decoding stages of the multi-stage decoder are in series such that the first decoding stage receives at least one output from the second decoding stage, wherein the controller is configured determine, based on the bit allocation information, a second signal prediction portion of the second signal reconstruction data allocated to carry an encoded second signal prediction, the second signal prediction being a prediction of the second signal representation, and a second signal prediction error portion of the second signal reconstruction data allocated to carry an encoded second signal prediction error, the second signal prediction error being an estimated error of the second signal prediction, wherein the second decoding stage is configured to decode the second signal prediction error portion to determine the second signal prediction error, and wherein the first decoding stage is configured to decode the second signal prediction portion, determine the second signal representation based on the decoded second signal prediction portion and based on the first signal representation decoded by the first decoder, and apply the second signal prediction error received from the second decoding stage to the determined second signal representation.
A decoding apparatus decodes multi-channel audio by receiving reconstruction data, using a first decoder to decode a first signal representation of at least one channel, and employing a second multi-stage decoder with two stages to decode a second signal representation. A controller uses bit allocation information indicating how bits are allocated between parametric and non-parametric encoding in the corresponding encoder. Based on this, it interprets the second signal reconstruction data to decode the second signal representation, including identifying bits allocated to parametric and non-parametric encoding. The two decoding stages operate in series, with the first stage using output from the second. Based on bit allocation, the controller identifies a prediction portion and a prediction error portion. The second decoding stage decodes the prediction error, and the first decoding stage decodes the prediction, determines the second representation using the decoded prediction and the first representation and applies the prediction error.
3. An audio transmission system, characterized in that said system comprises a decoding apparatus of claim 2 .
An audio transmission system includes the decoding apparatus described in claim 2, which decodes multi-channel audio by receiving reconstruction data, using a first decoder to decode a first signal representation of at least one channel, and employing a second multi-stage decoder with two stages to decode a second signal representation. A controller uses bit allocation information indicating how bits are allocated between parametric and non-parametric encoding in the corresponding encoder. Based on this, it interprets the second signal reconstruction data to decode the second signal representation, including identifying bits allocated to parametric and non-parametric encoding. The two decoding stages operate in series, with the first stage using output from the second. Based on bit allocation, the controller identifies a prediction portion and a prediction error portion. The second decoding stage decodes the prediction error, and the first decoding stage decodes the prediction, determines the second representation using the decoded prediction and the first representation and applies the prediction error.
4. An encoding method to encode a multi-channel audio signal, the encoding method comprising: receive the multi-channel audio signal over input channels; a first encoder encoding a first signal representation of at least one channel of said multi-channel audio signal in a first signal encoding process; a second encoder encoding a second signal representation of at least one channel of said multi-channel audio signal in a second signal encoding process, said second signal encoding process being a multi-stage encoding process comprising first and second encoding stages; and a controller adaptively allocating a number of encoding bits among the first and second encoding stages of the multi-stage signal encoding process based on inter-channel correlation characteristics of the multi-channel audio signal, wherein said adaptively allocating the number of encoding bits among the first and second encoding stages is performed based on an estimated performance of the first and/or the second encoding stage; wherein said first and second encoding stages of the multi-stage signal encoding process comprise a hybrid parametric and non-parametric encoding stages, and wherein the encoding bits are allocated between the parametric encoding stage and the non-parametric encoding stage based on the inter-channel correlation characteristics; wherein said adaptively allocating the number of encoding bits comprises allocating more bits to the non-parametric encoding stage when a performance of the parametric encoding stage is saturating.
An encoding method encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder. A controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage. The encoding stages are hybrid parametric and non-parametric, with bit allocation favoring the non-parametric stage when the parametric stage's performance saturates.
5. The encoding method of claim 4 , wherein said adaptively allocating the number of bits among the first and second encoding stages is performed on a frame-by-frame basis.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, performs the adaptive bit allocation on a frame-by-frame basis.
6. The encoding method of claim 4 , wherein said multi-stage signal encoding process includes an adaptive inter-channel prediction in the first encoding stage for a prediction of said second signal representation based on the first signal representation and the second signal representation, and wherein said performance is estimated at least partly based on a signal prediction error, the signal prediction error being an estimated error of the prediction of the second signal representation.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, includes adaptive inter-channel prediction in the first stage to predict the second signal representation from the first. The performance estimation is partly based on the signal prediction error (the error in predicting the second signal).
7. The encoding method of claim 6 , wherein said performance is estimated also based on an estimation of a quantization error as a function of the number of bits allocated for quantization of second-signal reconstruction data generated by said adaptive inter-channel prediction.
The encoding method described in claim 6, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, estimating performance based on prediction error, further estimates performance based on quantization error as a function of the number of bits allocated for quantizing reconstruction data generated by the adaptive inter-channel prediction.
8. The encoding method of claim 6 , wherein said multi-stage signal encoding process includes an encoding process in the second encoding stage for encoding a representation of the signal prediction error received from said first encoding stage.
The encoding method described in claim 6, which includes adaptive inter-channel prediction and estimates performance based on prediction error, also includes encoding the signal prediction error (the output of the inter-channel prediction) in the second encoding stage.
9. The encoding method of claim 4 , wherein the adaptively allocating the number of encoding bits comprises allocating the number of bits among the first and second encoding stages based on the estimated performance of the first and/or the second encoding stage in relation to the second signal representation currently being encoded through the second signal encoding process.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, allocates the number of encoding bits based on the estimated performance of the first and/or second encoding stage in relation to the second signal representation being encoded.
10. The encoding method of claim 4 , wherein said number of encoding bits is determined by a bit budget for said multi-stage signal encoding process, the method further comprising generating output data representative of the bit allocation.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, determines the total number of encoding bits using a bit budget and also generates output data indicating the bit allocation.
11. The encoding method of claim 4 , further comprising selecting a combination bit allocation and filter length for encoding so as to optimize a measure representative of the performance of said second signal encoding process.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, selects a combination of bit allocation and filter length to optimize a performance measure of the second signal encoding process.
12. The encoding method of claim 11 , further comprising selecting the combination bit allocation and filter length for encoding each frame so as to minimize a Mean Squared Error (MSE) of a prediction error over an entire encoding frame.
The encoding method described in claim 11, which selects a bit allocation and filter length to optimize the second signal encoding process, selects the bit allocation and filter length to minimize the Mean Squared Error (MSE) of the prediction error over an entire encoding frame.
13. The encoding method of claim 11 , further comprising generating output data representative of the selected bit allocation and filter length.
The encoding method described in claim 11, which selects a bit allocation and filter length to optimize the second signal encoding process, also generates output data indicating the selected bit allocation and filter length.
14. The encoding method of claim 4 , further comprising: selecting combination of frame division configuration of an encoding frame into a set of sub-frames, and bit allocation and filter length for encoding for each sub-frame, so as to optimize a measure representative of the performance of said second signal encoding process over an entire encoding frame; and encoding said second signal representation in each of the sub-frames of the selected set of sub-frames separately in accordance with the selected combination.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, selects a combination of frame division (splitting an encoding frame into sub-frames), bit allocation, and filter length for each sub-frame to optimize the performance of the second encoding process over the entire encoding frame and encodes the second signal representation in each sub-frame separately.
15. The encoding method of claim 14 , further comprising generating output data representative of the selected frame division configuration, and for each sub-frame of the selected frame division configuration, bit allocation and filter length.
The encoding method described in claim 14, which selects a frame division, bit allocation, and filter length for sub-frames, also generates output data indicating the selected frame division and, for each sub-frame, the selected bit allocation and filter length.
16. The encoding method of claim 15 , further comprising selecting the filter length, for each sub frame, based on the length of the sub-frame so that an indication of frame division configuration of an encoding frame into a set of sub-frames at the same time provides an indication of selected filter dimension for each sub-frame to thereby reduce the required signaling.
The encoding method described in claim 15, which selects frame division, bit allocation, and filter length for sub-frames, selects the filter length for each sub-frame based on the length of that sub-frame, so that indicating the frame division also implies the selected filter dimension, reducing signaling overhead.
17. The encoding method of claim 4 , further comprising selecting a combination of number of bits to be allocated to said first encoding stage and filter length to be used in said first encoding stage so as to optimize a measure representative of the performance of at least said first encoding stage.
The encoding method described in claim 4, which encodes multi-channel audio by receiving the audio, encoding a first signal representation of at least one channel using a first encoder, and encoding a second signal representation of at least one channel using a second, multi-stage encoder where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, selects a combination of the number of bits allocated to the first encoding stage and the filter length used in the first encoding stage to optimize the performance of at least the first encoding stage.
18. An encoding apparatus configured to encode a multi-channel audio signal, the decoding apparatus comprising: input channels configured to receive the multi-channel audio signal; a first encoder configured to encode a first signal representation of at least one channel of said multi-channel audio signal; a second encoder configured to encode a second signal representation of at least one channel of said multi-channel audio signal, the second encoder being a multi-stage encoder comprising first and second encoding stages; and a controller configured to adaptively allocate a number of encoding bits among the first and second encoding stages of the second encoder based on inter-channel correlation characteristics of the multi-channel audio signal, wherein the controller is configured to adaptively allocate the number of encoding bits among the first and second encoding stages based on an estimated performance of the first and/or the second encoding stage; wherein said first and second encoding stages comprise hybrid parametric and non-parametric encoding stages, and wherein said controller is configured to allocate the encoding bits between the parametric encoding stage and the non-parametric encoding stage based on the inter-channel correlation characteristics; wherein said controller is configured to allocate more bits to the non-parametric encoding stage when a performance of the parametric encoding stage is saturating.
An encoding apparatus encodes multi-channel audio with input channels, a first encoder for a first signal representation of at least one channel, a second, multi-stage encoder for a second signal representation of at least one channel, and a controller that adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage. The encoding stages are hybrid parametric and non-parametric, with bit allocation favoring the non-parametric stage when the parametric stage saturates.
19. The encoding apparatus of claim 18 , wherein the controller is configured to adaptively allocate the number of bits among the first and second stages based on the estimated performance of the first and/or the second encoding stage in relation to the second signal representation currently being encoded by the second encoder.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, adaptively allocates the number of bits based on the estimated performance of the stages in relation to the second signal representation being encoded by the second encoder.
20. The encoding apparatus of claim 18 , wherein said number of encoding bits are determined by a bit budget for said second encoder, and wherein said second encoder is configured to generate output data representative of the bit allocation.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, determines the total number of encoding bits using a bit budget and generates output data indicating the bit allocation.
21. The encoding apparatus of claim 18 , wherein controller is configured to adaptively allocate the number of bits among the first and second encoding stages on a frame-by-frame basis.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, adaptively allocates the number of bits between the encoding stages on a frame-by-frame basis.
22. An audio transmission system, characterized in that said system comprises an encoding apparatus of claim 18 .
An audio transmission system includes the encoding apparatus described in claim 18, which encodes multi-channel audio with input channels, a first encoder for a first signal representation of at least one channel, a second, multi-stage encoder for a second signal representation of at least one channel, and a controller that adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage. The encoding stages are hybrid parametric and non-parametric, with bit allocation favoring the non-parametric stage when the parametric stage saturates.
23. The apparatus of claim 18 , wherein the controller is configured to select the combination bit allocation and filter length for encoding so as to minimize a Mean Squared Error (MSE) of a prediction error over an entire encoding frame.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, selects a combination of bit allocation and filter length to minimize a Mean Squared Error (MSE) of the prediction error over an entire encoding frame.
24. The encoding apparatus of claim 18 , wherein the first encoding stage includes an adaptive inter-channel prediction filter for a second-signal prediction based on the first signal representation and the second signal representation, and wherein said controller is configured to estimate the performance of at least said first encoding stage at least partly based on a signal prediction error, the signal prediction error being an estimated error of the second signal prediction.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, includes an adaptive inter-channel prediction filter in the first encoding stage for predicting the second signal based on the first, and the controller estimates performance partly based on the signal prediction error.
25. The encoding apparatus of claim 24 , wherein said controller is configured to assess the estimated performance of at least said first encoding stage based on assessment of an estimated quantization error as a function of the number of bits allocated for quantization of said adaptive inter-channel prediction filter.
The encoding apparatus described in claim 24, which includes an adaptive inter-channel prediction filter and estimates performance based on prediction error, assesses performance based on the estimated quantization error as a function of bits allocated for quantizing the adaptive inter-channel prediction filter.
26. The encoding apparatus of claim 24 , wherein said second encoding stage is configured to encode a representation of the signal prediction error received from said first encoding stage.
The encoding apparatus described in claim 24, which includes an adaptive inter-channel prediction filter and estimates performance based on prediction error, encodes a representation of the signal prediction error in the second encoding stage.
27. The encoding apparatus of claim 18 , wherein said controller is configured to assess an estimated performance of the first encoding stage of said second encoder as a function of a first number of encoding bits assumed to be allocated to said first encoding stage, and allocate said first number of encoding bits to said first encoding stage based on said assessment.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, assesses the estimated performance of the first encoding stage of the second encoder as a function of a first number of encoding bits assumed to be allocated to the first encoding stage, and allocates those bits based on the assessment.
28. The encoding apparatus of claim 27 , wherein the controller is configured to select a combination of i) frame division configuration of an encoding frame into a set of sub-frames, ii) number of bits to be allocated to said first encoding stage for each sub-frame, and iii) filter length to be used in said first encoding stage for each sub-frame, so as to optimize a measure representative of the performance of at least said first encoding stage over an entire encoding frame, and wherein the second encoder is configured to encode said second signal representation in each of the sub-frames of the selected set of sub-frames separately in accordance with the selected combination.
The encoding apparatus described in claim 27, which assesses the performance of the first encoding stage of the second encoder based on number of bits allocated to that stage, selects a combination of (i) frame division configuration, (ii) number of bits allocated to the first encoding stage for each sub-frame, and (iii) filter length for each sub-frame, to optimize performance of at least the first stage over the entire frame, and encodes the second signal representation in each sub-frame separately based on the selection.
29. The encoding apparatus of claim 27 , wherein the controller is configured to select a combination of number of bits to be allocated to said first encoding stage and filter length to be used in said first encoding stage so as to optimize a measure representative of the performance of at least said first encoding stage.
The encoding apparatus described in claim 27, which assesses the performance of the first encoding stage of the second encoder based on number of bits allocated to that stage, selects a combination of the number of bits allocated to the first encoding stage and the filter length to optimize performance of at least the first encoding stage.
30. The encoding apparatus of claim 18 , wherein the controller is configured to select a combination bit allocation and filter length for encoding so as to optimize a measure representative of the performance of said second encoder.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, selects a combination of bit allocation and filter length to optimize a performance measure of the second encoder.
31. The encoding apparatus of claim 30 , wherein said second encoder is configured to generate output data representative of the selected bit allocation and the filter length.
The encoding apparatus described in claim 30, which selects a bit allocation and filter length to optimize performance, generates output data representing the selected bit allocation and filter length.
32. The encoding apparatus of claim 18 , wherein the controller is configured to select a combination of frame division configuration of an encoding frame into a set of sub-frames, and bit allocation and filter length for encoding for each sub-frame, so as to optimize a measure representative of the performance of said second encoder over an entire encoding frame and wherein the second encoder is configured to encode said second signal representation in each of the sub-frames of the selected set of sub-frames separately in accordance with the selected combination.
The encoding apparatus described in claim 18, which encodes multi-channel audio using two encoders where a controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage, selects a combination of frame division, bit allocation, and filter length for each sub-frame to optimize performance of the second encoder over the entire frame, and encodes the second signal representation in each sub-frame separately.
33. The encoding apparatus of claim 32 , wherein said second encoder is configured to generate output data representative of the selected frame division configuration, and for each sub-frame of the selected frame division configuration, bit allocation and filter length.
The encoding apparatus described in claim 32, which selects frame division, bit allocation, and filter length to optimize performance of the second encoder, generates output data representing the selected frame division and, for each sub-frame, the bit allocation and filter length.
34. The encoding apparatus of claim 33 , wherein said second encoder is configured to select the filter length, for each sub frame, based on the length of the sub-frame so that an indication of frame division configuration of an encoding frame into a set of sub-frames at the same time provides an indication of selected filter dimension for each sub-frame to thereby reduce the required signaling.
The encoding apparatus described in claim 33, which selects frame division, bit allocation, and filter length to optimize performance of the second encoder, selects the filter length based on the sub-frame length so that the frame division implicitly indicates the selected filter dimension, reducing signaling needs.
35. An encoding apparatus configured to encode a multi-channel audio signal, the decoding apparatus comprising: input channels configured to receive the multi-channel audio signal; a first encoder configured to encode a first signal representation of at least one channel of said multi-channel audio signal; a second encoder configured to encode a second signal representation of at least one channel of said multi-channel audio signal, the second encoder being a multi-stage encoder comprising first and second encoding stages; and a controller configured to adaptively allocate a number of encoding bits among the first and second encoding stages of the second encoder based on inter-channel correlation characteristics of the multi-channel audio signal, wherein the controller is configured to adaptively allocate the number of encoding bits among the first and second encoding stages based on an estimated performance of the first and/or the second encoding stage; wherein said first and second encoding stages comprise hybrid parametric and non-parametric encoding stages, and wherein said controller is configured to allocate the encoding bits between the parametric encoding stage and the non-parametric encoding stage based on the inter-channel correlation characteristics; wherein the parametric encoding stage comprises an inter-channel prediction (ICP) filter and an associated first quantizer for quantization of the ICP filter, and wherein the non-parametric encoding stage comprises a second quantizer for quantization of a residual prediction error of the ICP filter.
An encoding apparatus encodes multi-channel audio with input channels, a first encoder for a first signal representation of at least one channel, a second, multi-stage encoder for a second signal representation of at least one channel, and a controller that adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage. The encoding stages are hybrid parametric and non-parametric, with bit allocation favoring the non-parametric stage when the parametric stage saturates. The parametric stage includes an inter-channel prediction (ICP) filter and a quantizer for the ICP filter, and the non-parametric stage includes a quantizer for the residual prediction error of the ICP filter.
36. An encoding method to encode a multi-channel audio signal, the encoding method comprising: receiving the multi-channel audio signal input channels; a first encoder encoding a first signal representation of at least one channel of said multi-channel audio signal in a first signal encoding process; a second encoder encoding a second signal representation of at least one channel of said multi-channel audio signal in a second signal encoding process, said second signal encoding process being a multi-stage encoding process comprising first and second encoding stages; and a controller adaptively allocating a number of encoding bits among the first and second encoding stages of the multi-stage signal encoding process based on inter-channel correlation characteristics of the multi-channel audio signal, wherein said adaptively allocating the number of encoding bits among the first and second encoding stages is performed based on an estimated performance of the first and/or the second encoding stage wherein the first and second encoding stages of the multi-stage encoding processes are such that the second encoding stage receives at least one output generated from the first encoding stage, wherein the first encoding stage comprises generating and quantizing a second signal prediction based on the first and second signal representations, the second signal prediction being a prediction of the section signal representation; and generating second signal prediction error based on the first and second signal representations, the second signal prediction error being an estimated error of the second signal prediction, wherein the second encoding stage comprises quantizing the second signal prediction error received from the first encoding stage, and wherein the number of encoding bits are adaptively allocated to encode the quantized second signal prediction and the quantized second signal prediction error, the encoding bits being transmitted to a multi-channel audio signal decoder.
An encoding method encodes multi-channel audio by receiving the audio, encoding a first signal representation using a first encoder, and encoding a second signal representation using a second, multi-stage encoder. A controller adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage. The second stage receives output from the first. The first stage generates and quantizes a prediction of the second signal and a prediction error. The second stage quantizes the prediction error received from the first. The encoding bits are adaptively allocated to encode both the quantized prediction and the quantized prediction error, and are then transmitted to a decoder.
37. An encoding apparatus configured to encode a multi-channel audio signal, the decoding apparatus comprising: input channels configured to receive the multi-channel audio signal; a first encoder configured to encode a first signal representation of at least one channel of said multi-channel audio signal; a second encoder configured to encode a second signal representation of at least one channel of said multi-channel audio signal, the second encoder being a multi-stage encoder comprising first and second encoding stages; and a controller configured to adaptively allocate a number of encoding bits among the first and second encoding stages of the second encoder based on inter-channel correlation characteristics of the multi-channel audio signal, wherein the controller is configured to adaptively allocate the number of encoding bits among the first and second encoding stages based on an estimated performance of the first and/or the second encoding stage; wherein the first and second encoding stages of the multi-stage encoder are in series such that the second encoding stage receives at least one output from the first encoding stage, wherein the first encoding stage is configured to generate and quantize a second signal prediction based on the first and second signal representations, the second signal prediction being a prediction of the section signal representation; and generating second signal prediction error based on the first and second signal representations, the second signal prediction error being an estimated error of the second signal prediction, wherein the second encoding stage is configured to quantize the second signal prediction error received from the first encoding stage, and wherein the controller is configured to adaptively allocate the number of encoding bits to encode the quantized second signal prediction and the quantized second signal prediction error, the encoding bits being transmitted to a multi-channel audio signal decoder.
An encoding apparatus encodes multi-channel audio with input channels, a first encoder for a first signal representation, a second, multi-stage encoder for a second signal representation, and a controller that adaptively allocates encoding bits between the stages of the second encoder based on inter-channel correlation, considering the estimated performance of each stage. The second stage receives output from the first. The first stage generates and quantizes a prediction of the second signal based on the first and second representations, along with a prediction error. The second stage quantizes the prediction error received from the first stage. The controller adaptively allocates bits to encode the quantized prediction and prediction error, transmitting those bits to a decoder.
38. An audio encoder configured to encode a multi-channel audio signal, the audio encoder comprising: input channels configured to receive the multi-channel audio signal; a main encoder configured to receive a main signal portion and to generate an encoded main signal frame based on the main signal portion, the main signal portion being a portion of a main representation of one or more channels of the multi-channel audio signal; a side encoder configured to receive a side signal portion and to generate an encoded side signal frame based on the side signal portion, the side signal portion being a portion of a side representation of the one or more channels of the multi-channel audio signal; and a controller configured to allocate bits of the encoded side signal frame so as to encode the corresponding side signal portion, wherein the side encoder is a multi-stage encoder comprising: a first stage encoder configured to generate and quantize an estimated side signal portion corresponding to the side signal portion, the estimated side signal portion representing an estimate of the corresponding side signal portion, the first stage encoder also being configured to generate a residual error portion representing an estimate of a difference between the side signal portion and the corresponding estimated side signal portion; and a second stage encoder configured to quantize the residual error portion received from the first stage encoder, wherein the encoded side signal frame includes the quantized estimated side signal portion and the quantized residual error portion, wherein the controller is configured to: estimate a performance of the first stage encoder in relation to a current encoded side signal frame being generated by the side encoder, and allocate bits of the current encoded side signal frame between a current estimated side signal portion and a current residual error portion based on the estimated performance of the first stage encoder in relation to the current encoded side signal frame.
An audio encoder encodes multi-channel audio using input channels, a main encoder for a main signal portion, and a side encoder for a side signal portion. A controller allocates bits within the encoded side signal frame to encode the side signal portion. The side encoder is multi-stage: the first stage generates and quantizes an estimated side signal portion and a residual error portion. The second stage quantizes the residual error. The controller estimates the first stage's performance for the current frame and allocates bits between the estimated side signal portion and the residual error based on that performance.
39. The audio encoder of claim 38 , wherein the side encoder is also configured to receive the main signal portion and/or the corresponding encoded main signal frame, and wherein the first stage encoder is configured to generate one or both of the estimated side signal portion and the residual error portion also based on the main signal portion and/or the corresponding encoded main signal frame.
The audio encoder described in claim 38, which uses main and side encoders and adaptively allocates bits in the side encoder, also provides the main signal portion or the encoded main frame to the side encoder, and the first stage generates the estimated side signal or residual error using either or both of these.
40. The audio encoder of claim 38 , wherein the controller is configured to receive the main and side signal portions, determine one or more inter-channel correlation characteristics of the multi-channel audio signal based on the main and side signal portions, the inter-channel correlation characteristics including a cross correlation between the main and side signal portions, and estimate the performance of the first stage encoder performance of the first stage encoder in relation to the current encoded side signal frame based on the inter-channel correlation characteristics of the multi-channel audio signal.
The audio encoder described in claim 38, which uses main and side encoders and adaptively allocates bits in the side encoder, receives the main and side signal portions and determines inter-channel correlation characteristics (including cross-correlation between the main and side signals). It then estimates the first stage's performance based on these correlation characteristics.
41. The audio encoder of claim 38 , wherein the first stage encoder is configured to generate inter-channel prediction (ICP) filter coefficients from filtering the main signal portion through a time varying finite impulse response (FIR) filter, the ICP filter coefficients representing the estimated side signal portion and being generated so as to minimize a side signal prediction error e(n) representing the residual error portion, and quantize the ICP filter coefficients, and wherein the second stage encoder is configured to quantize the side signal prediction error e(n).
The audio encoder described in claim 38, which uses main and side encoders and adaptively allocates bits in the side encoder, uses a first stage that generates inter-channel prediction (ICP) filter coefficients from filtering the main signal through a time-varying FIR filter. These coefficients represent the estimated side signal and minimize the side signal prediction error (residual error). The first stage then quantizes the ICP filter coefficients, and the second stage quantizes the prediction error.
42. The audio encoder of claim 38 , wherein the controller is configured to allocate a relatively greater portion of the bits of the encoded side signal frame to the residual error portion as the performance of the first stage encoder saturates.
The audio encoder described in claim 38, which uses main and side encoders and adaptively allocates bits in the side encoder, allocates more bits to the residual error portion as the performance of the first stage saturates.
43. An encoding method to encode a multi-channel audio signal, the encoding method comprising: receiving the multi-channel audio signal over input channels; a first encoder encoding a first signal representation of at least one channel of said multi-channel audio signal in a first signal encoding process; a second encoder encoding a second signal representation of at least one channel of said multi-channel audio signal in a second signal encoding process, said second signal encoding process being a multi-stage encoding process comprising first and second encoding stages; and a controller adaptively allocating a number of encoding bits among the first and second encoding stages of the multi-stage signal encoding process based on inter-channel correlation characteristics of the multi-channel audio signal, wherein said adaptively allocating the number of encoding bits among the first and second encoding stages is performed based on an estimated performance of the first and/or the second encoding stage; wherein said first and second encoding stages of the multi-stage signal encoding process comprise a hybrid parametric and non-parametric encoding stages, and wherein the encoding bits are allocated between the parametric encoding stage and the non-parametric encoding stage based on the inter-channel correlation characteristics; wherein the parametric encoding stage comprises an inter-channel prediction (ICP) filter and an associated first quantizer for quantization of the ICP filter, and wherein the non-parametric encoding stage comprises a second quantizer for quantization of a residual prediction error of the ICP filter.
This invention relates to multi-channel audio signal encoding, addressing the challenge of efficiently compressing audio signals while preserving quality. The method receives a multi-channel audio signal and processes it through two encoders. The first encoder generates a signal representation of at least one channel using a single-stage encoding process. The second encoder uses a multi-stage encoding process, comprising a parametric stage and a non-parametric stage. The parametric stage includes an inter-channel prediction (ICP) filter and a quantizer for the filter coefficients, while the non-parametric stage quantizes the residual prediction error from the ICP filter. A controller dynamically allocates encoding bits between the two stages based on the inter-channel correlation characteristics of the audio signal, optimizing bit distribution to improve encoding efficiency. The allocation is determined by estimating the performance of each stage, ensuring that more bits are assigned to the stage that provides better compression or quality. This adaptive approach enhances encoding efficiency by leveraging the correlation between audio channels, reducing redundancy while maintaining audio fidelity. The hybrid parametric and non-parametric encoding stages allow for flexible bit allocation, adapting to varying signal characteristics.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
December 22, 2005
April 18, 2017
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.