Chip123 科技應用創新平台

 找回密碼
 申請會員

QQ登錄

只需一步,快速開始

Login

用FB帳號登入

搜索
1 2 3 4
查看: 3355|回復: 0
打印 上一主題 下一主題

Advanced Audio Coding

[複製鏈接]
跳轉到指定樓層
1#
發表於 2007-1-13 20:08:43 | 只看該作者 回帖獎勵 |倒序瀏覽 |閱讀模式
Advanced Audio Coding (AAC) is a standardized, lossy digital audio compression scheme. It's a way of compressing audio files, such as WAV, AIFF or imported from a CD. AAC gets a better quality sound than MP3 for the same compression level or smaller file sizes for the same quality audio. For example: A 192 kbps song compressed by MP3 will be the same quality as a 128 kbps AAC file, which will be smaller in overall size. It's most commonly used as the standard format for compressing audio CD's for Apple's iPod and iTunes. Apple also uses this as their format for selling from the iTunes Store, although these files are restricted using Apple's own DRM, known as FairPlay. It was also used as the standard audio file, for SONY's Playstation 2. It's also used as the audio for the .M4V format that Apple uses for it's iPod with Video.7 t& q( E1 e7 n5 `; U3 d
  j4 x- N7 x' r7 |7 t- R
History0 h$ s- N. D9 O0 V: T7 Y# ]0 A1 |
AAC was developed with the cooperation and contributions of companies mainly including Dolby, Fraunhofer (FhG), AT&T, Sony and Nokia, and was officially declared an international standard by the Moving Pictures Experts Group in April 1997.- F, o4 j0 i0 A4 p: o- g/ G
' x1 }4 D7 \3 w# D1 P2 V
Standardization& M& t1 r6 W1 U* [
It is specified both as Part 7 of the MPEG-2 standard, and Part 3 of the MPEG-4 standard.
) Y9 c# s3 ]" `6 d; n5 [
% L( g. w1 i2 ^/ ]" PAs such, it can be referred to as MPEG-2 Part 7 and MPEG-4 Part 3 depending on its implementation, however it is most often referred to as MPEG-4 AAC, or AAC for short.
, E% K% l+ t0 U2 ^9 q2 ^- D& f) l& F% C0 B
AAC's improvements over MP3+ w- H; Y4 r9 P( L" l
AAC was designed to have better performance than MP3 (which was specified in MPEG-1 and MPEG-2) by the ISO/IEC in 11172-3 and 13818-3.
9 W# f7 M3 A5 A2 n/ J  [+ x5 l+ E) y8 J, d: X2 N+ A6 q+ g& M! Q
Improvements include:
) ]" \+ }$ {9 a. K" x4 y6 k) k! N* F: q5 F& ~$ |: G$ ]
More Sample frequencies (from 8 kHz to 96 kHz) than MP3 (16 kHz to 48 kHz) & r6 w: H# C# M( A9 a7 |
Up to 48 channels (MP3 supports up to two channels in MPEG-1 mode and up to 5.1 channels in MPEG-2 mode)
$ p+ u& b4 M. q' @/ {( X% r1 z! fHigher efficiency and simpler filterbank (hybrid → pure MDCT)
% ]9 P7 q& K, k; m( M. ^9 PHigher coding efficiency for stationary signals (blocksize: 576 → 1024 samples) / O9 q* F" m  A/ E( T
Higher coding efficiency for transient signals (blocksize: 192 → 128 samples) - ^% H% W; D) G4 @/ @
Can use Kaiser-Bessel derived window function to eliminate spectral leakage at the expense of widening the main lobe   O- g5 c; v* o. V2 r) c
Much better handling of frequencies above 16 kHz
. ]! W$ j3 q" J" ^1 ]4 o3 W( a# KMore flexible joint stereo (separate for every scale band) 3 K: g- s* p" g; v& A
This gives developers more flexibility to design codecs that offer more efficient compression as compared to MP3. However in terms of whether AAC is better than MP3, the advantages of AAC are not entirely conclusive, and the MP3 specification, while outdated, has proven surprisingly robust. AAC and HE-AAC are better than MP3 at very low bitrates, however at medium to higher bitrates, the two formats are more comparable.
# Q' v, O2 U, E' G- Z* `% x3 V5 K7 D5 @8 a
Marketing aspects
& L2 w: _; x  l, O8 u8 k
AAC was promoted as the successor to MP3 for audio coding at medium to high bitrates. Its popularity is currently maintained by it being the default Apple iTunes codec, the media player which powers iPod, the most popular digital audio player on the market. [1] Furthermore, the iTunes Store, whose sales account for 85% of the market for legal online downloads, [2] sells AAC-encoded songs (encapsulated with FairPlay Digital Rights Management).
# Q7 H% X; K( u9 H4 z: s  l8 v! m- r; E( C
How AAC works
  n8 a4 v7 E; L% D; L) c8 N
AAC is a wideband audio coding algorithm that exploits two primary coding strategies to dramatically reduce the amount of data needed to represent high-quality digital audio.. a/ z' o. n) G: T' R3 d
8 z2 o& z  @( b, }+ u8 M# g) }3 w0 ~
Signal components that are perceptually irrelevant are discarded; 1 E- g! A9 f$ P: y- \' ?
Redundancies in the coded audio signal are eliminated; - B! g' P  L! T  ~* f" W: ~6 k
The signal is processed by a modified discrete cosine transform (MDCT) according to its complexity;
1 `4 h6 I# G% BInternal error correction codes are added;
; k7 ]& v; [, v3 Q' d7 E8 GThe signal is stored or transmitted. 0 d3 _/ r2 c7 b5 @$ L! e7 I' Q2 a
The MPEG-4 audio standard does not define a single or small set of highly efficient compression schemes but rather a complex toolbox to perform a wide range of operations from low bitrate speech coding to high-quality audio coding and music synthesis.2 z# S0 l1 b$ a. c

" \" Z: o) [5 YThe MPEG-4 audio coding algorithm family spans the range from low bitrate speech encoding (down to 2 Kbit/s) to high-quality audio coding (at 64 Kbit/s per channel and higher). * \* L- m, g. F% o! |
AAC offers sampling frequencies between 8 kHz and 96 kHz and any number of channels between 1 and 48.
# n7 `" T$ g# Z4 ?5 j2 SIn contrast to MP3's hybrid filter bank, AAC uses the modified discrete cosine transform (MDCT) together with the increased window lengths of 1024 points. AAC is much more capable of encoding audio with streams of complex pulses and square waves than MP3 or MP2. ' d& w/ R  g2 }5 y
AAC encoders can switch dynamically between a single MDCT block of length 1024 points or 8 blocks of 128 points.# X" Q: ^. _) ?  w! J# E
! Z# c' o/ B, ^. Z8 I" F3 L
If a signal change or a transient occurs, 8 shorter windows of 128 points each are chosen for their better temporal resolution. " i- E* _+ F- l+ e4 i* D4 p
By default, the longer 1024-point window is otherwise used because the increased frequency resolution allows for a more sophisticated psychoacoustic model, resulting in improved coding efficiency.
# q. ?7 m1 }; I1 d2 K! d3 k; S
" Y* i2 k2 Q8 F' e. \ Modular encoding1 K& r% \7 i& z8 S
AAC takes a modular approach to encoding. Depending on the complexity of the bitstream to be encoded, the desired performance and the acceptable output, implementers may create profiles to define which of a specific set of tools they want use for a particular application. The standard offers four default profiles:
. V3 z. t4 @" h1 U$ ~4 M7 f0 W; a7 j4 g
Low Complexity (LC) - the simplest and most widely used and supported; 7 V# T7 a7 x4 ]/ c. l& |
Main Profile (MAIN) - like the LC profile, with the addition of backwards prediction; 3 O% K+ p  q# S" T7 d
Sample-Rate Scalable (SRS), a.k.a. Scalable Sample Rate (MPEG-4 AAC-SSR);
3 i0 \! h- }; i- V+ jLong Term Prediction (LTP); added in the MPEG-4 standard - an improvement of the MAIN profile using a forward predictor with lower computational complexity. . u, G, ]' ~" z0 H9 @
Depending on the AAC profile and the MP3 encoder, 96 kbit/s AAC can give nearly the same or better perceptional quality as 128 kbit/s MP3.[1]
% }' \' b( j3 R1 H' A" ]; ?) G+ I3 X- B1 ~, G/ s% _0 A
AAC Low Delay. g4 S0 x; r  u
The MPEG-4 Low Delay Audio Coder (AAC-LD) is designed to combine the advantages of perceptual audio coding with the low delay necessary for two-way communication. It is closely derived from the MPEG-2 Advanced Audio Coding (AAC) format.
- A. M& m: P9 v) Q+ g9 X4 C* d0 }1 ^
0 _3 {/ y4 c0 k, G- p, P, oThe most stringent requirements are a maximum algorithmic delay of only 20 ms and a good audio quality for all kind of audio signals including speech and music. The AAC-LD coding scheme bridges the gap between speech coding schemes and high quality audio coding schemes.5 P) n* E* B) q' {

/ `3 q3 Y3 k2 N' b! qAAC Low Delay compared to normal AAC codecs and ITU speech audio compression systems.
# b" i6 p0 z" x+ L# G0 W& V8 Z9 ^6 K" @! j. n. f
AAC Error protection toolkit+ i9 M1 e/ C2 v! L& E
Applying error protection enables error correction up to a certain extent. Error correcting codes are usually applied equally to the whole payload.) w5 C" a2 N) n+ x1 J
- p$ Q$ l6 S7 O& r2 T9 y
But since different parts of an AAC payload show different sensitivity to transmission errors, this would not be a very efficient approach.
" Z( S; x% b7 g1 V9 _3 l
  [! T: a$ G/ U- o: @0 G2 wThe AAC payload can be subdivided into parts with different error sensitivities.
) s( \+ |* o9 f! V  Y" P1 B+ w+ r% `2 W6 S% ]; U* L1 g: b& u
Independent error correcting codes can be applied to any of these parts using the Error Protection (EP) tool defined in MPEG-4 Audio. & G$ I3 ~/ u3 Q  ~' F! K, O) K
This toolkit provides the error correcting capability to the most sensitive parts of the payload in order to keep the additional overhead low.
$ _- q# J# D9 n( o$ L& o4 w& R8 |# n% t+ [& L
Error Resilient (ER) AAC
$ q- ^9 Q  d. Z3 c% A- s6 y5 K  }% E+ Y5 _
Error Resilience (ER) techniques can be used to make the coding scheme itself more robust against errors.
+ a1 ~. a* R4 d3 d. \& a2 L# A4 o' {+ P
For AAC, three custom-tailored methods were developed and defined in MPEG-4 Audio# J/ `9 T  D/ T7 f9 k, v( T) b

3 |  r; a% U$ f* `: E7 E' D0 sHuffman Codeword Reordering (HCR) to avoid error propagation within spectral data;
! P. I( t" J, K5 dVirtual Codebooks (VCB11) to detect serious errors within spectral data;
( J% i: J9 a: I& ?Reversible Variable Length Code (RVLC) to reduce error propagation within scale factor data. 4 b. h( o2 k5 d" u. d3 I4 `2 T+ u" n1 t

  T0 `6 Y' F* x' MAAC ISO standard1 U% K, _. a+ e$ \: c7 ]
AAC, which was first specified in the standard known formally as ISO/IEC 13818-7, was published in 1997 as a new "part" (distinct from ISO/IEC 13818-3) in the MPEG-2 family of international standards.
# {- L: E; g4 w' I, x  s: b5 j% g+ Z1 ?
[ 本帖最後由 masonchung 於 2007-1-20 02:43 PM 編輯 ]
分享到:  QQ好友和群QQ好友和群 QQ空間QQ空間 騰訊微博騰訊微博 騰訊朋友騰訊朋友
收藏收藏 分享分享 頂 踩 分享分享
您需要登錄後才可以回帖 登錄 | 申請會員

本版積分規則

首頁|手機版|Chip123 科技應用創新平台 |新契機國際商機整合股份有限公司

GMT+8, 2024-5-16 08:44 AM , Processed in 0.113514 second(s), 17 queries .

Powered by Discuz! X3.2

© 2001-2013 Comsenz Inc.

快速回復 返回頂部 返回列表