home *** CD-ROM | disk | FTP | other *** search
Wrap
Text File | 1991-12-13 | 108.7 KB | 4,306 lines
.rs .\" Troff code generated by TPS Convert from ITU Original Files .\" Not Copyright ( c) 1991 .\" .\" Assumes tbl, eqn, MS macros, and lots of luck. .TA 1c 2c 3c 4c 5c 6c 7c 8c .ds CH .ds CF .EQ delim @@ .EN .nr LL 40.5P .nr ll 40.5P .nr HM 3P .nr FM 6P .nr PO 4P .nr PD 9p .po 4P .rs \v | 5i' .sp 1P .ce 1000 \v'3P' SECTION\ 3 .ce 0 .sp 1P .ce 1000 \fBTRANSMISSION\ STANDARDS\fR \v'1P' .ce 0 .sp 1P .sp 2P .LP \fBRecommendation\ P.48\fR .RT .sp 2P .sp 1P .ce 1000 \fBSPECIFICATION\ FOR\ AN\ INTERMEDIATE\ REFERENCE\ SYSTEM\fR .EF '% Volume\ V\ \(em\ Rec.\ P.48'' .OF '''Volume\ V\ \(em\ Rec.\ P.48 %' .ce 0 .sp 1P .ce 1000 \fI(Geneva, 1976; amended at Geneva, 1980,\fR .sp 9p .RT .ce 0 .sp 1P .ce 1000 \fIMalaga\(hyTorremolinos, 1984, Melbourne, 1988)\fR .ce 0 .sp 1P .LP \fISummary\fR .sp 1P .RT .PP This Recommendation intends to specify the intermediate reference system (IRS) to be used for defining loudness ratings. The description should be sufficient to enable equipment having the required characteristics to be reproduced in different laboratories and maintained to standardized performance. .RT .sp 2P .LP \fB1\fR \fBDesign objectives\fR .sp 1P .RT .PP The chief requirements to be satisfied for an intermediate reference system to be used for tests carried out on handset telephones .FS For other types of telephone, e.g.\ headset or loudspeaking .PP telephone, a different IRS will be required. The IRS is specified for the range 100\(hy5000\ Hz. The nominal range 300\(hy3400\ Hz specified is intended to be consistent with the nominal 4\ kHz spacing of FDM systems, and should not be interpreted as restricting improvements in transmission quality which might be obtained by extending the transmitted frequency bandwidth. .FE are as follows: .RT .LP a) the circuit must be stable and specifiable in its electrical and electro\(hyacoustic performance. The calibration of the equipment should be traceable to national standards; .LP b) the circuit components that are seen and touched by the subjects should be similar in appearance and \*Qfeel\*U to normal types of subscribers' equipment; .LP c) the sending and receiving parts should have frequency bandwidths and response shapes standardized to represent commercial telephone circuits; .LP d) the system should include a junction which should provide facilities for the insertion of loss, and other circuit elements such as filters or equalizers; .LP e) the system should be capable of being set up and maintained with relatively simple test equipment. .PP \fINote\fR \ \(em\ The requirements of a) to d) have been met in the initial design of the IRS by basing the sending and receiving frequency responses on the mean characteristics of a large number of commercial telephone circuits and confining the bandwidths to the nominal range 300\(hy3400\ Hz. .RT .LP .sp 1 .bp .PP Since the detailed design of an IRS may vary between different Administrations, the following specification defines only those essential characteristics required to ensure standardization of the performance of the\ IRS. .PP The principles of the IRS are described and its nominal sensitivities are given in \(sc\(sc\ 2, 3, 4\ and 5\ below; requirements concerning stability, tolerances, noise limits, crosstalk and distortion are dealt with in \(sc\(sc\ 6 to\ 9\ below. Some information concerning secondary characteristics is given in \(sc\ 10\ below. .PP Certain information concerning installation and maintenance are given in\ [1]. .RT .sp 2P .LP \fB2\fR \fBUse of the IRS\fR .sp 1P .RT .PP The basic elements of the IRS comprise: .RT .LP a) the sending part, .LP b) the receiving part, .LP c) the junction. .PP When one example each of\ a), b) and\ c) are assembled, calibrated and interconnected, a reference (unidirectional) speech path is formed, as shown in Figure\ 1/P.48. For performing loudness rating determinations, suitable switching facilities are also required to allow the reference sending and receiving parts to be interchanged with their commercial counterparts. .LP .rs .sp 16P .ad r \fBFigure 1/P.48 p.\fR .sp 1P .RT .ad b .RT .sp 2P .LP \fB3\fR \fBPhysical characteristics of handsets\fR .sp 1P .RT .PP The sending and receiving parts of an IRS shall each include a handset symmetrical about its longitudinal place and the profile produced by a section through this plane should, for the sake of standardization, conform to the dimensions indicated in Figure\ 1/P.35. In practice, any convenient form may be considered use being made, for example, of handsets of the same type as those used by an Administration in its own network. The general shape of the complete handset shall be such that, in normal use, the position of the earcap on the ear shall be as definite as possible, and not subject to excessive variation. .RT .PP The microphone capsule , when placed in the handset, shall be capable of calibration in accordance with the method described in Recommendation\ P.64. The earcap shall be such that it can be sealed on the circular knife\(hyedge of the IEC/CCITT artificial ear for calibration in accordance with IEC\ 318, and the contour of the earcap shall be suitable for defining the ear reference point as described in Annex\ A to Recommendation\ P.64. .bp .PP Transducers shall be stable and linear, and their physical design shall be such that they can be fitted in the handset chosen. A handset shall always contain both microphone and earphone capsules, irrespective of whether either is inactive during tests. The weight of a handset, so equipped, shall not exceed 350\ g. .RT .sp 2P .LP \fB4\fR \fBSubdivision of the complete IRS and impedances at the interfaces\fR .sp 1P .RT .PP Figure\ 1/P.48 shows the composition of the complete IRS, subdivided as specified in \(sc\ 2\ above. The principal features of the separate parts are considered below. .RT .sp 1P .LP 4.1 \fISending part\fR .sp 9p .RT .PP The sending part of the IRS is defined as the portion\ A\(hyJS extending from the handset microphone\ A to the interface with the junction at\ JS. The sending part shall include such amplification and equalization as necessary to ensure that the requirements of \(sc\(sc\ 5.1 and\ 7 below are satisfied. .PP The return loss of the impedance at JS, towards\ A, against 600 /0\(de \ ohms, when the sending part is correctly set up and .PP calibrated, shall be not less than 20\ dB over a frequency range 200\(hy4000\ Hz, and not less than 15\ dB over a frequency range 125\(hy6300\ Hz. .RT .sp 1P .LP 4.2 \fIReceiving part\fR .sp 9p .RT .PP The receiving part of the IRS is defined as the portion JR\(hyB extending from the interface with the junction at JR to the handset earphone at\ B. The receiving part shall include such amplification and equalization as necessary to ensure that the requirements of \(sc\(sc\ 5.2 and\ 7 below are satisfied. .PP The return loss of the impedance at JR, towards\ B, against 600 /0\(de \ ohms, when the receiving part is correctly set up and calibrated, shall be not less than 20\ dB over a frequency range 200\(hy4000\ Hz, and not less than 15\ dB over a frequency range 125\(hy6300\ Hz. .RT .sp 1P .LP 4.3 \fIJunction\fR .sp 9p .RT .PP For loudness balance and sidetone tests, the junction of the IRS shall comprise means of introducing known values of attenuation between the sending and receiving parts, and shall consist of a calibrated 600\ ohm attenuator having a maximum value of not less than 100\ dB .PP (e.g. 10\ \(mu\ 10\ dB\ +\ 10\ \(mu\ 1\ dB\ +\ 10\ \(mu\ 0.1\ dB) .RT .LP and having a tolerance, when permanently fitted and wired in position in the equipment, of not more than\ \(+- | % of the dial reading or 0.1\ dB, whichever is numerically greater. Provision shall be made for the inclusion of additional circuit elements (e.g.\ attenuation/frequency distortion) in the junction. The circuit configuration of such additional elements shall be compatible both with that of the attenuator and the junction interfaces. The return loss of the junction against 600 /0\(de \ ohms, both with and without any additional circuit elements, shall be not less than 20\ dB over a frequency range 200\(hy4000\ Hz, and not less than 15\ dB over a frequency range 125\(hy6300\ Hz. For these tests, the port other than that being measured shall be closed with 600 /0\(de \ ohms. .sp 2P .LP \fB5\fR \fBNominal sensitivities of sending and receiving parts\fR .sp 1P .RT .PP The absolute values given below are provisional and may require changes to some extent as a result of the study of Question\ 19/XII\ [2]. .RT .sp 1P .LP 5.1 \fISending part\fR .sp 9p .RT .PP The sending sensitivity, \fIS\fR\d\fIm\fR\\d\fIJ\fR\uis given in Table\ 1/P.48, column\ (2) (see\ [3]). .RT .LP .sp 1P .LP 5.2 \fIReceiving part\fR .sp 9p .RT .PP The receiving sensitivity, \fIS\fR\d\fIJ\fR\\d\fIe\fR\u, on a CCITT/IEC measured artificial ear (see Recommendation\ P.64) is given in Table\ 1/P.48, column\ (3) (see\ [3]). .bp .RT .ce \fBH.T. [T1.48]\fR .ce TABLE\ 1/P.48 .ce \fBNominal sending sensitivities and receiving sensitivities of the .ce IRS\fR .ce (These values were adopted provisionally) .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(48p) | cw(48p) . Frequency (Hz) \fIS\fR \d\fImJ\fR \u { \fIS\fR \d\fIJe\fR \u } _ .T& cw(48p) | cw(48p) | cw(48p) . dB V/Pa dB Pa/V (1) _ .T& cw(48p) | cw(48p) | cw(48p) . (2) (3) _ .T& cw(48p) | cw(48p) | cw(48p) . \ 100 \(em45.8 \(em27.5 .T& cw(48p) | cw(48p) | cw(48p) . \ 125 \(em36.1 \(em18.8 .T& cw(48p) | cw(48p) | cw(48p) . \ 160 \(em25.6 \(em10.8 .T& cw(48p) | cw(48p) | cw(48p) . \ 200 \(em19.2 \ \(em2.7 .T& cw(48p) | cw(48p) | cw(48p) . \ 250 \(em14.3 \ \ 2.7 .T& cw(48p) | cw(48p) | cw(48p) . \ 300 \(em11.3 \ \ 6.4 .T& cw(48p) | cw(48p) | cw(48p) . \ 315 \(em10.8 \ \ 7.2 .T& cw(48p) | cw(48p) | cw(48p) . \ 400 \ \(em8.4 \ \ 9.9 .T& cw(48p) | cw(48p) | cw(48p) . \ 500 \ \(em6.9 \ 11.3 .T& cw(48p) | cw(48p) | cw(48p) . \ 600 \ \(em6.3 \ 11.8 .T& cw(48p) | cw(48p) | cw(48p) . \ 630 \ \(em6.1 \ 11.9 .T& cw(48p) | cw(48p) | cw(48p) . \ 800 \ \(em4.9 \ 12.3 .T& cw(48p) | cw(48p) | cw(48p) . 1000 \ \(em3.7 \ 12.6 .T& cw(48p) | cw(48p) | cw(48p) . 1250 \ \(em2.3 \ 12.5 .T& cw(48p) | cw(48p) | cw(48p) . 1600 \ \(em0.6 \ 13.0 .T& cw(48p) | cw(48p) | cw(48p) . 2000 \ \ 0.3 \ 13.1 .T& cw(48p) | cw(48p) | cw(48p) . 2500 \ \ 1.8 \ 13.1 .T& cw(48p) | cw(48p) | cw(48p) . 3000 \ \ 1.5 \ 12.5 .T& cw(48p) | cw(48p) | cw(48p) . 3150 \ \ 1.8 \ 12.6 .T& cw(48p) | cw(48p) | cw(48p) . 3500 \ \(em7.3 \ \ 3.9 .T& cw(48p) | cw(48p) | cw(48p) . 4000 \(em37.2 \(em31.6 .T& cw(48p) | cw(48p) | cw(48p) . 5000 \(em52.2 \(em54.9 .T& cw(48p) | cw(48p) | cw(48p) . 6300 \(em73.6 \(em67.5 .T& cw(48p) | cw(48p) | cw(48p) . 8000 \(em90.0 \(em90.0 _ .TE .nr PS 9 .RT .ad r \fBTable 1/P.48 [T1.48], p.\fR .sp 1P .RT .ad b .RT .sp 2P .LP .sp 4 \fB6\fR \fBStability\fR .sp 1P .RT .PP The stability should be maintained, under reasonable ranges of ambient temperature and humidity, at least during the period between routine recalibrations. (See also\ [1).) .RT .sp 2P .LP \fB7\fR \fBShapes and tolerances on sensitivities of sending and receiving\fR \fBparts\fR .sp 1P .RT .PP The shape of the sensitivity/frequency characteristics of the sending and receiving parts of the IRS shall lie within the limits of masks formed by Table\ 2/P.48 and plotted in Figures\ 2/P.48 and 3/P.48. The sending and receiving loudness ratings shall both be set to 0\ \(+-\ 0.2\ dB when calculated in accordance with the principles laid down in Recommendation\ P.79. .PP \fINote\fR \ \(em\ One excursion above or one excursion below the limits is permitted provided that: .RT .LP a) the excursion is no greater than 2 dB above the upper or below the lower limit; .LP b) the width of the excursion as it breaks the appropriate limit is no greater than 1/10th of the frequency at the maximum or minimum of the excursion. .bp .ce \fBH.T. [T2.48]\fR .ce TABLE\ 2/P.48 .ce \fBCoordinates of sending and receiving sensitivity limit curves\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(42p) | cw(48p) | cw(48p) | cw(42p) | cw(48p) . Limite curve Frequency (Hz) { Sending sensitivity (dB with respect to an arbitrary level) } Frequency (Hz) { Receiving sensitivity (dB with respect to an arbitrary level) } _ .T& lw(42p) | lw(48p) | lw(48p) | lw(42p) | lw(48p) . Upper limit { \ 100 \ 200 \ 400 3400 3600 6000 } { \(em41 \(em16 \ \(em6 \ +6 \ +4 \(em60 } { \ 100 \ 200 \ 300 \ 500 3400 3600 4500 } { \(em24 \ \ 0 \ +9 +14 +16 +13 \(em40 } _ .T& lw(42p) | lw(48p) | lw(48p) | lw(42p) | lw(48p) . Lower limit { Under 200 \ 200 \ 400 3000 3400 Over 3400 } { \(em\(if \(em21 \(em11 \ \(em1 \ \(em4 \(em\(if } { Under 200 \ 200 \ 300 \ 500 3200 3400 Over 3400 } { \(em\(if \(em20 \ +4 \ +9 +10 \ +4 \(em\(if } _ .TE .nr PS 9 .RT .ad r \fBTableau [T2.48] p. 3\fR .sp 1P .RT .ad b .RT .LP .rs .sp 25P .ad r \fBFigure 2/P.48, p. 4\fR .sp 1P .RT .ad b .RT .LP .bp .LP .rs .sp 25P .ad r \fBFigure 3/P.48, p. 5\fR .sp 1P .RT .ad b .RT .sp 2P .LP \fB8\fR \fBNoise limits\fR .sp 1P .RT .PP It is important that the noise level in the system be well controlled. See\ [4]. .RT .LP .sp 2P .LP \fB9\fR \fBNonlinear distortion\fR .sp 1P .RT .PP In order to ensure that nonlinear distortion will be negligible with the vocal levels normally used for loudness rating, requirements in respect of distortion shall be met. .RT .sp 2P .LP \fB10\fR \fBComplete specifications\fR .sp 1P .RT .PP Certain secondary characteristics of an IRS may be included in Administrations' specifications. Particularly, special care must be given to adjustable components, stability and tolerances, crosstalk, installation and maintenance operations,\ etc. Reference\ [1] gives some guidance on these points. .RT .sp 2P .LP \fBReferences\fR .sp 1P .RT .LP [1] \fIPrecautions to be taken for correct installation and maintenance of\fR \fIan IRS\fR , Orange Book, Vol.\ V, Supplement No.\ 1, ITU, Geneva,\ 1977. .LP [2] CCITT \(em Question 19/XII, Contribution COM XII\(hyNo.\ 1, Study Period 1985\(hy1988, ITU, Geneva,\ 1985. .LP [3] \fIPrecautions to be taken for correct installation and maintenance of\fR \fIan IRS\fR , Orange Book, Vol.\ V, Supplement No.\ 1, \(sc\ 9.2, ITU, Geneva,\ 1977. .LP [4] \fIIbid.\fR , \(sc\ 5. .LP .bp .sp 1P .ce 1000 \v'3P' SECTION\ 4 .ce 0 .sp 1P .ce 1000 \fBOBJECTIVE\ MEASURING\ APPARATUS\fR .ce 0 .sp 1P .sp 2P .LP \fBRecommendation\ P.50\fR .RT .sp 2P .sp 1P .ce 1000 \fBARTIFICIAL\ VOICES\fR .EF '% Volume\ V\ \(em\ Rec.\ P.50'' .OF '''Volume\ V\ \(em\ Rec.\ P.50 %' .ce 0 .sp 1P .ce 1000 \fI(Melbourne, 1988)\fR .sp 9p .RT .ce 0 .sp 1P .LP The\ CCITT, .sp 1P .RT .sp 1P .LP \fIconsidering\fR .sp 9p .RT .PP (a) that it is highly desirable to perform objective telephonometric measurements by means of a mathematically defined signal reproducing the characteristics of human speech; .PP (b) that the standardization of such a signal is a subject for general study by the CCITT, .sp 1P .LP \fIrecommends\fR .FS The specifications given here are subject to future enhancement and therefore should be regarded as provisional. .FE .sp 9p .RT .PP the use of the artificial voice described in this Recommendation. .PP \fINote 1\fR \ \(em\ For objective loudness rating measurements, less sophisticated signals such as pink noise or spectrum\(hyshaped Gaussian noise can be used instead of the artificial voice. .PP \fINote 2\fR \ \(em\ The artificial voice here recommended has not yet been exhaustively tested in all possible applications; further studies being carried out within Question\ 14/XII. .RT .sp 2P .LP \fB1\fR \fBIntroduction\fR .sp 1P .RT .PP The signal here described reproduces the characteristics of human speech for the purposes of characterizing linear and nonlinear telecommunication systems and devices, which are intended for the transduction or transmission of speech. It is known that for some purposes, such as objective loudness rating measurements , more simple signals can be used as well. Examples of such signals are pink noise or spectrum\(hyshaped Gaussian noise, which nevertheless cannot be referred to as \*Qartificial voice\*U for the purpose of this Recommendation. .PP The artificial voice is a signal that is mathematically defined and that reproduces the time and spectral characteristics of speech which significantly affect the performances of telecommunication systems [1]. Two kinds of artificial voice are defined, reproducing respectively the spectral characteristics of female and male speech. .PP The following time and spectral characteristics of real speech are reproduced by the artificial voice: .RT .LP a) long\(hyterm average spectrum, .LP b) short\(hyterm spectrum, .LP c) instantaneous amplitude distribution, .LP d) voiced and unvoiced structure of speech waveform, .LP e) syllabic envelope. .bp .sp 2P .LP \fB2\fR \fBScope, purpose and definition\fR .sp 1P .RT .sp 1P .LP 2.1 \fIScope and purpose\fR .sp 9p .RT .PP The artificial voice is aimed at reproducing the characteristics of real speech over the bandwidth 100\ Hz \(em 8\ kHz. It can be utilized for characterizing many devices, e.g.\ carbon microphones, loudspeaking telephone sets, nonlinear coders, echo controlling devices, syllabic compandors, nonlinear systems in general. .PP The use of the artificial voice instead of real speech has the advantage of both being more easily generated and having a smaller variability than samples of real voice. .PP Of course, when a particular system is tested, the characteristics of the transmission path preceding it are to be considered. The actual test signal has then to be produced as the convolution between the artificial voice and the path response. .RT .sp 1P .LP 2.2 \fIDefinition\fR .sp 9p .RT .PP The artificial voice is a signal, mathematically defined, which reproduces all human speech characteristics, relevant to the characterization of linear and nonlinear telecommunication systems. It is intended to give a satisfactory correlation between objective measurements and real speech tests. .RT .sp 2P .LP \fB3\fR \fBTerminology\fR .sp 1P .RT .PP The artificial voice can be produced both as an electric or as an acoustic signal, according to the system or device under test (e.g.\ communication channels, coders, microphones). The following definitions apply with reference to Figure\ 1/P.50. .RT .LP .rs .sp 15P .ad r \fBFigure 1/P.50, p.\fR .sp 1P .RT .ad b .RT .sp 1P .LP 3.1 \fIelectrical artificial voice\fR .sp 9p .RT .PP The artificial voice produced as an electrical signal, used for testing transmission channels or other electric devices. .RT .sp 1P .LP 3.2 \fIartificial mouth excitation signal\fR .sp 9p .RT .PP A signal applied to the artificial mouth in order to produce the acoustic artificial voice. It is obtained by equalizing the electrical artificial voice for compensating the sensitivity/frequency characteristic of the mouth. .PP \fINote 1\fR \ \(em\ The equalization depends on the particular artificial mouth employed and can be accomplished electrically or mathematically within the signal generation process. .RT .sp 1P .LP 3.3 \fBacoustic artificial voice\fR .sp 9p .RT .PP It is the acoustic signal at the MRP (Mouth Reference Point) of the artificial mouth and has to comply with the same time and spectral requirements of the electrical artificial voice. .bp .RT .sp 2P .LP \fB4\fR \fBCharacteristics\fR .sp 1P .RT .sp 1P .LP 4.1 \fILong\(hyterm average spectrum\fR .sp 9p .RT .PP The third octave filtered long\(hyterm average spectrum of the artificial voice is given in Figure\ 2/P.50 and Table\ 1/P.50, normalized for a wideband sound pressure level of \(em4.7\ dBPa. The table is calculated from the theoretical equation reported in\ [2]. .PP \fINote\fR \ \(em\ The values of the long\(hyterm spectrum of the artificial voice at the MRP can be derived from the equation: \v'6p' .RT .ce 1000 \fIS\fR (\fIf\fR ) = \(em376.44 + 465.439(log\d1\\d0\u\fIf\fR ) \(em 157.745(log\d1\\d0\u\fIf\fR )\u2\d + 16.7124(log\d1\\d0\u\fIf\fR )\u3\d .ce 0 .ad r (1\(hy1) .ad b .RT .LP .sp 1 .LP where \fIS\fR (\fIf\fR ) is the spectrum density in dB relative to 1\ pW/m\u2\d sound intensity per Hertz at the frequency\ \fIf\fR . The definition frequency range is from 100\ Hz to 8\ kHz. .PP The curve of the spectrum is shown in Figure\ 2/P.50. The values of \fIS\fR (\fIf\fR ) at 1/3 octave ISO frequencies are given in the fourth column of Table\ 1/P.50. The tolerances are given in the fifth column of Table\ 1/P.50. The tolerances below 200\ Hz apply onto to the male artificial voice. .PP The total sound pressure level of the spectrum defined in Equation (1\(hy1) is \(em4.7\ dBPa. However, this spectrum is also applicable for the levels from \(em19.7 to +10.3\ dPBa. In other words, the first term of Equation (1\(hy1) may range from \(em391.44 to \(em361.44. .RT .LP .rs .sp 27P .ad r \fBFigure 2/P.50, p.\fR .sp 1P .RT .ad b .RT .LP .bp .ce \fBH.T. [T1.50]\fR .ce TABLE\ 1/P.50 .ce \fBLong\(hyterm spectrum of the artificial voice\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . { 1/3 octave center frequency (Hz) (1) } { Bandwidth correction factor 10 log 1 0 \(*D \fIf\fR (dB) (2) } { Sound pressure level (third octave) (dBPa) (3) } { Spectrum density (dB) (3) \(em (2) } Tolerance (dB) . _ .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 100 13.6 \(em23.1 \(em36.7 \(em .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 125 14.6 \(em19.2 \(em33.8 +3, \(em6 | ua\d\u)\d .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 160 15.6 \(em16.4 \(em32\fB,7\fR +3, \(em6 | ua\d\u)\d .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 200 16.6 \(em14.4 \(em31\fB,7\fR +3, \(em6 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 250 17.6 \(em13.4 \(em31\fB,7\fR \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 315 18.6 \(em13.0 \(em31.6 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 400 19.6 \(em13.3 \(em32.9 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 500 20.6 \(em14.1 \(em34.7 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 630 21.6 \(em15.4 \(em37\fB,7\fR \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . \ 800 22.6 \(em17.0 \(em39.6 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 1000 23.6 \(em18.9 \(em42.5 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 1250 24.6 \(em21.0 \(em45.6 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 1600 25.6 \(em23.0 \(em48.6 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 2000 26.6 \(em25.1 \(em51.7 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 2500 27.6 \(em26.9 \(em54.5 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 3150 28.6 \(em28.6 \(em57.2 \(+-3.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 4000 29.6 \(em29.8 \(em59.4 \(+-6.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 5000 30.6 \(em30.6 \(em61.2 \(+-6.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 6300 31.6 \(em30.9 \(em62.5 \(+-6.0 .T& cw(36p) | cw(36p) | cw(36p) | cw(36p) | cw(36p) . 8000 32.6 \(em30.5 \(em63.1 \(em .TE .LP \ua\d\u)\d The given tolerances apply to the long\(hyterm spectrum of male speech and must also be complied with by speech shaped noises. However, they do not apply to the female speech spectrum, whose energy content in this frequency range is virtually negligible. .nr PS 9 .RT .ad r \fBTable 1/P.50 [T1.50], p.\fR .sp 1P .RT .ad b .RT .LP .sp 5 .sp 1P .LP 4.2 \fIShort\(hyterm spectrum\fR .sp 9p .RT .PP The short\(hyterm spectrum characteristics of the male and female artificial voices are described in Annex\ A. .RT .sp 1P .LP 4.3 \fIInstantaneous amplitude distribution\fR .sp 9p .RT .PP The probability density distribution of the instantaneous amplitude of the artificial voice is shown in Figure\ 3/P.50\ [3]. .bp .RT .LP .rs .sp 21P .ad r \fBFigure 3/P.50, p.\fR .sp 1P .RT .ad b .RT .sp 1P .LP 4.4 \fISegmental power level distribution\fR .sp 9p .RT .PP The segmental power level distribution of the artificial voice, measured on time windows of 16\ ms, is shown in Figure\ 4/P.50. The upper and lower tolerance limits are reported as well. .PP \fINote\fR \ \(em\ The upper tolerance limit represents the typical segmental power level distribution of normal conversation, while the lower limit represents continuous speech (telephonometric phrases) [4], [5]. .RT .LP .rs .sp 19P .ad r \fBFigure 4/P.50, p.\fR .sp 1P .RT .ad b .RT .LP .bp .sp 1P .LP 4.5 \fISpectrum of the modulation envelope\fR .sp 9p .RT .PP The spectrum of the modulation envelope waveform is shown in Figure\ 5/P.50 and should be reproduced with a tolerance of \(+- | \ dB on the whole frequency range. .RT .LP .rs .sp 17P .ad r \fBFigure 5/P.50, p.\fR .sp 1P .RT .ad b .RT .sp 1P .LP 4.6 \fITime convergence\fR .sp 9p .RT .PP The artificial voice must exhibit characteristics as close as possible to real speech. Particularly, it should be possible to obtain the long\(hyterm spectrum and amplitude distribution characteristics in 10\ s. .RT .sp 2P .LP \fB5\fR \fBGeneration method\fR .sp 1P .RT .PP Figure 6/P.50 shows a block diagram of the generation process of the artificial voice . It is generated by applying two different types of excitation source signals, a glottal excitation signal and a random noise, to a time\(hyvariant spectrum shaping filter. The artificial voice generated by the glottal excitation signal and by the random noise corresponds respectively to voiced and unvoiced sounds. The frequency response of the spectrum shaping filter simulates the transmission characteristics of the vocal tract. .RT .LP .rs .sp 17P .ad r \fBFigure 6/P.50, p.\fR .sp 1P .RT .ad b .RT .LP .bp .sp 1P .LP 5.1 \fIExcitation source signal\fR .sp 9p .RT .PP The artifical voice is obtained by randomly alternating four basic unit elements, each containing voiced and unvoiced segments. While one unit element starts with an unvoiced sound, followed by a voiced one, the other three elements start with a voiced sound, followed by an unvoiced one and end with a voiced sound again (see also Figure\ 9/P.50). The ratio of the unvoiced sound duration \fIT\fR\d\fIu\fR\\d\fIv\fR\uto the total duration of voiced segments \fIT\fR\d\fIv\fR\ufor each unit element is 0.25. The duration \fIT\fR = \fIT\fR\d\fIu\fR\\d\fIv\fR\u+ \fIT\fR\d\fIv\fR\uof unit elements varies according to the following equation: \v'6p' .RT .sp 1P .ce 1000 \fIT\fR = \(em3.486 (log\d1\\d0\u\fIr\fR ) .ce 0 .sp 1P .LP .sp 1 where \fIr\fR | denotes a uniformly distributed random number (0.371\(= \fIr\fR \(= 0.609). .PP The time lengths of the voiced and unvoiced sounds of the four unit elements are as follows: .LP Element a: Unvoiced (\fIT\fR\d\fIu\fR\\d\fIv\fR\u) ; Voiced (\fIT\fR\d\fIv\fR\u) .LP Element b: Voiced (\fIT\fR\d\fIv\fR\u/4) + Unvoiced (\fIT\fR\d\fIu\fR\\d\fIv\fR\u) + Voiced (3\fIT\fR\d\fIv\fR\u/4) .LP Element c: Voiced (\fIT\fR\d\fIv\fR\u/2) + Unvoiced (\fIT\fR\d\fIu\fR\\d\fIv\fR\u) ; Voiced (\fIT\fR\d\fIv\fR\u/2) .LP Element d: Voiced (3\fIT\fR\d\fIv\fR\u/4) + Unvoiced (\fIT\fR\d\fIu\fR\\d\fIv\fR\u) + Voiced (\fIT\fR\d\fIv\fR\u/4) .PP Unit elements shall be randomly iterated for at least 10\ s in order to comply with the artificial voice characteristics as specified in \(sc\ 4. .sp 1P .LP 5.2 \fIGlottal excitation\fR .sp 9p .RT .PP The glottal excitation signal is a periodic waveform as shown in Figure\ 7/P.50. The pitch frequency (1/\fIT\fR\d0\uin Figure\ 7/P.50) varies according to the variation pattern shown in Figure\ 8/P.50 during the period \fIT\fR\d\fIv\fR\u. The starting value of the pitch frequency (\fIF\fR\d\fIs\fR\uin Figure\ 8/P.50) is determined according to the following relationships: .RT .LP \fIF\fR\d\fIs\fR\u= \fIF\fR\d\fIc\fR\u\(em 31.82 \fIT\fR\d\fIv\fR\u+ 39.4 \fIR\fR | for the male artificial voice .LP \fIF\fR\d\fIs\fR\u= \fIF\fR\d\fIc\fR\u\(em 51.85 \fIT\fR\d\fIv\fR\u+ 64.2 \fIR\fR | for the female artificial voice .LP where \fIF\fR\d\fIc\fR\uand \fIR\fR respectively denote the center frequency and a uniformly distributed random variable (\(em1\ <\ \fIR\fR \ <\ 1). \fIF\fR\d\fIc\fR\uis 128\ Hz for the male artificial voice and 215\ Hz for the female artificial voice. In the trapezoid of the pitch frequency variation pattern, the area of the trapezoid above \fIF\fR\d\fIc\fR\ushould be equal to that below \fIF\fR\d\fIc\fR\u(shaded in Figure\ 8/P.50). For the elements b), c) and d) in Figure\ 7/P.50 the pitch frequency variation pattern applies to the combination of the two voiced parts, irrespectively of where the unvoiced segment is inserted. .LP .rs .sp 22P .ad r \fBFigure 7/P.50, p.\fR .sp 1P .RT .ad b .RT .LP .bp .LP .rs .sp 15P .ad r \fBFigure 8/P.50, p.\fR .sp 1P .RT .ad b .RT .sp 1P .LP 5.3 \fIUnvoiced sounds\fR .sp 9p .RT .PP The transfer function of the low\(hypass filter located after the random noise generator (low emphasis) is 1/(1\ \(em\ \fIz\fR\d\\u(em\d1\u), where \fIz\fR \uD\dlF261\u1\d denotes the unit delay. .RT .sp 1P .LP 5.4 \fIPower envelope\fR .sp 9p .RT .PP The power envelope of each unit element of the excitation source signal is so controlled that the short\(hyterm segmental power (evaluated over 2\ ms intervals) of the artificial voice varies according to the patterns shown in\ a) to\ d) of Figure\ 9/P.50. This is obtained by utilizing the following relationship providing input and output signals of the spectrum shaping filter: \v'6p' .RT .ad r .ad b .RT .LP where: .LP \fIP\fR\d\fIi\fR\\d\fIn\fR\uis the input power to the spectrum shaping filter .LP \fIP\fR\d\fIo\fR\\d\fIu\fR\\d\fIt\fR\uis the output power from the spectrum shaping filter .LP \fIk\fR\d\fIi\fR\uis the \fIi\fR th coefficient of the spectrum shaping filter. .PP The rising, stationary and decay times of each trapezoid of a) to d) of Figure\ 9/P.50 shall be mutually related by the same proportionality coefficients (2 | | | | ) of the pitch frequency variation pattern shown in Figure\ 8/P.50. For each unit element, the average power of unvoiced sounds (\fIP\fR\d\fIu\fR\\d\fIv\fR\u) shall be 17.5\ dB less than the average power of voiced sounds (\fIP\fR\d\fIv\fR\u). .sp 1P .LP 5.5 \fISpectrum shaping filter\fR .sp 9p .RT .PP The spectrum shaping filter has a 12th order lattice structure as shown in Figure\ 10/P.50. Sixteen groups, each of 12 filtering coefficients (\fIk\fR\d1\u\(em \fIk\fR\d1\\d2\u), are defined; thirteen groups shall be used for generating the voiced part, while three groups shall be used for generating the unvoiced part. These coefficients are listed in Table\ 2/P.50 both for male and female artificial voices. .PP The twelve filter coefficients shall be updated every 60\ ms while generating the signal. More precisely, during each 60\ ms period the actual filtering coefficients must be adjourned every 2\ ms, by linearly interpolating between the two sets of values adopted for subsequent 60\ ms intervals. In the voiced sound part, each of 13\ groups of coefficients shall be chosen at random once every 780\ ms (=\ 60\ ms\ \(mu\ 13), and in the unvoiced sound part each of 3\ groups of coefficients shall be chosen at random once every 180\ ms (=\ 60\ ms\ \(mu\ 3). .PP \fINote\fR \ \(em\ The described implementation of the shaping filter should be considered as an example and is not an integral part of this Recommendation. Any other implementation providing the same transfer function can be alternatively used. .bp .RT .LP .rs .sp 30P .ad r \fBFigure 9/P.50, p. 15\fR .sp 1P .RT .ad b .RT .LP .rs .sp 12P .ad r \fBFigure 10/P.50, p. 16\fR .sp 1P .RT .ad b .RT .LP .bp .ce \fBH.T. [T2.50]\fR .ce TABLE\ 2/P.50 .ce \fBCoefficients\fR .ce \fI k\fR .ce \fIi\fR .ce \fIa)\ k\fI .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(30p) | cw(15p) | cw(15p) | cw(21p) | cw(15p) | cw(15p) | cw(15p) | cw(21p) | cw(15p) | cw(15p) | cw(15p) | cw(21p) | lw(15p) . \ \ \fIk\fR 1 \ \ \fIk\fR 2 \ \ \fIk\fR 3 \ \ \fIk\fR 4 \ \ \fIk\fR 5 \ \ \fIk\fR 6 \ \ \fIk\fR 7 \ \ \fIk\fR 8 \ \ \fIk\fR 9 \ \ | fIk\fR 1 0 \ \ | fIk\fR 1 1 \ | fIk\fR 1 2 _ .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . Unvoiced \ 1 \ 2 \ 3 -0.471 -0.284 -0.025 -0.108 -0.468 -0.496 0.024 0.030 -0.176 -0.048 0.090 0.162 0.140 0.124 0.236 0.036 -0.020 -0.012 0.054 0.087 0.068 0.004 0.067 0.001 0.123 0.131 0.096 0.044 0.011 0.029 0.099 0.076 0.086 -0.003 -0.024 -0.018 _ .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 1 0.974 0.219 0.025 -0.123 -0.132 -0.203 -0.103 -0.174 -0.079 -0.153 -0.010 -0.061 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 2 0,629 -0.152 -0.138 -0.142 -0.118 -0.135 0.147 0.019 0.077 -0.040 0.029 -0.007 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 3 0.599 -0.119 0.067 0.051 0.103 0.023 0.106 0.036 -0.006 -0.133 -0.052 -0.094 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 4 0.164 -0.364 -0.248 -0.076 0.168 0.072 0.103 0.045 0.112 0.010 0.048 -0.034 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 5 0.842 0.022 0.171 0.173 0.067 -0.057 0.089 -0.045 -0.039 -0.134 -0.034 -0.122 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 6 0.933 -0.537 -0.137 -0.161 -0.216 -0.139 0.115 -0.042 0.027 -0.163 0.102 -0.107 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . Voiced \ 7 0.937 -0.413 0.132 -0.059 -0.103 -0.134 0.047 -0.115 -0.105 -0.097 0.039 -0.108 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 8 0.965 -0.034 0.032 0.001 -0.107 -0.189 -0.057 -0.175 -0.109 -0.163 -0.003 -0.055 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . \ 9 0.870 -0.476 -0.016 -0.136 -0.125 -0.107 0.091 -0.008 0.021 -0.128 0.042 -0.069 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . 10 0.686 -0.030 0.178 0.197 0.155 -0.026 0.078 0.004 -0.001 -0.128 -0.004 -0.102 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . 11 0.963 -0.232 0.086 -0.018 -0.147 -0.192 -0.040 -0.179 -0.144 -0.133 0.042 -0.042 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . 12 0.930 -0.461 0.071 -0.144 -0.122 -0.096 0.034 -0.066 -0.021 -0.171 0.067 -0.091 .T& lw(24p) | lw(6p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) | rw(15p) | rw(15p) | rw(21p) | rw(15p) . 13 0.949 -0.334 0.143 -0.040 -0.112 -0.161 0.010 -0.156 -0.123 -0.119 0.049 -0.070 _ .TE .nr PS 9 .RT .ad r \fBTableau 2/P.50 [T2.50], p. 17\fR .sp 1P .RT .ad b .RT .LP .rs .sp 7P .ad r Blanc .ad b .RT .LP .bp .ce 1000 ANNEX\ A .ce 0 .ce 1000 (to Recommendation P.50) .sp 9p .RT .ce 0 .ce 1000 \fBShort\(hyterm spectrum characteristics of the artificial voice\fR .sp 1P .RT .ce 0 .PP \fR The artificial voice is generated by randomly selecting each of sixteen short\(hyterm spectrum patterns once ever 960\ ms (=\ 60\ ms\ \(mu\ 16\ patterns). The spectrum density of each pattern is provided by Equation\ (A\(hy1) and Table\ A\(hy1/P.50, and the short\(hyterm spectrum of the signal during the 60\ ms interval occurring between any two subsequent pattern selections varies smoothly from one pattern to the next. .sp 1P .RT .PP \fINote\fR \ \(em\ The spectrum patterns in Equation (A\(hy10) and Table A\(hy1/P.50 are expressed in power normalized form. \v'6p' .ad r .ad b .RT .LP .rs .sp 37P .ad r Blanc .ad b .RT .LP .bp .ce \fBH.T. [T3.50]\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(342p) . TABLE\ A\(hy1/P.50 .T& cw(342p) . { \fBCoefficients\fR \fIA\fR \fIi\fR \fIj\fR } .T& cw(342p) . { \fIa)\ A\fR \fIi\fR \fIj\fR \fIfor male artificial voice\fR } .TE .TS rw(18p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(30p) . \fIj\fR \fIi\fR \ 0 \ 1 \ 2 \ 3 \ 4 \ 5 \ 6 \ 7 \ 8 \ 9 \ | 0 \ | 1 | 2 _ .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 1 \ 2.09230 \ \(em1.33222 \ 1.32175 \ \(em1.14200 \ 0.99352 \ \(em0.94634 \ 0.72684 \ \(em0.63263 \ 0.41196 \ \(em0.42858 \ 0.22070 \(em0.19746 0.10900 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 2 \ 9.34810 \ \(em8.55934 \ 7.35732 \ \(em6.35320 \ 5.33999 \ \(em4.47238 \ 3.62417 \ \(em2.85246 \ 2.12260 \ \(em1.49424 \ 0.93988 \(em0.44998 0.12400 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 3 11.69068 \(em10.91138 \ 9.46588 \ \(em8.11729 \ 6.94160 \ \(em5.90977 \ 4.95137 \ \(em3.89587 \ 2.88750 \ \(em1.97671 \ 1.14892 \(em0.50255 0.12100 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 4 12.56830 \(em11.81209 10.36030 \ \(em8.82879 \ 7.37947 \ \(em6.01017 \ 4.66740 \ \(em3.46913 \ 2.42182 \ \(em1.60880 \ 0.91652 \(em0.39648 0.12000 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 5 \ 6.83438 \ \(em6.18275 \ 5.59089 \ \(em4.71866 \ 4.06004 \ \(em3.44767 \ 2.65380 \ \(em2.12140 \ 1.50334 \ \(em1.07904 \ 0.64553 \(em0.31816 0.11500 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 6 12.37251 \(em11.52358 \ 9.89962 \ \(em8.31774 \ 6.99062 \ \(em5.86272 \ 4.69809 \ \(em3.56806 \ 2.53340 \ \(em1.70522 \ 0.99232 \(em0.45403 0.13400 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 7 21.07637 \(em19.62125 16.56781 \(em13.67518 11.41379 \ \(em9.61940 \ 7.93529 \ \(em6.32841 \ 4.92443 \ \(em3.53539 \ 2.09095 \(em0.86543 0.18100 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 8 30.77371 \(em29.17365 25.52254 \(em21.51978 17.80583 \(em14.30488 10.87190 \ \(em7.71572 \ 5.14643 \ \(em3.20113 \ 1.72149 \(em0.68054 0.14400 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . \ 9 \ 4.18618 \ \(em3.36611 \ 3.36793 \ \(em2.92133 \ 2.38452 \ \(em2.06047 \ 1.57550 \ \(em1.34240 \ 0.84994 \ \(em0.70462 \ 0.38685 \(em0.21857 0.12100 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 10 14.12359 \(em13.14611 11.25804 \ \(em9.47510 \ 7.97588 \ \(em6.70717 \ 5.44803 \ \(em4.23843 \ 3.10807 \ \(em2.12879 \ 1.25096 \(em0.53230 0.12600 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 11 26.36971 \(em24.95984 21.80496 \(em18.41045 15.30642 \(em12.49415 \ 9.84879 \ \(em7.40287 \ 5.29262 \ \(em3.43906 \ 1.84980 \(em0.71546 0.14800 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 12 11.50808 \(em10.74609 \ 9.34328 \ \(em7.91953 \ 6.66959 \ \(em5.54500 \ 4.34328 \ \(em3.27036 \ 2.33714 \ \(em1.61333 \ 0.96597 \(em0.44666 0.13500 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 13 \ 5.32020 \ \(em4.61998 \ 4.29145 \ \(em3.62118 \ 3.01310 \ \(em2.67071 \ 2.13992 \ \(em1.72147 \ 1.22163 \ \(em0.93163 \ 0.53317 \(em0.28989 0.11900 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 14 20.61945 \(em19.39682 16.80034 \(em14.14817 11.84307 \ \(em9.78712 \ 7.73534 \ \(em5.77921 \ 4.06200 \ \(em2.66324 \ 1.49831 \(em0.59887 0.12600 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 15 30.02641 \(em28.42244 24.75314 \(em20.70178 16.98199 \(em13.72247 10.81050 \ \(em8.20966 \ 5.94148 \ \(em3.90501 \ 2.11507 \(em0.81306 0.16400 .T& cw(18p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(24p) | rw(30p) . 16 27.62370 \(em26.17896 22.93678 \(em19.42253 16.18997 \(em13.17171 10.19859 \ \(em7.42299 \ 5.07437 \ \(em3.21481 \ 1.73980 \(em0.67818 0.14000 _ .TE .nr PS 9 .RT .ad r \fBTableau A\(hy1/P.50 [T3.50], A L'ITALIENNE, p. 18\fR .sp 1P .RT .ad b .RT .LP .bp .sp 2P .LP \fBReferences\fR .sp 1P .RT .LP [1] CCITT \(em Contribution COM XII\(hyNo. 76, Study Period 1981\(hy1984 .LP [2] CCITT \(em Contribution COM XII\(hyNo. 108, Study Period 1981\(hy1984 .LP [3] CCITT \(em Contribution COM XII\(hyNo. 11, Study Period 1981\(hy1984 .LP [4] CCITT \(em Contribution COM XII\(hyNo. 150, Study Period 1981\(hy1984 .LP [5] CCITT \(em Contribution COM XII\(hyNo. 132, Study Period 1981\(hy1984 \v'6p' .sp 2P .LP \fBRecommendation\ P.51\fR .RT .sp 2P .ce 1000 \fBARTIFICIAL\ EAR\ AND\ ARTIFICIAL\ MOUTH\fR .EF '% Volume\ V\ \(em\ Rec.\ P.51'' .OF '''Volume\ V\ \(em\ Rec.\ P.51 %' .ce 0 .ce 1000 \fI(amended at Mar del Plata, 1968, Geneva, 1972, 1976,\fR .sp 9p .RT .ce 0 .sp 1P .ce 1000 \fI1980, Malaga\(hyTorremolinos, 1984 and Melbourne, 1988)\fR .ce 0 .sp 1P .sp 2P .LP The\ CCITT, .sp 1P .RT .sp 1P .LP \fIconsidering\fR .sp 9p .RT .PP (a) that it is highly desirable to design an apparatus for telephonometric measurements such that in the future all of these measurements may be made with this apparatus, without having recourse to the human mouth and ear; .PP (b) that the standardization of the artificial ear and mouth used in the construction of such apparatus is a subject for general study by the CCITT, .sp 1P .LP \fIrecommends\fR .sp 9p .RT .PP (1) the use of the artificial ears described in \(sc\ 1 of this Recommendation; .PP (2) the use of the artificial mouth described in \(sc\ 2 of this Recommendation. .PP \fINote\fR \ \(em\ Administrations may, if they wish, use devices which they have been able to construct for large\(hyscale testing of telephone apparatus supplied by manufacturers, provided that the results obtained with these devices are in satisfactory agreement with results obtained by real voice\(hyear methods. .sp 2P .LP \fB1\fR \fBArtificial ears\fR .sp 1P .RT .PP Three types of artificial ears are defined: .RT .LP 1) a wideband type for audiometricand telephonometric measurements, .LP 2) a special type for measuring insert earphones, .LP 3) a type which faithfully reproduces the characteristics of the average human ear, for use in the laboratory. .PP Type 1 is covered by IEC Recommendation\ 318\ [1], the second IEC Recommendation\ 711\ [2] and the third is the object of further study in the IEC. .PP It is recommended that the artificial ear conforming to IEC\ 318\ [1] should be used for measurements on supra\(hyaural earphones, e.g.\ handsets, and that the insert ear simulator conforming to IEC\ 711 [2] should be used for measurements on insert earphones, e.g.\ some headsets. .PP \fINote 1\fR \ \(em\ For the calibration of NOSFER earphones with rubber earpads (types\ 4026A and DR\ 701) the method detailed in Annex\ B to Recommendation\ P.42 should be used. .PP \fINote 2\fR \ \(em\ The sound pressure measured by the IEC 711 artificial ear is referred to the eardrum. The correction function given in Table\ 1/P.51 shall be used for converting data to the ear reference point (ERP), where loudness rating algorithms (Recommendation\ P.79) are based. The corrections apply to free field open\(hyear conditions and to partially or totally occluded conditions as well. .bp .RT .ce \fBH.T. [T1.51]\fR .ce TABLE\ 1/P.51 .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(48p) . Frequency (Hz) { \fIS\fR \d\fIDE\fR \u (dB) } _ .T& cw(48p) | rw(48p) . \ 100 0.0 .T& cw(48p) | rw(48p) . \ 125 0.0 .T& cw(48p) | rw(48p) . \ 160 0.0 .T& cw(48p) | rw(48p) . \ 200 0.0 .T& cw(48p) | rw(48p) . \ 250 0.0 .T& cw(48p) | rw(48p) . \ 315 \(em0.2 .T& cw(48p) | rw(48p) . \ 400 \(em0.5 .T& cw(48p) | rw(48p) . \ 500 \(em1.1 .T& cw(48p) | rw(48p) . \ 630 \(em1.0 .T& cw(48p) | rw(48p) . \ 800 \(em1.8 .T& cw(48p) | rw(48p) . 1000 \(em2.0 .T& cw(48p) | rw(48p) . 1250 \(em2.5 .T& cw(48p) | rw(48p) . 1600 \(em4.1 .T& cw(48p) | rw(48p) . 2000 \(em7.2 .T& cw(48p) | rw(48p) . 2500 \(em10.6 .T& cw(48p) | rw(48p) . 3150 \(em10.4 .T& cw(48p) | rw(48p) . 4000 \(em6.0 .T& cw(48p) | rw(48p) . 5000 \(em2.1 .TE .LP \fIS\fR \d\fIDE\fR \u is the transfer function eardrum to ERP: \fIS\fR \d\fIDE\fR \u = | 0 log @ { fIP\fR~\fIE\fR } over { fIP\fR~\fID\fR } @ (dB), where .LP \fIP\fI sound pressure at the ERP .LP \fIP\fI sound pressure at the eardrum. .nr PS 9 .RT .ad r \fBTable 1/P.51 [T1.51], p.\fR .sp 1P .RT .ad b .RT .sp 2P .LP .sp 2 \fB2\fR \fBArtificial mouth\fR .sp 1P .RT .sp 1P .LP 2.1 \fIIntroduction\fR .sp 9p .RT .PP The artificial mouth is a device that accurately reproduces the acoustic field generated by the human mouth in the near field. It is used for measuring objectively the sending characteristics of handset\(hyequipped telephone sets as specified in Recommendation\ P.64. It may also be used for measuring the sending characteristics of loudspeaking telephones at distances up to 0.5\ m from the lip plane, but the accuracy with which it reproduces the sound field of the human mouth is slightly reduced. .RT .sp 2P .LP 2.2 \fIDefinitions\fR .sp 1P .RT .sp 1P .LP 2.2.1 \fBlip ring\fR .sp 9p .RT .PP Circular ring of thin rigid rod, having a diameter of 25\ mm and less than 2\ mm thick. It shall be constructed of non\(hymagnetic material and be solidly fixed to the case of the artificial mouth. The lip ring defines both the reference axis of the mouth and the mouth reference point. .PP \fINote\fR \ \(em\ The provision of the lip ring for locating the lip planes and the reference axis is not mandatory. However, when not provided, adequate markings or other suitable geometric reference shall be alternatively available. .bp .RT .sp 1P .LP 2.2.2 \fBlip plane\fR .sp 9p .RT .PP Outer plane of the lip ring. .RT .sp 1P .LP 2.2.3 \fBreference axis\fR .sp 9p .RT .PP The line perpendicular to the lip plane containing the center of the lip ring. .RT .sp 1P .LP 2.2.4 \fBvertical plane\fR .sp 9p .RT .PP A plane containing the reference axis that divides the mouth into symmetrical halves. It shall be vertically oriented in order to reproduce the acoustic field generated by a person in the upright position. .RT .sp 1P .LP 2.2.5 \fBhorizontal plane\fR .sp 9p .RT .PP The plane containing the reference axis, perpendicular to the vertical plane. It shall be horizontally oriented in order to reproduce the acoustic field generated by a person in the upright position. .RT .sp 1P .LP 2.2.6 \fBmouth reference point (MRP)\fR .sp 9p .RT .PP The point on the reference axis, 25\ mm in front of the lip plane. .RT .sp 1P .LP 2.2.7 \fBnormalized free\(hyfield response\fR \fB(at a given point)\fR .sp 9p .RT .PP Difference between the third\(hyoctave spectrum level of the signal delivered by the artificial mouth at a given point in the free field and the third\(hyoctave spectrum level of the signal delivered simultaneously at the MRP. The characteristic is measured by feeding the artificial voice (see Recommendtion\ P.50) a speech\(hyshaped random noise or a pink noise. .RT .sp 1P .LP 2.2.8. \fBreference obstacle\fR .sp 9p .RT .PP Disc constructed of hard, stable and on\(hymegnetic material, such as brass, having a diameter of 63\ mm and 5\ mm thick. In order to measure the normalized obstacle diffraction, it shall be fitted with a \(14" pressure microphone, mounted at the centre with the diaphragm flush on the disc surface. .RT .sp 1P .LP 2.2.9 \fBnormalized obstacle diffraction\fR .sp 9p .RT .PP Difference between the third\(hyoctave spectrum level of the acoustic pressure delivered by the artificial mouth at the surface of the reference obstacle and the third\(hyoctave spectrum level of the pressure simultaneously delivered at the point on the reference axis, 500\ mm in front of the lip plane. The characteristic is defined for positions of the reference obstacle in front of the artificial mouth, with the disc axis coinciding with the reference axis, and is measured by feeding the artificial mouth with a complex signal such as the artificial voice, a speech shaped random noise or a pink noise. .RT .sp 2P .LP 2.3 \fIAcoustic characteristics of the artificial mouth\fR .sp 1P .RT .sp 1P .LP 2.3.1 \fINormalized free\(hyfield response\fR .sp 9p .RT .PP The normalized free\(hyfield response is specified at seventeen points: ten in the near field and seven in the far field. Near\(hyfield points are listed in Table\ 2/P.51, while far\(hyfield points are listed in Table\ 3/P.51. .PP Table 4/P.51 provides the normalized free\(hyfield response of the artificial mouth, together with tolerances, for the bandwidth between 100\ Hz and 8\ kHz. The requirements at each point not lying in the vertical plan shall also be met by the corresponding point in the symmetrical half\(hyspace. .PP The characteristic shall be checked by using appropriate microphones, as specified in Table\ 5/P.51. Pressure microphones shall be oriented with their axes perpendicular to the sound direction, while free\(hyfield microphones shall be oriented with their axes parallel to the direction of sound. .PP \fINote\fR \ \(em\ If a compressor microphone is used with the mouth, it (or an equivalent dummy) shall be left in place while checking the normalized free\(hyfield response. .bp .RT .ce \fBH.T. [T2.51]\fR .ce TABLE\ 2/P.51 .ce \fBCoordinates of points in the near field\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(72p) | cw(72p) . Measurement point { On\(hyaxis displacement from the lip plane (mm) } { Off\(hyaxis, perpendicular displacement (mm) } _ .T& cw(48p) | cw(72p) | lw(72p) . \ 1 \ 12.5 \ 0 .T& cw(48p) | cw(72p) | lw(72p) . \ 2 \ 50 | \ 0 .T& cw(48p) | cw(72p) | lw(72p) . \ 3 100 | \ 0 .T& cw(48p) | cw(72p) | lw(72p) . \ 4 140 | \ 0 .T& cw(48p) | cw(72p) | lw(72p) . \ 5 \ \ 0 | 20 horizontal .T& cw(48p) | cw(72p) | lw(72p) . \ 6 \ \ 0 | 40 horizontal .T& cw(48p) | cw(72p) | lw(72p) . \ 7 \ 25 | 20 horizontal .T& cw(48p) | cw(72p) | lw(72p) . \ 8 \ 25 | 40 horizontal .T& cw(48p) | cw(72p) | lw(72p) . \ 9 \ 25 | 20 vertical (downwards) .T& cw(48p) | cw(72p) | lw(72p) . 10 \ 25 | 40 vertical _ .TE .nr PS 9 .RT .ad r \fBTableau 2/P.51 [T2.51], p. 20\fR .sp 1P .RT .ad b .RT .LP .sp 5 .ce \fBH.T. [T3.51]\fR .ce TABLE\ 3/P.51 .ce \fBCoordinates of points in the far field\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(48p) | cw(48p) | cw(48p) . Measurement point { Distance from the lip plane (mm) } { Azimuth angle (horizontal) (degree) } { Elevation angle (vertical) (degree) } _ .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 11 500 \ 0 \ \ 0 .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 12 500 \ 0 +15 .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 13 500 \ 0 +30 .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 14 500 \ 0 \(em15 .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 15 500 \ 0 \(em30 .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 16 500 15 \ \ 0 .T& cw(48p) | cw(48p) | cw(48p) | cw(48p) . 17 500 30 \ \ 0 _ .TE .nr PS 9 .RT .ad r \fBTableau 3/P.51 [T3.51], p. 21\fR .sp 1P .RT .ad b .RT .LP .rs .sp 6P .ad r Blanc .ad b .RT .LP .bp .ce \fBH.T. [T4.51]\fR .ce TABLE\ 4a/P.51 .ce \fBNormalized free field response at points on axis in the near field\fR .ce .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(36p) | cw(30p) sw(24p) sw(30p) sw(24p) sw(30p) , l | l | l | l | l | l. Frequency { Measurement point (Hz) 1 (dB) 2 (dB) 3 (dB) 4 (dB) Tolerance (dB) } _ .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 100 4.2 \(em5.0 \(em11.0 \(em13.6 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 125 4.2 \(em5.0 \(em10.9 \(em13.6 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 160 4.2 \(em5.0 \(em10.7 \(em13.6 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 200 4.0 \(em5.0 \(em10.7 \(em13.3 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 250 4.0 \(em5.0 \(em10.6 \(em13.2 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 315 4.0 \(em5.0 \(em10.6 \(em13.2 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 400 4.0 \(em5.0 \(em10.6 \(em13.2 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 500 4.1 \(em5.0 \(em10.6 \(em13.2 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 630 4.2 \(em4.9 \(em10.5 \(em13.4 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . \ 800 4.2 \(em4.8 \(em10.5 \(em13.4 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 1000 4.1 \(em4.8 \(em10.4 \(em12.9 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 1250 3.9 \(em4.8 \(em10.2 \(em12.7 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 1600 3.8 \(em4.8 \(em10.0 \(em12.7 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 2000 3.6 \(em4.7 \(em10.0 \(em12.7 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 2500 3.5 \(em4.6 \ \(em9.4 \(em12.3 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 3150 3.6 \(em4.6 \ \(em9.4 \(em12.0 \(+-1.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 4000 3.7 \(em4.6 \ \(em9.7 \(em12.3 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 5000 3.7 \(em4.5 \ \(em9.7 \(em12.6 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 6300 3.8 \(em4.5 \ \(em9.7 \(em12.6 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) | cw(30p) . 8000 3.8 \(em4.9 \(em10.0 \(em12.7 \(+-1.5 _ .TE .nr PS 9 .RT .ad r \fBTableau 4a/P.51 [T4.51], p. 22\fR .sp 9p .RT .ad b .RT .ce \fBH.T. [T5.51]\fR .ce TABLE\ 4b/P.51 .ce \fBNormalized free\(hyfield response at points on axis in the near field\fR .ce .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(30p) | cw(24p) sw(24p) sw(24p) sw(24p) sw(24p) sw(24p) sw(24p) , l | l | l | l | l | l | l | l. Frequency { Measurement point (Hz) 5 | ua\d\u)\d (dB) 6 (dB) 7 (dB) 8 (dB) 9 (dB) 10 (dB) Tolerance (dB) } _ .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 100 5.2 \(em1.7 \(em1.4 \(em4.0 \(em1.6 \(em4.2 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 125 5.2 \(em1.7 \(em1.3 \(em3.8 \(em1.5 \(em4.2 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 160 5.2 \(em1.7 \(em1.2 \(em3.8 \(em1.5 \(em4.2 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 200 5.2 \(em1.7 \(em1.2 \(em3.8 \(em1.5 \(em4.2 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 250 5.2 \(em1.8 \(em1.3 \(em3.8 \(em1.4 \(em4.2 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 315 5.1 \(em1.8 \(em1.3 \(em3.8 \(em1.3 \(em4.2 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 400 5.1 \(em1.8 \(em1.3 \(em3.8 \(em1.3 \(em4.0 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 500 5.0 \(em1.6 \(em1.3 \(em3.8 \(em1.3 \(em3.9 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 630 5.0 \(em1.6 \(em1.3 \(em3.8 \(em1.3 \(em3.9 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . \ 800 5.0 \(em1.6 \(em1.3 \(em3.8 \(em1.3 \(em4.0 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 1000 4.8 \(em1.7 \(em1.3 \(em3.9 \(em1.3 \(em4.1 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 1250 4.8 \(em1.8 \(em1.4 \(em4.0 \(em1.3 \(em4.3 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 1600 4.7 \(em1.8 \(em1.4 \(em3.8 \(em1.3 \(em4.0 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 2000 4.7 \(em1.8 \(em1.2 \(em3.7 \(em1.3 \(em3.6 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 2500 4.7 \(em1.9 \(em1.0 \(em3.6 \(em1.1 \(em3.5 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 3150 4.7 \(em2.1 \(em1.1 \(em3.5 \(em1.2 \(em3.4 \(+-1.0 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 4000 4.5 \(em2.9 \(em1.5 \(em4.1 \(em1.3 \(em3.0 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 5000 3.8 \(em3.6 \(em1.5 \(em4.8 \(em1.3 \(em3.7 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 6300 3.2 \(em4.8 \(em1.8 \(em5.2 \(em1.7 \(em3.7 \(+-1.5 .T& cw(30p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) | cw(24p) . 8000 2.5 \(em5.2 \(em2.0 \(em6.1 \(em2.2 \(em4.2 \(+-1.5 .TE .LP \ua\d\u)\d The measurements on the human mouth at point 5 are quite scattered, so the response at this point is only indicatively provided and no tolerances are specified. .nr PS 9 .RT .ad r \fBTableau 4b/P.51 [T5.51], p. 23\fR .sp 9p .RT .ad b .RT .LP .bp .ce \fBH.T. [T6.51]\fR .ce TABLE\ 4c/P.51 .ce \fBNormalized free field response in the far field\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(48p) sw(48p) , ^ | c | c. Measurement point { Frequency range 100 Hz\(hy8 kHz } Response (dB) Tolerance (dB) _ .T& cw(48p) | cw(48p) | cw(48p) . 11 \(em24.0 \(+- | .0 .T& cw(48p) | cw(48p) | cw(48p) . 12 \(em24.0 \(+- | .0 .T& cw(48p) | cw(48p) | cw(48p) . 13 \(em24.0 \(+- | .0 .T& cw(48p) | cw(48p) | cw(48p) . 14 \(em24.0 \(+- | .0 .T& cw(48p) | cw(48p) | cw(48p) . 15 \(em24.0 \(+- | .0 .T& cw(48p) | cw(48p) | cw(48p) . 16 \(em24.0 \(+- | .0 .T& cw(48p) | cw(48p) | cw(48p) . 17 \(em24.0 \(+- | .0 _ .TE .nr PS 9 .RT .ad r \fBTableau 4c/P.51 [T6.51], p. 24\fR .sp 1P .RT .ad b .RT .ce \fBH.T. [T7.51]\fR .ce TABLE\ 5/P.51 .ce \fBRecommended microphone types for free\(hyfield measurements\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(60p) | cw(60p) | cw(48p) . Measurement point Microphone size (max.) Microphone equalization _ .T& lw(60p) | lw(60p) | lw(48p) . 1, 2, 5, 6, 7, 8, 9, 10 1/4" Pressure .T& lw(60p) | lw(60p) | lw(48p) . 3, 4 1/2" Pressure .T& lw(60p) | lw(60p) | lw(48p) . 11, 12, 13, 14, 15, 16, 17 1" Free\(hyfield .T& lw(60p) | lw(60p) | lw(48p) . MRP 1/4" Pressure _ .TE .nr PS 9 .RT .ad r \fBTableau 5/P.51 [T7.51], p. 25\fR .sp 1P .RT .ad b .RT .sp 1P .LP 2.3.2 \fINormalized obstacle diffraction\fR .sp 9p .RT .PP The normalized obstacle diffraction of the artificial mouth is defined at three points on the references axis, as specified in Table\ 6/P.51. .PP \fINote\fR \ \(em\ If a compressor microphone is used with the mouth, it (or an equivalent dummy) shall be left in place while checking the normalized obstacle diffraction. .RT .sp 1P .LP 2.3.3 \fIMaximum deliverable sound pressure level\fR .sp 9p .RT .PP The artificial mouth shall be able to deliver steadily the acoustic artificial voice at sound pressure levels up to at least +6\ dBPa at the MRP. .RT .sp 1P .LP 2.3.4 \fIHarmonic distortion\fR .sp 9p .RT .PP When delivering sine tones, with amplitudes up to +6\ dBPa at the MRP, the harmonic distortion of the acoustic signal shall comply with the limits specified in Table\ 7/P.51. .bp .RT .ce \fBH.T. [T8.51]\fR .ce TABLE\ 6/P.51 .ce \fBNormalized obstacle diffraction\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(36p) | cw(30p) sw(24p) sw(30p) sw(24p) , l | l | l | l | l. Frequency { Measurement point (Hz) 18 (dB) 19 (dB) 20 (dB) Tolerance (dB) } _ .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 100 32.2 27.0 21.7 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 125 32.0 27.0 21.4 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 160 32.0 27.3 21.4 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 200 31.2 26.5 20.6 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 250 31.2 26.5 20.5 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 315 31.9 27.0 21.0 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 400 31.8 27.0 20.9 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 500 31.3 26.4 20.4 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 630 31.0 26.0 20.0 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . \ 800 30.1 25.1 19.4 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 1000 29.3 24.4 18.8 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 1250 29.0 24.3 18.8 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 1600 28.9 24.5 19.6 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 2000 28.6 25.2 20.5 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 2500 29.0 26.3 23.2 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 3150 29.0 26.5 21.8 \(+-1.5 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 4000 29.6 27.3 22.8 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 5000 31.2 26.9 22.4 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 6300 31.7 26.0 22.5 \(+-2.0 .T& cw(36p) | cw(30p) | cw(24p) | cw(30p) | cw(24p) . 8000 30.0 23.0 18.0 \(+-2.0 _ .TE .nr PS 9 .RT .ad r \fBTableau 6/P.51 [T8.P.51], p. 26\fR .sp 1P .RT .ad b .RT .LP .sp 2 .ce \fBH.T. [T9.51]\fR .ce TABLE\ 7/P.51 .ce \fBMaximum harmonic distortion of the artificial mouth\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) sw(48p) , c | c. Harmonic distorsion 2\un\d\ud\d harmonic 3\ur\d\ud\d harmonic _ .T& cw(48p) | cw(48p) | cw(48p) . 100 Hz\(hy125 Hz < | 0% < | 0% .T& cw(48p) | cw(48p) | cw(48p) . 125 Hz\(hy200 Hz < | 4% < | 4% .T& cw(48p) | cw(48p) | cw(48p) . 200 Hz\(hy8 Hz < | 1% < | 1% _ .TE .nr PS 9 .RT .ad r \fBTableau 7/P.51 [T9.P.51], p. 27\fR .sp 1P .RT .ad b .RT .LP .rs .sp 4P .ad r Blanc .ad b .RT .LP .bp .sp 1P .LP 2.3.5 \fILinearity\fR .sp 9p .RT .PP A positive or negative variation of 6\ dB of the feeding electrical signal shall produce corresponding variation of 6\ dB \(+- 0.5\ dB at the MRP for outputs in the range \(em14\ dBPa to +6\ dBPa. This requirement shall be met both for complex excitations, such as the artificial voice, and for sine tones in the range 100\ Hz to 8\ kHz. .RT .sp 2P .LP 2.4 \fIMiscellaneous\fR .sp 1P .RT .sp 1P .LP 2.4.1 \fIDelivery conditions\fR .sp 9p .RT .PP The artificial mouth shall be delivered by the maufacturer with the mechanical fixtures required to place the \(12" calibration microphone at the MRP, as specified in Recommendation\ P.64. Suitable markings shall be engraved on the device housing for identifying the vertical plane position. .PP Each artificial mouth shall be delivered with a calibration chart specifying the free\(hyfield radiation and obstacle diffraction characteristics as defined in this Recommendation .RT .sp 1P .LP 2.4.2 \fIStability\fR .sp 9p .RT .PP The device shall be stable and reproducible. .RT .sp 1P .LP 2.4.3 \fIStray magnetic field\fR .sp 9p .RT .PP Neither the d.c. nor the a.c. magnetic stray fields generated by the artificial mouth shall neither influence the signal transduced by microphones under test. .PP It is recommended that the a.c. stray field produced at the MRP shall lie below the curve formed by the following coordinates: .RT .ce 1000 .sp 1 \fIFrequency\fR .ce 0 .ce 1000 (Hz) \fIMagnetic output\fR .ce 0 .LP (dB A/m/Pa) \ \ | 00 \(em10 \ 1 | 00 \(em40 10 | 00 \(em40 .PP It is also recommended that the d.c. stray field at the MRP be lower than 400\ A/m. .PP \fINote\fR \ \(em\ The recommended d.c. stray field limit of 400\ A/m applies specifically to mouths intended for measuring electromagnetic microphones. For measuring other kinds of microphones, a higher limit of 1200\ A/m is acceptable. .RT .sp 1P .LP 2.4.4 \fIChoice of model\fR .sp 9p .RT .PP The results of measurements made on the BK\ 4219 source (no longer produced) and on the newer BK\ 4227, with its mouthpiece replaced by the UA\ 0899 conical adaptor, show a satisfactory agreement between the two models and compliance with the present Recommendation. The models actually used in tests shall always be stated, together with the results of measurements. .PP \fINote\fR \ \(em\ It should be noted that the BK 4227 artificial mouth generates a d.c. stray magnetic field at the MRP which exceeds 400\ A/m. It is then not suitable for measuring electromagnetic microphones. .RT .sp 2P .LP \fBReferences\fR .sp 1P .RT .LP [1] International Electrotechnical Commission Recommendation, \fIAn\fR \fIartificial ear of the wideband type for the calibration of earphones used\fR \fIin audiometry\fR , IEC Publication\ 318, Geneva, 1970. .LP [2] International Electrotechnical Commission Recommendation, \fIOccluded\fR \fIear simulator for the measurement of earphones coupled to the ear by\fR \fIear insert\fR , IEC Publication\ 711, Geneva, 1981. .bp .sp 2P .LP \fBRecommendation\ P.52\fR .RT .sp 2P .sp 1P .ce 1000 \fBVOLUME\ METERS\fR .EF '% Volume\ V\ \(em\ Rec.\ P.52'' .OF '''Volume\ V\ \(em\ Rec.\ P.52 %' .ce 0 .sp 1P .PP The CCITT considers that, in order to ensure continuity with previous practice, it is not desirable to modify the specification of the volume meter of the ARAEN employed at the CCITT Laboratory. .sp 1P .RT .PP Table\ 1/P.52 gives the principal characteristics of various measuring devices used for monitoring the volume or peak values during telephone conversations or sound\(hyprogramme transmissions. .PP The measurement of active speech level is defined in Recommendation\ P.56. Comparison of results using the active speech level meter and some meters described in this Recommendation can be found in Supplement No.\ 18. .PP \fINote\fR \ \(em\ Descriptions of the following devices are contained in the Supplements to \fIWhite\ Book\fR , Volume\ V: .RT .LP \(em ARAEN volume meter or speech voltmeter : Supplement No.\ 10\ [1]. .LP \(em Volume meter standardized in the United States of America, termed the \*Q VU meter \*U: Supplement\ No.\ 11\ [2]. .LP \(em Peak indicator used by the British Broadcasting Corporation: Supplement No.\ 12\ [3]. .LP \(em Maximum amplitude indicator Types\ U\ 21 and U\ 71 used in the Federal Republic of Germany: Supplement No.\ 13\ [4]. .PP The volume indicator, SFERT, which formerly was used in the CCITT Laboratory is described in\ [5]. .LP .sp 1P .LP \fIComparative tests with different types of volume meters\fR .sp 9p .RT .PP A note which appears in [6] gives some information on the results of preliminary tests conducted at the SFERT Laboratory to compare the volume indicator with different impulse indicators. .PP The results of comparative tests made in 1952 by the United Kingdom Post Office appear in Supplement\ No.\ 14\ [7]. Further results can be found in Supplement No.\ 18 of the present volume. .RT .LP .rs .sp 25P .ad r Blanc .ad b .RT .LP .bp .ce \fBH.T. [T1.52]\fR .ce TABLE\ 1/P.52 .ce \fBPrincipal characteristics of the various instruments used for monitoring the volume or peaks\fR .ce \fBduring telephone conversations or sound\(hyprogramme .ce transmissions\fR .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(84p) | cw(30p) | cw(36p) | cw(30p) | cw(48p) . Type of instrument { Rectifier characteristic (see Note\ 3) } { Time to reach 99% of final reading (milliseconds) } { Integration time (milliseconds) (see Note\ 4) } { Time to return to zero (value and definition) } _ .T& lw(84p) | cw(30p) | cw(36p) | cw(30p) | lw(48p) . { (1) \*QSpeech voltmeter\*U United Kingdom Post Office Type\ 3 (S.V.3) identical to the speech power meter of the l'ARAEN } 2 230 100 (approx.) equal to the integration time _ .T& lw(84p) | cw(30p) | cw(36p) | cw(30p) | lw(48p) . { (2) VU meter (United States of America) (see No te 1) } 1.0 to 1.4 300 165 (approx.) equal to the integration time _ .T& lw(84p) | cw(30p) | cw(36p) | cw(30p) | lw(48p) . { (3) Speech power meter of the \*QSFERT volume indicator\*U } 2 around 400 to 650 200 equal to the integration time _ .T& lw(84p) | cw(30p) | cw(36p) | cw(30p) | lw(48p) . { (4) Peak indicator for sound\(hyprogramme transmissions used by the British Broadcasting Corporation (BBC Peak Programme Meter) (see Note\ 2) } 1 10 (see Note\ 5) { 3 seconds for the pointer to fall to 26\ dB } _ .T& lw(84p) | cw(30p) | cw(36p) | cw(30p) | lw(48p) . { (5) Maximum amplitude indicator used by the Federal German Republic (type U 21) } 1 around 80 5 (approx.) { 1 or 2 seconds from 100% to 10% of the reading in the steady state } _ .T& lw(84p) | lw(30p) | lw(36p) | cw(30p) | lw(48p) . { (6) OIRT | (em | rogramme level meter: \ type A sound meter \ type B sound meter } { for both types: less than 300 ms for meters with pointer indication and less than 150\ ms for meters with light indication } 10 | (+- | 60 | (+- | 0 { for both types: 1.5 to 2 seconds from the 0\ dB point which is at 30% of the length of the operational section of the scale } .TE .LP \fINote\ 1\fR \ \(em\ In France a meter similar to the one defined in line (2) of the table has been standardized. .LP \fINote\ 2\fR \ \(em\ In the Netherlands a meter (type NRU\(hyON301) similar to the one defined in line (4) of the table has been standardized. .LP \fINote\ 3\fR \ \(em\ The number given in the column is the index \fIn\fR in the formula [\fIV\fR (output) | | fIV\fR (input) \fIn\fR ] applicable for each half\(hycycle. .LP \fINote\ 4\fR \ \(em\ The \*Qintegration time\*U was defined by the CCIF as the \*Qminimum period during which a sinusoidal voltage should be applied to the instrument for the pointer to reach to within 0.2 neper or nearly 2\ dB of the deflection which would be obtained if the voltage were applied indefinitely\*U. A logarithmic ratio of 2\ dB corresponds to a percentage of 79.5% and a ratio of 0.2 neper to a percentage of 82%. .LP \fINote\ 5\fR \ \(em\ The figure of 4\ milliseconds that appeared in previous editions was actually the time taken to reach 80% of the final reading with a d.c. step applied to the rectifying/integrating circuit. In a new and somewhat different design of this programme meter using transistors, the performance on programme remains substantially the same as that of earlier versions and so does the response to an arbitrary, quasi\(hyd.c. test signal, but the integration time, as here defined, is about 20% greater at the higher meter readings. .LP \fINote\ 6\fR \ \(em\ In Italy a sound\(hyprogramme meter with the following characteristics is in use: \ \ \ Rectifier characteristic: 1 (see Note 3). \ \ \ Time to reach 99% of final reading: approx. 20\ ms. \ \ \ Integration time: approx. 1.5 ms. \ \ \ Time to return to zero: approx. 1.5 s from 100% to 10% of the reading in the steady state. .nr PS 9 .RT .ad r \fBTableau 1/P.52 [T1.52], p. 28\fR .sp 1P .RT .ad b .RT .LP .bp .sp 2P .LP \fBReferences\fR .sp 1P .RT .LP [1] \fIARAEN volume meter or speech voltmeter\fR , White Book, Vol.\ V, Supplement No.\ 10, ITU, Geneva,\ 1969. .LP [2] \fIVolume meter standardized in the United States of America, termed\fR \fIVU meter\fR , White Book, Vol.\ V, Supplement No.\ 11, ITU, Geneva,\ 1969. .LP [3] \fIModulation meter used by the British Broadcasting Corporation\fR , White Book, Vol.\ V, Supplement No.\ 12, ITU, Geneva,\ 1969. .LP [4] \fIMaximum amplitude indicators, types U 21 and U 71 used in the\fR \fIFederal Republic of Germany\fR , White Book, Vol.\ V, Supplement No.\ 13, ITU, Geneva,\ 1969. .LP [5] \fISFERT volume indicator\fR , Red Book, Vol\ V, Annex\ 18, Part\ 2, ITU, Geneva,\ 1962. .LP [6] CCIF \fIWhite Book\fR , Vol. IV, pp. 270\(hy293, ITU, Bern,\ 1934. .LP [7] \fIComparison of the readings given on conversational speech by\fR \fIdifferent types of volume meter\fR , White Book, Vol.\ V, Supplement No.\ 14, ITU, Geneva,\ 1969. \v'2P' .LP .sp 2P .LP \fBRecommendation\ P.53\fR .RT .sp 2P .sp 1P .ce 1000 \fBPSOPHOMETERS\ (APPARATUS\ FOR\ THE\ OBJECTIVE\ MEASUREMENT | fR \fBOF\ CIRCUIT\ NOISE)\fR .EF '% Volume\ V\ \(em\ Rec.\ P.53'' .OF '''Volume\ V\ \(em\ Rec.\ P.53 %' .ce 0 .sp 1P .ce 1000 Refer to Recommendation O.41, CCITT Blue Book, Volume IV, Fascicle IV.4 .sp 1P .RT .ce 0 .sp 1P .sp 2P .LP \fBRecommendation\ P.54\fR .RT .sp 2P .sp 1P .ce 1000 \fBSOUND\ LEVEL\ METERS\fR | \fB(APPARATUS\ FOR\ THE\ OBJECTIVE\ MEASUREMENT\ OF\ ROOM\ NOISE)\fR .EF '% Volume\ V\ \(em\ Rec.\ P.54'' .OF '''Volume\ V\ \(em\ Rec.\ P.54 %' .ce 0 .sp 1P .ce 1000 \fI(amended at Mar del Plata, 1968 and Geneva, 1972)\fR .sp 9p .RT .ce 0 .sp 1P .PP The CCITT recommends the adoption of the sound level meter specified in\ [1] in conjunction, for most uses, with the octave, half, and third octave filters in accordance with\ [2]. \v'1P' .sp 1P .RT .LP .sp 2P .LP \fBReferences\fR .sp 1P .RT .LP [1] International Electrotechnical Commission Standard, \fISound level\fR \fImeters\fR , IEC Publication 651 (179), Geneva,\ 1979. .LP [2] International Electrotechnical Recommendation, \fIOctave, half\(hyoctave\fR \fIand third\(hyoctave band filters intended for the analysis of sounds and\fR \fIvibrations\fR , IEC Publication 225, Geneva,\ 1966. .sp 2P .LP \fBRecommendation\ P.55\fR .RT .sp 2P .sp 1P .ce 1000 \fBAPPARATUS\ FOR\ THE\ MEASUREMENT\ OF\ IMPULSIVE\ NOISE\fR .EF '% Volume\ V\ \(em\ Rec.\ P.55'' .OF '''Volume\ V\ \(em\ Rec.\ P.55 %' .ce 0 .sp 1P .ce 1000 \fI(Mar del Plata, 1968)\fR .sp 9p .RT .ce 0 .sp 1P .PP Experiments have shown that clicks or other impulsive noises which occur in telephone calls come from a number of sources, such as faulty construction of the switching equipment, defective earthing at exchanges and electromagnetic couplings in exchanges or on the line. .sp 1P .RT .PP There is no practical way of assessing the disturbing effect of isolated pulses on telephone calls. A rapid succession of clicks is annoying chiefly at the start of a call. It is probable that these series of clicks affect data transmission more than they do the telephone call and that connections capable of transmitting data, according to the noise standards now under study, will also be satisfactory for speech transmission. .bp .PP In view of these considerations, the CCITT recommends that Administrations use the impulsive noise counter defined in Recommendation\ O.71\ [1] for measuring the occurrence of series of pulses on circuits for both speech and data transmission. .PP \fINote\fR \ \(em\ At the national level, Administrations might continue to study whether the use of this impulsive noise counter is sufficient to ensure that the conditions necessary to ensure good quality in telephone connections are met. In those studies, Administrations may use whatever measuring apparatus they consider most suitable\ \(em for example a psophometer with an increased overload factor\ \(em but the CCITT does not envisage recommending the use of such an instrument. .RT .sp 2P .LP \fBReference\fR .sp 1P .RT .LP [1] CCITT Recommendation \fISpecification for an impulsive noise\fR \fImeasuring instrument for telephone\(hytype circuits\fR , Vol.\ IV, Rec.\ O.71. .sp 2P .LP \fBRecommendation\ P.56\fR .RT .sp 2P .sp 1P .ce 1000 \fBOBJECTIVE\ MEASUREMENT\ OF\ ACTIVE\ SPEECH\ LEVEL\fR .EF '% Volume\ V\ \(em\ Rec.\ P.56'' .OF '''Volume\ V\ \(em\ Rec.\ P.56 %' .ce 0 .sp 1P .ce 1000 \fI(Melbourne, 1988)\fR .sp 9p .RT .ce 0 .sp 1P .LP \fB1\fR \fBIntroduction\fR .sp 1P .RT .PP The CCITT considers it important that there should be a standardized method of objectively measuring speech level, so that measurements made by different Administrations may be directly comparable. Requirements of such a meter are that it should measure active speech level and should be independent of operator interpretation. .PP In this Recommendation, a meter is a complete unit that includes the input circuitry, filter (if necessary), processor and display. The processor includes the algorithm of the detection method. .PP In its present form, this meter can safely be used for laboratory experiments or can be used with care on operational circuits. Further study is continuing on: .RT .LP a) how the meter can be used on 2\(hywire and 4\(hywire circuits to determine who is talking and whether it is an echo, and .LP b) how such an instrument can discriminate between speech and signalling, for example. .PP The method described herein maintains maximum comparability and continuity with past work, provided suitable monitoring is used, e.g.\ an operator performing the monitoring function. In particular, the new method yields data and conclusions compatible with those that have established the conventional value (22\ microwatts) of speech power at the input to the 4\(hywire point of the international circuit according to Recommendation\ G.223. A method using operator monitoring can be found in Annex\ A. .PP This Recommendation describes a method that can be easily implemented using current technology. It also acts as a reference against which other methods can be compared. The purpose of this Recommendation is not to exclude any other method but to ensure that results from different methods give the same result. .PP Active speech level shall be measured and reported in decibels relative to a stated reference according to the methods described below, namely, .RT .LP \(em \fIMethod\ A\fR \ \(em\ measuring a quantity called speech volume, used for the purpose of real\(hytime control of speech level (see \(sc\ 4); .LP \(em \fIMethod\ B\fR \ \(em\ measuring a quantity called active speech level, used for other purposes (see \(sc\ 5). .bp .PP Comparison of readings given by meters of methods A and B can be found in Supplement\ No.\ 18. .PP \fINote\fR \ \(em\ This meter cannot be used to determine peak levels but sufficient information exists\ [1] giving the instantaneous peak/r.m.s. ratio, provided the signal has not been restricted or modified in any way, e.g.\ peak clipping. .RT .sp 2P .LP \fB2\fR \fBTerminology\fR .sp 1P .RT .PP The recommended terminology is as follows: .RT .LP \fIspeech\ volume\fR until now used interchangeably with \fIspeech level\fR , should in future be used exclusively to denote a value obtained by method\ A; .LP \fIactive\ speech\ level\fR should be used exclusively to denote a value obtained by method\ B; .LP \fIspeech\ level\fR should be used as a general term to denote a value obtained by any method yielding a value expressed in decibels relative to a stated reference. .PP The definitions of these terms [2], and other related terms such as those for the meters themselves\ [3], should be adjusted accordingly. .sp 2P .LP \fB3\fR \fBGeneral\fR .sp 1P .RT .sp 1P .LP 3.1 \fIElectrical, acoustic and other levels\fR .sp 9p .RT .PP This Recommendation deals primarily with electrical measurements yielding results expressed in terms of electrical units, generally decibels relative to an appropriate reference value such as one volt. However, if the calibration and linearity of the transmission system in which the measurement takes place are assured, it is possible to refer the result backwards or forwards from the measurement point to any other point in the system, where the signal may exist in some non\(hyelectrical form (e.g.,\ acoustical). Power is proportional to squared voltage in the electrical domain, squared sound pressure in the acoustical domain, or the digital equivalent of either of these in the numerical domain, and the reference value must be of the appropriate kind (1\ volt, 1\ pascal, reference acoustic pressure equal to 20\ micropascals, or any other stated unit, as the case may be). .RT .sp 1P .LP 3.2 \fIUniversal requirements\fR .sp 9p .RT .PP For speech\(hylevel measurements of all types, the information reported should include: the designation of the measuring system, the method used (A, B, or B\(hyequivalent as explained in \(sc\ 4, or other specified method), the quantity observed, the units, and other relevant information such as the margin value (explained below) where applicable. .PP All the relevant conditions of measurement should also be stated, such as bandwidth, position of the measuring instrument in the communication circuit, and presence or absence of a terminating impedance. Apart from the stated band limitation intended to exclude spurious signals, no frequency weighting should be introduced in the measurement path (as distinct from the transmission path). .RT .sp 1P .LP 3.3 \fIAveraging\fR .sp 9p .RT .PP Where an average of several readings is reported, the method of averaging should be stated. The \fImean level\fR (mean speech volume or mean active speech level), formed by taking the mean of a number of decibel values, should be distinguished from the \fImean power\fR , formed by converting a number of decibel values to units of power, taking the mean of these, and then optionally restoring the result to decibels. .PP Any correction that has been applied should be mentioned, together with the facts or assumptions on which any such correction is based. For example, in loading calculations, when the active levels or durations of the individually measured portions of speech differ widely, 0.115 \(*s\u2\d is commonly added to the median or mean level in order to estimate the mean power, on the grounds that the distribution of mean active speech levels (dB\ values) is approximately Gaussian. .bp .RT .sp 2P .LP \fB4\fR \fBMethod A: immediate indication of speech volume for real\(hytime\fR \fBapplications\fR .sp 1P .RT .PP Measurement of speech volume for rapid real\(hytime control or adjustment of level by a human observer should be accomplished in the traditional manner by means of one of the devices listed in Recommendation\ P.52. .PP The choice of meter and the method of interpreting the pointer deflexions should be appropriate to the application, as in Table\ 1/P.56. .PP Values obtained by method A should be reported as \fIspeech volume\fR ; the meter employed, the quantity observed, and the units in which the result is expressed, should be stated. .RT .ce \fBH.T. [T1.56]\fR .ce TABLE\ 1/P.56 .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(80p) | cw(74p) | cw(74p) . Application Meter Quantity observed _ .T& lw(80p) | lw(74p) | lw(74p) . { Control of vocal level in live\(hyspeech loudness balances } ARAEN volume meter (SV3) Level exceeded in 3 s .T& lw(80p) | lw(74p) | lw(74p) . Avoidance of peak limiting Peak programme meter Highest reading .T& lw(80p) | lw(74p) | lw(74p) . { Maintenance of optimum level in making magnetic tape recording } VU meter { Average of peaks (excluding most extreme) } _ .TE .nr PS 9 .RT .ad r \fBTable 1/P.56 [T1.56], p.\fR .sp 1P .RT .ad b .RT .sp 2P .LP .sp 1 \fB5\fR \fBMethod B: active speech level for other applications than those\fR \fBmentioned in method\ A\fR .sp 1P .RT .sp 1P .LP 5.1 \fIPrinciple of measurement\fR .sp 9p .RT .PP Active speech level is measured by integrating a quantity proportional to instantaneous power over the aggregate of time during which the speech in question is present (called the active time), and then expressing the quotient, proportional to total energy divided by active time, in decibels relative to the appropriate reference. .PP The mean power of a speech signal when known to be present can be estimated with high precision from samples taken at a rate far below the Nyquist rate\fR . However, the all\(hyimportant question is what criterion should be used to determine when speech is present. .PP Ideally, the criterion should indicate the presence of speech for the same proportion of time as it appears to be present to a human listener, excluding noise that is not part of the speech (such as impulses, echoes, and steady noise during periods of silence), but including those brief periods of low or zero power that are not perceived as interruptions in the flow of speech\ [4]. It is not essential that the detector should operate exactly in synchronism with the beginnings and ends of utterances as perceived: there may be a delay in both operating and releasing, provided that the total active time is measured correctly. For this reason, complex real\(hytime voice\(hyactivity detectors depending on sampling at the Nyquist rate, such as those that have been successfully used in digital speech interpolation , are not necessarily the most suitable for this application. Their function is to indicate when a channel is available for transmission of information: this state does not always coincide with the absence of speech; on the one hand, it may occur during short intervals that ought to be considered part of the speech, and on the other hand, it may be delayed long after the end of an utterance (for reasons of convenience in the allocation of channels, for example). .PP This Recommendation describes the detection method that meets the requirements. The method involves applying a signal\(hydependent threshold which cannot be specified in advance, so that accurate results cannot be guaranteed while the measurement is actually in progress; despite that, by accumulating sufficient information during the process, it is possible to apply the correct threshold retrospectively, and hence to output a correct result almost as soon as the measurement finishes. Continuous adaptation of the threshold level in real time appears to yield similar results in simple cases, but further study is needed to find out how far this conclusion can be generalized. .bp .RT .sp 1P .LP 5.2 \fIDetails of realization\fR .sp 9p .RT .PP The algorithm for method B is as follows. .PP Let the speech signal be sampled at a rate not less than \fIf\fR samples per second, and quantized uniformly into a range of at least 2\u1\d\u2\d quantizing intervals (i.e.\ using 12\ bits per sample including the sign). .PP \fINote\fR \ \(em\ This requirement ensures that the dynamic range for instantaneous voltage is at least 66\ dB, but two factors combine to make the range of measurable active speech levels about 30\ dB less than this: .RT .LP 1) Allowance must be made for the ratio of peak power to mean power in speech, namely about 18\ dB where the probability of exceeding that value is 0.001. .LP 2) Envelope values down to at least 16\ dB below the mean active level must be calculated: these values may be fractional, but will not be accurate enough if computed from a quantizing interval much exceeding twice the sample value; that is to say, it should not be expected that an active speech level less than about 10\ dB above the quantizing interval would be measurable. .PP Let the successive sample values be denoted by \fIx\fR\d\fIi\fR\uwhere \fIi\fR \ =\ 1, 2, 3, | | | Let the time interval between consecutive samples be \fIt\fR \ =\ 1/ \fIf\fR \ seconds. .PP Other constants required are: .RT .LP \fIv\fR (volts/unit) scale factor of the analogue\(hydigital converter .LP \fIT\fR time constant of smoothing in seconds .LP \fIg\fR \ =\ exp\ (\(em\fIt\fR /\fIT\fR ) coefficient of smoothing .LP \fIH\fR hangover time in seconds .LP \fII\fR \ =\ \fIH\fR / \fIt\fR rounded up to next integer .LP \fIM\fR margin in dB, difference between threshold and active speech level. .PP Let the input samples be subjected to two distinct processes, 1 and 2. .sp 1P .LP \fIProcess 1\fR .sp 9p .RT .PP Accumulate the number of samples \fIn\fR , the sum \fIs\fR , and the sum of squares, \fIsq\fR : \v'6p' .RT .sp 1P .ce 1000 \fIn\fR\d\fIi\fR\u\ =\ \fIn\fR\d\fIi\fR\\d\\u(em\d1\u\ +\ 1 .ce 0 .sp 1P .ce 1000 \fIs\fR\d\fIi\fR\u\ =\ \fIs\fR\d\fIi\fR\\d\\u(em\d1\u\ +\ \fIx\fR\d\fIi\fR\u .ce 0 .sp 1P .ce 1000 \fIsq\fI\d\fIi\fR\u\ =\ \fIsq\fI\d\fIi\fR\\d\\u(em\d1\u\ +\ \fIx\fR $$Ei:2:\fIi\fR _ .ce 0 .sp 1P .LP .sp 1 where \fIs\fR\d0\u, \fIsq\fR\d0\uand \fIn\fR\d0\u(initial values) are zero. .sp 1P .LP \fIProcess 2\fR .sp 9p .RT .PP Perform two\(hystage exponential averaging on the rectified signal values: \v'6p' .RT .sp 1P .ce 1000 \fIp\fR\d\fIi\fR\u\ =\ \fIg\fR | (mu | fIp\fR\d\fIi\fR\\d\\u(em\d1\u+ (1\(em\fIg\fR ) | (mu | | fIx\fR\d\fIi\fR\u | .ce 0 .sp 1P .ce 1000 \fIq\fR\d\fIi\fR\u\ =\ \fIg\fR | (mu | fIq\fR\d\fIi\fR\\d\\u(em\d1\u+ (1\(em\fIg\fR ) | (mu | fIp\fR\d\fIi\fR\u .ce 0 .sp 1P .LP .sp 1 where \fIp\fR\d0\uand \fIq\fR\d0\u(initial values) are zero. .PP The sequence \fIq\fR\d\fIi\fR\uis called the envelope, \fIp\fR\d\fIi\fR\udenotes intermediate quantities. .PP Let a series of fixed threshold voltages \fIc\fR\d\fIj\fR\ube applied to the envelope. These should be spaced in geometric progression, at intervals of not more than 2:1 (6.02\ dB), from a value equal to about half the maximum code down to a value equal to one quantizing interval or lower. Let a corresponding series of activity counts\ \fIa\fR\d\fIj\fR\u, and a corresponding series of hangover counts, \fIh\fR\d\fIj\fR\u, be maintained: .PP for each value of \fIj\fR in turn, .RT .LP if \fIq\fR\d\fIi\fR\u> \fIc\fR\d\fIj\fR\uor \fIq\fR\d\fIi\fR\u= \fIc\fR\d\fIj\fR\u, then add 1 to \fIa\fR\d\fIj\fR\u and set \fIh\fR\d\fIj\fR\uto 0; .LP if \fIq\fR\d\fIi\fR\u< \fIc\fR\d\fIj\fR\uand \fIh\fR\d\fIj\fR\u< \fII\fR , then add 1 to \fIa\fR\d\fIj\fR\uand add 1 to \fIh\fR\d\fIj\fR\u; .LP if \fIq\fR\d\fIi\fR\u< \fIc\fR\d\fIj\fR\uand \fIh\fR\d\fIj\fR\u= \fII\fR , \fIthen\fR do nothing. .bp .PP In the first case, the envelope is at or above the \fIj\fR th threshold, so that the speech is active as judged by that threshold level. In the second case, the envelope is below the threshold, but the speech is still considered active because the corresponding hangover has not yet expired. In the third case, the speech is inactive as judged by the threshold level in question. .PP Initially, all the \fIa\fR\d\fIj\fR\uvalues are set equal to zero, and the \fIh\fR\d\fIj\fR\uvalues set equal to \fII\fR . .PP It should be noted that the suffix \fIi\fR in all the above cases is needed only to distinguish current values from previous values of accumulated quantities; for example, there is no need to hold more than one value of \fIsq\fR , but this value is continually updated. At the end of the measurement, therefore, the suffixes can be omitted from \fIs\fR , \fIsq\fR , \fIn\fR , \fIp\fR , and \fIq\fR . .PP Let all these processes continue until the end of the measurement is signalled. Then evaluate the following quantities: \v'6p' .RT .sp 1P .ce 1000 Total time = \fIn\fR \(mu \fIt\fR .ce 0 .sp 1P .ce 1000 Long\(hyterm power = \fIsq\fR \(mu \fIv\fR \u2\d/\fIn\fR . .ce 0 .sp 1P .LP .sp 1 .PP \fINote\fR \ \(em\ If it is suspected that there may be a significant d.c. offset, this may be estimated as \fIs\fR | (mu | fIv\fR /\fIn\fR , and used to evaluate a more accurate value of long\(hyterm power (a.c.) as \fIv\fR \u2\d [\fIsq\fR /\fIn\fR \(em(\fIs\fR /\fIn\fR )\u2\d]. However, in this case, the effect of the offset on the envelope must also be taken into account and appropriate corrections made. .PP For each value of \fIj\fR , the active\(hypower estimate is equal to \fIsq\fR | (mu | fIv\fR \u2\d/\fIa\fR\d\fIj\fR\u. .PP At this stage, the powers are in volts squared per unit time. Now express the long\(hyterm power and the active\(hypower estimates in decibels relative to the chosen reference voltage\ \fIr\fR : .RT .LP Long\(hyterm level, \fIL\fR = 10 log (\fIsq\fR | (mu | fIv\fR \u2\d/\fIn\fR )\(em20 log \fIr\fR .LP Active\(hylevel estimate, \fIA\fR\d\fIj\fR\u= 10 log (\fIsq\fR | (mu | fIv\fR \u2\d/\fIa\fR\d\fIj\fR\u) \(em20 log \fIr\fR .LP Threshold, \fIC\fR\d\fIj\fR\u= 20 log (\fIc\fR\d\fIj\fR\u | (mu | fIv\fR )\(em20 log \fIr\fR .PP For each value of \fIj\fR , compare the difference \fIA\fR\d\fIj\fR\u\(em \fIC\fR\d\fIj\fR\uwith the margin \fIM\fR , and determine (if necessary, by interpolation on a decibel scale between two consecutive values of \fIA\fR\d\fIj\fR\uand of \fIC\fR\d\fIj\fR\u) the true active level\ \fIA\fR and corresponding threshold\fIC\fR for which \fIA\fR \(em\fIC\fR \ =\ \fIM\fR . If one of the pairs of values\ \fIA\fR\d\fIj\fR\uand \fIC\fR\d\fIj\fR\ufulfils this condition exactly, then the true activity factor is \fIa\fR\d\fIj\fR\u/\fIn\fR , but in all cases it can be evaluated from the expression\ 10 \u(\fIL\fR \(em\fIA\fR )/10 \d. .PP For simplicity, the algorithm has been defined in terms of a digital process, but any equivalent process (one implemented on a programmable analogue computer, for example) should also be considered as fulfilling the definition. .RT .sp 1P .LP 5.3 \fIValues of the parameters\fR .sp 9p .RT .PP The values of the parameters given in Table 2/P.56 should be used. They have been found suitable for the purpose and have stood the test of many years of application by various organizations\ [4]. .RT .ce \fBH.T. [T2.56]\fR .ce TABLE\ 2/P.56 .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(72p) | cw(48p) . Parameter Value Tolerance _ .T& cw(48p) | cw(72p) | cw(48p) . \fIf\fR 694 samples/second not less than 600 .T& cw(48p) | cw(72p) | cw(48p) . \fIT\fR 0.03 seconds \(+- | %\ | .T& cw(48p) | cw(72p) | cw(48p) . \fIH\fR 0.2 seconds\ \(+- | %\ | .T& cw(48p) | cw(72p) | cw(48p) . \fIM\fR 15.9 dB { \(+- | .5\ } .TE .LP \fINote\fR \ \(em\ The value \fIM\fR \ =\ 15\ dB might appear to be implied in [4], but the threshold level there described equals the \fImean absolute voltage\fR of a sine wave whose \fImean power\fR is 15\ dB below the reference. The difference of 0.9\ dB is 20\ log (voltage/mean absolute voltage) for a sine wave. .nr PS 9 .RT .ad r \fBTable 2/P.56 [T2.56], p.\fR .sp 1P .RT .ad b .RT .LP .bp .PP The result of a measurement made by means of the above algorithm with parameter values conforming to the above restrictions should be reported as \fIactive speech level\fR , and the system should be described as \fIusing\fR \fImethod\ B\fR of this Recommendation. .PP \fINote\fR \ \(em\ Where noise levels are very high, as they are for example in certain vehicles or in certain radio systems, it is often desirable to set the threshold higher (i.e.\ use a smaller margin) in order to exclude the noise. This may be done provided the margin is also reported. The result of a such a measurement should be reported as \fIactive speech level with margin\ M\fR , and the measurement system described as \fIusing method\ B with margin\ M\fR . .PP The activity factor should preferably be reported as a percentage, with a specification of the margin value if this is outside the standard range. .RT .sp 2P .LP \fB6\fR \fBApproximate equivalents of method B\fR .sp 1P .RT .PP Other methods under development use a broadly similar principle of measurement but depart in detail from the algorithm given above. .PP It is not the intention to exclude any such method, provided it is convincingly shown by experimental evidence to yield results consistent with those obtained by method\ B in a sufficiently wide range of conditions. For this reason, a class of methods called \fIB\(hyequivalent methods\fR is recognized. .PP A B\(hyequivalent method of speech\(hylevel measurement is defined as any method that satisfies the following test in all respects. .PP Measurements shall be carried out simultaneously by the method in question and by method\ B on two or more samples of speech in every combination of the following variables: .RT .PP Voices one male and one female voice .PP Speech material a list of independent sentences, a passage of continuous speech, and one channel of a conversation, each lasting at least 20\ s (active time) .PP Bandwidth 300 to 3400 Hz and 100 to 8000\ Hz .PP Added noise flat within the measurement band at levels (\fIM\fR \ +\ 5)\ dB and (\fIM\fR \ +\ 25)\ dB below the active speech level, where \fIM\fR (the margin) is normally 15.9\ dB, but smaller in high\(hynoise applications .PP Levels at intervals of 10 dB over the range claimed for the system in question. .PP From the results, 95% confidence limits for the difference between the level given by the method in question and the active speech level given by method\ B shall be calculated for each of the above 24\ combinations. .PP If, for every combination, the upper confidence limit of this difference is not higher than +1\ dB and the lower confidence limit is not lower than \(em1\ dB, then the method shall be deemed to be a B\(hyequivalent method. .PP This verification procedure is valid until a suitable speech\(hylike signal has been recommended and found suitable to perform this function (see Questions\ 12/XII and 13/XII). .PP Further, a method qualifies as B\(hyequivalent if it gives results that fall within the specified limits when corrected by the addition of a fixed constant, known in advance of the measurement and not dependent on any feature of the speech signal (except possibly the bandwidth if this is known independently). .PP The results of measurements by such a method should be reported as \fIB\(hyequivalent active speech level\fR , and the activity factor as \fIB\(hyequivalent activity factor\fR . .PP Certain measurement systems with fixed thresholds (instead of the retrospectively selected threshold as described in \(sc\ 5.3), may still give an active speech level according to the definition in cases where the margin turns out to be within the specified limits. .RT .sp 2P .LP \fB7\fR \fBSpecification\fR .sp 1P .RT .PP A speech voltmeter normally consists of three parts, namely: .RT .LP i) input circuitry, .LP ii) filter, and .LP iii) processor and display. .bp .PP Figure 1/P.56 shows a typical layout of such a meter. .PP Whether all or part of the components that make up i) and ii) are used will depend on where the meter is to be used. However, it is recommended that a meter for general usage should conform to this specification. .RT .LP .rs .sp 16P .ad r \fBFigure 1/P.56, p.\fR .sp 1P .RT .ad b .RT .sp 2P .LP 7.1 \fISignal input\fR .sp 1P .RT .sp 1P .LP 7.1.1 \fIInput impedance\fR .sp 9p .RT .PP The meter is normally used as a bridging instrument and, if so, its impedance must be high so as not to influence the results. An impedance of 100\ kohm is recommended. .RT .sp 1P .LP 7.1.2 \fICircuit protection\fR .sp 9p .RT .PP It is recommended that the meter should withstand voltages far in excess of those in the measurement range as accidental usage may occur and the circuit under test may have higher voltages than anticipated. Examples of this are mains 110/240\ V or 50\ V exchange voltages. .RT .sp 1P .LP 7.1.3 \fIConnection\fR .sp 9p .RT .PP It is recommended that the connection should be independent of polarity. The meter should have the facility of connection in both balanced and unbalanced modes. .RT .sp 1P .LP 7.2 \fIFilter\fR .sp 9p .RT .PP When measuring the speech levels of circuits in the conventional telephony speech bandwidth (300\(hy3400\ Hz), it is often practical to use a filter that will reject unwanted hum, tape noise,\ etc. yet pass the frequencies of greatest interest without affecting the speech level measurement. The set of coordinates in Table\ 3/P.56 meet these requirements. Figure\ 2/P.56 gives an example of such a filter. .PP The following noise requirements should also be met: .RT .LP Output noise level: .LP wideband (20\(hy20 | 00 Hz) <\(em75 dBm .LP telephone weighted\fR <\(em90 dBmp. .bp .ce \fBH.T. [T3.56]\fR .ce TABLE\ 3/P.56 .ps 9 .vs 11 .nr VS 11 .nr PS 9 .TS center box; cw(48p) | cw(108p) . Frequency (Hz) (dB) _ .T& cw(48p) | cw(108p) . { \fIUpper limit response relative to 1 kHz\fR } .T& cw(48p) | lw(108p) . \ \ | 16 \(em49.75 .T& cw(48p) | lw(108p) . \ \ | 60 \ +0.25 .T& cw(48p) | lw(108p) . \ 7 | 00 \ +0.25 .T& cw(48p) | lw(108p) . 70 | 00 \(em49.75 _ .T& lw(48p) | cw(108p) . { \fILower limit response relative to 1 kHz\fR } .T& cw(48p) | lw(108p) . Under 200 \ \(em\(if .T& cw(48p) | lw(108p) . 200 \ \(em0.25 .T& cw(48p) | lw(108p) . 5500 \ \(em0.25 .T& cw(48p) | lw(108p) . Over 5500 \ \(em\(if _ .TE .nr PS 9 .RT .ad r \fBTableau 3/P.56 [T3.56], p. 32\fR .sp 1P .RT .ad b .RT .LP .rs .sp 35P .ad r \fBFigure 2/P.56, p. 33\fR .sp 1P .RT .ad b .RT .LP .bp .sp 2P .LP 7.3 \fISpeech level measurements\fR .sp 1P .RT .sp 1P .LP 7.3.1 \fIWorking range for speech\fR .sp 9p .RT .PP The recommended working range for speech refers to the active level and should be at least 0 to \(em30\ dBV. .PP \fINote\ 1\fR \ \(em\ The dynamic range of the instrument will depend on the analogue\(hyto\(hydigital converter (ADC). If the ADC is set to a 10\ volt maximum input level (i.e.\ the all 1\ code) and 12\(hybit arithmetic is used, based on the most significant bits from the ADC, then 1\ sign bit +11\ bits magnitude provides a 66\ dB range. The measurable range sill be some 35\ dB less when allowance is made for the peak/mean ratio of 18\ dB (peaks of speech will only exceed the maximum input level for less than 0.1% of the time\ [1]) and margin\ \fIM\fR of 15.9\ dB; the largest speech signal is therefore around +2\ dBV with a smallest speech signal of \(em30\ dBV. However, the practical working range has been found to be +5\ dBV to \(em35\ dBV. .PP \fINote\ 2\fR \ \(em\ To cater for a wider range of speech levels, an attenuator or low noise amplifier may be inserted in the input circuitry. Care must be exercised to maintain the input requirements of \(sc\ 7.1.1. .RT .sp 1P .LP 7.3.2 \fILinearity\fR .sp 9p .RT .PP The linearity of the meter is specified for r.m.s. sine wave measurements since for speech the algorithm is correct by definition, and only the precision or repeatability of measurements need to be considered; this is specified in \(sc\ 7.3.4. .PP Assuming that: .RT .LP a) the measurement is for a minimum period of 5 s, .LP b) the sine wave is present for the whole of the measurement period, the linearity specified is: .ce 1000 .sp 1 \fIFrequency\fR .ce 0 .ce 1000 (Hz) \fIInput range\fR .ce 0 .ce 1000 (dBV) \fIAccuracy\fR .ce 0 .LP (dB) \ 100 to 4000 +16 to \(em45 \(+- 0.1 4000 to 8000 +13 to \(em45 \(+- 0.3 .PP \fINote\fR \ \(em\ The maximum input for the frequency range 4000 to 8000 Hz should ideally be the same as for 100 to 4000\ Hz, but practical limitations in commercially available ADCs (due to the limited \*Q slewing rate \*U of the input circuitry) means that this cannot be obtained. However, as the power in the 8000\ Hz band for speech is 30\ dB down on the level at 500\ Hz it is likely that any error will be extremely small. .sp 1P .LP 7.3.3 \fIFrequency response\fR .sp 9p .RT .PP The frequency response of the meter without filter when measured in the frequency range 100 to 8000\ Hz should be flat within the specified tolerances: .RT .ce 1000 .sp 1 \fIFrequency\fR .ce 0 .ce 1000 (Hz) \fIInput range\fR .ce 0 .ce 1000 (dBV) \fITolerance\fR .ce 0 .LP (dB) \ 100 to 4000 +16 to \(em45 \(+- 0.2 4000 to 8000 +13 to \(em45 \(+- 0.4 .PP \fINote\ 1\fR \ \(em\ Tolerances are referred to 1000 Hz. .PP \fINote\ 2\fR \ \(em\ The note of 7.3.2 applies. .RT .sp 1P .LP 7.3.4 \fIRepeatability\fR .sp 9p .RT .PP When a given speech signal, having its active level within the recommended working range and its duration not less than 5\ s active time, is repeatedly measured on the same meter, the active\(hylevel readings shall have a standard deviation of less than 0.1\ dB. .bp .RT .sp 2P .LP \fB8\fR \fBRoutine calibration of method\(hyB meter\fR .sp 1P .RT .PP The following routine calibration procedures, using non\(hyspeech\(hylike signals, will ensure that the meter is performing satisfactorily. The calibration can only be made using speech. .PP A suitable circuit arrangement is shown in Figure 3/P.56. Wherever suitable, measurements should be made with two settings of the attenuator, 0 and 20\ dB. All source signals are from a 600\ ohm source and the meter is terminated in 600\ ohm. .RT .LP .rs .sp 11P .ad r \fBFigure 3/P.56, p.\fR .sp 1P .RT .ad b .RT .sp 1P .LP 8.1 \fINo input signal\fR .sp 9p .RT .PP With no input applied the meter should display the following results: .RT .sp 1P .LP .sp 1 Activity factor 0 +\ 0.5% Active\(hylevel < \(em60 dBV Long\(hyterm level < \(em60 dBV 8.2 \fIContinuous tone\fR .sp 9p .RT .PP With a 1000 Hz sine wave calibrated to be 0 dBV, the meter should display the following results for the two settings of the attenuator when applied for 12\ +\ 0.2\ s: .RT .LP .sp 1 \fIAttenuator = 0 dB\fR \fIAttenuator = 20 dB\fR Activity factor 100 to 0.5% 100 to 0.5% Active\(hylevel 0 \(+- 0.1 dBV \(em20 \(+- 0.1 dBV Long\(hyterm level 0 \(+- 0.1 dBV \(em20 \(+- 0.1 dBV .sp 2P .LP 8.3 \fIWhite noise\fR .sp 1P .RT .sp 1P .LP 8.3.1 \fIWithout filter\fR .sp 9p .RT .PP With the meter having no filter in circuit and the white noise source calibrated to be 0\ dBV, the meter should display the following results for the two settings of the attenuator when applied for 12\ +\ 0.2\ s: .RT .LP .sp 1 \fIAttenuator = 0 dB\fR \fIAttenuator = 20 dB\fR Activity factor 100 to 0.5% 100 to 0.5% Active\(hylevel 0 \(+- 0.5 dBV \(em20 \(+- 0.5 dBV Long\(hyterm level 0 \(+- 0.5 dBV \(em20 \(+- 0.5 dBV .bp .sp 1P .LP 8.3.2 \fIWith filter\fR .sp 9p .RT .PP With the meter having the filter in circuit and the white noise source calibrated to be 0\ dBV, the meter should display the following results for the two settings of the attenuator when applied for 12\ +\ 0.2\ s: .RT .LP .sp 1 \fIAttenuator = 0 dB\fR \fIAttenuator = 20 dB\fR Activity factor 100 to 0.5% 100 to 0.5% Active\(hylevel \(em6.9 \(+- 0.5 dBV \(em26.9 \(+- 0.5 dBV Long\(hyterm level \(em6.9 \(+- 0.5 dBV \(em26.9 \(+- 0.5 dBV .sp 1P .LP 8.3.3 \fIPulsed noise\fR .sp 9p .RT .PP With the meter having no filter in circuit and the white noise source pulsed at 3\ s \*QON\*U and 3\ s \*QOFF\*U and calibrated to be 0\ dBV when \*QON\*U, the meter should display the following results for the two settings of the attenuator when applied for 12\ +\ 0.2\ s: .RT .LP .sp 1 \fIAttenuator = 0 dB\fR \fIAttenuator = 20 dB\fR Factor activity 55 \(+- 1.5% 55 \(+- 1.5% Active\(hylevel 0 \(+- 1 dBV \(em20 \(+- 1 dBV Long\(hyterm level \(em2.7 \(+- 1 dBV \(em22.7 \(+- 1 dBV .PP \fINote\fR \ \(em\ It is possible that \(sc\ 8 could be revised to calibrate both method\ B and B\(hyequivalent meters when a speech\(hylike signal has been found suitable to perform this function. \v'1P' .ce 1000 ANNEX\ A .ce 0 .ce 1000 (to Recommendation P.56) .sp 9p .RT .ce 0 .ce 1000 \fBA method using a speech voltmeter complying\fR .sp 1P .RT .ce 0 .ce 1000 \fBwith method B in network conditions\fR .ce 0 .PP A speech voltmeter complying with method B is not suitable in its present form for speech measurements (see, for example, Recommendation\ G.223) on real connections since the meter is unable to distinquish between speech coming from one or the other end of the connection. .sp 1P .RT .PP However, if the meter is connected to a 4\(hywire point in a connection of the type 2\(hy4\(hy2\ wire, then measurements may be made using an operator monitoring the beginning and the end of the conversation. The operator can perform this function using earphones (provided the subscriber's permission has been obtained) or by an auxiliary meter (for example conforming to P.52). The circuit arrangement is shown in Figure\ A\(hy1/P.56. .PP The operator monitors the conversation, using the auxiliary meter or earphones, and then by means of a start/stop button can measure the beginning and end of the relevant conversation. .bp .RT .LP .rs .sp 18P .ad r \fBFigure A\(hy1/P.56, p.\fR .sp 1P .RT .ad b .RT .sp 2P .LP \fBReferences\fR .sp 1P .RT .LP [1] RICHARDS (D. | .): Telecommunication by speech, \(sc\ 2.1.3.2, pp. 56\(hy69, \fIButterworks\fR , London,\ 1973. .LP [2] ITU \(em \fIList of Definitions of Essential Telecommunication Terms\fR , Definition\ 14.16, Second impression, Geneva,\ 1961. .LP [3] ITU \(em \fIList of Definitions of Essential Telecommunication Terms\fR , Definitions\ 12.34, 12.35, 12.36, Second impression, Geneva,\ 1961. .LP [4] BERRY (R. | .): Speech\(hyvolume measurements on telephone circuits, \fIProc.\ IEE\fR , Vol.\ 118, No.\ 2, pp.\ 335\(hy338, February\ 1971. .sp 2P .LP \fBBibliography\fR .sp 1P .RT .LP BRADY (P. | .): Equivalent Peak Level: a thre shold\(hyindependent speech level measure, \fIJournal of the Acoustical Society of America\fR , Vol.\ 44, pp.\ 695\(hy699, 1968. .LP CARSON (R.): A digital Speech Voltmeter \(em the S V6, \fIBritish Telecommunications Engineering\fR , Vol.\ 3, Part\ 1, pp.\ 23\(hy30, April\ 1984. .LP CCITT \(em Contribution COM XII\(hyNo. 43 \fIA method for sp\fR \fIeech\(hylevel\fR \fImeasurements\fR \fIusing IEC\(hyinterface bus and calculation\fR (Norway), Geneva,\ 1982. .LP .rs .sp 12P .ad r Blanc .ad b .RT .LP .bp .LP \fBMONTAGE: PAGE 122 = BLANCHE\fR .sp 1P .RT .LP .bp