By Topic

IEEE Transactions on Acoustics, Speech, and Signal Processing

Issue 1 • February 1975

Filter Results

Displaying Results 1 - 25 of 28
  • [Front cover and table of contents]

    Publication Year: 1975, Page(s): 0
    Request permission for commercial reuse | PDF file iconPDF (175 KB)
    Freely Available from IEEE
  • Guest editorial

    Publication Year: 1975, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (241 KB)
    Freely Available from IEEE
  • Editorial

    Publication Year: 1975, Page(s): 149
    Request permission for commercial reuse | PDF file iconPDF (82 KB)
    Freely Available from IEEE
  • [Back cover]

    Publication Year: 1975, Page(s): c4
    Request permission for commercial reuse | PDF file iconPDF (1351 KB)
    Freely Available from IEEE
  • An objective parallel evaluator of segmentation/Classification performance for multiple systems

    Publication Year: 1975, Page(s):92 - 99
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (816 KB)

    A major difficulty in the development of methodologies for segmentation and classification in automatic recognition of continuous speech is the determination of objective, reliable performance statistics. Compounding this difficulty is the large amount of data necessary to make reasonably accurate performance estimates. The system to be described provides for concurrent objective evaluation of up ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A description of a parametrically controlled modular structure for speech processing

    Publication Year: 1975, Page(s):87 - 91
    Cited by:  Papers (12)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (600 KB)

    A system, the modular acoustic processor (MAP) consisting of two major components, has been designed for work in speech recognition. A versatile spectral analysis system, the parametrically controlled analyzer (PCA), serves as input to an hierarchically operated string transcriber (HOST). In the design of this system, controllability and modularity for developmental extensibility were primary conc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Pitch detection by data reduction

    Publication Year: 1975, Page(s):72 - 79
    Cited by:  Papers (22)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1016 KB)

    This paper presents an algorithm that determines the fundamental frequency of sampled speech by segmenting the signal into pitch periods. Segmentation is achieved by identifying those samples of the waveform corresponding to the beginning of each pitch period. The segmentation is accomplished in three phases. First, using zero crossing and energy measurements, a data structure is constructed from ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Where the phonemes are: Dealing with ambiguity in acoustic-phonetic recognition

    Publication Year: 1975, Page(s):50 - 53
    Cited by:  Papers (13)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (536 KB)

    Errors in acoustic-phonetic recognition occur not only because of the limited scope of the recognition algorithm, but also because certain ambiguities are inherent in analyzing the speech signal. Examples of such ambiguities in segmentation and labeling (feature extraction) are given. In order to allow for these phenomena and to deal effectively with acoustic recognition errors, we have devised a ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Minimum prediction residual principle applied to speech recognition

    Publication Year: 1975, Page(s):67 - 72
    Cited by:  Papers (681)  |  Patents (32)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (720 KB)

    A computer system is described in which isolated words, spoken by a designated talker, are recognized through calculation of a minimum prediction residual. A reference pattern for each word to be recognized is stored as a time pattern of linear prediction coefficients (LPC). The total log prediction residual of an input signal is minimized by optimally registering the reference LPC onto the input ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The use of syntax in a speech understanding system

    Publication Year: 1975, Page(s):112 - 117
    Cited by:  Papers (13)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (832 KB)

    When a person hears an English sentence, he uses many sources of information to assign structure and meaning to the utterance. One of these sources, syntax, is concerned with the goal of producing a consistent, meaningful, grammatical structure for the sentence. The exact type of structure produced is not as crucial as the process of building that structure because the speech environment has inher... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A 16-bit A-D-A conversion system for high-fidelity audio research

    Publication Year: 1975, Page(s):146 - 149
    Cited by:  Papers (10)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (360 KB)

    An A-D and D-A converter system with exceptionally wide dynamic range and low distortion is discussed. The system utilizes 12-bit floating-point approximation at conversion instead of true 16-bit conversion. A deglitching track and hold is illustrated which avoids heterodyning between signal components and the sample clock. Rate-limiting during the transition from hold to track is discussed. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The role of phonological rules in speech understanding research

    Publication Year: 1975, Page(s):104 - 112
    Cited by:  Papers (46)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (3576 KB)

    This paper presents some phonological rules which describe systematic pronunciation variation occurring in natural continuous speech. It is argued that a speech understanding system must account for such variation by incorporating phonological rules, either implicitly or explicitly, into the system. Spectrographic evidence for the phonological phenomena described by the rules is included. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A system for acoustic-phonetic analysis of continuous speech

    Publication Year: 1975, Page(s):54 - 67
    Cited by:  Papers (39)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1728 KB)

    A system for acoustic-phonetic analysis of continuous speech is being developed to serve as part of an automatic speech understanding system. The acoustic system accepts the speech wave-form as an input and produces as output a string of phoneme-like units referred to as acoustic phonetic elements (APEL's). This paper should be considered as a progress report, since the system is still under activ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The CASPERS linguistic analysis system

    Publication Year: 1975, Page(s):118 - 123
    Cited by:  Papers (8)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (504 KB)

    CASPERS (Computer-Automated Speech Perception System) is a user-modifiable facility for translating strings of acoustic Symbols into sentences. Three distinctive aspects of the system's design will be discussed: the dynamic application of acoustic-phonological rules across word boundaries, an acoustic-unit splitting-and merging strategy for treating the dictionary matching problem, and an extensiv... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A prosodically guided speech understanding strategy

    Publication Year: 1975, Page(s):30 - 38
    Cited by:  Papers (16)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1088 KB)

    Our strategy for computer understanding of speech uses prosodic features to break up continuous speech into sentences and phrases and locate stressed syllables in those phrases. The most reliable phonetic data are obtained by performing a distinguishing features analysis within the stressed syllables and by locating sibilants and other robust information in unstressed syllables. The numbers and lo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Syllable as a unit of speech recognition

    Publication Year: 1975, Page(s):82 - 87
    Cited by:  Papers (50)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (864 KB)

    Basic problems involved in automatic recognition of continuous speech are discussed with reference to the recently developed template matching technique using dynamic programming. Irregularities in phonetic manifestations of phonemes are discussed and it is argued that the syllable, phonologically redefined, will serve as the effective minimal unit in the time domain. English syllable structures a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Discrete-time systems

    Publication Year: 1975, Page(s): 150
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (192 KB)

    First Page of the Article
    View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Control concepts in a speech understanding system

    Publication Year: 1975, Page(s):136 - 140
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (616 KB)

    Automatic speech understanding must accommodate the fact that an entirely accurate and precise acoustic transcription of speech is unattainable. By applying knowledge about the phonology, syntax, and semantics of a language and the constraints imposed by a task domain, much of the ambiguity in an attainable transcription can be resolved. This paper deals with how to control the application of such... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Computer exploration of fast-speech rules

    Publication Year: 1975, Page(s):100 - 103
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (544 KB)

    A set of fast-speech rules has been tested on the computer using the phonological grammar tester (PGT) program of Friedman and Morin. We examine the types of difficulties encountered in the rules and discuss ways in which the program can be made more useful for studying fast-speech rules. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Motivation and overview of SPEECHLIS: An experimental prototype for speech understanding research

    Publication Year: 1975, Page(s):2 - 10
    Cited by:  Papers (9)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1200 KB)

    SPEECHLIS is a research prototype of an intelligent speech understanding system which makes use of advanced techniques of artificial intelligence, natural language processing, and acoustical and phonological analysis in an integrated way to determine the interpretation of continuous speech utterances. This paper describes a number of the characteristics of the speech understanding task which influ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The DRAGON system--An overview

    Publication Year: 1975, Page(s):24 - 29
    Cited by:  Papers (117)  |  Patents (36)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (736 KB)

    This paper briefly describes the major features of the DRAGON speech understanding system. DRAGON makes systematic use of a general abstract model to represent each of the knowledge sources necessary for automatic recognition of continuous speech. The model--that of a probabilistic function of a Markov process--is very flexible and leads to features which allow DRAGON to function despite high erro... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Experiments with a tree-search method for converting noisy phonetic representation into standard orthography

    Publication Year: 1975, Page(s):129 - 135
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (712 KB)

    A tree-search algorithm was utilized in decoding the noisy phonetic output of an acoustic processor (AP) for purposes of automatic recognition of continuous speech. A 250-word lexicon and a finite-state grammar specify the tree to be searched. The search is performed in a best-first manner where partial paths having the highest similarity measure are extended first. The similarity measure of a wor... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Semantic support for a speech understanding system

    Publication Year: 1975, Page(s):124 - 129
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (768 KB)

    One function of the Semantics component of SPEECH-LIS, the Bolt Beranek and Newman (BBN) Speech Understanding System, is to gather evidence for hypotheses it has made regarding the content of an utterance, as well as to evaluate the hypotheses made by other components. Another is to produce a representation of the utterance's meaning. Specifically, this involves forming consistent, meaningful coll... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An introduction to discrete systems

    Publication Year: 1975, Page(s): 150
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (192 KB)

    First Page of the Article
    View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Organization of the Hearsay II speech understanding system

    Publication Year: 1975, Page(s):11 - 24
    Cited by:  Papers (110)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2024 KB)

    Hearsay II (HSII) is a system currently under development at Carnegie-Mellon University to study the connected speech understanding problem. It is similar to Hearsay I (HSI) in that it is based on the hypothesize-and-test paradigm, using cooperating independent knowledge sources communicating with each other through a global data structure (blackboard). It differs in the sense that many of the lim... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.

Aims & Scope

This Transactions ceased production in 1990. The current retitled publication is IEEE Transactions on Signal Processing.

Full Aims & Scope