Institute for Systems Research Technical Reports

Permanent URI for this collectionhttp://hdl.handle.net/1903/4376

This archive contains a collection of reports generated by the faculty and students of the Institute for Systems Research (ISR), a permanent, interdisciplinary research unit in the A. James Clark School of Engineering at the University of Maryland. ISR-based projects are conducted through partnerships with industry and government, bringing together faculty and students from multiple academic departments and colleges across the university.

Browse

Search Results

Now showing 1 - 10 of 141
  • Thumbnail Image
    Item
    Hierarchical Coding of High Data Rate Video for Asynchronous Transfer Mode Networks
    (1994) Sivarajan, R.; Narayan, P.; ISR; CSHCN
    High data rate video is an integral part of high-quality multimedia for broadband networks. Owing to the high rate, compression of video information is required for an efficient use of network bandwidth. A hierarchical DCT-based video codec is examined that prioritizes and compresses high data rate video for transmission over ATM networks.

    The video codec utilizes intraframe coding by independently processing each frame of the video sequence. The lossless compression part consists of run length coding to exploit zero values in the high frequency DCT coefficients and variable length coding (VLC) to further reduce the bit rate. Three compression schemes are examined: adaptive Huffman, arithmetic coding, and Lempel-Ziv-Welch coding. For the model-based compression algorithms, we study several models to characterize the input bit stream to the VLC: memoryless, and Markov with either fixed orders or orders determined by an order estimator. For the three VLCs in the codec, the best performance was obtained from a combination of a memoryless Huffman codec and two first-order Huffman codecs. Many of the models incorporating memory, performed poorly due to the small size of the input files.

    Due to the VLC, the output rate of the system is variable; however, since intraframe coding is utilized, rate variations are small. In order to fully utilize available bandwidth, we examine the rate control problem of converting the codec from a variable rate system to a fixed rate system. The rate control problem is formulated as one of constrained minimization, and analyzed for optimal solutions. Algorithms are presented for optimal rate control.

  • Thumbnail Image
    Item
    On Stochastic Approximations Driven by Sample Averages: Convergence Results via the ODE Method
    (1994) Bartusek, John D.; Makowski, Armand M.; ISR; CSHCN
    We consider a class of projected stochastic approximation algorithms drive by sample averages. These algorithms arise naturally in problems of on-line parametric optimization for discrete event dynamical systems., e.g., queueing systems and Petri net models. We develop a general framework for investigating the a.s. convergence of the iterate sequence, and show how such convergence results can be obtained by means of the ordinary differential equation (ODE) method under a condition of exponential convergence. We relate this condition of exponential convergence to certain Large Deviations upper bounds which are uniform in both the parameter q and the initial condition x. To demonstrate the applicability of the results, we specialize them to two specific classes of state processes, namely sequences of i.i.d. random variables and finite state time-homogeneous Markov chains. In both cases, we identify simple (and checkable) conditions that ensure the validity of a uniform Large Deviations upper bound.
  • Thumbnail Image
    Item
    Adaptive Wavelet Based Image Coding
    (1994) Jafarkhani, Hamid; Farvardin, N.; ISR
    New schemes for classification of images are suggested. An application of these methods in adaptive DCT of images is considered. A new method to combine classification and bit allocation is introduces. Also, an efficient wavelet based image coding system using classification for adaptation is developed. Finally, practical considerations concerning overhead, complexity and performance are discussed.
  • Thumbnail Image
    Item
    Comparison of Responses in the Anterior and Primary Auditory Fields of the Ferret Cortex
    (1994) Kowalski, Nina; Versnel, H.; Shamma, . S.A.; ISR
    Characteristics of an anterior auditory field (AAF) in the ferret auditory cortex are described in terms of its electrophysiological responses to tonal stimuli and compared to those of primary auditory cortex (AI). Units in both areas were presented with the same stimulus paradigms and their responses analyzed in the same manner so that a direct comparison of responses was possible. The AAF is located dorsal and rostral to AI on the ectosylvian gyrus and extends into the suprasylvian sulcus rostral to AI. The tonotopicity is organized with high frequencies at the top of the sulcus bordering the high- frequency area of AI, then reversing with lower BFs extending down into the sulcus. AAF contained single units that responded to a frequency range of 0.2 - 30 kHz. Stimuli consisted of single-tone bursts, two-tone bursts and frequency modulated (FM) stimuli swept in both directions at various rates. Best frequency (BF) range, rate-level functions at BF, directional sensitivity, and variation in asymmetries of response areas were all comparable characteristics between AAF and AI. The characteristics that were different between the two cortical areas were: latency to tone onset, excitatory bandwidth 20 dB above threshold (BW20) and preferred FM rate, as parameterized with the centroid (a weighted average of spike counts). The mean latency of AAF units was shorter than in AI (16.5 ms AAF, 19.4 ms AI). BW20 measurements in AAF were typically twice as large as those found in AI (2.5 oct AAF, 1.3 oct AI). There was a wider range of centroids found in AI than in AAF, and the relationships between BW20 and centroid were different for AAF and AI. The relationship between centroid and BW20 was examined to see if wider bandwidths were a factor in a unit's ability to detect fast sweeps. There was significant (P<0.05) linear correlation in AAF but not in AI. In both fields, the variance of the centroid population decreased with increasing BW20. BW20 decreased as BF increased for units in both auditory fields.
  • Thumbnail Image
    Item
    Detection of Binary Sources Over Discrete Channels with Additive Markov Noise
    (1994) Alajaji, Fady; Phamdo, N.; Farvardin, Nariman; Fuja, Tom E.; ISR
    We consider the problem of directly transmitting a binary source with an inherent redundancy over a binary channel with additive stationary ergodic Markov noise. Out objective is to design an optimum receiver which fully utilizes the source redundancy in order to combat the channel noise.

    We investigate the problem of detecting a binary iid non-uniform source transmitted across the Markov channel. Two maximum a posteriori (MAP) formulations are considered: a sequence MAP detection and an instantaneous MAP detection. The two MAP detection problems are implemented using a modified version of the Viterbi decoding algorithm and a recursive algorithm. Necessary and sufficient conditions under which the sequence MAP detector becomes useless as well as simulation results are presented. A comparison between the performance of the proposed system with that of a (substantially more complex) traditional tandem source-channel coding scheme exhibits a better performance for the proposed scheme at relatively high channel bit error rates.

    The same detection problem is then analyzed for the case of a binary symmetric Markov source. Analytical and simulation results show the existence of a "mismatch" between the source and the channel. This mismatch is reduced by the use of a rate-one convolutional encoder. Finally, the detection problem is generalized for the case of a binary non-symmetric Markov source.

  • Thumbnail Image
    Item
    Ripple Analysis in Ferret Primary Auditory Cortex. II. Topographic and Columnar Distribution of Ripple Response Parameters
    (1994) Versnel, H.; Kowalski, Nina; Shamma, S.A.; ISR
    We examined the columnar and topographic distribution of response parameters using spectral ripples and tonal stimuli in the primary auditory cortex (AI) of the barbiturate-anesthetized ferret. The ripple stimuli consisted of broadband stimuli (1-20 kHz) with sinusoidally modulated spectral envelopes.

    Responses to ripples were parametrized in terms of characteristic ripple Wo(ripple frequency where the magnitude of the ripple transfer function is maximal, i.e., where the cell responds best) and characteristic phase Fo (intercept of the phase of the ripple transfer function, i.e., phase where the cell responds best). The response area (measured with tones) was parametrized in terms of its excitatory bandwidth at 20 dB above threshold (BW20), and its asymmetry as reflected by the directional sensitivity index (C) to frequency-modulated (FM) tones. Columnar organization for the above four parameters was investigated in 66 single units from 23 penetrations. It was confirmed for Wo, Fo, and the C index, but it appeared to be ambiguous for BW20. The response parameters measured from multiunit recordings corresponded closely to those obtained from single units in the same cluster. In a local region, most cells exhibited closely matched, response fields (RFs, inverse Fourier transformed ripple transfer function) and response areas (measured with two-tone stimuli), and had correspondingly similar response parameters to ripples and tones. The topographic distribution of the response parameters across the surface of AI was studied with multiunit recordings in four animals. In all maps, systematic patterns or clustering of, response parameters could be discerned along the isofrequency planes.

    The distribution of the characteristic ripple Wo exhibited two trends. First, along the isofrequency planes, it was largest near the center of AI, gradually decreasing towards the edges of the field where often a secondary maximum was found.

    The second trend occurred along the tonotopic axis where the maximum Wo found in an isofrequency range increases with increasing BF. The tonal bandwidth BW20, which was inversely correlated with Wo, exhibited a similar topographic distribution along the tonotopic axis and the isofrequency planes. The distribution of the characteristic ripple phase, Fo which reflects the asymmetry in the response field, showed a systematic order along the isofrequency axis. At the center of AI symmetric responses (Fo 0) predominated. Towards the edges, the RFs became more asymmetric with Fo < 0 caudally, and Fo > 0 rostrally. The asymmetric response types tended to cluster along repeated bands that paralleled the tonotopic axis. The FM directional sensitivity (C index, reflecting asymmetry of tonal response areas) tends to have similar trends along the isofrequency axis as Fo.

  • Thumbnail Image
    Item
    Ripple Analysis in Ferret Primary Auditory Cortex. I. Response Characteristics of Single Units to Sinusoidally Rippled Spectra
    (1994) Shamma, Shihab A.; Versnel, Huib; Kowalski, Nina; ISR
    We compared the response properties of single units to tones and sinusoidally, rippled spectral stimuli in the primary auditory cortex (AI) of the, barbiturate-anesthetized ferret. Using two- tone stimuli, we determined the response area of each cell and parameterized it in terms of best frequency (BF), the bandwidth of the excitatory responses at 20dB above threshold (BW20), and an asymmetry index measuring the balance of inhibition and excitation around the BF.

    Using frequency-modulated (FM) tones, we also determined a directional sensitivity index for the cell. Using broadband stimuli (1-20 kHz) with sinusoidally modulated spectral envelopes (ripples), we measured the response magnitude of each cell as a function of ripple frequency (W) and ripple phase (F), and then reconstructed the magnitude and phase of a ripple transfer function. Most cells (approximately 90 %) were tuned to a specific ripple frequency, denoted as a characteristic ripple frequency (Wo) . Most cells also exhibited a linear ripple phase as a function of W. The intercept of the phase function defined as the characteristic ripple phase (Fo), and is interpreted as the best ripple phase to drive the cell; the slope of the phase function reflects the location of the response area of the cell along the tonotopic axis. By inverse Fourier transforming the transfer function, we obtain the response field (RF) of the cell, an analogue of the response area measured with tonal stimuli. Like the response area, the RF was parametrized by the following measures: BFRF, which is the location of the maximum of the RF along the tonotopic axis, Wo , which is roughly inversely proportional to the width of the RF, and Fo which reflects the asymmetry of the RF. In the ferret Wo , ranges from 0.2 to 3 cycles/octave, with the average of the distribution around 1.0. Fo , ranges over the full cycle in a Gaussian-like distribution around 0o. For a subgroup of cells the sinusoidal modulations of the spectrum were presented both on linear and logarithmic amplitude scale. The responses were not notably different. The effect of the variations of amplitude of the sinusoidal modulation was studied. The largest effect was observed for the magnitude transfer function, which increased with amplitude and then saturated. The parameters Wo and Fo did not vary significantly with ripple amplitude. Typically, cells respond best to intermediate sound levels of the ripple stimulus, i.e., the magnitude transfer function shows a nonmonotonic dependence on overall stimulus level. The phase function and Wo do not depend much on level. The effects of a few nonlinearities on the responses are examined briefly. Effects of nonlinearities as threshold and saturation of the neural firing rates are examined. It is found that (non)monoticity of the rate level function of a cell could be distinguished from its ripple response characteristics. The RF of a cell closely corresponds to the response area measured with tone stimuli. Regression analysis shows that: (A) BFRF is, very similar to the tonal BF; (B) Wo is inversely correlated to the excitatory bandwidth; (C) Fo is correlated to the asymmetry of the response area.

    Responses to rippled spectra in AI resemble closely the response properties to sinusoidal gratings in the primary visual cortex (VI). This provides a unified framework within which to interpret the functional organization of both corticies. Basic differences between the two systems, however, are also evident as the lack in AI of a substantial simple/complex distinction in the responses.

    It is hypothesized that AI effectively analyzes an arbitrary input spectrum into a weighted sum of ripple components of different ripple frequencies and phases. This analysis is performed locally around each BF by a two-dimensional bank of filters tuned to different Wo and Fo values. Psychophysical support and implications of this hypothesis are also discussed in relation to the perception of timbre and other auditory tasks.

  • Thumbnail Image
    Item
    Representation of Spectral Profiles in the Auditory System, II: Detection of Spectral Peak Shape Changes
    (1994) Vranic-Sowers, S.; Shamma, S.A.; ISR
    Based on the ripple analysis model outlined in Part I of this paper, predictions are made for the detection of shape changes in spectral peak profiles. Peak shape is uniquely described in terms of two parameters: bandwidth factor (BWF) which reflects the tuning or sharpness of a peak, and a symmetry factor (SF) which roughly measures the local evenness or oddness of a peak. Using profile analysis methods, thresholds to changes in these parameters (defined as dBWF/BWF and dSF) are measured together with the effects of several manipulations such as using different peak levels, varying spectral component densities, and randomizing the frequencies of the peaks. The new ripple analysis model accounts well for the measured thresholds. Predictions of the three previously published models for the same profiles are also evaluated and discussed.
  • Thumbnail Image
    Item
    Representation of Spectral Profiles in the Auditory System, I: A Ripple Analysis Model
    (1994) Vranic-Sowers, S.; Shamma, S.A.; ISR
    A model of profile analysis is proposed in which a spectral profile is assumed to be represented by a weighted sum of sinusoidally modulated spectra (ripples). The analysis is performed by a bank of bandpass filters, each tuned to a particular ripple frequency and ripple phase. The parameters of the model are estimated using data from ripple detection experiments in [Green} 1986; Hillier 1991]. Detection thresholds are computed from the filter outputs and compared with perceptual thresholds, for profile detection experiments with step, single component increment, and the alternating profiles. The model accounts well for the measured thresholds in these experiments. Physiological and psychophysical evidences from the auditory and visual systems in support of this type of a model are also reviewed. The implications of this model for pitch and timbre perception are briefly discussed.
  • Thumbnail Image
    Item
    Speech Coding over Noisy Channels
    (1994) Farvardin, Nariman; ISR
    This chapter contains a discussion of quantization over noisy channels. The effects of channel noise on the performance of vector quantizers are discussed and algorithms for the design of noisy-channel vector quantizers are presented. It is argued that in certain practical situations where delay and complexity place hard limits on system parameters, a combined source-channel coding approach might be preferable to the more traditional tandem source-channel coding. Examples of full-searched, multi- stage and finite-state vector quantization designed for a noisy channel are provided for coding of speech line spectrum pair parameters.