Biologically Inspired Processing of Image and Sound Data
A report describes progress in a research program oriented toward developing artificial-intelligence capabilities for processing images and sounds in changing environments. This research focuses on extending the state of the art in mid-level processing of visual and auditory signals, following a biologically inspired approach. Developments described in the report are summarized as follows:
- In psychophysical experiments, Kalman filters were shown to be useful for modeling hitherto-unexplained features of adaptation of the vision systems of human observers to changes in speed.
- There was a continuation of previous research on low-level features (and processing of the features) needed for detection of junctions (e.g., characterized by L-, T-, and Y-shaped intersections on objects), which are known to be critical for classification of objects.
- Another continuation of previous research included refinement of lowlevel features (and of processing the features) needed for classification of sounds in the presence and absence of noise. These features and the processing thereof are closely related to those of visual processing.
- There was a continuation of development of an integrated perceptual system that would combine low-level feature extraction, attentional mechanisms, and simple object recognition to control a robot arm engaged in a task.
This work was done by Bartlett W. Mel, Norberto M. Grzywacz, Laurent Itti, and Shri Narayanan of the University of Southern California for the Naval Research Laboratory.
NRL-0022
This Brief includes a Technical Support Package (TSP).

Biologically Inspired Processing of Image and Sound Data
(reference NRL-0022) is currently available for download from the TSP library.
Don't have an account?
Overview
The document is a progress report for a project titled "Next-Generation Image and Sound Processing Strategies: Exploiting the Biological Model," funded by the Office of Naval Research (ONR). The report covers the period from February 1 to April 30, 2007, and is authored by a team from the University of Southern California, including Principal Investigator Bartlett W. Mel and co-investigators Norberto M. Grzywacz, Laurent Itti, and Shri Narayanan.
The overarching goal of the project is to advance mid-level visual and auditory signal processing by utilizing biologically inspired methods. The report outlines two primary objectives: (1) to imitate biological sensory feature extraction methods, and (2) to apply these biologically inspired features to enhance attention on critical information within complex visual and auditory environments.
The report details progress in several key areas:
-
Nonlinear Center-Surround Interactions: New experimental measurements have been conducted to compare the responses of retinal ganglion cells to artificial stimuli versus natural images. This research aims to provide insights into the nonlinear processing capabilities of retinal cells.
-
Hierarchical Feature Learning Network: The team has made advancements in developing a biologically inspired hierarchical feature learning network. This includes new results related to learning and detecting junction features, which are essential for understanding visual scenes.
-
Visual Attentional Model: The project has further developed a visual attentional model that incorporates learned top-down influences to predict human attentional shifts. Experimental results are included to demonstrate the effectiveness of this model.
-
Auditory Processing: The report also discusses the adaptation of biologically inspired feature extraction and normalization operations from the visual attentional model to detect prominent syllables and words in speech, highlighting the interdisciplinary nature of the research.
The document emphasizes the importance of understanding the human visual system, particularly the retina, as it is the most effective image-understanding device known. The research aims to bridge the gap between artificial stimuli used in current models and the complexities of natural images, ultimately contributing to advancements in image processing technologies.
In summary, this progress report highlights significant strides in understanding and modeling sensory processing, with implications for both visual and auditory systems, aiming to enhance the technical state-of-the-art in these fields through a biologically inspired approach.
Top Stories
NewsRF & Microwave Electronics
Microvision Aquires Luminar, Plans Relationship Restoration, Multi-industry Push
INSIDERAerospace
A Next Generation Helmet System for Navy Pilots
INSIDERDesign
New Raytheon and Lockheed Martin Agreements Expand Missile Defense Production
INSIDERMaterials
How Airbus is Using w-DED to 3D Print Larger Titanium Airplane Parts
NewsPower
Ford Announces 48-Volt Architecture for Future Electric Truck
ArticlesAR/AI
Webcasts
Electronics & Computers
Cooling a New Generation of Aerospace and Defense Embedded...
Automotive
Battery Abuse Testing: Pushing to Failure
Power
A FREE Two-Day Event Dedicated to Connected Mobility
Unmanned Systems
Quiet, Please: NVH Improvement Opportunities in the Early Design Cycle
Automotive
Advantages of Smart Power Distribution Unit Design for Automotive &...
Energy
Sesame Solar's Nanogrid Tech Promises Major Gains in Drone Endurance



