Performance optimizations, implementation, and verification
of the SGI Challenge multiprocessor
Mike Galles and Eric Williams
Silicon Graphics Computer Systems
Abstract
This paper presents the architecture, implementation,
and performance results for the SGI Challenge symmetric
multiprocessor system. Novel aspects of the architecture will
be highlighted, as well as key design trade-offs targeted at
increasing performance and reducing complexity. Multiprocessor
design verification techniques and their impact will also be
presented. The SGI Challenge system architecture provides a
high-bandwidth, low-latency cache-coherent interconnect for several
high performance processors, I/O busses, and a scalable memory system.
Hardware cache coherence mechanisms maintain a consistent view of shared
memory for all processors, with no software overhead and minimal impact
on processor performance. HDL simulation with random, self checking vector
generation and a lightweight operating system on full processor models
contributed to a concept to customer shipment cycle of 26 months.
Introduction
The Silicon Graphics Challenge line of symmetric multiprocessing
computers is designed to provide engineers and scientists with a
powerful set of computational, de sign, and visualization tools at
aggressive price performance levels. Scalability and balance across
processor, memory, and I/O subsystems also makes these computers
attractive to a wide range of high performance computing applications,
from the creation of special effects for the entertainment industry to
database management in the commercial sector. This paper begins with
an overview of the system parameters and configurations, then moves
directly into presentation of the coherence protocol, latency reduction
techniques, and bus efficiency mechanisms, with emphasis on performance
impact. The I/O bus and subsystem is later described, followed up by a
discussion of effective design verification techniques used to reduce the
overall product cycle time. Finally, performance results are presented and
summarized.
How to Get This Article
If you have a graphical Web browser (eg: Mosaic)
and a PostScript file viewer (eg: Ghostview, xpsview),
you can
download and view the file right now.
Or, you may download a compressed, .ps version of this
paper using anonymous ftp.
The file is
called challenge_paper.ps.Z and is located in the
~ftp/sgi/whitepaper directory of
our ftp server ftp.sgi.com.
Once you have the compressed file, you must uncompress
it before it is viewable:
uncompress filename.ps.Z
This will make a larger file without the ".Z" which
you can then display.
If you need help using anonymous
ftp here is a sample session.