Program

Annual IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, April 19, 2002

Held in conjunction with the

International Parallel and Distributed Processing Symposium

April 15-19, 2002
Marriott Marina, Ft. Lauderdale, FL, USA

Theme

Increasingly large parallel computing systems provide unique challenges to the researchers in dependable computing, especially because of the high failure rates intrinsic to these systems. While commercial and scientific companies share the need for massive throughput and low latency, dependability of service is also a concern. In addition to providing uninterrupted service, commercial systems must be free from data corruption. Achieving dependability in highly scalable parallel and distributed systems poses a considerable challenge. As the number of components increases, so does the probability of a component failure. Therefore, improved fault-tolerant technology is required for high scalable parallel and distributed systems.

The goal of this workshop is to provide a forum for researchers and practitioners to discuss issues related to these issues of fault-tolerant parallel and distributed systems. All aspects of design, theory and realization of parallel and distributed systems are of interest.

Topics of interest include, but are not limited to:

- Dependable distributed systems

- Fault-tolerant protocols for distributed systems

- Fault tolerance in clusters of workstations and PCs

- Fault-tolerant interconnection networks

- Fault-tolerant parallel programming

- Using COTS for designing dependable network based computing systems

- Portable checkpointing for heterogeneous distributed systems

- Fault injection in parallel and distributed systems

- Dependability evaluation of fault-tolerant parallel and distributed systems

- Dependable Biocomputing

- Dependable Quantum Computing

Program Committee:

Chair:
D. Avresky, Northeastern University, Boston, USA

J. Bruck, Caltech, USA

B. Ciciani, University of Roma, Italy

D. Culler, University of California Berkley, USA

C. Fetzer, AT&T Labs - Research, USA

J. Hayes, University of Michigan, Ann Arbor, USA

H. Hellwagner, University of Klagenfurt, Austria

T. Katayama, AIST, Japan

H. Levendel, Motorola, USA

Q. Li, Santa Clara University, USA

F. Lombardi, Northeastern University, USA

E. Maehle, University of Lubeck, Germany

P. Mehra, Compaq Tandem Labs, USA

C. Petitpierre, EPFL, Switzerland

D. Powell, LAAS-CNRS, France

M. Raynal, IRISA, France

J. Sifakis, UMR Verimag, France

The workshop is sponsored by the IEEE Computer Society Techical Committee on Parallel Processing. For further information, please contact the Program Chair:

D. R. Avresky

e-mail: avresky@ece.neu.edu

Program

Opening: 8.15am-8.30am

Session: Invited talks and speakers

1. "Availability Requirement fro a Fault Management Server in High-Availability Communication Systems"
8.30am-9.15am
Invited talk : H. Sun, J. Han and H. Levendel

2."From Experimental Assessment of Fault-Tolerant Systems to Dependability Benchmarking"
9.15am- 10.00am
Invited speaker : J. Arlat, CNRS, LAAS, Toulouse, France

Coffee break: 10.00am -10.30am

3. "Fault-Tolerant Quantum Computers"
10.30am - 11.15am
Invited speaker : John P. Hayes, Advanced Computer Architecture Laboratory, Dept. of EE & CS, University of Michigan, Ann Arbor, USA

Session 1

4. Fault Tolerance in the Network Storage Stack
11.15am -11.45am
S. Atchley, S. Soltesz, J. Plank, M. Beck and T. Moore, Univ. of Tennessee, TN, USA

5. Protocols for Fault-tolerant Distributed-Shared-Memory on the SOME-Bus Multiprocessor Architecture
11.45am-12.15pm
D. Hecht and C. Katsinis, Drexel University, USA

Lunch Break 12.15pm-2.00pm

Session 2

6. Optimizing the Reliability of Component-Based n-Version Approaches
2.00pm -2.30 pm
K.E. Grosspietch, FIAIS, Germany

7. A Generalized Analytical Performance Model of Distributed Systems That Perform N Tasks Using P Faul-Prone Processors
2.30pm - 3.00pm
G. Weerasighe, I. Antonious and L. Lipsky, UCONN, Connecticut, USA

8 . Hardware-Software Co-Reliability in Field Reconfigurable Multi-Processor-Memory Systems
3.00pm-3.30pm
M. Choi, N. Park and F Lombardi, NEU, Boston, USA

Coffee break: 3.30pm -4.00pm

Session 3

9. A distributed Primary-Segmented Backup Scheme for Dependable Real-Time Communication in Multihop Networks
4.00pm-4.30pm
G. Ranjith, G.P. Krishna and C. S. Murthy, IIT, Madras, India

10. The Viator Approach : About Four Principals of Autopoietic Growth on the Way to future Hyperactive Network Architectures
4.30pm-5.00pm
P. Simeonov, TUI, Illmenau, Germany

11. Checking a Non-Byzantine FT Scheme Against Byzantine Faults
5.00pm -5.30pm
P. Sobe MUL, ICE, Lubeck, Germany

12. Survivable Computer Networks in the Presence of Partitioning
5.30pm-6.00pm
Y. Varoglu and D.R. Avresky Network Computing Lab, ECE, NEU, Boston, USA

Closing remarks: 6.00pm-6.15pm

Program Annual IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, April 19, 2002