Distributed System Fundamentals

What Is a Distributed System?

A distributed system can be defined in several ways:

Tanenbaum and van Steen: “A collection of independent computers that appears to its users as a single coherent system”
Coulouris, Dollimore and Kindberg: “One in which hardware or software components located at networked computers communicate and coordinate their actions only by passing messages”
Lamport: “One that stops you getting work done when a machine you’ve never even heard of crashes”

Geographic Distribution: Resources and users are naturally distributed
- Example: Banking services accessible from different locations while data is centrally stored
Fault Tolerance: Problems rarely affect multiple locations simultaneously
- Multiple database servers in different rooms provide better reliability
Performance and Scalability: Combining resources for enhanced capabilities
- High Performance Computing, replicated web servers, etc.

Eight classic assumptions that often lead to problematic distributed systems designs (identified at Sun Microsystems):

System Function: The intended purpose (features and capabilities)
System Behavior: How the system performs its functions
Quality Attributes: Core qualities determining success:
- Performance
- Cost
- Security
- Dependability

Distributed systems introduce complexity in: