Sukumar Ghosh
The deployment of distributed systems in real applications has significantly increased. With the growth of embedded systems, the man-to-processor ratio is decreasing at an alarming rate. Large distributed systems view failures and perturbations as events and not catastrophic exceptions. Due to the dwindling man-to-processor ratio, it is not always feasible to ask for external intervention every time a failure or a perturbation occurs: future systems should be able to recover on their own. There are different paradigms addressing such issues: These are known as self-stabilization, self-healing, self-reconfiguration, autonomic computing, recovery-oriented computing, adaptive distributed systems etc. These topics define the primary focus of my research.
Spontaneous recovery and adaptation to changing environments are sometimes accompanied by harmful side effects. For example, in self-stabilizing systems, even a single transient failure can corrupt the entire network before recovery begins. Paths to recovery can also compromise with the safety requirements. Examples are abundant in sensor networks and P2P networks. My research deals with various techniques for handling failures and recovery. The current areas of investigation are:
Graduate students
Sukumar Ghosh: Distributed Systems: An Algorithmic Approach, 2006
CRC Press (ISBN 1584885645)
Table of contents ** Solution to exercises ** All figures of the book in the .tar.gz format ** Partial list of errors
Past courses
Computer Organization (22C:60) (Spring 2008)
Peer-to-peer Networks (22C:196:001)) (Spring 2008)
Distributed Systems and Algorithms (22C:166)
(Fall 2007)
Computer Organization (22C:60) (Spring 2007)
Peer-to-peer Networks (22C:196:003)) (Spring 2007)
Computer Organization (22C:060) (Summer 2006)
High-Performance Computer Architecture (22C:160/55:132) (Spring 2006)
Seminar on Systems and Networks (22C:294)
(Spring 2006)
Distributed Systems and Algorithms (22C:166)
(Fall 2006)
Biographical data
Ninth Symposium on Self-stabilizing Systems (SSS 2007)
Ninth International Conference on Distributed Computing and Networking (ICDCN 2008)
Self-stabilization bibliography.
The APPLE project