Dataflow-based Rollback Recovery in Distributed and Multi-core Systems: a Novel Software Approach for Building Highly Reliable Distributed and Multi-core Systems - David Cummings - 图书 - VDM Verlag Dr. Müller - 9783639210194 - 2009年11月27日
如封面与标题不符,以标题为准

Dataflow-based Rollback Recovery in Distributed and Multi-core Systems: a Novel Software Approach for Building Highly Reliable Distributed and Multi-core Systems


商品到货时接收邮件提醒
Do you have a profile? 登录
添加至iMusic心愿单

Many computer programs, especially those involving scientific computing, are long running and rely on parallel processing. The long run times, as well as the increased probability of hardware failures as the number of processors increases and semiconductor feature sizes shrink, demand a high level of recoverability from hardware failures. To address this, we describe a novel approach to parallel programming based on the large grain dataflow model of computing. This approach provides a number of fault-tolerance features, including two forms of application-transparent rollback recovery, process restart and distributed checkpoint/rollback. We describe a simulator for a large grain dataflow system named COSMOS that was originally developed at NASA?s Jet Propulsion Laboratory and was based on a distributed-memory architecture. Using the COSMOS simulator, performance comparisons and tradeoffs are made between process restart and checkpoint/rollback, and an analytical model is developed to validate the empirical results. This is then used to predict the behavior of COSMOS programs in a multi-core environment, with very favorable results.

介质类型 图书     Paperback Book   (平装胶订图书)
已发行 2009年11月27日
ISBN13 9783639210194
出版商 VDM Verlag Dr. Müller
页数 236
商品尺寸 150 × 220 × 10 mm   ·   349 g
语言 英语  

David Cummings的更多作品

显示全部

Mere med samme udgiver