The goal of this project is to simplify the addition of visualization and online/post-hoc analysis support for distributed simulation runs. The proposed approach will use versioned SoftPM containers, one per simulation node. Versioned containers support the creation of persistence points (checkpoints) on a time-line which can then be retrieved and browsed through by a visualizer or analyzer, either online or post-hoc. The initial goals of the project would be:
[Note: This project assumes that an implementation of versioned containers in SoftPM is available.]
The goal of this project is to make PRIME recoverable after system crashes and power failures to support long-running simulation runs on large number of nodes. The project will implement the use SoftPM simple containers to store key data structures that constitute PRIME runtime state and to design the recovery of simulation state and simulation execution. The goal of this project will be address:
[Note: This project assumes that an implementation of simple containers in SoftPM is available.]
What is HEC?
The goal of this project is to explore and optimize the use of Solid-state Drives (SSDs) in HEC platforms. SSDs have very different cost/reliability/performance characteristics than hard disk dirves (HDD). Consequently, SSDs can be used in many ways and this project will conduct a systematic analysis of these possibilities and develop designs to address each of them:
[Note: This project assumes the availability of substantial SSD storage for constructing the testbed.]
Contact: Luis or Ricardo
In this project we want to instrument QEMU to get full system memory accesses. We already have an implementation that solve many of the challenges we found in this project.
Contact: Luis or Ricardo
The idea of this project is to design an I/O scheduler for requests which data is available in more than one location. Having the same data in more than one location could be available in storage systems with redundancy like RAID 1 or deduplicated systems.
For this project we were trying to create an scheduler with theoretical support using competitive analysis. This was already done for commodity storage systems in the paper “New Algorithms for Disk Scheduling”. We already have some results we obtained in the Topics of Algorithms class.
Contact: Luis
The idea of this project is to study how we can combine multiple heterogeneous devices and export a single block array to the system. As goal, the combination of these devices should improve performance, reliability, power efficiency, etc. This project brings two important challenges:
Some of this questions are partially answer in previous meetings a projects we developed before.
Contact: Ricardo