Now showing items 41-46 of 46
Lyapunov Inverse Iteration for Computing a few Rightmost Eigenvalues of Large Generalized Eigenvalue Problems
In linear stability analysis of a large-scale dynamical system, we need to compute the rightmost eigenvalue(s) for a series of large generalized eigenvalue problems. Existing iterative eigenvalue solvers are not robust ...
Understanding Multicore Cache Behavior of Loop-based Parallel Programs via Reuse Distance Analysis
Understanding multicore memory behavior is crucial, but can be challenging due to the cache hierarchies employed in modern CPUs. In today's hierarchies, performance is determined by complex thread interactions, such as ...
Learning to Detect Carried Objects with Minimal Supervision
We propose a learning-based method for detecting carried objects that generates candidate image regions from protrusion, color contrast and occlusion boundary cues, and uses a classifier to filter out the regions unlikely ...
Constructing Inverted Files: To MapReduce or Not Revisited
Current high-throughput algorithms for constructing inverted files all follow the MapReduce framework, which presents a high-level programming model that hides the complexities of parallel programming. In this paper, ...
Exploiting Multi-Loop Parallelism on Heterogeneous Microprocessors
Heterogeneous microprocessors integrate CPUs and GPUs on the same chip, providing fast CPU-GPU communication and enabling cores to compute on data "in place." These advantages will permit integrated GPUs to exploit a ...
Design and Evaluation of Monolithic Computers Implemented Using Crossbar ReRAM
A monolithic computer is an emerging architecture in which a multicore CPU and a high-capacity main memory system are all integrated in a single die. We believe such architectures will be possible in the near future due ...