[in Journal: Parallel Processing Letters] AND [Keyword: SIMD] : Search

Anywhere

Advanced Search

SEARCH GUIDE

Results: 1 - 6of6

Follow results:

refine search

Filters

per page:

Sort: Relevance

Context for search term 1Search term 1*

All Dates

LastSelect static range

Custom Range

Select starting monthSelect starting year

Select ending monthSelect ending year

Advanced

Search name	Searched On	Run search
[in Journal: Reviews in Mathematical Physics] AND [Keyword: POVM] (1)	2 Apr 2025	Run
[in Journal: Parallel Processing Letters] AND [Keyword: SIMD] (6)	2 Apr 2025	Run
[in Journal: Modern Physics Letters B] AND [Keyword: MBE] (3)	2 Apr 2025	Run
[in Journal: Nano] AND [Keyword: MBE] (1)	2 Apr 2025	Run
[in Journal: Journal of Algebra and Its Applications] AND [Keyword: Wild] (1)	2 Apr 2025	Run

articleNo Access
PARALLELIZATION OF WAVELET FILTERS USING SIMD EXTENSIONS
- RADE KUTIL and
- PETER EDER
Parallel Processing Letters01 Sep 2006
Preview Abstract
Much work has been done to optimize wavelet transforms for SIMD extensions of modern CPUs. However, these approaches are mostly restricted to the vertical part of 2-D transforms with line-wise organized memory layouts because this leads to a rather straight forward SIMD-implementation. This work shows for an example of a common wavelet filter new approaches to use SIMD operations on 1-D transforms that are able to produce reasonable speedups. As a result, the performance of algorithms that use wavelet transforms, such as JPEG2000, can be increased significantly. Various variants of parallelization are presented and compared. Their advantages and disadvantages for general filters are discussed.
articleNo Access
USING HARDWARE MULTITHREADING TO OVERCOME BROADCAST/REDUCTION LATENCY IN AN ASSOCIATIVE SIMD PROCESSOR
- KEVIN SCHAFFER and
- ROBERT A. WALKER
Parallel Processing Letters01 Dec 2008
Preview Abstract
The latency of broadcast/reduction operations has a significant impact on the performance of SIMD processors. This is especially true for associative programs, which make extensive use of global search operations. Previously, we developed a prototype associative SIMD processor that uses hardware multithreading to overcome the broadcast/reduction latency. In this paper we show, through simulations of the processor running an associative program, that hardware multithreading is able to improve performance by increasing system utilization, even for processors with hundreds or thousands of processing elements. However, the choice of thread scheduling policy used by the hardware is critical in determining the actual utilization achieved. We consider three thread scheduling policies and show that a thread scheduler that avoids issuing threads that will stall due to pipeline dependencies or thread synchronization operations is able to maintain system utilization independent of the number of threads.
articleNo Access
POLLY — PERFORMING POLYHEDRAL OPTIMIZATIONS ON A LOW-LEVEL INTERMEDIATE REPRESENTATION
Parallel Processing Letters01 Dec 2012
Preview Abstract
The polyhedral model for loop parallelization has proved to be an effective tool for advanced optimization and automatic parallelization of programs in higher-level languages. Yet, to integrate such optimizations seamlessly into production compilers, they must be performed on the compiler's internal, low-level, intermediate representation (IR). With Polly, we present an infrastructure for polyhedral optimizations on such an IR. We describe the detection of program parts amenable to a polyhedral optimization (so-called static control parts), their translation to a Z-polyhedral representation, optimizations on this representation and the generation of optimized IR code. Furthermore, we define an interface for connecting external optimizers and present a novel way of using the parallelism they introduce to generate SIMD and OpenMP code. To evaluate Polly, we compile the PolyBench 2.0 benchmarks fully automatically with PLuTo as external optimizer and parallelizer. We can report on significant speedups.
articleNo Access
EFFICIENT K-SELECTION IN HYPERCUBE MULTIPROCESSORS
- PASCAL BERTHOMÉ
Parallel Processing Letters01 Sep 1992
Preview Abstract
This paper deals with the problem of finding the K smallest elements out of a totally ordered but non-sorted set of N elements. This problem, called K-selection, arises often in statistics, image processing and distributed computing. We propose two algorithms to solve this problem in hypercubes. Our first algorithm is asymptotically optimal when K = O((log N)^β), for any constant β. The second enlarges the range of optimality to K = N^∊, ∊ < 1, using a recursive strategy. These are major improvements on previous results.
articleNo Access
Parallel Image Processing Using Neural Networks: Applications in Contrast Enhancement of Medical Images
- J. Mattes,
- D. Trystram, and
- J. Demongeot
Parallel Processing Letters01 Mar 1998
Preview Abstract
This paper describes the implementation of a parallel image processing algorithm, the aim of which is to give good contrast enhancement in real time, especially on the boundaries of an object of interest defined by a grey homogeneity (for example, an object of medical interest having a functional or morphologic homogeneity, like a bone or tumor). The implementation of a neural network algorithm which does this contrast enhancement has been done on a SIMD massively parallel machine (a MasPar of 8192 processors) and the communication between its processors has been optimized.
articleNo Access
COMPUTATION OF THE CONTINUOUS WAVELET TRANSFORM ON MASSIVELY PARALLEL SIMD ARRAYS
- MANFRED FEIL,
- ANDREAS UHL, and
- MARIAN VAJTERŠIC
Parallel Processing Letters01 Dec 1999
Preview Abstract
Strategies for computing the continuous wavelet transform on massively parallel SIMD arrays are introduced and discussed. The different approaches are theoretically assessed and the results of implementations on a MasPar MP-2 are compared.

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Narrow Results

Filters Applied

Publication Type

Article Type

Publication Date

Author

Subjects

Access

PARALLELIZATION OF WAVELET FILTERS USING SIMD EXTENSIONS

USING HARDWARE MULTITHREADING TO OVERCOME BROADCAST/REDUCTION LATENCY IN AN ASSOCIATIVE SIMD PROCESSOR

POLLY — PERFORMING POLYHEDRAL OPTIMIZATIONS ON A LOW-LEVEL INTERMEDIATE REPRESENTATION

EFFICIENT K-SELECTION IN HYPERCUBE MULTIPROCESSORS

Parallel Image Processing Using Neural Networks: Applications in Contrast Enhancement of Medical Images

COMPUTATION OF THE CONTINUOUS WAVELET TRANSFORM ON MASSIVELY PARALLEL SIMD ARRAYS