Searched for: subject%3A%22OpenCL%22
(1 - 14 of 14)
document
Bijl, Robin (author)
The ever-increasing demand for computing has led to the need for specialized heterogeneous hardware, and the frameworks required to utilize them. Besides the traditional central processing units, more and more programs will make use of specialized hardware to accelerate computations. However, the increase in computing also leads to shorter mean...
master thesis 2023
document
Berkers, Martijn (author)
The application of accelerators in HPC applications has seen enormous growth in the last decade. In the field of HPC demands on throughput are steadily growing. <br/>Not all of the algorithms used have a clear HW architecture which performs the best. Our work explores the performance of different HW architectures in solving a convex optimization...
master thesis 2020
document
Hoozemans, J.J. (author), van Straten, J. (author), Viitanen, Timo (author), Tervo, Aleksi (author), Kadlec, Jiri (author), Al-Ars, Z. (author)
The proliferation of processing hardware alternatives allows developers to use various customized computing platforms to run their applications in an optimal way. However, porting application code on custom hardware requires a lot of development and porting effort. This paper describes a heterogeneous computational platform (the ALMARVI...
journal article 2019
document
Berger, M.E.R. (author)
Using the histogram procedure, this work studies performance determining factors in computing in parallel on SIMD and SIMT devices. Modern graphics pro-cessing units (GPUs) support SIMT, multiple threads running the same instruction, whereas central processing units (CPUs) use SIMD, in which one instruction op-erates on multiple operands. As...
master thesis 2015
document
Van der Wijst, H. (author)
In recent years the use of co-processors to accelerate specific tasks is becoming more common. To simplify the use of these accelerators in software, the OpenCL framework has been developed. This framework provides programs a cross-platform interface for using accelerators. The ?-VEX processor is a run-time reconfigurable VLIW processor. It...
master thesis 2015
document
Shen, J. (author)
Heterogeneous platforms are mixes of different processing units in a compute node (e.g., CPUs+GPUs, CPU+MICs) or a chip package (e.g., APUs). This type of platforms keeps gaining popularity in various computer systems ranging from supercomputers to mobile devices. In this context, improving their efficiency and usability has become increasingly...
doctoral thesis 2015
document
Postma, J.H. (author)
Conventionally programmed systems (e.g. robots) are not able to adapt to unforeseen changes in their task or environment. Reinforcement learning (RL), a machine learning approach, could grant this flexibility. Many fields of work could greatly benefit from this, be it in terms of cost, time or some other parameter. With RL, a learning agent...
master thesis 2015
document
Timal, N.R.N. (author)
Detecting similarities between (RNA, DNA, and protein) sequences is an important part of bioinformatics. Among the algorithms used to accomplish this, the Smith-Waterman algorithm is very popular. A sequential implementation of Smith-Waterman requires quadratic running time with respect to the length of the sequences. As the amount of data in...
master thesis 2015
document
Fang, J. (author)
The architecture diversity of many-core processors - with their different types of cores, and memory hierarchies - makes the old model of reprogramming every application for every platform infeasible. Therefore, inter-platform portability has become a desirable feature of programming models. While functional portability is ensured by standards...
doctoral thesis 2014
document
Shamonin, D.P. (author), Bron, E.E. (author), Lelieveldt, B.P.F. (author), Smits, M. (author), Klein, S. (author), Staring, M. (author)
Nonrigid image registration is an important, but time-consuming task in medical image analysis. In typical neuroimaging studies, multiple image registrations are performed, i.e., for atlas-based segmentation or template construction. Faster image registration routines would therefore be beneficial. In this paper we explore acceleration of the...
journal article 2014
document
Spliet, R. (author)
Traditional (sequential) applications use malloc for a variety of dynamic data structures, like linked lists or trees. GPGPU is gaining attention and popularity because its massively-parallel architecture allows for great speed improvement for programs that can be parallelised and implemented for a platform like OpenCL. Programmers who try to...
master thesis 2013
document
Penders, A. (author)
Data analysis is a rising field of interest for computer science research due to the growing amount of information that is digitally available. This increase in data has as direct consequence that any analysis is significantly complex. By using structured representations for the data sets, like graphs, the analysis becomes feasible, but is still...
master thesis 2012
document
Hemminga, S. (author)
In the last few years, the computing industry has changed its course from ever higher clock speeds to multi- and many-core systems. These new parallel systems su?er a “programmability gap”: there is a large gap between the performance a platform is theoretically capable of, and what the programmer is able to achieve. Early programming models...
master thesis 2011
document
Raskovic, M. (author)
Multicore systems have become an indispensable part of our everyday life. They represent a viable alternative for increasing processor performance without hitting the memory and power walls. However, the shift from traditional programming to multicore programming has a critical influence in three dimensions: the applications, the software tools,...
master thesis 2010
Searched for: subject%3A%22OpenCL%22
(1 - 14 of 14)