Normal view MARC view ISBD view

Optimizing a GPU algorithm through hardware profiling analysis (Record no. 56103)

MARC details
000 -CABECERA
campo de control de longitud fija	02075naa a2200217 a 4500
003 - IDENTIFICADOR DEL NÚMERO DE CONTROL
campo de control	AR-LpUFIB
005 - FECHA Y HORA DE LA ÚLTIMA TRANSACCIÓN
campo de control	20250311170427.0
008 - DATOS DE LONGITUD FIJA--INFORMACIÓN GENERAL
campo de control de longitud fija	230201s2014 xx o 000 0 eng d
024 8# - Otro identificador estandar
Número estándar o código	DIF-M6920
--	7084
--	DIF006322
040 ## - FUENTE DE LA CATALOGACIÓN
Centro catalogador/agencia de origen	AR-LpUFIB
Lengua de catalogación	spa
Centro/agencia transcriptor	AR-LpUFIB
100 1# - ENTRADA PRINCIPAL--NOMBRE DE PERSONA
Nombre de persona	Tinetti, Fernando Gustavo
245 10 - MENCIÓN DE TÍTULO
Título	Optimizing a GPU algorithm through hardware profiling analysis
300 ## - DESCRIPCIÓN FÍSICA
Extensión	1 archivo (606,2 KB)
500 ## - NOTA GENERAL
Nota general	Formato de archivo: PDF. -- Este documento es producción intelectual de la Facultad de Informática - UNLP (Colección BIPA/Biblioteca)
520 ## - SUMARIO, ETC.
Sumario, etc.	Usage of GPU-based architectures for scientific computing has been steadily increasing in the last years. This new paradigm for both programming and execution has been applied to solve several classic problems much faster than using the conventional multiprocessor and/or multicomputer approach. These architectures allow an increase in performance - compared to conventional CPU processors - for specific types of algorithms that are particularly suitable for its greater number of simpler cores which execute one single instruction at a time, each one for different sets of data. Since this is still a relative new technology, GPU device manufacturers as well as independent researchers have published several experiences (success stories), best practices, and optimization guides to aid developers for obtaining the maximum program performance. However, there is still little information about the possible optimizations that can only be harnessed by analyzing the specific device's hardware performance counters. In this paper, we discuss several optimizations based on hardware profiling and share our learned lessons about how such data can be used to optimize a scientific algorithm on a GPU using CUDA.
534 ## - NOTA SOBRE LA VERSIÓN ORIGINAL
Encabezamiento principal del original	International Conference on Computational Science and Computational Intelligence (2014 mar. 10-13 : Las Vegas). Proceedings, vol. 1, pp. 45-51
650 #4 - PUNTO DE ACCESO ADICIONAL DE MATERIA--TÉRMINO DE MATERIA
Término de materia o nombre geográfico como elemento de entrada	HARDWARE
700 1# - PUNTO DE ACCESO ADICIONAL--NOMBRE DE PERSONA
Nombre de persona	Martín, Sergio M.
856 40 - LOCALIZACIÓN Y ACCESO ELECTRÓNICOS
Identificador Uniforme del Recurso	<a href="http://dx.doi.org/10.1109/CSCI.2014.16">http://dx.doi.org/10.1109/CSCI.2014.16</a>
942 ## - ELEMENTOS DE PUNTO DE ACCESO ADICIONAL (KOHA)
Tipo de ítem Koha	Capítulo de libro

Holdings
Estado de retiro	Estado de pérdida	Estado dañado	Disponibilidad	Colección	Biblioteca permanente	Biblioteca actual	Fecha de adquisición	Total de préstamos	Signatura topográfica completa	Fecha visto por última vez	Identificador Uniforme del Recurso	Precio válido a partir de	Tipo de ítem Koha
			Recurso en Línea	Biblioteca digital	Biblioteca de la Facultad de Informática	Biblioteca de la Facultad de Informática	11/03/2025		A0659	11/03/2025	http://catalogo.info.unlp.edu.ar/meran/getDocument.pl?id=960	11/03/2025	Capítulo de libro