Risultati finali
Final implementation and evaluation of different versions of the BLAS based on the final specification from WP7.
Software for hybrid methodsSoftware for partitioning saddle point problems and overdetermined systems; improved block Cimmino methods incorporating new solvers from other deliverables. Includes extensive testing, documentation and benchmarking.
Software integrationIntegration of the NLAFET library in the respective application environments.
Novel SVD algorithmsPrototypes for the standard SVD algorithm, the symmetric eigenvalue problem and the QDWH-based SVD algorithm.
Software for highly unsymmetric factorizationsImplementation of proposed methods from D3.4 on top of common task framework. Includes extensive testing, documentation and benchmarking.
Prototypes for tiled one-sided factorizations with algorithm-based fault tolerancePrototype software for the Cholesky, LU, and QR factorizations with algorithm-based fault tolerance described in D6.6.
Prototype software for eigenvalue problem solversPrototypes for reduction to non-symmetric condensed forms (Hessenberg and Hessenberg-triangular), for the symmetric eigenvalue problem and the non-symmetric eigenvalue problems.
Prototype software, phase 2Prototypes for Krylov-based iterative methods, and multilevel preconditioners.
Bidiagonal factorizationPrototypes for tw-sided bidiagonal factorization.
Prototype software, phase 1Prototype software for sparse matrix-matrix multiplication and sparse low-rank matrix- approximation.
Prototype software for different versions of the BLASImplementation and evaluation of different versions of the BLAS based on the draft specification from WP7.
Software for symmetrically structured factorizationsImplementation and adaption of methods from D3.2 on top of the common task framework. Extension from symmetric to unsymmetric (but symmetrically structured) case. Includes extensive testing, documentation and benchmarking.
Prototypes for runtime systems exhibiting novel types of schedulingPrototype for runtime systems capable of scheduling at a varying level of granularity/abstraction (i.e., basic kernels, BLAS, LAPACK, etc, and runtime systems that can execute the tasks along the critical path with an adaptive level of parallelism.
Report on algorithm design and approaches to address issues around use of DAGs for sparse factorizations. Includes reporting on prototype code testing possible solutions.
Scalability and tunability of factorization algorithmsReport on scalability and tunability of the software implementing novel factorisation algorithms.
One-sided matrix factorizationsReport on tile algorithms and new experimental algorithms for matrix factorizations (LU, Cholesky, symmetric indefinite and QR). Includes reporting on and documentation of prototype code developed.
Analysis and algorithm designReport on novel Krylov methods and multilevel preconditioners, focusing on numerical efficiency and theoretical properties.
Eigenvalue solvers for nonsymmetric problemsEvaluation of new eigenvalue solvers for the non-symmetric eigenvalue problem, including Krylov methods.
Draft specification for Hybrid BLASRequirements and draft specification for a common set of high performance linear algebra kernels on hybrid systems: Hybrid BLAS.
Evaluation of software prototypesReport evaluating the prototype software developed in task 6.1 with regard to overall system performance on a selection of linear algebra algorithms.
IntegrationReport on the integration of preconditioners and iterative methods from D4.3 into the NLAFET library. Evaluation of the parallel efficiency of the new preconditioned iterative solvers.
Algorithm design for highly unsymmetric factorizationsReport on experimental algorithms for parallel Markowitz ordering, analyzing various approaches and highlighting issues arising. Includes reporting on prototype code testing possible algorithms and solutions.
Evaluation of auto-tuning techniquesReport on the effect of applying the novel scheduling and auto-tuning prototypes to various linear algebra problems.
Novel methods for static and dynamic schedulingEvaluation of existing and novel methods for static and dynamic scheduling in various types of HPC systems. Includes documentation of algorithms and prototypes developed.
Requirements analysisReport describing the outcome of the requirements analysis for all applications.
First dissemination reportDissemination report for the first reporting period, M1-M18.
Dissemination and community outreach planPlan for how to disseminate new results and interact with the broader community.
An off-line auto-tuning framework based on heuristic searchReview of techniques for pruning the search space in the context of auto-tuning, resulting in prototypes for an offline auto-tuning framework based on heuristic search. Includes reporting on optimal circumstances for switching scheduling approaches at run-time.
Second dissemination reportFinal dissemination report, covering M19-M36.
Algorithm design for hybrid methodsReport on partitioning techniques and performance bottlenecks for hybrid methods including block Cimmino. Analysis of methods for saddle-point and overdetermined systems.
Eigenvalue problem solversReport on computation of eigenvectors and reordering of eigenvalues in Schur and generalized Schur forms. Includes evaluation of the scalability and tunability of the prototype software developed.
Theoretical bounds for communication in sparse operationsReport on theoretical lower bounds for key sparse matrix operations such as matrix-matrix multiplication and factorization.
Performance evaluationEvaluation of the communication complexity and parallel performance of the prototypes from D4.3.
Final Hybrid BLAS specificationRevised versions of the BLAS specification based on collaborations with academic institutions and hardware vendors.
Algorithm-based fault tolerance techniquesReport on algorithm-based fault tolerance applied to the tiled Cholesky, LU, and/or QR factorizations.
Validation and evaluationEvaluation of the NLAFET library in the context of the applications, leading to validation of the library and recommendations for future improvements.
Project website (will contain both public parts, and parts that are restricted to the consortium), public and private source code repositories, bug tracking system, online forum, coding style, and developer guidelines.
Beta release of the NLAFET libraryBeta release of parts of the NLAFET library and User's Guide.
Release of the NLAFET libraryFirst complete release of the NLAFET library and associated User's Guide.
Pubblicazioni
Autori:
W. Liu , A. Li , J. Hogg, I. Duff, B Vinter
Pubblicato in:
Proceedings of Euro-Par 2016, Springer Lecture Notes, Numero 9833, 2016, Pagina/e 617-630, ISBN 978-3-319-43658-6
Editore:
Springer Verlag
DOI:
10.1007/978-3-319-43659-3_45
Autori:
Jack Dongarra and Sven Hammarling and Nicholas J. Higham and Samuel D. Relton and Pedro Valero-Lara and Mawussi Zounon
Pubblicato in:
Proceedings of the Fourth Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE4, 2016), 2016
Editore:
CEUR Workshop Proceedings
Autori:
Grey Ballard, James Demmel, Laura Grigori, Mathias Jacquelin, Nicholas Knight
Pubblicato in:
Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures - SPAA '18, 2018, Pagina/e 55-65, ISBN 9781-450357999
Editore:
ACM Press
DOI:
10.1145/3210377.3210415
Autori:
Mahmoud Eljammaly, Lars Karlsson, Bo Kågström
Pubblicato in:
Companion of the 2018 ACM/SPEC International Conference on Performance Engineering - ICPE '18, 2018, Pagina/e 5-8, ISBN 9781-450356299
Editore:
ACM Press
DOI:
10.1145/3185768.3186304
Autori:
Azzam Haidar, Stanimire Tomov, Jack Dongarra, Nick Higham
Pubblicato in:
Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, 2019
Editore:
Association for Computing Machinery
Autori:
B. Adlerborn, L. Karlsson, and B. Kågström
Pubblicato in:
NLAFET Working Papers, Numero 1, 2016
Editore:
Umeå University, the NLAFET project
Autori:
B. Adlerborn, B. Kågström, D. Kressner
Pubblicato in:
NLAFET Working Notes, Numero 2, 2016
Editore:
Umeå University, NLAFET project
Autori:
J. Dongarra, I. Duff, J Hogg, M. Gates, A. Haidar, S. Hammarling, N. J. Higham, P.V.-Lara, S.D. Relton, S. Tomov, M. Zounon
Pubblicato in:
2016, ISSN 1749-9097
Editore:
University of Manchester
Autori:
I. Duff, J. Hogg, and F. Lopez
Pubblicato in:
NLAFET Working Notes, Numero 7, 2016
Editore:
Umeå University, STFC
Autori:
S. Hammarling
Pubblicato in:
NLAFET Working Notes, Numero 4, 2016
Editore:
Umeå University, University of Manchester
Autori:
L. Grigori, S. Cayrols, and J. Demmel
Pubblicato in:
NLAFET Working Notes, Numero 3, 2016
Editore:
Umeå University, INRIA
Autori:
S.D. Relton, P. Valero-Lara, and M. Zounon
Pubblicato in:
NLAFET Working Notes, Numero 5, 2016
Editore:
Umeå University, University of Manchester
Autori:
Jonathan Hogg
Pubblicato in:
NLAFET Working Notes, Numero 6, 2016
Editore:
Umeå University, STFC
Autori:
Ayala, Alan; Claeys, Xavier; Grigori, Laura
Pubblicato in:
https://hal.inria.fr/hal-01893036, Numero 1, 2018
Editore:
INRIA
Autori:
Miraçi, Ani; Papež, Jan; Vohralík, Martin
Pubblicato in:
https://hal.archives-ouvertes.fr/hal-02070981, Numero 1, 2019
Editore:
INRIA
Autori:
Laura Grigori, Olivier Tissot
Pubblicato in:
2018
Editore:
INRIA
Autori:
Laura Grigori, Olivier Tissot
Pubblicato in:
2017
Editore:
INRIA
Autori:
Hussam Al Daas, Laura Grigori, Pierre Jolivet, Pierre-Henri Tournier
Pubblicato in:
2019
Editore:
INRIA
Autori:
Iain Duff and Florent Lopez
Pubblicato in:
NLAFET Working Notes, Numero 14, 2017
Editore:
Umeå University, also as Technical Report RAL-TR-2017-006, Science & Technology Facilities Council, UK.
Autori:
Maksims Abalenkovs, Jack Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Mawussi Zounon, Samuel Relton, Jakub Sistek, David Stevens, Ichitaro Yamazaki, Asim YarKhan
Pubblicato in:
NLAFET Working Notes, Numero 15, 2017
Editore:
Umeå University, also as LAPACK Working Notes 293
Autori:
Jan Papez, Laura Grigori, Radoslav Stompor
Pubblicato in:
NLAFET Working Notes, Numero 19, 2018
Editore:
Umeå University. Also as INRIA Research Report 9157, France
Autori:
Iain Duff, Florent Lopez and Stojce Nakov
Pubblicato in:
NLAFET Working Notes, Numero 17, 2017
Editore:
Umeå University. Also published as Technical Report RAL-TR-2017-010, STFC, UK.
Autori:
Sébastien Cayrols, Iain Duff and Florent Lopez
Pubblicato in:
NLAFET Working Notes, Numero 20, 2018
Editore:
Umeå University. Also published as Technical Report RAL-TR-2018-008, STFC, UK
Autori:
Maksims Abalenkovs, Negin Bagherpour, Jack Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Samuel Relton, Jakub Sistek, David Stevens, PanruoWu, Ichitaro Yamazaki, Asim YarKhan, Mawussi Zounon
Pubblicato in:
NLAFET Working Notes, Numero 16, 2017
Editore:
Umeå university, also published as LAPACK Working Notes 292
Autori:
Mahmoud Eljammaly, Lars Karlsson and Bo Kågström
Pubblicato in:
NLAFET Working Notes, Numero 18, 2017
Editore:
Umeå University
Autori:
Iain Duff, Jonathan Hogg and Florent Lopez
Pubblicato in:
NLAFET Working Notes, Numero 21, 2018
Editore:
Umeå University. Also published as Technical Report RAL-TR-2018-012, STFC, UK
Autori:
Timothy Davis, Iain Duff, Stojce Nakov
Pubblicato in:
NLAFET Working Notes, Numero 22, 2019
Editore:
Umeå University. Also published as Technical Report RAL-TR-2019-003, STFC, UK
Autori:
L. Grigori and O. Tissot
Pubblicato in:
NLAFET Working Note, Numero 13, 2017
Editore:
Umeå University, also INRIA Research Report 9023
Autori:
M. Eljammaly, L. Karlsson, and B. Kågström
Pubblicato in:
NLAFET Working Notes, Numero 8, 2017
Editore:
Umeå University, the NLAFET project
Autori:
Sven Hammarling
Pubblicato in:
NLAFET Working Notes, Numero 12, 2017
Editore:
Umeå University, University of Manchester
Autori:
B. Adlerborn, C.C. Kjelgaard Mikkelsen, L. Karlsson, and B. Kågström
Pubblicato in:
NLAFET Working Notes, Numero 10, 2017
Editore:
Umeå University, the NLAFET project
Autori:
C.C. Kjelgaard Mikkelsen and L. Karlsson
Pubblicato in:
NLAFET Working Notes, Numero 9, 2017
Editore:
Umeå University
Autori:
M. Myllykoski, C.C. Kjelgaard Mikkelsen, L. Karlsson, and B. Kågström.
Pubblicato in:
NLAFET Working Notes, Numero 11, 2017
Editore:
Umeå University, the NLAFET project
Autori:
J. Papež, L. Grigori, R. Stompor
Pubblicato in:
Astronomy & Astrophysics, Numero 620, 2018, Pagina/e A59, ISSN 0004-6361
Editore:
Springer Verlag
DOI:
10.1051/0004-6361/201832987
Autori:
Jack Dongarra, Sven Hammarling, Nicholas J. Higham, Samuel D. Relton, Pedro Valero-Lara, Mawussi Zounon
Pubblicato in:
Procedia Computer Science, Numero 108, 2017, Pagina/e 495-504, ISSN 1877-0509
Editore:
Elsevier
DOI:
10.1016/j.procs.2017.05.138
Autori:
Zvonimir Bujanović, Lars Karlsson, Daniel Kressner
Pubblicato in:
SIAM Journal on Matrix Analysis and Applications, Numero 39/3, 2018, Pagina/e 1270-1294, ISSN 0895-4798
Editore:
Society for Industrial and Applied Mathematics
DOI:
10.1137/17m1153637
Autori:
Björn Adlerborn, Lars Karlsson, Bo Kågström
Pubblicato in:
SIAM Journal on Scientific Computing, Numero 40/2, 2018, Pagina/e C157-C180, ISSN 1064-8275
Editore:
Society for Industrial and Applied Mathematics
DOI:
10.1137/16m1103890
Autori:
Alan Ayala, Xavier Claeys, Laura Grigori
Pubblicato in:
Journal of Scientific Computing, Numero 79/2, 2019, Pagina/e 1135-1160, ISSN 0885-7474
Editore:
Kluwer Academic/Plenum Publishers
DOI:
10.1007/s10915-018-0885-5
Autori:
Jack Dongarra, Negin Bagherpour, Sven Hammarling, Jakub Šístek, David Stevens, Mawussi Zounon, Samuel D. Relton, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Panruo Wu, Ichitaro Yamazaki, Asim Yarkhan, Maksims Abalenkovs
Pubblicato in:
ACM Transactions on Mathematical Software, Numero 45/2, 2019, Pagina/e 1-35, ISSN 0098-3500
Editore:
Association for Computing Machinary, Inc.
DOI:
10.1145/3264491
Autori:
Iain Duff, Jonathan Hogg, Florent Lopez
Pubblicato in:
Numerical Algebra, Control & Optimization, Numero 8/2, 2018, Pagina/e 237-260, ISSN 2155-3297
Editore:
American Institute of Mathematical Sciences
DOI:
10.3934/naco.2018014
Autori:
Azzam Haidar, Ahmad Abdelfattah, Mawussi Zounon, Stanimire Tomov, Jack Dongarra
Pubblicato in:
IEEE Transactions on Parallel and Distributed Systems, Numero 29/5, 2018, Pagina/e 973-984, ISSN 1045-9219
Editore:
Institute of Electrical and Electronics Engineers
DOI:
10.1109/tpds.2017.2783929
Autori:
Laura Grigori, Sebastien Cayrols, James W. Demmel
Pubblicato in:
SIAM Journal on Scientific Computing, Numero 40/2, 2018, Pagina/e C181-C209, ISSN 1064-8275
Editore:
Society for Industrial and Applied Mathematics
DOI:
10.1137/16m1074527
Autori:
Carl Christian Kjelgaard Mikkelsen, Angelika Beatrix Schwarz, Lars Karlsson
Pubblicato in:
Concurrency and Computation: Practice and Experience, 2017, Pagina/e e5064, ISSN 1532-0626
Editore:
John Wiley & Sons Inc.
DOI:
10.1002/cpe.5064
Autori:
Weifeng Liu, Ang Li, Jonathan D. Hogg, Iain S. Duff, Brian Vinter
Pubblicato in:
Concurrency and Computation: Practice and Experience, Numero 29/21, 2017, Pagina/e e4244, ISSN 1532-0626
Editore:
John Wiley & Sons Inc.
DOI:
10.1002/cpe.4244
Autori:
M. Myllykoski, T. Rossi, J. Toivanen
Pubblicato in:
Journal of Parallel and Distributed Computing, Numero 115, 2018, Pagina/e 56-66, ISSN 0743-7315
Editore:
Academic Press
DOI:
10.1016/j.jpdc.2018.01.004
Autori:
Angelika Schwarz, Lars Karlsson
Pubblicato in:
Parallel Computing, Numero 85, 2019, Pagina/e 131-140, ISSN 0167-8191
Editore:
Elsevier BV
DOI:
10.1016/j.parco.2019.04.001
Autori:
Ichitaro Yamazaki, Jakub Kurzak, Panruo Wu, Mawussi Zounon, Jack Dongarra
Pubblicato in:
IEEE Transactions on Parallel and Distributed Systems, Numero 29/8, 2018, Pagina/e 1879-1892, ISSN 1045-9219
Editore:
Institute of Electrical and Electronics Engineers
DOI:
10.1109/tpds.2018.2808964
Autori:
Jack Dongarra, Sven Hammarling, Nicholas J. Higham, Samuel D. Relton, Mawussi Zounon
Pubblicato in:
Euro-Par 2017: Parallel Processing, Numero 10417, 2017, Pagina/e 511-522, ISBN 978-3-319-64202-4
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-64203-1_37
Autori:
Iain Duff, Florent Lopez, Stojce Nakov
Pubblicato in:
Numerical Analysis and Optimization, Numero 235, 2018, Pagina/e 67-98, ISBN 978-3-319-90025-4
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-90026-1_4
Autori:
Mahmoud Eljammaly, Lars Karlsson, Bo Kågström
Pubblicato in:
Parallel Processing and Applied Mathematics, Numero 10777, 2018, Pagina/e 579-589, ISBN 978-3-319-78023-8
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-78024-5_50
Autori:
Iain Duff, Florent Lopez
Pubblicato in:
Parallel Processing and Applied Mathematics, Numero 10777, 2018, Pagina/e 197-206, ISBN 978-3-319-78023-8
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-78024-5_18
Autori:
Azzam Haidar, Ahmad Abdelfattah, Mawussi Zounon, Panruo Wu, Srikara Pranesh, Stanimire Tomov, Jack Dongarra
Pubblicato in:
Computational Science – ICCS 2018, Numero 10860, 2018, Pagina/e 586-600, ISBN 978-3-319-93697-0
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-93698-7_45
Autori:
Carl Christian Kjelgaard Mikkelsen, Lars Karlsson
Pubblicato in:
Parallel Processing and Applied Mathematics, Numero 10777, 2018, Pagina/e 68-78, ISBN 978-3-319-78023-8
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-78024-5_7
Autori:
Mirko Myllykoski
Pubblicato in:
Parallel Processing and Applied Mathematics, Numero 10777, 2018, Pagina/e 207-216, ISBN 978-3-319-78023-8
Editore:
Springer International Publishing
DOI:
10.1007/978-3-319-78024-5_19
È in corso la ricerca di dati su OpenAIRE...
Si è verificato un errore durante la ricerca dei dati su OpenAIRE
Nessun risultato disponibile