Primary tabs
Scalable heterogeneous CPU-GPU computations for unstructured tetrahedral meshes." IEEE Micro 35, no. 4 (2015): 6-15.
"Towards Detailed Tissue-Scale 3D Simulations of Electrical Activity and Calcium Handling in the Human Cardiac Ventricle In The 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2015). Lecture Notes in Computer Science, Springer Verlag, 2015.
ica3pp-2015.pdf (1.03 MB)

Towards Simulation of Subcellular Calcium Dynamics at Nanometre Resolution." International Journal of High Performance Computing Applications 29, no. 1 (2015): 51-63.
ijhpca-29-1-p51_63.pdf (4.13 MB)
"
Arrhythmogenic Mechanisms and Therapeutic Targets for Catecholaminergic Polymorphic Ventricular Tachycardia: A Simulation Study in a Human Ventricular Myocyte In Simula Research Laboratory., 2014.
Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs In Proceedings of Euro-Par 2014, Edited by F. Silva. Vol. 8632. LNCS 8632. Berlin Heidelberg New York: Springer, 2014.
Cellular Arrhythmogenesis in CPVT in a computational model of cardiac ventricular myocyte. Maastrich, Netherlands: European Working Group of Cardiac Cellular Electrophysiology, 2014.
Cellular Arrhythmogenesis in CPVT in a Computational Model of Cardiac Ventricular Myocyte. Scandinavian Physiological Society Meeting, 2014.
Effective Multi-GPU Communication Using Multiple CUDA Streams and Threads In 20th International Conference on Parallel and Distributed Systems (ICPADS 2014). IEEE, 2014.
sourouri_etal_icpads14.pdf (779.06 KB)

Heterogeneous CPU-GPU Computing for the Finite Volume Method on 3D Unstructured Meshes In 20th International Conference on Parallel and Distributed Systems (ICPADS 2014). IEEE, 2014.
icpads-hetero.pdf (852.85 KB)

High Efficient Sedimentary Basin Simulations on Hybrid CPU-GPU Clusters." Cluster Computing 17 (2014): 359-369.
"Mathematical Modeling of Ca Handling and Computational Studies of Ca-related Arrhythmogenesis in Heart In National University of Defense Technology, China. Changsha, China, 2014.
Performance Modeling of Serial and Parallel Implementations of the Fractional Adams-Bashforth-Moulton Method." Fractional Calculus and Applied Analysis 17 (2014): 617-637.
" Spontaneous Ca2+ Release and Ca2+ Waves Underlie Early and Delayed Afterdepolarizations, and Triggered Activity in Ryanodine Receptor Mutation associated with Catecholaminergic Polymorphic Ventricular Tachycardia. Scandinavian Physiological Society, 2014.
Time-Fractional Heat Equations and Negative Absolute Temperatures." Computers & Mathematics with Applications 67 (2014): 164-171.
"Utilizing Multiple Xeon Phi Coprocessors on One Compute Node In International Conference on Algorithms and Architectures for Parallel Processing, Edited by X. Sun. Vol. 8631. LNCS 8631. Berlin Heidelberg New York: Springer, 2014.
Adopting Heterogeneous Hardware Platforms for Scientific Computing In Guest lecture at Technical Unviersity of Denmark, December 5., 2013.
Simula.simula.2314.pdf (2.92 MB)

Balancing Efficiency and Accuracy for Sediment Transport Simulations." Computational Science & Discovery 6 (2013): 015011.
"On the GPU Performance of 3D Stencil Computations Implemented in OpenCL In Proceedings of International Supercomputing Conference, ISC 2013, Edited by J. M. Kunkel, T. Ludwig and H. W. Meuer. Vol. 7905. Lecture Notes in Computer Science 7905. Berlin Heidelberg New York: Springer, 2013.
On the GPU Performance of Cell-Centered Finite Volume Method Over Unstructured Tetrahedral Meshes In Proceedings of the 3rd Workshop on Irregular Applications: Architectures and Algorithms, Edited by A. Tumeo, J. Feo, O. Villa and S. Secchi. New York: ACM, 2013.
On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations In Proceedings of IEEE 19th International Conference on Parallel and Distributed Systems. Los Alamitos, California • Washington • Tokyo: IEEE, 2013.
Introduction to Scientific Writing In Intensive course given at National University of Defence Technology, China, October 17-19., 2013.
Simula.simula.2316.pdf (201.5 KB)

Mint: a User-Friendly C-to-CUDA Code Translator In Talk given at SIAM CSE'13, February 25., 2013.
Simula.simula.2322.pdf (3.71 MB)

Performance of Sediment Transport Simulations on NVIDIA's Kepler Architecture In The International Conference on Computational Science, ICCS 2013, Edited by V. Alexandrov, M. Lees, V. Krzhizhanovskaya, J. Dongarra and P. M. A. Sloot. Vol. 18. Procedia Computer Science 18. Elsevier, 2013.