Skip to content

Publications

2024

  • A shared compilation stack for distributed-memory parallelism in stencil DSLs. Bisbas, G., Lydike, A., Bauer, A., Brown, N., Fehr, M., Mitchel, L., Rodriguez-Canal, G., Jamieson, M., Kelly, P., Steuwer, M., Grosser, T. To appear in 2024 ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
  • Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC. Brown, N., Jamieson, M. To appear in Fourth International Workshop on RISC-V for HPC (RISC-V HPC) 2024.
  • Predicting accurate batch queue wait times on production supercomputers by combining machine learning techniques. Brown, N., Gibb, G., Belikov, E., Nash, R. To appear in Concurrency and Computation: Practice and Experience Special Issue.
  • Evaluating Versal AI Engines for option price discovery in market risk analysis. Klaisoongnoen, M., Brown, N. In the 32nd ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (ISFPGA). (pdf)

2023

  • Fortran performance optimisation and auto-parallelisation by leveraging MLIR-based domain specific abstractions in Flang. Brown, N. Jamieson, M., Bauer, E., Lydike, A., Grosser, T. In The Ninth Annual Workshop on the LLVM Compiler Infrastructure in HPC. (pdf)
  • Is RISC-V ready for HPC prime-time: Evaluating the 64-core Sophon SG2042 RISC-V CPU. Brown, N. Jamieson, M., Lee, J., Wang, P. In The Second International Workshop on RISC-V for HPC. (pdf)
  • Stencil-HMLS: A multi-layered approach to the automatic optimisation of stencil codes on FPGA. Rodriguez-Canal, G., Brown, N., Jamieson, M., Bauer, E., Lydike, A., Grosser, T. In The Ninth IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). (pdf)
  • Task-based preemptive scheduling on FPGAs leveraging partial reconfiguration. Rodriguez-Canal, G., Brown, N., Torres, Y., Gonzalez-Escribano, A. In Concurrency and Computation: Practice and Experience. (pdf)
  • Fortran High-Level Synthesis: Reducing the barriers to accelerating HPC codes on FPGAs. Rodriguez-Canal, G., Brown, N., Dykes, T., Jones, J., Haus, U. In the 33rd International Conference on Field-Programmable Logic and Applications. (pdf).
  • VESTEC: Visual Exploration and Sampling Toolkit for Extreme Computing Urgent decision making meets HPC: Experiences and future challenges. Flatken, M., Podobas, A., Fellegara, R., Basermann, A., Holke, J., Knapp, D., Kontak, M., Krullikowski, C., Nolde, M., Brown, N., Nash, R., Gibb, G., Belikov, E., Chien, S., Markidis, S., Guillou, P., Tierny, J., Gueunet, C., Günther, J., Pawlowski, M., Poletti, P., Guzzetta, G., Manica, M., Zardini, A., Chaboureau, J., Mendes, M., Cardil, A., Monedero, S., Ramirez, J., Gerndt, A. In IEEE Access volume 11 pp 87805 – 87834. (pdf)
  • Experiences of running an HPC RISC-V testbed. Brown, N., Jamieson, M., Lee, J. Extended abstract in RISC-V Summit Europe. (pdf)
  • Backporting RISC-V Vector assembly. Lee, J., Jamieson, M., Brown, N. In the first international workshop on RISC-V for HPC. (pdf)
  • Test-driving RISC-V Vector hardware for HPC. Lee, J., Jamieson, M., Brown, N., Jesus, R. In the first international workshop on RISC-V for HPC. (pdf)
  • Morpheus unleashed: Fast cross-platform SpMV on emerging architectures. Stylianou, C., Klaisoongnoen, M., Jesus, R., Brown, N., Weiland, M. In Proceedings of the Cray User Group 2023 (CUG). (pdf)
  • Making the case: The role of FPGAs for efficiency-driven quantitative financial modelling. Klaisoongnoen, M., Brown, N., In Economics of Financial Technology Conference.
  • Exploring the Versal AI engines for accelerating stencil-based atmospheric advection simulation. Brown, N. In The 31st ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA). (pdf)

2022

  • Fast and energy-efficient derivatives risk analysis: Streaming option Greeks on Xilinx and Intel FPGAs. Klaisoongnoen, M., Brown, N., Brown, O. In The Eighth IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). (pdf)
  • Web-based volunteer distributed computing for handling time-critical urgent workloads. Brown. N., Newby, S. In IEEE/ACM Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS). (pdf)
  • xDSL: A common compiler ecosystem for domain specific languages. Brown, N., Grosser, T., Fehr, M., Steuwer, M., Kelly, P. To appear as research poster at Supercomputing (SC22). (pdf)
  • Programming abstractions for preemptive scheduling in FPGAs using partial reconfiguration. Rodriguez-Canal, G., Brown, N., Torres, Y., Gonzalez-Escribano, A. In Springer LNCS Euro-Par Hetero-Par workshop proceedings. (pdf)
  • Performance of the Vipera framework for DSLs on micro-core architectures. Jamieson, M. Brown, N. In Springer LNCS Euro-Par DSL-HPC workshop proceedings. (pdf)
  • TensorFlow as a DSL on Cerebras WSE for stencil-based computation codes. Brown, N. Echols, B., Zarins, J., Grosser, T. In Springer LNCS Euro-Par DSL-HPC workshop proceedings. (pdf)
  • NVRAM as an Enabler to New Horizons in Graph Processing. Capelli, L. Brown, N., Bull, J. M. In Springer Nature Computer Science, Volume 3 Issue 5. September 2022. (pdf)
  • Low-power option Greeks: Efficiency-driven market risk analysis using FPGAs. Klaisoongnoen, M., Brown, N., Brown, O. T. In 12th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART). (pdf)
  • Workflows to driving high-performance interactive supercomputing for urgent decision making. Brown, N. Nash, R. Gibb, G. Belikov, E., Podobas, A. Chien, W. D., Markidis, S., Flatken, M., Gerndt, A. In LNCS High Performance Computing: ISC High Performance workshops 2022. (pdf)
  • FPGAs for accelerating HPC engineering workloads: the why and the how. Brown, N. Whitepaper published by EXCELLERAT Centre of Excellence (CoE). (pdf)
  • Predicting batch queue job wait times for informed scheduling of urgent HPC workloads. Brown, N. Gibb, G., Belikov, E., Nash, R. In Proceedings of the Cray User Group 2022 (CUG). (pdf)
  • A programming model for developing Application Specific Dataflow Machines on FPGAs. Brown, N. Extended abstract in the 30th IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM).

2021

  • Porting incompressible flow matrix assembly to FPGAs for accelerating HPC engineering simulations. Brown, N. In The Seventh IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). (pdf)
  • Utilising urgent computing to tackle the spread of mosquito-borne diseases. Brown, N., Nash, R., Poletti, P., Guzzetta, G., Manica, M., Zardini, A., Flatken, M., Vidal, J., Gueunet, C., Belikov, E., Tierny, J., Podobas, A., Der Chien, W., Markidis, S., Gerndt, A. In IEEE/ACM Third International Workshop on the use of HPC for Urgent Decision Making (UrgentHPC). (pdf)
  • Optimisation of an FPGA Credit Default Swap engine by embracing dataflow techniques. Brown, N., Klaisoongnoen, M., Brown, O. In IEEE Cluster workshop on FPGAs for HPC (pdf)
  • Accelerating advection for atmospheric modelling on Xilinx and Intel FPGAs. Brown, N. In IEEE Cluster workshop on FPGAs for HPC. (pdf)
  • I feel the need for speed: Exploiting latest generation FPGAs in providing new capabilities for high frequency trading. Klaisoongnoen, M., Brown, N., Brown, O. In PhD forum of International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART ’21)
  • Application specific dataflow machine construction for programming FPGAs via Lucent. Brown, N. In Languages, Tools, and Techniques for Accelerator Design (LATTE). (pdf)
  • Compact native code generation for dynamic languages on micro-core architectures. Jamieson, M., Brown, N. In ACM SIGPLAN 2021 International Conference on Compiler Construction (CC 2021). (pdf)

2020

  • Porting incompressible flow matrix assembly to FPGAs for accelerating HPC engineering simulations. Brown, N. In The Sixth IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). (pdf)
  • Exploring the acceleration of Nekbone on reconfigurable architectures. Brown, N. In IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). (pdf)
  • Benchmarking micro-core architectures for detecting disasters at the edge. Jamieson, M., Brown, N. In IEEE/ACM Second International Workshop on the use of HPC for Urgent Decision Making (UrgentHPC). (pdf)
  • A Bespoke Workflow Management System for Data-Driven Urgent HPC. Gibb, G., Brown, N., Nash, R. W., Kontak, M., Mendes, M., Monedero, S., Fidalgo, H. D., Cisneros, J. R. & Cardil, A. In IEEE/ACM Second International Workshop on the use of HPC for Urgent Decision Making (UrgentHPC). (pdf)
  • Supercomputing with MPI meets the Common Workflow Language standards: an experience report. Nash, R., Brown, N., Kontak, M., Crusoe, M. R. In IEEE/ACM 15th Workshop on Workflows in Support of Large-Scale Science (WORKS). (pdf)
  • Investigating applications on the A64FX. Jackson, W. A., Weiland, M., Brown, N., Turner, A., Parsons, M. In EAHPC-2020 – Embracing Arm: a journey of porting and optimization to the latest Arm-based processors as part of IEEE Cluster 2020. (pdf)
  • Having your cake and eating it: Exploiting Python for programmer productivity and performance on micro-core architectures using ePython. Jamieson, M., Brown, N. & Liu, S. In Proceedings of the 19th Python in Science Conference : SciPy 2020 (more info) (pdf)
  • Weighing up the new kid on the block: Impressions of using Vitis for HPC software development. Brown, N. In 30th International Conference on Field Programmable Logic and Applications (FPL) (more info) (pdf)
  • High level programming abstractions for leveraging hierarchical memories with micro-core architectures. Jamieson, M. & Brown, N. In Journal of Parallel and Distributed Computing (more info) (pdf)
  • Machine Learning for Gas and Oil Exploration. Nordloh, V., Roubickova, A. & Brown, N. In Proceedings of 9th International Conference on Prestigious Applications of Intelligent Systems (more info) (pdf)
  • Machine learning on Crays to optimise petrophysical workflows in oil and gas exploration. Brown, N., Roubickova, A., Lampaki, I., MacGregor, L., Ellis, M. & Vera de Newton. In Concurrency and Computation: Practice and Experience (more info) (pdf)
  • Modelling the earth’s geomagnetic environment on Cray machines using PETSc and SLEPc. Brown, N., Bainbridge, B., Beggan, C. D., Brown, W., Hamilton, B. & Macmillan, S. In Concurrency and Computation: Practice and Experience (more info) (pdf)

2019

  • It’s all about data movement: Optimising FPGA data access to boost performance. Brown, N. & Dolman, D. In The Fifth IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC) (more info) (pdf)
  • The Technologies Required for Fusing HPC and Real-Time Data to Support Urgent Computing. Gibb, G., Nash, R., Brown, N. & Prodan, B. In IEEE/ACM first international workshop on using HPC for urgent decision making (UrgentHPC) (more info) (pdf)
  • iPregel: Strategies to Deal with an Extreme Form of Irregularity in Vertex-Centric Graph Processing. Capelli, L., Brown, N. & Bull, J. In IEEE/ACM 9th Workshop on Irregular Applications: Architectures and Algorithms (IA3) (more info) (pdf)
  • Exploring the acceleration of the Met Office NERC Cloud model using FPGAs. Brown, N. In LNCS High Performance Computing: ISC High Performance 2019 (more info) (pdf)
  • The role of interactive super-computing in using HPC for urgent decision making. Brown, N., Nash, R., Gibb, G., Prodan, B., Kontak, M., Olshevsky, V. & Der Chien, W. In LNCS High Performance Computing: ISC High Performance 2019 (more info) (pdf)
  • Eithne: A framework for benchmarking micro-core accelerators. Jamieson, M. & Brown, N. Poster in Supercomputing 2019 (more info) (pdf)
  • Using machine learning to reduce ensembles of geological models for oil and gas exploration. Roubickova, A., Brown, N., Brown, O., MacGregor, L. & Stewart, M. In IEEE/ACM 5th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-5) (more info) (pdf)
  • iPregel: Vertex-Centric Programmability vs Memory Efficiency and Performance, Why Choose? Capelli, L., Hu, Z., Zakian, T., Brown, N. & Bull, J. In Parallel Computing (more info) (pdf)
  • Machine learning on Crays to optimise petrophysical workflows in oil and gas exploration. Brown, N., Roubickova, A., Lampaki, I., MacGregor, L., Ellis, M. & Vera de Newton. In Proceedings of the Cray User Group 2019 (CUG) (more info) (pdf)
  • Modelling the earth’s geomagnetic environment on Cray machines using PETSc and SLEPc. Brown, N., Bainbridge, B., Beggan, C. D., Macmillan, S., Brown, W. & Hamilton, B. In Proceedings of the Cray User Group 2019 (CUG) (more info) (pdf)
  • Massively parallel parcel-based simulation of moist convection. Boeing, S., Gibb, G., Brown, N., Weiland, M. & Dritschel, D. G. In EGU General Assembly 2019 (more info)

2018

  • Driving asynchronous distributed tasks with events. Brown, N., Thomson Brown, O. & Bull, J. In 4th Workshop on Open Source Supercomputing (more info) (pdf)
  • Leveraging MPI RMA to optimise halo-swapping communications in MONC on Cray machines. Brown, N., Bareford, M. & Weiland, M. In Concurrency and Computation: Practice and Experience (more info) (pdf)
  • Event Driven Asynchronous Tasks (EDAT). Brown, N. Poster in ISC 2018 (more info)
  • Leveraging hierarchical memories for micro-core architectures. Brown, N. & Jamieson, M. In the 5th Exascale Applications and Software Conference (more info)
  • Leveraging SLEPc in modeling the earth’s magnetic environment. Brown, N., Hamiton, B., Brown, W., Beggan, C. D., Bainbridge, B. & Macmillan. In the 5th Exascale Applications and Software Conference (more info)
  • A directory/cache for leveraging the efficient use of distributed memory by task-based runtime systems. Rotaru, T., Lörwald, B., Brown, N., Rahn, M., Aumage, O., Beltran, V., Teruel, X., Ciesko, J. & Sistek, J. In the 5th Exascale Applications and Software Conference (more info)
  • In situ data analytics for highly scalable cloud modelling on Cray machines. Brown, N., Weiland, M., Hill, A. & Shipway, B. In Concurrency and Computation: Practice and Experience (more info) (pdf)

2017

  • ePython: An Implementation of Python Enabling Accessible Programming of Micro-Core Architectures. Brown, N. In Computing Insight UK (more info)
  • Offloading Python kernels to micro-core architectures. Brown, N. Poster in Supercomputing 2017 (more info)
  • A comparison of techniques for solving the Poisson equation in CFD. Brown, N. In the Journal of Civil Aircraft Design and Research (more info) (pdf)
  • Type oriented programming for task based parallelism. Brown, N., Capelli, L. & Bull, J. In proceedings of the Type-driven Development Workshop (more info) (pdf)
  • Type oriented parallel programming for Exascale. Brown, N. In Advances in engineering software (more info) (pdf)
  • Directory/Cache API for Sharing Data in Distributed Memory Systems. Rotaru, T., Lörwald, B., Rahn, M., Querol, V., Aumage, O., Teruel, X. & Brown, N. Poster in ISC 2017 (more info)
  • MPI RMA as a directory/cache interoperability layer. Brown, N., Rotaru, T. & Lörwald, B. Poster in ISC 2017 (more info)
  • In-situ data analytics for highly scalable cloud modelling on Cray machines. Brown, N., Weiland, M., Hill, A. & Shipway, B. In proceedings of the Cray User Group (CUG) (more info) (pdf)
  • Porting the microphysics model CASIM to GPU and KNL Cray machines. Brown, N., Nigay, A., Weiland, M., Hill, A. & Shipway, B. In proceedings of the Cray User Group (CUG) (more info) (pdf)

2016

  • MONC – highly scalable cloud modelling on the latest supercomputers. Brown, N. In Computing Insight UK (more info)
  • ePython: An implementation of Python for the many-core Epiphany coprocessor. Brown, N. In the 6th Workshop on Python for High-Performance and Scientific Computing (more info) (pdf)
  • On Processing Extreme Data. Petcu, D., Iuhasz, G., Pop, D., Talia, D., Carretero, J., Prodan, R., Fahringer, T., Grasso, I., Doallo, R., Martin, M. J., Fraguela, B. B., Trobec, R., Depolli, M., Rodriguez, F. A., De Sande, F., Da Costa, G., Pierson, J., Anastasiadis, S., Bartzokas, A., Lolis, C., Goncalves, P., Brito, F. & Brown, N. In the Journal of Scalable Computing: Practice and Experience (more info)

2015

  • A highly scalable Met Office NERC Cloud model. Brown, N., Weiland, M., Hill, A., Shipway, B., Maynard, C., Allen, T. & Rezny, M. In proceedings of the 3rd International Conference on Exascale Applications and Software (more info) (pdf)
  • A directive based hybrid met office NERC cloud model. Brown, N., Lepper, A., Weiland, M., Hill, A., Shipway, B. & Maynard, C. In proceedings of the Second Workshop on Accelerator Programming using Directives (more info) (pdf)
  • Report on the state of the art of worldwide co-design centres. Brown, N. EESI-2 whitepaper (more info) (pdf)

2014

  • A type-oriented Graph500 benchmark. Brown, N. In Lecture Notes in Computer Science vol. 8488 (more info) (pdf)

2013

  • A hybrid approach for extreme scalability when solving linear systems. Brown, N., Bull, J. & Bethune, I. In proceedings of the First International Conference on Exascale Applications and Software (more info) (pdf)
  • Applying Type Oriented Programming to the PGAS Memory Model. Brown, N. In 7th International Conference on PGAS Programming Models (more info) (pdf)
  • Type oriented parallel programming for Exascale. Brown, N. In proceedings of the First International Conference on Exascale Applications and Software (more info) (pdf)