LiHPC | Laboratoire en Informatique Haute Performance pour le Calcul et la Simulation

2025

Second-order schemes based on a relaxation system for scalar and systems of conservation laws
Christophe Chalons Mathieu Girardin
Applied Mathematics and Computation, 2025, vol. 490, p. 129173, 2025

abstract

Abstract

In this paper, we propose a new class of second-order one-step time-splitting schemes for scalar and systems of conservation laws. The strategy is based on a relaxation approximation which consists in introducing a relaxation system with linearly degenerate characteristic fields to approximate the original system and deal with its nonlinearities. The numerical schemes are based on the use of arbitrarily high-order approximations of the underlying linear advection equations. Numerical evidence is proposed to illustrate the behaviour of our schemes.

Subcycling Strategy for Finite-Volume Updated-Lagrangian Methods Applied to Fluid–Structure Interaction
Teddy Chantrait Nicolas Chevaugeon Stéphane Del Pino Alexandre Gangloff Emmanuel Labourasse
International Journal for Numerical Methods in Engineering, Volume126, e70051, 2025

abstract

Abstract

In this article, we propose and investigate an explicit partitioned method for solving shock dynamics in fluid–structure interaction (FSI) problems. The method is fully conservative, ensuring the local conservation of mass, momentum, and energy, which is crucial for accurately capturing strong shock interactions. Using an updated-Lagrangian finite-volume approach, the method integrates a subcycling strategy to decouple time steps between the fluid and structure, significantly enhancing computational efficiency. Numerical experiments confirm the accuracy and stability of the method, demonstrating that it retains the key properties of monolithic solvers while reducing computational costs. Extensive validation across 1D and 3D FSI problems shows the method's capability for large-scale, fast transient simulations, making it a promising solution for high-performance applications.

Extension to non-uniform meshes of a high order computationally explicit kinetic scheme for hyperbolic conservation laws
Rémi Abgrall Stéphane Del Pino Axelle Drouard Emmanuel Labourasse
Computers & Fluids, Volume 297, 106648, 2025

We propose in this article a monotone finite volume diffusion scheme on 3D general meshes for the radiation hydrodynamics. Primary unknowns are averaged value over the cells of the mesh. It requires the evaluation of intermediate unknowns located at the vertices of the mesh. These vertex unknowns are computed using an interpolation method. In a second step, the scheme is made monotone by combining the computed fluxes. It allows to recover monotonicity, while making the scheme nonlinear. This scheme is inserted into a radiation hydrodynamics solver and assessed on radiation shock solutions on deformed meshes.

Mixed-Order Meshes through rp-adaptivity for Surface Fitting to Implicit Geometries
Ketan Mittal Veselin A. Dobrev Patrick Knupp Tzanio Kolev Franck Ledoux Claire Roche Vladimir Z. Tomov
Proceedings of the 2024 International Meshing Roundtable (IMR), 2024

abstract

Abstract

Computational analysis with the finite element method requires geometrically accurate meshes. It is well known that high-order meshes can accurately capture curved surfaces with fewer degrees of freedom in comparison to low-order meshes. Existing techniques for high-order mesh generation typically output meshes with same polynomial order for all elements. However, high order elements away from curvilinear boundaries or interfaces increase the computational cost of the simulation without increasing geometric accuracy. In prior work [5, 21], we have presented one such approach for generating body-fitted uniform-order meshes that takes a given mesh and morphs it to align with the surface of interest prescribed as the zero isocontour of a level-set function. We extend this method to generate mixed-order meshes such that curved surfaces of the domain are discretized with high-order elements, while low-order elements are used elsewhere. Numerical experiments demonstrate the robustness of the approach and show that it can be used to generate mixed-order meshes that are much more efficient than high uniform-order meshes. The proposed approach is purely algebraic, and extends to different types of elements (quadrilaterals/triangles/tetrahedron/hexahedra) in two- and three-dimensions.

Analyse quantitative des schémas numériques pour les équations aux dérivées partielles
Daniel Bouche William Weens
Collection PROfil, Editeur EDP Sciences, p. 248, 2024

Arbitrary order monotonic finite-volume schemes for 2D elliptic problems
Xavier Blanc François Hermeline Emmanuel Labourasse Julie Patela
Journal of Computational Physics, Volume 518, 2024, 113325, ISSN 0021-9991, 2024

abstract

Abstract

Monotonicity is very important in most applications solving elliptic problems. Many schemes preserving positivity has been proposed but are at most second-order convergent. Besides, in general, high-order schemes do not preserve positivity. In the present paper, we propose an arbitrary-order monotonic method for elliptic problems in 2D. We show how to adapt our method to the case of a discontinuous and/or tensorvalued diffusion coefficient, while keeping the order of convergence. We assess the new scheme on several test problems.

Représentation et optimisation de maillage structuré par blocs à l'aide de systèmes multi-agents
Valentin Postat
Thèse de Doctorat de l'Université Paris-Saclay, 2024

abstract

Abstract

Ce travail de thèse porte sur la représentation et la génération de maillages hexaédriques structurés par blocs. Il n'existe pas à ce jour de méthode permettant de générer des structures de blocs satisfaisantes pour n'importe quel domaine géométrique. En pratique, des ingénieurs experts génèrent ces maillages avec des logiciels interactifs, ce qui nécessite parfois plusieurs semaines de travail. De plus, l'ajout d'opérations de modification dans ces logiciels interactifs est un travail délicat pour maintenir la cohérence de la structure de blocs et sa relation avec le domaine géométrique à discrétiser. Afin d'améliorer ce processus, nous proposons tout d'abord de définir des opérations de manipulation de maillages hexaédriques se basant sur l'utilisation du modèle des cartes généralisées. Ensuite, en considérant des structures de blocs obtenues à l'aide de la méthode des Polycubes, nous fournissons des méthodes optimisant la topologie de ces structures pour satisfaire des contraintes de nature géométrique. Nous proposons ainsi une première méthode en dimension 2, qui considère une approche locale du problème en s'appuyant sur l'expérience des ingénieurs manipulant des logiciels interactifs. Puis nous proposons une seconde méthode utilisant cette fois la méta-heuristique d'optimisation par colonie de fourmis pour la sélection de feuillets en dimension 3.

Curved hexahedral block structure generation by advancing front
Claire Roche Jérôme Breil Simon Calderan Thierry Hocquellet Franck Ledoux
SIAM IMR24-SIAM International Meshing Roundtable Workshop, 2024

abstract

Abstract

This work aims to provide a method to generate block-structured meshes suitable for Computational Fluid Dynamics (CFD) simulations of flows around vehicles during atmospheric re-entry. This method takes as input a tetrahedral mesh of the domain, and the quadrangular block discretization of the vehicle surface. A linear blocking is obtained using an advancing front algorithm. This means it is incrementally created from the vehicle surface, layer by layer. Then, this linear blocking is curved, and we generate the final mesh. Some results of blocking and corresponding meshes generated with our algorithm are shown.

A physical method for optical characterization of pollution in industrial wastewater ponds using imaging spectroscopy
Louis Zaugg Rodolphe Marion Malik Chami Xavier Briottet Laure Roupioz
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024

End-to-end simulations to optimize imaging spectroscopy mission requirements for seven scientific applications
Xavier Briottet Karine Adeline T. Bajjouk V. Carrere Malik Chami Yohann Constans Yevgeny Derimian Alice Dupiau Marie Dumont Stéphanie Doz Sophie Fabre Pierre-Yves Foucher Hervé Herbin Stéphane Jacquemoud Arnaud Le Bris Marc Lang Pavel Litvinov Sophie Loyer Rodolphe Marion Audrey Minghelli Thomas Miraglio David Sheeren Benjamin Szymanski Frédéric Romand Camille Desjardins Damien Rodat Benoît Cheul
ISPRS Open Journal of Photogrammetry and Remote Sensing, 2024

abstract

Abstract

CNES is currently carrying out a Phase A study to assess the feasibility of a future hyperspectral imaging sensor (10 m spatial resolution) combined with a panchromatic camera (2.5 m spatial resolution). This mission focuses on both high spatial and spectral resolution requirements, as inherited from previous French studies such as HYPEX, HYPXIM, and BIODIVERSITY. To meet user requirements, cost, and instrument compactness constraints, CNES asked the French hyperspectral Mission Advisory Group (MAG), representing a broad French scientific community, to provide recommendations on spectral sampling, particularly in the Short Wave InfraRed (SWIR) for various applications. This paper presents the tests carried out with the aim of defining the optimal spectral sampling and spectral resolution in the SWIR domain for quantitative estimation of physical variables and classification purposes. The targeted applications are geosciences (mineralogy, soil moisture content), forestry (tree species classification, leaf functional traits), coastal and inland waters (bathymetry, water column, bottom classification in shallow water, coastal habitat classification), urban areas (land cover), industrial plumes (aerosols, methane and carbon dioxide), cryosphere (specific surface area, equivalent black carbon concentration), and atmosphere (water vapor, carbon dioxide and aerosols). All the products simulated in this exercise used the same CNES end-to-end processing chain, with realistic instrument parameters, enabling easy comparison between applications. 648 simulations 68 were carried out with different spectral strategies, radiometric calibration performances and signal-to-noise Ratios (SNR): 24 instrument configurations ´ 25 datasets (22 images + 3 spectral libraries). The results show that a 16/20 nm spectral sampling in the SWIR domain is sufficient for most applications. However, 10 nm spectral sampling is recommended for applications based onspecific absorption bands such as mineralogy, industrial plumes or atmospheric gases. In addition, a slight performance loss is generally observed when radiometric calibration accuracy decreases, with a few exceptions in bathymetry and in the cryosphere for which the observed performance is severely degraded. Finally, most applications can be achieved with the lowest SNR, with the exception of bathymetry, shallow water classification, as well as carbon dioxide and methane estimation, which require the higher SNR level tested. On the basis of these results, CNES is currently evaluating the best compromise for designing the future hyperspectral sensor to meet the objectives of priority applications.

IO-SEA: Storage I/O and Data Management for Exascale Architectures
Daniel Medeiros Eric B. Gregory Philippe Couvee James Hawkes Sebastien Gougeaud Maike Gilliot Olivier Bressand Yoann Valeri Julien Jaeger Damien Chapon Frederic Bournaud Loı̈c Strafella Daniel Caviedes-Voullième Ghazal Tashakor Jolanta Zjupa Max Holicki Tom Ridley Yanik Müller Filipe Souza Mendes Guimarães Wolfgang Frings Jan-Oliver Mirus Ilya Zhukov Eric Rodrigues Borba Nafiseh Moti Reza Salkhordeh Nadia Derbey Salim Mimouni Simon Derr Buket Benek Gursoy James Grogan Radek Furmánek Martin Golasowski Kateřina Slaninová Jan Martinovič Jan Faltýnek Jenny Wong Metin Cakircali Tiago Quintino Simon Smart Olivier Iffrig Sai Narasimhamurthy Sonja Happ Michael Rauh Stephan Krempel Mark Wiggins Jiřı́ Nováček André Brinkmann Stefano Markidis Philippe Deniel
Proceedings of the 21st ACM International Conference on Computing Frontiers: Workshops and Special Sessions, Association for Computing Machinery, p. 94-100, 2024

abstract

Abstract

The new emerging scientific workloads to be executed in the upcoming exascale supercomputers face major challenges in terms of storage, given their extreme volume of data. In particular, intelligent data placement, instrumentation, and workflow handling are central to application performance. The IO-SEA project developed multiple solutions to aid the scientific community in adressing these challenges: a Workflow Manager, a hierarchical storage management system, and a semantic API for storage. All of these major products incorporate additional minor products that support their mission. In this paper, we discuss both the roles of all these products and how they can assist the scientific community in achieving exascale performance.

Strategic Research Agenda for High-Performance Computing in Europe European HPC Research Priorities for 2025 - 2029
Nico Mittenzwey Fabrizio Magugliani Marc Duranton Craig Prunty Pascale Rossé-Laurent Manolis Marazakis Paul Carpenter Gabriel Antoniu Sarah Neuwirth Philippe Deniel Dirk Pleiter Utz-Uwe Haus Erwin Laure Andreas Wierse Tobias Becker Robert Haas Michael Malms Hans-Christian Hoppe Valeria Bartsch Sagar Dolas Ondřej Vysocký Maria Perez Andy Forrester Kristel Michielsen Estela Suarez Sai Narasimhamurthy Marcin Ostacz Gabriella Povero Pascale Bernier-Bruna Jean-Pierre Panziera
Zenodo, 2024

Predicting GPU Kernel's Performance on Upcoming Architectures
Lucas Van Lanker Hugo Taboada Elisabeth Brunet François Trahay
Euro-Par 2024: Parallel Processing, Springer Nature Switzerland, p. 77-90, 2024

abstract

Abstract

With the advent of heterogeneous systems that combine CPUs and GPUs, designing a supercomputer becomes more and more complex. The hardware characteristics of GPUs significantly impact the performance. Choosing the GPU that will maximize performance for a limited budget is tedious because it requires predicting the performance on a non-existing hardware platform.

RED-SEA Project: Towards a new-generation European interconnect
Maria Engracia Gomez Julio Sahuquillo Andrea Biagioni Nikos Chrysos Damien Berton Ottorino Frezza Francesca Lo Cicero Alessandro Lonardo Michele Martinelli Pier Stanislao Paolucci Elena Pastorelli Francesco Simula Matteo Turisini Piero Vicini Roberto Ammendola Carlotta Chiarini Chiara De Luca Fabrizio Capuani Adrián Castelló Jose Duro Eugenio Stabile Enrique Quintana Pascale Bernier-Bruna Claire Chen Pierre-Axel Lagadec Gregoire Pichon Etienne Walter Manolis Katevenis Sokratis Bartzis Orestis Mousouros Pantelis Xirouchakis Vangelis Mageiropoulos Michalis Gianioudis Harisis Loukas Aggelos Ioannou Nikos Kallimanis Miguel Sanchez de la Rosa Gabriel Gomez-Lopez Francisco Alfaro-Cortés Jesus Escudero Sahuquillo Pedro Javier Garcia Francisco J. Quiles Jose L. Sanchez Gaetan De Gassowski Matthieu Hautreaux Stephane Mathieu Gilles Moreau Marc Pérache Hugo Taboada Torsten Hoefler Timo Schneider Matteo Barnaba Giuseppe Piero Brandino Francesco De Giorgi Matteo Poggi Iakovos Mavroidis Yannis Papaefstathiou Nikolaos Tampouratzis Benjamin Kalisch Ulrich Krackhardt Mondrian Nuessle Wolfang Frings Dominik Gottwald Felime Guimaraes Max Holicki Volker Marx Yannik Muller Carsten Clauss Hugo Falter Xu Huang Jennifer Lopez Barillao Thomas Moschny Simon Pickartz
Microprocessors and Microsystems, Volume 110, October 2024, 105102, 2024

abstract

Abstract

RED-SEA is a H2020 EuroHPC project, whose main objective is to prepare a new-generation European Interconnect, capable of powering the EU Exascale systems to come, through an economically viable and technologically efficient interconnect, leveraging European interconnect technology (BXI) associated with standard and mature technology (Ethernet), previous EU-funded initiatives, as well as open standards and compatible APIs. To achieve this objective, the RED-SEA project is being carried out around four key pillars: (i) network architecture and workload requirements-interconnects co-design – aiming at optimizing the fit with the other EuroHPC projects and with the EPI processors; (ii) development of a high-performance, low-latency, seamless bridge with Ethernet; (iii) efficient network resource management, including congestion and Quality-of-Service; and (iv) end-to-end functions implemented at the network edges. This paper presents key achievements and results at the midterm of the project for each key pillar in the way to reach the final project objective. In this regard we can highlight: (i) The definition of the network requirements and architecture as well as a list of benchmarks and applications; (ii) In addition to initially planned IPs progress, BXI3 architecture has evolved to support natively Ethernet at low level, resulting in reduced complexity, with advantages in terms of cost optimization, and power consumption; (iii) The congestion characterization of target applications and proposals to reduce this congestion by the optimization of collective communication primitives, injection throttling and adaptive routing; and (iv) the low-latency high-message rate endpoint functions and their connection with new open technologies.

Génération de maillages hexaédriques structurés par blocs courbes pour la rentrée atmosphérique
Claire Roche
Thèse de Doctorat de l'Université Paris-Saclay, 2024

abstract

Abstract

Le Commissariat à l'Énergie Atomique et aux Énergies Alternatives (CEA) s'intéresse à la simulation d'écoulements fluides en régime supersonique et hypersonique dans le cadre de la rentrée atmosphérique. Pour ce faire, un code de simulation numérique dédié y est développé. Pour répondre à des contraintes fortes, ce code ne prend en entrée que des maillages hexaédriques structurés par blocs. Ce type de maillage est compliqué à générer, c'est le plus souvent réalisé à la main via l'utilisation de logiciels interactifs dédiés. Pour des géométries industrielles complexes, la génération d'un maillage est très couteuse en temps. A l'heure actuelle, la génération automatique de maillages hexaédriques est un sujet de recherche ouvert et complexe.Dans le cadre de ces travaux de thèse, nous proposons une méthode permettant de générer des maillages structurés par blocs courbes de domaines fluides autour de géométries dédiées pour les problématiques visées. Cette méthode a d'abord été prototypée dans le cadre de domaines 2D, puis étendue au cas 3D. Ici, la méthode est présentée dans le cas général, en dimension n. Elle se découpe en plusieurs étapes qui sont les suivantes.Dans un premier temps, une structure de blocs linéaire est obtenue par extrusion d'une première discrétisation de la paroi. Ces travaux sont une extension des travaux proposés par Ruiz-Girones et al.. Une fois cette structure de blocs linéaire obtenue, nous proposons deux manières distinctes de courber les blocs afin d'améliorer la représentation de la géométrie, et de limiter le lissage sur le maillage final. La première est à travers d'un processus de lissage de maillage à topologie fixe à l'aide d'un problème d'optimisation, auquel un terme de pénalité est ajouté pour aligner certaines arêtes du maillage aux interfaces. Dans notre processus, nous appliquons cette méthode de lissage à la structure de blocs pour l'aligner sur la surface du véhicule. Cette méthode étant pour l'instant trop couteuse en temps de calcul dans le cas 3D, nous proposons une seconde manière de courber les blocs, à travers une représentation à l'aide de courbes polynomiales de Béziers. Nous appliquons cette fois des opérations géométriques et locales afin d'aligner les blocs à la géométrie.Enfin, en partant du principe que les blocs sont représentés à l'aide de courbes de Bézier, nous générons un maillage final sur ces blocs courbes sous différentes contraintes. Finalement, nous évaluons la qualité des maillages générés à travers des critères purement géométriques, en étudiant l'impact des différents paramètres de notre méthode sur le maillage final. Nous évaluons également les maillages générés par la simulation d'écoulements fluides sur ces maillages, avec la comparaison à des données expérimentales, analytiques, ainsi qu'à des calculs de référence.

2023

Coupe: A Mesh Partitioning Platform
Cédric Chevalier Hubert Hirtz Franck Ledoux Sébastien Morais
SIAM International Meshing Roundtable 2023, Springer Nature Switzerland, p. 43-63, 2023

abstract

Abstract

This paper presents Coupe, a mesh partitioning platform. It provides solutions to solve different variants of the mesh partitioning problem, mainly in the context of load-balancing parallel mesh-based applications. From partitioning weights ensuring balance to topological partitioning that minimizes communication metrics through geometric methods, Coupe offers a large panel of algorithms to fit user-specific problems. Coupe exploits shared memory parallelism, is written in Rust, and consists of an open-source library and command line tools. Experimenting with different algorithms and parameters is easy. The code is available on Github.

Coupe: A Modular, Multi-threaded Mesh Partitioning Platform
Hubert Hirtz Cédric Chevalier Franck Ledoux Sébastien Morais
Euro-Par 2022 International Workshops, Glasgow, UK, August 22–26, 2022, Revised Selected Papers, Glasgow, United Kingdom, 2023

abstract

Abstract

Mesh partitioning used for load balancing in distributed numerical simulations is typically managed with tools that are good enough but not optimal. Their use scope is not explicitly dedicated to load balancing, and they cannot make use of all available information. In this paper, the mesh partitioning problem and the context for its use are precisely defined. Then, existing tools are presented, along with their characteristics and features that are missing. Finally, a new partitioning platform – the subject of my PhD thesis – is presented: its architecture, software engineering choices made along the way, and how it can be the best fit for load balancing distributed simulations. The platform is open-source and is hosted on GitHub: https://github.com/LIHPC-Computational-Geometry/coupe .

Experimenting with Hybrid Quantum Optimization in HPC Software Stack for CPU Register Allocation
Brice Chichereau Stéphane Vialle Patrick Carribault
IEEE International Conference on Quantum Computing and Engineering, 2023

abstract

Abstract

Quantum computers exploit the particular behavior of quantum physical systems to solve some problems in a different way than classical computers. We are now approaching the point where quantum computing could provide real advantages over classical methods. The computational capabilities of quantum systems will soon be available in future supercomputer architectures as hardware accelerators called Quantum Processing Units (QPU). From optimizing compilers to task scheduling, the High-Performance Computing (HPC) software stack could benefit from the advantages of quantum computing. We look here at the problem of register allocation, a crucial part of modern optimizing compilers. We propose a simple proof-of-concept hybrid quantum algorithm based on QAOA to solve this problem. We implement the algorithm and integrate it directly into GCC, a well-known modern compiler. The performance of the algorithm is evaluated against the simple Chaitin-Briggs heuristic as well as GCC's register allocator. While our proposed algorithm lags behind GCC's modern heuristics, it is a good first step in the design of useful quantum algorithms for the classical HPC software stack.

Monotonic diamond and DDFV type finite-volume schemes for 2D elliptic problems
Xavier Blanc François Hermeline Emmanuel Labourasse Julie Patela
Communications in Computational Physics, 2023

abstract

Abstract

The DDFV (Discrete Duality Finite Volume) method is a finite volume scheme mainly dedicated to diffusion problems, with some outstanding properties. This scheme has been found to be one of the most accurate finite volume methods for diffusion problems. In the present paper, we propose a new monotonic extension of DDFV, which can handle discontinuous tensorial diffusion coefficient. Moreover, we compare its performance to a diamond type method with an original interpolation method relying on polynomial reconstructions. Monotonicity is achieved by adapting the method of Gao et al [A finite volume element scheme with a monotonicity correction for anisotropic diffusion problems on general quadrilateral meshes] to our schemes. Such a technique does not require the positiveness of the secondary unknowns. We show that the two new methods are second-order accurate and are indeed monotonic on some challenging benchmarks as a Fokker-Planck problem.

Local-in-time existence of strong solutions to an averaged thick sprays model
Victor Fournet Christophe Buet Bruno Després
Kinetic and Related Models, 2023

Abstract

Quad meshing is a very well-studied domain for many years. While the problem can be globally considered as solved, many approaches do not provide suitable inputs for Computational Fluid Dynamics (CFD) and in our case for supersonic flow simulations. Such simulations require a very strong control on the cell size and direction. To our knowledge, engineers ensure this control manually using interactive software. In this work we propose an automatic algorithm to generate full quadrilateral block structured mesh for the purpose of supersonic flow simulation. We handle some simulation input like the angle of attack and the boundary layer definition. Our approach generates adequate 2D meshes and is designed to be extensible in 3D.

Partitionnement de maillages pour l'équilibrage de charge de simulations multi-physiques
Hubert Hirtz
Thèse de Doctorat de l'Université Paris-Saclay, 2023

abstract

Abstract

Cette étude s'inscrit dans le domaine de l'optimisation de performances de simulations numériques distribuées à grande échelle à base de maillages. Dans ce domaine, nous nous intéressons au bon équilibre de charge entre les unités de calcul sur lesquelles la simulation s'exécute. Pour équilibrer la charge d'une simulation à base de maillage, il faut généralement prendre en compte de la quantité de calcul nécessaire pour chaque maille, ainsi que la quantité de données qui doivent être transférées entre les unités de calcul. Les outils communément utilisés pour résoudre ce problème le solvent d'une manière, qui n'est pas forcément optimale pour une simulation donnée, car ils s'appliquent à de nombreux cas autres que l'équilibrage de charge et le partitionnement de maillage. Notre étude consiste à concevoir et implémenter un nouvel outil de partitionnement dédié aux maillages et à l'équilibrage de charge. Après une explication approfondie du contexte de l'étude, des problèmes de partitionnement ainsi que de l'état de l'art des algorithmes de partitionnement, nous montrons l'intérêt de chaîner des algorithmes pour optimiser de différentes façon une partition de maillage. Ensuite, nous étoffons cette méthode de chaînage en deux points: d'abord, en étendant l'algorithme de partitionnement de nombres VNBest pour l'équilibrage de charge où les unités de calcul sont hétérogènes, puis en spécialisant l'algorithme de partitionnement géométrique RCB, pour améliorer ses performances sur les maillages cartésiens. Nous décrivons en détails le processus de conception de notre outil de partitionnement, qui fonctionne exclusivement en mémoire partagée. Nous montrons notre outil peut obtenir des partitions avec un meilleur équilibre de charge que deux outils de partitionnement en mémoire partagée existants, Scotch et Metis. Cependant, nous ne minimisons pas aussi bien les transferts de données entre unités de calcul. Nous présentons les caractéristiques de performance des algorithmes implémentés en *multithread*.

Formal Definition of Hexahedral Blocking operations Using n-G-Maps
Valentin Postat Nicolas Le Goff Simon Calderan Franck Ledoux Guillaume Hutzler
SIAM International Meshing Roundtable, 2023

abstract

Abstract

Nowadays for real study cases, the generation of full block structured hexahedral meshes is mainly an interactive and very-time consuming process realized by highly-qualified engineers. To this purpose, they use interactive software where they handle and modify complex block structures with operations like block removal, block insertion, O-grid insertion, propagation of block splitting, propagation of meshing parameters along layers of blocks and so on. Such operations are error-prone and modifying or adding an operation is a very tedious work. In this work, we propose to formally define hexahedral block structures and main associated operations in the model of n-dimensional generalized map. This model provides topological invariant and a systematic handling of geometric data that allows us to ensure the expected robustness.

Practical Runtime Instrumentation of Software Languages: The Case of SciHook
Dorian Leroy Benoit Combemale Benoît Lelandais Marie-Pierre Oudot
SLE '23: 16th ACM SIGPLAN International Conference on Software Language Engineering, 2023

abstract

Abstract

Software languages have pros and cons, and are usually chosen accordingly. In this context, it is common to involve different languages in the development of complex systems, each one specifically tailored for a given concern. However, these languages create de facto silos, and offer little support for interoperability with other languages, be it statically or at runtime. In this paper, we report on our experiment on extracting a relevant behavioral interface from an existing language, and using it to enable interoperability at runtime. In particular, we present a systematic approach to define the behavioral interface and we discuss the expertise required to define it. We illustrate our work on the case study of SciHook, a C++ library enabling the runtime instrumentation of scientific software in Python. We present how the proposed approach, combined with SciHook, enables interoperability between Python and a domain-specific language dedicated to numerical analysis, namely NabLab, and discuss overhead at runtime.

Arbitrary-order finite volume schemes preserving positivity for diffusion problems on deformed meshes
Julie Patela
Thèse de Doctorat de l'Université Paris Cité, 2023

abstract

Abstract

The objective of this thesis is the development and the analysis of robust and accurate finite volume schemes for the approximation of the solution of the diffusion equation on deformed meshes with diffusion coefficient which can be anisotropic and/or discontinuous. To satisfy these properties, our schemes must preserve the positivity and achieve high-order accuracy. In this manuscript, we propose the first positivity-preserving arbitrary-order scheme for diffusion. Our approach is first to study the problem in 1D. In such a case, the positivity problem only appears for order 3 and higher. The 1D setting allows us to perform the mathematical analysis of this problem, including a proof of convergence of the scheme to an arbitrary order under a stability assumption. We then extend it to 2D at order 2, relying on well-known schemes. We study two possibilities: a DDFV-type scheme (Discrete Duality Finite Volume), which we compare with a method using polynomial reconstruction. Finally, this allows us to develop a monotonic scheme of arbitrary order on any mesh with a kappa diffusion coefficient that can be discontinuous and/or anisotropic. Improving the order is achieved through polynomial reconstruction, and monotonicity is obtained by reducing to a M-matrix structure, which gives nonlinear schemes. Each scheme is validated by numerical simulations showing the order of convergence and the positivity of the solution obtained.

NFS-Ganesha : évolutions d'un serveur NFS pour le HPC du Terascale à l'Exascale
Philippe Deniel
Thèse de Doctorat de l'Université Paris-Saclay, 2023

abstract

Abstract

Cette thèse présente NFS-Ganesha, un serveur NFS en espace utilisateur pour le HPC, et ses évolutions depuis sa création à l'aube des années 2000 jusqu'à la période Exascale actuelle. Créé à l'origine pour des besoins opérationnels liés à l'exploitation des grands systèmes de stockage, NFS-Ganesha a été pensé pour être générique et parallélisé. L'apparition conjointe des systèmes de fichiers parallèles, donnant naissance aux architectures «data-centriques» de centre de calcul, et celle du protocole NFSv4 vont faire évoluer de NFS-Ganesha qui va devenir un serveur NFS générique capable de s'interfacer avec de nombreux backends. L'évolution de NFSv4, sous la forme de NFSv4.1 et du protocole pNFS, fera de NFS-Ganesha un standard adopté par une forte communauté open-source impliquant chercheurs et industriels. NFS-Ganesha sera utilisé pour réaliser la fonctionnalité IO-Proxy, et la création de nouveaux protocoles parallèles afférents. Impliqués dans des projets de R&D européens, NFS-Ganesha servira à implémenter la fonctionnalité de serveur éphémère afin de répondre aux exigences de l'Exascale.

ADAPTING THE ARC CACHE MANAGEMENT POLICY TO FILE GRANULARITY
Hocine Mahni Stéphane Rubini Jalil Boukhobza Sebastien Gougeaud Philippe Deniel
7th Workshop on Performance and Scalability of Storage Systems (Per3S), 2023

The I/O Trace Initiative: Building a Collaborative I/O Archive to Advance HPC
Nafiseh Moti André Brinkmann Marc-André Vef Philippe Deniel Jesus Carretero Philip Carns Jean-Thomas Acquaviva Reza Salkhordeh
SC-W '23: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, 2023

abstract

Abstract

HPC application developers and administrators need to understand the complex interplay between compute clusters and storage systems to make effective optimization decisions. Ad hoc investigations of this interplay based on isolated case studies can lead to conclusions that are incorrect or difficult to generalize. The I/O Trace Initiative aims to improve the scientific community’s understanding of I/O operations by building a searchable collaborative archive of I/O traces from a wide range of applications and machines, with a focus on high-performance computing and scalable AI/ML. This initiative advances the accessibility of I/O trace data by enabling users to locate and compare traces based on user-specified criteria. It also provides a visual analytics platform for in-depth analysis, paving the way for the development of advanced performance optimization techniques. By acting as a hub for trace data, the initiative fosters collaborative research by encouraging data sharing and collective learning.

Triangular metric-based mesh adaptation for compressible multi-material flows in semi-Lagrangian coordinates
S. Del Pino I. Marmajou
Journal of Computational Physics, Volume 478, 1 April 2023, 2023

abstract

Abstract

In this paper, we propose an adaptive mesh refinement method for 2D multi-material compressible non-viscous flows in semi-Lagrangian coordinates. The mesh adaptation procedure is local and relies on a discrete metric field evaluation. The remapping method is second-order accurate and we prove its stability. We propose a multi-material treatment using two ingredients: the local remeshing is performed in a way that reduces as much as possible the creation of mixed cells and an interface reconstruction method that can be used to avoid the diffusion of the material interfaces. The obtained method is almost Lagrangian and can be implemented in a parallel framework. We provide some numerical tests which attest the validity of the method and its robustness.

Implicit discretization of Lagrangian gas dynamics
A. Plessier S. Del Pino B. Després
ESAIM M2AN Volume 57, Number 2, March-April 2023, 2023

abstract

Abstract

We construct an original framework based on convex analysis to prove the existence and uniqueness of a solution to a class of implicit numerical schemes. We propose an application of this general framework in the case of a new non linear implicit scheme for the 1D Lagrangian gas dynamics equations. We provide numerical illustrations that corroborate our proof of unconditional stability for this non linear implicit scheme.

2022

An asymptotic preserving method for the linear transport equation on general meshes
Pierre Anguill Patricia Cargo Cedric Énaux Philippe Hoch Emmanuel Labourasse Gerald Samba
Journal of Computational Physics, p. 110859, 2022

Relative Performance Projection on Arm Architectures
Clément Gavoille Hugo Taboada Patrick Carribault Fabrice Dupros Brice Goglin Emmanuel Jeannot
Euro-Par 2022: Parallel Processing - 28th International Conference on Parallel and Distributed Computing, Glasgow, UK, August 22-26, 2022, Proceedings, Springer, p. 85-99, 2022

Performance Improvements of Parallel Applicationsthanks to MPI-4.0 Hints
Maxim Moraru Adrien Roussel Hugo Taboada Christophe Jaillet Michael Krajecki Marc Pérache
Proceedings of SBAC-PAD 2022, IEEE, 2022

abstract

Overlapping communications with computation is an efficient way to amortize the cost of communications of an HPC application. To do so, it is possible to utilize MPI nonblocking primitives so that communications run in back-ground alongside computation. However, these mechanisms rely on communications actually making progress in the background, which may not be true for all MPI libraries. Some MPI libraries leverage a core dedicated to communications to ensure communication progression. However, taking a core away from the application for such purpose may have a negative impact on the overall execution time. It may be difficult to know when such dedicated core is actually helpful. In this paper, we propose a model for the performance of applications using MPI nonblocking primitives running on top of an MPI library with a dedicated core for communications. This model is used to understand the compromise between computation slowdown due to the communication core not being available for computation, and the communication speed-up thanks to the dedicated core; evaluate whether nonblocking communication is actually obtaining the expected performance in the context of the given application; predict the performance of a given application if ran with a dedicated core. We describe the performance model and evaluate it on different applications. We compare the predictions of the model with actual executions.

Enabling Global MPI Process Addressing in MPI Applications
Jean-Baptiste Besnard Sameer Shende Allen D. Malony Julien Jaeger Marc Pérache
EuroMPI/USA'22: 29th European MPI Users' Group Meeting, Chattanooga, TN, USA, September 26 - 28, 2022, ACM, p. 27-36, 2022

Evocube: A Genetic Labelling Framework for Polycube-Maps
C. Dumery François Protais Sébastien Mestrallet Christophe Bourcier Franck Ledoux
Computer Graphics Forum 41, 2022

abstract

Abstract

Polycube-maps are used as base-complexes in various fields of computational geometry, including the generation of regular all-hexahedral meshes free of internal singularities. However, the strict alignment constraints behind polycube-based methods make their computation challenging for CAD models used in numerical simulation via finite element method (FEM). We propose a novel approach based on an evolutionary algorithm to robustly compute polycube-maps in this context. We address the labelling problem, which aims to precompute polycube alignment by assigning one of the base axes to each boundary face on the input. Previous research has described ways to initialize and improve a labelling via greedy local fixes. However, such algorithms lack robustness and often converge to inaccurate solutions for complex geometries. Our proposed framework alleviates this issue by embedding labelling operations in an evolutionary heuristic, defining fitness, crossover, and mutations in the context of labelling optimization. We evaluate our method on a thousand smooth and CAD meshes, showing Evocube converges to accurate labellings on a wide range of shapes. The limitations of our method are also discussed thoroughly.

Robust Quantization for Polycube Maps
F. Protais M. Reberol N. Ray E. Corman F. Ledoux D. Sokolov
Robust Quantization, 2022-09

abstract

Abstract

An important part of recent advances in hexahedral meshing focuses on the deformation of a domain into a polycube; the polycube deformed by the inverse map fills the domain with a hexahedral mesh. These methods are appreciated because they generate highly regular meshes. In this paper we address a robustness issue that systematically occurs when a coarse mesh is desired: algorithms produce deformations that are not one-to-one, leading to collapse of large portions of the model when trying to apply the (undefined) inverse map. The origin of the problem is that the deformation requires to perform a mixed integer optimization, where the difficulty to enforce the integer constraints is directly tied to the expected coarseness. Our solution is to introduce sanity constraints preventing the loss of bijectivity due to the integer constraints.

Hex-Mesh Generation and Processing: A Survey
Nico Pietroni Marcel Campen Alla Sheffer Gianmarco Cherchi Franck Ledoux David Bommes Xifeng Gao Riccardo Scateni Jean Remacle Marco Livesu
Hex-Mesh Generation, 2022-10

abstract

Abstract

In this article, we provide a detailed survey of techniques for hexahedral mesh generation. We cover the whole spectrum of alternative approaches to mesh generation, as well as post-processing algorithms for connectivity editing and mesh optimization. For each technique, we highlight capabilities and limitations, also pointing out the associated unsolved challenges. Recent relaxed approaches, aiming to generate not pure-hex but hex-dominant meshes, are also discussed. The required background, pertaining to geometrical as well as combinatorial aspects, is introduced along the way.

Hex Me If You Can
P.-A Beaufort M. Reberol D. Kalmykov H. Liu F. Ledoux D. Bommes
Hex Me If You Can, 2022-10

abstract

Abstract

Abstract HexMe consists of 189 tetrahedral meshes with tagged features and a workflow to generate them. The primary purpose of HexMe meshes is to enable consistent and practically meaningful evaluation of hexahedral meshing algorithms and related techniques, specifically regarding the correct meshing of specified feature points, curves, and surfaces. The tetrahedral meshes have been generated with Gmsh, starting from 63 computer-aided design (CAD) models from various databases. To highlight and label the diverse and challenging aspects of hexahedral mesh generation, the CAD models are classified into three categories: simple, nasty, and industrial. For each CAD model, we provide three kinds of tetrahedral meshes (uniform, curvature-adapted, and box-embedded). The mesh generation pipeline is defined with the help of Snakemake, a modern workflow management system, which allows us to specify a fully automated, extensible, and sustainable workflow. It is possible to download the whole dataset or select individual meshes by browsing the online catalog. The HexMe dataset is built with evolution in mind and prepared for future developments. A public GitHub repository hosts the HexMe workflow, where external contributions and future releases are possible and encouraged. We demonstrate the value of HexMe by exploring the robustness limitations of state-of-the-art frame-field-based hexahedral meshing algorithm. Only for 19 of 189 tagged tetrahedral inputs all feature entities are meshed correctly, while the average success rates are 70.9% / 48.5% / 34.6% for feature points/curves/surfaces.

Enhancing MPI+OpenMP Task based Applications for Heterogenous Architectures with GPU support
Manuel Ferat Romain Pereira Adrien Roussel Patrick Carribault Luiz Angelo Steffenel Thierry Gautier
IWOMP 2022 - 18th International Workshop on OpenMP, p. 1-14, 2022

abstract

Abstract

Heterogeneous supercomputers are widespread over HPC systems and programming efficient applications on these architectures is a challenge. Task-based programming models are a promising way to tackle this challenge. Since OpenMP 4.0 and 4.5, the target directives enable to offload pieces of code to GPUs and to express it as tasks with dependencies. Therefore, heterogeneous machines can be programmed using MPI+OpenMP(task+target) to exhibit a very high level of concurrent asynchronous operations for which data transfers, kernel executions, communications and CPU computations can be overlapped. Hence, it is possible to suspend tasks performing these asynchronous operations on the CPUs and to overlap their completion with another task execution. Suspended tasks can resume once the associated asynchronous event is completed in an opportunistic way at every scheduling point. We have integrated this feature into the MPC framework and validated it on a AXPY microbenchmark and evaluated on a MPI+OpenMP(tasks) implementation of the LULESH proxy applications. The results show that we are able to improve asynchronism and the overall HPC performance, allowing applications to benefit from asynchronous execution on heterogeneous machines.

A Bi-Criteria FPTAS for Scheduling with Memory Constraints on Graphs with Bounded Tree-Width
Eric Angel Sébastien Morais Damien Regnault
Euro-Par 2022: Parallel Processing - 28th International Conference on Parallel and Distributed Computing, Glasgow, UK, August 22-26, 2022, Proceedings, Springer, p. 136–151, 2022

Effect of collisions with a second fluid on the temporal development of nonlinear, single-mode, Rayleigh-Taylor instability
Q Cauvet B Bernecker S Bouquet B Canaud F Hermeline S Pichon
Physical Review E, APS, p. 27269, 2022

Discrete Duality Finite Volume Discretization of the Thermal-PN Radiative Transfer Equations on General Meshes
Francois Hermeline
Communications in Computational Physics, GLOBAL SCIENCE PRESS Office B, 9/F, Kings Wing Plaza2, No. 1 On Kwan St..., p. 398-448, 2022

Coupe
Hubert Hirtz Cédric Chevalier Sébastien Morais Armand Touminet
CEA, LIHPC Computational Geometry group, 2022

Evaluation of the Performance Portability Layer of Different Linear Solver Packages with ALIEN, an Open Generic and Extensible Linear Algebra Framework
Jean-Marc Gratien Cédric Chevalier Thomas Guignon Xavier Tunc Pascal Have Stéphane De Chaisemartin
ECCOMAS Congress 2022 - 8th European Congress on Computational Methods in Applied Sciences and Engineering, 2022

Load Balancing with Zoltan and Isorropia
Erik Boman Doruk Bozdag Cédric Chevalier Siva Rajamanickam Karen Devine Vitus Leung Umit Catalyurek Lee Riesen Michael Wolf
2022

Modeling Round-Off Errors in Hydrodynamic Simulations
William Weens Thibaud Vazquez-Gonzalez Louise Salem-Knapp
p. 182-196, 2022

Bounding the Round-Off Error of the Upwind Scheme for Advection
Louise Salem-Knapp Sylvie Boldo William Weens
IEEE Transactions on Emerging Topics in Computing, p. 1-10, 2022

Bounding the Round-Off Error of the Upwind Scheme for Advection
Louise Salem-Knapp Sylvie Boldo William Weens
p. 127-127, 2022

Intercode Hexahedral Meshing from Eulerian to Lagrangian Simulations
Nicolas Le Goff Franck Ledoux Jean-Christophe Janodet
Mesh Generation and Adaptation: Cutting-Edge Techniques, Springer International Publishing, p. 69-94, 2022

abstract

Abstract

In this chapter, we deal with the problem of mesh conversion for coupling lagrangian and eulerian simulation codes. More specifically, we focus on hexahedral meshes, which are known as pretty difficult to generate and handle. Starting from an eulerian hexahedral mesh, i.e. a hexahedral mesh where each cell contains several materials, we provide a full-automatic process that generates a lagrangian hexahedral mesh, i.e. a hexahedral mesh where each cell contains a single material. This process is simulation-driven in the meaning that the we guarantee that the generated mesh can be used by a simulation code (minimal quality for individual cells), and we try and preserve the volume and location of each material as best as possible. In other words, the obtained lagrangian mesh fits the input eulerian mesh with high-fidelity. To do it, we interleave several advanced meshing treatments--mesh smoothing, mesh refinement, sheet insertion, discrete material reconstruction, discrepancy computation, in a fully integrated pipeline. Our solution is evaluated on 2D and 3D examples representative of CFD simulation (Computational Fluid Dynamics).

Contribution to the numerical simulation of radiative hydrodynamics
Emmanuel Labourasse
Habilitation à Diriger les Recherches en Mathématiques Appliquées, Sorbonne Université, 2021. ⟨tel-03572029⟩, 2022

Génération interactive de maillages hexaédriques structurés par blocs
Simon Calderan
Thèse de Doctorat de l'Université de Paris-Saclay, 2022

abstract

Abstract

Les codes de simulation numérique reposant sur des méthodes de type éléments et volumes finis requièrent de discrétiser le domaine étudié – par exemple une pièce mécanique telle qu’un moteur, une aile d’avion, une turbine, etc. – à l’aide d’un maillage. En dimension 3, un maillage est un ensemble composé d’éléments volumique simples, le plus souvent des tétraèdres ou des hexaèdres, qui partitionnent le domaine d’étude. Le choix de tétraèdres ou d’hexaèdres est principalement dicté par l’application (interaction fluide-structure, hydrodynamique, etc.). Si la génération automatique de maillages tétraédriques est un processus relativement maîtrisé aujourd’hui, générer des maillages hexaédriques est toujours un problème ouvert. Ceci est problématique pour les applications qui justement nécessitent impérativement des maillages hexaédriques puisque leur génération se fait de façon semi-automatique, ce qui peut prendre plusieurs semaines à plusieurs mois de temps ingénieur ! Alors que le temps consacré au processus de simulation numérique à proprement parler tend à diminuer du fait de la puissance des machines utilisées, le goulot d’étranglement est désormais dans la préparation des données, à savoir obtenir un modèle de CAO adapté au calcul, puis en générer un maillage.C’est dans ce contexte que s’inscrit la thèse proposée en suivant une approche hybride mêlant :1. Le développement d’algorithmes (semi)-automatiques pour générer et modifier des maillages hexaédriques structurés par blocs ;2. La mise en place d’un logiciel graphique interactif dédié à la manipulation de structures de blocs. Les mécanismes d’interaction seront en outre utilisés pour guider les algorithmes dans leurs prises de décision, que ce soit à l’initialisation (critères à apposer sur des entités particulières de CAO) ou en cours d’algorithme (décision entre plusieurs options sur lesquelles l’algorithme ne peut se prononcer seul).L’objectif de cette thèse n’est donc pas de fournir une solution automatique universelle, ce qui semble inatteignable actuellement, mais plutôt de réduire le temps ingénieur consacré à la génération du maillage en fournissant des outils plus adaptés. Dans cette optique, nous proposons de placer l’étude dans le prolongement de [LED10, KOW12, GAO15, GAO17], où est considéré le problème de simplification et d’enrichissement de maillages hexaédriques par insertion et suppression de couches de mailles. Dans tous ces travaux, les algorithmes proposés sont des algorithmes simples de type « glouton » où le maillage est modifié pas à pas pour converger vers une solution finale Ef : A chaque étape Ei, on fait l’hypothèse que la « meilleure » solution Ef sera obtenue en faisant le choix « optimal » pour Ei. Or en recherche opérationnelle, une telle approche est connue comme perfectible dès lors que le problème d’optimisation traité est non linéaire. L’idée est donc d’utiliser des approches usuelles en recherche opérationnelle et plus spécifiquement des systèmes multi-agents, couplées à des outils interactifs, pour permettre la génération de structures de blocs sur des CA0 complexes.

Étude sur la progression des communications MPI à base de ressources dédiées
Florian Reynier
Thèse de Doctorat de l'Université de Bordeaux, 2022

abstract

Abstract

De nos jours, MPI est de facto le standard pour la programmation à mémoire distribuée pour les supercalculateurs. Les communications non bloquantes sont un des modèles proposés par le standard MPI. Ces opérations peuvent être utilisées pour recouvrir les communications avec du calcul (ou d’autres communications) afin d’amortir leurs coûts. Cependant, pour être utilisées efficacement, ces opérations nécessitent une progression asynchrone pouvant régulièrement utiliser un montant non négligeable de ressources de calcul (particulièrement les collectives non bloquantes). De plus, partager les ressources de calcul avec l’application peut provoquer un ralentissement global. Les mécanismes utilisés pour cette progression asynchrone parviennent difficilement à concilier un bon recouvrement en gardant un impact minimal sur l’application, ce qui raréfie leur utilisation. Afin de résoudre ces différents problèmes, nous avons suivi plusieurs étapes. Premièrement, nous proposons une étude approfondie de la progression asynchrone dans les implémentations MPI, en utilisant de nouvelles métriques se concentrant sur l’évaluation des mécanismes de progression et de leur impact sur le système global. Après avoir exposé les faiblesses de ces implémentations MPI, nous proposons une nouvelle solution pour la progression des collectives non bloquantes en utilisant des coeurs dédiés combinés à des algorithmes de collectives basés sur des évènements. Nous avons mesuré l’efficacité de cette solution en utilisant nos métriques, pour nous comparer avec les implémentations MPI étudiées dans la première étape. Enfin, nous avons développé un modèle permettant de prédire le gain potentiel et le surcout induit par l’utilisation d’opérations non bloquantes avec des coeurs dédiés. Ce modèle peut être utilisé pour évaluer l’utilité de transformer une application basée sur des opérations bloquantes en opérations non bloquantes pour bénéficier du recouvrement. Nous évaluons ce modèle sur plusieurs benchmarks.

On the Role of Computer Languages in Scientific Computing
Dorian Leroy June Sallou Johann Bourcier Benoit Combemale
Computing in Science & Engineering ( Volume: 24, Issue: 4, 01 July-Aug. 2022), 2022

abstract

Abstract

Scientific codes are complex software systems. Their engineering involves various stakeholders using various computer languages for defining artifacts at different abstraction levels and for different purposes. In this article, we review the overall processes leading to the development of scientific software, and discuss the role of computer languages in the definition of the different artifacts. We provide guidelines to make informed decisions when the time comes to choose a computer language to develop scientific software.

Highly Efficient Controlled Hierarchical Data Reduction techniques for Interactive Visualization of Massive Simulation Data
Jérôme Dubois Jacques-Bernard Lekien
EuroVis 2019 - 21th EG/VGTC Conference on Visualization, 2022

abstract

Abstract

With the constant increase in compute power of supercomputers, high performance computing simulations are producing higher fidelity results and possibly massive amounts of data. To keep visualization of such results interactive, existing techniques such as Adaptive Mesh Refinement (AMR) can be of use. In particular, Tree-Based AMR methods (TB-AMR) are widespread in simulations and are becoming more present in general purpose visualization pipelines such as VTK. In this work, we show how TB-AMR data structures could lead to more efficient exploration of massive data sets in the Exascale era. We discuss how algorithms (filters) should be designed to take advantage of tree-like data structures for both data filtering or rendering. By introducing controlled hierarchical data reduction we greatly reduce the processing time for existing algorithms, sometimes with no visual impact, and drastically decrease exploration time for analysts. Also thanks to the techniques and implementations we propose, visualization of very large data is made possible on very constrained resources. These ideas are illustrated on million to billion-scale native TB-AMR or resampled meshes, with the HyperTreeGrid object and associated filters we have recently optimized and made available in the Visualisation Toolkit (VTK) for use by the scientific community.

ETP4HPC's SRA 5-Strategic Research Agenda for High-Performance Computing in Europe-2022
Michael Malms Laurent Cargemel Estela Suarez Nico Mittenzwey Marc Duranton Sakir Sezer Craig Prunty Pascale Rosse-Laurent Maria Perez-Harnandez Manolis Marazakis Cristiano Malossi Francois Bodin Jean-Francois Lavignon Jean-Philippe Nominé Mark Asch Ovidiu Vermesan Peter Bauer Stephane Requena Alberto Scionti Alexandru Costan Andrea Ferretti Angelos Bilas Ani Anciaux-Sedrakian Anna Queralt Antonio Peña Benjamin Depardon Carmine D'Amico Christophe Calvin Christos Kozanitis Colin Morey Daniel Molka Dario Garcia-Gasulla Dirk Hartmann Edouard Audit Emeric Brun Fabien Chaix France Boillod-Cerneux Gilad Shainer Gilles Wiber Guillaume Colin de Verdière Jacques-Charles Lafoucrière Jean-Marc Denis Jean-Thomas Acquaviva Jordi Guitart Julien Bigot Julita Corbolan Gomez Bautista Arturo Leonardo Lillit Axner Luke Mason Manolis Ploumidis Marc Casas Marc Perache Matthieu Hautreux Miguel Vazquez Nejc Bat Nicolas Bergeret Nicolas Tonello Nils Wedi Olivier Marsden Olivier Terzo Osman Unsal Patrick Carribault Petar Radojkovic Philippe Bricard Philippe Deniel Polyvios Pratikakis Ramon Nou Ricard Borrell Richard Graham Robin Pinning Rossen Apostolov Sabri Pllana Sinead Ryan Somnath Mazumdar Stefano Markidis Sven-Arne Reinemo Thierry Goubier Tiago Quintino Utz-Uwe Haus Valentin Plugaru Valeria Bartsch Vassil Alexandrov Vassilis Papaefstathiou Vicenc Beltran Xavier Martorell Xing Cai Yannis Papaefstathiou Yolanda Becerra
Zenodo, 2022

abstract

Abstract

In the High Performance Computing field (HPC), metadata server cluster is a critical aspect of a storage system performance and with object storage growth, systems must now be able to distribute metadata across servers thanks to distributed metadata servers. Storage systems reach better performances if the workload remains balanced over time. Indeed, an unbalanced distribution can lead to frequent requests to a subset of servers while other servers are completely idle. To avoid this issue, different metadata distribution methods exist and each one has its best use cases. Moreover, each system has different usages and different workloads, which means that one distribution method could fit to a specific kind of storage system and not to another one. To this end, we propose a tool to evaluate metadata distribution methods with different workloads. In this paper, we describe this tool and we use it to compare state-of-the-art methods and one method we developed. We also show how outputs generated by our tool enable us to deduce distribution weakness and chose the most adapted method.

2020

Surface tension for compressible fluids in ALE framework
T. Corot P. Hoch E. Labourasse
J. Comput. Phys., p. 109247, 2020

Overlapping MPI communications with Intel TBB computation
Cassandra Rocha Barbosa Pierre Lemarinier Marc Sergent Guillaume Papauré Marc Pérache
2020 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020, New Orleans, LA, USA, May 18-22, 2020, IEEE, p. 958-966, 2020

Preliminary Experience with OpenMP Memory Management Implementation
Adrien Roussel Patrick Carribault Julien Jaeger
OpenMP: Portable Multi-Level Parallelism on Modern Systems - 16th International Workshop on OpenMP, IWOMP 2020, Austin, TX, USA, September 22-24, 2020, Proceedings, Springer, p. 313-327, 2020

Implementation and performance evaluation of MPI persistent collectives in MPC: a case study
Stéphane Bouhrour Julien Jaeger
EuroMPI/USA '20: 27th European MPI Users' Group Meeting, Virtual Meeting, Austin, TX, USA, September 21-24, 2020, ACM, p. 51-60, 2020

Application-Driven Requirements for Node Resource Management in Next-Generation Systems
Edgar A. León Balazs Gerofi Julien Jaeger Guillaume Mercier Rolf Riesen Masamichi Takagi Brice Goglin
2020 IEEE/ACM International Workshop on Runtime and Operating Systems for Supercomputers, ROSS\@SC 2020, Atlanta, GA, USA, November 13, 2020, IEEE, p. 1-11, 2020

PARCOACH Extension for Static MPI Nonblocking and Persistent Communication Validation
Van Man Nguyen Emmanuelle Saillard Julien Jaeger Denis Barthou Patrick Carribault
4th IEEE/ACM International Workshop on Software Correctness for HPC Applications, Correctness\@SC 2020, Atlanta, GA, USA, November 11, 2020, IEEE, p. 31-39, 2020

Automatic Code Motion to Extend MPI Nonblocking Overlap Window
Van Man Nguyen Emmanuelle Saillard Julien Jaeger Denis Barthou Patrick Carribault
High Performance Computing - ISC High Performance 2020 International Workshops, Frankfurt, Germany, June 21-25, 2020, Revised Selected Papers, Springer, p. 43-54, 2020

A new Implicit Monte-Carlo scheme for photonics (without teleportation error and without tilts)
Gaël Poëtte Xavier Valentin
Journal of Computational Physics, p. 109405, 2020

Canceling Teleportation Error in Legacy IMC Code for Photonics (Without Tilts, With Simple Minimal Modifications)
Gaël Poëtte Xavier Valentin Adrien Bernede
Journal of Computational and Theoretical Transport, p. 162-194, 2020

A Multilevel Mesh Partitioning Algorithm Driven by Memory Constraints
Cédric Chevalier Franck Ledoux Sébastien Morais
2020 Proceedings of the SIAM Workshop on Combinatorial Scientific Computing, p. 85-95, 2020

abstract

Abstract

Running numerical simulations on HPC architectures requires distributing data to be processed over the various available processing units. This task is usually done by partitioning tools, whose primary goal is to balance the workload while minimizing inter-process communication. However, they do not take the memory load and memory capacity of the processing units into account. As this can lead to memory overflow, we propose a new approach to address mesh partitioning by including ghost cells in the memory usage and by considering memory capacity as a strong constraint to abide. We model the problem using a bipartite graph and present a new greedy algorithm that aims at producing a partition according to the memory capacity. This algorithm focuses on memory consumption, and we use it in a multi-level approach to improving the quality of the returned solutions during the refinement phase. The experimental results obtained from our benchmarks show that our approach can yield solutions respecting memory constraints for instances where traditional partitioning tools fail.

We are interested in the numerical approximation of the shallow water equations in two space dimensions. We propose a well-balanced, all-regime, and positive scheme. Our approach is based on a Lagrange-projection decomposition which allows to naturally decouple the acoustic and transport terms.

2019

A low-Mach correction for multi-dimensional finite volume shock capturing schemes with application in Lagrangian frame
E. Labourasse
Comput. Fluids, p. 372 - 393, 2019

Unifying the Analysis of Performance Event Streams at the Consumer Interface Level
Jean-Baptiste Besnard Allen D. Malony Sameer Shende Marc Pérache Patrick Carribault Julien Jaeger
Tools for High Performance Computing 2017, Springer International Publishing, p. 57-71, 2019

abstract

Abstract

Several instrumentation interfaces have been developed for parallel programs to make observable actions that take place during execution and to make accessible information about the program’s behavior and performance. Following in the footsteps of the successful profiling interface for MPI (PMPI), new rich interfaces to expose internal operation of MPI (MPI-T) and OpenMP (OMPT) runtimes are now in the standards. Taking advantage of these interfaces requires tools to selectively collect events from multiples interfaces by various techniques: function interposition (PMPI), value read (MPI-T), and callbacks (OMPT). In this paper, we present the unified instrumentation pipeline proposed by the MALP infrastructure that can be used to forward a variety of fine-grained events from multiple interfaces online to multi-threaded analysis processes implemented orthogonally with plugins. In essence, our contribution complements “front-end” instrumentation mechanisms by a generic “back-end” event consumption interface that allows “consumer” callbacks to generate performance measurements in various formats for analysis and transport. With such support, online and post-mortem cases become similar from an analysis point of view, making it possible to build more unified and consistent analysis frameworks. The paper describes the approach and demonstrates its benefits with several use cases.

Study on progress threads placement and dedicated cores for overlapping MPI nonblocking collectives on manycore processor
Alexandre Denis Julien Jaeger Emmanuel Jeannot Marc Pérache Hugo Taboada
Int. J. High Perform. Comput. Appl., 2019

Checkpoint/restart approaches for a thread-based MPI runtime
Julien Adam Maxime Kermarquer Jean-Baptiste Besnard Leonardo Bautista-Gomez Marc Pérache Patrick Carribault Julien Jaeger Allen D. Malony Sameer Shende
Parallel Comput., p. 204-219, 2019

Detecting Non-sibling Dependencies in OpenMP Task-Based Applications
Ricardo Bispo Vieira Antoine Capra Patrick Carribault Julien Jaeger Marc Pérache Adrien Roussel
OpenMP: Conquering the Full Hardware Spectrum - 15th International Workshop on OpenMP, IWOMP 2019, Auckland, New Zealand, September 11-13, 2019, Proceedings, Springer, p. 231-245, 2019

abstract

Abstract

The advent of the multicore era led to the duplication of functional units through an increasing number of cores. To exploit those processors, a shared-memory parallel programming model is one possible direction. Thus, OpenMP is a good candidate to enable different paradigms: data parallelism (including loop-based directives) and control parallelism, through the notion of tasks with dependencies. But this is the programmer responsibility to ensure that data dependencies are complete such as no data races may happen. It might be complex to guarantee that no issue will occur and that all dependencies have been correctly expressed in the context of nested tasks. This paper proposes an algorithm to detect the data dependencies that might be missing on the OpenMP task clauses between tasks that have been generated by different parents. This approach is implemented inside a tool relying on the OMPT interface.

Mixing ranks, tasks, progress and nonblocking collectives
Jean-Baptiste Besnard Julien Jaeger Allen D. Malony Sameer Shende Hugo Taboada Marc Pérache Patrick Carribault
Proceedings of the 26th European MPI Users’ Group Meeting, EuroMPI 2019, Zürich, Switzerland, September 11-13, 2019, ACM, p. 10:1-10:10, 2019

High-order staggered schemes for compressible hydrodynamics. Weak consistency and numerical validation
Gautier Dakin Bruno Després Stéphane Jaouen
Journal of Computational Physics, p. 339-364, 2019

Exposition, clarification, and expansion of MPI semantic terms and conventions: is a nonblocking MPI function permitted to block?
Purushotham V. Bangalore Rolf Rabenseifner Daniel J. Holmes Julien Jaeger Guillaume Mercier Claudia Blaas-Schenner Anthony Skjellum
Proceedings of the 26th European MPI Users' Group Meeting, EuroMPI 2019, Zürich, Switzerland, September 11-13, 2019, ACM, p. 2:1-2:10, 2019

Applying model-driven engineering to high-performance computing: Experience report, lessons learned, and remaining challenges
Benoı̂t Lelandais Marie-Pierre Oudot Benoit Combemale
Journal of Computer Languages, Elsevier, p. 100919, 2019

About a general discretization method for the usual partial differential equations: application to the radiative transfer and striations equations
Francois Hermeline
2019

Guaranteed quality-driven hexahedral overlay grid method
Nicolas Le Goff Franck Ledoux Jean-Christophe Janodet Steven J. Owen
Proceedings of the 28th International Meshing Roundtable, 2019

abstract

Abstract

Hexahedral mesh generation using overlay grid methods has the benefit of being fully automatic, requiring minimal user input. These methods follow a mesh-first approach where an initial mesh, usually a grid, is used to overlay the reference geometry. Procedures to modify the initial mesh are then employed to best capture the geometry to get a conformal all-hex mesh [1\]. One of the main drawbacks of those methods is the resulting mesh quality. While the interior of the mesh remains the same as the initial mesh, cells located at the material interfaces can end up quite deformed or even inverted, making the mesh totally useless for most numerical simulation codes. Considering an input mesh carrying volume fractions of the materials, the main purpose of the presented work is to ensure a minimal cell quality. Our method draws upon the overlay grid pipeline described in [2\] where several steps (cell assignment correction, interface reconstruction, mesh adaptation) are altered to control cell quality.

By generalizing the theory of convection to any type of thermal and compositional source terms (diabatic processes), we show that thermohaline convection in Earth's oceans, fingering convection in stellar atmospheres, and moist convection in Earth's atmosphere are derived from the same general diabatic convective instability. We also show that "radiative convection" triggered by the CO/CH4 transition with radiative transfer in the atmospheres of brown dwarfs is analogous to moist and thermohaline convection. We derive a generalization of the mixing-length theory to include the effect of source terms in 1D codes. We show that CO/CH4 "radiative" convection could significantly reduce the temperature gradient in the atmospheres of brown dwarfs similarly to moist convection in Earth's atmosphere, thus possibly explaining the reddening in brown dwarf spectra. By using idealized 2D hydrodynamic simulations in the Ledoux unstable regime, we show that compositional source terms can indeed provoke a reduction of the temperature gradient. The L/T transition could be explained by a bifurcation between the adiabatic and diabatic convective transports and seen as a giant cooling crisis: an analog of the boiling crisis in liquid/steam-water convective flows. This mechanism, with other chemical transitions, could be present in many giant and Earth-like exoplanets. The study of the impact of different parameters (effective temperature, compositional changes) on CO/CH4 radiative convection and the analogy with Earth moist and thermohaline convection is opening the possibility of using brown dwarfs to better understand some aspects of the physics at play in the climate of our own planet.

A High-performance and Portable All-Mach Regime Flow Solver Code with Well-balanced Gravity. Application to Compressible Convection
Thomas Padioleau P. Tremblin Edouard Audit Pierre Kestener Samuel Kokh
The Astrophysical Journal, Volume 875, Number 2, 2019

abstract

Abstract

Convection is an important physical process in astrophysics well-studied using numerical simulations under the Boussinesq and/or anelastic approximations. However, these approaches reach their limits when compressible effects are important in the high-Mach flow regime, e.g., in stellar atmospheres or in the presence of accretion shocks. In order to tackle these issues, we propose a new high-performance and portable code called “ARK” with a numerical solver well suited for the stratified compressible Navier–Stokes equations. We take a finite-volume approach with machine precision conservation of mass, transverse momentum, and total energy. Based on previous works in applied mathematics, we propose the use of a low-Mach correction to achieve a good precision in both low and high-Mach regimes. The gravity source term is discretized using a well-balanced scheme in order to reach machine precision hydrostatic balance. This new solver is implemented using the Kokkos library in order to achieve high-performance computing and portability across different architectures (e.g., multi-core, many-core, and GP-GPU). We show that the low-Mach correction allows to reach the low-Mach regime with a much better accuracy than a standard Godunov-type approach. The combined well-balanced property and the low-Mach correction allowed us to trigger Rayleigh–Bénard convective modes close to the critical Rayleigh number. Furthermore, we present 3D turbulent Rayleigh–Bénard convection with low diffusion using the low-Mach correction leading to a higher kinetic energy power spectrum. These results are very promising for future studies of high Mach and highly stratified convective problems in astrophysics.

2018

An asymptotic preserving multidimensional ALE method for a system of two compressible flows coupled with friction
S. Del Pino E. Labourasse G. Morel
J. Comput. Phys., p. 268 - 301, 2018

Dynamic Placement of Progress Thread for Overlapping MPI Non-blocking Collectives on Manycore Processor
Alexandre Denis Julien Jaeger Emmanuel Jeannot Marc Pérache Hugo Taboada
Euro-Par 2018: Parallel Processing - 24th International Conference on Parallel and Distributed Computing, Turin, Italy, August 27-31, 2018, Proceedings, Springer, p. 616-627, 2018

Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes Collaboration
Marc Sergent Mario Dagrada Patrick Carribault Julien Jaeger Marc Pérache Guillaume Papauré
Euro-Par 2018: Parallel Processing - 24th International Conference on Parallel and Distributed Computing, Turin, Italy, August 27-31, 2018, Proceedings, Springer, p. 560-572, 2018

Transparent High-Speed Network Checkpoint/Restart in MPI
Julien Adam Jean-Baptiste Besnard Allen D. Malony Sameer Shende Marc Pérache Patrick Carribault Julien Jaeger
Proceedings of the 25th European MPI Users’ Group Meeting, Barcelona, Spain, September 23-26, 2018, ACM, p. 12:1-12:11, 2018

Inverse Lax-Wendroff boundary treatment for compressible Lagrange-remap hydrodynamics on Cartesian grids
Gautier Dakin Bruno Després Stéphane Jaouen
Journal of Computational Physics, p. 228-257, 2018

Progress Thread Placement for Overlapping MPI Non-blocking Collectives Using Simultaneous Multi-threading
Alexandre Denis Julien Jaeger Hugo Taboada
Euro-Par 2018: Parallel Processing Workshops - Euro-Par 2018 International Workshops, Turin, Italy, August 27-28, 2018, Revised Selected Papers, Springer, p. 123-133, 2018

Profile-guided scope-based data allocation method
Hugo Brunie Julien Jaeger Patrick Carribault Denis Barthou
Proceedings of the International Symposium on Memory Systems, MEMSYS 2018, Old Town Alexandria, VA, USA, October 01-04, 2018, ACM, p. 169-182, 2018

Parallelization of iterative methods to solve sparse linear systems using task based runtime systems on multi and many-core architectures: application to Multi-Level Domain Decomposition methods
Adrien Roussel
Université Grenoble Alpes, 2018-02

PaDaWAn: a Python infrastructure for loosely coupled in situ workflows
Julien Capul Sébastien Morais Jacques-Bernard Lekien
ISAV 18: Proceedings of the Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, p. 7-12, 2018

Fostering metamodels and grammars within a dedicated environment for HPC: the NabLab environment (tool demo)
Benoı̂t Lelandais Marie-Pierre Oudot Benoit Combemale
Proceedings of the 11th ACM SIGPLAN International Conference on Software Language Engineering, p. 200-204, 2018

Multi-Criteria Graph Partitioning with Scotch
Rémi Barat Cédric Chevalier François Pellegrini
2018 Proceedings of the SIAM Workshop on Combinatorial Scientific Computing (CSC), Society for Industrial and Applied Mathematics, p. 66-75, 2018-01

Hexahedral mesh modification to preserve volume
Nicolas Le Goff Franck Ledoux Steven J. Owen
Computer-Aided Design, p. 42-54, 2018-12

abstract

Abstract

In this work, we provide a new post-processing procedure for automatically adjusting node locations of an all-hex mesh to better match the volume of a reference geometry. This process is particularly well-suited for mesh-first approaches, as overlay grid ones. In practice, hexahedral meshes generated via an overlay grid procedure, where a precise reference geometry representation is unknown or is impractical to use, do not provide for precise volumetric preservation. A discrete volume fraction representation of the reference geometry MI on an overlay grid is compared with a volume fraction representation of a 3D finite element mesh MO. This work introduces the notion of localized discrepancy between MI and MO and uses it to design a procedure that relocates mesh nodes to more accurately match a reference geometry. We demonstrate this procedure on a wide range of hexahedral meshes generated with the Sculpt code and show improved volumetric preservation while still maintaining acceptable mesh quality.

Hex-dominant meshing: Mind the gap!
Nicolas Ray Dmitry Sokolov Maxence Reberol Franck Ledoux Bruno Levy
2018-10

abstract

Abstract

We propose a robust pipeline that can generate hex-dominant meshes from any global parameterization of a tetrahedral mesh. We focus on robustness in order to be able to benchmark different parameterizations on a large database. Our main contribution is a new method that integrates the hexahedra (extracted from the parameterization) into the original object. The main difficulty is to produce the boundary of the result, composed of both faces of hexahedra and tetrahedra. Obviously, this surface must be a good approximation of the original object but, more importantly, it must be possible to remesh the volume bounded by this surface minus the extracted hexahedra (called void). We enforce these properties by carefully tracking and eliminating all possibilities of failure at each step of our pipeline. We tested our method on a large collection of objects (200+) with different settings. In most cases, we obtained results of very good quality as compared to the state-of-the-art solutions. To ease reproducing our results and benchmarks, we provide a C++ implementation of the pipeline in the supplemental materials.

Control of monotonic shock waves propagation for isothermic Euler equations
A.V. Porubov RS Bondarenkov D. Bouche AL Fradkov
ZAMM Journal of applied mathematics and mechanics (Zeitschrift für angewandte Mathematik und Mechanik), vol 98-3, p. 448-453, 2018

Two-step shock waves propagation for isothermal Euler equations
A.V. Porubov RS Bondarenkov D. Bouche AL Fradkov
Applied Mathematics and Computation, vol 332, p. 160-166, 2018

Anthropogenic aerosol emissions mapping and characterization by imaging spectroscopy - application to a metallurgical industry and a petrochemical complex
Yannick Philippets Pierre-Yves Foucher Rodolphe Marion Xavier Briottet
2018

abstract

Abstract

This paper is focused on the retrieval of industrial aerosol optical thickness (AOT) and microphysical properties by means of airborne imaging spectroscopy. Industrial emissions generally lead to optically thin plumes requiring an adapted detection method taking into account the weak proportion of particles sought in the atmosphere. To this end, a semi-analytical model combined with the Cluster-Tuned Matched Filter (CTMF) algorithm is presented to characterize those plumes, requiring the knowledge of the soil under the plume. The model allows the direct computation of the at-sensor radiance when a plume is included in the radiative transfer. When applied to industrial aerosol classes as defined in this paper, simulated spectral radiances can be compared to ‘real’ MODTRAN (Moderate Resolution Atmospheric Transmission) radiances using the Spectral Angle Mapper (SAM). On the range from 0.4 to 0.7 µm, for three grounds (water, vegetation, and bright one), SAM scores are lower than 0.043 in the worst case (a both absorbing and scattering particle over a bright ground), and usually lower than 0.025. The darker the ground reflectance is, the more accurate the results are (typically for reflectance lower than 0.3). Concerning AOT retrieval capabilities, with a pre-calculated model for a reference optical thickness of 0.25, we are able to retrieve plume AOT at 550 nm in the range 0.0 to 0.4 with an error usually ranging between 9% and 13%. The first test case is a CASI (Compact Airborne Spectrographic Imager) image acquired over the metallurgical industry of Fos-sur-Mer (France). First results of the use of the model coupled with CTMF algorithm reveal a scattering aerosol plume with particle sizes increasing with the distance from the stack (from detection score of 54% near the stack for particles with a diameter of 0.1 µm, to 69% away from it for 1.0 µm particles). A refinement is made then to estimate more precisely aerosol plume properties, using a multimodal distribution based on the previous results. It leads to find a mixture of sulfate and brown carbon particles with a plume AOT ranging between 0.2 and 0.5. The second test case is an AHS (Airborne Hyperspectral Scanner) image acquired over the petrochemical site of Antwerp (Belgium). The first CTMF application results in detecting a brown carbon aerosol of 0.1 µm mode (detection score is 51%). Refined results show the evolution of the AOT decreasing from 0.15 to 0.05 along the plume for a mixture of brown carbon fine mode and 0.3 µm radius of sulfate aerosol.

Mineral Mapping Using the Automatized Gaussian Model (AGM) - Application to Two Industrial French Sites at Gardanne and Thann
Rodolphe Marion Véronique Carrère
Remote Sensing 10(1):146, 2018

abstract

Abstract

The identification and mapping of the mineral composition of by-products and residues on industrial sites is a topic of growing interest because it may provide information on plant-processing activities and their impact on the surrounding environment. Imaging spectroscopy can provide such information based on the spectral signatures of soil mineral markers. In this study, we use the automatized Gaussian model (AGM), an automated, physically based method relying on spectral deconvolution. Originally developed for the short-wavelength infrared (SWIR) range, it has been extended to include information from the visible and near-infrared (VNIR) range to take iron oxides/hydroxides into account. We present the results of its application to two French industrial sites: (i) the Altéo Environnement site in Gardanne, southern France, dedicated to the extraction of alumina from bauxite; and (ii) the Millennium Inorganic Chemicals site in Thann, eastern France, which produces titanium dioxide from ilmenite and rutile, and its associated Séché éco Services site used to neutralize the resulting effluents, producing gypsum. HySpex hyperspectral images were acquired over Gardanne in September 2013 and an APEX image was acquired over Thann in June 2013. In both cases, reflectance spectra were measured and samples were collected in the field and analyzed for mineralogical and chemical composition. When applying the AGM to the images, both in the VNIR and SWIR ranges, we successfully identified and mapped minerals of interest characteristic of each site: bauxite, Bauxaline® and alumina for Gardanne; and red and white gypsum and calcite for Thann. Identifications and maps were consistent with in situ measurements.

Simulation is the most appropriate technique to evaluate the performance of current data storage systems and predict it for the future ones as part of data centers or cloud infrastructures. It assesses the potential of a system to meet the users requirements in terms of storage capacity, devices heterogeneity, delivered performance and robustness. We developed a simulation tool called OGSSim to address efficiently these criteria within a reduced execution time. But the number of threads on the test machine put an upper bound to the size of the simulated systems. To push this limitation and improve the simulation time, we define in this paper a parallel version of OGSSim. We explain how the parallelization process generate both design and implementation challenges due to the multi-node environment and the related communications and how MPI and ZeroMQ libraries respectively help us to address those challenges.

2017

Contemporary High Performance Computing
Mickaël Amiet Patrick Carribault Elisabeth Charon Guillaume Colin Verdière Philippe Deniel Gilles Grospellier Guénolé Harel François Jollet Jacques-Charles Lafoucrière Jacques-Bernard Lekien Stéphane Mathieu Marc Pérache Jean-Christophe Weill Gilles Wiber
Chapman; Hall/CRC, p. 45-74, 2017

Towards a Better Expressiveness of the Speedup Metric in MPI Context
Jean-Baptiste Besnard Allen D. Malony Sameer Shende Marc Pérache Patrick Carribault Julien Jaeger
46th International Conference on Parallel Processing Workshops, ICPP Workshops 2017, Bristol, United Kingdom, August 14-17, 2017, IEEE Computer Society, p. 251-260, 2017

User Co-scheduling for MPI+OpenMP Applications Using OpenMP Semantics
Antoine Capra Patrick Carribault Jean-Baptiste Besnard Allen D. Malony Marc Pérache Julien Jaeger
Scaling OpenMP for Exascale Performance and Portability - 13th International Workshop on OpenMP, IWOMP 2017, Stony Brook, NY, USA, September 20-22, 2017, Proceedings, Springer, p. 203-216, 2017

Dynamic Load Balancing of Monte Carlo Particle Transport Applications on HPC Clusters
Thomas Gonçalves Marc Pérache Frédéric Desprez Jean-François Méhaut
Parallel Computing is Everywhere, Proceedings of the International Conference on Parallel Computing, ParCo 2017, 12-15 September 2017, Bologna, Italy, IOS Press, p. 465-474, 2017

Resource-Management Study in HPC Runtime-Stacking Context
Arthur Loussert Benoit Welterlen Patrick Carribault Julien Jaeger Marc Pérache Raymond Namyst
29th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2017, Campinas, Brazil, October 17-20, 2017, IEEE Computer Society, p. 177-184, 2017

An all-regime Lagrange-Projection like scheme for 2D homogeneous models for two-phase flows on unstructured meshes
Christophe Chalons Mathieu Girardin Samuel Kokh
Journal of Computational Physics, 2017

Volume preservation improvement for interface reconstruction hexahedral methods
Nicolas Le Goff Franck Ledoux Steven J. Owen
Procedia Engineering, p. 258-270, 2017-01

abstract

Abstract

We propose a new post-processing procedure for automatically adjusting node locations of an all-hex mesh to better match the volume of a reference geometry. Hexahedral meshes generated via an overlay grid procedure, where a precise reference geometry representation is unknown or is impractical to use, do not provide for precise volumetric preservation. A discrete volume fraction representation of the reference geometry MI on an overlay grid is compared with a volume fraction representation of a 3D finite element mesh MO. This work proposes a procedure that uses the localized discrepancy between MI and MO to drive node relocation operations to more accurately match a reference geometry. We demonstrate this procedure on a wide range of hexahedral meshes generated with the Sculpt code and show improved volumetric preservation while still maintaining acceptable mesh quality.

Scalable Fine-Grained Metric-Based Remeshing Algorithm for Manycore/NUMA Architectures
Hoby Rakotoarivelo Franck Ledoux Franck Pommereau Nicolas Le Goff
Euro-Par 2017: Parallel Processing, Springer International Publishing, p. 594-606, 2017

abstract

Abstract

In this paper, we present a fine-grained multi-stage metric-based triangular remeshing algorithm on manycore and NUMA architectures. It is motivated by the dynamically evolving data dependencies and workload of such irregular algorithms, often resulting in poor performance and data locality at high number of cores. In this context, we devise a multi-stage algorithm in which a task graph is built for each kernel. Parallelism is then extracted through fine-grained independent set, maximal cardinality matching and graph coloring heuristics. In addition to index ranges precalculation, a dual-step atomic-based synchronization scheme is used for nodal data updates. Despite its intractable latency-boundness, a good overall scalability is achieved on a NUMA dual-socket Intel Haswell and a dual-memory Intel KNL computing nodes (64 cores). The relevance of our synchronization scheme is highlighted through a comparison with the state-of-the-art.

Comparative simulations of microjetting using atomistic and continuous approaches in the presence of viscosity and surface tension
O. Durand S. Jaouen L. Soulard O. Heuze L. Colombet E. Cieren
JOURNAL OF APPLIED PHYSICS, AMER INST PHYSICS, 2017

abstract

Abstract

We compare, at similar scales, the processes of microjetting and ejecta production from shocked roughened metal surfaces by using atomistic and continuous approaches. The atomistic approach is based on very large scale molecular dynamics (MD) simulations with systems containing up to 700 x 10(6) atoms. The continuous approach is based on Eulerian hydrodynamics simulations with adaptive mesh refinement; the simulations take into account the effects of viscosity and surface tension, and the equation of state is calculated from the MD simulations. The microjetting is generated by shock-loading above its fusion point a three-dimensional tin crystal with an initial sinusoidal free surface perturbation, the crystal being set in contact with a vacuum. Several samples with homothetic wavelengths and amplitudes of defect are simulated in order to investigate the influence of viscosity and surface tension of the metal. The simulations show that the hydrodynamic code reproduces with very good agreement the profiles, calculated from the MD simulations, of the ejected mass and velocity along the jet. Both codes also exhibit a similar fragmentation phenomenology of the metallic liquid sheets ejected, although the fragmentation seed is different. We show in particular, that it depends on the mesh size in the continuous approach. Published by AIP Publishing.

Daniel Gogny
J.F. Berger J.P. Blaizot D. Bouche P. Chaix J.P. Delaroche M. Dupuis M. Girod J. Gogny B. Grammaticos D. Iracane J. Lachkar F. Mariotte N. Pillet N. Van Giai
EUROPEAN PHYSICAL JOURNAL A , Volume 53, Issue 10, 2017

abstract

Abstract

In this article, the scientific life of D. Gogny is recounted by several collaborators. His strong involvement in researches related to various fields of physics (such as nuclear, atomic and plasma physics as well as electromagnetism) appears clearly, as well as the progresses made in the understanding of fundamental physics.

High-Frequency Currents on a Strongly Elongated Spheroid
I.V. Andronov D. Bouche M. Duruflé
IEEE Trans. AP, vol 65, n 2, p. 794-804, 2017

Storage systems capacity provided by data centers do not cease to increase, currently reaching the exabyte scale using thousands of disks. In this way, the question of the resiliency of such systems becomes critical, to avoid data loss and reduce the impact of the reconstruction process on the data access time. We propose SD2S, a method to create a placement scheme for declustered RAID organizations, based on a shifting placement. It consists in the calculation of degree matrices, which represent the distance between the source sets of each couple of physical disks, thus the number of data blocks which will be reconstructed in case of a double failure. The scheme creation is made by the computation of a score function for all possible shifting offsets and the selection of the one ensuring the reconstruction of the highest percentage of data. Results show the data reconstruction distribution against the number of double failure events. Also, the overhead generated by the calculation of the shifting offsets is compared to greedy SD2S and CRUSH without replicas for systems reaching the hundred of disks. These results confirm that the selection of the best offset can lead to a complete data reconstruction giving a small overhead, especially for large systems.

Using ZeroMQ as communication/synchronization mechanisms for IO requests simulation
Sebastien Gougeaud Soraya Zertal Jacques-Charles Lafoucriere Philippe Deniel
2017 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS), p. 1-8, 2017

abstract

Abstract

Using simulation to study the behavior of large-scale data storage systems becomes capital to predict the performance and the reliability of any of them at a lower cost. This helps to take the right decisions before the system development and deployment. OGSSim is a simulation tool for large and heterogeneous storage systems that uses parallelism to provide informations about the behavior of such systems in a reduced time. It uses ZeroMQ communication library to implement not only the data communication but also the synchronization functions between the generated threads. These synchronization points occur during the requests parallel execution and need to be treated efficiently to ensure data coherency for the fast and accurate computation of performance metrics. In this work, different issues due to the parallel execution of our simulation tool OGSSim are presented and the adopted solutions using ZeroMQ are discussed. The impact of these solutions in term of simulation time overhead are measured considering various system configurations. The obtained results s how t hat ZeroMQ has almost no impact on the simulation time, even for complex and large configurations.

2016

A positive scheme for diffusion problems on deformed meshes
Xavier Blanc Emmanuel Labourasse
J. Appl. Math. Mech., p. 660-680, 2016

A conservative slide line method for cell-centered semi-Lagrangian and ALE schemes in 2D
Silvia Bertoluzza S. Del Pino Emmanuel Labourasse
ESAIM: Math. Model. Numer. Anal., EDP Sciences, p. 187-214, 2016

Introducing Task-Containers as an Alternative to Runtime-Stacking
Jean-Baptiste Besnard Julien Adam Sameer Shende Marc Pérache Patrick Carribault Julien Jaeger
Proceedings of the 23rd European MPI Users’ Group Meeting, EuroMPI 2016, Edinburgh, United Kingdom, September 25-28, 2016, ACM, p. 51-63, 2016

A Parallel and Resilient Frontend for High Performance Validation Suites
Julien Adam Marc Pérache
High Performance Computing for Computational Science - VECPAR 2016 - 12th International Conference, Porto, Portugal, June 28-30, 2016, Revised Selected Papers, Springer, p. 248-255, 2016

Comparaison de moteurs exécutifs pour la parallélisation de solveurs linéaires itératifs
Adrien Roussel
Conférence d'informatique en Parallélisme, Architecture et Système (Compas'2016), 2016

Description, Implementation and Evaluation of an Affinity Clause for Task Directives
Philippe Virouleau Adrien Roussel François Broquedis Thierry Gautier Fabrice Rastello Jean-Marc Gratien
IWOMP 2016, 2016

Using Runtime Systems Tools to Implement Efficient Preconditioners for Heterogeneous Architectures
Adrien Roussel Jean-Marc Gratien Thierry Gautier
Oil & Gas Science and Technology - Revue d'IFP Energies nouvelles, Institut Français du Pétrole, p. 65:1-13, 2016

Partitionnement de maillages sous contrainte mémoire à l'aide de la programmation linéaire en nombres entiers
Eric Angel Cédric Chevalier Franck Ledoux Sébastien Morais Damien Regnault
Conférence d'informatique en Parallélisme, Architecture et Système (Compas'2016), 2016

FPT Approximation Algorithm for Scheduling with Memory Constraints
Eric Angel Cédric Chevalier Franck Ledoux Sébastien Morais Damien Regnault
Euro-Par 2016: Parallel Processing - 22nd International European Conference on Parallel and Distributed Computing, Grenoble, FR, August 24-26, 2016, Proceedings, p. 196-208, 2016

Analysis of non-meshable automatically generated frame fields
Ryan Viertel Matt Staten Franck Ledoux
2016

abstract

Abstract

Recent methods for frame field generation in two and three dimensions are reviewed. Frame fields generated automatically in 2D and 3D are analyzed with respect to quad and hex mesh generation. Problems are identified with automatically generated frame fields that prevent mesh generation via current methods. Specifically, there exist geometries that contain limit cycles and cannot be parameterized or decomposed by separatrices of the frame field. In 3D, singularity lines occur that minimize the field curvature but do not align with the frame field. These types of singularities make it impossible to create a mesh that both follows the frame field, and simultaneously respects the singularity as an irregular node in the mesh. Specific examples are presented that illustrate these problems. For each example, streamlines are used to help visualize properties of the frame fields, problems are analyzed, and options to potentially mitigate such problems are discussed.

Étude et obtention d'heuristiques et d'algorithmes exacts et approchés pour un problème de partitionnement de maillage sous contraintes mémoire
Sébastien Morais
Thèse de doctorat, spécialité informatique, CEA, Université d'Evry-Val-d'Essonne, 2016

Modane: a design support tool for numerical simulation codes
Benoı̂t Lelandais Marie-Pierre Oudot
Oil & Gas Science and Technology--Revue d'IFP Energies nouvelles, EDP Sciences, p. 57, 2016

A monotone nonlinear finite volume method for approximating diffusion operators on general meshes
Jean-Sylvain Camier Francois Hermeline
International Journal for Numerical Methods in Engineering, Wiley Online Library, p. 496-519, 2016

A discretization of the multigroup PN radiative transfer equation on general meshes
Francois Hermeline
Journal of Computational Physics, Elsevier, p. 549-582, 2016

Balance-Enforced Multi-Level Algorithm for Multi-Criteria Mesh Partitioning
Rémi Barat Cédric Chevalier François Pellegrini
CSC 16, p. 2, 2016

Multi-Constraints Graph Partitioning for Load Balancing of Multi-Physics Simulations
Rémi Barat Cédric Chevalier François Pellegrini
Conférence d'informatique En Parallélisme, Architecture et Système (COMPAS), 2016

High-order accurate Lagrange-remap hydrodynamic schemes on staggered Cartesian grids
Gautier Dakin Hervé Jourdren
Comptes Rendus Mathematique, 2016

Origin of emission from square-shaped organic microlasers
S. Bittner C. Lafargue I. Gozhyk N. Djellali L. Milliet D. T. Hickox-Young C. Ulysse D. Bouche R. Dubertrand E. Bogomolny J. Zyss M. Lebental
EuroPhysics Letters, 113, 2016

Diffraction by a strip at almost grazing angle
I. Andronov D. Bouche
Journal of Sound and Vibration, vol 374, p. 185-198, 2016

Deconvolution of SWIR reflectance spectra for automatic mineral identification in hyperspectral imaging
Martin Brossard Rodolphe Marion Véronique Carrère
Remote Sensing Letters 7(6):581-590, 2016

abstract

Abstract

Hyperspectral sensors generally acquire images in the spectral range in more than one hundred contiguous narrow channels with a (deca)metric spatial resolution. Each pixel of the image is thus associated with a continuous spectrum which can be used to identify or map surface minerals. The most powerful algorithms (e.g., USGS (United States Geological Survey) Tetracorder) run with a standardized spectral library, are often supervised and require some expert knowledge. In this paper, we present an original method for mineral identification and mapping. Its originality lies in its fully automatic functioning for the full spectral range, from initialization using spectral derivatives, to spectral deconvolution and mineral identification, with a global approach. The modelling combines exponential Gaussians, a continuum including the fundamental water absorption at and deals with overfitting to keep only the relevant Gaussians. We tested the method in the SWIR (Short-Wave InfraRed,) and for 14 minerals representative of industrial environments (e.g., quarries, mines, industries). More than 98% of the simulated spectra were correctly identified. When applied to two AVIRIS (Airborne Visible/InfraRed Imaging Spectrometer) images, results were consistent with ground truth data. The method could be improved by extending it to the VNIR (Visible and Near-InfraRed,) spectral range to include iron oxides and by managing spectral mixtures.

Caractérisation des panaches industriels par imagerie hyperspectrale aéroportée
Xavier Briottet Pierre-Yves Foucher Rodolphe Marion Adrien Deschamps
Photoniques, 2016

abstract

Abstract

La caractérisation des aérosols et des gaz produits par l’homme est un enjeu majeur pour la société car ces composants ont un impact direct sur la santé et le climat. Plusieurs techniques de caractérisation existent mais la télédétection aéroportée est une réponse potentiellement adaptée pour l’étude de ces sources si l’on veut avoir accès à leur expansion spatiale. De plus, l’imagerie hyperspectrale concernant tout le domaine optique, elle permet de couvrir l’ensemble des besoins nécessaires à la détection et la caractérisation des aérosols et des gaz.

Block shifting layout for efficient and robust large declustered storage systems
Sebastien Gougeaud Soraya Zertal Jacques-Charles Lafoucriere Philippe Deniel
2016 International Conference on High Performance Computing and Simulation (HPCS), p. 342-349, 2016

abstract

Abstract

Modern disks are very large (SSDs, HDDs) and their capacities will certainly increase in the future. Storage systems use an important number of such devices to compose storage pools and fulfil the storage capacity demands. The result is a higher probability of a failure and a longer reconstruction duration. Consequently, the whole system is penalized as the response time is higher and a second failure will generate a data loss. In this paper, we propose a new method based on block shifting layout which increases the efficiency of a RAID declustered storage system and improves its robustness in both normal and failure modes. We define four mapping rules to reach these objectives. Conducted tests reveal that exploiting the coprime property between the number of devices and the block shifting factor leads to an optimal layout. It reduces significantly the redirection time proportionally to the number of disks, reaching 50% for 1000 disks and a negligeable memory cost as we avoid the use of a redirection table. It also allows the recovery of additional data in case of a second failure during the degraded mode which gives to our proposed method a huge interest for large storage systems comparing to other existing methods.

A generic and open simulation tool for large multi-tiered hierarchical storage systems
Sebastien Gougeaud Soraya Zertal Jacques-Charles Lafoucriere Philippe Deniel
2016 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS), p. 1-8, 2016

abstract

Abstract

Actual storage systems are very large, with complex and distributed architectural configurations, composed of various technologies devices. However, simulation, analysis and evaluation tools in the literature do not handle this complex design and these heterogeneous components. This paper presents OGSSim (Open and Generic Storage systems Simulation tool): a new simulation tool for such systems. Being generic to all devices technologies and open to diverse management strategies and architecture layouts, it fulfills all the storage systems needs in term of representativeness. Also, it has been validated against real systems, thus its accuracy makes it a useful tool for the conception of future storage systems, the choice of hardware components and the analysis of the adequacy between the applications needs and the management strategies combined with the configuration layout. This validation confirmed only a maximum of 15% of difference between real and simulated execution time. Also, OGSSim execution in a competitive time, just 3.5 sec for common workloads on a large system of 500 disks, makes it a challenging simulation and evaluation tool. Thus, it is the appropriate and accurate tool for modern storage systems conception, evaluation and maintenance.

2015

Angular momentum preserving cell-centered Lagrangian and Eulerian schemes on arbitrary grids
Bruno Després Emmanuel Labourasse
J. Comput. Phys., Academic Press, p. 28-54, 2015

Fine-grain data management directory for OpenMP 4.0 and OpenACC
Julien Jaeger Patrick Carribault Marc Pérache
Concurr. Comput. Pract. Exp., p. 1528-1539, 2015

An MPI Halo-Cell Implementation for Zero-Copy Abstraction
Jean-Baptiste Besnard Allen D. Malony Sameer Shende Marc Pérache Patrick Carribault Julien Jaeger
Proceedings of the 22nd European MPI Users’ Group Meeting, EuroMPI 2015, Bordeaux, France, September 21-23, 2015, ACM, p. 3:1-3:9, 2015

Correctness Analysis of MPI-3 Non-Blocking Communications in PARCOACH
Julien Jaeger Emmanuelle Saillard Patrick Carribault Denis Barthou
Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015, Bordeaux, France, September 21-23, 2015, ACM, p. 16:1-16:2, 2015

Combinatorial Algorithms to Enable Computational Science and Engineering: Work from the CSCAPES Institute
Erik G. Boman Umit V. Catalyurek Cedric Chevalier Karen D. Devine Assefaw H. Gebremedhin Paul D. Hovland Alex Pothen Sivasankaran Rajamanickam Ilya Safro Michael M. Wolf Min Zhou
Purdue Univ., West Lafayette, IN (United States); Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Argonne National Lab. (ANL), Argonne, IL (United States), 2015-01

Molecular dynamics simulations of shock compressed heterogeneous materials. I. The porous case
L. Soulard N. Pineau J. Clerouin L. Colombet
JOURNAL OF APPLIED PHYSICS, AMER INST PHYSICS, 2015

Optimizing Collective Operations in Hybrid Applications
Aurèle Mahéo Patrick Carribault Marc Pérache William Jalby
21st European MPI Users’ Group Meeting, EuroMPI/ASIA ’14, Kyoto, Japan - September 09 - 12, ACM, p. 121, 2014

An Arbitrary Space-Time High-Order Finite Volume Scheme for Gas Dynamics Equations in Curvilinear Coordinates on Polar Meshes
Bertrand Meltz Stéphane Jaouen Frédéric Lagoutière
Finite Volumes for Complex Applications VII-Elliptic, Parabolic and Hyperbolic Problems, Springer International Publishing, p. 901-909, 2014

An All-Regime Lagrange-Projection Like Scheme for the Gas Dynamics Equations on Unstructured Meshes
Christophe Chalons Mathieu Girardin Samuel Kokh
Communications in Computational Physics, 2014

Asymptotic preserving and all-regime Lagrange-Projection like numerical schemes : application to two-phase flows in low mach regime
Mathieu Girardin
2014

Algorithme Approché Pour Un Problème de Partitionnement de Maillage Sous Contrainte Mémoire
Sébastien Morais Eric Angel Cédric Chevalier Franck Ledoux Kim Thang Nguyen Damien Regnault
ROADEF - 15ème Congrès Annuel de La Société Française de Recherche Opérationnelle et d'aide à La Décision, Société française de recherche opérationnelle et d'aide à la décision, 2014

Linear Programming for Mesh Partitioning under Memory Constraint : Theoretical Formulations and Experimentations
Sébastien Morais Eric Angel Cédric Chevalier Franck Ledoux Damien Regnault
CSC 14, p. 2, 2014

Big Data Solution for CTBT Monitoring Using Global Cross Correlation
Pierre Gaillard Dmitry Bobrov Aurelien Dupont Agnès Grenouille Ivan O Kitov Mikhail Rozhkov
AGU Fall Meeting Abstracts, p. IN21A-3700, 2014

Closed Form Solution and Equivalent Equation Approximation of Linear Advection by a Non Dissipative Second Order Scheme for Step Initial Conditions
G. Arbia D. Bouche
Acta Applicandae Mathematicae, Volume 130, Issue 1, p. 151-162, 2014

A low-order reduced model for the long range propagation of infrasounds in the atmosphere
M. Bertin C. Millet D. Bouche
Journal of the Acoustical Society of America 136(1), p. 37-52, 2014

A low-order model for wave propagation in random waveguides
C. Millet M. Bertin D. Bouche
Bulletin of the American Physical Society, 59(20), BAPS.2014.DFD.D20.3, 2014

A Well Balanced Scheme for a Transport Equation with Varying Velocity Arising in Relativistic Transfer Equation
T. Leroy B. Després C. Buet
Finite Volumes for Complex Applications VII-Elliptic, Parabolic and Hyperbolic Problems, 2014

abstract

Abstract

We are interested in the study of numerical schemes for the homogeneous in space asymptotic limit in the non equilibrium regime of the relativistic transfer equation. This limit leads to a frequency drift term modeling the Doppler effects for photons, and our aim is to design costless well-balanced schemes. One difficulty is that wave speed may vanish, which implies that standard well-balanced schemes constructed by discretizing the source term at the interfaces and by using a Godunov scheme may become inconsistent in this limit. This is indeed observed numerically.

Low intrusive coupling of implicit and explicit time integration schemes for structural dynamics: Application to low energy impacts on composite structures
T. Chantrait J. Rannou A. Gravouil
Finite Elements in Analysis and Design, p. 23-33, 2014

abstract

Abstract

Simulation of low energy impacts on composite structures is a key feature in aeronautics. Unfortunately it involves very expensive numerical simulations: on the one side, the structures of interest have large dimensions and need fine volumic meshes (at least locally) in order to properly capture damage. On the other side, explicit simulations are commonly used to lead this kind of simulations (Lopes et al., 2009 [1]; Bouvet, 2009 [2]), which results in very small time steps to ensure the CFL condition (Courant et al., 1967 [3]). Implicit algorithms are actually more difficult to use in this situation because of the lack of smoothness of the solution that can lead to prohibitive number of time steps or even to non-convergence of Newton-like iterative processes. It is also observed that non-smooth phenomena are localized in space and time (near the impacted zone). It may therefore be advantageous to adopt a multiscale space/time approach by splitting the structure into several substructures with their own space/time discretization and their own integration scheme. The purpose of this decomposition is to take advantage of the specificities of both algorithms families: explicit scheme focuses on non-smooth areas while smoother parts (actually linear in this work) of the solutions are computed with larger time steps with an implicit scheme. We propose here an implementation of the Gravouil–Combescure method (GC) (Combescure and Gravouil, 2002 [4]) by the mean of low intrusive coupling between the implicit finite element analysis (FEA) code Zset/Zébulon (Z-set official website, 2013 [5]) and the explicit FEA code Europlexus (Europlexus official website, 2013 [6]). Simulations of low energy impacts on composite stiffened panels are presented. It is shown on this application that large time step ratios can be reached, thus saving computation time.

Approche multiéchelle en espace et en temps pour la prévision des endommagements dans les structures composites soumises à un impact de faible énergie
Teddy Chantrait
INSA de Lyon, 2014

Formal modelling and analysis of distributed storage systems
Jordan La Houssaye Franck Pommereau Philippe Deniel
IBISC, university of Evry / Paris-Saclay, 2014

abstract

Abstract

Distributed storage systems are nowadays ubiquitous, often under the form of multiple caches forming a hierarchy. A large amount of work has been dedicated to design, implement and optimise such systems. However, there exists to the best of our knowledge no attempt to use formal modelling and analysis in this field. This paper proposes a formal modelling framework to design distributed storage systems while separating the various concerns they involve like data-model, operations, placement, consistency, topology, etc. A system modelled in such a way can be analysed through model-checking to prove correctness properties, or through simulation to measure timed performance. In this paper, we define the modelling framework and then focus on timing analysis. We illustrate these two aspects on a simple example showing that our proposal has the potential to be used to make design decisions before the real system is implemented.

2013

A new method to introduce constraints in cell-centered Lagrangian schemes
Guillaume Clair Bruno Després Emmanuel Labourasse
Comp. Meth. Appl. Mech. Eng., Elsevier, p. 56-65, 2013

Data-Management Directory for OpenMP 4.0 and OpenACC
Julien Jaeger Patrick Carribault Marc Pérache
Euro-Par 2013: Parallel Processing Workshops - BigDataCloud, DIHC, FedICI, HeteroPar, HiBB, LSDVE, MHPC, OMHI, PADABS, PROPER, Resilience, ROME, and UCHPC 2013, Aachen, Germany, August 26-27, 2013. Revised Selected Papers, Springer, p. 168-177, 2013

Event Streaming for Online Performance Measurements Reduction
Jean-Baptiste Besnard Marc Pérache William Jalby
42nd International Conference on Parallel Processing, ICPP 2013, Lyon, France, October 1-4, 2013, IEEE Computer Society, p. 985-994, 2013

Introducing kernel-level page reuse for high performance computing
Sébastien Valat Marc Pérache William Jalby
Proceedings of the ACM SIGPLAN Workshop on Memory Systems Performance and Correctness, June, 21, 2013, Seattle, Washington, USA, Co-located with PLDI 2013, ACM, p. 3:1-3:9, 2013

Binary Instrumentation for Scalable Performance Measurement of OpenMP Applications
Julien Jaeger Peter Philippen Eric Petit Andres Charif Rubial Christian Rössel William Jalby Bernd Mohr
Parallel Computing: Accelerating Computational Science and Engineering (CSE), Proceedings of the International Conference on Parallel Computing, ParCo 2013, 10-13 September 2013, Garching (near Munich), Germany, IOS Press, p. 783-792, 2013

Large Time Step and Asymptotic Preserving Numerical Schemes for the Gas Dynamics Equations with Source Terms
Christophe Chalons Mathieu Girardin Samuel Kokh
SIAM Journal on Scientific Computing, 2013

Hypergraph Partitioning
Cédric Chevalier
Graph Partitioning, John Wiley & Sons, Ltd, p. 65-80, 2013

A Constraint-Based System to Ensure the Preservation of Sharp Geometric Features in Hexahedral Meshes
Franck Ledoux Nicolas Le Goff Steven J. Owen Matthew L. Staten Jean-Christophe Weill
Proceedings of the 21st International Meshing Roundtable, Springer Berlin Heidelberg, p. 315-332, 2013

Improving MPI Communication Overlap with Collaborative Polling
Sylvain Didelot Patrick Carribault Marc Pérache William Jalby
Recent Advances in the Message Passing Interface - 19th European MPI Users’ Group Meeting, EuroMPI 2012, Vienna, Austria, September 23-26, 2012. Proceedings, Springer, p. 37-46, 2012

Numerical investigation of magnetic Richtmyer-Meshkov instability
Y. Levy S. Jaouen B. Canaud
Laser and Particle Beams, p. 415-419, 2012

Shock velocity increase due to a heterogeneity produced by a two-gas layer
Déborah Elbaz Georges Jourdan Lazhar Houas Stéphane Jaouen Philippe Ballereau Frédéric Dias Benoit Canaud
Phys. Rev. E, p. 066307, 2012

High-order dimensionally split Lagrange-remap schemes for ideal magnetohydrodynamics
Marc Wolff Stéphane Jaouen Hervé Jourdren Eric Sonnendrücker
Discrete and Continuous Dynamical Systems S, AIMS, p. 345-367, 2012

Transformations source-à-source pour l'optimisation de codes irréguliers et multithreads. (Source-to-source transformations for irregular and multithreaded code optimization)
Julien Jaeger
Versailles Saint-Quentin-en-Yvelines University, France, 2012

Automatic efficient data layout for multithreaded stencil codes on CPU sand GPUs
Julien Jaeger Denis Barthou
19th International Conference on High Performance Computing, HiPC 2012, Pune, India, December 18-22, 2012, IEEE Computer Society, p. 1-10, 2012

A finite volume method for the approximation of convection--diffusion equations on general meshes
F Hermeline
International journal for numerical methods in engineering, Wiley Online Library, p. 1331-1357, 2012

The Zoltan and Isorropia Parallel Toolkits for Combinatorial Scientific Computing: Partitioning, Ordering and Coloring
Erik G. Boman Uemit V. Catalyuerek Cedric Chevalier Karen D. Devine
Scientific Programming, Hindawi Ltd, p. 129-150, 2012

Load Balancing for Mesh Based Multi-Physics Simulations in the Arcane Framework
C. Chevalier G. Grospellier F. Ledoux J. C. Weill
The Eighth International Conference on Engineering Computational Technology, p. 4, 2012

Parallel Partitioning, Coloring, and Ordering for Scientific Computing
E. G. Boman Ue V. Catalyuerek C. Chevalier K. D. Devine
Parallel Partitioning, Coloring, and Ordering in Scientific Computing, Chapman & Hall/Crc Press, p. 351-371, 2012

Diffraction by a narrow circular cone as a strongly elongated body
I. Andronov D. Bouche
Journal of Mathematical Sciences volume 185, p. 517–522, 2012

Simultaneous study of the diffraction by a 2D-convex obstacle through boundary layer method and microlocal analysis
D. Bouche O. Lafitte
Asymptotic Analysis 79, p. 347–378, 2012

High-frequency diffraction of a plane electromagnetic wave by an elongated spheroid
I. Andronov D. Bouche M. Duruflé
IEEE-AP, vol 60, n°11, p. 5286-5295, 2012

2011

Method, computer program and device for managing memory access in a multiprocessor architecture of numa type
Zoltan Menyhart Marc Pérache
2011

Thread-Local Storage Extension to Support Thread-Based MPI/OpenMP Applications
Patrick Carribault Marc Pérache Hervé Jourdren
OpenMP in the Petascale Era - 7th International Workshop on OpenMP, IWOMP 2011, Chicago, IL, USA, June 13-15, 2011. Proceedings, Springer, p. 80-93, 2011

Conservative numerical methods for a two-temperature resistive MHD model with self-generated magnetic field term
Marc Wolff Stéphane Jaouen Lise-Marie Imbert-Gérard
Esaim Proceedings, p. 195-210, 2011

Metric-based mesh adaptation for 2D Lagrangian compressible flows
S. Del Pino
Journal of Computational Physics, Elsevier, p. 1793-1821, 2011

Forward and adjoint simulations of seismic wave propagation on fully unstructured hexahedral meshes
Daniel Peter Dimitri Komatitsch Yang Luo Roland Martin Nicolas Le Goff Emanuele Casarotti Pieyre Le Loher Federica Magnoni Qinya Liu Céline Blitz Tarje Nissen-Meyer Piero Basini Jeroen Tromp
Geophysical Journal International, p. 721-739, 2011-08

abstract

Abstract

We present forward and adjoint spectral-element simulations of coupled acoustic and (an)elastic seismic wave propagation on fully unstructured hexahedral meshes. Simulations benefit from recent advances in hexahedral meshing, load balancing and software optimization. Meshing may be accomplished using a mesh generation tool kit such as CUBIT, and load balancing is facilitated by graph partitioning based on the SCOTCH library. Coupling between fluid and solid regions is incorporated in a straightforward fashion using domain decomposition. Topography, bathymetry and Moho undulations may be readily included in the mesh, and physical dispersion and attenuation associated with anelasticity are accounted for using a series of standard linear solids. Finite-frequency Fréchet derivatives are calculated using adjoint methods in both fluid and solid domains. The software is benchmarked for a layercake model. We present various examples of fully unstructured meshes, snapshots of wavefields and finite-frequency kernels generated by Version 2.0 'Sesame' of our widely used open source spectral-element package SPECFEM3D.

Error estimate for the upwind finite volume method for the nonlinear scalar conservation law
D. Bouche J.M. Ghidaglia F. Pascal
Journal of Computational and Applied Mathematics 235, p. 5394–5410, 2011

An optimal error estimate for upwind finite volume methods for nonlinear hyperbolic conservation laws
D. Bouche J.M. Ghidaglia F. Pascal
Applied Numerical Mathematics 61, p. 1114-1131, 2011

Forward and backward wave in high-frequency diffraction by an elongated spheroid
I. Andronov D. Bouche
Progress In Electromagnetics Research B, Vol. 29, p. 209-231, 2011

Modeling directional-hemispherical reflectance and transmittance of fresh and dry leaves from 0.4μm to 5.7μm with the PROSPECT-VISIR model
Fanny Gerber Rodolphe Marion Albert Olioso Stéphane Jacquemoud B. Ribeiro da Luz Sophie Fabre
Remote Sensing of Environment 115(2):404-414, 2011

abstract

Abstract

Vegetation water content retrieval using passive remote sensing techniques in the 0.4–2.5 μm region (reflection of solar radiation) and the 8–14 μm region (emission of thermal radiation) has given rise to an abundant literature. The wavelength range in between, where the main water absorption bands are located, has surprisingly received very little attention because of the complexity of the radiometric signal that mixes both reflected and emitted fluxes. Nevertheless, it is now covered by the latest generation of passive optical sensors (e.g. SEBASS, AHS). This work aims at modeling leaf spectral reflectance and transmittance in the infrared, particularly between 3 μm and 5 μm, to improve the retrieval of vegetation water content using hyperspectral data. Two unique datasets containing 32 leaf samples each were acquired in 2008 at the USGS National Center, Reston (VA, USA) and the ONERA Research Center, Toulouse (France). Reflectance and transmittance were recorded using laboratory spectrometers in the spectral region from 0.4 μm to 14 μm, and the leaf water and dry matter contents were determined. It turns out that these spectra are strongly linked to water content up to 5.7 μm. This dependence is much weaker further into the infrared, where spectral features seem to be mainly associated with the biochemical composition of the leaf surface. The measurements show that leaves transmit light in this wavelength domain and that the transmittance of dry samples can reach 0.35 of incoming light around 5 μm, and 0.05 around 11 μm. This work extends the PROSPECT leaf optical properties model by taking into account the high absorption levels of leaf constituents (by the insertion of the complex Fresnel coefficients) and surface phenomena (by the addition of a top layer). The new model, PROSPECT-VISIR (VISible to InfraRed), simulates leaf reflectance and transmittance between 0.4 μm and 5.7 μm (at 1 nm spectral resolution) with a root mean square error (RMSE) of 0.017 and 0.018, respectively. Model inversion also allows the prediction of water (RMSE = 0.0011 g/cm²) and dry matter (RMSE = 0.0013 g/cm²) contents.

2010

An antidissipative transport scheme on unstructured meshes for multicomponent flows
Bruno Després Frédéric Lagoutière Emmanuel Labourasse Isabelle Marmajou
Int. J. Finite Volumes, p. 30-65, 2010

User level DB: a debugging API for user-level thread libraries
Kevin Pouget Marc Pérache Patrick Carribault Hervé Jourdren
24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Workshop Proceedings, IEEE, p. 1-7, 2010

Enabling Low-Overhead Hybrid MPI/OpenMP Parallelism with MPC
Patrick Carribault Marc Pérache Hervé Jourdren
Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More, 6th Internationan Workshop on OpenMP, IWOMP 2010, Tsukuba, Japan, June 14-16, 2010, Proceedings, Springer, p. 1-14, 2010

High-order dimensionally split Lagrange-remap schemes for compressible hydrodynamics
Frédéric Duboc Cédric Enaux Stéphane Jaouen Hervé Jourdren Marc Wolff
Comptes Rendus Académie des Sciences, Paris, Série I, p. 105-110, 2010

A curvilinear finite-volume method to solve compressible gas dynamics in semi-Lagrangian coordinates
S. Del Pino
Comptes Rendus Mathématique, Elsevier, p. 1027-1032, 2010

An iterative procedure to solve a coupled two-fluids turbulence model
T. Chacón Rebollo S. Del Pino D. Yakoubi
ESAIM: Mathematical Modelling and Numerical Analysis, EDP Sciences, p. 693-713, 2010

Compensation of the scheme dispersion and dissipation by artificial non-linear additions
A.V. Porubov D. Bouche G. Bonnaud
Transactions on Computer Science, LNCS 5890, p. 122-131, 2010

Theoretical Analysis of the upwind finite volume scheme on the counter-example of Peterson
D. Bouche J.M. Ghidaglia F. Pascal
ESAIM-Mathematical Modelling and Numerical Analysis, Volume 44, number 6, p. 1279-1293, 2010

Hexahedral Mesh Matching: Converting non-conforming hexahedral-to-hexahedral interfaces into conforming interfaces
ML. Staten JF. Shepherd K. Shimada
p. 1279-1293, 2010

abstract

Abstract

Abstract This paper presents a new method, called Mesh Matching, for handling non-conforming hexahedral-to-hexahedral interfaces for finite element analysis. Mesh Matching modifies the hexahedral element topology on one or both sides of the interface until there is a one-to-one pairing of finite element nodes, edges and quadrilaterals on the interface surfaces, allowing mesh entities to be merged into a single conforming mesh. Element topology is modified using hexahedral dual operations, including pillowing, sheet extraction, dicing and column collapsing. The primary motivation for this research is to simplify the generation of unstructured all-hexahedral finite element meshes. Mesh Matching relaxes global constraint propagation which currently hinders hexahedral meshing of large assemblies, and limits its extension to parallel processing. As a secondary benefit, by providing conforming mesh interfaces, Mesh Matching provides an alternative to artificial constraints such as tied contacts and multi-point constraints. The quality of the resultant conforming hexahedral mesh is high and the increase in number of elements is moderate. Copyright © 2009 John Wiley & Sons, Ltd.

2009

A cell-centered Lagrangian hydrodynamics scheme in arbitrary dimension
G. Carré S. Del Pino B. Després E. Labourasse
J. Comput. Phys., p. 5160-5183, 2009

Polynomial Least-Square reconstruction for semi-Lagrangian Cell-Centered Hydrodynamic Scheme
G. Carré S. Del Pino E. Labourasse K. Pichon Gostaf A. V. Shapeev
ESAIM: Proc., p. 1008-1024, 2009

MPC-MPI: An MPI Implementation Reducing the Overall Memory Consumption
Marc Pérache Patrick Carribault Hervé Jourdren
Recent Advances in Parallel Virtual Machine and Message Passing Interface, 16th European PVM/MPI Users’ Group Meeting, Espoo, Finland, September 7-10, 2009. Proceedings, Springer, p. 94-103, 2009

Compressible magnetic Rayleigh-Taylor instability in stratified plasmas: Comparison of analytical and numerical results in the linear regime
S. Liberatore S. Jaouen E. Tabakhoff B. Canaud
Physics of Plasmas, p. 044502, 2009

Dissipative issue of high-order shock capturing schemes with non-convex equations of state
Olivier Heuzé Stéphane Jaouen Hervé Jourdren
Journal of Computational Physics, p. 833-860, 2009

3D finite volume simulation of acoustic waves in the earth atmosphere
S. Del Pino B. Després P. Havé H. Jourdren P.-F. Piserchia
Computers & Fluids, Elsevier, p. 765-777, 2009

The Arcane Development Framework
Gilles Grospellier Benoit Lelandais
Proceedings of the 8th Workshop on Parallel/High-Performance Object-Oriented Scientific Computing, Association for Computing Machinery, 2009

abstract

Abstract

In this paper, we introduce the Arcane software development framework for 2D and 3D numerical simulation codes. First, we describe the Arcane core, the mesh management and the parallelism strategy. Then, we focus on the concepts introduced to speed up the development of numerical codes: numerical modules, variables, entry points and services. We explain the execution model and enumerate the available debugging tools. Finally, the main functionalities of Arcane are described through an example. As a conclusion, we present the future works.

A finite volume method for approximating 3D diffusion operators on general meshes
Francois Hermeline
Journal of computational Physics, Elsevier, p. 5763-5786, 2009

Advances in Parallel Partitioning, Load Balancing and Matrix Ordering for Scientific Computing
Erik G. Boman Umit V. Catalyurek Cédric Chevalier Karen D. Devine Ilya Safro Michael M. Wolf
Journal of Physics: Conference Series, p. 12008, 2009-07

Comparison of Coarsening Schemes for Multilevel Graph Partitioning
C. Chevalier I. Safro
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), p. 191-205, 2009

Weighted Aggregation for Multi-Level Graph Partitioning
Cedric Chevalier Ilya Safro
2009

Getting Started with Zoltan: A Short Tutorial
K. D. Devine E. G. Boman L. A. Riesen U. V. Catalyurek C. Chevalier
Proc. of 2009 Dagstuhl Seminar on Combinatorial Scientific Computing, 2009

Retrieval of microphysical and optical properties in aerosol plumes with hyperspectral imagery: L-APOM method
Alexandre Alakian Rodolphe Marion Xavier Briottet
Remote Sensing of Environment 113(4):781-793, 2009

abstract

Abstract

This paper presents the retrieval method L-APOM which aims at characterizing the microphysical and optical properties of aerosol plumes from hyperspectral images with high spatial resolution. The inversion process is divided into three steps: estimation of the ground reflectance below the plume, characterization of the standard atmosphere (gases and background aerosols) and estimation of the plume aerosols properties. As using spectral information only is not sufficient to insure uniqueness of solutions, original constraints are added by assuming slow spatial variations of particles properties within the plume. The whole inversion process is validated on a large set of simulated images and reveals to remain accurate even in the worst cases of noise: relative estimation errors of aerosol properties remain between 10% and 20% in most cases. L-APOM is applied on a real AVIRIS hyperspectral image of a biomass burning plume for which in situ measurements are available. Retrieved properties appear globally consistent with measurements.

NFSv4 Proxy in User Space on a Massive Cluster Architecture: Issues and Perspectives
Philippe Deniel
USENIX Association, 2009

2008

DNS of the interaction between a deformable buoyant bubble and a spatially decaying turbulence: A priori tests for LES two-phase flow modelling
A. Toutant E. Labourasse O. Lebaigue O. Simonin
Comput. Fluids, p. 877 - 886, 2008

Efficient Shared Memory Message Passing for Inter-VM Communications
François Diakhaté Marc Pérache Raymond Namyst Hervé Jourdren
Euro-Par 2008 Workshops - Parallel Processing, VHPC 2008, UNICORE 2008, HPPC 2008, SGS 2008, PROPER 2008, ROIA 2008, and DPA 2008, Las Palmas de Gran Canaria, Spain, August 25-26, 2008, Revised Selected Papers, Springer, p. 53-62, 2008

MPC: A Unified Parallel Runtime for Clusters of NUMA Machines
Marc Pérache Hervé Jourdren Raymond Namyst
Euro-Par 2008 - Parallel Processing, 14th International Euro-Par Conference, Las Palmas de Gran Canaria, Spain, August 26-29, 2008, Proceedings, Springer, p. 78-88, 2008

Fine Tuning Matrix Multiplications on Multicore
Stéphane Zuckerman Marc Pérache William Jalby
High Performance Computing - HiPC 2008, 15th International Conference, Bangalore, India, December 17-20, 2008. Proceedings, Springer, p. 30-41, 2008

Lagrangian method enhanced with edge swapping for the free fall and contact problem
É. Bernard S. Del Pino E. Deriaz B. Després K. Jurkova F. Lagoutière
ESAIM: Proceedings, EDP Sciences, p. 46-59, 2008

Une borne inférieure pour la constante de la condition inf-sup sur l'opérateur de divergence
S. Del Pino U. Razafison D. Yakoubi
Comptes Rendus Mathematique, Elsevier, p. 533-538, 2008

Approximating second-order vector differential operators on distorted meshes in two space dimensions
F Hermeline
International journal for numerical methods in engineering, Wiley Online Library, p. 1065-1089, 2008

New finite volume methods for approximating partial differential equations on arbitrary meshes; Nouvelles methodes de volumes finis pour approcher des equations aux derivees partielles sur des maillages quelconques
F Hermeline
2008

Numerical experiments with the DDFV method
F Hermeline
Finite Volumes for Complex Applications V, John Wiley & Sons, p. 851-864, 2008

A finite volume method for the approximation of Maxwell's equations in two space dimensions on arbitrary meshes
Francois Hermeline Siham Layouni Pascal Omnes
Journal of Computational Physics, Elsevier, p. 9365-9388, 2008

Improved Parallel Data Partitioning by Nested Dissection with Applications to Information Retrieval.
Michael M. Wolf Cedric Chevalier Erik Gunnar Boman
Proposed for publication in Parallel Computing., Sandia National Laboratories (SNL), Albuquerque, NM, and Livermore, CA (United States), 2008-12

PT-Scotch: A Tool for Efficient Parallel Graph Ordering
C. Chevalier F. Pellegrini
Parallel Computing, p. 318-331, 2008

Simulation of Seismic Wave Propagation in an Asteroid Based upon an Unstructured MPI Spectral-Element Method: Blocking and Non-blocking Communication Strategies
Roland Martin Dimitri Komatitsch Céline Blitz Nicolas Le Goff
High Performance Computing for Computational Science - VECPAR, Springer, p. 350-363, 2008

abstract

Abstract

In order to better understand the internal structure of asteroids orbiting in the Solar system and then the response of such objects to impacts, seismic wave propagation in asteroid 433-Eros is performed numerically based on a spectral-element method at frequencies lying between 2 Hz and 22 Hz. In the year 2000, the NEAR Shoemaker mission to Eros has provided images of the asteroid surface, which contains numerous fractures that likely extend to its interior. Our goal is to be able to propagate seismic waves resulting from an impact in such models. For that purpose we create and mesh both homogeneous and fractured models with a highly-dispersive regolith layer at the surface using the CUBIT mesh generator developed at Sandia National Laboratories (USA). The unstructured meshes are partitioned using the METIS software package in order to minimize edge cuts and therefore optimize load balancing in our parallel blocking or non-blocking MPI implementations. We show the results of several simulations and illustrate the fact that they exhibit good scaling.

High-frequency simulations of global seismic wave propagation using SPECFEM3D\_GLOBE on 62K processors
Laura Carrington Dimitri Komatitsch Michael Laurenzano Mustafa M. Tikir David Michea Nicolas Le Goff Allan Snavely Jeroen Tromp
SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, p. 1-11, 2008-11

abstract

Abstract

SPECFEM3D_GLOBE is a spectral element application enabling the simulation of global seismic wave propagation in 3D anelastic, anisotropic, rotating and self-gravitating Earth models at unprecedented resolution. A fundamental challenge in global seismology is to model the propagation of waves with periods between 1 and 2 seconds, the highest frequency signals that can propagate clear across the Earth. These waves help reveal the 3D structure of the Earth's deep interior and can be compared to seismographic recordings. We broke the 2 second barrier using the 62K processor Ranger system at TACC. Indeed we broke the barrier using just half of Ranger, by reaching a period of 1.84 seconds with sustained 28.7 Tflops on 32K processors. We obtained similar results on the XT4 Franklin system at NERSC and the XT4 Kraken system at University of Tennessee Knoxville, while a similar run on the 28K processor Jaguar system at ORNL, which has better memory bandwidth per processor, sustained 35.7 Tflops (a higher flops rate) with a 1.94 shortest period.Thus we have enabled a powerful new tool for seismic wave simulation, one that operates in the same frequency regimes as nature; in seismology there is no need to pursue periods much smaller because higher frequency signals do not propagate across the entire globe.We employed performance modeling methods to identify performance bottlenecks and worked through issues of parallel I/O and scalability. Improved mesh design and numbering results in excellent load balancing and few cache misses. The primary achievements are not just the scalability and high teraflops number, but a historic step towards understanding the physics and chemistry of the Earth's interior at unprecedented resolution.

On the degeneration of creeping waves in a vicinity of critical values of the impedance
D. Bouche I. Andronov
Wave Motion 45, p. 400-411, 2008

Degeneration of electromagnetic creeping waves in a vicinity of critical values of anisotropic impedance
D. Bouche I. Andronov
IEEE Transactions on Antennas Propagation, p. 1984-1992, 2008

Description of numerical shock profiles of non-linear Burgers’ equation by asymptotic solution of its differential approximations
A.V. Porubov D. Bouche G. Bonnaud
International Journal of Finite Volumes, V. 5, p. 1-16, 2008

Remote sensing of aerosol plumes: a semianalytical model
Alexandre Alakian Rodolphe Marion Xavier Briottet
Applied Optics 47(11):1851-1866, 2008

abstract

Abstract

A semianalytical model, named APOM (aerosol plume optical model) and predicting the radiative effects of aerosol plumes in the spectral range [0.4,2.5 μm], is presented in the case of nadir viewing. It is devoted to the analysis of plumes arising from single strong emission events (high optical depths) such as fires or industrial discharges. The scene is represented by a standard atmosphere (molecules and natural aerosols) on which a plume layer is added at the bottom. The estimated at-sensor reflectance depends on the atmosphere without plume, the solar zenith angle, the plume optical properties (optical depth, single-scattering albedo, and asymmetry parameter), the ground reflectance, and the wavelength. Its mathematical expression as well as its numerical coefficients are derived from MODTRAN4 radiative transfer simulations. The DISORT option is used with 16 fluxes to provide a sufficiently accurate calculation of multiple scattering effects that are important for dense smokes. Model accuracy is assessed by using a set of simulations performed in the case of biomass burning and industrial plumes. APOM proves to be accurate and robust for solar zenith angles between 0° and 60° whatever the sensor altitude, the standard atmosphere, for plume phase functions defined from urban and rural models, and for plume locations that extend from the ground to a height below 3 km. The modeling errors in the at-sensor reflectance are on average below 0.002. They can reach values of 0.01 but correspond to low relative errors then (below 3% on average). This model can be used for forward modeling (quick simulations of multi/hyperspectral images and help in sensor design) as well as for the retrieval of the plume optical properties from remotely sensed images.

2007

Towards Large Eddy simulation of isothermal two-phase flows: Governing equations and a priori tests
E. Labourasse D. Lacanette A. Toutant P. Lubin S. Vincent O. Lebaigue J.-P. Caltagirone P. Sagaut
Int. J. Multiphase Flow, p. 1 - 39, 2007

High-gain direct-drive inertial confinement fusion for the Laser Mégajoule: recent progress
B. Canaud F. Garaude P. Ballereau J. L. Bourgade C. Clique D. Dureau M. Houry S. Jaouen H. Jourdren N. Lecler L. Masse A. Masson R. Quach R. Piron D. Riz J. Van der Vliet M. Temporal J. A. Delettrez P. W. McKenty
Plasma Physics and Controlled Fusion, p. B601-B610, 2007

Wave Propagation in Materials with Non-Convex Equations of State
Olivier Heuzé Stéphane Jaouen Hervé Jourdren
Shock compression of condensed matter, p. 47-50, 2007

A purely Lagrangian method for computing linearly-perturbed flows in spherical geometry
Stéphane Jaouen
Journal of Computational Physics, p. 464-490, 2007

Numerical transport of an arbitrary number of components
S. Jaouen F. Lagoutiere
Computer Methods in Applied Mechanics and Engineering, p. 3127-3140, 2007

Wave propagation in materials with non convex equation of state
Olivier Heuzé Stephane Jaouen Hervé Jourdren
APS shock compression of condensed matter meeting abstracts, p. G4.004, 2007

2D/3D turbine simulations with FreeFEM
S. Del Pino B. Maury
Numerical Analysis and Scientific Computing for PDEs and their challenging applications, 2007

Approximation of 2-D and 3-D diffusion operators with variable full tensor coefficients on arbitrary meshes
F Hermeline
Computer methods in applied mechanics and engineering, Elsevier, p. 2497-2526, 2007

The PT-Scotch Project: Purpose, Algorithms, Intermediate Results
Cédric Chevalier François Pellegrini
PPAM 2007 - Seventh International Conference on Parallel Processing and Applied Mathematics, 2007-09

Conception et mise en oeuvre d'outils efficaces pour le partitionnement et la distribution parallèles de problème numériques de très grande taille
Cédric Chevalier
Université Sciences et Technologies - Bordeaux I, 2007-09

A Theoretical Framework for Hyperspectral Anomaly Detection Using Spectral and Spatial A Priori Information
Brice Yver Rodolphe Marion
IEEE Xplore, 2007

abstract

Abstract

This letter presents a new theoretical approach for anomaly detection using a priori information about targets. This a priori knowledge deals with the general spectral behavior and the spatial distribution of targets. In this letter, we consider subpixel and isolated targets that are spectrally anomalous in one region of the spectrum but not in another. This method is totally different from matched filters that suffer from a relative sensitivity to low errors in the target spectral signature. We incorporate the spectral a priori knowledge in a new detection distance, and we propose a Bayesian approach with a Markovian regularization to suppress the potential targets that do not respect the spatial a priori. The interest of the method is illustrated on simulated data consisting in realistic anomalies that are superimposed on a real HyMap hyperspectral image.

GANESHA, a multi-usage with large cache NFSv4 server
Ph. Deniel Th. Leibovici J-Ch. Lafoucrière
WiP session at FAST'97, 2007

GANESHA, a multi-usage with large cache NFSv4 server
Ph. Deniel Th. Leibovici J-Ch. Lafoucrière
Proceedings of the Linux Symposium, p. 113-124, 2007

2006

Contribution à l’élaboration d’environnements de programmation dédiés au calcul scientifique hautes performances
Marc Pérache
These de doctorat, spécialité informatique, CEA, Université de Bordeaux, 2006

Hydrodynamic instabilities in ablative tamped flows
M. Temporal S. Jaouen L. Masse B. Canaud
Physics of Plasmas, p. 122701, 2006

Arbitrary high-order schemes for the linear advection and wave equations: application to hydrodynamics and aeroacoustics
S. Del Pino H. Jourdren
Comptes Rendus Mathematique, Elsevier, p. 441-446, 2006

Improvement of the Efficiency of Genetic Algorithms for Scalable Parallel Graph Partitioning in a Multi-level Framework
Cédric Chevalier François Pellegrini
Euro-Par 2006 Parallel Processing, Springer, p. 243-252, 2006

Asymptotics of creeping waves in the case of nondiagonalizable surface impedance
I. Andronov D. Bouche
PIER 59, p. 215-230, 2006

Simulation des ondes
D. Bouche
La Recherche, 2006

Atmospheric correction of hyperspectral data over dark surfaces via simulated annealing
Rodolphe Marion Rémi Michel Christian Faye
IEEE Transactions on Geoscience and Remote Sensing 44(6):1566 - 1574, 2006

abstract

Abstract

A method [atmospheric correction via simulated annealing (ACSA)] is proposed that enhances the atmospheric correction of hyperspectral images over dark surfaces. It is based on the minimization of a smoothness criterion to avoid the assumption of linear variations of the reflectance within gas absorption bands. We first show that this commonly used approach generally fails over dark surfaces when the signal to noise ratio strongly declines. In this case, important residual features highly correlated with the shape of gas absorption bands are observed in the estimated surface reflectance. We add a geometrical constraint to deal with this correlation. A simulated annealing approach is used to solve this constrained optimization problem. The parameters involved in the implementation of the algorithm (initial temperature, number of iterations, cooling schedule, and correlation threshold) are automatically determined by using a standard simulated annealing theory, reflectance databases, and sensor characteristics. Applied to a HyMap image with available ground truths, we verify that ACSA adequately recovers ground reflectance over clear land surfaces, and that the added spectral shape constraint does not introduce any spurious feature in the spectrum. The analysis of an AVIRIS image of Central Switzerland clearly shows the ability of the method to perform enhanced water vapor estimations over dark surfaces. Over a lake (reflectance equal to 0.02, low signal to noise ratio equal to about 6), ACSA retrieves unbiased water vapor amounts (2.86 cm/spl plusmn/0.36 cm) in agreement with in situ measurements (2.97 cm/spl plusmn/0.30 cm). This corresponds to a reduction of the standard deviation by a factor 3 in comparison with standard unconstrained procedures (1.95 cm/spl plusmn/1.08 cm). Similar results are obtained using a Hyperion image of DoE ARM SGP test site containing a very dark area of the land surface.

2005

Hybrid methods for airframe noise numerical prediction
M. Terracol E. Manoha C. Herrero E. Labourasse S. Redonnet P. Sagaut
Theor. Comput. Fluid Dyn., p. 197-227, 2005

Numerical study of a conservative bifluid model with interpenetration
Bruno Després Stéphane Jaouen Constant Mazeran Takéo Takahashi
Numerical Methods for Hyperbolic and Kinetic Problems, IRMA lectures in Mathematics and Theoretical Physics, p. 177-207, 2005

Asymptotic and Hybrid Methods in Electromagnetism
I. Andronov D. Bouche F. Molinet
IEE Press, 2005

Expressions du champ diffracté par une inclusion
J.M. Bernard D. Bouche I. Andronov F. Guyon
Annales des Télécomm, vol.60, n°5-6, p. 630-648, 2005

Error estimate and geometrical corrector for the upwind finite volume method applied to the advection equation
D. Bouche J.M. Ghidaglia F. Pascal
SIAM journal of numerical analysis, vol.43, n°2, p. 578-603, 2005

An optimal a priori error analysis of the finite volume method for linear convection problems
D. Bouche J.M. Ghidaglia F. Pascal
In F. Benkhaldoun, D. Ouazar, S. Raghay, Finite volumes for complex applications IV, Problems and perspectives, p. 225-236, 2005

Authentification dans les ONC/RPC
Ph. Deniel
M.I.S.C, 2005

2004

Advance in RANS-LES coupling, a review and an insight on the NLDE approach
E. Labourasse P. Sagaut
Arch. Comput. Methods Eng., p. 199-256, 2004

Special section: boundary conditions for Large Eddy Simulation-Turbulent Inflow Conditions for Large-Eddy Simulation of Compressible Wall-Bounded Flows
P. Sagaut E. Garnier E. Tromeur L. Larcheveque E. Labourasse
AIAA J., New York, etc. American Institute of Aeronautics; Astronautics., p. 469-477, 2004

A Godunov-type method in Lagrangian coordinates for computing linearly-perturbed planar-symmetric flows of gas dynamics
Jean-Marie Clarisse Stéphane Jaouen Pierre-Arnaud Raviart
Journal of Computational Physics, p. 80-105, 2004

Asymptotic analysis and layer decomposition for the couplex exercise
S. Del Pino O. Pironneau
Computational Geosciences, Springer, p. 149-162, 2004

A hierarchical and view dependent visualization algorithm for tree based AMR data in 2D or 3D
S. Del Pino
Proceedings of the 5th Eurographics conference on Parallel Graphics and Visualization, Eurographics Association, p. 49-58, 2004

A finite volume method for solving Maxwell equations in inhomogeneous media on arbitrary meshes
François Hermeline
Comptes Rendus Mathematique, Elsevier, p. 893-898, 2004

Comment les avions sont devenus furtifs
D. Bouche O. Vacus S. Vermersch
La Recherche, 2004

Measuring trace gases in plumes from hyperspectral remotely sensed data
Rodolphe Marion Rémi Michel Christian Faye
IEEE Transactions on Geoscience and Remote Sensing 42(4):854-864, 2004

abstract

Abstract

A method [joint reflectance and gas estimator (JRGE)] is developed to estimate a set of atmospheric gas concentrations in an unknown surface reflectance context from hyperspectral images. It is applicable for clear atmospheres without any aerosol in a spectral range between approximately 800 and 2500 nm. Standard gas by gas methods yield a 6% rms error in H/sub 2/O retrieval from Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) data, reaching several tens percent for a set of widespread ground materials and resulting from an simplifying assumption of linear variations of the reflectance model within gas absorption bands and partial accounting of the gas induced signal. JRGE offers a theoretical framework consisting in a two steps algorithm that accounts for sensor characteristics, assumptions on gas concentrations and reflectance variations. It estimates variations in gas concentrations relatively to a standard atmosphere model. An adaptive cubic smoothing spline like estimation of the reflectance is first performed. Concentrations of several gaseous species are then simultaneously retrieved using a nonlinear procedure based on radiative transfer calculations. Applied to AVIRIS spectra simulated from reflectance databases and sensor characteristics, JRGE reduces the errors in H/sub 2/O retrieval to 2.87%. For an AVIRIS image acquired over the Quinault prescribed fire, far field CO/sub 2/ estimate (348 ppm, about 6% to 7% rms) is in agreement with in situ measurement (345-350 ppm) and aerosols yield an underestimation of total atmospheric CO/sub 2/ content equal to 5.35% about 2 km downwind the fire. JRGE smoothes and interpolates the reflectance for gas estimation but also provides nonsmoothed reflectance spectra. JRGE is shown to preserve various mineral absorption features included in the AVIRIS image of Cuprite Mining District test site.

2003

A fictitious domain based general PDE solver
S. Del Pino O. Pironneau
Numerical methods for scientific computing variational problems and applications, Barcelona, 2003

Approximation of diffusion operators with discontinuous tensor coefficients on distorted meshes
F Hermeline
Computer methods in applied mechanics and engineering, Elsevier, p. 1939-1959, 2003

Comparison of numerical schemes for solving the advection equation
D. Bouche G. Bonnaud D. Ramos
Applied Math Letters 16, p. 147-154, 2003

Introduction à la GSSAPI
Ph. Deniel
Linux Magazine, Diamond Editions, 2003

2002

Reconstruction of Turbulent Fluctuations Using a Hybrid RANS/LES Approach
E. Labourasse P. Sagaut
J. Comput. Phys., p. 301 - 336, 2002

A-priori Domain Decomposition of PDE Systems and Applications
S. Del Pino J.-L. Lions O. Pironneau
Mathematical Modeling and Numerical Simulation in Continuum Mechanics, Springer, p. 125-135, 2002

Les mathématiques et la diffraction des ondes
D. Bouche
Numéro spécial du journal SMF SMAI, “l’explosion des mathématiques", 2002

Reconstruction des fluctuations turbulentes par une approche hybride RANS/LES
E. Labourasse
Thèse de doctorat en Mécanique. Université Pierre et Marie Curie - Paris VI, 2002. ⟨tel-00006002⟩, 2002

2001

3D computing using virtual reality data
S. Del Pino
ESAIM: Proceedings, EDP Sciences, p. 267-275, 2001

2000

Multiscale approaches to unsteady simulation of turbulent flows
P. Sagaut E. Labourasse P. Quéméré M. Terracol
Int. J. Nonlin. Sci. Num., p. 285-298, 2000

A finite element method for virtual reality data
S. Del Pino E. Heikkola O. Pironneau J. Toivanen
Comptes Rendus de l'Académie des Sciences-Series I-Mathematics, Elsevier, p. 1107-1111, 2000

A finite volume method for the approximation of diffusion operators on distorted meshes
Francois Hermeline
Journal of computational Physics, Elsevier, p. 481-499, 2000

1998

A finite volume method for second order elliptic equations
François Hermeline
Comptes Rendus de l'Academie des Sciences Series I Mathematics, p. 1433-1436, 1998

Speedup and efficiency of large-size applications on heterogeneous networks
L. Colombet L. Desbat
Theoretical Computer Science, p. 31-44, 1998

abstract

Abstract

Program environments are now commonly used for parallelism on networks of workstations. There is a need for simple and consistent tools to measure algorithm performance on heterogeneous networks. In this work we propose a generalization to heterogeneous networks of the classical efficiency formula E(N) = S(N)N, where S(N) is the speedup on N processors.

1997

Which approach to parallelizing scientific codes --- That is the question
Jean-Yves Berthou Laurent Colombet
Parallel Computing, p. 165-180, 1997

abstract

Abstract

We present in this paper the strong points and limitations of semi-automatic parallelization, data parallel programming and message passing programming. We apply these on two numerical algorithms namely a bi-dimensional Fourier transform algorithm and a conjugate gradient programs. We implemented this program for each of the different methods on a Cray T3D. The results of these experiments demonstrate the accuracy of our proposition that when the three methods are combined, efficiency, portability and easiness of parallel programming may be achieved.

Methods to Overlap Communications in Parallel Numerical Algorithms
Christophe Calvin Laurent Colombet Philippe Michallon
International Journal of Foundations of Computer Science, p. 211-235, 1997

Asymptotic Methods in Electromagnetism
D. Bouche F. Molinet R. Mittra
Springer Verlag, 1997

Friedlander-Keller solution for the 3D Maxwell case
D. Bouche I. Andronov
Progress in Electromagnetic Research 16, 1997

Creeping wave diffraction by the junction with a plane surface
D. Bouche I. Andronov N. Kirpichnikova V. Philipov
Annales des Télécomm., 9-10, p. 483-488, 1997

We present in this paper the results of various communication benchmarks on a Cray T3D MPP system. They are composed of most-used communication schemes in parallel applications and numerical kernels. They have been implemented using PVM message-passing libraries on the Cray T3D system. For each of these benchmarks, we propose a model depending on the size of the message communicated and the number of processors involved. We verify that the error between the proposed model and the measures is very small (0.8% in average for point-to-point communications and 3% in average for collective communications).

Fringe diffraction coefficient by higher order discontinuities
P. Montarnal D. Bouche
Annales des Télécomm., 3-4, p. 137-142, 1996

1995

Overlapping techniques of communications
C. Calvin L. Colombet P. Michallon
High-Performance Computing and Networking, Springer Berlin Heidelberg, p. 600-605, 1995

abstract

Abstract

We present in this paper general techniques for overlapping communications in parallel numerical kernels. We describe first some dependencies schemes which can be found in most of numerical parallel algorithms and we apply on these schemes methods based on the change of the granularity of the computational tasks. The choice of the granularity in order to obtain a good overlap depends on the main parameters of the target machines. We apply the precedent techniques of overlapping on classical numerical kernels, namely the matrix-vector product and the bi-dimensional FFT, and implemented them on a T3D and a Paragon. The results of these experiments demonstrate the accuracy of this approach.

Creeping and Whispering gallery waves on the surface of a transparent body
D. Bouche I. Andronov
Journal of Electromagnetic Waves and Applications JEWA 9, p. 503-520, 1995

Asymptotic expansion of the electromagnetic field induced by a dipole on a perfectly conducting convex surface
D. Bouche I. Andronov
Journal of Electromagnetic Waves and Applications JEWA 9, p. 905-924, 1995

Computation of the RCS of complex coated objects by a generalized PTD method
S. Vermersch M. Sesques D. Bouche
Numéro spécial des annales des télécomm, 1995

1994

Towards mixed computation/communication in parallel scientific libraries
C. Calvin L. Colombet F. Desprez B. Jargot P. Michallon B. Tourancheau D. Trystram
Parallel Processing: CONPAR 94 --- VAPP VI, Springer Berlin Heidelberg, p. 605-615, 1994

abstract

Abstract

This paper presents an overlapping technique of communications by computations based on pipelined communications. This allows to improve the execution time of most parallel numerical algorithms. Some simple examples are developed to illustrate the efficiency of this technique matrix-vector product and bi-dimensional Fast Fourier Transform. Moreover, we propose an unified formalism to express easily the pipelined versions of these algorithms. Finally, we report some experiments on various parallel machines.

Méthodes asymptotiques en électromagnétisme
D. Bouche F. Molinet
SMAI, Springer Verlag, vol 16, 1994

Ondes rampantes sur un objet convexe décrit par une condition d'impédance anisotrope
D. Bouche I. Andronov
Annales des Télécomm. , 3-4, p. 194-198, 1994

Calcul du second terme de la constante de propagation des ondes rampantes par une méthode de couche-limite
D. Bouche I. Andronov
Annales des Télécomm. , 3-4, p. 199-204, 1994

Etude des ondes rampantes sur un corps élancé
D. Bouche I. Andronov
Annales des Télécomm. , 3-4, p. 205-210, 1994

1993

Star modeling on IBM RS6000 networks using PVM
L. Colombet L. Desbat F. Menard
Proceedings The 2nd International Symposium on High Performance Distributed Computing, p. 121-128, 1993

Time-frequency analysis of backscattered data from a coated strip with a gap
H. Ling J. Moore D. Bouche V. Saavedra
IEEE AP, 1993

Méthodes asymptotiques en électromagnétisme
D. Bouche F. Molinet
Revue Science et Défense, p. 89-124, 1993

Asymptotic and hybrid techniques for electromagnetic scattering
D. Bouche F. Molinet R. Mittra
Proceedings of IEEE, p. 1656-1684, 1993

On the satisfaction of reciprocity in the context of GTD
D. Bouche R. Mittra
Radio Science, p. 527-531, 1993

1992

Courant sur un obstacle cylindrique parfaitement conducteur présentant une discontinuité de courbure
Daniel Bouche
Annales des Télécomm, n°47, p. 391-399, 1992

Etude des ondes rampantes sur un corps convexe décrit par une condition d’impédance par une méthode de développement asymptotique
Daniel Bouche
Annales des Télécomm, n°47, p. 400-412, 1992

Champ à la surface d’un objet axisymétrique conducteur au voisinage d’un point focal de rayons rampants
Daniel Bouche F. Auzanneau
Annales des Télécomm, n°47, p. 413-420, 1992

abstract

Abstract

On considère un objet axisymétrique illuminé par une onde plane en incidence axiale. Les champs de surface dans la zone d'ombre sont dus aux ondes rampantes et sont donnés, loin de l' axe de symétrie, par les formules de la théorie géométrique de la diffraction. Le point sur l' axe de symétrie est un foyer pour les ondes rampantes et les formules précédentes y prédisent un résultat infini. On détermine, à l'aide d' une méthode de développement asymptotique, une solution pour les champs au voisinage du foyer. Cette solution tend vers les résultats de la TGD loin du foyer et reste bornée au foyer. La comparaison des résultats obtenus par équation intégrale sur des sphéroïdes allongés ou aplatis est satisfaisante.

Calcul du champ à la surface d’un obstacle convexe vérifiant une condition d’impédance par une méthode de développement asymptotique
D. Bouche
Journal d’Acoustique, 5, p. 507-530, 1992

Diffraction by low observable axisymmetric objects at high frequency
D. Bouche J.J. Bouquet M. Pierronne R. Mittra
IEEE AP, vol 40, n°10, p. 1165-1174, 1992

Optimisation of multilayered antireflection coatings using an optimal control method
J.J. Pesqué D. Bouche R. Mittra
IEEE Trans MTT, vol 40, p. 1789-1796, 1992

1991

Onde de frange pour une discontinuité de courbure
D. Bouche F.Molinet
Annales des Télécomm, 7-8, p. 388-391, 1991

Practical problems in high frequency RCS computation
D. Bouche J.J. Bouquet M. Pierronne
Mathematical and numerical aspects of wave propagation phenomena G. Cohen, L. Halpern, P. Joly, Ed, Siam, 1991

GTD et réciprocité
D. Bouche
Annales des Télécomm, 7-8, p. 382-387, 1991

Nos Publications

2025

Abstract

Abstract

Abstract

2024

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

2023

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

2022

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

2021

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

2020

Abstract

Abstract

Abstract

Abstract