cwpearson/astaroth

Files

History

jpekkila d51d48071f Updated documentation and made it work with Doxygen. Now the doc/doxygen/index.html generated with it looks quite good and contains lots of useful and up-to-date information about Astaroth

2020-01-13 21:11:04 +02:00

..

Added an autotesting script which tests for hydro, magnetic and mhd. Currently hydro and magnetic fail but full mhd works. This indicates that the equations in the hydro and magnetic conditionals have been changed but the autotests have not been updated to correspondingly

2019-08-06 17:40:02 +03:00

magnetic_solver

Added an autotesting script which tests for hydro, magnetic and mhd. Currently hydro and magnetic fail but full mhd works. This indicates that the equations in the hydro and magnetic conditionals have been changed but the autotests have not been updated to correspondingly

2019-08-06 17:40:02 +03:00

Adapted the old example of helical forcing with profiles to conform with the revised syntax

2019-10-07 19:43:25 +03:00

mhd_solver_DEPRECATED

Moved the old mhd solver to mhd_solver_DEPRECATED and replaced it with the new stencil_kernel.ac file

2019-10-07 17:36:30 +03:00

Removed or updated some old .gitignore files

2019-09-24 17:50:41 +03:00

Modified the other sps files to use the new syntax. Though does not compile since there are some old/very old changes in the DSL that have not been updated to these files (f.ex. RK macro does not exist anymore, it's currently rk3)

2019-08-08 21:25:45 +03:00

Added forward declaration for yyparse to avoid warnings with some compilers when compiling acc

2019-12-03 18:36:21 +02:00

Clarified the syntax for real number literals. 1.0 is the same precision as AcReal, 1.0f is an explicit float and 1.0d is an explicit double.

2019-10-07 18:24:32 +03:00

Added an example for creating arbitrary projects, see acc/test_solver and src/exampleproject. Note: make sure that dt is calculated adequately and that all parameters are defined properly (see src/exampleproject/simulation.cc)

2019-10-01 15:33:26 +03:00

build_acc.sh

The Astaroth Code Compiler (acc) is now built with cmake. Additionally, make is now used to generate the CUDA headers from DSL sources. The headers are also properly regenerated whenever a DSL file has been changed. With this commit, the DSL is now seamlessly integrated to the library and we no longer need complicated scripts to figure out the correct files. The current workflow for using custom DSL sources is to pass the DSL module directory to cmake, f.ex. cmake -DDSL_MODULE_DIR=/acc/mhd_solver. Note that the path must be absolute or then given relative to the CMakeLists.txt directory. f.ex cd build && cmake -DDSL_MODULE_DIR=../acc/mhd_solver does not work. CMake then takes all DSL files in that directory and handles the rest.

2019-09-18 17:28:29 +03:00

clean.sh

Added Astaroth 2.0

2019-06-14 14:19:07 +03:00

CMakeLists.txt

acc is now built with cmake instead of the old build script. This was mainly done to fix compilation on Puhti where I had problems linking flex even though it is available. As an added bonus the code is now safer to build since all dependencies are now rigorously tracked by cmake and make, and f.ex. change in the compiler now forces also the whole library to be rebuilt (which is the behaviour we want)

2019-09-24 16:57:19 +03:00

compile.sh

acc is now built with cmake instead of the old build script. This was mainly done to fix compilation on Puhti where I had problems linking flex even though it is available. As an added bonus the code is now safer to build since all dependencies are now rigorously tracked by cmake and make, and f.ex. change in the compiler now forces also the whole library to be rebuilt (which is the behaviour we want)

2019-09-24 16:57:19 +03:00

preprocess.sh

Removed debug prints from the preprocessing script

2019-10-08 00:31:15 +03:00

README.md

Updated documentation and made it work with Doxygen. Now the doc/doxygen/index.html generated with it looks quite good and contains lots of useful and up-to-date information about Astaroth

2020-01-13 21:11:04 +02:00

test_grammar.sh

Replaced old deprecated instances of DCONST_INT with DCONST

2019-11-27 13:48:42 +02:00

README.md

Astaroth DSL compiler

Dependencies

Debian/Ubuntu

apt install flex bison build-essential

Usage

./build_acc.sh # Builds the ASPL compiler (acc)
./compile.sh <.sps or .sas source> # Compiles the given stage into CUDA
./test.sh # Tries to compile the sample stages
./clean.sh # Removed directories generated by build_acc.sh and test.sh

Example

./compile.sh src/stencil_assembly.sas # Generates stencil_assembly.cuh
./compile.sh src/stencil_process.sps # Generates stencil_process.cuh

What happens under the hood

The compiler is made of a scanner (flex), parser (bison), implementation of the abstract syntax tree (AST) and a code generator. The language is defined by tokens and grammars found in acc.l and acc.y. These files are given as input to flex and bison, which generate the scanning and parsing stages for the compiler. The resulting AST is defined in ast.h. Finally, we traverse the generated AST with our code generator, generating CUDA code.

ACC compilation stages

In short:

Preprocess .ac
Compile preprocessed .ac to .cuh
Compile .cuh

More detailed:

A Parser is generated: bison --verbose -d acc.y
A Scanner is generated: flex acc.l
The compiler is built: gcc -std=gnu11 code_generator.c acc.tab.c lex.yy.c -lfl
Source files (.sps and .sas) are preprocessed using the GCC preprocessor and cleaned from any residual directives which would be useful when compiling the code further with GCC. We do not need those when compiling with ACC and are not recognized by our grammar.
Either the stencil processing stage (.sps) or the stencil assembly stage (.sas) are generated by passing the preprocessed file to acc. This emits the final CUDA code.
Compilation is continued with the NVIDIA CUDA compiler

Even more detailed:

The NVIDIA CUDA compiler compiles .cuh to .fatbin, which is embedded into a C++ binary containig host code of the program. A fatbin contains .cubin files, which contain the configuration of the GPU and the kernels in a streaming assembly code (.sass). We could also compile for a virtual architecture (.ptx) instead of the actual hardware-specific machine code (.cubin) by passing -code=compute_XX flag to nvcc, which would compile cuda sources at runtime (just-in-time compilation, JIT) when creating the CUDA context. However, we alway know which architecture we want to run the code on and JIT compilation would just increase the time to takes to launch the program.

nvcc -DAC_DOUBLE_PRECISION=1 -ptx --relocatable-device-code true -O3 -std=c++11 --maxrregcount=255 -ftz=true -gencode arch=compute_60,code=sm_60 device.cu -I ../../include -I ../../ nvcc -DAC_DOUBLE_PRECISION=1 -cubin --relocatable-device-code true -O3 -std=c++11 --maxrregcount=255 -ftz=true -gencode arch=compute_60,code=sm_60 device.cu -I ../../include -I ../../ cuobjdump --dump-sass device.cubin > device.sass