astaroth

Author	SHA1	Message	Date
jpekkila	0db61dd411	Disabled the project-wide maxrregcount flag by default since it is only beneficial for resource-heavy kernels. The maximum register count should be defined per kernel instead if needed.	2019-07-22 12:58:28 +03:00
Miikka Vaisala	a8caad1ade	A draft of the sink particle plan.	2019-07-18 17:34:09 +08:00
Miikka Vaisala	8f46fc1c64	Documentation for the planned sink particle property.	2019-07-18 16:20:00 +08:00
jpekkila	eb589def71	Added some additional warning flags for gcc. Disabled them by default until I get the new warnings fixed.	2019-07-18 08:34:52 +03:00
Miikka Vaisala	074fb26df9	Added TODO_SINK comments. The comments were written to map out what essential part are needed for resolving a system with graviating sink particles. No changes to the code itself.	2019-07-17 14:05:48 +08:00
jpekkila	78aba6428e	Updated the copyright years throughout the project	2019-07-16 14:28:32 +03:00
jpekkila	93fc121f5c	Introduced versions of the asynchronous functions which take a stream as a parameter	2019-07-10 15:49:21 +03:00
jpekkila	bd98eaf9f7	Added a stream to loadDeviceConstant call.	2019-07-10 15:29:54 +03:00
jpekkila	b08d5b26f5	cudaMemcpyToSymbol -> cudaMemcpyToSymbolAsync	2019-07-10 15:05:57 +03:00
jpekkila	976bf05c8d	Wrong scope for num_iterations in the last commit, fixed	2019-07-10 14:37:32 +03:00
jpekkila	866ec8a192	Removed some old hack I used for benchmarking a while back	2019-07-10 14:34:05 +03:00
jpekkila	9af7193ffb	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 10:24:28 +00:00
jpekkila	4eb1f74140	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 10:12:17 +00:00
jpekkila	a0d4f574b1	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 10:07:01 +00:00
jpekkila	22b30d8c78	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 09:46:29 +00:00
jpekkila	f38456757e	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 09:42:14 +00:00
jpekkila	897a5c820c	Revert "bitbucket-pipelines.yml edited online with Bitbucket" This reverts commit `18e7a727c5`.	2019-07-10 12:34:50 +03:00
jpekkila	499bc1966f	Revert "bitbucket-pipelines.yml edited online with Bitbucket" This reverts commit `b27f598a86`.	2019-07-10 12:34:36 +03:00
jpekkila	b27f598a86	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 09:28:30 +00:00
jpekkila	18e7a727c5	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-10 09:20:23 +00:00
jpekkila	e14e19774d	Added a synchronization to benchmark.cc that is now required when calling acIntegrateStep	2019-07-09 19:03:45 +03:00
jpekkila	8cc9281045	Double versions of some sqrt, cos and sin were used in model_rk3.cc instead of the long double versions, fixed.	2019-07-09 19:03:15 +03:00
jpekkila	e6c770cbee	Added a synchronization after acLoadDeviceConstant since it is now stated to be asynchronous	2019-07-09 19:00:08 +03:00
jpekkila	d0b95c39b6	Disabled writing out unnecessary files when auto-optimizing the code	2019-07-09 18:51:04 +03:00
jpekkila	0bda016e17	Reviewed the Astaroth interface. Now there's a clear distinction between synchronous and asynchronous functions. For basic usage, we provide a set of functions that are always safe to call (acIntegrate, acLoad, etc), but because of this, must be quite restricted in the sense that f.ex. the whole mesh must be loaded at once and computations cannot be executed concurrently on multiple GPUs. For more advanced users we provide asynchronous functions (such as acLoadWithOffset). Since we cannot know how the asynchronous functions are called (for example, when the integration step has been fully completed and the halos of neighboring subgrids can be safely communicated between GPUs), the responsibility of synchronization must be left to the user. In the existing implementations we currently use only the basic "safe" set of functions (except in renderer.cc), so the existing functionality has not been changed with these latests commits. Autotests also pass.	2019-07-09 18:42:00 +03:00
jpekkila	1251f61570	Removed a stray acBoundcondStep() in acStore where it definitely shouln't be. Removed code duplication: acBoundcondStep now uses the new acLocalBoundcondStep and acGlobalBoundcondStep functions.	2019-07-09 17:08:18 +03:00
jpekkila	6c7b2dbd8d	Merge branch 'master' into multigpu_optimization_2019-07-05	2019-07-09 16:44:55 +03:00
jpekkila	6ec5e5a2c6	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-09 13:21:16 +00:00
jpekkila	bc6f91f610	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-09 12:53:50 +00:00
jpekkila	fba09d2427	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-09 12:51:36 +00:00
jpekkila	afb17c78d4	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-09 12:48:14 +00:00
jpekkila	314f3c1fcc	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-09 12:31:16 +00:00
jpekkila	5ceb8ddb7e	bitbucket-pipelines.yml edited online with Bitbucket	2019-07-09 12:26:08 +00:00
jpekkila	d56e6f1492	Initial Bitbucket Pipelines configuration	2019-07-09 12:23:15 +00:00
jpekkila	10a98b01a9	Experimental change: now the integration function is automatically optimized during acInit	2019-07-09 14:46:24 +03:00
jpekkila	a086821e7c	Added a function acAutoOptimize to the interface and removed rk3_step_async in kernels.cuh (moved into rkStep)	2019-07-09 14:21:22 +03:00
jpekkila	84d96de42b	Merge branch 'master' into multigpu_optimization_2019-07-05	2019-07-09 13:40:33 +03:00
jpekkila	508d15b578	Switched from math.h to cmath in math_utils.h. The old-school C math functions are bugged/not overloaded properly in GCC < 6.0 when compiling C++.	2019-07-09 13:37:08 +03:00
jpekkila	deebe570da	Merge branch 'master' into multigpu_optimization_2019-07-05	2019-07-08 16:11:24 +03:00
jpekkila	eda2f6543b	Created a new ForcingParams structure and some functions for generating and transferring the forcing parameters to the host/device	2019-07-08 15:43:37 +03:00
Miikka Vaisala	f9be905703	Corrected an unit coversion issue from forcing. Now noticing these because of switching to gcc 8.	2019-07-08 16:43:37 +08:00
Miikka Vaisala	df1ba6264a	Update to readme.	2019-07-08 11:08:45 +08:00
Miikka Vaisala	6ba15c3a7c	props.totalConstMem and props.sharedMemPerBlock cause assembler error while compiling on TIARA gp cluster. Therefore commeted out.	2019-07-08 11:00:12 +08:00
jpekkila	5fdfdeca9e	Multi-GPU optimizations: removed some unnecessary synchronization and divided the calculation of boundary conditions to local and global steps.	2019-07-05 18:21:44 +03:00
jpekkila	f1066a2c11	Added preliminary pragmas for dispatching commands simultaneously to multiple GPUs (commented out)	2019-07-05 17:16:12 +03:00
jpekkila	2092adc0f6	Preparations for multi-GPU optimizations	2019-07-05 15:44:30 +03:00
jpekkila	ce8fe53f91	Moved explanations and comments to the beginning of astaroth.cu. No code changes.	2019-07-05 15:39:52 +03:00
jpekkila	d87eb36f5a	Formatting: brackets around a for loop for consistency	2019-07-05 15:26:19 +03:00
jpekkila	224b91b83a	Added more control for synchronizing streams and halos among the GPUs	2019-07-05 15:17:20 +03:00
jpekkila	332f1a4f40	Reordered some of the functions in astaroth.cu and introduced acExchangeHalos() for synchronizing the part of the grid that is independent from the chosen boundary conditions between subgrids.	2019-07-05 15:01:51 +03:00

... 14 15 16 17 18 ...

955 Commits