You can get the GPU-targeted assembly (sometimes called SASS by NVIDIA) through specifically compiling to a given GPU then using nvdisasm, which also has a very terse definition of the underlying instruction set in the docs (https://docs.nvidia.com/cuda/cuda-binary-utilities/index.htm...).
But it's one way only, NVIDIA ships a disassembler, but explicitly doesn't ship an assembler.
But it's one way only, NVIDIA ships a disassembler, but explicitly doesn't ship an assembler.