tf::cudaSequentialCapturing class

class to capture the described graph into a native cudaGraph using a single stream

A sequential capturing algorithm finds a topological order of the described graph and captures dependent GPU tasks using a single stream. All GPU tasks run sequentially without breaking inter dependencies.

Constructors, destructors, conversion operators

cudaSequentialCapturing() defaulted
constructs a sequential optimizer