Inhaltsverzeichnis
For programmers, accessing tensor cores in any of the Volta, Turing, or Ampere chips is easy: the code simply needs to use a flag to tell the API and drivers that you want to use tensor cores, the data type needs to be one supported by the cores, and the dimensions of the matrices need to be a multiple of 8. In the world of graphics, a huge amount of data needs to be moved about and processed in the form of vectors, all at the same time. But for now, if you want to experience super fast GEMM throughput and all the benefits this can bring, you've got two choices: get yourself a bunch of huge multicore CPUs or just one GPU with tensor cores. The connections between the various data values are given a specific weight -- a number that expresses how important that connection is.