VideoCardz believes the full version of this Ampere GPU will likely have 128 CUs, which would result in a massive 8192 CUDA cores. These "full" parts may or may not make it to the market, depending on how well yields are. The site points out that the full GV100 "Volta" GPU offered 84 CUs but we never saw parts with more than 80 enabled.
The parts that showed up in the GeekBench database are likely Tesla or Quadro models.
That said, it is possible that the leaked 124 CU (SM) device is the next-gen Tesla (it would lack just as many SMs as Tesla Volta). We do not know for sure if the next-gen architecture for Tesla is codenamed Ampere, Hopper or something else. However, NVIDIA CEO will host an opening keynote at GTC 2020 (which shifted to an online event) where he might provide an update on these GPUs, next-gen architecture and future roadmaps.NVIDIA is expected to introduce Ampere later this month at its GTC 2020, which will be a web-only event this time due to the coronavirus outbreak.