Guess you got confused with terminology. FP32/64 cores are single-precision (FP32) shader processors and double-precision (FP64) shader processors.
nVidia calls their shader processors CUDA cores. Do you see it now? FP32/64 cores=CUDA cores=shader processors. So it is CUDA that does floating point calculations (nVidia). "Stream Processor" is an AMD-invented term for the same thing - shader processors.
What is a Shader Processing Unit? Also known as Stream Processor (AMD), CUDA Core (NVIDIA), the Shader Processing Unit became the most important component on a Graphics Card, upon the release of Shader Unified Architectures, back in 2006.
Before the release of Unified Shaders, GPUs had 2 types of Shaders: Pixel Shaders and Vertex Shaders, whereas Pixel Shaders were in charge of computing color and Vertex Shaders allowed the control of movement, lightning, position, and color in any scene involving 3D models.
Both are important and GPUs always had a major problem: an unbalance between Pixel and Vertex Shaders. Example: Radeon X1900 - 36 Pixel Shaders and 8 Vertex Shaders. The problem? Depending on the need of 3D application, often Pixel or Vertex would be idling. Pretty much like a computer waiting for its printer to finish a print order. Another term would be: bottleneck.
Well, unified Shaders will switch between the two depending upon what work needs to be done. In other words, you don't have one or the other, ideally, sitting idle waiting for the other group to finish their work and thus allows higher efficiency.