@@ -56,3 +56,9 @@ We need to decide what level of optimization we want to do with gcc.
For example, with O1, the img\_gray.c will *not* vectorize the grayscale
conversion, but with O2, it will.
One thought is that if we compare the speedup between vectorized and non-vectorized, that wil give us an idea of the impact of the vector instructions?
## Basic CPU Configuration
I copied the given class configuration for the CPU microachitectural details.
It uses an Intel Skylake style cache hierarchy.
We should think about if the actual details matter.