Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>That question can't be answered without knowing the clock-rates of DSP and the ARM.</p> <p>Here is some background:</p> <p>I just checked the cycles of a floating point multiplication on the c674x DSP:</p> <p>It can issue two multiplications per cycle, and each multiplication has a result latency of three cycles (that means you have to wait three additional cycles before the result appears in the destination register). </p> <p>You can however start two multiplications <em>each</em> cycle because the DSP will not wait for the result. The compiler/assembler will do the required scheduling for you.</p> <p>That only uses two of the available eight functional units of the DSP, so while you do the two multiplications you can per cycle also do:</p> <ul> <li>two load/stores (64 bit wide)</li> <li>six floating point add/subtract instructions (or integer instructions)</li> </ul> <p>Loop control and branching is free and does not cost you anything on the DSP.</p> <p>That makes a total of six floating point operations per cycle with parallel loads/stores and loop control.</p> <p>ARM-NEON on the other hand can, in floating point mode:</p> <p>Issue two multiplications per cycle. Latency is comparable, and the instructions are also pipeline-able like on the DSP. Loading/storing takes extra time as does add/subtract stuff. Loop control and branching will very likely go for free in well written code.</p> <p><strong>So in summary the DSP does three times as much work per cycle as the Cortex-A9 NEON unit.</strong> </p> <p>Now you can check the clock-rates of DSP and the ARM and see what is faster for your job.</p> <p>Oh, one thing: With well-written DSP code you will almost never see a cache miss during loads because you move the data from RAM to the cache using DMA before you access the data. This gives impressive speed advantages for the DSP as well. </p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
    3. VO
      singulars
      1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload