The document proposes a new VLSI algorithm for computing the discrete Hartley transform (DHT) that is well-suited for implementation on a highly parallel and modular VLSI architecture. The algorithm can efficiently split the DHT computation into multiple parallel parts that can be executed concurrently. It also leverages sub-expression sharing to significantly reduce hardware complexity. By efficiently sharing multipliers with the same constant, the number of required multipliers is much smaller than existing algorithms, and these constant multipliers can be efficiently implemented in VLSI.