Novel Time-Domain Interleaved Multiplier Sharing Architecture for Ultra-High Throughput FFT Processors

Authors

  • Nancharaiah Vejendla
  • Dr.R. Ramana Reddy
  • Dr.N.Balaji

Keywords:

Fast Fourier Transform (FFT), high-throughput DSP, multiplier sharing, time-domain interleaving, mixed-radix architecture, OFDM, 5G NR, VLSI design.

Abstract

This paper presents a novel Time-Domain Interleaved Multiplier Sharing (TDIMS) architecture designed to achieve ultra-high throughput in Fast Fourier Transform (FFT) processors while minimizing hardware complexity. The proposed architecture integrates a dynamically reconfigurable radix-4/8 butterfly processing engine with a centralized shared multiplier pool that is time-interleaved across eight parallel data paths. By exploiting temporal and spatial redundancies in twiddle factor computations, TDIMS reduces the number of complex multipliers by up to 35% compared to conventional multipath delay commutator (MDC) designs. The system also employs a predictive twiddle factor addressing scheme to eliminate memory access bottlenecks. Implemented and synthesized and validated on a Xilinx Virtex-7 FPGA, the processor supports mixed-radix operation and achieves a throughput of 128 GS/s at a 2 GHz clock frequency for 1024/2048-point FFTs. Performance evaluations demonstrate a 4.6× improvement in throughput-per-gate efficiency over state-of-the-art designs, along with significant reductions in logic resource utilization and competitive power consumption. The architecture is highly scalable and well-suited for next-generation wireless applications, including 5G NR and optical OFDM systems, where high speed, flexibility, and area efficiency are critical.

References

M. Ayinala, M. Brown, and K. K. Parhi, “Pipelined Parallel FFT Architectures via Folding Transformation,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 20, no. 6, pp. 1068–1081, Jun. 2012.

T. Cho and H. Lee, “A High-Speed Low-Complexity Modified ${rm Radix}-2^{5} $ FFT Processor for High Rate WPAN Applications,” IEEE Transactions on Very Large Scale Integration Systems, vol. 21, no. 1, pp. 187–191, Jan. 2013

X. Guan, Y. Fei, and H. Lin, “Hierarchical Design of an Application-Specific Instruction Set Processor for High-Throughput and Scalable FFT Processing,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 20, no. 3, pp. 551–563, Mar. 2012.

S.-J. Huang and S.-G. Chen, “A High-Throughput Radix-16 FFT Processor with Parallel and Normal Input/Output Ordering for IEEE 802.15.3c Systems,” IEEE Transactions on Circuits and Systems I-regular Papers, vol. 59, no. 8, pp. 1752–1765, Aug. 2012.

IEEE, “IEEE Standard for Information Technology--Telecommunications and Information Exchange Between Systems Local and Metropolitan Area Networks--Specific Requirements - Part 11: Wireless LAN Medium Access Control (MAC) and Physical Layer,” IEEE Access, Sep. 2024.

I.-C. Lin, Y. Yang, and C.-C. Lin, “High-Performance Low-Power Carry Speculative Addition with Variable Latency,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 23, no. 9, pp. 1591–1603, Sep. 2015.

J. H. Baek, S. D. Kim, and M. H. Sunwoo, “SPOCS: Application Specific Signal Processor for OFDM Communication Systems,” Journal of Signal Processing Systems, vol. 53, no. 3, pp. 383–397, May 2008.

M. Ayinala and K. K. Parhi, “FFT Architectures for Real-Valued Signals Based on Radix-$2^{3}$ and Radix-$2^{4}$ Algorithms,” IEEE Transactions on Circuits and Systems I Regular Papers, vol. 60, no. 9, pp. 2422–2430, Sep. 2013.

W. Shieh, Q. Yang, and Y. Ma, “107 Gb/s coherent optical OFDM transmission over 1000-km SSMF fiber using orthogonal band multiplexing,” Optics Express, vol. 16, no. 9, p. 6378, Apr. 2008.

M. Garrido, J. Grajal, del Carmen, and O. Gustafsson, “Pipelined Radix-$2^{k}$ Feedforward FFT Architectures,” IEEE Transactions on Very Large Scale Integration Systems, vol. 21, no. 1, pp. 23–32, Jan. 2013.

S.-N. Tang, J.-Y. Tsai, and T.-Y. Chang, “A 2.4-GS/s FFT Processor for OFDM-Based WPAN Applications,” IEEE Transactions on Circuits and Systems II-Express Briefs, vol. 57, no. 6, pp. 451–455, Jun. 2010.

P. Tsai and C. Lin, “A Generalised Conflict-Free Memory Addressing Scheme for Continuous-Flow Parallel-Processing FFT Processors with Rescheduling,” IEEE Transactions on Very Large Scale Integration Systems, vol. 19, no. 12, pp. 2290–2302, Dec. 2011.

Y. Lin, H.-Y. Liu, and C. Lee, “A 1-GS/s FFT/IFFT processor for UWB applications,” IEEE Journal of Solid-state Circuits, vol. 40, no. 8, pp. 1726–1735, Aug. 2005.

K. Yang, S.-H. Tsai, and Gene, “MDC FFT/IFFT Processor with Variable Length for MIMO-OFDM Systems,” IEEE Transactions on Very Large Scale Integration Systems, vol. 21, no. 4, pp. 720–731, Apr. 2013.

Published

2026-02-09

How to Cite

Vejendla, N., Dr.R., . R. R., & Narayanam, B. (2026). Novel Time-Domain Interleaved Multiplier Sharing Architecture for Ultra-High Throughput FFT Processors. Journal of VLSI Design and Signal Processing, 12(1), 28–38. Retrieved from https://matjournals.net/engineering/index.php/JOVDSP/article/view/3026

Issue

Section

Articles