Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Monolithically integrated 112 Gbps PAM4 optical transmitter and receiver in a 45 nm CMOS-silicon photonics process

Open Access Open Access

Abstract

We demonstrate a transmitter and receiver in a silicon photonics platform for O-band optical communication that monolithically incorporates a modulator driver, traveling-wave Mach-Zehnder modulator, control circuitry, photodetector, and transimpedance amplifier (TIA) in the GlobalFoundries Fotonix (45SPCLO) platform. The transmitter and receiver show an open 112 Gbps PAM4 eye at a 4.3 pJ/bit energy efficiency, not including the laser. Extensive use of gain-peaking enables our modulator driver and TIA to achieve the high bandwidths needed in the 45 nm CMOS-silicon photonics process. Our results suggest an alternative to the frequent approach of bump-bonding BiCMOS drivers and TIAs to silicon photonics.

© 2023 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

1. Introduction

Performance per Watt is one of the critical metrics by which a computer system is judged. The numerator, performance, is typically expressed as the number of operations per second performed when the computer system is operating at peak throughput. The denominator, Watts, is the overall power consumption of the system during this period. Power dissipated in the processors, switches, communication links, power supplies, memory, cooling systems, and associated supporting infrastructure all contribute to the denominator. As overall bandwidths increase, the power required for data movement in an exascale computing system becomes a significant contributor to the power consumption of a system. Existing systems implement more than 1000 Pbps of total memory bandwidth [1]. A future disaggregated memory system could see further scaling with a significant fraction of the bandwidth transmitted in the optical domain [2]. In terms of energy efficiency of such links, an end-to-end optical link using pluggable optics may consume on the order of 25 pJ/bit, including the SERDES in the host ASIC [3]. At 2000 Pbps of bidirectional traffic and 25 pJ/bit, the power dissipated in optical links would multiply to 50 MW for such an optically-connected memory system, significantly more than the entire system power of existing Top500 HPC systems [4]. Here, we have ignored the power from optical links to support switching and other functionality to express orders-of-magnitude. It follows that continued bandwidth scaling for new computer architectures will require further improvements in the energy efficiency of optical links in order to preserve or improve performance per Watt.

Co-packaged optics improves the performance per Watt of computer systems by eliminating the need for additional electrical retimer chips, such as those used in conventional pluggable optics. Figure 1(a) shows an illustration of the electrical connectivity from a host ASIC (a switch or processor, for example) to a pluggable optical module that provides the interface to optical fibers. Inside of the pluggable module, it is common to include a DSP chip, which converts the host-side interface, such as 8 × 50 Gbps signaling, into an optical interface, such as 4 × 100 Gbps signaling [5]. Crucially, the DSP chip also provides retiming and equalization functionality, as closing the full link from the ASIC through the pluggable connector to the optics directly would be too challenging. Figure 1(b,c) shows example links in which the DSP chip is not required. In (b), the separately-packaged optical engine is located near the ASIC, similar to the orientation inside of the pluggable, while in (c), the optical engine and the ASIC are co-packaged together. While both (b) and (c) eliminate the DSP chip, the co-packaged solution requires less signal equalization, which in turn can lead to improved optical sensitivity and/or lower electrical power consumption within the SERDES.

 figure: Fig. 1.

Fig. 1. (a) A conventional optical link from an ASIC on a host PCB to a pluggable optical module with a retimer or DSP chip. (b) An alternative link in which the optical engine is placed in a package near the ASIC, (c) Co-packaged optics in which the optical engine is on the same substrate as the ASIC. The arrangements shown in (b) and (c) provide net power consumption savings over (a) by eliminating the retimer/DSP chip.

Download Full Size | PDF

The requirements of a co-packaged optical link motivate a technology that can support high-density I/O and excellent energy efficiency [6,7]. To meet these needs, silicon photonics (SiPh) emerges as a promising platform for next-generation interconnects because of the ease with which it is possible to take advantage of existing multi-chip module assembly infrastructure [8,9]. In particular, a monolithic CMOS-SiPh process is ideal for CPO because it enables: (1) Device density that exceeds conventional pad-pitch limitations of wirebond, flip-chip, or advanced packaging techniques, (2) simplified electrical assembly compared to heterogeneous electrical amplifier and photonics chipsets, and (3) close integration between ASICs and optical I/O [1012].

However, one large disadvantage of a monolithic CMOS-SiPh process is that, typically, the performance of the CMOS is not state-of-the-art [13]. Thus, an open question exists whether such monolithic CMOS-SiPh processes can enable optical links at the latest-generation of bit rates, modulation formats, and SERDES capabilities. Importantly, for most chip-to-chip communications inside of exascale compute systems, this typically requires interoperability with the interfaces that support PCB or backplane communications. If a link can be connected electrically rather than optically, it typically should from an energy & cost perspective. It is also desirable to be able to use a single ASIC design for both electrical connectivity and optical connectivity. Thus, interfacing optics to ASICs at the latest generation of standard 112 Gbps, or similar, rates enables a flexible I/O architecture.

Designing both a transmitter and a receiver in a monolithic CMOS-SiPh process at these data rates is as of yet undemonstrated in literature. Many results have demonstrated heterogeneous chipsets integrated in a 2D, 2.5D, or 3D assembly with separate electrical ICs and photonic ICs [3,5,1421]. Prior monolithic literature with both a transmitter and receiver on-wafer demonstrated links at up to 25 Gbps [2224] with published simulation models for a monolithic die at 106 Gbps [25]. Prior individual monolithic results (i.e., only a transmitter or only a receiver) demonstrate a transmitter at 44 Gbps NRZ [26], a receiver at 56Gbps NRZ [27], and a coherent receiver at up to 66 Gbaud QPSK [28], all in a SiGe BiCMOS-SiPh monolithic process; as well as a 100 Gbps PAM4 transmitter [29], a 56 Gbps PAM4 transmitter [30], and separately a 28 Gbps NRZ receiver [31] in a CMOS-SiPh process.

In this paper, we describe in detail the first CMOS-SiPh monolithic transmitter and receiver for 112 Gbps optical communication designed and fabricated in the GlobalFoundries Fotonix (45SPCLO) platform. The chips operate near 1310 nm and monolithically incorporate all the electrical and optical functionality for an optical transmitter and receiver, except for the laser which is packaged separately and fiber-coupled. The transmitter (TX) and receiver (RX) photonic-analog chips (PAC) are fabricated on the same wafer without any process modifications between the two designs. The TX PAC comprises a linear modulator driver, traveling-wave Mach-Zehnder (TWMZ) modulator, an integrated feedback control system, and digital serial peripheral interface (SPI). The RX PAC comprises an integrated photodetector, linear transimpedance amplifier (TIA), and a separate SPI bus. Both the TX PAC and RX PAC are shown to operate with open eyes at 112 Gbps PAM4.

2. Design and fabrication

The transmitter and receiver are designed as separate chips on the same reticle and wafer. No process splits are utilized between the two chips. Each TX PAC contains 16 channels with 8 total laser inputs where each laser is split two ways. Each RX PAC also contains 16 channels with on-chip polarization diversity. V-groove edge couplers are used as spot-size converters between optical fibers and on-chip waveguides [12]. Grating-couplers are used throughout both chips for wafer-level and some chip-level testing. Refer to Fig. 2 for a schematic representation of the on-chip signal paths.

 figure: Fig. 2.

Fig. 2. Schematic representation of the on-chip signal path, including spot-size converter (SSC), 1 × 2 laser splitter, monitor photodiodes (MPD), serial peripheral interface (SPI) for control, traveling wave Mach-Zehnder modulator (TWMZ), high-speed pn-junction phase shifter (PN), thermal phase shifter (THPH), adjustable termination resistor (Rterm), high-speed photodetectors (HSPD), polarization splitter & rotators (PSR), grating couplers for testing (GC), and integrated control system (CTRL).

Download Full Size | PDF

A current mode driver chain is used to drive a differential TWMZ that is terminated by an adjustable termination resistor; the resistance range was designed to be 18-27 Ω for the differential mode with 4 bits of adjustment [32]. Based on Process Development Kit (PDK) documents from the foundry, the TWMZ performance is calculated to achieve around 4.6 V Vπ with a 3 dB bandwidth of around 26 GHz and a low-speed 3 dB cutoff of 6 MHz, a differential impedance of 24 ohm, and optical loss of 1.5 dB at full transmission. The pn junctions in each arm are each driven in push-pull configuration and see the full drive voltage, with an overall differential transmission line [33,34]. Therefore, the driver is designed to be loaded by a 24 Ω differential impedance transmission line and provide 1.6 V peak-to-peak differential voltage. We note the ability to use such a low differential-impedance with minimal penalty is enabled by the lack of transmission line between the modulator and driver, an aspect of our monolithic integration. In combination with the modulator, the TX PAC is calculated to achieve a low-frequency phase shift of 1.1 radians. During operation, applied equalization will lower the total modulation depth. The driver chain consists of an input termination followed by 5 Current Mode Logic (CML) stages [35]. The input of the driver chain is matched to a 100 Ω differential resistor and DC coupled with a common mode voltage of 650 mV.

All driver stages are CML-based amplifiers with different functionalities, as shown in Fig. 3(a). The input stage is sized to provide minimum capacitive loading at the input to achieve low input return loss. The second stage is a CTLE with switchable RC source degeneration to provide high frequency peaking for bandwidth improvement. The third stage is a variable gain amplifier (VGA) to adjust the driver gain. The input termination and CML stages benefit from T-coils at their output to improve the bandwidth. The last stage of the driver, shown in Fig. 3(b), is a cascode CML stage with thick oxide FETs as output devices to tolerate high output swing. In order to achieve the optimal linear swing, the output stage is powered using a 1.8 V supply. The driver is double-terminated with adjustable resistors to reduce the reflections at both ends. However, the termination on the driver side must be kept higher to preserve the voltage swing amplitude. Figure 3(c) plots the simulated |EOS21| response of the PAC-TX from driver input to fiber output at nominal process and temperature corners. We define EOS21 in the usual manner; a decrease of the output RF power from a photodetector attached to the modulated signal by 50% corresponds to a 3 dB rolloff. All simulations are performed at 75C with Typical-Typical (TT) corners, with post-layout extracted netlists.

 figure: Fig. 3.

Fig. 3. (a) Modulator driver architecture, (b) output driver stage schematic diagram, and (c) simulated |EOS21| response of the TX PAC. For 3 (c), the TWMZ is biased at the 3 dB point of maximal linear response.

Download Full Size | PDF

The RX lane architecture is shown in Fig. 4(a). An illuminated single-ended photodiode drives a balanced RX implementation. The photodiode is a PDK element with nominal performance of 0.9 A/W, 50 GHz bandwidth, and under 40 nA dark current reported by the foundry [12]. A dark photodiode is connected to the complementary RX input. The first stage is a pseudo-differential inverter-based shunt-feedback TIA, followed by an inverter-based Cherry-Hooper amplifier. Both inverter-based stages have symmetric T-coil inductive peaking [36] and are powered by an externally regulated low-noise power supply. The subsequent stages are implemented as fully differential CML-based amplifiers with no supply regulation. The first CML stage performs single-ended to differential (S2D) conversion, followed by a programmable continuous-time linear equalization (CTLE), a programmable gain amplifier (PGA), a pre-driver, and an output pad driver. Most CML stages use bridged-shunt inductive peaking networks, chosen for robustness, while the pad driver has a symmetric T-coil network for broadband output matching [37]. We note that both the TIA and modulator driver make heavy use of inductive peaking to substantially enhance the bandwidth of both circuits. This was required to meet the system bandwidth requirements in a 45 nm CMOS node [38].

 figure: Fig. 4.

Fig. 4. (a) TIA & pad driver architecture and (b) simulated AC response of the PAC RX TIA loaded with the photodetector at three different gain settings.

Download Full Size | PDF

To minimize power supply noise both at the input (PD bias) and output (TIA) and achieve better linearity at the single-ended to differential block, as shown in Fig. 4(a), a dummy PD is used to balance the input to the TIA [3941]. Additionally, employing a dummy PD allows the option of integrating a PD emulator on-chip for testing purposes without adversely affecting bandwidth and noise, should a PD emulator be added to the main PD. Nevertheless, the dummy PD architecture does contribute to an increase in the input-referred noise.

Independent common-mode feedback circuits sense the output of each Cherry-Hooper amplifier and feed back a DC current into the respective photo-detector anode. This scheme allows for optimal S2D input common-mode biasing, and the use of independent voltage references allow for foreground offset calibration. The low-pass filter cut-off is lower than 50 kHz, resulting in negligible baseline wander at 112 Gbps. On-chip current and voltage generators bias the RX, and an SPI interface allows for digital programming of internal registers. At high gain, simulated noise at 30 GHz is 17.7 pA/Hz1/2. Figure 4(b) plots the simulated AC response at three different gain settings, which we describe as “low”, “medium” and “high”, having transimpedance gains of 63 dB-ohm, 66 dB-ohm and 68 dB-ohm respectively.

The wafers are fabricated in GlobalFoundries’ 300 mm 45 nm SOI process (GF Fotonix). The cross-sectional diagram of the process is shown in Fig. 5. The wafers monolithically integrate high-speed transistors, modulators, photodetectors, ridge waveguides, rib waveguides, silicon nitride waveguides, V-groove edge couplers, and a full metal stack.

 figure: Fig. 5.

Fig. 5. (a) Cross-section diagram of the IOSMF with v-groove and the attached fiber. (b) Cross-section showing the front-end and middle-of-line of 45CLO technology. Figure taken from: [12].

Download Full Size | PDF

3. Experimental results

The transmitter and receiver chips are designed for co-packaged applications wherein the full-flow wafers are bumped with V-groove edge couplers and the high-speed I/O are directly connected to the host ASIC SERDES. High-speed probes are used for testing the individual performance of the transmitter and receiver since the flip-chip substrate application circuit limits access to the high-speed inputs and outputs of the chip. Figure 6 shows photographs of the full transmitter and receiver chips, as well as the devices under test. The TX PAC uses a separate glass array of polarization-maintaining fibers for laser inputs and standard single-mode fibers for modulator outputs. The RX PAC uses a single array of standard single-mode fiber. The low-speed I/O and power supplies are connected via wirebonds to a printed circuit board while the high-speed I/O is probed in a ground-signal-signal-ground configuration for both TX and RX PACs.

 figure: Fig. 6.

Fig. 6. Photographs of (a) the TX and RX chips on-wafer, (b) the TX PAC under test, and (c) the RX PAC under test.

Download Full Size | PDF

Figure 7 shows a block diagram of the transmitter and receiver test setups. A Keysight M8194A arbitrary waveform generator (AWG) with 120 GSa/s generates the high-speed PAM4 signal using root-raised cosine pulse shaping, while a Keysight DCA-N1092C oscilloscope (DCA) with 28 GHz bandwidth captures the signal. An FPGA and microcontroller are used to communicate to the on-chip SPI and external DC power supplies provide power and biasing. For receiver characterization, an external EOSpace 40 GHz lithium niobate modulator is used to generate the input optical waveform. For all testing, the laser outputs near 1310 nm. The laser is an Applied Optoelectronics Inc DFB laser with 100 kHz linewidth.

 figure: Fig. 7.

Fig. 7. Block diagram of the (a) transmitter and (b) receiver experimental test setup; the fibers indicated are short, on the order of 5 m or less, and just used to connect different pieces of test equipment. The monitor PD indicated in (a) is used to monitor optical output power and TWMZ bias point. We utilize a Source Measurement Unit (SMU) to provide a variable DC bias to the commercial Mach-Zehnder modulator (MZM) used for RX PAC testing, as well as a Power Supply Unit (PSU) to power the SHF807C, which is an RF amplifier.

Download Full Size | PDF

The TX PAC is tested with a 50 mW input in the PM fiber. On-chip, each laser input is split once so that each channel requires approximately 25 mW of laser power. The PDFA optical amplifier is used to amplify the Tx output to approximately 0 dBm before entering the DCA. During operation, the on-chip control loops bias and stabilize the thermal tuner inside the TWMZ. A SPI command is used to flip a single polarity bit that determines whether the control loop stabilizes on a rising edge or falling edge of the Mach-Zehnder transfer function. On-chip ADCs, DACs, low-speed TIAs, and low dropout voltage regulators sample the MPD photocurrent and drive the thermal tuners. During transmitter output eye diagram characterization, the s-parameter response for the 1.85 mm cabling up to the probe input (but not including the probe or TX PAC) is equalized from the AWG output. A 5-tap T-spaced feed forward equalizer is then optimized & applied at the AWG to compensate for the probe and TX PAC electro-optic response. The measured fiber-to-fiber insertion loss of the transmitter when biased at the maximum TWMZ transmission point is 15.3 dB; we note this includes an intrinsic 3 dB loss for splitting each laser into two channels.

At 112 Gbps we measure clean open eyes from the TX PAC, as shown in Fig. 8. We characterize TDECQ out of the transmitter under a variety of scenarios. TDECQ (transmitter and dispersion eye closure quaternary) is a figure of merit for the eye quality of PAM4 optical transmitters [42]. The TX PAC measures an extinction ratio of 3.8 dB and a TDECQ of 2.3 dB with a standard 5-tap feed forward equalizer (FFE) [43] which meets the IEEE 802.3 requirement for a 400GBASE-DR4 transmitter. With the aid of a continuous time linear equalizer (CTLE) analog filter and additional FFE taps, the TX eye opens up further. The CTLE filter implemented on the DCA has a 21 GHz zero, 27 GHz first pole, and 32 GHz second pole. In addition to the bandwidth limitations from the TX PAC, there is also significant rolloff from the DCA input bandwidth of 28 GHz and other elements in the test setup that are compensated for with this equalization. A 3.8 dB extinction ratio indicates an effective 0.86 radian peak to peak phase shift, which can be compared to the 1.1 radians we expected based on calculation in the low-speed limit for our modulator + driver.

 figure: Fig. 8.

Fig. 8. Measured eye diagrams from TX PAC testing at 112 Gbps with various filters implemented on the DCA, sampled with a baud/2 bandwidth receiver: (a) A “raw” TX without further signal processing; (b) processed with an analog CTLE where the CTLE has a 21 GHz zero, 27 GHz first pole, and 32 GHz second pole; (c) with a 5-tap FFE; and (d) with the CTLE and a 15-tap FFE.

Download Full Size | PDF

Similar to the TX PAC performance, the PAC RX also shows open eyes when tested at 112 Gbps. A modulated signal from the reference modulator with mean power of 1.0 mW is input at the SM fiber. We stress this reference modulator is an ideal optical test component, and not the TX PAC. A polarization controller is adjusted to ensure maximal transmission of input light into the RX PAC, which corresponds to TE input polarization. Measured fiber-referred high-speed photodiode responsivity for the assembly under test is 245 mA/W for TE-polarized and 210 mA/W for TM-polarized input light, corresponding to 0.7 dB polarization-dependent loss. Figure 9 shows the optical input to the PAC RX from the reference modulator in (a), the sampled eye after a baud/2 receiver in (b), and the filtered eye after a feed forward equalizer in (c). At 112 Gbps, only a 5-tap FFE is required to open the eye, well within the capabilities of an LR SERDES. Many long reach SERDES also implement CTLE and/or a decision feedback equalizer, which could further improve performance [4446]. The average DC photocurrent seen on the photodetector during test was 150 uA. During the test, the high gain setting was used for the TIA.

 figure: Fig. 9.

Fig. 9. Measured eye diagrams from RX PAC testing at 112 Gbps. Shown in (a) is the optical input from the reference modulator used as stimulus. (b) Shows the “raw” eye after a baud/2 bandwidth sampler, without equalization, in the DCA and (c) shows the eye after a 5-tap FFE. The DCA did not use a pulse-shaping filter.

Download Full Size | PDF

In operation our modulator driver, TIA, biasing, and active control require 483 mW. At 112 Gbps, this corresponds to 4.3 pJ/bit, not including the laser or SERDES. Future TX PAC devices may incorporate a local undercut, which would reduce the required thermal tuning power. A breakdown of the power consumption is shown in Table 1. All values are measurements except for the heater and associated drive circuitry overhead, which is projected at a required optical phase shift of π/2 radians.

Tables Icon

Table 1. Breakdown of power consumption for the TX PAC and RX PAC and extrapolation to typical total power consumption for transmitter and receiver

To help put our work in a wider context, we add energy/bit metrics for a variety of external modulator drivers and TIAs in Tables 2 and 3. We note that these competing results are not monolithically integrated with silicon photonics and are in processes that typically outperform 45 nm CMOS. Our work is competitive with or consumes lower power than all of these competing results, with the exception of two low-node CMOS TIAs from significantly more advanced processes than 45 nm CMOS. To provide a sense for the typical power budget from the rest of the SERDES chain, we include a table showing the power budget for a TX and RX portion of a SERDES in Table 4.

Tables Icon

Table 2. Comparison of high-baud and linear modulator drivers

Tables Icon

Table 3. Comparison of high-baud and linear TIAs

Tables Icon

Table 4. Power budget for TX ad RX portions of SERDES

4. Conclusion

Monolithic CMOS-silicon photonics processes are attractive platforms for next-generation co-packaged optics applications. A critical unanswered question up to now is whether such platforms are capable of operating at high-speed, particularly in 45 nm CMOS. We demonstrate that such a monolithic platform is suitable for 112 Gbps optical communications. Co-packaging these transmitter and receiver chips with a processor, switch, or other ASIC will ultimately result in improvements in computer system performance per Watt compared to what would be achieved by utilizing conventional pluggable optics.

Acknowledgments

The authors would like to thank Anthony Yu and the GlobalFoundries team for their support during the development and fabrication of these chips. Special thanks go to Jenny Huang and Adrian Chang for the assembly of the tested modules. The authors would also like to thank our CEO, Marcus Gomez; CTO, Mitchell Nahmias; president, Michael Hochberg; SVP of engineering, Dominick Scordo; chief architect, David Baker; program manager, Jonathan Centofanti; and entire Luminous Computing team for their invaluable support, guidance, and discussions.

Disclosures

The authors declare no conflicts of interest.

Data availability

All data needed to evaluate the conclusions in the paper are available in the main text.

References

1. “Specifications - Supercomputer Fugaku: Fujitsu Global,” https://www.fujitsu.com/global/about/innovation/fugaku/specifications/.

2. J. Gonzalez, A. Gazman, M. Hattink, M. G. Palma, M. Bahadori, R. Rubio-Noriega, L. Orosa, M. Glick, O. Mutlu, K. Bergman, and R. Azevedo, “Optically connected memory for disaggregated data centers,” J. Parallel and Distributed Computing 163, 300–312 (2022). [CrossRef]  

3. R. Nagarajan, L. Ding, R. Coccioli, M. Kato, R. Tan, P. Tumne, M. Patterson, and L. Liu, “2.5D Heterogeneous Integration for Silicon Photonics Engines in Optical Transceivers,” IEEE J. Select. Topics Quantum Electron. 29(3), 1–9 (2022). [CrossRef]  

4. https://www.top500.org/.

5. C. Xie, P. Magill, D. Li, Y. Zhang, L. Zheng, A. Wang, Y. Bao, C. Sui, M. Streshinsky, J. Mu, S. Yang, and W. Sun, “Real-Time Demonstration of Silicon-Photonics-Based QSFP-DD 400GBASE-DR4 Transceivers for Datacenter Applications,” in Optical Fiber Communication Conference (Optical Society of America, 2020), paper T3H.5.

6. C. Minkenberg, R. Krishnaswamy, A. Zilkie, and D. Nelson, “Co-packaged datacenter optics: Opportunities and challenges,” IET Optoelectron. 15(2), 77–91 (2021). [CrossRef]  

7. B. G. Lee, N. Nedovic, T. H. Greer III, and C. T. Gray, “Beyond CPO: A Motivation and Approach for Bringing Optics Onto the Silicon Interposer,” IEEE Journal of Lightwave Technology 41(4), 1152–1162 (2023). [CrossRef]  

8. M. Bohn, P. Magill, M. Hochberg, D. Scordo, A. Novack, and Matthew Streshinsky, “Next-Generation Silicon Photonic Interconnect Solutions,” in Optical Fiber Communication Conference (Optical Society of America, 2019), paper M3J.3.

9. M. Lipson, “The revolution of silicon photonics,” Nat. Mater. 21(9), 974–975 (2022). [CrossRef]  

10. V. Stojanovic, R. J. Ram, M. Popovic, S. Lin, S. Moazeni, M. Wade, C. Sun, L. Alloatti, A. Atabaki, F. Pavanello, N. Mehta, and P. Bhargava, “Monolithic silicon-photonic platforms in state-of-the-art CMOS SOI processes,” Opt. Express 26(10), 13106–13121 (2018). [CrossRef]  

11. K. Giewont, K. Nummy, F. A. Anderson, J. Ayala, T. Barwicz, Y. Bian, K. K. Dezfulian, D. M. Gill, T. Houghton, S. Hu, B. Peng, M. Rakowski, S. Rauch III, J. C. Rosenberg, A. Sahin, I. Stobert, and A. Stricker, “300-mm Monolithic Silicon Photonics Foundry Technology,” IEEE J. Select. Topics Quantum Electron. 25(5), 1–11 (2019). [CrossRef]  

12. M. Rakowski, C. Meagher, K. Nummy, et al., “45 nm CMOS - Silicon Photonics Monolithic Technology (45CLO) for next-generation, low power and high speed optical interconnects,” in Optical Fiber Communication Conference (Optical Society of America, 2020), paper T3H.3.

13. M. Streshinsky, R. Ding, Y. Liu, A. Novack, C. Galland, A. E.-J. Lim, P. Guo-Qiang Lo, T. Baehr-Jones, and M. Hochberg, “The road to affordable, large-scale silicon photonics,” Optics and Photonics News 24(9), 32–39 (2013). [CrossRef]  

14. J. S. Levy, E. Timurdogan, Y.-S. Kuo, et al., “4×100Gb/s PAM4 Multi-Channel Silicon Photonic Chipset with Hybrid Integration of III-V DFB Lasers and Electro-Absorption Modulators,” IEEE J. Lightwave Technology, Early Access 1–10 (2023).

15. Y. Zhao, L. Chen, R. Aroca, N. Zhu, D. Ton, D. Inglis, and C. Doerr, “Silicon Photonic Based Stacked Die Assembly for 4√ó200-Gbit/s Short-Reach Transmission,” in Optical Fiber Communication Conference (Optical Society of America, 2021), paper F2F.2.

16. E. Timurdogan, Z. Su, R.-J. Shiue, M. J. Byrd, C. V. Poulton, K. Jabon, C. DeRose, B. R. Moss, E. S. Hosseini, I. Duzevik, M. Whitson, R. P. Millman Jr., D. A. Atlas, and M. R. Watts, “400 G Silicon Photonics Integrated Circuit Transceiver Chipsets for CPO, OBO, and Pluggable Modules,” in Optical Fiber Communication Conference (Optical Society of America, 2020), paper T3H.2.

17. M. Traverso, M. Mazzini, K. Lakshmikumar, S. Sunder, A. Kurylak, C. Appel, C. Muzio, R. Tummidi, A. Cervasio, M. Nadeau, W. Li, J. Neiman, and M. Webster, “Integrated Silicon Photonics Transceiver Module for 100Gbit/s 20 km Transmission,” in Optical Fiber Communication Conference (Optical Society of America, 2021), paper M3A.3.

18. R. Nagarajan, M. Filer, Y. Fu, M. Kato, T. Rope, and J. Stewart, “Silicon Photonics-Based 100 Gbit/s, PAM4, DWDM Data Center Interconnects,” J. Opt. Commun. Netw. 10(7), B25–B36 (2018). [CrossRef]  

19. S. Fathololoumi, C. Malouin, D. Hui, K. Al-hemyari, K. Nguyen, P. Seddighian, Y.-J. Chen, Y. Wang, A. Yan, R. Defrees, T. Liljeberg, and L. Liao, “Highly Integrated 4 Tbps Silicon Photonic IC for Compute Fabric Connectivity,” in Symposium on High-Performance Interconnects, pp. 1–4 (IEEE, 2022).

20. V. Raghunathan, K. Muth, B. Vinnakota, P. Venugopal, R. Schaevitz, and M. Mehta, “SCIP to the Next Generation of Computing: Extending More than Moore with Silicon Photonics Chiplets in Package (SCIP),” in 23rd International Symposium on Quality Electronic Design, 1–6 (2022).

21. R.-C. Yu and D. Pan, “Silicon Photonics Applications for 5 G and Data Centers,” in Optical Fiber Communication Conference (Optical Society of America, 2021), paper M3A.5.

22. T. Pinguet, P. M. De Dobbelaere, D. Foltz, S. Gloeckner, S. Hovey, Y. Liang, M. Mack, G. Masini, A. Mekis, M. Peterson, T. Pinguet, S. Sahni, J. Schramm, and M. Sharp, L. Verslegers, B. P. Welch, K. Yokoyama, and S. Yu, “25 Gb/s Silicon Photonic Transceivers,” in 9th International Conference on Group IV Photonics, paper ThC1 (2012).

23. F. Testa, M. T. Wade, M. Lostedt, F. Cavaliere, M. Romagnoli, and V. Stojanovic, “Optical Interconnects for Future Advanced Antenna Systems: Architectures, Requirements and Technologies,” J. Lightwave Technol. 40(2), 393–403 (2022). [CrossRef]  

24. J. F. Buckwalter, X. Zheng, G. Li, K. Raj, and A. V. Krishnamoorthy, “A Monolithic 25-Gb/s Transceiver With Photonic Ring Modulators and Ge Detectors in a 130-nm CMOS SOI Process,” IEEE J. Solid-State Circuits 47(6), 1309–1322 (2012). [CrossRef]  

25. C. Schulien, “Enabling Scalable Application-Specific Optical Engines (ASOE) by Monolithic Integration of Photonics and Electronics,” in Hot Chips 34 Symposium, 1–32 (IEEE, 2022).

26. F. Iseini, M. Inac, A. Malignaggi, A. Peczek, and G. Kahmen, “Monolithically Integrated Optoelectronic Transmitter based on Segmented Mach-Zehnder Modulator in EPIC 250 nm BiCMOS Technology,” in 23rd Topical Meeting on Silicon Monolithic Integrated Circuits in RF Systems, 51–54 (IEEE, 2023).

27. G. Dziallas, A. Fatemi, A. Peczek, L. Zimmermann, A. Malignaggi, and G. Kahmen, “A 56-Gb/s Optical Receiver With 2.08-µA Noise Monolithically Integrated into a 250-nm SiGe BiCMOS Technology,” IEEE Trans. Microwave Theory Techn. 70(1), 392–401 (2022). [CrossRef]  

28. P. M. Seiler, K. Voigt, A. Peczek, G. Georgieva, S. Lischke, A. Malignaggi, and L. Zimmerman, “Multiband Silicon Photonic ePIC Coherent Receiver for 64 GBd QPSK,” J. Lightwave Technol. 40(10), 3331–3337 (2022). [CrossRef]  

29. M. Wade, E. Anderson, S. Ardalan, P. Bhargava, S. Buchbinder, M. L. Davenport, J. Fini, H. Lu, C. Li, R. Meade, C. Ramamurthy, M. Rust, F. Sedgwick, V. Stojanovic, D. V. Orden, C. Zhang, and C. Sun, “TeraPHY: A Chiplet Technology for Low-Power, High-Bandwidth In-Package Optical I/O,” IEEE Micro 40(2), 63–71 (2020). [CrossRef]  

30. C. Xiong, D. M. Gill, J. E. Proesel, J. S. Orcutt, W. Haensch, and W. M. J. Green, “Monolithic 56 Gb/s silicon photonic pulse-amplitude modulation transmitter,” Optica 3(10), 1060–1065 (2016). [CrossRef]  

31. S. Assefa, H. Pan, S. Shank, W. M. J. Green, A. Rylyakov, C. Schow, M. Khater, S. Kamlapurkar, E. Kiewra, C. Reinholm, T. Topuria, P. Rice, C. Baks, and Y. Vlasov, “Monolithically Integrated Silicon Nanophotonics Receiver in 90 nm CMOS Technology Node,” in Optical Fiber Communication Conference (Optical Society of America, 2013), paper OM2H.4.

32. M. Raj, Y. Frans, P.-C. Chiang, S. L. C. Ambatipudi, D. Mahashin, P. D. Heyn, S. Balakrishnan, J. V. Campenhout, J. Grayson, M. Epitaux, and K. Chang, “Design of a 50-Gb/s Hybrid Integrated Si-Photonic Optical Link in 16-nm FinFET,” IEEE J. Solid-State Circuits 55(4), 1086–1095 (2020). [CrossRef]  

33. J. Witzens, T. Baehr-Jones, and M. Hochberg, “Design of transmission line driven slot waveguide Mach-Zehnder interferometers and application to analog optical links,” Opt. Express 18(16), 16902–16928 (2010). [CrossRef]  

34. J. Witzens, “High-Speed Silicon Photonics Modulators,” in Proceedings of the IEEE106(12), 2158–2182 (2018).

35. M. Mizuno, M. Yamashina, K. Furuta, H. Igura, H. Abiko, K. Okabe, A. Ono, and H. Yamada, “A GHz MOS adaptive pipeline technique using MOS current-mode logic,” IEEE J. Solid-State Circuits 31(6), 784–791 (1996). [CrossRef]  

36. S. Shekhar, J. S. Walling, and D. J. Allstot, “Bandwidth Extension Techniques for CMOS Amplifiers,” IEEE J. Solid-State Circuits 41(11), 2424–2439 (2006). [CrossRef]  

37. B. Analui, D. Guckenberger, D. Kucharski, and A. Narasimha, “A Fully Integrated 20-Gb/s Optoelectronic Transceiver Implemented in a Standard 0.13-µm CMOS SOI Technology,” IEEE J. Solid-State Circuits 41(12), 2945–2955 (2006). [CrossRef]  

38. S. S. Mohan, M. D. M. Hershenson, S. P. Boyd, and T. H. Lee, “Bandwidth extension in CMOS with optimized on-chip inductors,” IEEE J. Solid-State Circuits 35(3), 346–355 (2000). [CrossRef]  

39. K. R. Lakshmikumar, A. Kurylak, M. Nagaraju, R. Booth, R. K. Nandwana, J. Pampanin, and V. Boccuzzi, “A Process and Temperature Insensitive CMOS Linear TIA for 100 Gb/s/ λ PAM-4 Optical Links,” IEEE J. Solid-State Circuits 54(11), 3180–3190 (2019). [CrossRef]  

40. H. Andrade, A. Maharry, T. Hirokawa, L. Valenzuela, S. Simon, C. L. Schow, and J. F. Buckwalter, “Comparison of three monolithically integrated TIA topologies for 50 Gb/s OOK and PAM4,” Proceedings of SPIE, 11286–11289 (2020).

41. K. Lakshmikumar, A. Kurylak, R. K. Nandwana, B. Das, J. Pampanin, M. Brubaker, and P. K. Hanumolu, “A 7 pA/√Hz Asymmetric Differential TIA for 100Gb/s PAM-4 links with −14dBm Optical Sensitivity in 16 nm CMOS,” in International Solid-State Circuits Conference, 206–208 (IEEE, 2023).

42. “IEEE Standard for Ethernet,” IEEE Std 802.3-2022 (Revision of IEEE Std 802.3-2018).

43. N. P. Kelly, D. L. Ray, and D. W. Vogel, “A mixed-signal DFE/FFE receiver for 100Base-TX applications,” in International Solid-State Circuits Conference, 310–311 (IEEE, 2000).

44. J. W. Kim, S. Chen, Y. Wang, H.-W. Hung, K. Tan, W. Lin, A. B. Roldan, D. Carey, I. Chlis, R. Casey, A. Bekele, Y. Cao, D. Mahashin, H. Ahn, H. Zhang, Y. Frans, and K. Chang, “A 112-Gb/s PAM-4 Long-Reach Wireline Transceiver Using a 36-Way Time-Interleaved SAR ADC and Inverter-Based RX Analog Front-End in 7-nm FinFET,” IEEE J. Solid-State Circuits 56(1), 7–18 (2021). [CrossRef]  

45. H. Lin, C. Boecker, M. Hossain, S. Tangirala, R. Vu, S. D. Vamvakos, E. Groen, S. Li, P. Choudhary, N. Wang, M. Shibata, H. Taghavi, M. van Ierssel, A. H. Maniyar, Adam Brar, N. Nguyen, and S. Desai, “ADC-DSP-Based 10-to-112-Gb/s Multi-Standard Receiver in 7-nm FinFET,” IEEE J. Solid-State Circuits 56(4), 1265–1277 (2021). [CrossRef]  

46. Y. Krupnik, Y. Perelman, I. Levin, Y. Sanhedrai, R. Eitan, A. Khairi, Y. Shifman, Y. Landau, U. Virobnik, N. Dolev, A. Meisler, and A. Cohen, “112-Gb/s PAM4 ADC-Based SERDES Receiver With Resonant AFE for Long-Reach Channels,” IEEE J. Solid-State Circuits 55(4), 1077–1085 (2020). [CrossRef]  

47. A. H. Ahmed, D. Lim, A. Elmoznine, Y. Ma, T. Huynh, C. Williams, L. Vera, Y. Liu, R. Shi, M. Streshinsky, A. Novack, R. Ding, R. Younce, R. Sukkar, J. Roman, M. Hochberg, S. Shekhar, and A. Rylyakov, “A 6 V Swing 3.6% THD >40 GHz Driver with 4.5x Bandwidth Extension for a 272 Gb/s Dual-Polarization 16-QAM Silicon Photonic Transmitter,” in International Solid-State Circuits Conference (IEEE, 2019), 484–486.

48. H. Li, G. Balamurugan, J. Jaussi, and B. Casper, “A 112 Gb/s PAM4 Linear TIA with 0.96 pJ/bit Energy Efficiency in 28 nm CMOS,” in 44th European Solid State Circuits Conference, 238–241 (IEEE, 2018).

49. S. Daneshgar, H. Li, T. Kim, and G. Balamurugan, “A 128 Gb/s, 11.2 mW Single-Ended PAM4 Linear TIA With 2.7 µArms Input Noise in 22 nm FinFET CMOS,” IEEE J. Solid-State Circuits 57(5), 1397–1408 (2022). [CrossRef]  

50. D. Li, S. Gao, Y. Shi, X. Gui, N. Qi, Z. Li, Q. Pan, P. Chiang, and L. Geng, “A 112-Gb/s PAM-4 Linear Optical Receiver in 130-nm SiGe BiCMOS,” in International Symposium on Circuits and Systems, 2158 (IEEE, 2020).

51. E. Groen, C. Boecker, M. Hossain, R. Vu, S. D. Vamvakos, H. Lin, S. Li, M. V. Ierssel, P. Choudhary, N. Wang, M. Shibata, M. H. Taghavi, K. Brar, N. Nguyen, and S. Desai, “10-to-112-Gb/s DSP-DAC-Based Transmitter in 7-nm FinFET With Flex Clocking Architecture,” IEEE J. Solid-State Circuits 56(1), 30–42 (2021). [CrossRef]  

Data availability

All data needed to evaluate the conclusions in the paper are available in the main text.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (9)

Fig. 1.
Fig. 1. (a) A conventional optical link from an ASIC on a host PCB to a pluggable optical module with a retimer or DSP chip. (b) An alternative link in which the optical engine is placed in a package near the ASIC, (c) Co-packaged optics in which the optical engine is on the same substrate as the ASIC. The arrangements shown in (b) and (c) provide net power consumption savings over (a) by eliminating the retimer/DSP chip.
Fig. 2.
Fig. 2. Schematic representation of the on-chip signal path, including spot-size converter (SSC), 1 × 2 laser splitter, monitor photodiodes (MPD), serial peripheral interface (SPI) for control, traveling wave Mach-Zehnder modulator (TWMZ), high-speed pn-junction phase shifter (PN), thermal phase shifter (THPH), adjustable termination resistor (Rterm), high-speed photodetectors (HSPD), polarization splitter & rotators (PSR), grating couplers for testing (GC), and integrated control system (CTRL).
Fig. 3.
Fig. 3. (a) Modulator driver architecture, (b) output driver stage schematic diagram, and (c) simulated |EOS21| response of the TX PAC. For 3 (c), the TWMZ is biased at the 3 dB point of maximal linear response.
Fig. 4.
Fig. 4. (a) TIA & pad driver architecture and (b) simulated AC response of the PAC RX TIA loaded with the photodetector at three different gain settings.
Fig. 5.
Fig. 5. (a) Cross-section diagram of the IOSMF with v-groove and the attached fiber. (b) Cross-section showing the front-end and middle-of-line of 45CLO technology. Figure taken from: [12].
Fig. 6.
Fig. 6. Photographs of (a) the TX and RX chips on-wafer, (b) the TX PAC under test, and (c) the RX PAC under test.
Fig. 7.
Fig. 7. Block diagram of the (a) transmitter and (b) receiver experimental test setup; the fibers indicated are short, on the order of 5 m or less, and just used to connect different pieces of test equipment. The monitor PD indicated in (a) is used to monitor optical output power and TWMZ bias point. We utilize a Source Measurement Unit (SMU) to provide a variable DC bias to the commercial Mach-Zehnder modulator (MZM) used for RX PAC testing, as well as a Power Supply Unit (PSU) to power the SHF807C, which is an RF amplifier.
Fig. 8.
Fig. 8. Measured eye diagrams from TX PAC testing at 112 Gbps with various filters implemented on the DCA, sampled with a baud/2 bandwidth receiver: (a) A “raw” TX without further signal processing; (b) processed with an analog CTLE where the CTLE has a 21 GHz zero, 27 GHz first pole, and 32 GHz second pole; (c) with a 5-tap FFE; and (d) with the CTLE and a 15-tap FFE.
Fig. 9.
Fig. 9. Measured eye diagrams from RX PAC testing at 112 Gbps. Shown in (a) is the optical input from the reference modulator used as stimulus. (b) Shows the “raw” eye after a baud/2 bandwidth sampler, without equalization, in the DCA and (c) shows the eye after a 5-tap FFE. The DCA did not use a pulse-shaping filter.

Tables (4)

Tables Icon

Table 1. Breakdown of power consumption for the TX PAC and RX PAC and extrapolation to typical total power consumption for transmitter and receiver

Tables Icon

Table 2. Comparison of high-baud and linear modulator drivers

Tables Icon

Table 3. Comparison of high-baud and linear TIAs

Tables Icon

Table 4. Power budget for TX ad RX portions of SERDES

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.