

## The Efficiency and Power Utilization of Current-Scaling Digital Transmitters

Mul, Dieuwert P.N.; Bootsman, Robert J.; Alavi, Morteza S.; de Vreede, Leo C.N.; Beikmirza, M.R.

DOI 10.1109/TMTT.2023.3336984

Publication date 2024 **Document Version** Final published version

Published in IEEE Transactions on Microwave Theory and Techniques

**Citation (APA)** Mul, D. P. N., Bootsman, R. J., Alavi, M. S., de Vreede, L. C. N., & Beikmirza, M. R. (2024). The Efficiency and Power Utilization of Current-Scaling Digital Transmitters. *IEEE Transactions on Microwave Theory and Techniques*, *72*(7), 4350-4366. https://doi.org/10.1109/TMTT.2023.3336984

#### Important note

To cite this publication, please use the final published version (if applicable). Please check the document version above.

Copyright

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Takedown policy

Please contact us and provide details if you believe this document breaches copyrights. We will remove access to the work immediately and investigate your claim.

# Green Open Access added to TU Delft Institutional Repository

# 'You share, we take care!' - Taverne project

https://www.openaccess.nl/en/you-share-we-take-care

Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

# The Efficiency and Power Utilization of Current-Scaling Digital Transmitters

Dieuwert P. N. Mul<sup>®</sup>, *Student Member, IEEE*, Robert J. Bootsman<sup>®</sup>, *Member, IEEE*, Mohammadreza Beikmirza<sup>®</sup>, *Member, IEEE*, Morteza S. Alavi<sup>®</sup>, *Member, IEEE*, and Leo C. N. de Vreede<sup>®</sup>, *Senior Member, IEEE* 

*Abstract*— The RF performance of current-scaling digital transmitters (DTX) with polar, unsigned Cartesian, signed Cartesian, and multiphase architectures have been compared regarding power utilization of their output-stage switch banks and drain efficiency. The analysis includes various switch bank operation modes, such as switch bank sharing, segment activation interleaving, and their activation times (RF duty cycle of the segments). Current-scaling DTXs can be made compatible with high-power operations while offering high system efficiency and RF bandwidth. The average efficiency using Doherty power back-off efficiency enhancement is analyzed, and a comparison of the different proposed DTX implementations is presented.

*Index Terms*—Current mode, current scaling, digital transmitter (DTX), Doherty, efficiency, multiphase, peak-to-averagepower ratio (PAPR), polar, power utilization, RF-DAC, signed Cartesian (SC), upconversion.

#### I. INTRODUCTION

**D** IGITAL transmitters (DTXs) can offer higher integration and system efficiency than their analog counterparts. However, DTXs come in different flavors, each with benefits and shortcomings. This work provides the theoretical background of current-scaling DTXs, which show remarkable properties in terms of drain efficiency, linearity, RF bandwidth, and scaling of their output power. Furthermore, these transmitters have relaxed demands on their output-matching network and breakdown voltage, which are comparable to those used in analog class-B transmitters. However, different from analog designs, input matching is omitted, allowing a close to perfect frequency-agile operation of transmitter output stages without any stability issues [1].

Current-scaling DTXs use segmented output stages preferably operating in saturated, or current-limited, mode (Fig. 1). The total RF output signal results from the current summation in the segmented output stages, referred to as switch

Manuscript received 29 June 2023; revised 28 September 2023 and 11 November 2023; accepted 15 November 2023. Date of publication 13 December 2023; date of current version 2 July 2024. This work was supported in part by the Dutch Research Council (NWO), in part by Ampleon Netherlands B. V., in part by Nokia Bell Laboratories, and in part by MediaTek through the Project (DIPLOMAT) 16336 and Project DRASTIC. (*Corresponding author: Dieuwert P. N. Mul.*)

The authors are with the Department of Microelectronics, Delft University of Technology, 2628 CD Delft, The Netherlands (e-mail: d.p.n.mul@tudelft.nl). Color versions of one or more figures in this article are available at

https://doi.org/10.1109/TMTT.2023.3336984. Digital Object Identifier 10.1109/TMTT.2023.3336984



Fig. 1. Simplified concept illustrations of current-scaling DTX approaches (a) low-power CMOS DTX switch bank using unit cells and (b) high-power LDMOS DTX switch bank with CMOS driver DTX using gate segmentation in its output stage.

banks. The output stage segments act to a great extent like current sources, which are digitally controlled by an amplitude codeword (ACW). Doing so, the effective gate width  $(w_g)$  of the output device is scaled, rather than the drain current  $(i_d)$  as a function of the gate voltage  $(v_{gs})$ , as is the case in analog implementations. Since the output current scales (in first-order approximation) linearly with the ACW, current-scaling DTXs behave linearly up to the point, where  $g_m$  drops due to compression [2].

All segments of the switch bank are directly connected to the drain terminal; therefore, the combined output capacitance of the segments is  $C_{out} = C_{ds,bank}$ . At the input of the switch bank(s), the effective loading of the individual drivers for the equal-sized gate segments is  $C_{gs,bank}/N$ , where  $C_{gs,bank}$  is the gate capacitance of the total bank, and N is the number of gate segments [1]. In the case of a high-power, dual-chip implementation, the bond wires introduce a parasitic inductance. The resonance frequency of this inductance combined with the segment gate capacitance  $C_{gs,bank}/N$  should be well above the operating frequency. Additionally, the mutual inductance between the bond wires should be minimized [1].  $C_{out}$  will have a significant value when aiming for higher output powers. Consequently,  $C_{out}$  needs to be compensated by the output matching network when operating at RF. To handle  $C_{out}$ ,

0018-9480 © 2023 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.



Fig. 2. Top-level block diagrams of four  $g_m$ -scaling DTX architectures. (a) Unsigned Cartesian configuration using fixed clocks, (b) SC configuration using a clock mapper to switch the *I* and *Q* clocks by 180°, (c) multiphase configuration, using more clock phases, and (d) polar configuration using a continuously varying phase-modulated clock to drive the (single) switch bank.

conventional analog transmitters mostly employ class-B-like matching to reach the best compromise between linearity, efficiency, and RF bandwidth, even though other operating classes (e.g., class-F) promise higher theoretical efficiency. However, these operating classes typically rely on open conditions for some of their harmonics (e.g., the third for class-F), which is difficult to achieve over a large bandwidth when  $C_{out}$  is high. Continuous operating classes [3] can offer higher bandwidth, but this comes at the expense of higher drain voltage swings and varying complex loading conditions across the frequency band of interest, of which the latter is challenging to combine with *N*-way wideband Doherty operation.

Furthermore, in contrast to analog implementations, where the current waveform is a half-sine wave, DTX typically operates with square-wave-shaped drain currents due to the digital (voltage) control of its gate segments. A square wave drain current will have a higher fundamental content (boosting the output power) than a rectified half-sine wave drain current having the same amplitude and (RF) duty cycle. However, the 50% duty cycle of a square wave also yields a more extensive overlap with the typical sinusoidal drain voltage resulting from class-B operation, increasing the power dissipation of the output stage switch bank. Likewise, as in analog, there is a direct relationship between the duty cycle (thus, conduction angle) of the DTX-generated drain current pulses, the output power, and efficiency. However, unlike in analog class-C operation, which suffers from gain expansion, in a current-scaling DTX



Fig. 3. Exemplary waveforms for the different DTX architectures using different RF duty cycles, addressing the IQ-point  $(3)^{1/2}/2+1/2j$ , or  $\rho_{BB} = 1$  and  $\theta_{BB} = 30^{\circ}$ . The multiphase waveform has a smaller effective duty cycle than Cartesian concepts, yielding a better approximation of the polar architecture.

transmitter, the linearity of the transfer (e.g., ACW-RF output current) is inherently preserved when changing the amplitude [2]. In DTX, the duty cycle is an important design parameter and will, therefore, be included in our discussions.

This article first provides an overview of different (current scaling) DTX upconversion and switch bank architectures and, second, compares them for output power and efficiency. Third, the theoretical analysis is verified by measurements, and finally, a comprehensive literature review is performed in which alternative DTX architectures are compared for their output power and peak-to-average-power ratio (PAPR).

#### **II. TYPES OF UPCONVERSION ARCHITECTURES**

Four different current-scaling DTX architectures are considered in this work (Fig. 2). They range from Cartesian, which is suited for handling wide modulation bandwidths, to efficient "narrow-band" polar TX lineups. Next, we will discuss their operation principles.

#### A. Unsigned Cartesian DTX

The unsigned Cartesian DTX architecture uses two nonmodulated clocks to perform the baseband-IQ to RF upconversion [Fig. 2(a)] [4], [5], [6]. Because of this, the clocks used in the switch banks cannot be adjusted for their activation phase, so the activation itself must always be positive. Consequently, only class-A-like operation is possible, and the input IQ data are shifted to the top-right quadrant, as shown in Table I. As a result, this architecture suffers from a high constant dc current in the output stage, which cannot be tolerated in a high-efficiency transmitter concept. However, this constant current tends to reduce the impact of bias network-induced memory effects, making this configuration highly linear and, therefore, an attractive option for direct digital RF modulators (DDRMs) [4], [5]. Since the emphasis of this work is on



Fig. 4. Plots showing the top-right quadrant of the IQ plane with vector representations of the IQ point  $(3)^{1/2}/2 + 1/2j$ , using a DTX architecture based on (a) SC (green), (b) (eight-phase) multiphase (blue), and (c) polar (purple) architectures.

high-efficiency TX output stages, we will, in the following paragraphs, focus on the DTX architectures capable of achieving higher efficiencies.

#### B. Signed Cartesian DTX

In an SC architecture [Fig. 2(b)], both the in-phase (I) and quadrature (Q) clocks can be switched by  $180^{\circ}$  [6], [7], allowing for negative I and Q values to be transmitted by shifting the RF phase of a positive activation by 180°. For consistency in this work, we define the two (current) summing vectors as A and B, with a phase angle  $\phi_{AB}$  between them. For SC, A = I, B = Q, and  $\phi_{AB} = 90^{\circ}$  (Table I). In this case, the phase mapper selects the required clocks from four input clocks based on the sign bits of the original I and Q data [8]. Fig. 3 provides an example of the time domain current waveform that addresses the IQ point  $(3)^{1/2}/2 + 1/2j$ , or  $\rho_{BB} = 1$  and  $\theta_{BB} = 30^{\circ}$ . It should be mentioned that the positive phase shifts lag in time, in correspondence to the trigonometry:  $\rho_{BB} \cos(\omega_0 t + \theta_{BB}) = I \cos(\omega_0 t) - Q \sin(\omega_0 t)$ . Fig. 4(a) illustrates the corresponding vector summation of the current pulses. Digital word decoders send the in-phase and quadrature amplitudes to the unit cells in the corresponding banks.

SC DTX supports class-B-like operation since quiescent currents are no longer required, and the switch bank can cover all quadrants. Furthermore, the I and Q paths are very similar in terms of their hardware, and the amplitude decoder and the phase mapper can operate synchronously in a clocked domain. It should be noted that, due to the splitting of the sign and amplitude information, the SC operation is, in its essence, not fully linear. Namely, the phase mapper is effectively a high-speed, 1-bit phase modulator, creating a phase-modulated clock. Moreover, taking the absolute value of a signal is also a nonlinear operation. Only after combining the amplitude data and phase-modulated clock in the switch banks, the desired (linear) output signal is retrieved. Due to its digital nature, the amplitude and phase paths operate in a fully synchronous, clocked domain, allowing the handling of wideband modulated signals.

However, the 90° phase angle between the vectors [as shown in Fig. 4(a)] unfortunately results in complex loading conditions arise for the I and Q switch banks when both are active. As a consequence, the efficiency of a Cartesian DTX is compromised for the off-axis constellation points, which is discussed in detail in Section V-C.

#### C. Multiphase DTX

The SC architecture can be extended to operate with more than four possible phases (e.g., with eight phases) to drive its cells: yielding the multiphase architecture [Fig. 2(c)] [6], [9], [10], [11], [12], [13]. Doing so, the phase angle ( $\phi_{AB}$ ) between the clocks driving the unit cells (or gate segments) in the switch bank is reduced, alleviating the reactive loading of the switch bank segments for off-axis constellation points. As such, increasing the efficiency for these points compared with the SC operation [Figs. 4(b) and 5]. Further increasing the number of driving phases for the unit cells in the switch banks, e.g., from 8 to 16 or more, yields an even closer approximation of the behavior of the polar architecture. The phase angle between the vectors equals 360° divided by the number of phases (assuming uniform distribution), yielding 45° for eight-phase and 22.5° for 16-phase operations.

For multiphase operation, the input IQ data needs to be mapped to the A and B vectors, with their corresponding activation phases (see Tables I and II) [9]. This phase selection again can be performed by the phase mapper [6], which controls the moment of activation by selecting the proper clocks provided by the phase generator, using the eight-phase grid (Fig. 3). Just like in the SC architecture, the high-speed phase selection is performed in the clocked domain, and thus, the amplitude and phase of the switch bank can be updated synchronously. Mapping the IQ data to A and B vector representations over more than four phases compromises the orthogonal nature of the TX unit, compared with working with the IQ data directly. This feature can affect the spectral purity when dealing with very high modulation bandwidths (e.g., beyond 200 MHz). However, the synchronous operation

TABLE I IQ (BASEBAND) DATA MAPPING TO POLAR, CARTESIAN, SC, AND Eight-Phase DTX Signal Representations

| Unsigned<br>Cartesian                   | Signed Cartesian                                                                      | 8-phase<br>operation                           | Polar                                                                     |
|-----------------------------------------|---------------------------------------------------------------------------------------|------------------------------------------------|---------------------------------------------------------------------------|
| $A = \frac{I+1}{2}$ $B = \frac{Q+1}{2}$ | Vector amplitude:<br>A =  I  and<br>B =  Q <br>Phase selection:<br>sgn(I) or $sgn(Q)$ | $A =   I  -  Q  $ $B = \sqrt{2}\min( I ,  Q )$ | $\rho = \over \sqrt{I^2 + Q^2} \\ \theta = \\ \operatorname{atan2}(Q, I)$ |

TABLE II Selection of the Driving Bank Phases for the Eight-Phase Operation

| Octant | Logic Expression                                | $	heta_{ m A}$   | $\theta_{\mathrm{B}}$ |
|--------|-------------------------------------------------|------------------|-----------------------|
| Ι      | $(I \ge 0) \land (Q \ge 0) \land ( I  \ge  Q )$ | 0                | $\frac{\pi}{4}$       |
| II     | $(I \ge 0) \land (Q \ge 0) \land ( I  <  Q )$   | $\frac{\pi}{2}$  | $\frac{\pi}{4}$       |
| III    | $(I<0)\wedge(Q\geq 0)\wedge( I < Q )$           | $\frac{\pi}{2}$  | $\frac{3\pi}{4}$      |
| IV     | $(I < 0) \land (Q \ge 0) \land ( I  \ge  Q )$   | π                | $\frac{3\pi}{4}$      |
| V      | $(I<0)\wedge(Q<0)\wedge( I \geq  Q )$           | $\pi$            | $\frac{3\pi}{4}$      |
| VI     | $(I<0)\wedge(Q<0)\wedge( I < Q )$               | $-\frac{\pi}{2}$ | $-\frac{3\pi}{4}$     |
| VII    | $(I \ge 0) \land (Q < 0) \land ( I  <  Q )$     | $-\frac{\pi}{2}$ | $-\frac{\pi}{4}$      |
| VIII   | $(I \ge 0) \land (Q < 0) \land ( I  \ge  Q )$   | 0                | $-\frac{\pi}{4}$      |

of a multiphase transmitter makes this modulation bandwidth limitation much less severe than in polar architectures.

#### D. Polar DTX

In a polar architecture, there is only one activation phase for the switch bank, which is typically created by a high-resolution phase modulator, using a CORDIC to determine its digital phase word [Fig. 2(d)] [14], [15], [16], [17], [18]. The number of activated unit cells/gate segments directly sets the RF output amplitude without any vector summation. For this reason, maximum efficiency and output power can be achieved at any phase (Fig. 5). However, a CORDIC implements a nonlinear IQ-to-amplitude and phase transformation (see Table I), yielding severe bandwidth expansion [19], [20], [21]. Furthermore, the amplitude and phase paths are not synchronized (Fig. 3), requiring exact time alignment of these paths when they combine in the output stage to reconstruct the desired output signal. Any minuscule delay mismatch between the amplitude and phase path will result in severe adjacent channel power ratio (ACPR) and error vector magnitude (EVM) degradation, especially for wideband signals. In practice, excellent time alignment is not easy to achieve due to the different hardware nature of the two paths. As a result, practical polar transmitter implementations are generally considered unsuited for



Fig. 5. Effective DTX signal constellation-area coverage for a given fixed total (normalized) gate width of the switch bank. For the SC operation using: separate banks (red), bank sharing (yellow), and segment activation interleaving (green); multiphase (MP) operation using bank sharing (blue); and polar operation (purple). The dashed blue lines refer to the possibility of multiphase operation with two dedicated switch banks (small area) and segment activation interleaving (larger area).



Fig. 6. Switch bank activation (a) without IQ sharing, using separate banks, and (b) with IQ sharing, using one combined bank for I and Q data.

high-modulation bandwidths (e.g., beyond 100 MHz) when high linearity is required.

#### **III. SWITCH BANK UTILIZATION IMPROVEMENTS**

When using vector summation in Cartesian and multiphase architectures, a straightforward DTX implementation uses a separate switch bank for each vector, as indicated in Fig. 6(a) [22]. Assuming an output stage with a total of N unit cells/gate segments, with each single bank having N/2 unit cells, the maximum (current) output contour for this architecture is limited by  $A_{\text{max}} = N/2$  and  $B_{\text{max}} = N/2$ . This arrangement limits the maximum output power along the A- and B-axes by a factor 1/2 compared with a polar architecture that can activate all its N unit cells at any phase (see Fig. 5).

Next, two techniques to improve the switch bank utilization and, thereby, the output current/power utilization is discussed, namely, switch bank sharing and segment activation interleaving.

#### A. Switch Bank Segment Sharing

When using switch bank segment sharing, a single outputstage switch bank with N segments generates both the A and B(current) vectors. Each of its segments can be activated (within an RF cycle) at either the A or the B phase [see Fig. 6(b)]



Fig. 7. (a) Heatmap of the probability density of a resampled QAM signal centered around its carrier frequency, resulting in a square-like coverage of the IQ output plane. (b) Probability density of a 16-channel OFDM upconversion, resulting in a circular coverage of the IQ plane and is, thus, independent of the phase. (c) Probability density plot of the amplitude of the QAM and OFDM signals, the latter follows the Rayleigh distribution.

[23], [24], [25]. Assuming a shared bank with *N* segments in total, the maximum output contour is given by the relation  $A_{\text{max}} + B_{\text{max}} = N$ . The lowest maximum output power occurs when A = B, i.e.,  $\theta_{\text{BB}} = \phi_{AB}$ . Especially, for a multiphase architecture, this technique greatly improves the output power utilization, allowing its output power contour to approximate that of a polar architecture (Fig. 5).

To avoid one segment being activated for both the *A* and *B* data, the activation order typically starts from opposite sides of the switch bank, as shown in Fig. 6(b) [26]. Using segment sharing, the switch bank can be activated completely along the in-phase and quadrature axes. As a result, the maximum output is also achieved at these axes. The effective constellation area coverage of such a DTX is a  $45^{\circ}$  rotated square (Fig. 5). A snake-like activation pattern can be used to improve the differential nonlinearity (DNL) (Fig. 6) [24].

When using quadrature amplitude modulated (QAM) signals, which cover a (more or less) square area [Fig. 7(a)], in the SC operation, IQ-data mapping can be applied to maximize the usage of this output current/power capability. The related mapped values of I' and Q' are simply calculated as follows [25]:

$$I' = I + Q \quad Q' = I - Q.$$
 (1)

#### B. Switch Bank Segment Activation Interleaving

The output power utilization of the switch bank can be further improved by applying switch bank segment activation interleaving. In this activation technique, each unit cell/gate segment can be activated at both the A and the B vector phases within one RF cycle [4]. Therefore, the entire output switch bank is fully available for both vectors, yielding a maximum output contour given by the relation  $A_{\text{max}} = N$  and  $B_{\text{max}} = N$ , for a switch bank with a total of N segments. The lowest achievable peak output power occurs along the vector axes and is equal to the polar architecture. At off-axis constellation points, even higher output powers can be obtained, yielding a square-shaped output current contour in the SC operation. In multiphase operation, the constellation area becomes starlike, adding only a minimal increase of its effectively usable area coverage compared with bank sharing, as we will discuss further in Section IV-B.

Moreover, when using the activation interleaving technique, the waveforms created by the *A* and *B* vectors should be nonoverlapping (Fig. 3). Therefore, in this case, the maximum RF duty cycle  $d \le \phi_{AB}/360^\circ$  for the switch bank segments activation (Fig. 3).

Finally, the rise and fall times, in combination with the nonlinear  $v_{gs}$ - $i_{ds}$  relation of practical devices, will reduce the effective RF duty cycle of the activated switch bank segments. Using segment activation interleaving, a single segment can be activated twice within one RF cycle. In between the two consecutive activations, the active device possibly does not switch (completely) off due to nonzero rise and fall times. The resulting effective activation is not equal to the sum of two separate activations. In other words, the two vectors interact, which will compromise the spectral purity.

#### IV. DTX OUTPUT POWER COMPARISON

The previously described DTX architectures, with their various switch bank activations, provide different coverages of the IQ constellation plane with their TX signals. Therefore, meaningful normalization is required to compare their output power capability. Based on the normalized output powers, we compare the different architectures for their (maximum) output power, after which we discuss the effect of the RF duty cycle on the output power. Finally, some other performance considerations regarding  $w_g$  and the output matching strategy are discussed.

#### A. Normalization

The coverage of the IQ constellation plane is in this article considered in the current domain to stay close to the nature of the DTX current-scaling concept [2]. Furthermore, we assume class-B/C-like output matching, fundamental ohmic loading for the (combined) output of the switch bank(s), and short-circuit conditions for their higher harmonics. Like their analog counterparts, the drain voltage in these conditions is purely sinusoidal, with its maximum constrained by the breakdown voltage of the output stage technology. For a predefined TX peak output power level, a certain fundamental switch bank output current is needed, which we normalize as  $|\vec{i}_{out,peak}| = 1$ .

Modulated signals like QAM have square-like constellation diagrams; however, resampling and low-pass filtering yield a somewhat circular IQ plane coverage, as shown in Fig. 7(a). When the center frequency is shifted away from the carrier frequency, the frequency offset translates to a constantly increasing or decreasing phase, yielding a continuous rotation of the output constellation and, thus, a circular IQ plane coverage over time. Furthermore, in the widely used orthogonal frequency division multiplexing (OFDM) technique, the inverse FFT is applied in the modulation algorithm, which gives rise to a circular coverage of the IQ plane as well [Fig. 7(b)]. Consequently, to make a fair comparison between the various DTX concepts, we require  $|i_{out,max}| \ge 1$  for all phases, thus covering a unit circle in the IQ plane (Fig. 5). Hereby,  $|i_{out,max}|$  is the maximum current that can be provided for a certain phase and follows from the previously described output current contours.

#### B. Power Utilization of the Switch Bank(s)

Using the condition  $|\vec{i}_{out,max}| \ge 1$  for all phases, we can compare the current/power utilization of the different architectures by looking at their minima in  $|\vec{i}_{out,max}|$ . Since in a polar architecture, the maximum output current is available at any phase, we compare each architecture to this case. For now, we concentrate on the consequence of the vector summation of the RF fundamental at the output of the segmented output stage, which consists of the vectors  $\vec{A}$  and  $\vec{B}$ .

In architectures using a dedicated switch bank for A and B, the maximum achievable output current along the activation axes is limited to only half of the total current capability, yielding a required total  $w_g$  twice as large as the for a polar architecture, assuming equal duty cycles. Using switch bank segment activation interleaving, the achievable output current along the activation axes is equal to the total current capability of the power device, yielding the same required total  $w_g$  as the polar architecture (Fig. 4).

When the switch bank segment sharing utilization technique is used, the effective output current depends on the applied upconversion method (SC or multiphase, see Fig. 5). In all these cases, the magnitude of the fundamental RF output current results from the (complex) vector summation  $|\vec{A} + \vec{B}|$ , whereas the total number of activated segments is given by the summation of the vector magnitudes,  $|\vec{A}| + |\vec{B}|$  (Fig. 4). Consequently, we introduce the upconversion current utilization factor,  $F_{up}$ , which describes the ratio between this vector summation and summation of their absolute values as

$$F_{\rm up}(\theta_{\rm BB}, \phi_{AB}) = \frac{\left|\vec{A} + \vec{B}\right|}{\left|\vec{A}\right| + \left|\vec{B}\right|}$$
$$= \frac{\sin(\phi_{AB})}{\sin(\phi_{AB} - \theta_{\rm BB}) + \sin(\theta_{\rm BB})}$$
(2)

where  $\phi_{AB}$  is the phase angle between the two vectors, and  $\theta_{BB}$  is the phase of the baseband input data. The upconversion current utilization factor describes the maximum achievable output current (illustrated in Fig. 5) as a function of the input phase. The current utilization is limited to the value for which

TABLE III CURRENT UTILIZATION OF DIFFERENT DTX ARCHITECTURES AND REQUIRED GATE WIDTH TO GENERATE THE SAME RF SIGNAL, RELA-TIVE TO THE POLAR ARCHITECTURE

|              |                            | Output         | Required       |  |
|--------------|----------------------------|----------------|----------------|--|
| Bank usage   | Architecture               | current        | relative       |  |
|              |                            | utilization    | $w_{ m g,tot}$ |  |
| One bank     | Polar                      | 1              | 1              |  |
| Dedicated    | Signed Cartesian /         | 1/9            |                |  |
| banks        | multi-phase                | 1/2            |                |  |
| Switch bank  | Signed Cartesian           | $\sim 0.71$    | $\approx 1.41$ |  |
| sharing      | $\phi_{AB} = 90^{\circ}$   | $\sim 0.71$    |                |  |
| Switch bank  | 8-phases                   | $\sim 0.02$    | a, 1.09        |  |
| sharing      | $\phi_{AB} = 45^{\circ}$   | $\sim 0.92$    | $\approx 1.08$ |  |
| Switch bank  | 16-phases                  | $\sim 0.08$    | a, 1.02        |  |
| sharing      | $\phi_{AB} = 22.5^{\circ}$ | $\approx 0.96$ | $\approx 1.02$ |  |
| Segment      | Signed Cartesian /         |                | 1              |  |
| activation   | multi phasa                | 1              |                |  |
| interleaving | munu-phase                 |                |                |  |

 $F_{\rm up}$  is minimum, which occurs at  $\theta_{\rm BB} = \phi_{AB}/2$ 

$$F_{\rm up,min}(\phi_{AB}) = \cos\left(\frac{\phi_{AB}}{2}\right).$$
 (3)

Using this upconversion current utilization factor, the required total gate-width for a vector summing architecture with (shared) vector banks with a phase angle  $\phi_{AB}$  between the two vectors, relative to a polar architecture using the same RF duty cycle, is calculated to be

$$\frac{w_{g,\text{tot}}}{w_{g,\text{tot,polar}}} = \frac{1}{F_{\text{up,min}}(\phi_{AB})}$$
$$= \frac{1}{\cos(\frac{\phi_{AB}}{2})}.$$
(4)

Table III gives an overview of the output current utilization of different DTX architectures. Using dedicated vector banks drastically limits the current utilization, demanding a  $2\times$  higher total  $w_g$  to achieve the same output current at any phase compared with a polar architecture. Using switch bank sharing, this is reduced to  $1.41\times$  for SC operation and even to  $1.08\times$  and  $1.02\times$  for multiphase operation with eight and 16 phases, respectively. Using multiphase operation with a vector sharing bank gives only a minimal penalty in output current utilization compared with the polar operation. Using vector interleaved operation, this penalty is completely avoided. However, in practical DTX implementations, this yields other disadvantages that might affect the resulting spectral purity and enforces restrictions in the duty cycle  $(d \le \phi_{AB}/360^\circ)$ .

#### C. RF Duty Cycle Considerations

Section IV-B only considered the RF fundamental of the currents resulting from the ( $\vec{A}$  and  $\vec{B}$ ) activations. However, the current-scaling DTX concept uses a square/rectangular-shaped current waveform, of which the effective duty cycle is set by the time that the switch banks are conducting current. When using "class-B"/"class-C"-like output matching, the duty cycle can be reduced to improve its efficiency [2], [3]. However, when doing so, the fundamental content of the reduced duty

cycle rectangular wave also reduces, demanding a larger  $w_{g,tot}$  to maintain the same fundamental current drive capability.

The magnitude of the fundamental output current,  $|i_{out}|$ , resulting from the rectangular current waveform, is calculated using the first term of the Fourier series of the applied current waveform. In the vector summation architectures, the applied current waveform consists of two pulses. However, since summation is preserved when the Fourier transform is applied, we first derive the fundamental Fourier term of the single-rectangular waveforms, and then utilize the (vector) summation on resulting RF fundamentals, rather than applying the Fourier series on the composite waveform. We now only need to derive the first Fourier term of a single pulse. The fundamental output current of the rectangular wave relates to its current amplitude,  $I_{rect}$ , and the applied RF duty cycle (d) via

$$\left|\vec{i}_{\text{out}}\right| = I_{\text{rect}} \frac{2\sin(\pi d)}{\pi}.$$
(5)

For analog class-B, having a rectified half-sine wave current, the fundamental current follows from the maximum current via:  $|\vec{i}_{out}| = I_{half-sine,max}/2$ , where  $I_{half-sine,max}$  is the half-sine amplitude. Consequently, for a duty cycle of 50%,  $w_{g,tot}$  of a polar DTX can be  $\pi/4$  smaller than that of an analog class-B TX for the same fundamental current. For a duty cycle of 29%, the same  $w_{g,tot}$  as in analog operation can be used in the DTX polar output stage. The required relative  $w_{g,tot}$  of all discussed DTX architectures using different RF duty cycles are summarized in Table IV.

In practical situations, the current waveform has a nonzero rise and fall time, which degrades to some extent the output power for short duty cycles at high operating frequencies (e.g., less than 25% above 5 GHz [6]). When the switch bank activation becomes very short, practical switching speed constraints of the active devices can negatively affect the achievable DTX output powers since the rise and fall times start to dominate at these shorter duty cycles [1].

### D. Other (Practical) DTX Performance Considerations Concerning $w_g$

The total required  $w_g$  directly scales the input capacitance  $(C_{in})$  and output capacitance  $(C_{out})$  of the power switch bank. The required (maximum) driver power needed for the switch bank activation of the DTX is directly proportional to the input capacitance. Assuming a digital driver providing sufficient voltage swing to entirely switch the (power) FET devices in the output switch banks between "ON" and "OFF" at a given operating frequency. The required driver power to operate the DTX switch bank is

$$P_{\rm in} = (ACW_A + ACW_B) (C_{\rm gs,bank} + NC_{\rm driver}) V_{\rm gg}^2 f_0 \qquad (6)$$

with  $V_{gg}$  representing the driver output voltage swing (being the gate voltage of the power switch bank),  $C_{gs,bank}$  is the gate capacitance of the total bank,  $C_{driver}$  is the total driver capacitance per segment [1], and ACW<sub>A</sub> and ACW<sub>B</sub> the normalized switch bank activation, of which its sum (ACW<sub>A</sub>+ ACW<sub>B</sub>) is a value between 0 and 1. Following this equation, the required input power scales are inversely proportional to the upconversion current utilization factor. Furthermore, the shorter the required rise and fall times, the lower the required  $R_{on}$  of the driver, increasing the driver size and, thus,  $C_{driver}$ . Consequently, shorter duty cycles improve the drain efficiency but yield higher drive power, indicating an optimum when considering the overall line-up efficiency [1].

With a larger required  $w_{g,tot}$ , the output capacitance  $C_{out}$  also scales accordingly. Its value will limit, combined with the provided ohmic load and chosen matching strategy, the achievable RF bandwidth and the maximum (drain) efficiency of the DTX [27].

#### E. Matching Strategies for the Switch Bank Output

Many DTX implementations in literature use an output match tailored to the maximum achievable fundamental peak output current. This choice provides linear operation for the entire constellation area coverage, e.g., a square-shaped area in an SC architecture, which is suited for single-channel QAM modulation. In these cases, a lower load is applied, yielding an increased output power (factor  $(2)^{1/2}$  for SC operation). In this article, we aim to achieve optimal matching over a circle to reach the full voltage swing without clipping for the maximum of the modulated signal at any phase rotation. When a circular constellation coverage is required, the optimum load should be applied for any phase and  $|\vec{i}_{out,peak}| = 1$ , yielding

$$R_L = \frac{v_{\rm dd}}{\left|\vec{i}_{\rm out}\right|} = \frac{v_{\rm dd}\pi}{2I_{\rm rect}\sin(\pi d)} \cdot \frac{1}{F_{\rm up,min}(\theta_{\rm BB}, \phi_{AB})}$$
(7)

where  $R_L$  is the load resistance.

#### V. EFFICIENCY CONSIDERATIONS

In this section, we compare the efficiency of the various current-scaling DTX architectures. First, we will compute the DTX peak efficiencies by analyzing the impact of the RF duty cycle and the effects of vector summation on the efficiency. Second, we calculate the average efficiency when dealing with complex modulated signals and when applying a DTX in the Doherty efficiency enhancement configuration.

#### A. Impact of the RF Duty Cycle

The drain efficiency of the output stage(s)/switch bank(s) is defined as

$$\eta_d = \frac{P_{\text{out,RF}}}{P_{\text{DC}}} \tag{8}$$

where  $P_{\text{out,RF}}$  is the fundamental RF output power and  $P_{\text{DC}}$  the final stage(s) power consumption. We use the previous normalization:  $|\vec{i}_{\text{out,peak}}| = 1$ . Under the condition of a perfect ohmic output load and short-circuited conditions for all harmonics, the output voltage is a perfect sine wave, with normalized peak voltage,  $|\vec{v}_{\text{out,peak}}| = 1$ . Using (5), this yields

$$P_{\text{out,RF}} = \frac{R_L |\vec{t}_{\text{out}}|^2}{2} = R_L I_{\text{rect}}^2 \frac{2\sin^2(\pi d)}{\pi^2}.$$
 (9)

The dissipated dc power is given by

$$P_{\rm dc} = v_{\rm dd} \frac{1}{T} \int_{-\frac{d}{2}T}^{\frac{d}{2}T} I_A + I_B dt$$
$$= v_{\rm dd} \cdot (I_A + I_B) \cdot d \tag{10}$$

where *d* is the RF duty cycle, *T* is the period, and  $I_A$  and  $I_B$  are the amplitudes of the rectangular current waveforms of the *A* and *B* vector, respectively. Logically, in a polar architecture, there is only one rectangular current phase instead of *A* and *B*.

#### B. Efficiency of the Polar DTX Architecture

In a polar architecture, following from (5) and (10), the dc power is derived as a function of the duty cycle as

$$P_{\rm dc} = v_{\rm dd} \frac{\pi d}{2\sin(\pi d)} \left| \vec{i}_{\rm out} \right|. \tag{11}$$

Using (9) and (10), the drain efficiency for a current-scaling polar DTX is given by

$$\eta_{d,\text{polar}} = \frac{R_L}{v_{\text{dd}}} \cdot \frac{\sin(\pi d)}{\pi d} |\vec{i}_{\text{out}}|.$$
(12)

Fig. 8 gives the theoretically achievable drain efficiency versus RF output power for a polar DTX using a rectangular current wave for a varying duty cycle compared with analog polar from class-A to deep class-C conditions [2]. The polar DTX provides better output power and efficiency for the same  $w_g$ for duty cycle values below 50%. A 25% duty cycle for the switch bank activations is a favorable choice since it can offer higher efficiency (90% ideally) than analog class-B operation (78.5% ideally) while requiring only an 11% larger total  $w_{g}$ (Table IV). The 25% duty cycle can be synthesized digitally using simple divide-by-two circuits. Furthermore, the rising and falling edges of a 25% duty cycle clock coincide with the edges of the I and Q clocks of an SC DTX. By recombining the different phase clocks, 25% duty cycle clocks can easily be synthesized [8]. Similarly, by recombining the multiphase clocks, 12.5% or 37.5% duty cycle can be synthesized.

#### C. Efficiency of Vector Summation DTX Architectures

When the output waveform is synthesized using two rectangular current pulses, the dc power depends on the absolute sum of the current at the "A" and "B" activation. This sum can be expressed as a function of the fundamental RF output current  $i_{\text{fund}}$ , using (2), (5), and (10)

$$P_{\rm dc} = v_{\rm dd} \frac{\pi d}{2\sin(\pi d)} \left( \left| \vec{i}_A \right| + \left| \vec{i}_B \right| \right)$$
$$= v_{\rm dd} \frac{\pi d}{2\sin(\pi d)} \cdot \frac{\left| \vec{i}_{\rm out} \right|}{F_{\rm up}(\theta_{\rm BB}, \phi_{AB})}.$$
(13)

This yields a theoretical drain efficiency for multivector architectures of

$$\eta_{d,\text{vector-sum}} = \frac{R_L}{v_{\text{dd}}} \cdot \frac{\sin(\pi d)}{\pi d} F_{\text{up}}(\theta_{\text{BB}}, \phi_{AB}) \big| \vec{i}_{\text{out}} \big|.$$
(14)

From (4), the upconversion current utilization yields the largest degradation in efficiency at  $\theta_{BB} = \phi_{AB}/2$ , yielding  $1/(2)^{1/2}$ ,



Fig. 8. Theoretical peak drain efficiency averaged over the phase of the modulation ( $\theta_{BB}$ ) versus normalized output power for a DTX switch bank. Efficiencies are shown for an analog polar architecture (red), polar DTX (purple), SC DTX (green), and multiphase (MP) DTX (blue) [2].

and 0.92 for SC and eight-phase operation, respectively, compared with a polar system. Fig. 9 shows the efficiency per constellation point of the different architectures for a 25% RF duty cycle.

1) Average Efficiency Over Phase: Section IV-A concluded that the probability distribution of the modulated TX signal (e.g., OFDM) can be assumed independent over phase for multichannel signals. However, the dc power consumption of the vector summing DTX architectures will vary as a function of the phase angle of the modulated signal. It is, therefore, useful to calculate the average dc power consumption when changing the TX signal phase from vectors A to B by integrating  $1/F_{up}(\theta_{BB}, \phi_{AB})$  from 0 to  $\phi_{AB}$ 

$$\frac{1}{F_{\rm up,av}(\phi_{AB})} = \frac{1}{\phi_{AB}} \int_0^{\phi_{AB}} \frac{1}{F_{\rm up}(\theta_{\rm BB}, \phi_{AB})} d\theta_{\rm BB}$$
$$= \frac{1}{\phi_{AB}} \int_0^{\phi_{AB}} \frac{\sin(\phi_{AB} - \theta_{\rm BB}) + \sin(\theta_{\rm BB})}{\sin(\phi_{AB})} d\theta_{\rm BB}$$
$$= \frac{2}{\phi_{AB}} \tan\left(\frac{\phi_{AB}}{2}\right) \tag{15}$$

where  $F_{up}(\theta_{BB}, \phi_{AB})$  in (14) can be substituted by  $F_{up,av}(\phi_{AB})$  to calculate the average efficiency.

Using (15), we find the increase in average dc power consumption, relative to a polar system, of  $(4/\pi) = 1.27$  for SC and  $(8/\pi)((2)^{1/2} - 1) = 1.055$  for eight-phase operation. The resulting efficiencies averaged over phase and power utilization of the SC and multiphase architecture are given in Fig. 8, which shows that the output power and efficiency are compromised for SC architectures, whereas the multiphase architecture exhibits an output power/efficiency tradeoff which is much closer to the polar architecture. Multiphase operation with a 25% duty cycle seems to be particularly attractive since it offers higher efficiency than analog class-B operation, with only a limited output power penalty. Furthermore, its phase-coherent operation avoids the need for a (wideband) CORDIC and continuous phase modulator.

In Table IV, an overview is given of the required  $w_{g,tot}$  relative to an analog polar system, the maximum efficiency, the peak efficiency at  $\theta_{BB} = \phi_{AB}/2$ , and the peak efficiency



Fig. 9. Contour plots showing the theoretical drain efficiency for (a) SC, (b) multiphase, and (c) polar operation over the IQ plane, given a 25% RF duty cycle, illustrating the phase dependency of the efficiency for SC operation, and a more "polar-like" efficiency profile for eight-phase operation.

TABLE IV CURRENT-SCALING DTX ARCHITECTURES WITH THEIR  $w_{g,TOT}$  and Efficiency Performances Relative to an Analog Polar System

|              | Duty<br>Cycle | Required<br>relative<br>$w_{g,tot}$ | $\eta_{\mathrm{d,pk}}$<br>@ $\theta_{\mathrm{BB}} = 0$ | $\eta_{ m d,pk} = \phi_{ m AB}/2$ | $\eta_{ m d,pk,av}$ |
|--------------|---------------|-------------------------------------|--------------------------------------------------------|-----------------------------------|---------------------|
| Analog Polar |               |                                     |                                                        |                                   |                     |
| Class-B      |               | 1                                   | 78.5%                                                  | _                                 | _                   |
|              | 50%           | 0.79                                | 63.7%                                                  | —                                 | —                   |
| Polar        | 28.8%         | 1                                   | 86.7%                                                  | —                                 | —                   |
| DTX          | 25%           | 1.11                                | 90.0%                                                  | —                                 | —                   |
|              | 12.5%         | 2.05                                | 97.5%                                                  | —                                 | —                   |
| Signed       | 50%           | 1.57                                | 63.7%                                                  | 45.0%                             | 50.0%               |
| Cartesian    | 28.8%         | 2                                   | 86.7%                                                  | 61.3%                             | 68.4%               |
| dedicated    | 25%           | 2.22                                | 90.0%                                                  | 63.7%                             | 70.7%               |
| I&Q banks    | 12.5%         | 4.10                                | 97.5%                                                  | 68.9%                             | 76.5%               |
| Bank         | 50%           | 1.11                                | 63.7%                                                  | 45.0%                             | 50.0%               |
| sharing      | 28.8%         | 1.41                                | 86.7%                                                  | 61.3%                             | 68.1%               |
| Signed       | 25%           | 1.57                                | 90.0%                                                  | 63.7%                             | 70.7%               |
| Cartesian    | 12.5%         | 2.90                                | 97.5%                                                  | 68.9%                             | 76.5%               |
| Bank         | 50%           | 0.85                                | 63.7%                                                  | 58.8%                             | 60.4%               |
| sharing      | 28.8%         | 1.08                                | 86.7%                                                  | 80.1%                             | 82.2%               |
| 8-phase      | 25%           | 1.20                                | 90.0%                                                  | 83.2%                             | 85.4%               |
|              | 12.5%         | $  2.\overline{22}$                 | $97.\overline{4\%}$                                    | 90. $\overline{0\%}$              | 92.4%               |

averaged over the phase for the discussed TX architectures at various duty cycles.

#### D. DTX Doherty Operation

Due to their (close to) linear operation, the efficiency of the current/ $g_m$ -scaling DTX architectures will roll-off linearly with RF output current (quadratic with output power). To enhance DTX (deep) power back-off (PBO) efficiency when dealing with complex modulated signals (like OFDM) having a large PAPR, Doherty efficiency enhancement techniques are a favored approach. Namely, current-scaling DTX units are perfect replacements for the analog branches conventionally used in these Doherty designs. They can use similar or even identical output power combiners and output-matching arrangements. More importantly, in contrast to the analog Doherty, the branch driving profiles for the main and peaking branches in a DTX can be obtained almost effortlessly in the digital domain, rather than manipulating their bias points in

combination with a complicated analog input splitter design. Furthermore, their digital input drive is frequency-agile in nature. This improved branch control allows achieving close to perfect transfer functions and Doherty efficiency versus power back-off even for higher order *N*-way Doherty combiners over larger RF bandwidths [8], [28], [29].

Furthermore, all Doherty branches in the DTX, including the main branch, can be operated in their highest efficiency mode since the transfer function can be set independently of the duty cycle. Finally, all quiescent currents can be eliminated in DTX implementations, allowing significant power savings in low-traffic scenarios [30].

Next, we will evaluate the theoretically achievable average efficiencies of *N*-way DTX-based Doherty configurations. DTX-based Doherty configurations can be adapted for any kind of power combining networks with free-to-choose switch bank sizes and activation schemes. However, to limit the number of configurations considered in this work, we only focus on *N*-way Doherty combiners using a linear (nonsaturating) current profile [31] having equal switch bank sizes for the implementation of their branches. Using equal-sized switch banks relaxes the DTX design logistics and time. Consequently, the following DTX configurations are studied: single-line-up, symmetrical Doherty (1:1) [32], the three-way Doherty (1:1:1) [33], and the four-way Doherty (1:1:1:1) [34]. Moreover, the supply voltages used for all branches are considered to be identical.

1) Power Back-Off Efficiency Profiles: To find the power back-off efficiency profiles, we calculate the dc power as a function of the fundamental RF output current. When using vector summation operation, such as SC or multiphase, in a Doherty DTX, each branch needs to operate with the same vector ratio to ensure proper Doherty load modulation for the RF outputs of the switch banks at any phase. Having an equal RF duty cycle and the same  $v_{dd}$  for each branch, using (2), (11), we find

$$P_{\rm dc} = v_{\rm dd} \frac{\pi d}{2\sin(\pi d)} \cdot \frac{\left|\vec{i}_{m}\right| + \sum_{n=1}^{N-1} \left|\vec{i}_{p,n}\right|}{F_{\rm up}(\theta_{\rm BB}, \phi_{AB})}$$
(16)



Fig. 10. Efficiency profile of symmetric one-way to four-way Doherty architectures for (a) SC and (b) multiphase operation. The plotted lines represent the phase averaged efficiency, and the shaded area shows the phase dependency of each operation.

where  $|\vec{t}_m|$  and  $|\vec{t}_{p,n}|$  are the fundamental currents in the main and the *n*th peaking branch, respectively. For an *N*-way Doherty, the high-efficiency power back-off points follow from (30) and (31), derived in Appendix B. The fundamental drain currents in each branch are given in (32).

Although the efficiency peaks of this type of Doherty with equal bank sizes appear at different PBO points compared with those of the classical symmetric Doherty configurations, it can be shown that for a given set of power back-off points, the related efficiency versus output current/power function, is independent of the actual Doherty implementation used, under the condition of a lossless power combination and ideal current source behavior of the active devices. This facilitates the results of this analysis to be used in a broader context.

Using (16), the Doherty efficiency is defined as

$$\eta_{d,\text{Doh}}(\left|\vec{i}_{\text{out}}\right|) = \frac{R_L \left|i_{\text{out}}\right|}{v_{\text{dd}}} \cdot \frac{\sin(\pi d)}{\pi d} F_{\text{up}}(\theta_{\text{BB}}, \phi_{AB}) D_{\text{eff}}(\left|\vec{i}_{\text{out}}\right|)$$
(17)

where  $D_{\text{eff}}(|\vec{i}_{\text{out}}|)$  is defined as the Doherty efficiency factor for a given fundamental output current level, which can be used to describe the efficiency versus power back-off, normalized to 100% for its peak efficiency

$$D_{\rm eff}(|\vec{i}_{\rm out}|) = \frac{|\vec{i}_{\rm out}|}{|\vec{i}_m| + \sum_{n=1}^{N-1} |\vec{i}_{p,n}|}.$$
 (18)

The efficiency profile of the considered Doherty configurations versus the normalized RF output current for both SC and eight-phase operation are shown in Fig. 10. The solid line in the plot represents the efficiency averaged over phase, and the shaded area represents the range over which the instantaneous efficiency can vary as a function of its modulation phase ( $\theta_{BB}$ ).

2) Average Efficiency and Probability Density: Using the efficiency profiles derived in Section V-D.1, the overall average efficiency for a modulated signal with a given PAPR can be derived. For this purpose, the 2-D probability density must be considered. As shown in Fig. 7(b), the coverage of an OFDM signal is constant over phase, and thus, the amplitude can be considered independent of the phase in this calculation. Consequently, the average efficiency can be

calculated by evaluating the average upconversion utilization factor and the average dc current in power back-off. Assuming that an OFDM signal has a bivariate Gaussian distribution over the constellation, with equal variance for the in-phase and quadrature-phase, the probability density of the vector amplitude of such a distribution is described by the Rayleigh distribution [32], [35], as shown in Fig. 7(c)

$$f_{\text{Ray}}(\left|\vec{i}_{\text{out}}\right|, \sigma^2) = \frac{\left|\vec{i}_{\text{out}}\right|}{\sigma^2} e^{-\frac{\left|\vec{i}_{\text{out}}\right|^2}{2\sigma^2}}$$
(19)

where  $|\vec{i}_{out}|$  is normalized to:  $0 \le |\vec{i}_{out}| \le 1$  and  $\sigma^2$  represents the variance of I and Q. The average output power of such a distribution, based on the integral from 0 to  $\infty$ , is  $2\sigma^2$ . The switch bank output is only defined for the range  $0 \le |\vec{i}_{out}| \le 1$ . Consequently, the average output power is calculated using this interval, yielding an average output power of

$$P_{\text{out,RF,av}} = R_L \int_0^1 f_{\text{Ray}} (|\vec{i}_{\text{out}}|, \sigma^2) |\vec{i}_{\text{out}}|^2 d|\vec{i}_{\text{out}}|$$
$$= R_L \left( 2\sigma^2 \left( 1 - e^{-\frac{1}{2\sigma^2}} \right) - e^{-\frac{1}{2\sigma^2}} \right).$$
(20)

For a given PAPR, we can make the approximation: PAPR  $\approx$   $(1/2\sigma^2)$  (for PAPR = 7 dB, the deviation is 0.25 dB).

The same integral and (16) are used to calculate the average dc power

$$P_{\rm dc,av} = v_{\rm dd} \frac{d\pi}{2\sin(\pi d)} \cdot \frac{1}{F_{\rm up,av}(\phi_{AB})}$$
  
$$\cdots \int_{0}^{1} f_{\rm Ray}(|\vec{i}_{\rm out}|, \sigma^{2}) \left( |\vec{i}_{m}| + \sum_{n=1}^{N-1} |\vec{i}_{p,n}| \right) d|\vec{i}_{\rm out}|.$$
(21)

The average efficiency,  $\eta_{d,av}(\sigma^2)$  is defined as

$$\eta_{d,\mathrm{av}}(\sigma^{2}) = \frac{R_{L}}{v_{\mathrm{dd}}} \cdot \frac{\sin(\pi d)}{d\pi} \cdot F_{\mathrm{up,av}}(\phi_{AB})$$
  
$$\cdots \frac{\int_{0}^{1} f_{\mathrm{Ray}}(|\vec{i}_{\mathrm{out}}|) |\vec{i}_{\mathrm{out}}|^{2} d|\vec{i}_{\mathrm{out}}|}{\int_{0}^{1} f_{\mathrm{Ray}}(|\vec{i}_{\mathrm{out}}|) \left(|\vec{i}_{m}| + \sum_{n=1}^{N-1} |\vec{i}_{p,n}|\right) d|\vec{i}_{\mathrm{out}}|}$$
(22)



Fig. 11. Plot showing the average drain efficiency as a function of the PAPR of a single line-up and a symmetric two-way, three-way, and four-way Doherty. The solid line shows the Doherty efficiency factor,  $D_{\text{eff,av}}$  (efficiency normalized to  $\eta_{d,\text{peak}} = 100\%$ ). Additionally, the dashed-dotted lines, dashed lines, and dotted lines represent the theoretical average efficiency for a polar DTX, multiphase DTX, and SC DTX, respectively, using a 25% duty cycle.



Fig. 12. Picture of the CMOS-LDMOS power DTX [2].

which is plotted in Fig. 11, using a duty cycle of 25% for different variants of DTX architectures.

#### VI. VERIFICATIONS IN MEASUREMENT

To verify some of the theoretical findings and values derived in this article, additional measurements are performed on two already available DTX configurations, presented in [2] and [6], that excel in terms of output power, configurability, or Doherty order. Since all previously derived theory assumes lossless operation, some normalization is needed to allow a meaningful comparison and confirmation of the predicted trends.

#### A. Vector Summation Efficiency

The modulation-phase–efficiency relation for SC and multiphase operation is verified by performing additional measurements on the CMOS-LDMOS power DTX presented in [2] and [29] (Fig. 12) and the CMOS only DTX presented in [6]. The LDMOS output stage has two segmented banks that can be independently controlled and, thus, used as designated A and B banks in SC or multiphase operation. Fig. 13 shows the drain efficiency, measured for different phase inputs while ensuring the output power was equal for all different phases. The measured peak efficiency (54% at ~50% duty cycle with only one phase activated) has been normalized to allow comparison with the theory.



Fig. 13. Plot showing the normalized efficiency as a function of phase for SC and multiphase operation, verifying the upconversion current utilization factor ( $F_{up}$ ). The theoretical value is plotted, as well as (normalized) measured data from the DTX implementations presented in [2] and [6].



Fig. 14. Normalized efficiency and required activated  $w_g$  for a constant RF output power as a function of the duty cycle. With the applied duty cycle calibration, the measurements verify the predicted trends.

#### B. Duty Cycle Reduction

Also, the anticipated relation between the duty cycle and drain efficiency has been verified by measurements. While the effective LDMOS duty cycle could not be measured directly, the duty cycle can be derived using the duty-cycle-dependent dc value of the additional trigger pin at the output of the CMOS driver. The duty cycle of the CMOS driver is higher than that of the LDMOS drain current, which is caused by its nonzero threshold voltage  $(v_t)$  and nonlinear  $v_{gs} - i_{ds}$  relation. In the measurement, the duty cycle of the DTX CMOS driver is varied, while its ACW is continuously adjusted to keep the RF output power constant. Using the ACW increase at lower duty cycles, needed to keep the RF output power constant, the effective duty cycle of the LDMOS drain current can be estimated using similar dependencies, as shown in Fig. 8. Fig. 14 shows the expected theoretical and experimental results for two RF output power levels versus duty cycle, confirming the predicted trends.

#### VII. COMPARISON WITH LITERATURE

To verify our key conclusions for a current-scaling DTX with other experimental data, a comprehensive literature review has been performed. Alternative DTX architectures are included to put the current-scaling DTX concept in a better perspective. These include the following.

80

70

60

50

40

20

10

15

 $\dot{20}$ 

25

30

Peak Output Power (dBm)

(a)

35

40

45

Peak Efficiency (%)



ż

Fig. 15. Literature comparison of proposed DTXs over the last decade (a) in terms of peak efficiency versus peak output power and (b) in terms of average efficiency versus PAPR. The circles represent the drain efficiency, and the triangles represent the DTX system efficiency. The blue-colored shapes represent current-scaling architectures, the red-colored shapes represent switched capacitor architectures, and the green-colored shapes represent architectures that use switching to scale their on-resistance.

50

- 1) Switched capacitor DTX approaches, that rely on digital switch banks with class-D output stages to control the amount of (series) output capacitance that is actively switched between ground and supply, and as such, provides a means to control its RF output power via a resonant output match [79].
- 2) Switched resistor-based DTXs, that use switch banks composed out of small output transistors to toggle at the transmit frequency between off-state and  $R_{on}$ , of which the later depends on the number of activated output stage devices in the switch bank. Using this approach, its RF output power can be controlled even when performing a class-E-like operation [90].
- 3) Hybrid DTX architectures, which typically rely on a combination of current/capacitor/resistor and/or voltage scaling. Nonscaling DTXs, like (classical) out-phasing configuration, are not included in this comparison [38].

First, the DTX architectures reported in the literature are compared for their peak RF output power and efficiency [Fig. 15(a)]. Both peak drain efficiency (dot) and system efficiency (triangle) are provided when available. Most reported single-chip CMOS DTX implementations are found in the +20 and +30 - dBm range. This is caused by the limited breakdown of the advanced CMOS technologies used in the output stages, and thus, restricted output voltage swing(s). Targeting higher RF output powers using these high-speed CMOS technologies demands (very) low load impedance(s), or extensive power combiner network(s), which both make the circuitry very sensitive to output losses. Here, the use of an external PCB matching network can help to lower these losses [6], [83]. Current-scaling (blue symbols) and resistor-scaling (green symbols) DTX implementations tend to be more forgiving for losses in their output stages/networks than switched capacitor solutions (red symbols). As such, current-scaling and resistor-scaling approaches can reach peak efficiencies over 60%. Since resistor scaling/class-E-like approaches (green symbols) typically use a higher voltage swing for a given output power level, they seem to be more power restricted

when using the same process, or comparable technology, for their switch bank implementation [21], [91].

6

PAPR (dB)

(b)

ġ

10

11

Dual-chip DTX solutions, e.g., a CMOS DTX driver combined with a (gate) segmented output stage in high-voltage technology, can provide much higher RF output powers, e.g., over 40 dBm. A  $v_t$  down-shifted segmented LDMOS technology with a breakdown voltage of 65 V was used for the implementation of [1], [2], and [29]. In principle, a similar approach can also be used with a GaN power device. However, up to date, this has not been reported since achieving a sufficiently low  $v_t$  and a fine segmentation is less straightforward in GaN technology. In view of this, note that [43] uses a current-scaling CMOS driver followed by a singleinput common-gate GaN device that has no segmentation, so [43] is still relying on an analog PA output stage; however, it is capable of delivering +34.6 dBm.

The average DTX drain (circles) and system efficiency (triangles) versus PAPR are shown in Fig. 15(b). It indicates that current-scaling DTXs also report the highest average efficiencies [6], [8], [29], [44], [87].

Two-way Doherty efficiency enhancement yields the highest average efficiency for DTXs [6], [46] at moderate PAPRs (4.5 - 8 dB). For higher PAPRs, we find the best average efficiency performance with a four-way Doherty DTX configuration. When considering the switched-capacitor PA (SCPA) architectures, [57] stands out in average drain efficiency for a very high PAPR of 10.9 dB. This result is achieved using a combination of dual-supply class-G, Doherty load modulation, and time interleaving to create eight high-efficiency power back-off points.

#### VIII. CONCLUSION

This article benchmarks the Cartesian, multiphase, and polar current-scaling DTX architectures with their switch bank implementation (separate IQ banks, IQ bank sharing, and interleaving) against the performance of ideal classical analog class-B operation. To do so, the IQ baseband-to-RF-signal upconversion of the various DTX architectures is



Fig. 16. (a) Drain current and (b) voltage profile for a four-way Doherty architecture using a linear current profile.

expressed through the upconversion current utilization factor  $F_{up}$ . Also, the coverage of the current constellation plane for the different DTX architectures is evaluated. Using a realistic OFDM multichannel modulation scenario, the modulation phase dependency of Cartesian and multiphase DTX architectures can be eliminated as a variable, allowing a straightforward normalization of their output current capability. Next, the impact of the duty cycle for the current pulses/vectors is taken into account, enabling a straightforward comparison with the ideal analog class-B operation in terms of the required total gate width  $(w_{g,tot})$  and peak drain efficiency. Performing these steps, it is found that the use of lowduty-cycle current pulses provides a better power/efficiency tradeoff for DTX than its analog counterpart. Meanwhile, it does not show the gain expansion as is present in analog class-C. Although polar DTX operation provides by far the best efficiency and output power, Cartesian and multiphase DTX operation strongly benefit from their phase coherent upconversion, allowing straightforward retiming of their unit cells in the output stage and, as such, avoiding the bandwidth expansion of polar operation, while eliminating the need for a CORDIC or phase modulator. This makes Cartesian and multiphase operations better suited for handling wideband modulated signals.

Unfortunately, Cartesian DTX efficiency still suffers from the 90° phase difference between its (summing) current vectors. For this reason, the eight-phase operation with a 25% duty cycle seems to be an interesting candidate in terms of overall DTX performance. Namely, it approximates the efficiency of a polar DTX, ( $\eta_{pk,av,multiphase} = 85\%$ ) and requires only a relative  $w_{g,tot}$  of 1.2 in a bank sharing architecture, whereas being compatible with wideband modulation signals [6]. Meanwhile, the output match in all these considered architectures only requires shorts for higher harmonics, which are easy to implement and also compatible with wideband operation. Since current-scaling-based DTX operation provides superior control in terms of current amplitude and phase, it is perfectly suited to be used in N-way Doherty configurations that facilitate high average efficiency when handling modulation signals with high PAPR. Consequently, the theoretical achievable average efficiencies of a single DTX and two-way, three-way, and four-way Doherty DTX have been computed for polar, multiphase, and Cartesian operations assuming a 25% duty cycle for their current pulses. The theoretical trends found in terms of efficiency/output power versus duty cycle have been verified in measurements for the polar, Cartesian, and multiphase operations. Also, an extensive comparison with DTX results in the literature has been given, showing the recent DTX progress in peak efficiency versus output power and average efficiency versus PAPR.

#### APPENDIX A UPCONVERSION CURRENT UTILIZATION FACTOR

In this appendix, a closed-form equation is derived for the upconversion current utilization as a function of the phase angle between the two applied vectors and the phase of the input signal. The upconversion current utilization factor is defined as

$$F_{\rm up}(\theta_{\rm BB}, \phi_{AB}) = \frac{|I + j\mathbf{Q}|}{A + B}.$$
(23)

In which |I + jQ| represents the magnitude of the wanted fundamental current in the constellation plane, and A and B are the two current vectors used for its construction. To derive the closed-form equation for  $F_{up}(\theta_{BB}, \phi_{AB})$ , we first calculate the values of A and B as a function of the input amplitude  $(\rho_{BB})$  and phase  $(\theta_{BB})$ . For this purpose, the input IQ value is rewritten to polar notation

$$I = \Re(\rho_{BB} e^{j\theta_{BB}}) = \rho_{BB} \cos(\theta_{BB})$$
$$Q = \Im(\rho_{BB} e^{j\theta_{BB}}) = \rho_{BB} \sin(\theta_{BB}).$$
(24)

In this analysis, we only consider the first segment ( $0 \le \theta_{BB} \le \phi_{AB}$ ), utilizing the symmetries present in Figs. 5 and 9. In this segment, only the *B* vector contributes to the quadrature (imaginary) part of the input IQ value, yielding

$$Q = \Im \left( B e^{j \phi_{AB}} \right) = B \sin(\phi_{AB}) \tag{25}$$

and thus,

$$B = \rho_{\rm BB} \frac{\sin(\theta_{\rm BB})}{\sin(\phi_{AB})}.$$
 (26)

Furthermore, in this first segment, the A vector is in-phase and only contains a real value. The amplitude of the A vector is calculated by subtracting the real part of the B vector from the real (in-phase) part of the baseband input

$$A = \Re \left( \rho_{\rm BB} e^{j\theta_{\rm BB}} \right) - \Re \left( B e^{j\phi_{AB}} \right) \tag{27}$$

and thus,

$$A = \rho_{BB} \left( \cos(\theta_{BB}) - \frac{\sin(\theta_{BB})}{\sin(\phi_{AB})} \cos(\phi_{AB}) \right).$$
(28)

Using the equations earlier, the upconversion factor can now be written as

$$F_{\rm up}(\theta_{\rm BB}, \phi_{AB}) = \frac{\rho_{\rm BB}}{A+B}$$
$$= \frac{\sin(\phi_{AB})}{\sin(\phi_{AB} - \theta_{\rm BB}) + \sin(\theta_{\rm BB})}.$$
 (29)

#### APPENDIX B

#### DOHERTY DC POWER

The analysis provided is based on the current and voltage profiles, of which a four-way example is shown in Fig. 16 [31]. This configuration uses linear current relations for its branches, in contrast to the classical N-way Doherty. The linear relations avoid discontinuities, yielding higher linearity in practical designs [8], [92].

The maximum branch currents  $(|\vec{i}_{m,\max}|, |\vec{i}_{p1,\max}|, \ldots,)$  are directly proportional their relative size, in our case equal to  $|i_{out,max}|/4$  for each branch. As shown in Fig. 16, the main branch's current increases linear from 0 to  $|i_{m,max}|$  over the full input range ( $\rho_{BB}$ ), whereas the other branch currents are activated successively at higher drive levels, indicated by  $k_3$ ,  $k_2$ , and  $k_1$ . The already activated branches reach their maximum swing and, thus, efficiency at these points. Consequently,  $k_1$ ,  $k_2$ , and  $k_3$  are the high-efficiency PBO points.

To calculate the values of  $k_{1-3}$ , the output power is expressed as the sum of the powers provided by each branch, assuming a lossless Doherty power combiner is given as

$$|\vec{i}_{\text{out}}||\vec{v}_{\text{out}}| = |\vec{i}_{m}||\vec{v}_{m}| + \sum_{m=1}^{N-1} |\vec{i}_{p,m}||\vec{v}_{p,m}|.$$
(30)

Using the linear current profiles given in Fig. 16, the following branch current relations are derived

.→

$$\begin{aligned} |\vec{i}_{m}| &= \rho_{\mathrm{BB}} |\vec{i}_{m,\mathrm{max}}| \\ |\vec{i}_{p,m}| &= \begin{cases} \frac{\rho_{\mathrm{BB}} - k_{N-m}}{1 - k_{N-m}} |\vec{i}_{p,m,\mathrm{max}}|, & \text{for } k_{N-m} < \rho_{\mathrm{BB}} \leqslant 1 \\ 0, & \text{elsewhere} \end{cases} \end{aligned}$$
(31)

for an N-way Doherty configuration with  $m = \{1, 2, ..., N -$ 1]. The input values  $k_3$ ,  $k_2$ , and  $k_1$  are found at the points, where the drain voltage at each active branch reaches its maximum value,  $|\vec{v}_{out,max}| = 1$  and the currents in the nonactivated (peaking) branches are zero. Furthermore, we assume that  $|i_{out}|$ relates linearly to  $\rho_{\rm BB}$  and use  $|i_{\rm out}||\vec{v}_{\rm out}| = |i_{\rm out}|^2 R_l$ , whereas  $R_l = 1$  and  $v_{dd} = 1$ . As an example, using these conditions and substituting (31) into (30) yield the following high-efficiency power back-off points for the four-way symmetric Doherty:

$$k_3 = \frac{1}{4}, \quad k_2 = \frac{1}{3}, \ k_1 = \frac{3}{8}.$$

The drain currents as a function of the input amplitude  $\rho_{BB}$ and the high-efficiency PBO points are hereby

$$\begin{aligned} |\dot{i}_{m}| &= k_{3}\rho_{\text{BB}} \\ |\vec{i}_{p,1}| &= \begin{cases} k_{2}(\rho_{\text{BB}} - k_{3}), & \text{for } k_{3} < |\vec{i}_{\text{out}}| \leq 1 \\ 0, & \text{elsewhere} \end{cases} \\ |\vec{i}_{p,2}| &= \begin{cases} (k_{1} - k_{3})(\rho_{\text{BB}} - k_{2}), & \text{for } k_{2} < |\vec{i}_{\text{out}}| \leq 1 \\ 0, & \text{elsewhere} \end{cases} \\ |\vec{i}_{p,3}| &= \begin{cases} (1 - k_{2})(\rho_{\text{BB}} - k_{1}), & \text{for } k_{1} < |\vec{i}_{\text{out}}| \leq 1 \\ 0, & \text{elsewhere.} \end{cases} \end{aligned}$$
(32)

Similarly, this method can be used to analyze other Nway (e.g., two-way, three-way, and/or asymmetric) Doherty configurations.

#### ACKNOWLEDGMENT

The authors would like to thank Anil Kumaran for his technical assistance and the people from Ampleon, Nijmegen, The Netherlands, and Nokia Bell Labs, Stuttgart, Germany, for the helpful discussions.

#### REFERENCES

- [1] R. J. Bootsman et al., "High-power digital transmitters for wireless infrastructure applications (a feasibility study)," IEEE Trans. Microw. Theory Techn., vol. 70, no. 5, pp. 2835-2850, May 2022.
- [2] D. P. N. Mul et al., "Efficiency and linearity of digital 'class-C like' transmitters," in Proc. Eur. Microw. Conf. (EuMC), 2021, pp. 1-4.
- [3] S. C. Cripps, RF Power Amplifiers for Wireless Communications, 2nd ed. Norwood, MA, USA: Artech House, 2006.
- [4] M. Mehrpoo, M. Hashemi, Y. Shen, L. C. N. de Vreede, and M. S. Alavi, "A wideband linear I/Q-interleaving DDRM," IEEE J. Solid-State Circuits, vol. 53, no. 5, pp. 1361-1373, May 2018.
- [5] Y. Shen, R. Bootsman, M. S. Alavi, and L. C. N. de Vreede, "A wideband IQ-mapping direct-digital RF modulator for 5G transmitters," IEEE J. Solid-State Circuits, vol. 57, no. 5, pp. 1446-1456, May 2022.
- [6] M. Beikmirza, Y. Shen, L. C. N. de Vreede, and M. S. Alavi, "A wideband energy-efficient multi-mode CMOS digital transmitter," IEEE J. Solid-State Circuits, vol. 58, no. 3, pp. 677-690, Mar. 2023.
- [7] M. S. Alavi, A. Visweswaran, R. B. Staszewski, L. C. N. de Vreede, J. R. Long, and A. Akhnoukh, "A 2-GHz digital I/Q modulator in 65nm CMOS," in Proc. IEEE Asian Solid-State Circuits Conf., Nov. 2011, pp. 277-280.
- [8] M. Beikmirza, Y. Shen, L. C. N. de Vreede, and M. S. Alavi, "A wideband four-way Doherty bits-in RF-out CMOS transmitter," IEEE J. Solid-State Circuits, vol. 56, no. 12, pp. 3768-3783, Dec. 2021.
- [9] H. Wang, T. Matsuura, G. L. G. de Mercey, P. C.-P. Liang, K. Takinami, and R. W. Booth, "High-efficiency all-digital transmitter," U.S. Patent 8 385 469, Feb. 26, 2013.
- [10] E. W. McCune Jr., "Wideband phase modulation methods and apparatus," U.S. Patent 8 508 309, Aug. 13, 2013.
- [11] E. W. McCune, "Concurrent polar and quadrature modulator," in Proc. WAMICON, Jun. 2014, pp. 1-4.
- [12] W. Yuan and J. S. Walling, "A multiphase switched capacitor power amplifier in 130 nm CMOS," in Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC), May 2016, pp. 210-213.
- W. Yuan and J. S. Walling, "A multiphase switched capacitor power [13] amplifier," IEEE J. Solid-State Circuits, vol. 52, no. 5, pp. 1320-1330, May 2017.
- [14] B. Khamaisi et al., "A 16 nm, +28 dBm dual-band all-digital polar transmitter based on 4-core digital PA for Wi-Fi6E applications," in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, vol. 65, Feb. 2022, pp. 324-326.
- [15] A. Ben-Bassat et al., "A fully integrated 27-dBm dual-band all-digital polar transmitter supporting 160 MHz for Wi-Fi 6 applications," IEEE J. Solid-State Circuits, vol. 55, no. 12, pp. 3414-3425, Dec. 2020.
- [16] P. Madoglio et al., "A cellular multiband DTC-based digital polar transmitter with -153-dBc/Hz noise in 14-nm FinFET," IEEE J. Solid-State Circuits, vol. 55, no. 7, pp. 1830-1841, Jul. 2020.
- [17] Y. Shen et al., "A fully-integrated digital-intensive polar Doherty transmitter," in Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC), Jun. 2017, pp. 196-199.
- [18] M. Hashemi, M. S. Alavi, and L. C. N. De Vreede, "Pushing the linearity limits of a digital polar transmitter," in Proc. 13th Eur. Microw. Integr. Circuits Conf. (EuMIC), Sep. 2018, pp. 174-177.
- [19] R. B. Staszewski and M. S. Alavi, "Digital I/Q RF transmitter using time-division duplexing," in Proc. IEEE Int. Symp. Radio-Frequency Integr. Technol., Nov. 2011, pp. 165-168.
- [20] M. S. Alavi, R. B. Staszewski, L. C. N. de Vreede, A. Visweswaran, and J. R. Long, "All-digital RF I/Q modulator," IEEE Trans. Microwave Theory Techniques, vol. 60, no. 11, pp. 3513-3526, Nov. 2012.
- [21] M. Hashemi et al., "An intrinsically linear wideband digital polar PA featuring AM-AM and AM-PM corrections through nonlinear sizing, overdrive-voltage control, and multiphase RF clocking," in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2017, pp. 300-301.
- M. S. Alavi, R. B. Staszewski, L. C. N. de Vreede, and J. R. Long, [22] "A wideband 2×13-bit all-digital I/Q RF-DAC," IEEE Trans. Microwave Theory Techn., vol. 62, no. 4, pp. 732-752, Apr. 2014.

- [23] H. Jin et al., "Efficient digital quadrature transmitter based on IQ cell sharing," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2015, pp. 1–3.
- [24] D. Zheng et al., "A 15b quadrature digital power amplifier with transformer-based complex-domain power-efficiency enhancement," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2020, pp. 370–372.
- [25] Y. Shen, R. Bootsman, M. S. Alavi, and L. de Vreede, "A 0.5–3 GHz I/Q interleaved direct-digital RF modulator with up to 320 MHz modulation bandwidth in 40 nm CMOS," in *Proc. IEEE Custom Integr. Circuits Conf. (CICC)*, Mar. 2020, pp. 1–4.
- [26] T. Buckel et al., "A novel digital-intensive hybrid polar-I/Q RF transmitter architecture," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 65, no. 12, pp. 4390–4403, Dec. 2018.
- [27] R. M. Fano, "Theoretical limitations on the broadband matching of arbitrary impedances," J. Franklin Inst., vol. 249, no. 2, pp. 139–154, Feb. 1950.
- [28] M. J. Pelk, W. C. Neo, J. R. Gajadharsing, R. S. Pengelly, and L. C. N. de Vreede, "A high-efficiency 100-W GaN three-way Doherty amplifier for base-station applications," *IEEE Trans. Microw. Theory Techn.*, vol. 56, no. 7, pp. 1582–1591, Jul. 2008.
- [29] R. Bootsman et al., "A 39 W fully digital wideband inverted Doherty transmitter," in *IEEE MTT-S Int. Microw. Symp. Dig.*, Jun. 2022, pp. 979–982.
- [30] J. von Perner and V. Friderikos. (2021). Green Future Networks— Network Energy Efficiency. Next Generation Mobile Networks. [Online]. Available: https://www.ngmn.org/publications/green-futurenetworks-network-energy-efficiency.html
- [31] J. Gajadharsing, "N way Doherty amplifier," U.S. Patent 8 928 402 B2, Jun. 20, 2013.
- [32] F. H. Raab et al., "Power amplifiers and transmitters for RF and microwave," *IEEE Trans. Microw. Theory Techn.*, vol. 50, no. 3, pp. 814–826, Mar. 2002.
- [33] J. Gajadharsing, W. Neo, M. Pelk, L. de Vreede, and J. Zhao, "3-way Doherty amplifier with minimum output network," U.S. Patent 8 022 760 B2, Sep. 20, 2011.
- [34] X. Moronval and J. Gajadharsing, "A 100 W multi-band four-way integrated Doherty amplifier," in *IEEE MTT-S Int. Microw. Symp. Dig.*, May 2016, pp. 1–3.
- [35] A. Papoulis and S. U. Pillai, Probability, Random Variables, and Stochastic Processes. New York, NY, USA: McGraw-Hill, 2002.
- [36] A. Zhang, C. Yang, M. Ayesh, and M. S. Chen, "A 5-to-6 GHz current-mode subharmonic switching digital power amplifier for enhancing power back-off efficiency," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, vol. 64, Feb. 2021, pp. 364–366.
- [37] H. J. Qian, B. Yang, J. Zhou, H. Xu, and X. Luo, "A quadrature digital power amplifier with hybrid Doherty and impedance boosting for complex domain power back-off efficiency enhancement," *IEEE J. Solid-State Circuits*, vol. 56, no. 5, pp. 1487–1501, May 2021.
- [38] D. Jung, S. Li, J.-S. Park, T.-Y. Huang, H. Zhao, and H. Wang, "A CMOS 1.2-V hybrid current- and voltage-mode three-way digital Doherty PA with built-in phase nonlinearity compensation," *IEEE J. Solid-State Circuits*, vol. 55, no. 3, pp. 525–535, Mar. 2020.
- [39] J. Sheth and S. M. Bowers, "A differential digital 4-way Doherty power amplifier with 48% peak drain efficiency for low power applications," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Aug. 2020, pp. 119–122.
- [40] S. Hu, S. Kousai, and H. Wang, "A broadband mixed-signal CMOS power amplifier with a hybrid Class-G Doherty efficiency enhancement technique," *IEEE J. Solid-State Circuits*, vol. 51, no. 3, pp. 598–613, Mar. 2016.
- [41] M. Ingels, X. Zhang, K. Raczkowski, S. Cha, P. Palmers, and J. Craninckx, "A linear 28 nm CMOS digital transmitter with 2×12 bit up to LO baseband sampling and -58dBc C-IM3," in *Proc. 40th Eur. Solid State Circuits Conf. (ESSCIRC)*, Sep. 2014, pp. 379–382.
- [42] H. Wang et al., "A highly-efficient multi-band multi-mode all-digital quadrature transmitter," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 61, no. 5, pp. 1321–1330, May 2014.
- [43] V. Diddi, S. Sakata, S. Shinjo, V. Vorapipat, R. Eden, and P. Asbeck, "Broadband digitally-controlled power amplifier based on CMOS/GaN combination," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, May 2016, pp. 258–261.

- [44] V. Diddi, H. Gheidi, Y. Liu, J. Buckwalter, and P. Asbeck, "A wattclass, high-efficiency, digitally-modulated polar power amplifier in SOI CMOS," in *Proc. IEEE Compound Semiconductor Integr. Circuit Symp.* (*CSICS*), Oct. 2015, pp. 1–4.
- [45] M. S. Alavi, G. Voicu, R. B. Staszewski, L. C. N. de Vreede, and J. R. Long, "A 2×13-bit all-digital I/Q RF-DAC in 65-nm CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2013, pp. 167–170.
- [46] M. Beikmirza, Y. Shen, L. C. N. de Vreede, and M. S. Alavi, "A wideband two-way digital Doherty transmitter in 40nm CMOS," in *IEEE MTT-S Int. Microw. Symp. Dig.*, Jun. 2022, pp. 975–978.
- [47] J. S. Park, Y. Wang, S. Pellerano, C. Hull, and H. Wang, "A CMOS wideband current-mode digital polar power amplifier with built-in AM– PM distortion self-compensation," *IEEE J. Solid-State Circuits*, vol. 53, no. 2, pp. 340–356, Feb. 2018.
- [48] J. S. Park, S. Hu, Y. Wang, and H. Wang, "A highly linear dual-band mixed-mode polar power amplifier in CMOS with an ultracompact output network," *IEEE J. Solid-State Circuits*, vol. 51, no. 8, pp. 1756–1770, Aug. 2016.
- [49] S. Hu, S. Kousai, and H. Wang, "A compact broadband mixed-signal power amplifier in bulk CMOS with hybrid Class-G and dynamic load trajectory manipulation operations," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, May 2016, pp. 202–205.
- [50] S. Hu, S. Kousai, J. S. Park, O. L. Chlieh, and H. Wang, "Design of a transformer-based reconfigurable digital polar Doherty power amplifier fully integrated in bulk CMOS," *IEEE J. Solid-State Circuits*, vol. 50, no. 5, pp. 1094–1106, May 2015.
- [51] J. Lee, D. Jung, D. Munzer, and H. Wang, "A compact single transformer footprint hybrid current-voltage digital Doherty power amplifier," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2022, pp. 247–250.
- [52] W. M. Gaber, P. Wambacq, J. Craninckx, and M. Ingels, "A 21-dBm I/Q digital transmitter using stacked output stage in 28-nm bulk CMOS technology," *IEEE Trans. Microw. Theory Techn.*, vol. 65, no. 11, pp. 4744–4757, Nov. 2017.
- [53] Y. Shen, R. Bootsman, M. S. Alavi, and L. C. N. de Vreede, "A 1–3 GHz I/Q interleaved direct-digital RF modulator as a driver for a commongate PA in 40 nm CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Aug. 2020, pp. 287–290.
- [54] B. Yang, E. Y. Chang, A. M. Niknejad, B. Nikolic, and E. Alon, "A 65-nm CMOS I/Q RF power DAC with 24- to 42-dB third-harmonic cancellation and up to 18-dB mixed-signal filtering," *IEEE J. Solid-State Circuits*, vol. 53, no. 4, pp. 1127–1138, Apr. 2018.
- [55] C. Hu, Y. Yin, T. Li, Y. Liu, L. Xiong, and H. Xu, "A fullyintegrated wideband digital polar transmitter with 11-bit digital-to-phase converter in 40 nm CMOS," *IEEE J. Solid-State Circuits*, vol. 58, no. 2, pp. 462–473, Feb. 2023.
- [56] B. Yang, H. J. Qian, and X. Luo, "Quadrature switched/floated capacitor power amplifier with reconfigurable self-coupling canceling transformer for deep back-off efficiency enhancement," *IEEE J. Solid-State Circuits*, vol. 56, no. 12, pp. 3715–3727, Dec. 2021.
- [57] S.-W. Yoo, S.-C. Hung, and S.-M. Yoo, "A multimode multi-efficiencypeak digital power amplifier," *IEEE J. Solid-State Circuits*, vol. 55, no. 12, pp. 3322–3334, Dec. 2020.
- [58] A. Zhang and M. S. Chen, "A watt-level phase-interleaved multisubharmonic switching digital power amplifier," *IEEE J. Solid-State Circuits*, vol. 54, no. 12, pp. 3452–3465, Dec. 2019.
- [59] V. Vorapipat, C. Levy, and P. Asbeck, "A wideband voltage mode Doherty power amplifier," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, May 2016, pp. 266–269.
- [60] J. Lee, D. Jung, D. Munzer, and H. Wang, "A compact wideband joint bidirectional class-G digital Doherty switched-capacitor transmitter and N-path quadrature receiver through capacitor bank sharing," in *Proc. IEEE Custom Integr. Circuits Conf. (CICC)*, Apr. 2022, pp. 1–2.
- [61] H. M. Nguyen, J. S. Walling, A. Zhu, and R. B. Staszewski, "A mmwave switched-capacitor RFDAC," *IEEE J. Solid-State Circuits*, vol. 57, no. 4, pp. 1224–1238, Apr. 2022.
- [62] B. Qiao, A. V. Kayyil, and D. J. Allstot, "An eight-core class-G switched-capacitor power amplifier with eight power backoff efficiency peaks," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2022, pp. 251–254.
- [63] Z. Bai, A. Azam, and J. S. Walling, "A frequency tuneable switchedcapacitor PA in 65nm CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2019, pp. 295–298.

- [64] Z. Bai, W. Yuan, A. Azam, and J. S. Walling, "4.3 a multiphase interpolating digital power amplifier for TX beamforming in 65nm CMOS," in *Proc. IEEE Int. Solid- State Circuits Conf.* - (*ISSCC*), Feb. 2019, pp. 78–80.
- [65] S.-W. Yoo, S.-C. Hung, and S.-M. Yoo, "A watt-level quadrature Class-G switched-capacitor power amplifier with linearization techniques," *IEEE J. Solid-State Circuits*, vol. 54, no. 5, pp. 1274–1287, May 2019.
- [66] A. Azam, Z. Bai, and J. S. Walling, "Leveraging programmable capacitor arrays for frequency-tunable digital power amplifiers," *IEEE Trans. Microw. Theory Techn.*, vol. 68, no. 6, pp. 1983–1994, Jun. 2020.
- [67] Z. Bai, W. Yuan, A. Azam, and J. S. Walling, "A split-array, C-2C switched-capacitor power amplifier in 65nm CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2017, pp. 336–339.
- [68] Y. Yin, L. Xiong, Y. Zhu, B. Chen, H. Min, and H. Xu, "A compact dualband digital polar Doherty power amplifier using parallel-combining transformer," *IEEE J. Solid-State Circuits*, vol. 54, no. 6, pp. 1575–1585, Jun. 2019.
- [69] W. Yuan, V. Aparin, J. Dunworth, L. Seward, and J. S. Walling, "A quadrature switched capacitor power amplifier," *IEEE J. Solid-State Circuits*, vol. 51, no. 5, pp. 1200–1209, May 2016.
- [70] H. Jin, D. Kim, and B. Kim, "Efficient digital quadrature transmitter based on IQ cell sharing," *IEEE J. Solid-State Circuits*, vol. 52, no. 5, pp. 1345–1357, May 2017.
- [71] V. Vorapipat, C. S. Levy, and P. M. AsbeckIEEE, "A class-G voltagemode Doherty power amplifier," *IEEE J. Solid-State Circuits*, vol. 52, no. 12, pp. 3348–3360, Dec. 2017.
- [72] Y. Li et al., "A 15-bit quadrature digital power amplifier with transformer-based complex-domain efficiency enhancement," *IEEE J. Solid-State Circuits*, vol. 57, no. 6, pp. 1610–1622, Jun. 2022.
- [73] T. Li et al., "A wideband digital polar transmitter with integrated capacitor-DAC-based constant-envelope digital-to-phase converter," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2019, pp. 83–86.
- [74] L. Xiong, T. Li, Y. Yin, H. Min, N. Yan, and H. Xu, "A broadband switched-transformer digital power amplifier for deep back-off efficiency enhancement," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2019, pp. 76–78.
- [75] Y. Yin et al., "A compact transformer-combined polar/quadrature reconfigurable digital power amplifier in 28-nm logic LP CMOS," *IEEE J. Solid-State Circuits*, vol. 54, no. 3, pp. 709–719, Mar. 2019.
- [76] T. Li et al., "A wideband efficiency-enhanced Class-G digital power amplifier for IoT applications," *IEEE Microw. Wireless Compon. Lett.*, vol. 28, no. 8, pp. 714–716, Aug. 2018.
- [77] Z. Bai, D. Johnson, A. Azam, A. Saha, W. Yuan, and J. S. Walling, "A 12 bit split-array switched capacitor power amplifier in 130nm CMOS," in *Proc. 29th IEEE Int. Syst.-Chip Conf. (SOCC)*, Sep. 2016, pp. 24–28.
- [78] S.-M. Yoo et al., "A class-G switched-capacitor RF power amplifier," IEEE J. Solid-State Circuits, vol. 48, no. 5, pp. 1212–1224, May 2013.
- [79] S.-M. Yoo, J. S. Walling, E. C. Woo, B. Jann, and D. J. Allstot, "A switched-capacitor RF power amplifier," *IEEE J. Solid-State Circuits*, vol. 46, no. 12, pp. 2977–2987, Dec. 2011.
- [80] J. Li et al., "A 4.1 W quadrature Doherty digital power amplifier with 33.6% peak PAE in 28 nm bulk CMOS," in *Proc. IEEE Int. Solid- State Circuits Conf. (ISSCC)*, Feb. 2023, pp. 1–3.
- [81] S.-C. Hung, S.-W. Yoo, and S.-M. Yoo, "A quadrature Class-G complexdomain Doherty digital power amplifier," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2019, pp. 291–294.
- [82] M. Hashemi, L. Zhou, Y. Shen, M. Mehrpoo, and L. de Vreede, "Highly efficient and linear class-E CMOS digital power amplifier using a compensated Marchand balun and circuit-level linearization achieving 67% peak DE and -40dBc ACLR without DPD," in *IEEE MTT-S Int. Microw. Symp. Dig.*, Jun. 2017, pp. 2025–2028.
- [83] M. Hashemi, L. Zhou, Y. Shen, and L. C. N. de Vreede, "A highly linear wideband polar class-E CMOS digital Doherty power amplifier," *IEEE Trans. Microw. Theory Techn.*, vol. 67, no. 10, pp. 4232–4245, Oct. 2019.
- [84] M. Hashemi, Y. Shen, M. Mehrpoo, M. S. Alavi, and L. C. N. de Vreede, "An intrinsically linear wideband polar digital power amplifier," *IEEE J. Solid-State Circuits*, vol. 52, no. 12, pp. 3312–3328, Dec. 2017.
- [85] F. Zhang, P. Chen, J. S. Walling, A. Zhu, and R. B. Staszewski, "An active-under-coil RFDAC with analog linear interpolation in 28nm CMOS," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 68, no. 5, pp. 1855–1868, May 2021.

- [86] Z. Deng et al., "A dual-band digital-WiFi 802.11a/b/g/n transmitter SoC with digital I/Q combining and diamond profile mapping for compact die area and improved efficiency in 40 nm CMOS," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Jan. 2016, pp. 172–173.
- [87] M. Beikmirza et al., "A 4-way Doherty digital transmitter featuring 50%-LO signed IQ interleave upconversion with more than 27 dBm peak power and 40% drain efficiency at 10 dB power back-off operating in the 5 GHz band," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, vol. 64, Feb. 2021, pp. 92–94.
- [88] H. Tang, H. J. Qian, B. Yang, T. Wang, and X. Luo, "A polar Doherty SCPA with 4.4 °AM-PM distortion using on-chip self-calibration supporting 64-/256-/1024-QAM," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2022, pp. 243–246.
- [89] Y. Yin, L. Xiong, Y. Zhu, B. Chen, H. Min, and H. Xu, "A compact dual-band digital Doherty power amplifier using parallel-combining transformer for cellular NB-IoT applications," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2018, pp. 408–410.
- [90] R. B. Staszewski, C.-M. Hung, K. Maggio, J. Wallberg, D. Leipold, and P. T. Balsara, "All-digital phase-domain TX frequency synthesizer for Bluetooth radios in 0.13 μm CMOS," in *Proc. IEEE Int. Solid-State Circuits Conf.*, 2004, pp. 272–527.
- [91] R. J. Bootsman et al., "An 18.5 W fully-digital transmitter with 60.4% peak system efficiency," in *IEEE MTT-S Int. Microw. Symp. Dig.*, Aug. 2020, pp. 1113–1116.
- [92] L. C. N. de Vreede, R. Gajadharsing, and W. C. E. Neo, "On the bandwidth performance of Doherty amplifiers," in *Proc. IEEE Int. Wireless Symp. (IWS)*, Beijing, China, Apr. 2013, pp. 1–4.



**Dieuwert P. N. Mul** (Student Member, IEEE) was born in Heemstede, The Netherlands, in 1990. He received the B.Sc. and M.Sc. degrees in electrical engineering from the Delft University of Technology, Delft, The Netherlands, in 2012 and 2018, respectively, where he is currently pursuing the Ph.D. degree with the Electronic Circuits and Architectures (ELCA) Research Group.

His research interests include digital-intensive transmitters, RF-integrated circuit designs, and (high-speed) signal processing.



**Robert J. Bootsman** (Member, IEEE) was born in Nieuw-Vennep, The Netherlands, in 1992. He received the B.Sc. degree (cum laude) in electrical engineering and the M.Sc. degree in electrical engineering track microelectronics from the Delft University of Technology, Delft, The Netherlands, in 2014 and 2018, respectively, where he is currently pursuing the Ph.D. degree with the Electronic Circuits and Architectures (ELCA) Research Group.

His research interests include digital-intensive and energy-efficient RF power amplifiers and high-speed data conversion and processing.



Mohammadreza Beikmirza (Member, IEEE) received the B.Sc. and M.Sc. degrees in electrical engineering from the Amirkabir University of Technology (Tehran Polytechnic), Tehran, Iran, in 2014 and 2016, respectively, and the Ph.D. degree from the Delft University of Technology, Delft, The Netherlands, in 2023.

He is currently a Post-Doctoral Research Fellow with the Delft University of Technology. His current research interests include digital-intensive transmitters and RF/monolithic microwave

integrated circuit designs.

Dr. Beikmirza was a recipient of the Amirkabir University of Technology Best Undergraduate Thesis Award in 2015, the IEEE Iran Section Best Undergraduate Thesis Award in 2015, and the Platinum Award (First Place) of the Huawei RFIC Contest in 2021, and a Co-Recipient of the Institute of Semiconductor Engineers (ISE) President Best Paper Award in 2021.



**Morteza S. Alavi** (Member, IEEE) received the B.S.E.E. degree from the Iran University of Science and Technology, Tehran, Iran, in 2003, the M.S.E.E. degree from the University of Tehran, Tehran, in 2006, and the Ph.D. degree in electrical engineering from the Delft University of Technology (TU-Delft), Delft, The Netherlands, in 2014.

He was a Co-Founder and the Chief Executive Officer (CEO) of DitIQ B. V., Delft, a local company developing energy-efficient, wideband wireless transmitters for the next generation of the cellular

network. Since 2016, he has been with the Electronic Circuits and Architectures Research Group, TU-Delft, where he is currently a tenured Assistant Professor. He has coauthored *Radio Frequency Digital-to-Analog Converter* (Elsevier, 2016). His research interests include designing high-frequency and high-speed wireless/cellular communication and sensor systems and the field of wireline transceivers.

Dr. Alavi received the Best Paper Award from the 2011 IEEE International Symposium on Radio Frequency Integrated Technology. He received the Best Student Paper Award (Second Place) from the 2013 Radio Frequency Integrated Circuits Symposium. His Ph.D. student also received the Best Student Paper Award (First Place) from the 2017 RFIC Symposium in Honolulu, HI, USA. His research group recently received the 2021 Institute of Semiconductor Engineers President Best Paper Award from the International SoC Design Conference. One of his Ph.D. students also received the Platinum Award (First Place) from the 2021 Huawei Student Design Contest. He is also a Co-Recipient of the 2022 Best Paper Award of the honorable mentioned IEEE Brain and SSCS Joint Communities Paper.



Leo C. N. de Vreede (Senior Member, IEEE) received the Ph.D. degree (cum laude) from the Delft University of Technology, Delft, The Netherlands, in 1996.

In 1996, he was appointed as an Assistant Professor with the Delft University of Technology, where he worked on the nonlinear distortion behavior of active devices. In 1999 and 2015, he was appointed as an Associate Professor and a Full Professor with the Delft University of Technology, respectively, where he became responsible for the

Electronics Research Laboratory. During that period, he worked on solutions for improved linearity and RF performance at the device, circuit, and system levels. He is also a Co-Founder/Advisor of Anteverta-mw B. V., Eindhoven, The Netherlands, a company specializing in RF device characterization. He has (co)authored more than 120 IEEE-refereed conference papers and journal articles. He holds several patents. His current research interests include RF measurement systems, RF technology optimization, and (digital-intensive) energy-efficient/wideband circuit/system concepts for wireless applications.

Dr. de Vreede was a (co)recipient of the IEEE Microwave Prize in 2008, a Mentor of the Else Kooi Prize Awarded Ph.D. Work in 2010, and the Dow Energy Dissertation Prize Awarded Ph.D. Work in 2011. He was a recipient of the TUD Entrepreneurial Scientist Award in 2015. He (co)guided several students who received the (Best) Paper Award from the Bipolar/BiCMOS Circuits and Technology Meeting, the Program for Research on Integrated Systems and Circuits, the European Solid-State Circuits and Devices Conference, the International Microwave Symposium, the Radio Frequency Integration Technology, and the Radio Frequency Integrated Circuits Symposium.