# Class-C PA-VCO Cell for FSK and GFSK Transmitters

Chuanwei Li, Student Member, IEEE, and Antonio Liscidini, Senior Member, IEEE

Abstract-In this paper, a Class-C PA-VCO cell tailored to FSK/GFSK transmitters is presented. In the proposed solution, a Class-C VCO and a common-gate stage PA are stacked in a current-reuse architecture operating with 1.2 V power supply. The PA and the VCO efficiencies are maximized by adjusting their voltage headroom without the use of any DC-DC converters. The PA-VCO is inserted in a transmitter based on an openloop architecture. The presented prototype, fabricated in 0.13  $\mu$ m CMOS technology, occupies an active area of 0.2 mm<sup>2</sup>. A maximum TX efficiency of 17.5% is achieved while the TX is delivering an output power of -1 dBm at 2.45 GHz. A phase noise of -129 dBc/Hz at 2.5 MHz frequency offset results in a carrierfrequency drift below 7 Hz/s and an FSK error below 0.7%, which allows the transmitter to operate in open-loop while delivering long data-packets. The transmitter is also compliant to BLE specifications when FSK and GFSK modulations with index of 0.5 are applied.

*Index Terms*—Class-C, current-reuse, FSK, GFSK, high efficiency, low phase noise, low-power, open-loop transmitter, PA, VCO.

### I. INTRODUCTION

**I** N THE ERA of the Internet of Things (IoT), ultra-low power wireless transceivers will play an indispensable role in the creation of energy harvested wireless sensor networks (WSNs) [1]. Bluetooth Low Energy (BLE) and ZigBee are the most popular standards for ultra-low power applications [1]–[3]. In such systems, the transmitter represents the most power hungry section of the transceiver and the main bottleneck in the design of a battery-less device. Frequency shift keying (FSK) and Gaussian FSK (GFSK) are the main modulations adopted by ultra-low power transceivers since for modest data rate they can lead to high efficiency and a simplification of the radio architecture. In particular, the transmitter can be realized with a phase-locked loop (PLL) followed by a nonlinear power amplifier (PA) [4].

In ultra-low power applications, performance and transmitted power are sacrificed in favor of a longer-lasting battery life [3]. When the PA operates in such low-power conditions ( $P_{RF} < 0$  dBm), the overall efficiency is affected by power consumption of the other building blocks (e.g., the PLL). In particular, the overall efficiency of the TX ( $\eta_{TX}$ ) can be expressed as

$$\eta_{TX} = \frac{P_{\rm RF}}{P_{\rm DC,PA} + P_{\rm DC,MOD}} \tag{1}$$

Manuscript received November 23, 2015; revised February 11, 2016, and March 15, 2016; accepted April 11, 2016. Date of publication June 10, 2016; date of current version June 22, 2016. This paper was approved by Guest Editor Andrea Mazzanti.

The authors are with the Electrical and Computer Engineering Department, University of Toronto, Toronto, ON, Canada, M5S 3G4 (e-mail: chuanwei.li@mail.utoronto.ca).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/JSSC.2016.2554148

Vin PA Vin PA Var PA,ideal ·  $\eta_{M}$  $\eta_{PA,ideal}$  ·  $\eta_{M}$ 

Vout

VDD P

Fig. 1. TX output load and matching network.

where  $P_{\text{RF}}$  is RF power delivered by the TX,  $P_{\text{DC},\text{PA}}$  is power dissipated by the PA, and  $P_{\text{DC},\text{MOD}}$  is the power consumption of the TX excluding the PA. Equation (1) can be rewritten as a function of the PA efficiency ( $\eta_{\text{PA}}$ ) as

$$\eta_{\rm TX} = \frac{\eta_{\rm PA}}{1 + \eta_{\rm PA} \cdot \frac{P_{\rm DC,MOD}}{P_{\rm RE}}}.$$
 (2)

Vp

From (2), it is possible to verify that, for low output power levels, the overall TX efficiency drops, since  $P_{DC,MOD}$  does not typically scale with  $P_{RF}$ . Usually, the lower bound for  $P_{DC,MOD}$  is set by the voltage-controlled oscillator (VCO), which is the second most power-hungry block after the PA. The delivery of a small output power also sets a constraint in the design of the PA and its output matching network. In fact, since the maximum efficiency of a PA is typically reached when the amplifier operates rail-to-rail, a small output power demands a large resistive load at the output of the PA, becoming more sensitive to parasitic elements and decreasing the overall efficiency [5] (Fig. 1). For a given PA voltage supply  $V_{DD,PA}$  and output power  $P_{RF}$ , the required load to maximize the  $\eta_{PA}$  is given by

$$R_{\text{load}} = \frac{1}{2} \cdot \frac{V_{\text{p}}^2}{P_{\text{RF}}} = \frac{1}{2} \cdot \frac{V_{\text{DD,PA}}^2}{P_{\text{RF}}}$$
(3)

where the signal swing at the output  $(V_p)$  is assumed equal to  $V_{DD,PA}$ . As shown in [5], the efficiency of a simple matching network (i.e., with only one inductor) used to transform the impedance of the antenna  $R_s$  into  $R_{load}$  (with  $R_{load} > R_s$ ) is given by

$$\eta_M \approx 1 - \frac{\sqrt{\frac{R_{\text{load}}}{R_s} - 1}}{Q_L} = 1 - \frac{\sqrt{\frac{V_{\text{DD,PA}}^2}{V_{\text{RF}}^2} - 1}}{Q_L}$$
 (4)

where  $V_{\rm RF}$  corresponds to the voltage amplitude of the delivered power  $P_{\rm RF}$  at the antenna and  $Q_{\rm L}$  the quality factor of the inductor (quality factor of the capacitors is neglected since it is typically greater than  $Q_{\rm L}$ ). To give an idea of the

0018-9200 © 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications\_standards/publications/rights/index.html for more information.



Fig. 2. (a) Supplying the VCO and the PA using DC–DC converters and (b) current-reuse stacking structure.

relevance of this problem at low transmitted power, if  $V_{RF}$  is 0.3 V (roughly 0 dBm of power) adopting a power supply of 1.2 V and an inductor with  $Q_L = 10 - 15$ , the  $\eta_M$  would be between 61% and 74%. In the case of small output power, the impact of parasitic elements can be limited by adopting a multistage matching network [5] (demanding larger area) or a lower voltage supply to the PA, with the cost of an extra DC–DC converter [Fig. 2(a)] [6], [7]. Taking this effect into account, (2) can be rewritten as

$$\eta_{TX} = \frac{\eta_{\text{PA,ideal}} \cdot \eta_M}{1 + \eta_{\text{PA,ideal}} \cdot \eta_M \cdot \frac{P_{\text{DC,MOD}}}{P_{\text{RF}}}}$$
(5)

where  $\eta_{PA,ideal}$  corresponds to the efficiency of the PA by assuming an ideal matching network.

The work presented in this paper tries to solve the above issues by applying the current-reuse architecture as shown in Fig. 2(b), where the PA is stacked on top of a Class-C VCO [8]. When the power burned by the PA is of the same order as the one dissipated by the VCO, the proposed design offers four main advantages. First, a single  $V_{\rm DD}$  can be subdivided between the VCO and the PA without any DC-DC converters. Second, the targeted low output power can be delivered by the PA with a smaller load, which is less sensitive to parasitic elements and makes the design of the matching network easier. Third, the supply voltage ratio between the VCO and the PA can be adjusted, offering a degree of freedom for power optimization for different targeted output powers and phase noise (PN) specs. Finally, the Class-C VCO can operates at half of the carrier frequency, exploiting an intrinsic property of the Class-C VCO that can be used as a frequency doubler, if the output signal is sensed at the center tap of the inductor in the LC-tank. For testing purposes, the proposed PA-VCO cell is inserted in a fractional PLL for channel selection. Although, in the proposed prototype, the transmission occurs in open-loop by directly modulating the VCO (thanks to the low carrier-frequency drift and FSK error achieved), the structure is also fully compatible with closedloop direct modulators.

This paper is organized as follows. Section II is focused on the details of the Class-C VCO and the stacking structure. Section III describes in detail the boosted common-gate topology adopted for the PA. Section IV offers a design methodology to dimension the PA-VCO structure for a targeted output power and the PN, and Section V shows the details of a



Fig. 3. Class-C VCO and PA stacking.

prototype implemented in 0.13  $\mu$ m CMOS technology. Finally, measurement results are discussed in Section VI.

## II. STACKING CLASS-C VCO AND PA

The Class-C VCO is an oscillator topology originally proposed by [9], where the cross-coupled pair in the VCO operates in Class-C in order to maximize the amplitude of the current injected into the *LC*-tank at the oscillation frequency. According to the original design proposed by [9], two key elements are required to maintain the VCO in Class-C operation: a capacitor  $C_{\text{tail}}$  in parallel with the VCO's biasing current source  $I_{\text{tail}}$ , and a proper bias scheme to keep the cross-coupled pair M1-M2 in the saturation region during the conducting phase (Fig. 3) [9], [10].

## A. Class-C VCO as a Frequency Doubler

In a Class-C oscillator topology, each transistor of the crosscoupled pair (M1-M2) generates a train of pulses that feeds an LC tank (used to amplify the differential component at  $f_{\rm LO}$  by sustaining the oscillation). If the tank resonates only for differential components, as in the case shown in Fig. 3, common-mode signals are collected at the center tap of the inductor. The two trains of pulses collected at the center tap of the inductor form a train of pulses at  $2 \cdot f_{LO}$ , making the Class-C oscillator a frequency doubler. By feeding the pulse train to a common-gate PA as shown in Fig. 3, a complete transmitter front-end can be realized. In this structure, the oscillator operates at half of the carrier frequency of the transmitted signals, saving power in frequency generation and the dividers. Furthermore, a differential to single-end conversion is intrinsically performed simplifying the interface between the PA and the antenna. In addition to that, the VCO bias current is reused by the PA and no buffers are required to drive the PA avoiding loading the LC tank.

In WSN transceiver, the RX and the TX typically share the same VCO. This could be a problem for the proposed structure where the VCO operates at half of the carrier frequency. A possible solution is proposed in [11], where a receiver is



Fig. 4. PA-VCO stack equivalent circuits at  $2 \cdot \omega_{\text{LO}}$ . (a) PA-VCO stack. (b) Common-mode equivalent impedance  $R_{\text{eq}}$  at  $2 \cdot \omega_{\text{LO}}$ . (c) Transformation of  $R_{\text{eq}}$ .

realized by using a VCO also operating at half of the carrier frequency exploiting injection-locking properties.

#### B. Stacking the PA and the VCO

The common-gate stage M3 in Fig. 3 intrinsically operates in Class-C since it is fed by a train of narrow pulses. A resonant *LC* matching network is applied at the output of the M3 to provide a DC path to filter out higher order harmonics and to maximize the efficiency for a targeted output power, by scaling the impedance of the antenna.

Stacking of the PA with the Class-C VCO is less trivial than it appears since a high input impedance provided by the PA tends to compromise the Class-C operation of the VCO. The study of this phenomenon is complex since the PA operates in Class-C and a simple time-invariant linear model cannot be used. However, the same approach used in [9] to study the Class-C oscillator was adopted. According to [9], the time-variant transconductance of M3 can be described by the following Fourier series:

$$g_m(\omega_{\rm LO}t) = \sum_{\rm p} g_{\rm p} \cdot \cos\left(p \cdot \omega_{\rm LO}t\right) \tag{6}$$

where  $\omega_{\rm LO} = 2\pi \cdot f_{\rm LO}$  and the coefficient  $g_{\rm p}$  in the Fourier series can be determined by the large-signal regime and the MOSFET device parameters [9]. Notice that  $g_{\rm p}$  are different from the coefficients of the classic Taylor expansion around a DC bias point. For example,  $g_0$  is the transistor transconductance averaged in an oscillation period that would resemble the small-signal transconductance  $g_m$  in a timeinvariant situation. Since the PA works as a common-gate stage, its input resistance at  $2 \cdot f_{\rm LO}$  can be found by solving the following equation for the components  $i_{\rm d}$  at  $2 \cdot \omega_{\rm LO}$ :

$$i_{\rm d}(t) = g_m(t) \cdot V_x \cdot \cos\left(2 \cdot \omega_{\rm LO} t\right) \tag{7}$$

where  $i_d(t)$  is the current absorbed by the transistor in the presence of a sinusoidal voltage source  $V_x$  at its input. By substituting  $i_d(t) = I_x \cdot \cos(2 \cdot \omega_{\text{LO}} t)$  in (7), the input impedance at  $2 \cdot \omega_{\text{LO}}$  can be expressed as

$$R_{\rm eq} = \frac{V_x}{I_x} = \frac{1}{\left(g_0 + \frac{g_4}{2}\right)}.$$
 (8)

As shown in Fig. 4, for the signal at  $2 \cdot f_{LO}$ , this input impedance appears in series with the capacitor  $C_{tail}$  because the current source  $I_{tail}$  behaves as an open circuit for such



Fig. 5. Effects of having  $R_{eq}$  at the top and bottom of the stack.

component. Although in general it is wrong to move a resistor from drain to source of a transistor, in this case the transformation is possible because for common-mode components, the transistors M1 and M2 are diode connected [Fig. 4(b)]. The role of the capacitor  $C_{\text{tail}}$  in a Class-C oscillator is to provide a low-impedance path for the current when both transistors in the switching pair are off. An increment of such  $R_{eq}$ reduces the capability of the oscillator to operate in Class-C by decreasing the efficiency of the oscillator and the extraction of the second harmonic as shown by the simulation in Fig. 5. To prove the correctness of the transformation performed in Fig. 4, Fig. 5 shows the effect of having  $R_{eq}$  at the top and bottom of the stack. The slight difference between the two curves derives from the presence of parasitic capacitance of the transistors that mitigate the effect of an increment of  $R_{\rm eq}$  when it is placed in series to  $C_{\rm tail}$ .

In order to keep the VCO in Class-C operation, the input impedance of the PA must be lower than  $C_{tail}$ , leading to the following condition:

$$R_{\rm eq} \ll \frac{1}{2 \cdot \omega_{\rm LO} \cdot C_{\rm tail}}.$$
 (9)

Once condition (9) is satisfied, another critical element for the stack design is the subdivision of the supply voltage ( $V_{DD}$ ) between the VCO ( $V_{DD,VCO}$ ) and the PA ( $V_{DD,PA}$ ). The overall efficiency ( $\eta_{STACK}$ ) can be expressed as follows:

$$\eta_{\text{STACK}} = \frac{P_{\text{RF}}}{P_{\text{DC,STACK}}}$$
(10)

where  $P_{\text{DC,STACK}}$  is the power consumed by the overall stack. Equation (10) can be rewritten as a function of the PA efficiency ( $\eta_{\text{PA}}$ ) as

$$\eta_{\text{STACK}} = \eta_{\text{PA}} \cdot \frac{V_{\text{DD},\text{PA}}}{V_{\text{DD}}}.$$
(11)

For a given  $\eta_{PA}$ , increasing  $V_{DD,PA}$  improves the efficiency of the stack. However, when the losses of the matching network are also considered, (11) can be rewritten as:

$$\eta_{\text{STACK}} = \eta_{\text{PA,ideal}} \cdot \eta_M \frac{V_{\text{DD,PA}}}{V_{\text{DD}}}$$
$$= \eta_{\text{PA,ideal}} \cdot \frac{V_{\text{DD,PA}}}{V_{\text{DD}}} \cdot \left(1 - \frac{\sqrt{\frac{V_{\text{DD,PA}}^2}{V_{\text{RF}}^2} - 1}}{Q_{\text{L}}}\right). \quad (12)$$



Fig. 6. Boosted common-gate stage: (a) active implementation, (b) proposed solution, and (c) equivalent model.

In this case, it is possible to verify that higher  $V_{\text{DD,PA}}$ leads to more losses in the matching network, making (12) nonmonotonic. A more intuitive explanation of the behavior of (12) is that, for small V<sub>DD,PA</sub>, the overall efficiency diminishes because the majority of the power is used in the VCO, while for very large  $V_{DD,PA}$  the efficiency is limited since some power is dissipated in the matching network. The optimum value of  $V_{\text{DD,PA}}$  depends on many constraints such as  $Q_{\text{L}}$ ,  $V_{\rm DD}$ ,  $V_{\rm RF}$ , and the power required by the VCO to satisfy a certain PN spec. The flexible subdivision of the supply voltage between the VCO and the PA offers an additional degree of freedom for the design optimization without the need for any DC-DC converters. In fact, the optimization can be done by simply controlling the DC bias voltage at the gate of transistor M3, which, for low frequency components, operates as a voltage regulator for the VCO.

When a low PN is needed, as targeted in the presented prototype to achieve a small carrier-frequency drift and FSK error, more power must be consumed by the VCO requiring a  $V_{DD,VCO}$  larger than  $V_{DD,PA}$ . Although in this case  $\eta_{STACK}$  is inevitably reduced, the PA can be more efficient since it is less sensitive to the parasitic elements, as shown by (4) and (12). On the other hand, if the PN spec is relaxed, the subdivision of the supply between the VCO and the PA can be optimized as a function of the delivery output power and the overall efficiency of the stack.

### III. BOOSTED COMMON-GATE PA

As shown in the previous section, a low input impedance of the common-gate PA at the second harmonic of the oscillation frequency is fundamental to maintain the oscillator in Class-C operation. When the transmitted power is in the range of mW, a simple common-gate topology as reported in Fig. 3 is not sufficient to satisfy the condition expressed by (9) since the average transistor transconductance (i.e.,  $g_0$ ) becomes too small. In such cases, the input impedance of the common-gate PA can be further decreased by a boosted topology as shown in Fig. 6(a). The main problem of such an approach is the usage of an additional amplifier that would compromise the overall efficiency. An active stage can be avoided by exploiting the large differential signal produced by the VCO at  $\omega_{LO}$ . As shown in Fig. 6(b), the amplifier is split into two parts and each gate is driven by one of the phases of the differential LO signals produced by the VCO tank. The presence of a train of pulses feeding the PA makes it difficult to develop an analytical expression for the input impedance of the amplifier, which relies on a time-variant nonlinear operation of the transistors. However, the behavior of the proposed structure can be studied by considering the transformation reported in Fig. 6. Since each transistor works only for half of the oscillation period, the structure in Fig. 6(b) resembles the one reported in Fig. 6(c), where a single transistor is driven by a "rectified" sinusoid at  $\omega_{\rm LO}$ , which also contains the frequency component at  $2 \cdot \omega_{\rm LO}$ . This component, correlated to the one at the source of M3, is used to boost the transistor transconductance, lowering the input impedance. In order to guarantee a Class-C operation for the PA, the amplitude at the gate of the M3a and M3b must be limited to keep the transistors in the saturation region when turned on. This requires the introduction of a capacitor divider between the VCO tank and the PA that partially loads the VCO tank. However, since the amplitude required to lower the input impedance can be only a fraction of the one developed across the VCO tank, the capacitive divider can be sized to limit the loading effect introduced by the PA.

Simulations of currents and voltages highlighted in Fig. 6 are reported in Fig. 7. The large signals at  $\omega_{LO}$  provided at the gate turn the transistors on and off such that the input pulses flow alternatively into M3a and M3b. A plot of the input impedance (for  $2 \cdot f_{LO}$  components) as a function of the capacitive divider gain k is reported in Fig. 8. For k = 0, the structure behaves as a common-gate stage, while an increment of k decreases the level of the input impedance rises again since the transistors start to work partially in triode.

When stacking on the top of the VCO, the boosted commongate PA resembles a frequency doubler, which exploits the nonlinearity of the transistor to generate a second harmonic of output current. However, in the proposed solution, the output current is not set by the voltage driving the gate of the transistor but by the current feeding the source (Fig. 6). In this case, the stage operates as a boosted cascode where the additional voltage presented at the gate only reduces the swing at the source by lowering the input impedance. To work as a frequency doubler, the PA would require a large capacitor at



Fig. 8. Input impedance at  $2 \cdot f_{LO}$  versus capacitive divider gain k.

k

its source to act as a common-source stage for the RF signal. The proposed solution, not only avoids the presence of a large decoupling capacitor, but also takes advantage of the fact that a common-gate stage driven by a high impedance introduces less noise and distortion compared to a common-source stage. The reason for this is that the gate and the source tend to follow each other for noise and distortions due to high driving impedance.

#### **IV. DESIGN METHODOLOGY**

Here, a design methodology to dimension the most important elements in the PA-VCO cell from a given voltage supply, a targeted output power and the PN is demonstrated. Although several strategies can be adopted to obtain an optimized design, this strategy offers a useful start point especially for the design of the PA matching network and the resonant load of the VCO. In this design methodology, the first step is to set the voltage headroom for the two building blocks. The minimum voltage supply that can be dedicated to the Class-C VCO depends on several elements that can be found in [9] such as start-up constraints, transistor threshold and the PN spec. Once the minimum  $V_{\text{DD,VCO}}$  is determined,  $V_{\text{DD,PA}}$  is also obtained. The definition of the voltage headroom for the two building blocks sets an upper bound for the efficiency of the stack defined by (11).

Once  $V_{\text{DD,PA}}$  is determined, the PA load ( $R_{\text{load}}$ ) that maximizes the efficiency is only a function of the targeted output power  $P_{\text{RF}}$  and can be found in (3). With  $R_{\text{load}}$ , the current amplitude at 2· $f_{\text{LO}}$  delivered by the PA is also set. By assuming a conversion gain of the Class-C oscillator that is close to one [9], the VCO and the PA bias current ( $I_{\text{bias}}$ ) are determined by  $V_{\text{DD,PA}}/R_{\text{load}}$ . Finally,  $R_{\text{load}}$  also sets the impedance ratio that the matching network has to provide to the antenna (i.e.,  $R_{\text{load}}/R_s$  with  $R_s$  typically equal to 50  $\Omega$ ).

Once the VCO current ( $I_{\text{bias}}$ ) and its voltage headroom ( $V_{\text{DD,VCO}}$ ) have been determined, the VCO tank must be designed to maximize the oscillation amplitude at  $f_{\text{LO}}$ . The maximum peak-to-peak oscillation amplitude allowed for a Class-C VCO not to compromise the PN performance is roughly  $1.3 \cdot V_{\text{DD,VCO}}$  [12], which leads to a tank resistance at resonance as

$$R_{\text{tank}} = \frac{1.3 \cdot V_{\text{DD},\text{VCO}}}{2 \cdot I_{\text{bias}}}.$$
 (13)

## V. PROTOTYPE

The proposed current-reuse PA-VCO stack has been integrated in a fractional PLL to implement an open-loop transmitter tailored to an indoor localization application. The main goals for this prototype were a targeted output power below 0 dBm for the PA, low carrier-frequency drift, and low FSK error. While in a closed-loop architecture, carrier-frequency drift and the FSK error are mainly set by the PN produced in the PLL bandwidth, in an open-loop modulation they are determined only by the VCO. Since, beside the power efficiency, the cost of integration of a single technology node is crucial to allow a high-density of devices in the network, the technology chosen for the integration was 0.13  $\mu$ m CMOS technology with an ultrathick metal layer. From this point of view, the capability of the VCO to operate at half of the carrier frequency allowed us to limit the power dissipation despite the technology node adopted. The schematic of the integrated prototype is shown in Fig. 9. The PA-VCO stack has been inserted in an analog PLL, which is used for channel selection. An external clock is used as reference and the PLL was designed to achieve a bandwidth of around 1 MHz for a fast settling of the carrier frequency. Once the channel is selected, the PLL is opened and transmission occurs by directly modulating the VCO as proposed in [4]. This prevents the spurs of the PLL from affecting the transmission. A more detailed schematic of the VCO is shown in Fig. 10. The VCO tank is constituted by a fixed capacitor (400 fF), a coarse tuning capacitor bank to compensate process variation (seven elements of 100 fF each), a varactor to close the PLL, and a smaller varactor used for



Fig. 9. TX architecture.



Fig. 10. Class-C VCO and its biasing scheme.

the signal modulation. The varactor used for the modulation is driven by an external operational amplifier, which generates two logic values. This approach, proposed in [4], can shape the pulse in order to mimic a GFSK modulation. The inductor in the VCO LC-tank has a nominal value of 6.9 nH with a simulated quality factor of 15 at half of the carrier frequency (i.e., 1.2 GHz). Each of the VCO cross-coupled transistors has a size of 50  $\mu$ m/0.12  $\mu$ m and the tail current source  $M_{\text{tail}}$  has a size of 600  $\mu$ m/0.6  $\mu$ m. The tail capacitor used to maintain Class-C operation  $C_{\text{tail}}$  has a value of 5 pF.

The PA was implemented as shown in Fig. 6(b). Each of the transistors in the boosted common-gate stage (M3a and M3b in Fig. 6(b)) has a size of 400  $\mu$ m/0.18  $\mu$ m. To allow full characterization for a wide range of bias current, the capacitive divider was implemented with 3-bit resolution to adjust the amplitude of the LO signals at the gate of M3a/M3b upon the bias current magnitude. The divider attenuation can be programmed between 2/3 and 1. This flexibility unfortunately cost a degradation of the overall quality factor of the tank when the divider attenuation is 2/3 (i.e., all switches on).



Fig. 11. TX chip micrograph.



Fig. 12. Transmitted power and TX efficiency versus TX bias current.

The impedance transformation between the PA output and the antenna was realized on the evaluation board to have more flexibility during the testing phase. The L-matching network also provides a DC path for the bias current. The modulated signal is provided externally through an R&S SMA200W Vector Signal Generator while the transmitted signal is sensed by an R&S FSW26 Signal Analyser.

As previously reported, the VCO has to be biased properly to maintain its Class-C operation. The scheme proposed by [10] has been applied, as shown in Fig. 10. An operational amplifier is able to set the DC value at the drain of the current mirror  $M_{\text{tail}}$ ; thus, current flowing through the current mirror will be maintained constant during both start-up and steady state. By applying this solution, oscillation amplitude can be maximized while still keeping oscillation start-up robust.

## VI. MEASUREMENT RESULTS

The chip micrograph is reported in Fig. 11. The die measures 1.3 mm  $\times$  1.0 mm while the active area of the design is 0.2 mm<sup>2</sup> (dominated by the integrated inductor). The rest of the area is occupied by the pads and decoupling capacitors to better filter interferences coming from the external biases. The supply voltage is 1.2 V for the entire chip.  $V_{\text{DD,VCO}}$  was set to 0.9 V to minimize the PN and with it carrier-frequency drift and the FSK error.

The output power and the TX efficiency versus the TX bias current are plotted in Fig. 12. Note that the TX transmission



Fig. 13. PN and FoM versus VCO bias current.



Fig. 14. FTR under capacitive divider setting 1002.

occurs when the PLL is in open-loop and powered off. Thus, the only excess power consumption other than the PA-VCO stack is two operational amplifiers (one used for Class-C VCO biasing and another used as a buffer in the PLL loop). These op-amps consume 0.2 mA in total. The peak TX efficiency is equal to 17.5% when the TX consumes a total current of 3.8 mA. Since power dissipations of both the VCO and the PA scale during the sweep, the overall efficiency can be kept above 11% in the entire range from 2.0 mA to 3.8 mA. From (11), it is possible to extrapolate the efficiency of the PA to be approximately 70% when delivering an output power of -1 dBm. Notice that, in all of the measurements, the matching network (optimized for an output power around 0 dBm) has been kept constant.

The free running VCO achieves almost constant PN level through the entire ISM band. A plot of the PN and the Figure of Merit (FoM) versus the VCO bias current is reported in Fig. 13. The measurements were taken with a carrier of

|                                           | This work      | [14]         | [15]          | [16]          | [17]           | [4]              | [18]             | [19]         | [20]           |
|-------------------------------------------|----------------|--------------|---------------|---------------|----------------|------------------|------------------|--------------|----------------|
| Max. Transmitted<br>Power [dBm]           | -1             | +4           | -3            | +5            | -1             | +1.9             | +1.6             | +3           | +1             |
| Peak Efficiency (@<br>output power [dBm]) | 17.5%<br>(@-1) | 9.1%<br>(@0) | 9.8%<br>(@-3) | 10.5%<br>(@0) | 14.7%<br>(@-1) | 17.4%<br>(@+1.9) | 24.5%<br>(@+1.6) | 36%<br>(@+3) | 28.6%<br>(@+1) |
| Modulation Type                           | GFSK           | GFSK         | GFSK          | GFSK          | GFSK           | GFSK             | GFSK             | GFSK         | GFSK           |
| Data rate [Mb/s]                          | 1              | 1            | 1             | 1             | 1              | 1                | 1                | 1            | 1              |
| FSK Error [%]                             | 0.69           | N.A.         | N.A.          | N.A.          | 2.8            | N.A.             | N.A.             | N.A.         | 4.78           |
| Max. Carrier-<br>Frequency Drift [Hz/µs]  | 7              | N.A.         | N.A.          | N.A.          | N.A.           | 20               | 20               | N.A.         | 57.44          |
| PN [dBc/Hz]<br>@2.5MHz                    | -129           | N.A.         | -116*         | -125          | -119*          | -118.5           | -117             | -124*        | N.A.           |
| Technology                                | 130nm<br>CMOS  | 90nm<br>CMOS | 65nm<br>CMOS  | 65nm<br>CMOS  | 90nm<br>CMOS   | 130nm<br>CMOS    | 130nm<br>CMOS    | 28nm<br>CMOS | 40nm<br>CMOS   |
| Power dissipation [mW]                    | 4.56           | 11           | 5.1           | 9.5           | 5.4            | 8.9              | 5.9              | 5.5          | 4.4            |
| Supply Voltage [V]                        | 1.2            | 1.2          | 1.1           | 1.3           | 1.2            | 1.0              | 1.0              | 0.5/1.0      | 1              |
| Chip Area [mm <sup>2</sup> ]              | 1.3/0.2**      | 1.95         | 7.84          | 0.35**        | 3.61           | 1.54             | 2.1              | N.A.         | 1.3            |

TABLE I Chip Summary and Comparison With State of the Art

\* PN @2.5MHz was normalized from the one at 1MHz \*\* Core area

2.45 GHz at 2.5 MHz frequency offset. The minimum PN of -132 dBc/Hz is obtained for a TX bias current equal to 2.8 mA. After this point, the amplitude of the VCO exceeds the one required to operate efficiently in Class-C. The following oscillator FoM has been used in the calculation [13]:

$$FoM = -L \left(\Delta f\right) + 20 \cdot \log\left(\frac{f_{osc}}{\Delta f}\right) - 10 \cdot \log\left(\frac{P_{diss}}{1 \text{ mW}}\right) \quad (14)$$

where  $f_{\rm osc}$  is center frequency,  $\Delta f$  is the offset frequency from the center,  $L(\Delta f)$  is the PN at specific offset, and  $P_{\rm diss}$ is the DC power consumed by the VCO. With  $V_{\rm DD,VCO}$  equal to 0.9 V, the peak FoM achieved is 188 dBc/Hz. The VCO frequency tuning range (FTR) is reported in Fig. 14 when the 3-bit capacitive divider is set to binary value 100<sub>2</sub> (typical operative condition). By acting on the 3-bit capacitor bank and the varactor connected to the PLL, the output frequency sweeps from 2.26 GHz to 2.58 GHz, leading to a FTR of 13.2%.

The TX performance was evaluated by employing both FSK and GFSK modulation signals (with a modulation index of 0.5). The modulations were applied by calibrating the amplitude of the signal driving the varactor on-board. The spectrum plot is shown in Fig. 15; the emission mask is respected for both cases with excess margin since low power is transmitted. At a data rate of 1 Mb/s, a maximum carrier-frequency drift of 7 Hz/ $\mu$ s and an FSK error below 0.7% were obtained. A screenshot of the output spectrum between 0.5 GHz and 10 GHz while delivering a single tone at -1 dBm is reported in Fig. 16 (0.6 dB of cable/board losses are not de-embedded). All higher-order harmonics of the LO and the transmitted signal remain below -40 dBm.

A summary of the performance of the proposed design and other GFSK transmitters at 2.4 GHz is reported in Table I.



Fig. 15. Transmitted PSD with FSK and GFSK modulation index of 0.5 at 1 Mb/s data rate (5 MHz span and 10 kHz resolution bandwidth).

The comparison with different solutions is not simple since several TXs were designed to maximize their efficiency for  $P_{\rm RF}$  higher than 0 dBm, e.g., [16]. Comparing with other sub-mW output power TXs, a significant improvement of carrier-frequency drift and the FSK error was obtained without sacrificing efficiency. Since the TX is in open-loop, the oscillator PN affects not only the out-of-band emission performance but also carrier-frequency drift and the FSK error. Although the VCO burns the majority of the power, a comparable efficiency was obtained due to a higher PA efficiency thanks to the PA operation with a very low voltage supply. Although the prototype has an external matching network for testing purposes (i.e., a high Q inductor), its impact on the overall



Fig. 16. The 0.5 GHz-10 GHz output spectrum while delivering single tone at -1 dBm (0.6 dB of cable/board losses are not de-embedded).

efficiency is minimal since  $V_{\text{DD,PA}}$  resembles  $V_{\text{RF}}$  and, as shown by (4) and (12), in such case  $\eta_M$  is close to one regardless of the  $Q_{\text{L}}$  value.

#### VII. CONCLUSION

A current-reuse PA-VCO topology for low power FSK/GFSK transmitters has been presented. By stacking a Class-C VCO and a boosted common-gate PA, high TX efficiency at low output power (-1 dBm) was achieved. The proposed design demonstrated low carrier-frequency drift and FSK error without significantly sacrificing the TX efficiency due to the PA operation with a small voltage supply.

#### REFERENCES

- Bluetooth Low Energy, Bluetooth, 2015, [Online]. Available: http://www.bluetooth.com/what-is-bluetooth-technology/bluetoothtechnology-basics/low-energy
- [2] Network Specifications, ZigBee Alliance, 2015, [Online]. Available: http://www.zigbee.org/zigbee-for-developers/network-specifications/
- [3] A. Selvakumar et al., "Sub-mW current re-use receiver front-end for wireless sensor network applications," *IEEE J. Solid-State Circuits*, vol. 50, no. 12, pp. 2965–2974, Dec. 2015.
- [4] J. Masuch and M. Delgado-Restituto, "A sub-10 nJ/b 1.9-dBm output power FSK transmitter for body area network applications," *IEEE Trans. Microw. Theory Techn.*, vol. 60, no. 5, pp. 1413–1423, Apr. 2012.
- [5] Y. Han and D. J. Perreault, "Analysis and design of high efficiency matching networks," *IEEE Trans. Power Electron.*, vol. 21, no. 5, pp. 1484–1491, Sep. 2006.
- [6] H. Amir-Aslanzadeh et al., "Current-reused 2.4-GHz direct-modulation transmitter with on-chip automatic tuning," *IEEE Trans. Very Large Scale Integr. (VLSI) Syst.*, vol. 21, no. 4, pp. 732–746, Apr. 2013.

- [7] L. Zheng *et al.*, "Design and analysis of a current-reuse transmitter for ultra-low power applications," in *Proc. IEEE Int. Symp. Circuits Syst.*, May 2009, pp. 1317–1320.
- [8] C. L. A. Liscidini, "A current re-use PA-VCO cell for low-power BLE transmitters," in *Proc. 41st Eur. Solid-State Circuits Conf.*, Sep. 2015, pp. 172–175.
- [9] A. Mazzanti and P. Andreani, "Class-C harmonic CMOS VCOs, with a general result on phase noise," *IEEE J. Solid-State Circuits*, vol. 43, no. 12, pp. 2716–2729, Dec. 2008.
- [10] L. Fanori and P. Andreani, "Low-phase-noise 3.4–4.5 GHz dynamic-bias class-C CMOS VCOs with a FoM of 191 dBc/Hz," in *Proc. ESSCIRC*, Sep. 2011, pp. 406–409.
- [11] J. Bae *et al.*, "A low energy injection-locked FSK transceiver with frequency-to-amplitude conversion for body sensor applications," *IEEE J. Solid-State Circuits*, vol. 46, no. 4, pp. 928–937, Apr. 2011.
- [12] M. Garampazzi *et al.*, "An intuitive analysis of phase noise fundamental limits suitable for benchmarking LC oscillators," *IEEE J. Solid-State Circuits*, vol. 49, no. 3, pp. 635–645, Mar. 2014.
- [13] B. Razavi, "Figures of merit of VCOs," in *RF Microelectronics*, 2nd ed. New York, NJ, USA: Pearson Education Inc., 2012, ch. 8, sect. 7.6, p. 570.
- [14] F. Pengg et al., "A low power miniaturized 1.95 mm2 fully integrated transceiver with fastPLL mode for IEEE 802.15.4/bluetooth smart and proprietary 2.4 GHz applications," in Proc. IEEE Radio Frequency Integr. Circuits Symp., Jun. 2013, pp. 71–74.
- [15] G. Devita et al., "A 5 mW multi-standard Bluetooth LE/IEEE 802.15.6 SoC for WBAN applications," in Proc. 40th Eur. Solid State Circuits Conf., Sep. 2014, pp. 283–286.
- [16] S. Chakraborty et al., "A 1.9 nJ/bit, 5 Mbps multi-standard ISM band wireless transmitter using fully digital PLL," in Proc. IEEE Radio Frequency Integr. Circuits Symp., Jun. 2013, pp. 75–78.
- [17] L. Yao-Hong et al., "A 2.7 nJ/b multi-standard 2.3/2.4 GHz polar transmitter for wireless sensor networks," in *IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers*, Feb. 2012, pp. 448–450.
- [18] J. Masuch and M. Delgado-Restituto, "A 1.1-mW-RX -81.4 dBm sensitivity CMOS transceiver for bluetooth low energy," *IEEE Trans. Microw. Theory Techn.*, vol. 61, no. 4, pp. 1660–1673, Apr. 2013.

- [19] F. Kuo et al., "A fully integrated 28 nm Bluetooth Low-Energy transmitter with 36% system efficiency at 3 dBm," in Proc. 41st Eur. Solid-State Circuits Conf., Sep. 2015, pp. 356–359.
- [20] Y. Liu et al., "13.2 A 3.7 mW-RX 4.4 mW-TX fully integrated Bluetooth Low-Energy/IEEE802.15.4/proprietary SoC with an ADPLL-based fast frequency offset compensation in 40 nm CMOS," in *IEEE Int. Solid-State Circuits Conf.*, Feb. 2015, pp. 1–3.



Antonio Liscidini (S'99–M'06–SM'13) was born in Tirano, Italy, in 1977. He received the Laurea degree (*summa cum laude*) and Ph.D. degree in electrical engineering from the University of Pavia, Pavia, Italy, in 2002 and 2006, respectively.

He was a summer Intern with National Semiconductors, Santa Clara, CA, USA, in 2003, studying poly phase filters and CMOS LNAs. From 2008 to 2012, he was an Assistant Professor with the University of Pavia and a Consultant with Marvell Semiconductors in the area of integrated circuit

design. Since December 2012, he has been an Assistant Professor with the Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada. His research interests are in the implementations of transceivers and frequency synthesizers for cellular and ultra low power applications.

Prof. Liscidini was the recipient of the Best Student Paper Award at IEEE 2005 Symposium on VLSI Circuits and the Best Invited Paper Award at 2011 IEEE Custom Integrated Circuit Conference. From 2008 to 2011, he served as an associate editor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, and he served as a guest editor of the IEEE JOURNAL OF SOLID STATE CIRCUITS for the special issue on ESSCIRC Conference, July 2013. Since 2016, he has been a Distinguished Lecturer of the IEEE Solid State Circuit Society. Currently he is member of the TPC of the European Solid State Circuit Conference (ESSCIRC) and of the International Solid State Circuit Conference.



**Chuanwei Li** (S'13) received the B.A.Sc degree (with honors) in electrical engineering from the University of Toronto, Toronto, ON, Canada, in 2014, where he is currently working toward the M.A.Sc degree in electrical engineering/electronics under Prof. Antonio Liscidini.

He was an Intern IC Design Engineer with Analog Devices, inc., Toronto, ON, Canada, from 2012 to 2013. His research is focused on the implementation of transceivers for cellular and ultra-low power applications.