This is an author-formatted work. The archival version for citation is <S. Salehi, D. Fan, and R. F. DeMara, "Survey of STT-MRAM Cell Design Strategies: Taxonomy and Sense Amplifier Tradeoffs for Resiliency," ACM Journal on Emerging Technologies in Computing Systems, (2016), in-press.> which is available at <http://jetc.acm.org/index.cfm>."

# Survey of STT-MRAM Cell Design Strategies: Taxonomy and Sense Amplifier Tradeoffs for Resiliency

SOHEIL SALEHI, University of Central Florida DELIANG FAN, University of Central Florida RONALD F. DEMARA, University of Central Florida

Spin-Transfer Torque Random Access Memory (STT-MRAM) has been explored as a post-CMOS technology for embedded and data storage applications seeking non-volatility, near-zero standby energy, and high density. Towards attaining these objectives for practical implementations, various techniques to mitigate the specific reliability challenges associated with STT-MRAM elements are surveyed, classified, and assessed herein. Cost and suitability metrics assessed include the area of nanomagmetic and CMOS components per bit, access time and complexity, sense margin, and energy or power consumption costs versus resiliency benefits. Solutions to the reliability issues identified are addressed within a taxonomy created to categorize the current and future approaches to reliable STT-MRAM designs. A variety of destructive and non-destructive sensing schemes are assessed for process variation tolerance, read disturbance reduction, sense margin, and write polarization asymmetry compensation. The highest resiliency strategies deliver a sensing margin above 300mV while incurring low power and energy consumption on the order of picojoules and microwatts, respectively, and attaining read sense latency of a few nanoseconds down to hundreds of picoseconds for non-destructive and destructive sensing schemes, respectively.

# Hardware-Spintronics and magnetic technologies

Additional Key Words and Phrases: Spin-Transfer Torque storage elements, STT-MRAM, Magnetic Tunnel Junction (MTJ), Sense Amplifier, Process Variation, Read/Write Reliability, Design, Performance, Reliability

Authors' addresses: S. Salehi, D. Fan, and Ronald F. DeMara, Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL, USA; email: soheil.salehi@knights.ucf.edu.

## 1. INTRODUCTION

As technology scales down along with increased demands of greater on-chip integration for larger memory capacities, researchers and designers have responded to the resulting fabrication and operational challenges by embracing new device technologies along with new memory cell designs which leverage their unique advantages. A collection of innovative methods has been developed to increase their reliability and performance. In addition to addressing scalability to technologies beyond 10nm where traditional memory elements such as Static Random Access Memory (SRAM) and Dynamic Random Access Memory (DRAM) face significant scaling challenges [Sun, *et al.* 2012], innovations to mitigate the power wall and reduce leakage power consumption occupy the forefront of on-chip memory design considerations. [Bishnoi, *et al.* 2014; Ashraf, *et al.* 2015; Zhao, *et al.* 2009]. Power consumed by memory elements can become a significant portion of total power in active modes whereby the processing cores rely on these memory arrays that are significant contributors to standby mode power consumption [Kultursay, *et al.* 2013; Chen, *et al.* 2016].

To attain these goals and deliver the necessary operational characteristics, emerging memory devices such as Resistive RAM (RRAM), Phase Change Memory (PCM), and Magnetic RAM (MRAM) offer several potential advantages. Among promising devices, the 2014 ITRS Magnetism Roadmap identifies nanomagnetic devices such as Spin-Transfer Torque Magnetic Logic (NML), Domain Wall Motion (DWM), Spin-Transfer Torque (STT)-MTJ/Spin Hall Effect (SHE)-MTJ are considered feasibly-implemented [Zand, *et al.* 2016; Roohi, *et al.* 2016; Pyle, *et al.* 2016; Shamsi, *et al.* 2015] whose attributes are depicted in Figure 1. STT-MRAM can offer low read access time, near-zero standby power consumption, and small area requirement. STT-MRAM also offers integration with backend CMOS processes. To embrace their adoption in anticipated applications, a palette of cooperating reliability techniques is identified and compared at the bit-cell level.



Figure 1: Advantages and reliability challenges of STT-MRAM.

In this survey, the primary focus is on reliability issues that may affect the performance of the STT-MRAM. First, an overview of STT-MRAM functionality and technology aspects is provided in Section 2. In Section 3, reliability issues of the STT-MRAM have been identified and classified. In Section 4, some potential solutions have been presented in terms of a novel taxonomy to organize approaches and their advantages, along with a detailed discussion about previous works. Finally, this review concludes in Section 5 with summary of recommended techniques and rationale.



Figure 2: (a) 1T-1R STT-MRAM cell structure, (b) Right: Anti-parallel (high resistance), Left: Parallel (low resistance).

## 2. STT-MRAM OVERVIEW

Magnetic Tunnel Junction (MTJ) devices are constructed with layered pillars of ferromagnetic and insulating layers to leverage magnetic orientations that can be controlled and sensed in terms of electrical signal levels. Spin-Transfer Torque (STT) switching is one of the most promising alternatives for data storage. MTJ device which consists of two ferromagnetic layers called the reference layer and the free layer and one tunnel barrier oxide layer, is used in STT-MRAM cells to store data as binary values 0 or 1 [Sun, *et al.* 2012] as shown in Figure 2. Figure 2a shows an STT-MRAM cell which has an access transistor that connects the storage device and the bit-line which is the same as DRAM cell, however, an STT-MRAM cell differs from DRAM since the other end of the storage device is connected to the sense line or source line instead of ground. This STT-MRAM cell structure is being known as "one-transistor-one-MTJ (1T-1R)" [Kang, Zhang, Wang, *et al.* 2015; Fong, *et al.* 2016].

As mentioned the stored data in an MTJ can be represented as the magnetic orientation of the free layer compared to fixed layer which these two layers introduce a resistance that shows either 0 or 1. As shown in Figure 2b, identical magnetic orientation results in parallel configuration which introduces a low resistance that can be represented as logical "0" and opposite magnetic orientation results in anti-parallel configuration which introduces high resistance that can be represented as logical "1". Since in STT-MRAM, relative resistance values are used to determine the bit value, then reliable sensing schemes are utilized in order to read the stored data compared to DRAM in which the data is the charge stored in a capacitor and the data is sensed using voltage sensing circuitry. In order to read the stored data to an MTJ, requires significantly larger current flow. Large write amplifiers are used since writing a data to an MTJ, requires significantly larger current than reading the stored value. In order to write into an MTJ cell, we need to apply a large current through the MTJ to change the magnetic orientation of its free layer to form a parallel or an anti-parallel configuration compared to the fixed layer [Fong, *et al.* 2016].

Maintaining lower critical current while having faster speed as well as large energy barrier for high-density device, researchers have proposed data storage using Perpendicular Magnetic Anisotropy (PMA) instead of conventional In-plane Magnetic Anisotropy (IMA) in order to achieve high anisotropy field  $H_k$ . This will result in better integration of STT-MRAM into logic circuits [Zhao, *et al.* 2012]. Detailed information about STT-MRAM circuit design and operation can be found in [Sun, *et al.* 2012] and [Kultursay, *et al.* 2013].

STT-MRAM writing has three different operating regions for switching which include Thermal Activation Region where the write current is less than eighty percent of the critical current, Dynamic Reversal Region wherein the write current is greater than eighty percent of the critical current and less than the critical current itself, and Precessional Region where the write current is greater than the critical current [Kang, Zhang, Wang, *et al.* 2015]. We normally operate this in either the Thermal Activation Region or the Precessional Region. If fast switching is required Precessional Region is a better option compared to Thermal Activation Region since the latter will cause slower switching.

## 3. RELIABLITY ISSUES OF STT-MRAM

STT-MRAM has several advantages over other emerging memory technologies, however, it faces some distinct reliability challenges involving read and write failures [Bishnoi, *et al.* 2014; Kang, Li, Klein, *et al.* 2014] as listed in this Section. STT-MRAM scalability is greatly influenced and limited due to thermal fluctuations and issues such as MTJ process variations and the CMOS access transistor have had negative effects on STT-MRAM devices. Also, as a result of these issues, demand for an advanced sensing circuit which can provide required sensing margin along with low power operation has been increased as will be addressed in Section 4.

STT-MRAM bit errors can be significantly influenced due to process variations [Emre, *et al.*], which precipitate another important issue that STT-MRAM suffers from as well as suffering from its unique intrinsic thermal randomness. These variations include variation in the access transistor sizes, variation in threshold voltage ( $V_{th}$ ), MTJ geometric variation, and initial angle of the MTJ. Whereas the effect of variation involving the access transistor on system performance has been investigated in [Eken, *et al.* 2014; Alghareb, *et al.*], here we focus on the process variation of the MTJ cell. The difference between the sensed bit-line voltage and the reference voltage which is known as the Sense Margin will be small due to the wide distribution of MTJ resistance which can also result in a false detection scenario [Zhang, *et al.* 2013]. On the other hand, write speed can be affected and may vary due to the thermal fluctuations during MTJ switching in write operations and this will further aggravate by process variation-induced variability of the switching current [Eken, *et al.* 2014].

Errors due to the STT-MRAM physical nature's failures will be categorized into transient faults and permanent faults as depicted in Figure 3. Transient faults, which can also be described as an incorrect signal condition, is mostly caused by the parameters of free layer such as current density  $(J_c)$ and thermal stability factor denoted by  $\Delta$ . Permanent faults, which can be precipitated by destructive device damage, are initially caused by susceptibility to the sensitive parameters of oxide barrier such as barrier's thickness  $(t_{ox})$  and Tunnel Magneto-Resistance (TMR) ratio [Zhao, *et al.* 2012] which have been expanded with additional parameters in Table 1.



Figure 3: Taxonomy of STT-MRAM Device Failures.

#### 4. RELATED WORKS

Several studies have shown that exposures to permanent and transient faults can be mitigated in various device technologies by employing spatial hardware redundancy, temporal operational redundancy, error correcting codes, and resiliency via active reconfiguration. In order to make reliable STT-MRAM cells which are less vulnerable to alpha-particle-induced transient faults, a variety of design strategies and different considerations have been proposed in [Zhao, *et al.* 2009; Lakys, *et al.* 2012; Zhao, *et al.* 2011; Kawahara, *et al.* 2012; Li, *et al.* 2010]. For instance, in [Kawahara, *et al.* 2012], concept of '1'/0' dual-array equalized reference is proposed that reduces the error rate by lowering the read current which introduces a precise reference for stable read operation. Research studies have

shown that STT-MRAM memory devices are vulnerable to radiation induced transient faults due to their CMOS peripheral circuit used to read and write in the MTJ cells [Chabi, *et al.* 2014; Yang, *et al.* 2016; Kang, Zhao, *et al.* 2014]. It has been proven experimentally in [Kang, Zhao, *et al.* 2014] that the MTJ device itself is resilient to radiation induced transient faults. In order to reduce the vulnerability of the STT-MRAM devices to radiation induced transient faults caused by elements such as alpha particle, Kang, et al. 2013]. They have experimentally shown that their design increases the robustness of memory and logic devices using hybrid CMOS/STT-MRAM to radiation induced transient faults such as Single Event Upsets (SEUs) and Multiple Bit Upsets (MBUs) [Tsiligiannis, *et al.* 2013].

| T               |                      | Туре                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                     | Possible Solution(s)                                                                                                                                                                                                                                                                                                                                                                                                                                               |  |  |  |  |  |  |
|-----------------|----------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|--|--|
|                 | liability Issues     | Write Failure                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | <ul> <li>Due to stochastic nature of write process in STT-MRAM</li> <li>MTJ cell does not switch properly in order to store the required value within write time period</li> </ul>                                                                                                                                                                                                                                                                              | <ul> <li>Possible solution can be increasing the write duration</li> <li>Another possible solution can be increasing write current</li> <li>These solutions may cause significant amount of power dissipation and area overhead as well as speed degradation</li> </ul>                                                                                                                                                                                            |  |  |  |  |  |  |
|                 | Write Re             | Write<br>Polarization<br>Asymmetry                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | <ul> <li>P to AP needs higher switching current and suffers<br/>from more error rate compared to AP to P switching</li> </ul>                                                                                                                                                                                                                                                                                                                                   | <ul> <li>Reversed MTJ Connection results in a larger <i>I<sub>MTJ</sub></i> for the<br/>P to AP switching which alleviates the effect of the <i>I<sub>c</sub></i><br/>asymmetry (<i>I<sub>c</sub></i> (P→AP)/<i>I<sub>c</sub></i> (AP→P) &gt; 1)</li> </ul>                                                                                                                                                                                                        |  |  |  |  |  |  |
| ransient Faults | ues                  | Failures due<br>to Read<br>Disturb                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | <ul> <li>Read and Write share the same path</li> <li>Unwanted bit-flip during a read operation</li> <li>Growing with technology scaling</li> <li>&gt; thermal stability factor ∆ decreases</li> <li>&gt; critical switching current decreases</li> </ul>                                                                                                                                                                                                        | <ul> <li>Possible solution can be increasing the margin between read and write currents</li> <li>Increasing the write current is one of the solutions which may not be feasible since write current maintains a high value in STT-MRAM devices.</li> <li>Also can be done by decreasing the read current which will Increase the read latency and may result in another reliability issue called decision failure</li> <li>Error Correction Codes (ECC)</li> </ul> |  |  |  |  |  |  |
| 1               | Read Reliability Iss | Readability<br>Degradation<br>at Scaled<br>Technology<br>Node                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | <ul> <li>Due to reduction in switching current which will<br/>become a greater concern in scaled technology<br/>node</li> <li>Reduction in switching current will limit the upper-<br/>bound of sensing current</li> </ul>                                                                                                                                                                                                                                      | <ul> <li>High read current is required in order to:</li> <li>&gt; Provide enough sense margin</li> <li>&gt; Ensure reliable sensing by excluding the device variation of the sense amplifier</li> <li>&gt; Maintain fast read and reduce read latency</li> <li>Low read current is required in order to:</li> <li>&gt; Prevent stored data from being upset</li> </ul>                                                                                             |  |  |  |  |  |  |
|                 |                      | Decision<br>Failure                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | <ul> <li>When reading an MTJ cell, not being able to<br/>distinguish whether the stored bit is zero or one</li> </ul>                                                                                                                                                                                                                                                                                                                                           | <ul> <li>Possible solution can be increasing the read duration</li> <li>Another possible solution can be increasing read current</li> </ul>                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |  |
|                 |                      | Retention<br>Failure                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | <ul> <li>STT-MRAM had an intrinsic thermal instability which<br/>can result to a bit-flip of an MTJ cell's content</li> </ul>                                                                                                                                                                                                                                                                                                                                   | <ul> <li>One solution at the device-level is exploiting the thermal stability factor <i>∆</i></li> <li>Increasing <i>∆</i> results in longer read duration, larger current amplitude and increase in number of bits per word during parallel reading</li> </ul>                                                                                                                                                                                                    |  |  |  |  |  |  |
| nont Fourtes    | nent rauns           | <ul> <li>Switching current and switching duration are inversely proportional to each other</li> <li>High current density J<sub>c</sub> is normally required in order to achieve high speed based on: V = R.A × J<sub>c</sub></li> <li>In order to have a better switching probability we have to provide a large sensing margin and in order to maintain high current density we can reduce the Resistance Area product (<i>R</i>.<i>A</i>) or thickness of the oxide or increase the bias voltage V</li> <li>Each of these solutions can result in the oxide barrier breakdown and shorten the MTJ lifetime</li> </ul> |                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | <ul> <li>Using Modular Redundancy</li> <li>In order to prevent permanent faults, oxide thickness variation is required to be less than 5%</li> <li>Using a low bias voltage for sensing is suggested since</li> </ul>                                                                                                                                                                                                                                              |  |  |  |  |  |  |
| Downod          | Leuna                | Barrier<br>Thickness<br>Variability                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | <ul> <li>Maintaining low <i>R.A</i> value, favorably ultra-thin insulator or oxide barrier is required</li> <li>MTJ's resistance is proportional to the oxide thickness exponentially. Increase in bias voltage will result in decrease in <i>TMR</i> ratio and <i>TMR</i> ratio may become less than the resistance Variation Ratio (VR)</li> <li>&gt; In this case, sensing margin will be upset by VR and permanent faults will occur as a result</li> </ul> | the real <i>TMR</i> ratio decreases during the sensing operation                                                                                                                                                                                                                                                                                                                                                                                                   |  |  |  |  |  |  |

### Table 1: STT-MRAM Reliability Issues.

Furthermore, in order to tolerate permanent faults, two solutions have been proposed in [Zhao, et al. 2012]. One option is Triple Modular Redundancy using a majority voter and the other option is to resize the active transistors. However, both options introduce area overheads whereas Triple Modular Redundancy requires two additional S.A.s along with a voting circuit, and meanwhile the second option utilizes a larger transistor. While Triple Modular Redundancy is a popular approach for masking soft errors and providing single-fault coverage in various circuits, it incurs roughly three-fold increases in area and energy. In a recent research study in [Kang, Zhao, Wang, et al. 2013], Kang et al. have proposed a novel area efficient and high speed Error Correcting Code (ECC) circuit utilizing Orthogonal Latin Square Code (OLSC) in order to increase the reliability of STT-MRAM with the option of adaptability that enables the system to adapt the error correction based on its needs. Moreover, in order to increase the yield of STT-MRAM devices, Kang et al. have introduce an innovative method to sustain permanent and transient faults using an integration of ECC along with Fault Masking (FM) methods which they address as sECC [Kang, Zhang, Zhao, et al. 2015]. In addition, in their study by combining sECC method with Redundancy Repair (RR) method, they have successfully managed to further improve and optimize the performance of the emerging STT-MRAM devices. Kang et al. in [Kang, Zhang, Zhao, et al. 2015] have managed to repair the permanent faults in the system using redundant elements so called RR, mask transient faults and Single Isolated Faults (SIFs) using sECC.

Alternatively, results of the study in [Zhao, *et al.* 2012] have shown that only resizing the transistors is sufficient for increasing the reliability of the conventional applications, however, for those applications which require extreme sensing reliability, Triple Modular Redundancy technique can be more useful. Considerable amount of research has been performed on improving reliability and performance of STT-MRAM memory devices. In [Xu, *et al.* 2011] the write and read performance of STT-MRAM as last-level on-chip cache have been estimated and analyzed over the processor performance. Moreover, in [Zhou, *et al.* 2009] an early-write-termination scheme was proposed in order to enhance STT-MRAM reliability and reduce STT-MRAM write energy. Furthermore, in [Wang, *et al.* 2008] an implementation of STT-MRAM under different technology nodes has been discussed in which the corresponding process technology and scaling parameters were presented. Furthermore, in order to reduce the probability of accidental bit-flipping and loss of data caused by the current applied during read period, a disturbance-free read scheme was proposed in Gigabit scale STT-MRAM design in [Ono, *et al.* 2009], which, due to process variations of MTJs, this scheme is unable to solve the read failures.

In order to reduce the read disturb probability, [Raychowdhury 2013] has proposed the pulsed read method and [Takemura, *et al.* 2010] has proposed the disruptive reading and restoring scheme. Furthermore, in [Bishnoi, *et al.* 2014], in order to alleviate the read disturb reliability issue, some bit-cell architectures are proposed. Moreover, it has been shown that by increasing the thermal stability factor we can reduce the read disturb rate. However, all of these techniques introduce large area overhead and/or large power dissipation and large delays as mentioned in [Bishnoi, *et al.* 2014].

In general, sensing schemes can be classified into two categories, Destructive and Non-Destructive [Motaman, *et al.* 2015]. Based on the definition presented in this research, Destructive Schemes are more vulnerable to read reliability failures. Non-Destructive Schemes are more tolerant to process variation of reference cell, however, Destructive Schemes, typically, provide smaller read/sense latency as described in the following Sections.

## A) Destructive Sensing Schemes

The first category of strategies to mitigate the cost of self-referencing is through consecutive accesses that restore the destroyed value once it has been reliably read. Several self-reference sensing schemes were proposed to overcome reliability concerns due to process variations of MTJs in STT-MRAM. In [Sun, *et al.* 2012] Sun, et al. have analyzed the Conventional Sensing Scheme (CSS) which compares the bit line voltage to a reference voltage to read the value of a memory bit cell under certain conditions. However, as technology shrinks process variation will be increased and will result in significant standard deviations of sense margin that will lead to large read failure probability. As a result, the chip yield in STT-MRAM design is highly limited due to poor robustness of CSS.

mentioned in [Tanizaki, et al. 2006; Jeong, et al. 2003], original value stored in an MTJ cell in Conventional Self-Reference Sensing Scheme (CSR), will be compared to a reference value which is stored in the same MTJ in a different write cycle. As it can be found in [Tanizaki, et al. 2006; Jeong, et al. 2003], CSR consumes a large amount of power and also introduces long latency. Sun, et al. also analyzed CSR in [Sun, et al. 2012], which needs two write operations that results in long latency and will lead to large power overhead and can also be destructive to the stored value. Comparing CSR to CSS we can conclude that CSR maintains a higher sense margin with the cost of sacrificing the reliability and performance.

One of the destructive sensing strategies that has been recently used in [Zand, et al. 2016] was proposed by Zhao, et al. called Pre-Charge Sensing in [Zhao, et al. 2009; Zhao, et al. 2012] which uses a Pre-Charge Sense Amplifier (PCSA) and minimizes the read current value and read duration compared with conventional static data sensing. As a result of this action, high reliability will be provided for the STT-MRAM while maintaining the same thermal stability factor. This method, which is called Dynamic Sensing Scheme (DSS) [Zhao, et al. 2011], has solved the sensing problem utilizing Dynamic Sensing [Zhao, et al. 2012]. In order to reduce the read current required, Lakys, et al. in [Lakys, et al. 2012] have used Dynamic Sensing Scheme using two word selection transistors for each MTJ cell in order to perform read operations and switching operations. Also with this method the size of the reading transistor can minimized which can lead to reduction of read current far below the disturb margin down to  $10\mu A$ . This method introduces some area overhead compared to conventional 1T-1R STT-MRAM cell designs, however, this area overhead is negligible to implement cross-point memories since in these designs the selection transistors are shared among several MTJs within the same word [Lakys, et al. 2012]. Lakys, et al. also suggest a method called Self-enable Switching Circuit (SSC) to decrease the impact of stochastic switching in [Lakys, et al. 2012] which operates based on relaxing the bias voltage stress on the oxide barrier and utilizing short duration write pulse sequence instead of fixed long writing pulse within switching and sensing operations. This method reduces the probability of oxide breakdown and allows short write pulse durations which will result in reduction in the number of switching operation and this will improve the oxide barrier lifetime [Zhao, et al. 2012]. Later in 2012, Ren, et al. have proposed Body Voltage Sensing Circuit (BVSC) in [Ren, et al. 2012], which like SSC utilizes a short pulse reading scheme that enables fast sensing operations. They have shown that their design provides improved speed at the cost of sacrificing sense margin and also improves reliability of the read operation against read disturbance.

In order to increase the sensing margin while reducing the latency and improving device variation tolerance of the STT-MRAM cell, Zhang, et al. in [Zhang, et al. 2013] have analyzed Regular Differential STT-MRAM Cell Structure (RDAMS). This method uses a differential STT-MRAM cell design including two separate 1T-1R cells which the resistance state of these two are always opposite. This design doubles the maximum sense margin compared to the one of 1T-1R cells (CSS), however, RDAMS capacity is half of the one of the two 1T-1R cells used in this design. In the same research publication, Asymmetric Differential Cell Structure (ADAMS) have been proposed in order to improve the read and write performance of STT-MRAM and also increase the transient fault tolerance compared to RDAMS. This method uses a differential STT-MRAM cell design like RDAMS including two separate 1T-1R cells that one of the MTJ cells is reversely connected to the NMOS transistor. Write latency of ADAMS is the same as RDAMS while maintaining smaller cell area compared to other previous sensing schemes such as CSS, CSR, etc. ADAMS has improved the read latency and has reduced the write error rate [Zhang, et al. 2013].

In an extension to a previous research on PCSA, Kang, et al. proposed Separated Pre-Charge Sense Amplifier (SPCA) [Kang, Deng, et al. 2014]. Based on their simulation result, SPCA has a similar performance in terms of latency and power and provides better reliability with a small area overhead compared to PCSA proposed in [Zhao, et al. 2009]. In another interesting research study proposed in 2014, Eken, et al. in [Eken, et al. 2014] introduce a novel strategy called Field-Assisted STT Self-Referencing Scheme (FA-STT) which utilizes an external magnetic field to generate the self-reference sensing signal. This method offers improved process variation resilience and thermal fluctuation tolerance in STT-MRAM and MTJ switching respectively. It also provides a much better read

|    |                                                             |                                                                      | Area per Cell<br>(device count) |      |     |                                                |                               | Cycle(s)       | Sense                                          | Read<br>Latency          | AVG. Energy                 | Simulation-based                |
|----|-------------------------------------------------------------|----------------------------------------------------------------------|---------------------------------|------|-----|------------------------------------------------|-------------------------------|----------------|------------------------------------------------|--------------------------|-----------------------------|---------------------------------|
| #  | Reference(s)                                                | Approach                                                             | MTJ                             | смоз | Сар | Other                                          | S.A. Type                     | or<br>Stage(s) | Margin<br>(V <sub>P/AP</sub> -V <sub>R</sub> ) | or<br>Sensing<br>Latency | or<br>Power<br>consumption  | or<br>Theoretical               |
| 1  | [Tanizaki, et al. 2006]                                     | Self-Reference Scheme (SRS)                                          | 1                               | 17   | 2   | 0                                              | Conventional                  | 2              | N/A                                            | Large<br>(130ns)         | N/A                         | Simulation-based<br>TSMC 240-nm |
| 2  | [Sun, et al. 2012]                                          | Conventional Sensing Scheme<br>(CSS)                                 | 1                               | 1    | 0   | 0                                              | Conventional                  | 2              | Small<br>(~20mv)                               | Small<br>(2.5ns)         | Low<br>(0.8937pJ)           | Theoretical                     |
| 3  | [Sun, et al. 2012]                                          | Conventional Self-Reference<br>Sensing Scheme (CSR)                  | 1                               | 1    | 2   | 0                                              | Conventional                  | 2              | Large<br>(76.6mv)                              | Large<br>(40ns)          | High<br>(22.05pJ)           | Theoretical                     |
| 4  | [Zhao, <i>et al.</i><br>2012],[Zhao, <i>et al.</i><br>2009] | Dynamic Sensing Scheme<br>Using Pre-Charge Sense<br>Amplifier (PCSA) | 2                               | 7    | 0   | 0                                              | Pre-Charge Sense<br>Amplifier | 2              | Large<br>(N/A)                                 | Small<br>(~164ps)        | Low<br>(~3.17fJ)            | Simulation-based<br>TSMC 65-nm  |
| 5  | [Ren, et al. 2012]                                          | Body Voltage Sensing Circuit<br>(BVSC)                               | 3                               | 21   | 2   | 1 Res                                          | Conventional                  | 1              | Large<br>(195mv)                               | Small<br>(1ns)           | Low<br>(195.5fJ<br>& 300uW) | Simulation-based<br>TSMC 65-nm  |
| 6  | [Lakys, et al. 2012]                                        | Self-enable Switching Circuit<br>(SSC)                               | 2                               | 13   | 0   | 0                                              | Pre-Charge Sense<br>Amplifier | 2              | N/A                                            | Small<br>(<200ps)        | Low<br>(N/A)                | Simulation-based<br>TSMC 65-nm  |
| 7  | [Zhang, et al. 2013]                                        | Regular Differential STT-MRAM<br>Cell Structure (RDAMS)              | 2                               | 4    | 0   | 0                                              | Conventional                  | 1              | Large<br>(N/A)                                 | Small<br>(321.8ps)       | N/A                         | Simulation-based<br>TSMC 45-nm  |
| 8  | [Zhang, et al. 2013]                                        | Asymmetric Differential STT-<br>MRAM Cell Structure (ADAMS)          | 2                               | 4    | 0   | 0                                              | Pre-Charge Sense<br>Amplifier | 1              | Large<br>(N/A)                                 | Small<br>(266.7ps)       | N/A                         | Simulation-based<br>TSMC 45-nm  |
| 9  | [Kang, Deng, et al.<br>2014]                                | Separated Pre-Charge Sense<br>Amplifier (SPCSA)                      | 2                               | 7    | 0   | 0                                              | Pre-Charge Sense<br>Amplifier | 2              | Large<br>(N/A)                                 | Small<br>(~187ps)        | Low<br>(~3.84fJ)            | Simulation-based<br>TSMC 65-nm  |
| 10 | [Eken, et al. 2014]                                         | Field-Assisted STT Self-<br>Referencing Scheme (FA-STT)              | 1                               | 5    | 2   | External<br>B-Field                            | Conventional                  | 2              | Large<br>(>20mv)                               | N/A                      | N/A                         | Simulation-based<br>TSMC 45-nm  |
| 11 | [Motaman, et al. 2015]                                      | Slope Detection Sensing<br>Scheme                                    | 1                               | 20+4 | 3   | 1 Current<br>Source<br>& 2 Switches<br>& 1 Res | Pre-Charge Sense<br>Amplifier | 2              | Large<br>(N/A)                                 | Large<br>(6.8ns)         | Low<br>(150uW)              | Simulation-based<br>TSMC 22-nm  |
| 12 | [Jun-Tae Choi 2016]                                         | Parallel Reading Sense Amplifier<br>(PRSA)                           | 1                               | 18+2 | 2   | 1 Switch                                       | Conventional                  | 2              | Large<br>(N/A)                                 | Large<br>(<20ns)         | N/A                         | Simulation-based<br>TSMC 180-nm |

Table 2: Destructive Sensing Schemes and their attributes.

reliability by improving read sense margin compared to conventional self-reference sensing schemes (CSS, CSR, NDSR, and VDRS) and it significantly reduces the write error rate. Furthermore, Slope Detection Sensing Scheme was suggested by Motaman, et al. in [Motaman, et al. 2015] which will also be categorized as destructive scheme in which they have claimed makes the STT-MRAM cell design more reliable against device mismatch and variations. They have discussed that their design has a high sensing robustness due to eliminating the reference comparison. Finally, in a recent research published in 2016, Parallel Reading Sense Amplifier (PRSA) is proposed [Jun-Tae Choi 2016]. This sense amplifier has two sensing steps that the first read step can be performed in parallel with the write operation which reduces the read latency. Authors in [Jun-Tae Choi 2016] have claimed that PRSA offers large sensing margin.

# B) Non-Destructive Sensing Schemes

The second category of strategies to mitigate the cost of destructive sensing are being discussed in this Section. Two different design methods that have been demonstrated in [Au, *et al.* 2004] are Low-Power Simple Sensing Circuit and High-Sensitivity Switched-Current Sensing Circuit. It has been exhibited that if low power operation is a priority, then Low-Power Simple Sensing Circuit can offer better performance. On the other hand, if a better magneto-resistance ratio and faster reading is required, then High-Sensitivity Switched-Current Sensing Circuit would be preferred. Even though, High-Sensitivity Switched-Current Sensing Circuit are to Low-Power Simple Sensing Circuit in terms of power consumption, it can provide better performance in terms of speed. Reducing the sensing latency and read disturbance faults are of significant importance when designing a sensing circuit. In order to maintain low read latency while improving the device performance, an Adequate-Reference Scheme is proposed in [Tsuchida, *et al.* 2010] that increases the sense margin of STT-MRAM cells which results in read latency reduction. However, the read disturbance error rate might be increased due to increase in the magnitude of the sensing current. Negative-Resistance Read Scheme (NRRS) which is a current-based sensing scheme has been proposed, fabricated, and tested in [Halupka, *et al.* 2010], and authors have analyzed the non-destructive read operation performed.

Moreover, in [Chen, *et al.* 2010] taking advantage of the different current dependences of the high and the low resistance states of an MTJ, another sensing scheme was proposed called Nondestructive Self-Reference Sensing Scheme (NDSR) [Chen, *et al.* 2010] NDSR utilizes a characteristic of MgO- based MTJ which is the difference between the current roll-off slope of high and low resistance states. Based on this characteristic we can see that if the MTJ has a high resistance state, the current roll-off slope will be steeper than the low resistance state. It has been proven in [Chen, *et al.* 2010] that although, this method has two read steps, NDSR has reduced the power consumption and significantly improved the read latency by removing the two write operation and performing one write operation instead. However, compared to CSR, NDSR has a smaller sensing margin. Furthermore, in [Chen, *et al.* 2012] they have improved the sensing margin of NDSR utilizing combined magnetic and circuit level enhancements. Researchers have introduced a dual-voltage row decoder with a charge sharing scheme in [Kim, *et al.* 2011]. This scheme has shown reduced read disturbance while providing short sensing latency, resulting in increase in the yield of STT-MRAM devices.

Device variation tolerance and large sensing margin are other important considerations in designing sensing circuits. In a recent research study published in 2012, a sensing scheme have been proposed and discussed [Kim, et al. 2012], which can tolerate the process variation in the scaled technology nodes. In order to overcome the issues due to process variation and asymmetry of Read Access Pass Yield (RAPY) of the memory cells, respectively they have developed a Source Degeneration Scheme (SDS) and a Balanced Reference Scheme (BRS). Furthermore, Non-Destructive Variability Tolerant Differential Read Scheme was proposed in [Das, et al. 2012] which is targeting the device variation while improving other reliability aspects of the device. This design has the advantage of complimentary inputs as well as providing large sense margin along with better reliability by reducing error rate as mentioned in [Das, et al. 2012]. Later in 2012, due to the small sensing margin of the NDSR proposed in [Chen, et al. 2010], Sun, et al. came up with a circuit to provide a better sensing margin [Sun, et al. 2012]. This method, which is called Voltage-Driven Non-Destructive Self-Referencing Sensing Scheme (VDRS), is more robust than NDSR and maintains a better tolerance on variation of MTJ devices as well as providing a high sense margin. VDSR has low read latency and low power consumption compared to NDSR. In [Sun, et al. 2012] authors shown that this method demonstrates the highest STT-MRAM array yield among all existing sensing schemes of STT-MRAM design. In another research to improve the STT-MRAM reliability, Offset-Tolerant Triple-Stage Sensing Circuit has been proposed by Kang, et al. in [Kang, Zhao, Klein, et al. 2013]. They have verified that their design can reduce the device errors due to process variation and read disturbance which increases the STT-MRAM reliability in scaled technology nodes while providing large sense margin.

In 2014, Kim, et al. in another research have introduced Split-Path Sensing Circuit (SPSC) [Kim, Na, et al. 2014] and have shown that using variable reference voltage, their design consumes less energy compared to Highly-Symmetric Cross-Coupled Current Mirror (HSCC) proposed in [Maffitt, et al. 2006], SDSC [Kim, et al. 2012], and BVSC [Ren, et al. 2012] while providing a large enough sensing margin. Another sensing scheme which can tolerate the process variation in the scaled technology nodes and improve the reliability of STT-MRAM, has been proposed and discussed in [Na, et al. 2014]. In this research study, authors have suggested an Offset-Canceling Triple-Stage Sensing Circuit (OCTS) in order to overcome the issues due to process variation and asymmetry of RAPY of the memory cells that can operate with low currents which will result in avoiding read disturbance. Moreover, in [Kim, Ryu, et al. 2014] Self-Body Biasing Sensing Circuit (Self-BB) is proposed, which is thought to also tolerate issues due to process variation and asymmetry of RAPY of the memory cells while providing better sensing margin compared to conventional sensing schemes and performing fast sensing operations.

Read disturb faults and device variation are highly relative to the scaling of the technology node which both can affect the device reliability in a negative way. The first step in order to prevent the read disturb faults effectively is to detect them. In [Bishnoi, *et al.* 2014], a circuit has been proposed that has the ability to detect the read disturb fault utilizing a self-test mechanism that is supposed to validate its behavior which is called Read Disturb Detection Scheme (RDD). It has been shown that using this method, up-to 95% of the total read disturb faults can be detected while maintaining negligible area and power overhead. In order to reduce device variation effect on STT-MRAM reliability, a variation-tolerant high-reliability sensing scheme has been introduced and designed which is shown to increase the sensing margin with the cost of more delay and loss of speed [Kang, Li,

|    | Reference(s)                                                                                               | Approach                                                                | Area per Cell<br>(device count) |             |     | ell<br>nt)          |                                                 | Cycle(s)       | Sanca                                          | Read              | AVG. Energy      | Simulation-                     |
|----|------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------|---------------------------------|-------------|-----|---------------------|-------------------------------------------------|----------------|------------------------------------------------|-------------------|------------------|---------------------------------|
| #  |                                                                                                            |                                                                         |                                 | (device col |     |                     | S.A. Type                                       | or<br>Stage(s) | Margin<br>(V <sub>P/AP</sub> -V <sub>R</sub> ) | or                | or<br>Power      | based<br>or                     |
|    |                                                                                                            |                                                                         |                                 | CMOS        | Сар | Other               |                                                 |                |                                                | Latency           | consumption      | Theoretical                     |
| 1  | [Au, et al. 2004]                                                                                          | Low-Power Simple Sensing                                                |                                 | 15+         | 0   | 0                   | Conventional                                    | 2              | N/A                                            | Large<br>(28ns)   | High<br>(6.23mW) | Simulation-based<br>TSMC 600-nm |
| 2  | [Au, et al. 2004]                                                                                          |                                                                         |                                 | 2×SA        |     |                     |                                                 |                |                                                | Large<br>(16ns)   | High<br>(1.75mW) | Simulation-based<br>TSMC 180-nm |
| 3  | [Au, et al. 2004]                                                                                          | High-Sensitivity Switched-Current Sensing                               | 5                               | 31+         | 0   | 0                   | Conventional                                    | 2              | N/A                                            | Large<br>(16ns)   | High<br>(8.99mW) | Simulation-based<br>TSMC 600-nm |
| 4  | [Au, et al. 2004]                                                                                          | Circuit                                                                 |                                 | 2×SA        |     |                     |                                                 |                |                                                | Large<br>(12ns)   | High<br>(5.70mW) | Simulation-based<br>TSMC 180-nm |
| 5  | [Maffitt, et al. 2006],[Kim, Na,<br>et al. 2014]                                                           | Highly Symmetric Cross-Coupled Current<br>Mirror (HSCC)                 | 5                               | 16          | 0   | 0                   | Conventional                                    | 1              | N/A                                            | Small<br>(3ns)    | Low<br>(0.76pJ)  | Simulation-based<br>TSMC 45-nm  |
| 6  | [Tsuchida, <i>et al.</i><br>2010],[Chankyung Kim, <i>et al.</i><br>2015]                                   | Adequate-Reference Scheme (ARS)                                         | 6                               | 26          | 0   | 1 Current<br>Source | Conventional                                    | 1              | N/A                                            | Large<br>(11ns)   | N/A              | Simulation-based<br>TSMC 65-nm  |
| 7  | [Halupka, et al. 2010]                                                                                     | Negative-Resistance Read Scheme (NRRS)                                  | 1                               | 14+<br>2×SA | 0   | 1 Res               | Conventional                                    | 1              | N/A                                            | Large<br>(8ns)    | N/A              | Simulation-based<br>TSMC 130-nm |
| 8  | [Chen, et al. 2010],[Chen, et al. 2012]                                                                    | Nondestructive Self-Reference Sensing<br>Scheme (NDSR)                  | 1                               | 5           | 1   | 2 Res               | Conventional                                    | 2              | Large<br>(>20mv)                               | Large<br>(15ns)   | Low<br>(1.04pJ)  | Simulation-based<br>TSMC 130-nm |
| 9  | [Kim, et al. 2011]                                                                                         | Disturbance-Free Scheme                                                 | 3                               | 25+SA       | 0   | 0                   | Conventional                                    | 1              | N/A                                            | Large<br>(8ns)    | N/A              | Simulation-based<br>TSMC 45-nm  |
| 10 | [Kim, et al. 2012]                                                                                         | Source Degenerating Scheme and Balanced<br>Reference Scheme (SDSC)      | 3                               | 19          | 0   | 0                   | Conventional                                    | 1              | N/A                                            | Small<br>(1.9ns)  | Low<br>(134.4fJ) | Simulation-based<br>TSMC 65-nm  |
| 11 | [Das, et al. 2012]                                                                                         | Non-Destructive Variability Tolerant<br>Differential Read Scheme        | 4                               | 11+SA       | 0   | 0                   | Conventional                                    | 1              | N/A                                            | N/A               | N/A              | Simulation-based<br>TSMC 22-nm  |
| 12 | [Sun, et al. 2012]                                                                                         | Voltage-Driven Non-destructive Self-<br>Reference Sensing Scheme (VDRS) | 1                               | 9           | 2   | 0                   | Conventional                                    | 2              | Large<br>(>45mv)                               | Large<br>(15ns)   | Low<br>(12.08pJ) | Simulation-based<br>TSMC 130-nm |
| 13 | [Kang, Zhao, Klein <i>, et al.</i><br>2013]                                                                | Offset-Tolerant Triple-Stage Sensing Circuit                            | 5                               | 8+2×S<br>A  | 2   | 0                   | Conventional                                    | 3              | N/A                                            | Small<br>(4.3ns)  | Low<br>(40fJ)    | Simulation-based<br>TSMC 40-nm  |
| 14 | [Kim, Na, et al. 2014]                                                                                     | Split-Path Sensing Circuit (SPSC)                                       | 5                               | 16          | 0   | 0                   | Conventional                                    | 1              | N/A                                            | Small<br>(3ns)    | Low<br>(0.21pJ)  | Simulation-based<br>TSMC 45-nm  |
| 15 | [Kim, Ryu, <i>et al.</i> 2014],[Kang,<br>Li, Klein, <i>et al.</i> 2014]                                    | Offset-Canceling Triple-Stage Sensing Circuit<br>(OCTS)                 | 5                               | 15+14       | 3   | 7 Switches          | Conventional                                    | 3              | Large<br>~45.21mv                              | Large<br>(6.4ns)  | Low<br>(395.5fJ) | Simulation-based<br>TSMC 45-nm  |
| 16 | [Kim, Ryu, et al. 2014]                                                                                    | Self-Body Biasing Sensing Circuit (Self-BB)                             | 3                               | 15          | 0   | 0                   | Conventional                                    | 1              | N/A                                            | Small<br>(2ns)    | N/A              | Simulation-based<br>TSMC 45-nm  |
| 17 | [Bishnoi, et al. 2014]                                                                                     | Read Disturb Detection Scheme (RDD)                                     | 6                               | 37          | 0   | 0                   | Pre-Charge<br>Sense Amplifier                   | 1              | N/A                                            | Small<br>(1.2ns)  | Low<br>(N/A)     | Simulation-based<br>TSMC 65-nm  |
| 18 | [Kang, Li, Wang, et al. 2014]                                                                              | Variation-Tolerant High-Reliability Sensing<br>Scheme                   | 5                               | 17+14       | 3   | 7 Switches          | Conventional                                    | 3              | Large<br>(~227.54mv)                           | Small<br>(3ns)    | N/A              | Simulation-based<br>TSMC 40-nm  |
| 19 | [Kang, Li, Klein, et al. 2014]                                                                             | Variation-Tolerant and Disturbance-Free<br>Sensing Circuit              | 5                               | 17+14       | 3   | 7 Switches          | Conventional                                    | 3              | Large<br>(~102.14mv)                           | Small<br>(3ns)    | N/A              | Simulation-based<br>TSMC 40-nm  |
| 20 | [Chankyung Kim, <i>et al.</i><br>2015],[Yang, <i>et al.</i><br>2016],[Tsiligiannis, <i>et al.</i><br>2013] | Covalent-Bonded Cross-Coupled Current-<br>Mode Sense Amplifier (CBSA)   | 3                               | 31+SA       | 0   | 0                   | Conventional                                    | 2              | N/A                                            | Large<br>(9.1 ns) | N/A              | Simulation-based<br>TSMC 65-nm  |
| 21 | [Lee, et al. 2015]                                                                                         | Pre-Read and Write Sense Amplifier (PWSA)                               | 1                               | 46          | 0   | 1 Res               | Conventional                                    | 4              | Large<br>(360mv)                               | Small<br>(2ns)    | Low<br>(18uW)    | Simulation-based<br>TSMC 65-nm  |
| 22 | [Yang, et al. 2015]                                                                                        | Body-Biasing Feedback Circuit                                           | 3                               | 12+SA       | 0   | 0                   | Conventional                                    | 1              | Large<br>(N/A)                                 | Small<br>(N/A)    | N/A              | Simulation-based<br>TSMC 40-nm  |
| 23 | [Kang, Pang, et al. 2015]                                                                                  | Dynamic Referencing Sensing Scheme<br>(DRS)                             | 5                               | 11          | 0   | 0                   | Conventional                                    | 1              | Large<br>(~22.31mv)                            | N/A               | N/A              | Simulation-based<br>TSMC 40-nm  |
| 24 | [Ran <i>, et al.</i> 2015]                                                                                 | Read Disturb Detection Scheme (RDD)                                     | 5                               | 23          | 0   | 0                   | Latch-type<br>Voltage<br>Comparator             | 1              | N/A                                            | Small<br>(4ns)    | N/A              | Simulation-based<br>TSMC 28-nm  |
| 25 | [Kyungmin Kim, et al. 2015]                                                                                | Degenerated Cross-coupled Sensing Circuit<br>(DCCSC)                    | 3                               | 33          | 0   | 0                   | Degenerated<br>Cross-coupled<br>Sensing Circuit | 1              | Large<br>(>495.3mv)                            | Small<br>(2ns)    | Low<br>(0.195pJ) | Simulation-based<br>TSMC 65-nm  |

#### Table 3: Non-Destructive Sensing Schemes and their attributes.

Wang, *et al.* 2014]. This design includes three stages of sensing. One of these stages is a pre-amplifying stage which utilizes a charge transfer amplifier to amplify the voltage difference between the reference MTJ cell and main MTJ cell. In another research publication, Kang, et al. [Kang, Li, Klein, *et al.* 2014] utilized a modified charge transfer stage and a source follower amplifier which makes the design more reliable against device mismatch and variations, prevents read disturbance, and further improves the sense margin. In a recent research publication, Covalent-Bonded Cross-Coupled Current-Mode Sense Amplifier (CBSA) has been proposed and fabricated in [Chankyung Kim, *et al.* 2015]. In this design the source lines are merged throughout the whole memory array in order to make the design more compact in terms of area efficiency. Authors have shown that this design reduces the mismatch sensitivity of the cross-coupled latch in the sense amplifier design.

Providing reliable solutions are valuable, however if the power consumption of a reliable design is high, then that design cannot be a good alternative. In an effort for maintaining low power while having a reliable design, Lee, et al. have recently proposed Pre-Read and Write Sense Amplifier (PWSA) which they have shown that due to its pre-read stage, then the write error rate can be controlled. Based on their result shown in [Lee, *et al.* 2015], their design provides a fast reading circuit that consumes a small amount of power and increases the reliability through controlling and reducing the write error rate.

Another source of reliability exposure that results in read disturbance or read failure, is the thermal instability. In [Yang, et al. 2015], a Body-Biasing Feedback Circuit is proposed to improve the sensing margin and reliability of the STT-MRAM against thermal instability which will further reduce read failures and disturbance. In [Kang, Pang, et al. 2015], authors have introduced Dynamic Referencing Sensing (DRS) scheme in order to prevent read disturb reliability issue. Based on their design the area overhead is negligible due to the fact that the sensing circuitry is shared among the memory cells along the bit-line or word-line. In their design they manipulate the sensing circuit with regards to the sensed signal and adaptively configure the resistance of the load transistor's resistance. In a recent research [Ran, et al. 2015], another RDD circuit has been demonstrated and examined. Based on the fact that if a read disturb occurs, then the read current will have a sudden change due to change in the resistance of the MTJ and also knowing that this change will be in a unidirectional fashion either from P to AP or from AP to P, they have designed a circuit that can detect the read disturb fault. However, they have also mentioned that there exists a small latency before activation of the RDD circuit which if any read disturb fault occurs within the duration of the latency it will not be detected. They have also exhibited that the probability if detecting the read disturb will increase in their design if TMR ratio increases and/or if the detection time increases. The area overhead of this RDD design is negligible compared to the area of the chip as claimed in [Ran, et al. 2015]. Finally, a Degenerated Cross-Coupled Sensing Circuit (DCCSC) is been proposed in [Kyungmin Kim, et al. 2015] which is proven to have wide sense margin while consuming small amount of energy with a fast sensing time. They have designed a new reference cell that exhibited increased reliability against device mismatch and variations, and ameliorates read disturbances.

#### 5. CONCLUSION

Each of the solutions to the reliability problems of the STT-MRAM leverage different properties of the MTJ switching behavior. Figure 4 depicts the Read/Sense latency vs. Sensing Margin of STT-MRAM Destructive Sensing Schemes which have been proposed as time progressed from the initial designs on the left side of the plot to current designs on the right side of the plot. Recent preferred designs are able to achieve less than 5ns read sensing latency while maintaining wide sensing margins. Figure 5 depicts these values for Non-Destructive Sensing Schemes. Non-Destructive schemes have emphasized lower energy consumption as opposed to maintaining sensing margins. These offer a feasible guide to the circuit designer seeking to trade-off the range of approaches available based on these important parameters of reliability, performance, and energy.



Figure 4: Read Sense Latency vs. Sensing Margin of STT-MRAM Destructive Sensing Schemes and Circuit Designs (with the same order as listed in # column in Table 2).



Figure 5: Read Sense Latency vs. Sensing Margin of STT-MRAM Non-Destructive Sensing Schemes and Circuit Designs (with the same order as listed in # column in Table 3).

Table 2 and Table 3 list detailed numerical values for all of the schemes reviewed in this paper and also a description of qualitative attributes. Performance is seen to span in three different ranges across all proposed design strategies. The highest resiliency strategies deliver a sensing margin above 300mV while incurring low power and energy consumption on the order of picojoules and microwatts, respectively, with read sense latency of a few nanoseconds down to hundreds of picoseconds for non-destructive and destructive sensing schemes, respectively.

Some notable inflection points between reliability and performance occur based on whether tolerating process variation is of primary importance. In that case, destructive techniques such as [Eken, et al. 2014; Zhang, et al. 2013; Motaman, et al. 2015; Kim, et al. 2012; Das, et al. 2012; Na, et al. 2014; Kim, Ryu, et al. 2014; Kang, Li, Wang, et al. 2014; Chankyung Kim, et al. 2015; Kang, Li, Klein, et al. 2014; Kyungmin Kim, et al. 2015] are recommended. On the other hand, if tolerating read disturbance is a governing requirement then [Bishnoi, et al. 2014; Na, et al. 2014; Ren, et al. 2012; Lakys, et al. 2012; Kang, Li, Klein, et al. 2014; Halupka, et al. 2010; Kyungmin Kim, et al. 2015; Kim, et al. 2011; Kang, Zhao, Klein, et al. 2013; Yang, et al. 2015; Ran, et al. 2015] techniques are believed to be more promising. Furthermore, if wide sensing margin is required techniques such as [Eken, et al. 2015]

| Attribute                                 | References                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |  |  |  |  |  |
|-------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|--|--|
| Process Variation Tolerant                | [Eken, <i>et al.</i> 2014],[Zhang, <i>et al.</i> 2013],[Motaman, <i>et al.</i> 2015],[Kim, <i>et al.</i> 2012],[Das, <i>et al.</i> 2012],[Na<br><i>et al.</i> 2014],[Kim, Ryu, <i>et al.</i> 2014],[Kang, Li, Wang, <i>et al.</i> 2014],[Kang, Li, Klein, <i>et al.</i><br>2014],[Chankyung Kim, <i>et al.</i> 2015],[Kyungmin Kim, <i>et al.</i> 2015]                                                                                                                              |  |  |  |  |  |  |
| Read Disturb Reduction                    | [Bishnoi, <i>et al.</i> 2014],[Lakys, <i>et al.</i> 2012],[Ren, <i>et al.</i> 2012],[Halupka, <i>et al.</i> 2010],[Kim, <i>et al.</i> 2012],[Xang, Zhao, Klein, <i>et al.</i> 2013],[Na, <i>et al.</i> 2014],[Kang, Li, Klein, <i>et al.</i> 2014],[Yang, <i>et al.</i> 2015],[Ran, <i>et al.</i> 2015]                                                                                                                                                                              |  |  |  |  |  |  |
| Wide Sense Margin                         | [Sun, <i>et al.</i> 2012],[Zhao, <i>et al.</i> 2012],[Eken, <i>et al.</i> 2014],[Zhang, <i>et al.</i> 2013],[Zhao, <i>et al.</i> 2009],[Motaman, <i>et al.</i> 2015],[Chabi, <i>et al.</i> 2014],[Ren, <i>et al.</i> 2012],[Kang, Deng, <i>et al.</i> 2014],[Jun-Tae Choi 2016],[Tsuchida, <i>et al.</i> 2010],[Chen, <i>et al.</i> 2012],[Das, <i>et al.</i> 2012],[Kang, Zhao, Klein, <i>et al.</i> 2013],[Kang, Li, Klein, <i>et al.</i> 2014],[Kyungmin Kim, <i>et al.</i> 2015] |  |  |  |  |  |  |
| Write Polarization Asymmetry<br>Reduction | [Eken, et al. 2014],[Zhang, et al. 2013],[Lee, et al. 2015]                                                                                                                                                                                                                                                                                                                                                                                                                          |  |  |  |  |  |  |
| Yield Increase                            | [Sun, et al. 2012],[Kim, et al. 2011],[Kim, et al. 2012]                                                                                                                                                                                                                                                                                                                                                                                                                             |  |  |  |  |  |  |

Table 4: Suggested Sensing Schemes based on their attributes.

*al.* 2014; Zhang, *et al.* 2013; Sun, *et al.* 2012; Motaman, *et al.* 2015; Zhao, *et al.* 2009; Zhao, *et al.* 2012; Das, *et al.* 2012; Ren, *et al.* 2012; Tsuchida, *et al.* 2010; Kang, Li, Klein, *et al.* 2014; Kyungmin Kim, *et al.* 2015; Kang, Deng, *et al.* 2014; Chen, *et al.* 2012; Jun-Tae Choi 2016; Kang, Zhao, Klein, *et al.* 2013] could be a preferable, despite increased energy dissipation of some of the approaches. In addition, for robust and reliable designs to reduce write polarization asymmetry, sensing schemes such as [Eken, *et al.* 2014; Zhang, *et al.* 2013; Lee, *et al.* 2015] can be good candidates. Finally, if increasing the yield is the main goal then [Sun, *et al.* 2012; Kim, *et al.* 2012; Kim, *et al.* 2011] techniques can be promising alternatives for conventional sensing schemes. These criteria are also summarized in Table 4 along with the approaches that correspond with each objective.

### REFERENCE

- E. Eken, Y. Zhang, W. Wen, R. Joshi, H. Li, et al., "A Novel Self-Reference Technique for STT-RAM Read and Write Reliability Enhancement," *IEEE Transactions on Magnetics*, vol. 50, pp.1-4, 2014.
- E.K. Au, W.-H. Ki, W.H. Mow, S.T. Hung, and C.Y. Wong, "A novel current-mode sensing scheme for magnetic tunnel junction MRAM," *IEEE Transactions on Magnetics*, vol.40, pp.483-488, 2004.
- H. Tanizaki, T. Tsuji, J. Otani, Y. Yamaguchi, Y. Murai, et al., "A high-density and high-speed 1T-4MTJ MRAM with Voltage Offset Self-Reference Sensing Scheme," In Proceedings of Asian Solid-State Circuits Conference (ASSCC), IEEE, pp.303-306, 2006.
- Y. Zhang, I. Bayram, Y. Wang, H. Li, and Y. Chen, "ADAMS: asymmetric differential STT-RAM cell structure for reliable and high-performance applications," In Proceedings of the International Conference on Computer-Aided Design (ICCAD), IEEE Press, pp.9-16, 2013.
- Z. Sun, H. Li, Y. Chen, and X. Wang, "Voltage driven nondestructive self-reference sensing scheme of spin-transfer torque memory," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.20, pp.2020-2030, 2012.
- S. Motaman, S. Ghosh, and J.P. Kulkarni, "A novel slope detection technique for robust STTRAM sensing," In Proceedings of International Symposium on Low Power Electronics and Design (ISLPED), IEEE/ACM, pp.7-12, 2015.
- J. Kim, K. Ryu, S.H. Kang, and S.-O. Jung, "A novel sensing circuit for deep submicron spin transfer torque MRAM (STT-MRAM)," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.20, pp.181-186, 2012.
- R. Bishnoi, M. Ebrahimi, F. Oboril, and M.B. Tahoori, "Read disturb fault detection in STT-MRAM," In Proceedings of International Test Conference (ITC), IEEE, pp.1-7, 2014.
- R.A. Ashraf, A. Al-Zahrani, N. Khoshavi, R. Zand, S. Salehi, et al., "Reactive rejuvenation of CMOS logic paths using selfactivating voltage domains," In Proceedings of International Symposium on Circuits and Systems (ISCAS), IEEE, pp.2944-2947, 2015.
- W. Zhao, C. Chappert, V. Javerliac, and J.-P. Nozière, "High speed, high stability and low power sensing amplifier for MTJ/CMOS hybrid logic circuits," *IEEE Transactions on Magnetics*, vol.45, pp.3784-3787, 2009.
- W. Zhao, Y. Zhang, T. Devolder, J.-O. Klein, D. Ravelosona, et al., "Failure and reliability analysis of STT-MRAM," *Microelectronics Reliability*, vol.52, pp.1848-1852, 2012.
- J. Das, S.M. Alam, and S. Bhanja, "Non-destructive variability tolerant differential read for non-volatile logic," In Proceedings of 55th International Midwest Symposium on Circuits and Systems (MWSCAS), IEEE, pp.178-181, 2012.
- T.M. Maffitt, J.K. DeBrosse, J. Gabric, and E.T. Gow, "Design considerations for MRAM," *IBM Journal of Research and Development*, vol.50, pp.25, 2006.
- T. Na, J. Kim, J.P. Kim, S.-H. Kang, and S.-O. Jung, "An offset-canceling triple-stage sensing circuit for deep submicrometer STT-RAM," *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, vol.22, pp.1620-1624, 2014.
- J. Kim, T. Na, J.P. Kim, S.H. Kang, and S.-O. Jung, "A split-path sensing circuit for spin torque transfer MRAM," IEEE Transactions on Circuits and Systems II: Express Briefs, vol.61, pp.193-197, 2014.
- E. Kultursay, M. Kandemir, A. Sivasubramaniam, and O. Mutlu, "Evaluating STT-RAM as an energy-efficient main memory alternative," In Proceedings of International Symposium on Performance Analysis of Systems and Software (ISPASS), IEEE, pp.256-267, 2013.
- X. Chen, N. Khoshavi, J. Zhou, D. Huang, R.F. DeMara, et al., "AOS: adaptive overwrite scheme for energy-efficient MLC STT-RAM cache," In Proceedings of 53rd Design Automation Conference (DAC), ACM, pp.170, 2016.
- J. Kim, K. Ryu, J.P. Kim, S.-H. Kang, and S.-O. Jung, "STT-MRAM sensing circuit with self-body biasing in deep submicron technologies," *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, vol.22, pp.1630-1634, 2014.
- F. Ren, H. Park, R. Dorrance, Y. Toriyama, C.-K.K. Yang, et al., "A body-voltage-sensing-based short pulse reading circuit for spin-torque transfer RAMs (STT-RAMs)," In Proceedings of 13th International Symposium on Quality Electronic Design (ISQED), IEEE, pp.275-282, 2012.
- R. Zand, A. Roohi, S. Salehi, and R. DeMara, "Scalable Adaptive Spintronic Reconfigurable Logic using Area-Matched MTJ Design," *IEEE Transactions on Circuits and Systems II: Express Briefs*, vol.63, pp.5, 2016.
- A. Roohi, R. Zand, and R. DeMara, "A Tunable Majority Gate based Full Adder using Current-Induced Domain Wall Nanomagnets," *IEEE Transactions on Magnetics*, vol.52, pp.7, 2016.
- S.D. Pyle, H. Li, and R.F. DeMara, "Compact low-power instant store and restore D flip-flop using a self-complementing spintronic device," *Electronics Letters*, 2016.
- K. Shamsi, Y. Bi, Y. Jin, P.-E. Gaillardon, M. Niemier, et al., "Reliable and high performance STT-MRAM architectures based on controllable-polarity devices," In Proceedings of 33rd International Conference on Computer Design (ICCD), IEEE, pp.343-350, 2015.

- K. Tsuchida, T. Inaba, K. Fujita, Y. Ueda, T. Shimizu, et al., "A 64Mb MRAM with Clamped-Reference and Adequate-Reference Schemes," In Proceedings of International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), IEEE, pp.3, 2010.
- W. Kang, Z. Li, Z. Wang, E. Deng, J.-O. Klein, et al., "Variation-tolerant high-reliability sensing scheme for deep submicrometer STT-MRAM," *IEEE Transactions on Magnetics*, vol. 50, pp.1-4, 2014.
- Y. Lakys, W.S. Zhao, T. Devolder, Y. Zhang, J.-O. Klein, et al., "Self-enabled "error-free" switching circuit for spin transfer torque MRAM and logic," *IEEE Transactions on Magnetics*, vol.48, pp.2403-2406, 2012.
- C. Kim, K. Kwon, C. Park, S. Jang, and J. Choi, "A covalent-bonded cross-coupled current-mode sense amplifier for STT-MRAM with 1T1MTJ common source-line structure array," In Proceedings of International Solid-State Circuits Conference-(ISSCC), IEEE, pp.1-3, 2015.
- W. Kang, Z. Li, J.-O. Klein, Y. Chen, Y. Zhang, et al., "Variation-tolerant and disturbance-free sensing circuit for deep nanometer STT-MRAM," *IEEE Transactions on Nanotechnology*, vol.13, pp.1088-1092, 2014.
- D. Halupka, S. Huda, W. Song, A. Sheikholeslami, K. Tsunoda, et al., "Negative-resistance read and write schemes for STT-MRAM in 0.13µm CMOS," In Proceedings of International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), IEEE, pp.256-257, 2010.
- W. Kang, Y. Zhang, Z. Wang, J.-O. Klein, C. Chappert, et al., "Spintronics: Emerging ultra-low-power circuits and systems beyond MOS technology," ACM Journal on Emerging Technologies in Computing Systems (JETC), vol.12, pp.16, 2015.
- X. Fong, Y. Kim, K. Yogendra, D. Fan, A. Sengupta, et al., "Spin-Transfer Torque Devices for Logic and Memory: Prospects and Perspectives," *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems*, vol.35, pp.1-22, 2016.
- Y. Chen, H.H. Li, X. Wang, W. Zhu, W. Xu, et al., "A nondestructive self-reference scheme for spin-transfer torque random access memory (STT-RAM)," In Design, Automation & Test in Europe Conference & Exhibition (DATE), 2010, IEEE, pp.148-153, 2010.
- K. Kim, and C. Yoo, "Variation-Tolerant Sensing Circuit for Spin-Transfer Torque MRAM," IEEE Transactions on Circuits and Systems II: Express Briefs, vol.62, pp.1134-1138, 2015.
- W. Kang, E. Deng, J.-O. Klein, Y. Zhang, Y. Zhang, et al., "Separated precharge sensing amplifier for deep submicrometer MTJ/CMOS hybrid logic circuits," *IEEE Transactions on Magnetics*, vol.50, pp.1-5, 2014.
- Y. Chen, H. Li, X. Wang, W. Zhu, W. Xu, et al., "A 130 nm 1.2 V/3.3 V 16 Kb spin-transfer torque random access memory with nondestructive self-reference sensing scheme," *IEEE Journal of Solid-State Circuits*, vol.47, pp.560-573, 2012.
- J.P. Kim, T. Kim, W. Hao, H.M. Rao, K. Lee, et al., "A 45nm 1Mb embedded STT-MRAM with design techniques to minimize read-disturbance," In *Proceedings of Symposium on VLSI Circuits-Digest of Technical Papers*, 2011.
- G.-H.K. Jun-Tae Choi, Kyu-Boem Kim, and Yun-Heub Song, "Novel Self-Reference Sense Amplifier for Spin-Transfer-Torque Magneto-Resistive Random Access Memory," JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, vol.16, pp.8, 2016.
- W. Kang, W. Zhao, J.-O. Klein, Y. Zhang, C. Chappert, et al., "High reliability sensing circuit for deep submicron spin transfer torque magnetic random access memory," *Electronics Letters*, vol.49, pp.1283-1285, 2013.
- L. Yang, Y. Cheng, Y. Wang, H. Yu, W. Zhao, et al., "A body-biasing of readout circuit for STT-RAM with improved thermal reliability," In Proceedings of International Symposium on Circuits and Systems (ISCAS), IEEE, pp.1530-1533, 2015.
- Y. Ran, W. Kang, Y. Zhang, J.-O. Klein, and W. Zhao, "Read disturbance issue for nanoscale STT-MRAM," In Proceedings of Non-Volatile Memory System and Applications Symposium (NVMSA), IEEE, pp.1-6, 2015.
- Y. Emre, C. Yang, K. Sutaria, Y. Cao, and C. Chakrabarti, "Enhancing the reliability of STT-RAM through circuit and system level techniques," In *Proceedings of Workshop on Signal Processing Systems (SiPS)*, IEEE, pp.125-130, 2012.
- F. Alghareb, R. Ashraf, A. Alzahrani, and R. DeMara, "Energy and Delay Tradeoffs of Soft Error Masking for 16nm FinFET Logic Paths: Survey and Impact of Process Variation in Near Threshold Region," *IEEE Transactions on Circuits and* Systems II: Express Briefs, vol.PP, pp.5, 2016.
- D. Chabi, W. Zhao, J.-O. Klein, and C. Chappert, "Design and analysis of radiation hardened sensing circuits for spin transfer torque magnetic memory and logic," *IEEE Transactions on Nuclear Science*, vol.61, pp.3258-3264, 2014.
- J. Yang, P. Wang, Y. Zhang, Y. Cheng, W. Zhao, et al., "Radiation-Induced Soft Error Analysis of STT-MRAM: A Device to Circuit Approach," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.35, pp.380-393, 2016.
- W. Zhao, T. Devolder, Y. Lakys, J.-O. Klein, C. Chappert, et al., "Design considerations and strategies for high-reliable STT-MRAM," *Microelectronics Reliability*, vol.51, pp.1454-1458, 2011.
- T. Kawahara, K. Ito, R. Takemura, and H. Ohno, "Spin-transfer torque RAM technology: review and prospect," *Microelectronics Reliability*, vol.52, pp.613-627, 2012.
- J. Li, P. Ndai, A. Goel, S. Salahuddin, and K. Roy, "Design paradigm for robust spin-torque transfer magnetic RAM (STT MRAM) from circuit/architecture perspective," *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, vol.18, pp.1710-1723, 2010.
- G. Tsiligiannis, L. Dilillo, A. Bosio, P. Girard, A. Todri, et al., "Testing a commercial MRAM under neutron and alpha radiation in dynamic mode," *IEEE Transactions on Nuclear Science*, vol.60, pp.2617-2622, 2013.
- W. Kang, W. Zhao, E. Deng, J.-O. Klein, Y. Cheng, et al., "A radiation hardened hybrid spintronic/CMOS nonvolatile unit using magnetic tunnel junctions," *Journal of Physics D: Applied Physics*, vol.47, pp.405003, 2014.
- H. Lee, J.G. Alzate, R. Dorrance, X.Q. Cai, D. Markovic, et al., "Design of a Fast and Low-Power Sense Amplifier and Writing Circuit for High-Speed MRAM," *IEEE Transactions on Magnetics*, vol.51, pp.1-7, 2015.
- W. Kang, T. Pang, Y. Zhang, D. Ravelosona, and W. Zhao, "Dynamic Reference Sensing Scheme for Deeply Scaled STT-MRAM," In Proceedings of International Memory Workshop (IMW), IEEE, pp.1-4, 2015.
- W. Kang, W. Zhao, Z. Wang, Y. Zhang, J.-O. Klein, et al., "A low-cost built-in error correction circuit design for STT-MRAM reliability improvement," *Microelectronics Reliability*, vol.53, pp.1224-1229, 2013.
- W. Kang, L. Zhang, W. Zhao, J.-O. Klein, Y. Zhang, et al., "Yield and reliability improvement techniques for emerging nonvolatile STT-MRAM," *IEEE Journal on Emerging and Selected Topics in Circuits and Systems*, vol.5, pp.28-39, 2015.

- W. Xu, H. Sun, X. Wang, Y. Chen, and T. Zhang, "Design of last-level on-chip cache using spin-torque transfer RAM (STT RAM)," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.19, pp.483-493, 2011.
- P. Zhou, B. Zhao, J. Yang, and Y. Zhang, "Energy reduction for STT-RAM using early write termination," In Proceedings of the International Conference on Computer-Aided Design (ICCAD), ACM, pp.264-268, 2009.
- X. Wang, Y. Chen, H. Li, D. Dimitrov, and H. Liu, "Spin torque random access memory down to 22 nm technology," *IEEE Transactions on Magnetics*, vol.44, pp.2479-2482, 2008.
- K. Ono, T. Kawahara, R. Takemura, K. Miura, H. Yamamoto, et al., "A disturbance-free read scheme and a compact stochasticspin-dynamics-based MTJ circuit model for Gb-scale SPRAM," In *Proceedings of International Electron Devices Meeting* (*IEDM*), IEEE, pp.1-4, 2009.
- A. Raychowdhury, "Pulsed READ in spin transfer torque (STT) memory bitcell for lower READ disturb," In Proceedings of International Symposium on Nanoscale Architectures (NANOARCH), IEEE/ACM, pp.34-35, 2013.
- R. Takemura, T. Kawahara, K. Ono, K. Miura, H. Matsuoka, et al., "Highly-scalable disruptive reading scheme for Gb-scale SPRAM and beyond," In *Proceedings of International Memory Workshop (IMW)*, IEEE, pp.1-2, 2010.
- G. Jeong, W. Cho, S. Ahn, H. Jeong, G. Koh, et al., "A 0.24-µm 2.0-V 1T1MTJ 16-kb nonvolatile magnetoresistance RAM with self-reference sensing scheme," *IEEE Journal of Solid-State Circuits*, vol.38, pp.1906-1910, 2003.