University of Baghdad College of Engineering



## Journal of Engineering

journal homepage: www.joe.uobaghdad.edu.iq

Volume 29 Number 7 July 2023



## Comparative Reliability Analysis between Horizontal-Vertical-Diagonal Code and Code with Crosstalk Avoidance and Error Correction for NoC Interconnects

Hamdan Kadhim Abed <sup>1,\*</sup>, Wameedh Nazar Flayyih<sup>2</sup>

Department of Computer Engineering, College of Engineering, University of Baghdad, Baghdad, Iraq hamdan.abed2005m@coeng.uobaghdad.edu.iq<sup>1</sup>, wam.nazar@coeng.uobaghdad.edu.iq<sup>2</sup>

#### ABSTRACT

**E**nsuring reliable data transmission in Network on Chip (NoC) is one of the most challenging tasks, especially in noisy environments. As crosstalk, interference, and radiation were increased with manufacturers' increasing tendency to reduce the area, increase the frequencies, and reduce the voltages. So many Error Control Codes (ECC) were proposed with different error detection and correction capacities and various degrees of complexity. Code with Crosstalk Avoidance and Error Correction (CCAEC) for network-on-chip interconnects uses simple parity check bits as the main technique to get high error correction capacity. Per this work, this coding scheme corrects up to 12 random errors, representing a high correction capacity compared with many other code schemes. This candidate has high correction capability but with a high codeword size. In this work, the CCAEC code is compared to another well-known code scheme called Horizontal-Vertical-Diagonal (HVD) error detecting and correcting code through reliability analysis by deriving a new accurate mathematical model for the probability of residual error P<sub>res</sub> for both code schemes and confirming it by simulation results for both schemes. The results showed that the HVD code could correct all single, double, and triple errors and failed to correct only 3.3 % of states of quadric errors. In comparison, the CCAEC code can correct a single error and fails in 1.5%, 7.2%, and 16.4% cases of double, triple, and quadric errors, respectively. As a result, the HVD has better reliability than CCAEC and has lower overhead; making it a promising coding scheme to handle the reliability issues for NoC.

**Keywords:** Error Detection and Correction Codes, Network on Chip Noc, Reliability Analysis, Residual Error Probability.

Article received: 30/12/2022

<sup>\*</sup>Corresponding author

Peer review under the responsibility of University of Baghdad. https://doi.org/10.31026/i.eng.2023.07.08

This is an open access article under the CC BY 4 license (<u>http://creativecommons.org/licenses/by/4.0/)</u>.

Article accepted: 07/04/2023

Article published: 01/07/2023



# تحليل موثوقية مقارن بين ترميز افقي-عمودي-محوري وترميز تجنب الحث المتبادل وتصحيح الخطأ لروابط شبكة الرقاقة

حمدان كاظم عبد1، \*، وميض نزار فليح<sup>2</sup> قسم هندسة الحاسبات، كلية الهندسة، جامعة بغداد، بغداد، العراق

الخلاصة

يعد ضمان نقل البيانات الموثوق به في (NoC) Network on Chip (NoC) أحد أكثر المهام تحديًا ، خاصةً عندما زادت تأثيرات البيئات الصاخبة مثل الحث المتبادل والتداخل والإشعاع مع زيادة اتجاه الشركات المصنعة لتقليل المساحة وزيادة التر ددات وتقليل الفولتية. نتيجة لذلك ، تم اقتراح العديد من رموز التحكم في الأخطاء (ECC) بقدرات مختلفة للكشف عن الأخطاء وتصحيحها الفولتية. نتيجة لذلك ، تم اقتراح العديد من رموز التحكم في الأخطاء (ECC) بقدرات مختلفة للكشف عن الأخطاء وتصحيحها الفولتية. نتيجة لذلك ، تم اقتراح العديد من رموز التحكم في الأخطاء (ECC) بقدرات مختلفة للكشف عن الأخطاء وتصحيحها الورجات مختلفة من التعقيد. على سبيل المثال ، يستخدم الرمز مع تجنب الحث المتبادل وتصحيح الخطأ (CCAEC) للشبكة على ودرجات مختلفة من التعقيد. على سبيل المثال ، يستخدم الرمز مع تجنب الحث المتبادل وتصحيح الخطأ (CCAEC) للمخطط الترميز هذا وفقًا للمؤلفين تصحيح ما يصل إلى 12 خطأ عشوائي والذي يعتبر ذو قدرة تصحيح كبيرة مقارنة بالعديد من مغطط الترميز هذا وفقًا للمؤلفين تصحيح ما يصل إلى 12 خطأ عشوائي والذي يعتبر ذو قدرة تصحيح كبيرة مقارنة بالعديد من مغرز الأخرى. هذا الترميز لديه قدرة تصحيح عالية ولكن مع حجم كلمة مشفرة عالية. في هذا البحث ، تتم مقارنة ترميز (HVD) مع مخطط ترميز آخر معروف يسمى (HVD) الشتقاق نموذج رياضي دقيق جديد لاحتمال الخطأ المتبقي مقارنة ترميز على مع معنوا للا معرفي والذي يتم من خلال اشتقاق نموذج رياضي دقيق جديد لاحتمال الخطأ المتبقي مقارنة ترميز وتكيدها من خلال لتنانج المعاري المتوقية والذي يتم من خلال اشتقاق نموذج رياضي دقيق جديد الحمان الخطأ المتبقي مقارنة ونميز وتأكيدها من خلال نتائج المحاكة لكلا المخطوين. أظهرت النائج أن ترميز المؤالي الميز وينا مع مع الأخلية وفشل في تصحيح 3.5 فقط من حالات الأخطاء الرميز والماء الزميز ويأكيدها من خلال نتائج المحاكة لكلا المخطوين. أظهرت النائج أن ترميز Hvo الميني الخطاء الفردية والم في و5.1% و 5.6% من حالات الأخطاء المزدوجة والثلاثية وفشل في تصحيح 3.5% فقط من حالات الأخطاء الرباعية ، بينما يمكن الترميز الخر تصحيح خلية النائي ولكات الأخطاء المزدوجة والثلاثية وفشل في تصحيح ألاكا المخطاء المزدوجة والثلاثية وفشل في تصحيح ألاحا الأخطاء المزدوجة والثلاثية وعالم من حالات الأخطاء المزدوجة والثلاثية والرل من تصحيح مال

الكلمات الرئيسية: رموز اكتشاف الأخطاء وتصحيحها، الشبكة على الرقاقة، تحليل الموثوقية، احتمالية الخطأ المتبقى.

## **1. INTRODUCTION**

Traditional links, such as the standard bus, lack IP scalability and reusability. Therefore, NoC has been adopted to improve modularity, reliability, and scalability in on-chip communications for multi-core architectures. The requirements for reliable on-chip communication in the current network-on-chip (NoC) have increased concerning increasing the number of node blocks (Giovanni et al., 2002). The NoC comprises the Network Interface (NI), routers, and interconnect links (Mohammed and Flavvih, 2019). These interconnect links are the most affected by the noisy environment, such as crosstalk, radiation, and interference effects (Rahimipour et al., 2012; Rahimipour et al., 2020). In addition to the constraints, Such as shrunk area, increased frequencies, and decreased supply voltage (Murali et al., 2005; Flayyih et al., 2015). Many types of Error Detection And Correction Code (EDAC) techniques have been proposed to face these challenges, representing the most effective approach for supporting reliable on-chip communication. Single Error Detection (SED) is the simplest code using a simple parity check bit technique (Fu and Ampadu, 2012). Single Error Correction and Single Error Detection (SEC-SED) were achieved with hamming code (Singh, 2016). Single Error Correction and Double Error Detection (SEC-DED) were proposed with Extended Hamming code (Murali et al., 2005). Hamming product codes (HPC) (Fu and Ampadu, 2009) can correct up to 5 errors by using

two dimensions of extended hamming in a row and column with type II Hybrid Automatic Repeat Request (HARQ) to reduce transmitted codeword size. Horizontal-Vertical-Diagonal error detecting and correcting code (HVD) uses four direction simple parity bits to detect up to 7 errors or detect up to 4 error bits and correct up to three errors **(Kishani et al., 2011)**. In **(Shamshiri et al., 2011)**, an end-to-end error location-aware correction code is produced for 64-bit data which can correct up to 16 bursts and 2 random errors. For 32-bit data, the 14-bit burst error can be corrected by using Hamming code, forbidden pattern code (FPC), and overlapping, which was proposed in multiple continuous errors correct coding (MCECC) **(Wang, 2011)**.

In Multibit Error Correcting Coding with Reduced Link Bandwidth (MECCRLB) **(Vinodhini and Murty, 2018)**, 25 parity bits are added to the 32-bit input data. MECCRLB can correct an 11-bit random error, a 4-bit burst error, or the 4-bit random error and the built-in burst error. But, it is proved by **(Asaad et al., 2020)** that it can correct only one error.

After proposing some techniques for crosstalk avoidance, such as shielding (Kose et al., 2010), adding buffers or repeaters (Zangeneh and Masoumi, 2010), duplication (Ganguly et al., 2009), and Crosstalk Avoidance Code (CAC) (Shirmohammadi and Miremadi, 2017). Many EDAC techniques were combined with crosstalk avoidance techniques to enhance the reliability of NoC, where in Joint crosstalk avoidance and Triple Error Correction (JTEC) (Ganguly et al., 2009), a coding technique (77, 32), Hamming code and duplication are combined to correct 3 bits of error. This code is further enhanced for Triple Error Correction and Quadruple Error Detection (JTEC-SQED). The Hsiao code is used with triplication, giving a 117-bit codeword for 32-bit input data, 5-bit errors can be corrected or 6-bit errors detected upon decoding (Vinodhini et al., 2015). In Duplicated Two-Dimensional Parities (DTDP) (Flayyih et al., 2014), two-dimensional parity bits with codeword duplicated are used for 7-bit error detection. When the decoder detects the error, the receiver sends a request signal to the sender for retransmission. Later, this code was enhanced to correct a single error and detect 6 errors (Flayyih et al., 2020). In Joint Crosstalk Aware Multiple Error Correction (JMEC) (Gul, 2017), 32-bit data duplication and interleaving results in a 104-bit codeword that can correct up to 10 random errors or 9 burst errors. Finally, Code with Crosstalk Avoidance and Error Correction (CCAEC) code (Lakshmi et al., 2020) with an interesting error correction capacity of up to 12 errors was proposed by using two dimensions one of them is a horizontal simple parity vector where each parity bit is produced from one row of the input data block and another vertical parity check bits that are produced in a two-step. In the first step, the mask check bits for each row are calculated. In the second step, the vertical bits from the generated masked check bits are calculated. After duplication, the 104-bit codeword length is generated. According to the above techniques, error correction capacity is lower in techniques not joined with crosstalk avoidance. Although combining crosstalk avoidance with EDAC codes increased the errorcorrecting capacity, there was an increase in the bit overhead which led to an increase in the power consumption of the link.

This paper analyzes the reliability of the HVD code and CCAEC code by deriving a new accurate mathematical model for the probability of error residual for both coding schemes. This analysis compares the coding mentioned above schemes to evaluate the best scheme used in the NoC. Also, the simulation results are calculated using the Verilog code by Modelsim program to confirm the derived model.



## 2. METHODOLOGY

To analyze the HVD and CCAEC codes, we need first to clarify the mechanism of the encoder and decoder for each technique which can be introduced as follows:

## 2.1 Horizontal-Vertical-Diagonal Error Detecting and Correcting Code HVD.

The HVD code **(Kishani et al., 2011)** is a Hybrid Automatic Repeat Request HARQ that corrects specific errors and detects others without correcting according to the capability of the algorithm code scheme.

On the encoder side, the input data (M) is arranged in a matrix of (m x n) where m and n represent several rows and columns of the data block, respectively. From this matrix, four sets of parity check bits are derived, namely horizontal (H), vertical (V), slash diagonal (D), and backslash diagonal (D'), as shown in **Fig. 1** for 64-bit input data. In addition, a parity check bit is added for each of the four parity check vectors. The encoder algorithm produces a 114-bit codeword.



Figure 1. The structure of the HVD method (Rahman et al., 2015).

The decoder algorithm depends on the syndrome of parity check bits which are commonly used in many coding schemes such as BKLC, BCH, Golay, and Hamming codes **(Ahmed and Al-Hindawi, 2023)** to detect or correct errors that may affect the transmitted data. If all syndrome of check bits is equal to zeros, this means either no error or an undetectable state. In contrast, if not equal to zeros, the algorithm detects up to 7 errors, or corrects up to 3 errors and detects up to 4 errors depending on the intersection among parity check bits, as shown in the algorithm in **Fig. 2**.



Figure 2. Decoder algorithm of HVD code.

## 2.2 Code with Crosstalk Avoidance and Error Correction for NoC (CCAEC).

In the CCAEC code **(Lakshmi et al., 2020)**, the input data for M bits are arranged in an  $m \times n$  matrix where n = 4 and m = M/n. Here m and n are the numbers of rows and columns, respectively. The number of columns stays constant for any size of input data. Horizontal and vertical parity check bits are coded for each row. The number of horizontal parity bits (H) equals m, and the number of vertical parity bits (V) is  $3 \times m/2$ . For example, the 32-bit input data is arranged as  $8 \times 4$  with 8 rows and 4 columns. The number of horizontal and

vertical parity bits is 8 and 12, respectively. Horizontal parity check bits are obtained directly for each row. Also, the masked parity bit is encoded from each row using adaptive hamming code. These parity bits are known as masked parity because they are not added to the codeword nor transmitted. Vertical parity bits are derived from these masked parity bits. Finally, the codeword consists of only data bits, horizontal parity bits, and vertical parity bits, which the sender transmits. Masked parity J bits are obtained as shown in **Fig. 3** from the following equations **(Lakshmi et al., 2020)**:

$$J_i = M_i \oplus M_{i+24} \tag{1}$$

$$I_{i+1} = M_{i+8} \oplus M_{i+24} \tag{2}$$

$$J_{i+2} = M_{i+16} \oplus M_{i+24}$$
(3)

Where  $\oplus$  is XOR logic operation, *i*=3*n* and *n*=0,1,2,...,7.

The vertical parity bits are obtained by:

$$V_{Cj} = J_j \oplus J_{j+12} \tag{4}$$

Where *j* = 0, 1, ...,11.

| Data bits      |                |                 |                 | Horizontal<br>Check Bits | Masked Check Bits |                 |                 |                   |
|----------------|----------------|-----------------|-----------------|--------------------------|-------------------|-----------------|-----------------|-------------------|
| Mo             | M <sub>8</sub> | M16             | M <sub>24</sub> | HC <sub>0</sub>          | Jo                | J1              | J <sub>2</sub>  | 1                 |
| M1             | M9             | M17             | M <sub>25</sub> | HC1                      | J3                | J4              | Js              | 1                 |
| M <sub>2</sub> | M10            | M18             | M <sub>26</sub> | HC <sub>2</sub>          | J6                | J7              | J <sub>8</sub>  | ]                 |
| Мз             | M11            | M19             | M <sub>27</sub> | HC₃                      | Jg                | J10             | J <sub>11</sub> |                   |
| M4             | M12            | M <sub>20</sub> | M <sub>28</sub> | HC <sub>4</sub>          | J <sub>12</sub>   | J <sub>13</sub> | J <sub>14</sub> | ]                 |
| M <sub>5</sub> | M13            | M21             | M <sub>29</sub> | HC₅                      | J <sub>15</sub>   | J <sub>16</sub> | J <sub>17</sub> | ]                 |
| M <sub>6</sub> | M14            | M22             | M30             | HC <sub>6</sub>          | J <sub>18</sub>   | J19             | J <sub>20</sub> | ]                 |
| M <sub>7</sub> | M15            | M <sub>23</sub> | M31             | HC <sub>7</sub>          | J <sub>21</sub>   | J22             | J <sub>23</sub> |                   |
|                |                |                 |                 |                          | VC <sub>0</sub>   | VC1             | VC <sub>2</sub> |                   |
|                |                |                 |                 |                          | VC <sub>3</sub>   | VC4             | VC <sub>5</sub> | Vertical<br>Check |
|                |                |                 |                 |                          | VC <sub>6</sub>   | VC7             | VC <sub>8</sub> | Bits              |
|                |                |                 |                 |                          | VC <sub>9</sub>   | VC10            | VC11            |                   |

**Figure 3.** Matrix (8 × 4) used to calculate check and masked check bits for 32-bit Input data **(Lakshmi et al., 2020)**.

Hence, a codeword consisting of 52 bits consisting of 20 parity bits was added to the 32-bit input data, as shown in **Fig. 3.** Finally, to enhance crosstalk avoidance, the codeword was duplicated to become 104 bits before transmitting it. The decoder algorithm for this code is shown in **Fig. 4.** First, the received data is separated into two blocks. Calculating the syndrome for each type of parity check bit, comparing the syndrome of horizontal check bits to choose the copy with fewer errors, and then determining whether these errors are correct depends on the syndrome for both horizontal and vertical check bits.





#### **3. RELIABILITY ANALYSIS**

One of the metrics to determine reliability is calculating the probability of residual error ( $P_{res}$ ), which measures the reliability of the NoC of any EDAC technique **(Flayyih et al.,** 

**2013).**  $P_{res}$  is the probability of finding an error(s) in the received flit after completing the decoding process. That means the flit has an undetectable state which is out of the capability of the code scheme decoding algorithm, and this can occur in the first transmission or after retransmitting one or more times (Yu, 2009). In the following subsection, both HVD and CCAEC codes will be evaluated according to the calculation of the P<sub>res</sub> for random errors only because the burst errors are acceptable for both codes:

#### A. HVD Code.

Because it is a HARQ technique, this code type depends on correcting some errors and detecting others without correcting them according to the capability of scheme code; therefore, the  $P_{res}$  is given as in **(Flayyih et al., 2014)**:

 $P_{res} = P_{und} + P_{und} \times P_{ret} + P_{und} \times P_{ret}^2 + \dots + P_{und} \times P_{ret}^n$  (5) The  $P_{ret}$  is the probability of retransmission, and  $P_{und}$  is the probability of undetectable errors in the decoder. Eq. (5) can be simplified using geometric series reduction as:

$$P_{res} = \frac{P_{und}}{1 - P_{ret}} \tag{6}$$

where *P*<sub>und</sub> for HVD is derived using Eq. (7) by summating undetectable cases, which are very few cases the HVD decoder cannot detect. Where in the first case, when two of eight errors have happened in any one direction of HVD message bits matrix, row, column, slash, or back slash direction, which is represented in the first, second, and third term of Eq. (7), respectively, and other six errors are in parity check bits related to former two bits that make syndromes of them equal to zero as shown **Fig. 5 (a)**, and **(b)**. The second undetectable case, as shown in the fourth term of Eq. (7), is when four errors, each two of them are located in a different row in data bits. The other four bits have happened in parity check bits related to the former four errors to make all syndromes equal to zeros, as shown in **Fig. 5 (c)**. The other cases, such as shown in **Fig. 5 (d)**, where all eight errors occurring in message bits are neglected since there is little probability and to avoid complexity.

$$P_{\text{und}} = \left[\binom{n}{2}\binom{m}{1} + \binom{m}{2}\binom{n}{1} + \sum_{t=2}^{n-1}\binom{t}{2} \times 4 + \binom{n}{4}\binom{m/2}{1}\right] \varepsilon^{td+1}$$
In general,
$$(7)$$

$$\binom{y}{x} = \frac{y!}{x!(y-x)!} \tag{8}$$

Eq. (8) is a general mathematical form to calculate the possible combinations of x elements from a set of y, where x and y are any two integer numbers and  $t_d$  (for HVD  $t_d$  = 7) is the maximum error detection capacity. Such the HARQ technique can correct some errors and detect others according to the scope of code scheme capacity. Assuming that code can correct up to  $t_c$  and can detect up to  $t_d$  errors, then  $P_{ret}$  can be written as:

 $P_{ret} = \sum_{tc+1}^{td} P_{i\text{-}error}$ (9)  $P_{i\text{-}error} \text{ is the probability that an L-bit codeword (where L for HVD equal to m×n+H+V+D+D'+4 = 69 bits for 32-bit input message) has$ *i* $errors, and t<sub>c</sub> (for HVD t<sub>c</sub> = 3) is the maximum error correction capacity. Thus, <math>P_{i\text{-}error}$  is given by (Flayyih et al., 2018):



(10)

 $P_{i\text{-}error} = {L \choose i} \varepsilon^i (1 - \varepsilon)^{L-i}$ 

For small  $\varepsilon$ , the probability of (t<sub>c</sub> + 1) errors dominates, and the Eq. (9) can be rounded to:

 $P_{\rm ret} = {L \choose tc+1} \varepsilon^{tc+1}$ 

(11) Finally, by substituting Eq. (7) and Eq. (11) in Eq. (6), Pres can be found for the HVD code easily.



Figure 5. Samples of undetectable cases of 8 errors for HVD 32 message bits decoder (a) Double errors in one row (b) Double errors in one Diagonal (c) Quadruple of errors in data bits (d) Eight errors in data bits.



#### **B. CCAEC Code.**

The probability of residual error depends on the probability of retransmission of detected error  $P_{ret}$  and the probability of correction error concerning the technique type and capability of scheme code; the general form is given by **(Fu and Ampadu, 2009)** as:

$$P_{res} = P_{und} + P(e_{decoding}, e_{detecting})$$
(12)

P( $e_{decoding}$ ,  $e_{detecting}$ ) is the Probability of error after the retransmission and decoding process is completed. Since CCAEC is a Forward Error Correction (FEC) technique capable of correcting all errors detected, resulting in  $P_{ret} = 0$  and Probability of uncorrectable error  $(P_{unc})=P_{und}$ . So Eq. (5) and Eq. (12) become:

$$P_{res} = P_{unc} \tag{13}$$

Based on **(Lakshmi et al., 2020)**, the CCAEC code can correct up to 12 random errors for 32 bits of input data, and P<sub>res</sub> was given as:

$$P_{\rm res} = \begin{pmatrix} 2(k+m+\left(\frac{m+3}{2}\right))\\ 13 \end{pmatrix} \varepsilon^{13}$$
(14)

But this equation is inaccurate because the CCAEC decoding algorithm cannot correct all 12 random errors. We can prove that based on the minimum hamming distance  $d_{min}$ , which is the minimum number of bits that can be changed to jump from a valid codeword to another valid one. The  $d_{min}$  is used to determine the maximum detection and correction capacity for any linear coding scheme by using equation ( $(d_{min}-1)/2$ ). For the CCAEC code, the  $d_{min} = (d_{min}$  for horizontal simple parity check vector ×  $d_{min}$  for vertical simple parity check vector) (Asaad et al., 2020) produce as  $d_{min} = 2 \times 2 = 4$ . So, the maximum correcting capability for the CCAEC code is (4-1)/2 = 1. As well as for this theoretical limitation and based on the CCAEC algorithm (Lakshmi et al., 2020), Fig. 6 shows some cases of two, three, and four errors in which the CCAEC code fails to correct them.

So the new accurate estimated model was derived based on the decoding algorithm, as given in Fig. 4. After duplicated codeword is received, the decoder separates it into two copies. And then, select the copy with the least number of 1's in its syndrome horizontal check bits and when all syndrome of any of two copies equals zero. On the other side, if both copies are equal in the number of 1's, the decoder will select any of them as a default copy. To simplify, the two copies will be denoted as copies A and B; when both are equal in the number of 1's, copy A will be considered a default copy. As a result, Eq.(16) through Eq.(18) represent the uncorrectable error probability of CCAEC code where Eq.(16) expresses undetectable double error probability ( $P_{unc2}$ ) when two errors are located in one row in copy A as shown in **Fig. 6 (a)**. The following equation represents three undetectable errors probability (*P*<sub>unc3</sub>). In the first term, two errors are located in the same row in the default copy, and the third error is anywhere in the duplicated codeword except the message bits and horizontal check bits of copy A. The second term is when two errors are in one row in copy B, and the third error is in message bits and horizontal check bits of copy A. In the third term, when one error occurs in copy A, specifically in the last bit of any row in the message bits or in the horizontal check bit, the second error is in the vertical check bit which is related to the first error in

copy A, and the third error happens anywhere in message bits or horizontal check bits of copy B.



Figure 6. Samples of fail cases for (a) CCAEC decoder, (b) Double errors, (c) Triple errors, and (d) Quadruple errors.

In the last term, one error locates any row's first three bits. The second error is in the vertical check bit, related to the former first error in copy A. The third error happens anywhere in the message bits and horizontal check bits of copy B, examples of three undetectable errors, as shown in **Fig. 6 (b) and(c)**.

The last equation is related to four uncorrectable errors probability ( $P_{unc4}$ ). The first case is when two errors are in the same row in the message bits of the default copy, and the other two errors are located anywhere in the codeword except the message bits of the default copy. The second term is when two errors are in the same row in the message bits of the default copy, and the other two errors, one of them is located anywhere in the message bits or horizontal check bits of copy A, and the other error is anywhere in the message bits or horizontal check bits of copy B. Finally, the last term is if two errors are in the same row in the message bits or horizontal check bits of copy B. Finally, the last term is of copy A and the other error are in the same row in the message bits. Horizontal check bits of copy A and the other error are anywhere in the codeword except the message bits and horizontal check bits of copy B, as shown in **Fig. 6 (d)**. However, some cases are ignored, especially in the four error case, because it has a very small error bit rate and also to avoid the complexity of the equation. Finally, The P<sub>res</sub> for the upper bound of CCAEC can be written as:

$$P_{res} = \sum_{i=2}^{4} P_{unci} \tag{15}$$

Where *i* is given from *i*=2 up to 4 errors.

Where Punc can express by:

$$P_{unc2} = \binom{(m+1)}{2} \binom{n}{1} \varepsilon^2$$
(16)

$$P_{\text{unc3}} = \begin{bmatrix} \binom{(m+1)}{2} \binom{n}{1} \binom{n \times (m+1) + 2 \times vr \times vc}{1} + \binom{(m+1)}{2} \binom{n}{1} \binom{n \times (m+1)}{1} + \binom{vc}{1} \binom{2}{1} \binom{n}{1} \binom{n \times (m+1)}{1} + \binom{(vc)}{1} \binom{n}{1} \binom{n \times (m+1)}{1} \end{bmatrix} \varepsilon^{3}$$
(17)

$$P_{\text{unc4}} = \left[\binom{(m+1)}{2}\binom{n \times (m+1) + 2 \times vr \times vc}{1}\binom{n \times (m+1) + 2 \times vr \times vc}{1}\binom{n}{1} + \binom{(m+1)}{2}\binom{n \times (m+1)}{1}\binom{n \times (m+1)}{1}\binom{n}{1} + \binom{(m+1)}{2}\binom{n \times (m+1) + 2 \times vr \times vc}{1}\binom{n}{1}\right]\varepsilon^{4}$$
(18)

Where  $v_r$  and  $v_c$  are the number of rows and columns for the matrix of vertical parity check bits, respectively.

By substituting Eq.(16), Eq.(17), and Eq. (18) in Eq. (15), the P<sub>res</sub> of CCAEC can be found.

#### **4. RESULTS AND DISCUSSION**

For more analysis, simulation results were done by Verilog code under the Modelsim program for both code schemes, as shown in **Table 1.** Where 10<sup>6</sup> random samples of 32-bit input data are injected into both code schemes with different numbers of errors, and the failure percentage is given **(Asaad et al., 2020)**:

Failure percentage =  $\frac{\text{the number of uncorrected samples}}{\text{the total number of samples}} \times 100\%$  (19)

Injected samples are fed with one, two, three, four, and five errors randomly located in the transmitted codeword. As shown in **Table 1**, it is clear that for a single error, both schemes can correct all samples with a single error. Then, for samples with double and triple random errors, the HVD code can correct all of them, while the CCAEC code fails to correct 1.5% and 7.2% for double and triple errors, respectively. Similarly, the HVD code could not correct 3.3% of quadruple errors, and the other code could not correct 16.4% of the total samples. Since the HVD code is HARQ technique, the undetectable error probability  $P_{und}$  must be found because its important in  $P_{res}$  calculation where it appears in case of 8 errors, and its value was  $0.4 \times 10^{-7}$  in simulation results.

| Numbers of | HVD                    | CCAEC                  |  |  |
|------------|------------------------|------------------------|--|--|
| errors     | Failure Percentage (%) | Failure Percentage (%) |  |  |
| 1          | 0                      | 0                      |  |  |
| 2          | 0                      | 1.5                    |  |  |
| 3          | 0                      | 7.2                    |  |  |
| 4          | 3.3                    | 16.4                   |  |  |

**Table 1.** Simulation results.

**Fig. 7**. represents the simulation and estimation of the probability of residual error with respect to different values of bit error rate where the simulation is done by Verilog language under Modelsim program for both scheme codes and injected 10<sup>7</sup> samples of 32 data bits for each number of errors located in random position for each scheme algorithm and found failure rate for each of them and then multiply each of these rates by error bit rate with the power of the number of errors then make summation for them to find simulated P<sub>res</sub>. And we can observe very little difference between estimation and simulation results, which confirms the derived model's validity.



**Bit Error Rate** 

Figure 7.  $P_{res}$  with respect to the bit error rate.

As well as it is clear to notice as given in **Table 2.** The bit overhead for CCAEC is more than the HVD code due to duplicating the codeword to reduce the crosstalk effect. Also, as shown in the same table, the code rate of HVD is higher than CCAEC. According to the reliability analysis results and both of the last two previous features and correct on capacity, the HVD coding is considered a better choice than the CCAEC code.

Finally, when taking the same value of P<sub>res</sub> for both scheme codes with respect to the error bit rate, the HVD refers to an error bit rate more than CCAEC, which is proportional to voltage swing that leads to a reduction in the power consumption in HVD as according to Gaussian noise model which is discussed in more details in many research papers **(Rahimipour et al., 2020; Asaad et al., 2020)**. In **Table.2**, bit overhead and code rate was calculated, and it is clear that HVD has values better than CCAEC code. These rates affect the area and power consumption of the Network on the chip.

| Method | Input data | Redundant | Codeword      | Bit Overhead | Code Rate |
|--------|------------|-----------|---------------|--------------|-----------|
|        | (bits)     | (bits)    | length (bits) | (%)          | (%)       |
| HVD    | 32         | 37        | 69            | 115.63       | 45.71     |
| CCAEC  | 32         | 20        | 104           | 125          | 30.77     |
|        |            |           | (duplicated)  |              |           |

Table 2. Bit overhead and code rate results.

## **5. CONCLUSIONS**

In this paper, a new accurate mathematical model for the probability of residual error of HVD and CCAED codes was derived and used to analyze the reliability of these two codes. After comparing the reliability analysis results of the HVD and CCAEC codes, it was found that the CCAEC results of the new estimation model are very close to the simulation results, which confirms the newly derived model. This confirms the inaccuracy of the old estimation results and invalidates the claim that it can correct 12 errors as it fails to correct some patterns of two errors. The HVD method was found to have higher reliability than CCAEC due to correction and detection capacity, where HVD can correct all messages with two and three errors and 96.7 % of total messages with four errors. In contrast, the CCAEC code can correct 98.5%, 92.8%, and 83.6% of all messages with two, three, and four errors, respectively. As a result of the analysis of the results mentioned above, the CCAEC code has high power consumption due to its high voltage swing compared with the HVD code. Finally, the HVD code remains a more reliable code and still an efficient code to handle the reliability issues for NoC. It can improve its performance by using a simple crosstalk avoidance method, such as increasing link spacing to equal other codes in this important feature.

## NOMENCLATURE

 $d_{min}$ = minimum hamming distance.  $P_{res}$  = probability of residual error.  $P_{und}$  = probability of undetectable error.



#### REFERENCES

Ahmed, S.A., and Al-Hindawi, A.M.J., 2023. Double-Staged Syndrome Coding Scheme for Improving Information Transmission Security over the Wiretap Channel. Journal of Engineering, 29(2), pp.112-136. Doi: 10.31026/j.eng.2023.02.08

Asaad K. Chlaab, Flayyih, W.N., and Rokhani, F.Z., 2020. Lightweight hamming product code based multiple bit error correction coding scheme using shared resources for on chip interconnects. *Bulletin of Electrical Engineering and Informatics*, *9*(5), pp.1979-1989. Doi: 10.11591/eei.v9i5.1876.

Asaad K. Chlaab, Flayyih, W.N., and Rokhani, F.Z., 2020. Reliability Analysis of Multibit Error Correcting Coding and Comparison to Hamming Product Code for On-Chip Interconnect. *Journal of Engineering*, *26*(6), pp.94-106. Doi:10.31026/j.eng.2020.06.08.

Flayyih, W.N., 2018. Crosstalk aware multi-bit error detection with limited error correction coding for reliable on-chip communication. *International Journal of Computer Applications*, *179*(40), Doi: 10.5120/ijca2018916934.

Flayyih, W.N., Samsudin, K., Hashim, S.J., Ismail, Y.I., and Rokhani, F.Z., 2015. Adaptive multibit crosstalk-aware error control coding scheme for on-chip communication. *IEEE Transactions on Circuits and Systems II: Express Briefs*, 63(2), pp.166-170. Doi: 10.1109/TCSII.2015.2483379.

Flayyih, W.N., Samsudin, K., Hashim, S.J., Ismail, Y., and Rokhani, F.Z., 2020. Multi-bit error control coding with limited correction for high-performance and energy-efficient network on chip. *IET Circuits, Devices & Systems*, *14*(1), pp.7-16, Doi: 10.1049/iet-cds.2018.5282.

Flayyih, W.N., Samsudin, K., Hashim, S.J., Rokhani, F.Z., and Ismail, Y.I., 2013, September. Improved undetected error probability model for JTEC and JTEC-SQED coding schemes. In *2013 IEEE International Conference on Circuits and Systems (ICCAS)* (pp. 27-32). IEEE, Doi: 10.1109/CircuitsAndSystems.2013.6671640.

Flayyih, W.N., Samsudin, K., Hashim, S.J., Rokhani, F.Z., and Ismail, Y.I., 2014. Crosstalk-aware multiple error detection scheme based on two-dimensional parities for energy efficient network on chip. *IEEE Transactions on Circuits and Systems I: Regular Papers*, *61*(7), pp. 2034-2047. Doi: 10.1109/TCSI.2013.2295952.

Fu, B., and Ampadu, P., 2009. On hamming product codes with type-II hybrid ARQ for on-chip interconnects. *IEEE Transactions on Circuits and Systems I: Regular Papers*, *56*(9), pp. 2042-2054. Doi: 10.1109/TCSI.2009.2026679.

Fu, B., and Ampadu, P., 2012. Error control for network-on-chip links. *Springer*. Doi: 10.1007/978-1-4419-9313-7.

Ganguly, A., Pande, P.P., and Belzer, B., 2009. Crosstalk-aware channel coding schemes for energy efficient and reliable NOC interconnects. *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, *17*(11), pp. 1626-1639. Doi: 10.1109/TVLSI.2008.2005722.

Giovanni D., Benini, L., and Micheli, 2002. Networks on chips: A new soc paradigm. *IEEE Computer*, *35*(1), pp. 70-78. Doi: 10.1109/2.976921.

Gul, M., Chouikha, M., and Wade, M., 2017. Joint crosstalk aware burst error fault tolerance mechanism for reliable on-chip communication. *IEEE Transactions on Emerging Topics in Computing*, *8*(4), pp. 889-896. Doi: 10.1109/TETC.2017.2787549.

Kishani, M., Zarandi, H.R., Pedram, H., Tajary, A., Raji, M., and Ghavami, B., 2011. HVD: horizontal-vertical-diagonal error detecting and correcting code to protect against with soft errors. *Design Automation for Embedded Systems*, *15*, pp. 289-310, Doi: 10.1007/s10617-011-9078-2.

Kose, S., Salman, E., and Friedman, E.G., 2010. Shielding methodologies in the presence of power/ground noise. *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, *19*(8), pp. 1458-1468, Doi: 10.1109/TVLSI.2010.2054119.

Lakshmi, K.A.S.S., Keerthi, A.M., Sri, K.M., and Vinodhini, M., 2020, June. Code with crosstalk avoidance and error correction for network on chip interconnects. In *2020 4th International Conference on Trends in Electronics and Informatics (ICOEI)(48184)* (pp. 75-79). IEEE. Doi: 10.1109/ICOEI48184.2020.9143002.

Mohammed, H. J., Flayyih, W.N., and Rokhani, F.Z., 2019. Tolerating permanent faults in the input port of the network on chip router. *Journal of Low Power Electronics and Applications*, 9(1), P. 11. Doi: 10.3390/jlpea9010011.

Murali, S., Theocharides, T., Vijaykrishnan, N., Irwin, M.J., Benini, L., and De Micheli, G., 2005. Analysis of error recovery schemes for networks on chips. *IEEE Design & Test of Computers*, *22*(5), pp. 434-442. Doi: 10.1109/MDT.2005.104.

Rahimipour, S., Flayyih, W.N., El-Azhary, I., Shafie, S., and Rokhani, F.Z., 2012, October. A survey of onchip monitors. In *2012 IEEE International Conference on Circuits and Systems (ICCAS)* (pp. 243-248). IEEE. Doi: 10.1109/ICCircuitsAndSystems.2012.6408286.

Rahimipour, S., Flayyih, W.N., Kamsani, N.A., Hashim, S.J., Stan, M.R., and Rokhani, F.Z.B., 2020. Low-power, highly reliable dynamic thermal management by exploiting approximate computing. *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, *28*(10), pp. 2210-2222, Doi: 10.1109/TVLSI.2020.3012626.

Rahimipour, S., Flayyih, W.N., Kamsani, N.A., Stan, M., and Rohani, F.Z., 2015, September. Investigating the impact of on-chip interconnection noise on dynamic thermal management efficiency. In *2015 IEEE International Circuits and Systems Symposium (ICSyS)* (pp. 85-89). IEEE. Doi: 10.1109/CircuitsAndSystems.2015.7394070.

Rahman, M.S., Sadi, M.S., Ahamed, S., and Jurjens, J., 2015, May. Soft error tolerance using horizontalvertical-double-bit diagonal parity method. In *2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT)* (pp. 1-6). IEEE, Doi: 10.1109/ICEEICT.2015.7307411.

Singh, A.K., 2016, December. Error detection and correction by hamming code. In *2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC)* (pp. 35-37). IEEE. Doi: 10.1109/ICGTSPICC.2016.7955265.

Shamshiri, S., Ghofrani, A., and Cheng, K.T., 2011, September. End-to-end error correction and online diagnosis for on-chip networks. In *2011 IEEE International Test Conference* (pp. 1-10). IEEE. Doi: 10.1109/TEST.2011.6139156.

Shirmohammadi, Z., and Miremadi, S.G., 2017, October. SDT-free: An efficient crosstalk avoidance coding mechanism considering inductance effects. In *2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)* (pp. 293-297). IEEE, Doi: 10.1109/ICCKE.2017.8167894.

Vinodhini, M., and Murty, N.S., 2018. Reliable low power NoC interconnect. *Microprocessors and Microsystems*, *57*, pp. 15-22, Doi: 10.1016/j.micpro.2017.12.008.

Vinodhini, M., Lillygrace, K., and Murty, N.S., 2015, December. A fault tolerant NoC architecture with runtime adaptive double layer error control and crosstalk avoidance. In *2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)* (pp. 1-6). IEEE, Doi: 10.1109/ICCIC.2015.7435690.

Wang, B., Xie, J., Mao, Z., and Wang, Q., 2011, October. Multiple continuous error correct code for high performance network-on-chip. In *2011 Asia Pacific Conference on Postgraduate Research in Microelectronics & Electronics* (pp. 98-101), IEEE. Doi: 10.1109/PrimeAsia.2011.6075080.

Yu, Q., and Ampadu, P., 2009. Adaptive error control for nanometer scale network-on-chip links. *IET computers & digital techniques*, *3*(6), pp. 643-659. Doi: 10.1049/iet-cdt.2008.0132.

Zangeneh, M., and Masoumi, N., 2010, January. Throughput optimization for interleaved repeaterinserted interconnects in VLSI design. In *2010 3rd International Nanoelectronics Conference (INEC)* (pp. 1443-1444). IEEE. Doi: 10.1109/INEC.2010.5424802.