

# **G** OPEN ACCESS

**Citation:** Jiexian H, Khizar Y, Ali ZA, Hasan R, Pathan MS (2023) On the dynamic reconfigurable implementations of MISTY1 and KASUMI block ciphers. PLoS ONE 18(9): e0291429. [https://doi.](https://doi.org/10.1371/journal.pone.0291429) [org/10.1371/journal.pone.0291429](https://doi.org/10.1371/journal.pone.0291429)

**Editor:** Je Sen Teh, Universiti Sains Malaysia, MALAYSIA

**Received:** December 12, 2022

**Accepted:** August 29, 2023

**Published:** September 28, 2023

**Peer Review History:** PLOS recognizes the benefits of transparency in the peer review process; therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. The editorial history of this article is available here: <https://doi.org/10.1371/journal.pone.0291429>

**Copyright:** © 2023 Jiexian et al. This is an open access article distributed under the terms of the Creative Commons [Attribution](http://creativecommons.org/licenses/by/4.0/) License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

**Data Availability Statement:** All relevant data are within the paper.

**Funding:** This study was funded by Jiaying University, Minghai Industrial Robot Production RESEARCH ARTICLE

# On the dynamic reconfigurable implementations of MISTY1 and KASUMI block ciphers

#### **Huang Jiexian1 , Yasir Khizar2 , Zain Anwar Al[iID](https://orcid.org/0000-0002-2143-2879)1,3\*, Raza Hasan[ID4](https://orcid.org/0000-0002-8089-837X) , Muhammad Salman Pathan**<sup>5</sup>

**1** School of Physics and Electronic Engineering, JiaYing University, Meizhou, Guangdong, China, **2** School of Electronic Information, Nanjing University of Aeronautics & Astronautics, Nanjing, Jiangsu, China, **3** Electronic Engineering Department, Sir Syed University of Engineering & Technology, Karachi, Sindh, Pakistan, **4** Department of Science and Engineering, Solent University, Southampton, United Kingdom,

**5** Computer Science Department, Maynooth University, Co. Kildare, Ireland

\* zainanwar86@hotmail.com

# Abstract

Novel hardware architectures for dynamic reconfigurable implementation of 64-bit MISTY1 and KASUMI block ciphers are proposed to enhance the performance of cryptographic chips for secure IoT applications. The SRL32 primitive (Reconfigurable Look up Tables— RLUTs) and DPR (Dynamic Partial Reconfiguration) are employed to reconfigure single round MISTY1 / KASUMI algorithms on the run-time. The RLUT based architecture attains dynamic logic functionality without extra hardware resources by internally modifying the LUT contents. The proposed adaptive reconfiguration can be adopted as a productive countermeasure against malicious attacks with the added advantage of less reconfiguration time (RT). On the other hand, the block architecture reconfigures the core hardware by externally uploading the partial bit stream and has significant advantages in terms of low area implementation and power reduction. Implementation was carried out on FPGA, Xilinx Virtex 7. The results showed remarkable results with very low area of 668 / 514 CLB slices consuming 460 / 354 mW for RLUT and DPR architectures respectively. Moreover, the throughput obtained for RLUT architecture was found as 364 Mbps with very less RT of 445 nsec while DPR architecture achieved speed of 176 Mbps with RT of 1.1 msec. The novel architectures outperform the stand-alone existing hardware designs of MISTY1 and KASUMI implementations by adding the dynamic reconfigurability while at the same achieving high performance in terms of area and throughput. Design details of proposed unified architectures and comprehensive analysis is described.

# **Introduction**

To meet the IoT security application requirements of wireless networks, cryptographic algorithms have emerged as the key component in SoC designs. To date, various encryption algorithms SNOW, AES, KASUMI, ZUC, MISTY1, etc. have been developed to ensure data confidentiality / integrity. The usage / implementation of these algorithms, however, mainly depends upon the applications and security requirements. In recent times, the dynamic

<span id="page-1-0"></span>and Education Integrated Base in the form of a grant to HJ [422A0303].

**Competing interests:** The authors have declared that no competing interests exist.

reconfigurable cryptographic computations are proven to be of greater advantages against the hardware Trojans. This study investigates FPGA based reconfigurable hardware implementation of MISTY1 and KASUMI algorithms exploring SRL32 primitive and DPR technology. The proposed work has a broad range of applications including military equipment, sensor networks and wireless communications [\[1–6\]](#page-14-0).

FPGAs are the widely being adopted platforms for cryptographic circuit implementation due to their prospective features for high-end flexibility, sophisticated on-chip interconnects and run-time reconfigurable computing. In addition, the run-time reconfiguration of FPGAs with RLUT (SRL32 primitive) and DPR has certainly aided towards secure design package. The RLUT reconfigures the hardware core by internally modifying the pre-defined LUT contents. On the other hand, the DPR scheme allows a part of bit stream to be uploaded dynamically at predefined stages / set-timings without arbitration of the running FPGA operation. This allows the FPGAs for hardware implementations in hostile environments  $[7-10]$ .

MISTY1 / KASUMI algorithms are designed to encrypt 64-bit block of data. MISTY1 is an ISO standardized block UMTS cipher developed by Mitsubishi Electric and is widely being employed in Japanese ATM networks. Followed by MISTY1, KASUMI is a 3gpp core algorithm deployed for confidentiality / integrity of user in 3g Universal Mobile Telecommunications System (UMTS) networks. The two algorithms resemble in terms of Linear / Differential cryptanalysis, architectural characteristics, and the key requirements; nevertheless, they have several contrasting features / design constraints for round operations. The attacks against MISTY1 and 3gpp standardized KASUMI have identified the weaknesses; however, no serious drawbacks / shortcomings have been found against the full 8-rounds MISTY1 and 3gpp standardized KASUMI output feedback (OFB) and Cipher Block Chaining (CBC) modes [[11–14\]](#page-14-0).

A detailed investigation has been carried out on the optimization techniques of MISTY1 and KASUMI algorithms, reconfigurable hardware implementations, PUFs implementation, DPR methodology and SRL32 primitive. Regarding MISTY1 and KASUMI implementations, variety of high speed and low area hardware architectures have been proposed till date for compact ASIC applications and sensor networks. A deep insight reveals that the optimization techniques adopted on MISTY1 can be employed to its counterpart KASUMI with essential modifications. Moreover, the common logic functions can be utilized for reconfigurable hardware implementation. In this regard, a highly area-efficient implementation of reconfigurable MISTY1 / KASUMI is proposed in constituting 3481 NAND gates; however, the hardware is based on static reconfiguration. Thus, the entire bit-stream and logic functionality / selection methodology has to be uploaded on the initial-run of FPGA and is, therefore, more prone to attacks. Moreover, the repetitive-loop structure results in a low throughput value of 130.2 Mbps and in-turn the high-power consumption [[15–26\]](#page-15-0). Studies on static reconfigurable cryptographic modules have also been proposed for SNOW / ZUC algorithms and AES / KASUMI / SNOW / ZUC algorithms signifying flexible implementations [[14](#page-14-0), [27–36\]](#page-15-0).

Compared to static hardware circuits, dynamic reconfigurable implementations appear with the added advantages of security and power reduction. The SRL32 primitive has been reported for applications pertaining to constructive as well as destructive usages. The constructive applications include the dynamic operability of LUTs whereas the destructive application appears in the form of hardware Trojans insertion for the retrieval of secret keys. On the other hand, the DPR methodology has been adopted by designers to employ multiple algorithms. The study is proposed in employed DPR as the core hardware for implementation of PUF-- HELP and AES. The DPR technology employing AES, TDEA, MISTY1, CAMELLIA, SEED and CAST-128 has also been reported in representing power efficient design. In addition, the technique has also been employed to implement the variants of AES i.e., AES– 128 / 192 and 256 bits [\[27,](#page-15-0) [28,](#page-15-0) [30\]](#page-15-0).

<span id="page-2-0"></span>The studies reported to date have either employed the low area or high speed techniques for MISTY1/KASUMI implementations. No studies have been found which have identified the similarities of the two 64-bit MISTY1 and KASUMI algorithms and implement them for dynamic reconfigurability. The same has been implemented with while keeping in view the key advantages of SRL32 primitive and DPR architecture. Dynamic reconfigurable MISTY1 / KASUMI architectures have implemented with improved performance in terms of occupied area, power consumption, RT, and throughput [\[21–24\]](#page-15-0).

The key contributions of paper include:

- 1. Design and configuration of RLUT based MISTY1 / KASUMI architecture.
	- Implementation of MISTY1 / KASUMI transformation functions (FL, FI, FO, and 32-bit XOR) and S9 / S7 s-boxes by SRL32.
	- Novel methodology for dynamic reconfiguration of RLUT architecture by utilizing the counter data for compact implementation.
- 2. Implementation of DPR based reconfigurable hardware architecture.
	- Optimization / configuration of static part utilizing the common logic for single round MISTY1 / KASUMI algorithm.
	- Reconfigurable logic designing of MISTY1 / KASUMI logic functions to complement the static logic for low power implementation.

The paper is preceded by brief description to reconfiguration methodologies and covers the review on MISTY1 and KASUMI algorithms followed by the proposed RLUT and DPR based designs respectively. Finally, the results are summarized with conclusion.

# **RLUT, DPR and review of MISTY1 / KASUMI algorithms**

# **RLUT and DPR**

The RLUT and DPR methodology for FPGAs is depicted in Figs 1 and [2](#page-3-0) respectively.

The SRL32 consists of  $5-1$  LUT or  $4-2$  LUTs with reconfiguration data enable signal (En), clock, reconfiguration data input (RDI) and reconfiguration data output (RDO). The RDI is serially shifted to the LUT on clock edge and high En whereas the old value is depleted from LUT through RDO. The RDI of RLUTs may comprise of the output values of existing LUTs



**Fig** 1. RLUT (configured as  $1 \times$  SRL32 primitive or  $2 \times$  SRL16).

<span id="page-3-0"></span>

**[Fig](#page-2-0) 2. DPR methodology in FPGAs.**

and thus can be connected in cascaded form [\[25\]](#page-15-0). This methodology reconfigures the LUTs internally i.e. no external bit stream is uploaded to modify the hardware. The RT of RLUT architecture depends upon path delay and the number of clock cycles required for re-configuration of LUTs. On the other hand, DPR technology consists of static / dynamic modules with static part representing the logic circuit and is uploaded during the initial run of FPGA. The partial bit stream stored in external DDR (memory) can be dynamically loaded into FPGA core to update the hardware circuit. The DPR supports the uploading baud rate of 400 Mbps and therefore the RT depends upon the size of partial bit stream. The greater the size of partial bit stream, the higher is the RT [\[26\]](#page-15-0).

#### **Review of MISTY1 / KASUMI algorithms**

A brief review on the similarities and contrasting features of MISTY1 / KASUMI algorithms is presented in [Table](#page-4-0) 1. The algorithms and their transformation functions FL, FO, FI and S9 / S7 s-boxes differ in terms of functionality. In addition, the extended key generation and key

<https://doi.org/10.1371/journal.pone.0291429.g002>



#### <span id="page-4-0"></span>**[Table](#page-3-0) 1. Similarities and differences of MISTY1 and KASUMI.**

<https://doi.org/10.1371/journal.pone.0291429.t001>

scheduling of MISTY1 is entirely different to that of KASUMI. A detailed specification of MISTY1 / KASUMI algorithms is mentioned.

# **RLUT based reconfigurable MISTY1 / KASUMI**

The RLUT architecture is depicted in [Fig](#page-5-0) 3 consisting of single round MISTY1 / KASUMI algorithm, EK generation / scheduling and controller. The architecture is modeled with SRL32 primitive and dynamically configures the LUTs based on enable M1 / K and reconfiguration data (RD) signals. The RLUT architecture has been designed to implement MISTY1 / KASUMI logic functions by employing SRL32 such that no additional hardware resources are required. The proposed hardware design is instantiated for MISTY1 algorithm and is dynamically operated for KASUMI.

The input to the architectures includes 64-bit plain text, 128-bit secret key and control signals. The EKs required for 8-rounds MISTY1 / KASUMI algorithm are generated by single round algorithm (by FO function) before the round operations. In order to reconfigure the circuit, the SRL controller dynamically enables the configuration signals using pre-set timings. A 128-bit extended key and secret key is fed through the key scheduling scheme to RLUT based MISTY1 / KASUMI algorithm. A detailed explanation to single round algorithm, its transformation functions and key scheduling scheme is as under.

## **A single round MISTY1 / KASUMI algorithm**

A single round RLUT based reconfigurable MISTY1 / KASUMI algorithm is presented in [Fig](#page-5-0) [4.](#page-5-0) The repeated iterations of single round algorithm encipher the plain text and is saved in 2 × 32-bit registers R1 and R2 after 9 and 8 clock cycles for MISTY1 and KASUMI algorithms respectively. The additional clock cycle for the 8-round MISTY1 algorithm is due to the last (i.e. 9th) round consisting of  $2 \times$  FL functions. Besides,  $4 \times$  clock cycles are required to generate the extended for MISTY1 / KASUMI round operations.

<span id="page-5-0"></span>

#### **[Fig](#page-4-0) 3. RLUT architecture.**

<https://doi.org/10.1371/journal.pone.0291429.g003>







<span id="page-6-0"></span>

<https://doi.org/10.1371/journal.pone.0291429.t002>

The architectures characterize SRL32 primitive for FL function and 32-bit XOR implementation. The FL function consists of  $16 \times$  SRL32-1 and  $16 \times$  SRL32-2 for LSBs and MSBs respectively with initial values representing MISTY1 logic operation. The RLUTs are designed such that the re-configuration of LUTs can be carried out for FL function KASUMI operation without additional logic. This is obtained by deriving the relation for the input bits to formulate the RLUTs for MISTY1 and KASUMI FL function operations and is given in Table 2 as under.

For each output bit, the RLUTs input signals are applied based on the combination given in Tab. 2. The initial values stored in RLUTs comprise of MISTY1 FL function. Thus, in order to configure the circular left shift operation of FL function for KASUMI algorithm, the SRL32 values are updated by RDI1 and RDI2 connected to counter C0 signal. Thus, FL function can be implemented without additional hardware cost using the counter of RLUT architecture. In a similar way, a 32-bit XOR implementation is carried out by employing  $2 \times$  SRL32 sets i.e., SRL32-3 and SRL32-4 executing odd and even round XOR operations respectively. The SRL32 for XOR operation is implemented as  $2 \times$  SRL16 generating 2  $\times$  outputs. The RD<sub>I3</sub> and RD<sub>I4</sub> are connected to  $C_2$  signal of the counter and update the LUTs by retaining the 16-bit initial values. The use of RLUTs for 32-bit XOR implementation with SRL32 based controller's M1 / K also eliminates the use of input multiplexers. Hence, the logic functionality of MISTY1 and KASUMI odd / even rounds XOR operation is obtained.

Table 3 summarizes the initial values and the reconfigurable values of SRL32-1, 2, 3 and 4. The re-configuration delay of FL function and 32-bit XOR is 33  $\times$  clock cycles and 16  $\times$  clock cycles respectively. The additional delay of  $1 \times$  clock cycle for FL function is due to the reconfigurable values for which the data enable has to be de-asserted for one clock cycle. The RT of FL function thus dictates the delay for the proposed algorithm to dynamically modify the circuit operation from MISTY1 to KASUMI.

The FO function and the respective FI function for RLUT architecture are shown in [Fig](#page-7-0) 5a [and](#page-7-0) 5b respectively. The FO function performs round operations as well as EK generation for MISTY1 and KASUMI. The optimum operation of FO function is performed by reconfiguring the FI (S9 and S7 s-boxes) as well as enabling or disabling the KO<sub>i</sub> XORs by asserting the required secret key,  $C_1$ - $C_8$  constants and logic zero values. For MISTY1, the EKs are generated by nullifying the effect of  $KO<sub>i</sub> XORs$  using logic zeros to output the FI function. On the other

| Ftn        | <b>SRL32 Primitive</b> | <b>Initial Value</b> | Reconfigurable Value | RT                       |
|------------|------------------------|----------------------|----------------------|--------------------------|
| FL         | SRL32-1                | 6A6A6A6A             | AA555555             | $33 \times$ clock cycles |
|            | SRL32-2                | A6A6A6A6             | AAAAAA55             | $33 \times$ clock cycles |
| 32-bit XOR | SRL32-3                | 66660FF0             | C3C36666             | $16 \times$ clock cycles |
|            | SRI 32-4               | 6666CC33             | C3C36666             | $16 \times$ clock cycles |

**Table 3. RLUT contents and RT for FL and 32-bit XOR.**

<span id="page-7-0"></span>



hand, the KO<sub>i</sub> XORs constituting  $C_1$ - $C_8$  constant values are utilized for EK generation of KASUMI. The output  $EK_1$  and  $EK_2$  for MISTY1 and KASUMI are thus saved in EK register.

The FI function consists of  $4 \times$  Feistel rounds with the last round consisting of S7 s-box as buffer SRLs for MISTY1. In addition, the logic XOR appended with S7 function is cancelled out by the output of AND gates driven through low M1 / K signal reproducing the 7-bit MSB output. Therefore, the FI function operates for MISTY1 by initiating MISTY1 S9 and S7 sboxes for the first 3 Feistel rounds. Later, S9 and S7 s-boxes can be reconfigured for KASUMI by enabling the last Feistel round with M1 / K signal. The optimized S9 and S7 s-boxes are based on SRL32 primitive and are explained below.

#### **Implementation of reconfigurable S-boxes with RLUTs**

The S7 s-box of (2nd round) FI function is designed using 5-input functions with output logic consisting of XOR for each S7 expression. The input and output bits relationship for MISTY1 and KASUMI S7 s-boxes is presented in [Table](#page-8-0) 4. Based on input bits relationship for MISTY1 and KASUMI S7 s-box, a swapping circuit is required as depicted in [Fig](#page-8-0) 6a. The output logic for KASUMI operation, on the other hand, requires reconfiguration as well as inversion for  $y_3$ ,  $y_4$ ,  $y_5$  and  $y_6$ . In order to incorporate inversion, the output logic of S7 s-box is incorporated with additional M1 / K signal as shown in [Fig](#page-8-0)  $6b$  and therefore flips the respective bit for KASUMI operation.

The S7 s-box is designed with SRL32 consisting of initial contents for MISTY1. By inserting low M1 / K signal, the swapping and flipping of the input and output sides is discarded such that input and output circuits act as a buffer and thus S7 s-box operates for MISTY1. To upload the KASUMI S7 s-box contents, a cascaded reconfiguration data operation is performed. Thus, the contents of SRL32 for MISTY1 S7 s-box are re-utilized to configure  $2 \times S7$ 

| Input  |               | Output         |                |  |
|--------|---------------|----------------|----------------|--|
| MISTY1 | <b>KASUMI</b> | MISTY1         | <b>KASUMI</b>  |  |
| $x_0$  | $X_6$         | y <sub>0</sub> | Y <sub>6</sub> |  |
| $x_1$  | $x_5$         | y <sub>1</sub> | $y_2$          |  |
| $x_2$  | $x_4$         | Y <sub>2</sub> | Y <sub>5</sub> |  |
| $X_3$  | $X_3$         | Y <sub>3</sub> | y <sub>1</sub> |  |
| $x_4$  | $X_2$         | Y <sub>4</sub> | $V_3$          |  |
| $X_5$  | $X_1$         | y <sub>5</sub> | y <sub>0</sub> |  |
| $X_6$  | $x_0$         | Y <sub>6</sub> |                |  |

<span id="page-8-0"></span>**[Table](#page-7-0) 4. RLUT contents and reconfiguration time for FL and 32-bit XOR.**

<https://doi.org/10.1371/journal.pone.0291429.t004>

s-boxes (KASUMI has  $2 \times$  S7 s-boxes in FI) in 32 clock cycles for KASUMI. This methodology represents a significant reduction in area for S7 s-boxes implementation.

The flow diagram for implementation of S9 s-boxes is shown in [Fig](#page-9-0) 7. The S9 s-boxes are implemented by eliminating the Close Support Electronics (CSEs) and logically formulating the LUTs for MISTY1 and KASUMI algorithms. The LUTs are designed such that the output combination of MISTY1 / KASUMI S9 s-boxes drives the RLUTs with 4-bit input. Thus, SRL32 are employed at the output of each expression for S9 s-box to perform a MUX operation as well as XOR function for MISTY1 / KASUMI. The reconfiguration time of SRL32 for S9 s-box is  $16 \times$  clock cycles.

#### **Key scheduling**

í.

The key scheduling for RLUT architecture is presented in [Fig](#page-9-0) 8. Since the round operations require  $11 \times 16$ -bit keys, the input multiplexers select the required secret keys and extended



**[Fig](#page-7-0) 6. a) Circuit for swapping the inputs, b) S7 s-box.**

<span id="page-9-0"></span>

#### **[Fig](#page-8-0) 7. S9 s-box.**

<https://doi.org/10.1371/journal.pone.0291429.g007>

keys for respective rounds. However, the KASUMI secret keys differ from MISTY1 in terms of 1-bit, 5-bit, 8-bit and 13-bit circular left shift rotations.

The RLUTs are employed to act as buffers for MISTY1 and are reconfigured by Counter's  $C_2$  and  $C_3$  signals for KASUMI with RT of 16 clock cycles. The dynamic reconfigurable part also consists of  $2 \times 16$ -bit swapped keys for KASUMI FL Even function and is implemented



 $SK = \{ SK1 || SK2 || SK3 || ... || SK11 \}$ 

**[Fig](#page-8-0) 8. Key scheduling for RLUT architecture.**

the same way as [Fig](#page-8-0) 6a. Thus, the required 64-bit cipher text is generated for MISTY1 / KASUMI with the proposed key scheduling scheme.

## **DPR based reconfigurable MISTY1 / KASUMI**

A DPR based MISTY1 / KASUMI architecture is presented in Fig 9 and differs from RLUT hardware in terms of reconfiguration technique and EK generation / scheduling. The reconfiguration is carried out by dynamically uploading the partial bit stream stored in external hardware. For the proposed DPR architecture, the FPGA core is preset for MISTY1 algorithm (similar to RLUT architecture); however, the design can also be stipulated for vice versa implementation. This infers that the static logic only consists of common logic for MISTY1 and KASUMI and can be configured by uploading either of the partial bit streams for MISTY1 or KASUMI.

The primary advantage of this design is the power reduction compared to un-optimized equivalent parallel architectures for MISTY1 / KASUMI. The design is based on re-utilization of common logic and is configured as the static part whereas the partial bit streams consist of dynamic logic for the transformation functions to be uploaded on required MISTY1 /



<span id="page-11-0"></span>KASUMI operations. The varying logic functionality therefore results in reconfigurable architecture.

#### **A single round MISTY1 / KASUMI algorithm**

A single round MISTY1 / KASUMI algorithm for DPR architecture is presented in Fig 10a consisting of static and dynamic parts for  $2 \times FL$  functions and FO function (the dotted parts represent the dynamic logic). For FL functions, the logic combinations AND / OR / XORs are re-utilized; the reconfiguration can be carried out for KASUMI algorithm by implementing 2 × 1-bit circular left-shift. Similarly, 32-bit XOR can be implemented for single round MISTY1 / KASUMI by reconfiguring the input mixes.

Unlike RLUT architecture, a single round DPR algorithm employs FO (FI) function to generate EK (i.e.,  $EK_1$  and  $EK_2$ ) only for MISTY1 algorithm whereas the EK for KASUMI is generated by independent i.e., reconfigurable key generation module. In addition, the FO function consists of static / dynamic logic parts for FI function as shown in Fig 10b. The reconfigurable FI function has S9 / S7 s-boxes and 7-bit XOR as reconfigurable parts and thus the 4th Feistel round is configured only for KASUMI FI function operation. The s-boxes S9 and S7 s-boxes are implemented using CSE methodology. For S7 s-box, the combinational logic is re-utilized by configuring the input / output bits. On the other hand, the S9 s-box consists of combined CSEs for MISTY1 / KASUMI as static logic whereas additional logic of S9 s-box can be dynamically configured for MISTY1 / KASUMI implementation. The S9 s-box logic details for static part and dynamic parts are shown in [Table](#page-12-0) 5 showing 42% area reduction compared to nonoptimized equivalent parallel S9 s-boxes implementation.





<https://doi.org/10.1371/journal.pone.0291429.g010>



#### <span id="page-12-0"></span>**[Table](#page-11-0) 5. % area reduction (NAND gates) for S9 s-box implementation.**

<https://doi.org/10.1371/journal.pone.0291429.t005>

#### **Extended key generation and key scheduling**

The key scheduling scheme of the proposed DPR architecture is shown in Fig 11. It has been designed for dynamic operability such that the secret keys are applied through reconfigurable 1-bit, 5-bit, 8-bit, and 13-bit circular left shift operation whereas  $2 \times 16$ -bit keys of FL Even for MISTY1 are swapped in the dynamic region for KASUMI algorithm. The extended key of KASUMI is generated in 1 clock cycle by implementing  $8 \times 16$ -bit XORs in reconfigurable region for KASUMI algorithm. The 16-bit secret keys and extended keys are finally multiplexed to output the required  $11 \times 16$ -bit key for MISTY1 / KASUMI algorithm round operations.

#### **FPGA implementation—Results and analysis**

The proposed reconfigurable hardware architectures are implemented on Xilinx Vertex 7 FPGA, XC7VX690T. [Table](#page-13-0) 6 summarizes the analysis of the proposed designs based on the performance parameters including area utilization, power consumption, speed, and RT.

The RLUT architecture shows remarkable results with high throughput value and lesser RT of 364 Mbps and 445 nsec respectively compared to its counterpart DPR architecture with parametric values of 176 Mbps and 1.1 msec. Moreover, the occupied area and the power consumption of RLUT architecture is found to be 668 CLB slices and 460 mW respectively and are comparable to the proposed DPR architecture occupying 514 CLB slices with power dissipation of 354 mW. This signifies the advantages of RLUT architecture outperforming existing reconfigurable implementations.

The proposed DPR architecture, on the other hand, shows comparable performance with the DPR architectures of refs [\[27](#page-15-0), [28](#page-15-0), [30](#page-15-0)]. In terms of area, the proposed reconfigurable



#### **Fig 11. Key scheduling.**



<span id="page-13-0"></span>

<https://doi.org/10.1371/journal.pone.0291429.t006>

architectures have lesser CLB slices whereas reconfiguration time of our RLUT architecture is lesser as compared to ref [\[27\]](#page-15-0). It is worth mentioning that no DPR architectures have been proposed for MISTY1/KASUMI; the studies have been found only for implementation of AES with HELP algorithm. Contrary to DPR architecture, the novel RLUT architecture has never been proposed for implementation of two or more cryptographic circuits. Therefore, in terms of run-time reconfigurability, the RLUT architecture is novel and has been compared with DPR / stand-alone hardware architectures.

Finally, to obtain a fair comparison, a single round parallel MISTY1 and KASUMI architecture is also implemented for reference. The RLUT / DPR schemes are both area and power efficient compared to single round parallel MISTY1 / KASUMI whereas higher reconfiguration time of our architectures can be countered with the key advantage of dynamic reconfiguration. Thus, the proposed architectures outperform the straight-forward implementation and have core applications in sensor networks and military wireless gadgets.

## **Conclusion**

This paper presents state-of-the-art dynamically reconfigurable hardware architectures for MISTY1 and KASUMI block ciphers. The DPR and SRL based FPGA schemes are explored for run-time reconfiguration of MISTY1 and KASUMI algorithms by utilizing common logic. The RLUT architecture signifies core advantages of dynamic reconfiguration, high throughput, and lesser reconfiguration time with comparable parametric values of area / power of the proposed DPR architecture. The hardware designs are highly suitable for military applications and wireless sensor networks. The proposed methodologies can be extended to security architecture for UMTS networks employing multiple algorithms for confidentiality and integrity in 3g and 4g systems. To sum up, this work can be regarded as a significant development in circuit design and cryptography leading to future generation security architecture designs.

#### <span id="page-14-0"></span>**Author Contributions**

**Conceptualization:** Huang Jiexian.

**Data curation:** Yasir Khizar.

**Formal analysis:** Yasir Khizar.

**Investigation:** Yasir Khizar.

**Methodology:** Yasir Khizar.

**Project administration:** Zain Anwar Ali.

**Resources:** Raza Hasan, Muhammad Salman Pathan.

**Software:** Huang Jiexian, Raza Hasan, Muhammad Salman Pathan.

**Supervision:** Zain Anwar Ali.

**Writing – original draft:** Raza Hasan, Muhammad Salman Pathan.

**Writing – review & editing:** Muhammad Salman Pathan.

#### **References**

- **[1](#page-1-0).** Dang, Viet B., Farnoud Farahmand, Michal Andrzejczak, and Kris Gaj. "Implementing and benchmarking three lattice-based post-quantum cryptography algorithms using software/hardware codesign." In 2019 International Conference on Field-Programmable Technology (ICFPT), pp. 206–214. IEEE, 2019.
- **2.** Jasim, Khalid Fadhil, Kayhan Zrar Ghafoor, and Halgurd S. Maghdid. "Analysis of Encryption Algorithms Proposed for Data Security in 4G and 5G Generations." In ITM Web of Conferences, vol. 42, p. 01004. EDP Sciences, 2022.
- **3.** Bisht Neeraj, Pandey Bishwajeet, and Budhani Sandeep Kumar. "Comparative performance analysis of AES encryption algorithm for various LVCMOS on different FPGAs." World Journal of Engineering  $(2022)$
- **4.** Yazdeen Abdulmajeed Adil, Zeebaree Subhi RM, Sadeeq Mohammed Mohammed, Shakir Fattah Kak, Ahmed Omar M., and Zebari Rizgar R. "FPGA implementations for data encryption and decryption via concurrent and parallel computation: A review." Qubahan Academic Journal 1, no. 2 (2021): 8–16.
- **5.** Chen, Jingjing, and Fucheng You. "An image encryption algorithm based on SM4 and Base64." In Journal of Physics: Conference Series, vol. 1812, no. 1, p. 012041. IOP Publishing, 2021.
- **[6](#page-1-0).** Alkamil Arkan, and Perera Darshika G. "Towards dynamic and partial reconfigurable hardware architectures for cryptographic algorithms on embedded devices." IEEE Access 8 (2020): 221720–221742.
- **[7](#page-1-0).** Sala Della, Riccardo Davide Bellizia, and Scotti Giuseppe. "A novel ultra-compact fpga puf: The ddpuf." Cryptography 5, no. 3 (2021): 23.
- **8.** Zerrouki Fahem, Ouchani Samir, and Bouarfa Hafida. "A survey on silicon PUFs." Journal of Systems Architecture 127 (2022): 102514.
- **9.** Roy Debapriya Basu, Bhasin Shivam, Nikolić Ivica, and Mukhopadhyay Debdeep. "Combining puf with rluts: a two-party pay-per-device ip licensing scheme on fpgas." ACM Transactions on Embedded Computing Systems (TECS) 18, no. 2 (2019): 1–22.
- **[10](#page-1-0).** Aranda Luis Alberto, Sánchez-Macián Alfonso, and Maestro Juan Antonio. "ACME: A tool to improve configuration memory fault injection in SRAM-based FPGAs." IEEE Access 7 (2019): 128153–128161.
- **[11](#page-1-0).** Muthalagu Raja, and Jain Subeen. "A novel modified KASUMI block chiper for global system for mobile communications." International Journal of Computers and Applications 43, no. 8 (2021): 805–811.
- **[12](#page-13-0).** Yasir Ning Wu, Zhang. "Compact hardware implementations of MISTY1 block cipher." Journal of Circuits, Systems and Computers, 2018, vol. 27, no. 3, pp. 14. [https://doi.org/10.1142/](https://doi.org/10.1142/S0218126618500378) [S0218126618500378](https://doi.org/10.1142/S0218126618500378)
- **[13](#page-13-0).** Yasir Ning Wu, Chen Xin. "FPGA based highly efficient MISTY1 architecture." IEICE Electronics Express, 2017, vol. 14, no. 18, p. 2017084[1https://doi.org/10.1587/elex.14.20170841](https://doi.org/10.1587/elex.14.20170841)
- **[14](#page-13-0).** Yasir, Wu Ning, Ali Zain Anwar, Shaikh Muhammad Mujtaba, Yahya Muhammad Rehan, and Aamir Muhammad. "Compact and high speed architectures of KASUMI block cipher." Wireless Personal Communications 106 (2019): 1787–1800. <https://doi.org/10.1007/s11277-018-5606-8>
- <span id="page-15-0"></span>**[15](#page-1-0).** Yasir Ning Wu, Chen Xin. "Area-efficient hardware architectures of MISTY1 block cipher." Radioengineering, Vol 27, No. 2, June 2018.
- **[16](#page-13-0).** Yamamoto Itoh, Yajima. "Compact architecture for ASIC and FPGA Implementation of KASUMI Block Cipher." IEICE Transactions on fundamentals of Electronics, Communications and Computer Sciences. 2011, vol. E94-A, p. 2628–2638. <https://doi.org/10.1587/transfun.E94.A.2628>
- **[17](#page-13-0).** Yasir, Wu Ning, Zhang. "Highly optimized reconfigurable hardware architecture of 64-bit block ciphers MISTY1 and KASUMI." IET Electronics Letters, 2017, vol. 53, no. 1, p. 10–11. [https://doi.org/10.1049/](https://doi.org/10.1049/el.2016.3982) [el.2016.3982](https://doi.org/10.1049/el.2016.3982)
- **[18](#page-13-0).** Gupta Chattopadhyay, Khalid. "Designing Integrated Accelerator for Stream Ciphers with Structural Similarities." Cryptography and Communications-Discrete Structures Boolean Functions and Sequences, 2013, vol. 5, no. 1, p. 19–47. <https://doi.org/10.1007/s12095-012-0074-6>
- **19.** Yamashita Ryozo. "History of personal computers in Japan." International Journal of Parallel, Emergent and Distributed Systems 35, no. 2 (2020): 143–169.
- **20.** Manz Olaf. "Symmetric Ciphers." In Encrypt, Sign, Attack: A compact introduction to cryptography, pp. 19–51. Berlin, Heidelberg: Springer Berlin Heidelberg, 2022.
- **[21](#page-2-0).** Ntantogian Christoforos, Veroni Eleni, Karopoulos Georgios, and Xenakis Christos. "A survey of voice and communication protection solutions against wiretapping." Computers & Electrical Engineering 77 (2019): 163–178.
- **22.** Jayashri N., and Kalaiselvi K. "Cloud Cryptography for Cloud Data Analytics in IOT." Machine Learning Approach for Cloud Data Analytics in IoT (2021): 119–142.
- **23.** Stangherlin, Kleber, Zhuanhao Wu, Hiren Patel, and Manoj Sachdev. "Design exploration and security assessment of puf-on-puf implementations." arXiv preprint arXiv:2206.11840 (2022).
- **[24](#page-2-0).** Tripathi, Sayan, Jhilam Jana, Jagannath Samanta, and Jaydeb Bhaumik. "Fast and power efficient secded and sec-ded-daec codes in iot based wireless sensor networks." In TENCON 2019–2019 IEEE Region 10 Conference (TENCON), pp. 540–545. IEEE, 2019.
- **[25](#page-3-0).** Mileva Aleksandra, Dimitrova Vesna, Kara Orhun, and Mihaljević Miodrag J. "Catalog and Illustrative Examples of Lightweight Cryptographic Primitives." Security of Ubiquitous Computing Systems: Selected Topics (2021): 21–47.
- **[26](#page-3-0).** Hasan Raza, Khizar Yasir, Mahmood Salman, and Sheikh Muhammad Kashif. "Design Space Exploration for High-Speed Implementation of the MISTY1 Block Cipher." Mathematical Problems in Engineering 2021 (2021): 1–14.
- **[27](#page-1-0).** Pocklassery, Venkata, Plusquellic. "Physical Unclonable Functions and Dynamic Partial Reconfiguration for Security in Resource-Constrained Embedded Systems", In Proceedings 2017 IEEE International Symposium on Hardware Oriented Security and Trust, USA, 2017, p. 116–121, ISBN:978-1- 5386-3929-0.
- **[28](#page-13-0).** Hori, Kastashita, Kobara. "Energy and Area Saving Effect of Dynamic Partial Reconfiguration on a 28 nm Process FPGA." In 2013 IEEE 2nd Global Conference on Consumer Electronics, Japan, 2013, p. 217–218, ISBN: 978-1-4799-0892-9.
- **29.** Wang Zhu, Yao Yan, Tong Xiaojun, Luo Qinghua, and Chen Xiangyu. "Dynamically reconfigurable encryption and decryption system design for the internet of things information security." Sensors 19, no. 1 (2019): 143. <https://doi.org/10.3390/s19010143> PMID: [30609820](http://www.ncbi.nlm.nih.gov/pubmed/30609820)
- **[30](#page-12-0).** Burman, Rangababu, Datta. "Development of Dynamic Reconfiguration Implementation of AES on FPGA Platform." In Proceedings of 2nd International Conference on Devices for Integrated Circuit, India, 2017, p. 247–251, ISBN: 978-1-5090-4724-6.
- 31. Hettwer, Benjamin, Johannes Petersen, Stefan Gehrer, Heike Neumann, and Tim Güneysu. "Securing cryptographic circuits by exploiting implementation diversity and partial reconfiguration on FPGAs." In 2019 Design, Automation & Test in Europe Conference & Exhibition (DATE), pp. 260–263. IEEE, 2019.
- **32.** Heeger, Derek, and Jim Plusquellic. "Analysis of IoT authentication over LoRa." In 2020 16th International Conference on Distributed Computing in Sensor Systems (DCOSS), pp. 458–465. IEEE, 2020.
- **33.** Sevin Abdullah, and Mohammed Abdu Ahmed Osman. "A survey on software implementation of lightweight block ciphers for IoT devices." Journal of Ambient Intelligence and Humanized Computing (2021): 1–15.
- **34.** Shi Zhengyuan, Chen Cheng, Yang Gangqiang, Xiong Hailiang, Li Fudong, Hu Honggang, et al. "Design space exploration of galois and fibonacci configuration based on espresso stream cipher." ACM Transactions on Reconfigurable Technology and Systems (TRETS) (2022).
- **35.** Soliman Shady, Jaela Mohammed A., Abotaleb Abdelrhman M., Hassan Youssef, Abdelghany Mohamed A., Abdel-Hamid Amr T., et al. "FPGA implementation of dynamically reconfigurable IoT security module using algorithm hopping." Integration 68 (2019): 108–121.
- **[36](#page-1-0).** Derbez, Patrick. "Tools and Algorithms for Cryptanalysis." PhD diss., Université Rennes 1, 2022.