کتابداری و اطلاع‌رسانی (Mar 2021)

A Self-recovery Digital Watermarking Approach for Tamper Detection of Handwritten and Printed Electronic Documents

  • Azadeh Bastani,
  • Esmaeel Fatemi Behbahani

DOI
https://doi.org/10.30481/lis.2020.235473.1727
Journal volume & issue
Vol. 24, no. 1
pp. 174 – 193

Abstract

Read online

Objective: The aim of this study is to present a brief discussion of the digital image watermarking for texts in order to prove the documents authentication and copyright protection and to introduce a new method of digital watermarking of printed and handwritten Persian documents image for tamper detection and recovery. Methodology: This paper is based on an applied research.In this study, 20 Persian manuscripts and 35 printed texts were used to simulate the algorithm. In the proposed algorithm, using the information of the image pixels and combining it with a hidden key, we produced the digital watermark and embedded it in the text image. We then intentionally attacked the image and extracted the hidden watermark in the text. Finally, due to the self-recovery capability of the method, the tampered areas were detected and the document was restored to its original form. Findings: In all 55 test images, separate watermark were embedded according to the proposed algorithm. To evaluate proposed system the three types of attacks includes adding, removing and updating, 30 random images were selected. In 20 images, all three types of manipulation were applied. Then the watermark was extracted and the results were studied. The accuracy of proposed method in detecting and correcting tampered areas was calculated according to two metrics: Tamper Detection Accuracy (TDA) and Tamper Recognition Accuracy (TRA). The algorithm was able to identify manipulated areas with an accuracy of 93.82 percent and correct and recover changes with an accuracy of 92.27 percent. It is important to note that the results obtained are based on the number of correctly identified bits. Therefore, the accuracy of less than 100 percent does not mean inability to detect some manipulated areas; Rather, as can be deduced from the intuitive results, this means that despite the correct diagnosis of the tampered area, only some pixels of that area have not been correctly detected or recovered and therefore do not have a negative effect on the final tamper proofing. For imperceptibility test, PSNR value is estimated. The average value of PSNR in the suggested method is 40.18 dB. Conclusion: Determining the documents authenticity without the availability of the original documents is one of the main necessities. Digital watermarking is one of the most powerful tools for document authentication and protection of copyright. Our proposed method has two special features that are considered to be its advantages over other methods. The first feature is the ability to work with all kinds of Persian text images. In addition to the ability to run on printed texts, it is also possible to model in handwritten texts as well as the combination of text and image. Another advantage is good rate of tamper detection and the ability to self-recovery of tampered images. The numerical and intuitive results indicate the efficiency of proposed scheme and so can be implemented for all handwritten and electronic printed texts. Therefore, this method can be used to protect the security of printed and handwritten documents in digital libraries, e-learning, e-government and any electronic document exchange system.

Keywords