Preprint / Version 1

An Attempt at Japanese Natural Language Lint Using a Local Large Language Model

— Implementation of a Prompt-Engineering-Independent Natural Language Lint System —

##article.authors##

  • Tatsuo Kamitani The University of Fukuchiyama Accounting Faculty of Regional Management

DOI:

https://doi.org/10.51094/jxiv.2247

Keywords:

Natural Language Specification, Ambiguity Detection, JapaneseLint, LoRA-based Distillation, Specification Review Support

Abstract

This study proposes JapaneseLint, a natural language lint system designed to detect ambiguity in Japanese specifications and to automatically generate quantitatively constrained revision proposals. JapaneseLint adopts a lightweight fine-tuning architecture based on LoRA-based distillation, in which a task-specific reasoning style for specification review is embedded directly into a compact model using a general-purpose large language model (GPT-5.1) as a teacher. This design enables stable Lint-style inference without reliance on extensive prompt engineering, while minimizing context consumption at inference time.
For evaluation, twenty test specifications automatically generated by GPT-5.1 were used as inputs, and both inference time and textual output quality were assessed. The experimental results show that the average inference time was 1.803 s, with a maximum of 2.618 s, satisfying practical response-time requirements commonly reported in the HCI literature. Moreover, the generated diagnostic comments and revision proposals exhibited sufficient readability, objectivity, and quantitative rigor for practical use in real-world specification review tasks.
These results demonstrate that JapaneseLint can provide fast and practically usable support for ambiguity detection and quantitative revision of natural-language specifications under realistic computational constraints.

Conflicts of Interest Disclosure

There are no conflicts of interest regarding this paper.

Downloads *Displays the aggregated results up to the previous day.

Download data is not yet available.

References

(1) チョムスキー, N.(井上和子訳)『言語論』大修館書店 (1979)

(2) チョムスキー, N.・バーウィック, R. C.(渡会圭子訳)『チョムスキー言語学講義──言語はいかにして進化したか』ちくま学芸文庫, 筑摩書房 (2017)

(3) ソシュール, F. de(町田健訳)『新訳 ソシュール 一般言語学講義』研究社 (2016)

(4) 町田健『ソシュールと言語学──コトバはなぜ通じるのか』講談社現代新書 (2004)

(5) C. E. Shannon, W. Weaver:コミュニケーションの数学的理論,明治図書,pp.35-40 (1969)

(6) 美濃 導彦・西田 正吾 編. 情報メディア工学. 東京, オーム社, pp. 2-5 (1999)

(7) Daniel M. Berry; Erik Kamsties; Michael M. Krieger, From Contract Drafting to Software Specification: Linguistic Sources of Ambiguity, University of Waterloo Technical Report (2003)

(8) Alessandro Fantechi; Stefania Gnesi; Laura Semini, Rule-based NLP vs ChatGPT in Ambiguity Detection, a Preliminary Study, CEUR Workshop Proceedings (REFSQ 2023 NLP4RE), Vol.3378 (2023)

(9) Muhamad H. F. Muhamad; Nur Nasuha Mohd Noor; Nooraini Yusoff; Suriayati Chuprat S., Fault-Prone Software Requirements Specification Detection Using Ensemble Learning for Edge/Cloud Applications, Applied Sciences (MDPI), Vol.13, No.13 (2023)

(10) Stefania Gnesi; Gabriele Trentanni, QuARS: A NLP Tool for Requirements Analysis, CEUR Workshop Proceedings, NLP4RE 2019, Vol. 2376, pp. 55–64, Essen, Germany, (2019)

(11) Alessandro Fantechi; Stefania Gnesi; Laura Semini, Rule-based NLP vs ChatGPT in Ambiguity Detection, a Preliminary Study, CEUR Workshop Proceedings, REFSQ 2023 NLP4RE, Vol. 3378 (2023)

(12) Sarmad Bashir; Alessio Ferrari; Muhammad Abbas Khan; Per Erik Strandberg; Zulqarnain Haider; Mehrdad Saadatmand; Markus Bohlin, Requirements Ambiguity Detection and Explanation with LLMs: An Industrial Study, 41st International Conference on Software Maintenance and Evolution (ICSME), pp. 620–631 (2025)

(13) Alex Mendoza; Shromona Ghosh; Clark Barrett, NL2Spec: Interactively Translating Unstructured Natural Language to Temporal Logic with Large Language Models, arXivarXiv:2303.04864 (2023)

(1) Lubos Běhounek; Martin Hnětynka; Tomáš Vasilek; Pavel Král, Leveraging Large Language Models for the Quality Assurance of Software Requirements, IEEE International Requirements Engineering Conference (RE) (2024)

(15) Andrew J. Porter; Michael W. Godfrey; Ahmed E. Hassan, Requirements Specification Automated Quality Analysis, IEEE Computer, Vol.58, No.2, pp. 45-53 (2025)

(16) Malte Noller; Markus Borg; Gordon Fraser; Andreas Zeller, SpecFix: Repairing Natural Language Software Specifications, Proceedings of the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) (2024)

(17) Hyeong Jin Kim; Seungwon Park; Yejin Choi, Aligning Language Models to Explicitly Handle Ambiguity, arXiv:2404.119972 (2024)

(18) Alessandro Cimatti; Edmund M. Clarke; Enrico Giunchiglia; Marco Roveri, Industrial Adoption of Formal Methods: Barriers and Challenges, Formal Methods for Industrial Critical Systems (FMICS), Vol. 8187, pp.1-15 (2013)

(19) Andrea Ferrari; Paola Spoletini; Stefania Gnesi, Formal Requirements Engineering and Large Language Models, Information and Software Technology (Elsevier), Vol.181, Article 107697 (2025)

(20) Ben Shneiderman, Response Time and Display Rate in Human Performance with Computers, IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-14, No. 2, pp. 265–270 (1984)

(21) Fiona Fui-Hoon Nah, A Study on Tolerable Waiting Time: How Long Are Web Users Willing to Wait?, Behaviour & Information Technology, Vol. 23, No. 3, pp. 153–163 (2004)

Posted


Submitted: 2025-12-12 05:05:56 UTC

Published: 2025-12-18 02:33:51 UTC
Section
Information Sciences