NORMSERVIS s.r.o.

IEEE P3168

IEEE Approved Draft Standard for Robustness Evaluation Test Methods for a Natural Language Processing Service that uses Machine Learning

STANDARD published on 19.6.2024

English -
electronic (protected pdf) - Immediate download (56.00 USD)

English -
CD-ROM (57.50 USD)

The information about the standard:

Designation standards: IEEE P3168
Publication date standards: 19.6.2024
Approximate weight : 300 g (0.66 lbs)
Country: International technical standard
Category: Technical standards IEEE

Annotation of standard text IEEE P3168 :

New IEEE Standard - Active - Draft.
The Natural Language Processing (NLP) services using machine learning have rich applications in solving various tasks, and have been widely deployed and used, usually accessible by API calls. The robustness of the NLP services is challenged by various well-known general corruptions and adversarial attacks. Examples of general corruptions include inadvertent or random deletion, addition, or repetition of characters or words. Adversarial attacks generate adversarial characters, words or sentence samples causing the models underpinning the NLP services to produce incorrect results. This standard proposes a method for quantitatively evaluating the robustness the NLP services. Under the method, different cases the evaluation needs to perform against are specified. Robustness metrics and their calculation are defined. With the standard, the service stakeholders including the service developer, service providers, and service users can develop understanding of the robustness of the services. The evaluation can be performed during various phases in the life cycle of the NLP services, the testing phase, in the validation phase, after deployment, etc.

ISBN: 979-8-8557-0478-5, 979-8-8557-0478-5
Number of Pages: 27
Product Code: STDUD26745, STDAPE26745
Keywords: robustness evaluation, artificial intelligence, natural language processing service, evaluation metrics
Category: 309
Draft Number: P3168/D3, Aug 2023 - UNAPPROVED DRAFT, P3168/D3, Aug 2023 - APPROVED DRAFT