人工智能论文英文版-LengClaro2023:A Dataset of Administrative Texts in Spanish with Plain Language adaptations.pdf
LengClaro2023:ADatasetofAdministrativeTextsinSpanish
withPlainLanguageadaptations
BelénAgüera-MarcoItziarGonzalez-Dios
UniversityoftheBasqueCountryUPV/EHUHiTZCenter-Ixa
baguera001@ikasle.ehu.eusUniversityoftheBasqueCountryUPV/EHU
itziar.gonzalezd@ehu.eus
Abstractretainingtheoriginalinformationcontentand
meaning(Siddharthan,2014).Thisinvolves
5Inthiswork,wepresentLengClaro2023,modifyingthecontentandstructureofthetext
adatasetoflegal-administrativetextsin
2inordertoimproveitsunderstandabilityand
0Spanish.Basedonthemostfrequentlyusedmakeiteasiertoread(AlvaManchegoetal.,
2proceduresfromtheSpanishSocialSecu-
nritywebsite,wehavecreatedforeachtext2020).Inthiswork,wehaveimplementedthese
utwosimplifiedequivalents.Thefirstver-modificationstoensurethatthetextsadhereto
JsionfollowstherecommendationsprovidedtheprinciplesofPlainLanguage.Asdefinedby
6byarTextclaro.Thesecondversionincor-theInternationalPlainLanguageFederation:
poratesadditionalrecommendationsfrom
]plainlanguageguidelinestoexplorefurtherAcommunicationisinplainlan-
L
Cpotentialimprovementsinthesystem.Theguageifitswording,structure,