Dòng Nội dung
1
2
Méthodologie d’harmonisation et de traitement des données orales du CÉFC / Christophe Benzitoun, Carole Etienne // Langages Nº 219 (3/2020)
France : Armand Colin, 2020
p. 39-52

The CÉFC corpus includes data from several different sources to make observable the diversity of oral French at least partly, solving the problems inherent to the heterogeneity of these data is intrinsic to the constitution of this resource and motivated by its objective. This article will describe, step by step, the methodological approach that enables us to build a homogeneous resource by pooling these different sources in order to provide coherent automatic annotations and to facilitate the analysis of an oral corpus of several million words.
Đầu mục:0 (Lượt lưu thông:0) Tài liệu số:0 (Lượt truy cập:0)