Repository: common-voice/cv-dataset Branch: main Commit: d93bea708ce1 Files: 72 Total size: 5.3 MB Directory structure: gitextract_44_0gd_b/ ├── CHANGELOG.md ├── LICENSE ├── README.md ├── datasets/ │ ├── code-switching/ │ │ └── README.md │ ├── scripted-speech/ │ │ ├── CHANGELOG.md │ │ ├── README.md │ │ ├── cv-corpus-1.json │ │ ├── cv-corpus-10.0-2022-07-04.json │ │ ├── cv-corpus-10.0-delta-2022-07-04.json │ │ ├── cv-corpus-11.0-2022-09-21.json │ │ ├── cv-corpus-11.0-delta-2022-09-21.json │ │ ├── cv-corpus-12.0-2022-12-07.json │ │ ├── cv-corpus-12.0-delta-2022-12-07.json │ │ ├── cv-corpus-13.0-2023-03-09.json │ │ ├── cv-corpus-13.0-delta-2023-03-09.json │ │ ├── cv-corpus-14.0-2023-06-23.json │ │ ├── cv-corpus-14.0-delta-2023-06-23.json │ │ ├── cv-corpus-15.0-2023-09-08.json │ │ ├── cv-corpus-15.0-delta-2023-09-08.json │ │ ├── cv-corpus-16.0-2023-12-06.json │ │ ├── cv-corpus-16.0-delta-2023-12-06.json │ │ ├── cv-corpus-16.1-2023-12-06.json │ │ ├── cv-corpus-16.1-delta-2023-12-06.json │ │ ├── cv-corpus-17.0-2024-03-15.json │ │ ├── cv-corpus-17.0-delta-2024-03-15.json │ │ ├── cv-corpus-18.0-2024-06-14.json │ │ ├── cv-corpus-18.0-delta-2024-06-14.json │ │ ├── cv-corpus-19.0-2024-09-13.json │ │ ├── cv-corpus-19.0-delta-2024-09-13.json │ │ ├── cv-corpus-2.json │ │ ├── cv-corpus-20.0-2024-12-06.json │ │ ├── cv-corpus-20.0-delta-2024-12-06.json │ │ ├── cv-corpus-21.0-2025-03-14.json │ │ ├── cv-corpus-21.0-delta-2025-03-14.json │ │ ├── cv-corpus-22.0-2025-06-20.json │ │ ├── cv-corpus-22.0-delta-2025-06-20.json │ │ ├── cv-corpus-23.0-2025-09-05.json │ │ ├── cv-corpus-23.0-delta-2025-09-05.json │ │ ├── cv-corpus-24.0-2025-12-05.json │ │ ├── cv-corpus-24.0-delta-2025-12-05.json │ │ ├── cv-corpus-25.0-2026-03-09.json │ │ ├── cv-corpus-25.0-delta-2026-03-09.json │ │ ├── cv-corpus-3.json │ │ ├── cv-corpus-4-2019-12-10.json │ │ ├── cv-corpus-5-2020-06-22.json │ │ ├── cv-corpus-5-singleword.json │ │ ├── cv-corpus-5.1-2020-06-22.json │ │ ├── cv-corpus-5.1-singleword.json │ │ ├── cv-corpus-6.0-2020-12-11.json │ │ ├── cv-corpus-6.0-singleword.json │ │ ├── cv-corpus-6.1-2020-12-11.json │ │ ├── cv-corpus-6.1-singleword.json │ │ ├── cv-corpus-7.0-2021-07-21.json │ │ ├── cv-corpus-7.0-singleword.json │ │ ├── cv-corpus-8.0-2022-01-19.json │ │ └── cv-corpus-9.0-2022-04-27.json │ └── spontaneous-speech/ │ ├── .gitkeep │ ├── CHANGELOG.md │ ├── README.md │ ├── sps-corpus-1.0-2025-09-05.json │ ├── sps-corpus-2.0-2025-12-05.json │ ├── sps-corpus-2.0-delta-2025-12-05.json │ ├── sps-corpus-3.0-2026-03-09.json │ └── sps-corpus-3.0-delta-2026-03-09.json └── helpers/ ├── .eslintrc.json ├── README.md ├── common.js ├── compareReleases.js ├── createDeltaStatistics.js ├── createStats.js ├── jsconfig.json └── recalculateStats.js ================================================ FILE CONTENTS ================================================ ================================================ FILE: CHANGELOG.md ================================================ # Changelog Changelogs are maintained per dataset type: - [Scripted Speech (SCS)](datasets/scripted-speech/CHANGELOG.md) -- 25 releases (v1 through v25.0) - [Spontaneous Speech (SPS)](datasets/spontaneous-speech/CHANGELOG.md) -- 3 releases (v1.0 through v3.0) - [Code Switching (CS)](datasets/code-switching/README.md) -- planned, no releases yet ## Major Changes with March 2026 Releases The March 2026 release cycle (SCS v25.0 / SPS v3.0) introduces significant infrastructure and tooling changes across the Common Voice dataset ecosystem. Below is a summary; see each dataset type's changelog for details relevant to dataset consumers. - **Multi-modality dataset statistics.** This repository (`cv-dataset`) now tracks release statistics for all dataset types (SCS, SPS, CS). Helper scripts (`createStats.js`, `compareReleases.js`, `createDeltaStatistics.js`, `recalculateStats.js`) were refactored to handle both SCS and SPS data formats, with per-type handlers and recursive comparison for nested SPS objects. - **SCS & SPS Bundler changes** The Scripted Speech bundler is augmented with new `variant` option, and ability to handle licensed datasets. The Spontaneous Speech bundler reached its first production release matching SCS counterpart where possible, with four release types (`full`, `delta`, `variants`, `statistics`) and includes graceful delta release support with passive locale skipping for locales with zero new activity. - **Embedded QA pipeline.** The SPS bundler now embeds the quality-control-data-pipeline as a `PostProcessCorpus` step. This applies disfluency standardization, quality tagging, and generates a per-locale QA summary JSON included in each release archive. - **Datasheets integration.** Both SCS and SPS bundlers now generate per-locale datasheets (Markdown documentation) as part of the release pipeline. Templates and community-contributed content are sourced from `cv-datasheets` (schema v2.0.0), and the bundler fills in auto-generated statistics at build time. Datasheets are included in full release archives and also presented at datasets pages on the MDC platform. They merge community-contributed content with auto-generated statistics, and are designed to be human-readable summaries of the dataset for each locale. - **SCS-SPS data bridge.** The SPS bundler can cross-reference the SCS database to provide demographics data. This enables accent, age, and gender data from SCS profiles to appear in SPS releases when available. Note that SPS was not connected to SCS user profiles at the start, thus older data may have missing demographics. ================================================ FILE: LICENSE ================================================ Mozilla Public License Version 2.0 ================================== 1. Definitions -------------- 1.1. "Contributor" means each individual or legal entity that creates, contributes to the creation of, or owns Covered Software. 1.2. "Contributor Version" means the combination of the Contributions of others (if any) used by a Contributor and that particular Contributor's Contribution. 1.3. "Contribution" means Covered Software of a particular Contributor. 1.4. "Covered Software" means Source Code Form to which the initial Contributor has attached the notice in Exhibit A, the Executable Form of such Source Code Form, and Modifications of such Source Code Form, in each case including portions thereof. 1.5. "Incompatible With Secondary Licenses" means (a) that the initial Contributor has attached the notice described in Exhibit B to the Covered Software; or (b) that the Covered Software was made available under the terms of version 1.1 or earlier of the License, but not also under the terms of a Secondary License. 1.6. "Executable Form" means any form of the work other than Source Code Form. 1.7. "Larger Work" means a work that combines Covered Software with other material, in a separate file or files, that is not Covered Software. 1.8. "License" means this document. 1.9. "Licensable" means having the right to grant, to the maximum extent possible, whether at the time of the initial grant or subsequently, any and all of the rights conveyed by this License. 1.10. "Modifications" means any of the following: (a) any file in Source Code Form that results from an addition to, deletion from, or modification of the contents of Covered Software; or (b) any new file in Source Code Form that contains any Covered Software. 1.11. "Patent Claims" of a Contributor means any patent claim(s), including without limitation, method, process, and apparatus claims, in any patent Licensable by such Contributor that would be infringed, but for the grant of the License, by the making, using, selling, offering for sale, having made, import, or transfer of either its Contributions or its Contributor Version. 1.12. "Secondary License" means either the GNU General Public License, Version 2.0, the GNU Lesser General Public License, Version 2.1, the GNU Affero General Public License, Version 3.0, or any later versions of those licenses. 1.13. "Source Code Form" means the form of the work preferred for making modifications. 1.14. "You" (or "Your") means an individual or a legal entity exercising rights under this License. For legal entities, "You" includes any entity that controls, is controlled by, or is under common control with You. For purposes of this definition, "control" means (a) the power, direct or indirect, to cause the direction or management of such entity, whether by contract or otherwise, or (b) ownership of more than fifty percent (50%) of the outstanding shares or beneficial ownership of such entity. 2. License Grants and Conditions -------------------------------- 2.1. Grants Each Contributor hereby grants You a world-wide, royalty-free, non-exclusive license: (a) under intellectual property rights (other than patent or trademark) Licensable by such Contributor to use, reproduce, make available, modify, display, perform, distribute, and otherwise exploit its Contributions, either on an unmodified basis, with Modifications, or as part of a Larger Work; and (b) under Patent Claims of such Contributor to make, use, sell, offer for sale, have made, import, and otherwise transfer either its Contributions or its Contributor Version. 2.2. Effective Date The licenses granted in Section 2.1 with respect to any Contribution become effective for each Contribution on the date the Contributor first distributes such Contribution. 2.3. Limitations on Grant Scope The licenses granted in this Section 2 are the only rights granted under this License. No additional rights or licenses will be implied from the distribution or licensing of Covered Software under this License. Notwithstanding Section 2.1(b) above, no patent license is granted by a Contributor: (a) for any code that a Contributor has removed from Covered Software; or (b) for infringements caused by: (i) Your and any other third party's modifications of Covered Software, or (ii) the combination of its Contributions with other software (except as part of its Contributor Version); or (c) under Patent Claims infringed by Covered Software in the absence of its Contributions. This License does not grant any rights in the trademarks, service marks, or logos of any Contributor (except as may be necessary to comply with the notice requirements in Section 3.4). 2.4. Subsequent Licenses No Contributor makes additional grants as a result of Your choice to distribute the Covered Software under a subsequent version of this License (see Section 10.2) or under the terms of a Secondary License (if permitted under the terms of Section 3.3). 2.5. Representation Each Contributor represents that the Contributor believes its Contributions are its original creation(s) or it has sufficient rights to grant the rights to its Contributions conveyed by this License. 2.6. Fair Use This License is not intended to limit any rights You have under applicable copyright doctrines of fair use, fair dealing, or other equivalents. 2.7. Conditions Sections 3.1, 3.2, 3.3, and 3.4 are conditions of the licenses granted in Section 2.1. 3. Responsibilities ------------------- 3.1. Distribution of Source Form All distribution of Covered Software in Source Code Form, including any Modifications that You create or to which You contribute, must be under the terms of this License. You must inform recipients that the Source Code Form of the Covered Software is governed by the terms of this License, and how they can obtain a copy of this License. You may not attempt to alter or restrict the recipients' rights in the Source Code Form. 3.2. Distribution of Executable Form If You distribute Covered Software in Executable Form then: (a) such Covered Software must also be made available in Source Code Form, as described in Section 3.1, and You must inform recipients of the Executable Form how they can obtain a copy of such Source Code Form by reasonable means in a timely manner, at a charge no more than the cost of distribution to the recipient; and (b) You may distribute such Executable Form under the terms of this License, or sublicense it under different terms, provided that the license for the Executable Form does not attempt to limit or alter the recipients' rights in the Source Code Form under this License. 3.3. Distribution of a Larger Work You may create and distribute a Larger Work under terms of Your choice, provided that You also comply with the requirements of this License for the Covered Software. If the Larger Work is a combination of Covered Software with a work governed by one or more Secondary Licenses, and the Covered Software is not Incompatible With Secondary Licenses, this License permits You to additionally distribute such Covered Software under the terms of such Secondary License(s), so that the recipient of the Larger Work may, at their option, further distribute the Covered Software under the terms of either this License or such Secondary License(s). 3.4. Notices You may not remove or alter the substance of any license notices (including copyright notices, patent notices, disclaimers of warranty, or limitations of liability) contained within the Source Code Form of the Covered Software, except that You may alter any license notices to the extent required to remedy known factual inaccuracies. 3.5. Application of Additional Terms You may choose to offer, and to charge a fee for, warranty, support, indemnity or liability obligations to one or more recipients of Covered Software. However, You may do so only on Your own behalf, and not on behalf of any Contributor. You must make it absolutely clear that any such warranty, support, indemnity, or liability obligation is offered by You alone, and You hereby agree to indemnify every Contributor for any liability incurred by such Contributor as a result of warranty, support, indemnity or liability terms You offer. You may include additional disclaimers of warranty and limitations of liability specific to any jurisdiction. 4. Inability to Comply Due to Statute or Regulation --------------------------------------------------- If it is impossible for You to comply with any of the terms of this License with respect to some or all of the Covered Software due to statute, judicial order, or regulation then You must: (a) comply with the terms of this License to the maximum extent possible; and (b) describe the limitations and the code they affect. Such description must be placed in a text file included with all distributions of the Covered Software under this License. Except to the extent prohibited by statute or regulation, such description must be sufficiently detailed for a recipient of ordinary skill to be able to understand it. 5. Termination -------------- 5.1. The rights granted under this License will terminate automatically if You fail to comply with any of its terms. However, if You become compliant, then the rights granted under this License from a particular Contributor are reinstated (a) provisionally, unless and until such Contributor explicitly and finally terminates Your grants, and (b) on an ongoing basis, if such Contributor fails to notify You of the non-compliance by some reasonable means prior to 60 days after You have come back into compliance. Moreover, Your grants from a particular Contributor are reinstated on an ongoing basis if such Contributor notifies You of the non-compliance by some reasonable means, this is the first time You have received notice of non-compliance with this License from such Contributor, and You become compliant prior to 30 days after Your receipt of the notice. 5.2. If You initiate litigation against any entity by asserting a patent infringement claim (excluding declaratory judgment actions, counter-claims, and cross-claims) alleging that a Contributor Version directly or indirectly infringes any patent, then the rights granted to You by any and all Contributors for the Covered Software under Section 2.1 of this License shall terminate. 5.3. In the event of termination under Sections 5.1 or 5.2 above, all end user license agreements (excluding distributors and resellers) which have been validly granted by You or Your distributors under this License prior to termination shall survive termination. ************************************************************************ * * * 6. Disclaimer of Warranty * * ------------------------- * * * * Covered Software is provided under this License on an "as is" * * basis, without warranty of any kind, either expressed, implied, or * * statutory, including, without limitation, warranties that the * * Covered Software is free of defects, merchantable, fit for a * * particular purpose or non-infringing. The entire risk as to the * * quality and performance of the Covered Software is with You. * * Should any Covered Software prove defective in any respect, You * * (not any Contributor) assume the cost of any necessary servicing, * * repair, or correction. This disclaimer of warranty constitutes an * * essential part of this License. No use of any Covered Software is * * authorized under this License except under this disclaimer. * * * ************************************************************************ ************************************************************************ * * * 7. Limitation of Liability * * -------------------------- * * * * Under no circumstances and under no legal theory, whether tort * * (including negligence), contract, or otherwise, shall any * * Contributor, or anyone who distributes Covered Software as * * permitted above, be liable to You for any direct, indirect, * * special, incidental, or consequential damages of any character * * including, without limitation, damages for lost profits, loss of * * goodwill, work stoppage, computer failure or malfunction, or any * * and all other commercial damages or losses, even if such party * * shall have been informed of the possibility of such damages. This * * limitation of liability shall not apply to liability for death or * * personal injury resulting from such party's negligence to the * * extent applicable law prohibits such limitation. Some * * jurisdictions do not allow the exclusion or limitation of * * incidental or consequential damages, so this exclusion and * * limitation may not apply to You. * * * ************************************************************************ 8. Litigation ------------- Any litigation relating to this License may be brought only in the courts of a jurisdiction where the defendant maintains its principal place of business and such litigation shall be governed by laws of that jurisdiction, without reference to its conflict-of-law provisions. Nothing in this Section shall prevent a party's ability to bring cross-claims or counter-claims. 9. Miscellaneous ---------------- This License represents the complete agreement concerning the subject matter hereof. If any provision of this License is held to be unenforceable, such provision shall be reformed only to the extent necessary to make it enforceable. Any law or regulation which provides that the language of a contract shall be construed against the drafter shall not be used to construe this License against a Contributor. 10. Versions of the License --------------------------- 10.1. New Versions Mozilla Foundation is the license steward. Except as provided in Section 10.3, no one other than the license steward has the right to modify or publish new versions of this License. Each version will be given a distinguishing version number. 10.2. Effect of New Versions You may distribute the Covered Software under the terms of the version of the License under which You originally received the Covered Software, or under the terms of any subsequent version published by the license steward. 10.3. Modified Versions If you create software not governed by this License, and you want to create a new license for such software, you may create and use a modified version of this License if you rename the license and remove any references to the name of the license steward (except to note that such modified license differs from this License). 10.4. Distributing Source Code Form that is Incompatible With Secondary Licenses If You choose to distribute Source Code Form that is Incompatible With Secondary Licenses under the terms of this version of the License, the notice described in Exhibit B of this License must be attached. Exhibit A - Source Code Form License Notice ------------------------------------------- This Source Code Form is subject to the terms of the Mozilla Public License, v. 2.0. If a copy of the MPL was not distributed with this file, You can obtain one at http://mozilla.org/MPL/2.0/. If it is not possible or desirable to put the notice in a particular file, then You may include the notice in a location (such as a LICENSE file in a relevant directory) where a recipient would be likely to look for such a notice. You may add additional accurate notices of copyright ownership. Exhibit B - "Incompatible With Secondary Licenses" Notice --------------------------------------------------------- This Source Code Form is "Incompatible With Secondary Licenses", as defined by the Mozilla Public License, v. 2.0. ================================================ FILE: README.md ================================================ # Common Voice Datasets This repo contains release details and metadata for the [Common Voice](https://commonvoice.mozilla.org) datasets. Please visit the [Mozilla Data Collective Common Voice section](https://mozilladatacollective.com/organization/cmfh0j9o10006ns07jq45h7xk) to download the latest datasets. ## Dataset Types Common Voice collects voice data through multiple modalities. Each dataset type has its own release information, data structure, and documentation. | Type | Alias | Status | Releases | Latest (2026-03) | Languages | | -------------------------------------------------- | ----- | ------ | -------: | :--------------: | --------: | | [Scripted Speech](datasets/scripted-speech/) | SCS | Active | 25 | v25.0 | 290 | | [Spontaneous Speech](datasets/spontaneous-speech/) | SPS | Active | 3 | v3.0 | 72 | | [Code Switching](datasets/code-switching/) | CS | Alpha | -- | -- | -- | See each dataset type's documentation for detailed information about data structures, fields in metadata files (`.tsv`), archive contents, and release changelogs. Note that the "date" in releases represents the cut-off date for data collection and validation, not the actual release date of the dataset. ## Data Pipeline ```mermaid flowchart LR subgraph SCS["Scripted Speech (SCS)"] SCS_DB[("DB")] SCS_GCS["GCS"] end subgraph SCS_BUN["SCS Bundler"] CC["CorporaCreator"] end subgraph SCS_BUN2["SCS Bundler"] UP["Uploader"] end DSH["cv-datasheets"] subgraph SPS["Spontaneous Speech (SPS)"] SPS_DB[("DB")] SPS_GCS["GCS"] end subgraph SPS_BUN["SPS Bundler"] QA["QA Pipeline"] end BUN_GCS["GCS datasets datasheets stats"] MDC[["MDC downloads"]] CDS[["cv-dataset ◀"]] SCS_DB -->|data| SCS_BUN SCS_GCS -->|clips| SCS_BUN DSH -->|JSON| SCS_BUN DSH -->|JSON| SPS_BUN SPS_DB -->|data| SPS_BUN SPS_GCS -->|audio| SPS_BUN SCS_BUN --> BUN_GCS SPS_BUN --> BUN_GCS BUN_GCS -->|datasets| UP BUN_GCS -->|datasheets| UP UP -->|API| MDC BUN_GCS -->|stats| CDS style CDS fill:#1a73e8,color:#ffffff,stroke:#1558b0,stroke-width:2px ``` ## Overview ### Scripted Speech (SCS) ```mermaid --- config: xyChart: width: 900 height: 400 --- xychart-beta title "Scripted Speech: Total & Validated Hours" x-axis ["1","2","3","4","5.1","6.1","7","8","9","10","11","12","13","14","15","16.1","17","18","19","20","21","22","23","24","25"] y-axis "Hours" 0 --> 42000 bar [1368,2366,2454,4257,7226,9283,13905,18243,20217,20817,24231,26119,27141,28117,28750,30328,31175,32121,32584,33154,33534,33815,35921,38932,41792] bar [1096,1872,1979,3401,5671,7335,11192,14122,14973,15234,16429,17127,17689,18651,19159,19915,20408,20943,21593,22106,22344,22640,24600,25886,28377] ``` For details see: [Scripted Speech documentation](datasets/scripted-speech/) ### Spontaneous Speech (SPS) ```mermaid --- config: xyChart: width: 600 height: 350 --- xychart-beta title "Spontaneous Speech: Total vs Validated Hours" x-axis ["v1.0","v2.0","v3.0"] y-axis "Hours" 0 --> 600 bar [428,454,508] bar [263,268,269] ``` For details see: [Spontaneous Speech documentation](datasets/spontaneous-speech/) ## Dataset Access You can download the Common Voice datasets from the [Mozilla Data Collective](https://mozilladatacollective.com/) (MDC) platform: - [Directly from the browser](https://mozilladatacollective.com/organization/cmfh0j9o10006ns07jq45h7xk) - [Using the MDC API](https://mozilladatacollective.com/api-reference) - [Using the MDC Python SDK](https://github.com/Mozilla-Data-Collective/datacollective-python) to directly load the datasets as pandas DataFrame in your Python codebase ## Generating Dataset Statistics Helper scripts are available in the [helpers/](helpers/) directory for processing bundler output into dataset statistics. See [helpers/README.md](helpers/README.md) for detailed usage and examples. All helper scripts support multiple dataset types via the first argument: ```bash node helpers/createStats.js node helpers/compareReleases.js node helpers/createDeltaStatistics.js node helpers/recalculateStats.js ``` ## Citation If you use the data in a published academic work we would appreciate if you cite the following article: - Ardila, R., Branson, M., Davis, K., Henretty, M., Kohler, M., Meyer, J., Morais, R., Saunders, L., Tyers, F. M. and Weber, G. (2020) "[Common Voice: A Massively-Multilingual Speech Corpus](https://arxiv.org/abs/1912.06670)". _Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)._ pp. 4211--4215 ```bibtex @inproceedings{commonvoice:2020, author = {Ardila, R. and Branson, M. and Davis, K. and Henretty, M. and Kohler, M. and Meyer, J. and Morais, R. and Saunders, L. and Tyers, F. M. and Weber, G.}, title = {Common Voice: A Massively-Multilingual Speech Corpus}, booktitle = {Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)}, pages = {4211--4215}, year = 2020 } ``` ## Feedback Please only use this repo to provide feedback on **technical issues** with the dataset, such as file corruptions, problems with the partitions, and so on. For more expansive discussions, please join us in [Discourse](https://discourse.mozilla.org/c/voice) or [Matrix](https://chat.mozilla.org/#/room/#common-voice:mozilla.org). ================================================ FILE: datasets/code-switching/README.md ================================================ # Code Switching (CS) Code Switching is an upcoming Common Voice modality where contributors produce speech that naturally switches between two or more languages within a single utterance. This is a subproject within the [Spontaneous Speech](https://github.com/common-voice/spontaneous-speech) repository, currently gated to alpha testers. **Status**: Alpha test phase -- no releases yet. This directory will contain release statistics once the first Code Switching dataset is published. ================================================ FILE: datasets/scripted-speech/CHANGELOG.md ================================================ # Scripted Speech (SCS) Changelog ## Current Release ### [Corpus 25.0](cv-corpus-25.0-2026-03-09.json) Regularly scheduled dataset release Q1 2026. - **Date released**: 18 March 2026 - **Clip cut-off date**: 09 March 2026 - **Total hours**: 41,792 - **Total validated hours**: 28,377 - **Number of languages**: 290 **New languages since last major release**: Croatian (`hr`) #### Dataset Changes in Corpus 25.0 - added `README.md` datasheet per locale -- a Markdown document with language description, dataset statistics, demographic breakdowns, and community context (generated from [cv-datasheets](https://github.com/common-voice/cv-datasheets), schema v2.0.0) - added `variant` column to `validated_sentences.tsv` (after `sentence`), containing the language variant token for the sentence (empty if none) - added `variant` column to `unvalidated_sentences.tsv` (after `sentence`) - added `up_votes`, `down_votes`, and `status` columns to `unvalidated_sentences.tsv` - `status` is `pending` (not-yet-validated) or `rejected` (when `down_votes` >= 2 and `down_votes` > `up_votes`) - the `unvalidated_sentences.tsv` description is corrected: it contains sentences that have not reached the validated threshold, not only sentences without any votes - added `variant` and `locale` columns to [Corpora Creator](https://github.com/common-voice/CorporaCreator) clip files ## Past Releases ### [Corpus 24.0](cv-corpus-24.0-2025-12-05.json) Regularly scheduled dataset release Q4 2025. - **Date released**: 17 December 2025 - **Clip cut-off date**: 05 December 2025 - **Total hours**: 38,932 - **Total validated hours**: 25,886 - **Number of languages**: 289 **New languages since last major release**: Sorbian, Lower (`dsb`), Alsatian (`gsw`), Laz (`lzz`) ### [Corpus 23.0](cv-corpus-23.0-2025-09-05.json) Regularly scheduled dataset release Q3 2025. - **Date released**: 17 September 2025 - **Clip cut-off date**: 05 September 2025 - **Total hours**: 35,921 - **Total validated hours**: 24,600 - **Number of languages**: 286 **New languages since last major release**: Adamawa Fulfulde (`fub`), Adja (`ajg`), Adyghe (`ady`), Aragonese (`an`), Asheninka Perene, Asheninka South Ucayali, Atayal (`tay`), Baatonum (`bba`), Bafia (`ksf`), Bafut (`bfd`), Bakoko, Balti (`bft`), Bamun (`bax`), Bamvele (`beb`), Bankon (`abb`), Baoule (`bci`), Batanga (`bnm`), Bateri (`btv`), Borgu Fulfulde (`fue`), Brahui, Brushaski (`bsk`), Bulu (`bum`), Bunun (`bnn`), Cameroon Pidgin (`wes`), Central Alaskan Yup'ik (`esu`), Central Puebla Nahuatl (`ncx`), Central Tarahumara, Chokwe, Copainalá Zoque (`zoc`), Cornish (`kw`), Dagbani (`dag`), Dameli (`dml`), Dargwa (`dar`), Dawoodi (`dmk`), Dhatki, Duala (`dua`), Eastern Balochi (`bgp`), Ebrie (`ebr`), Ekoti, Eton (`eto`), Ewondo (`ewo`), Fang (`fan`), Fe'efe'e (`fmp`), Gawarbaiti (`gwt`), Gawri (`gwc`), Ghomala (`bbj`), Goaria, Guidar, Guiziga, Gujari (`gju`), Gurgula (`ggg`), Hazargi, Huarijio (`var`), Huautla Mazatec (`mau`), Ibibio (`ibb`), Indus Kohistani (`mvy`), Iñupiaq (`ipk`), Jaqaru (`jqr`), Kabardian (`kbd`), Kachhi, Kalasha (`kls`), Kalkoti (`xka`), Kateviri (`bsh`), Khetrani (`xhe`), Khowar (`khw`), Kichwa (`qvi`), Kihemba, Kirombo, Kohistani Shina (`plk`), Kom (`bkm`), Kotokoli, Kunabembe, Kwasio, Lassi (`lss`), Loarki, Loja Highland Kichwa, Losso, Mada (`mxu`), Malay (`ms`), Manx (`gv`), Massa, Matses, Mbo (`mbo`), Mbum, Medumba (`byv`), Mengambo, Mina, Mingrelian (`xmf`), Mokpwe (`bri`), Moussey, Mpiemo, Mundang, Mungaka, Musgum, Ngiembon (`nnh`), Ngomba, Ngombale, Nigerian Pidgin English (`pcm`), Northern Hindko (`hno`), Northwest Gbaya (`gya`), Nuasue, Nyungwe, Nüpode Huitoto, Oadki, Orizaba Nahuatl, Ormuri (`oru`), Ouldémé, Pahari-Pothwari, Paiwan (`pwn`), Pakistani Marwari, Palula (`phl`), Parkari Koli, Puno Quechua (`qxp`), Quechua Ambo-Pasco (`qva`), Quechua Arequipa-La Unión (`qxu`), Quechua Cajatambo (`qvl`), Quechua Chiquián (`qxa`), Quechua Corongo Ancash (`qwa`), Quechua Jauja Wanka (`qxw`), Quechua Pasco Santa Ana de Tusi (`qxt`), Quechua Santiago del Estero, Quechua Sihuas Ancash (`qws`), Quechua Yanahuanca, Quechua Yauyos (`qux`), Rukai (`dru`), Sakizaya (`szy`), Sansi, Seediq (`trv`), Seri (`sei`), Shina (`scl`), Sindhi Bhili, Siswati (`ss`), Southern Pastaza Quechua (`qup`), Svan (`sva`), Tepeuxila Cuicatec (`cux`), Teutila Cuicatec (`cut`), Tlingit, Torwali (`trw`), Tshiluba, Tuki, Tunen (`tvu`), Tupuri (`tui`), Tush (`bbl`), Ushojo (`ush`), Wadiyara Koli, Wakhi (`wbl`), Western Highland Purepecha (`pua`), Yadgha, Yaqui (`yaq`) ### [Corpus 22.0](cv-corpus-22.0-2025-06-20.json) Regularly scheduled dataset release Q2 2025. - **Date released**: 25 June 2025 - **Clip cut-off date**: 20 June 2025 - **Total hours**: 33,815 - **Total validated hours**: 22,640 - **Number of languages**: 137 **New languages since last major release**: Aromanian (`rup`), Tajik (`tg`), Tshivenda (`ve`) ### [Corpus 21.0](cv-corpus-21.0-2025-03-14.json) Regularly scheduled dataset release Q1 2025. - **Date released**: 19 March 2025 - **Clip cut-off date**: 14 March 2025 - **Total hours**: 33,534 - **Total validated hours**: 22,344 - **Number of languages**: 134 **New languages since last major release**: Norwegian Bokmål (`nb-NO`) ### [Corpus 20.0](cv-corpus-20.0-2024-12-06.json) Regularly scheduled dataset release Q4 2024. - **Date released**: 11 December 2024 - **Clip cut-off date**: 06 December 2024 - **Total hours**: 33,154 - **Total validated hours**: 22,106 - **Number of languages**: 133 **New languages since last major release**: IsiNdebele (South) (`nr`), Southern Sotho (`st`) ### [Corpus 19.0](cv-corpus-19.0-2024-09-13.json) Regularly scheduled dataset release Q3 2024. - **Date released**: 18 September 2024 - **Clip cut-off date**: 13 September 2024 - **Total hours**: 32,584 - **Total validated hours**: 21,593 - **Number of languages**: 131 **New languages since last major release**: Sindhi (`sd`), Xitsonga (`ts`) ### [Corpus 18.0](cv-corpus-18.0-2024-06-14.json) #### Dataset Changes in Corpus 18.0 - the `sentence_domain` column contains now up to three domains separated by a comma, e.g. `general,finance,news_current_affairs` - the domains `agriculture`, `automotive` and `food_service_retail` have been renamed to `agriculture_food`, `automotive_transport`, `service_retail` respectively ### [Corpus 17.0](cv-corpus-17.0-2024-03-15.json) #### Dataset Changes in Corpus 17.0 - added `unvalidated_sentences.tsv` and `validated_sentences.tsv` - `unvalidated_sentences.tsv` contains sentences that have not reached the validated threshold (including those with votes but insufficient up votes), the columns are: `sentence_id`, `sentence`, `sentence_domain` and `source` - `validated_sentences.tsv` contains sentences that have two or more up votes, it has two additional columns: `is_used` and `clips_count` - `is_used`: indicates whether or not the sentence is used on the speak page - `clips_count`: the number of clips that are associated with the sentence - added `sentence_id` and `sentence_domain` to the [Corpora Creator](https://github.com/common-voice/CorporaCreator) files - the following [sentence domains](https://github.com/common-voice/common-voice/blob/f820e0fa3ec00fc6d49dae7e31bcebf9eb24878b/common/taxonomies.ts#L35) are supported ### [Corpus 16.1](cv-corpus-16.1-2023-12-06.json) #### Dataset Changes in Corpus 16.1 - changed `times.txt` to `clip_durations.tsv` for consistency - `clip_durations.tsv` contains two columns: `clip` and `duration[ms]` ### [Corpus 14.0](cv-corpus-14.0-2023-06-23.json) #### Dataset Changes in Corpus 14.0 - added `times.txt` containing mp3 filename and duration in ms ### [Corpus 13.0](cv-corpus-13.0-2023-03-09.json) #### Dataset Changes in Corpus 13.0 - added `variant` column to [Corpora Creator](https://github.com/common-voice/CorporaCreator) files ### [Corpus 10.0](cv-corpus-10.0-2022-07-04.json) #### Dataset Changes in Corpus 10.0 - introduced delta segments - delta segment tar file naming is `cv-corpus-{version}-delta-{YYYY-MM-DD}-{locale}.tar.gz` - delta segments contain the same files except for the training splits, i.e. `dev.tsv`, `test.tsv`, `train.tsv` ### [Corpus 9.0](cv-corpus-9.0-2022-04-27.json) Regularly scheduled dataset release Q1 2022. - **Date released**: 27 April 2022 - **Clip cut-off date**: 07 April 2022 - **Total hours**: 20,217 - **Total validated hours**: 14,973 - **Number of languages**: 93 **New languages since last major release**: Tigre (`tig`), Taiwanese (Minnan) (`nan-tw`), Meadow Mari (`mhr`), Bengali (`bn`), Toki Pona (`tok`), Cantonese (`yue`) ### [Corpus 8.0](cv-corpus-8.0-2022-01-19.json) Regularly scheduled dataset release. - **Date released**: 26 January 2022 - **Clip cut-off date**: 19 January 2022 - **Total hours**: 18,243 - **Total validated hours**: 14,122 - **Number of languages**: 87 **New languages since last major release**: Igbo (`ig`), Marathi (`mr`), Danish (`da`), Norwegian Nynorsk (`nn-NO`), Central Kurdish (`ckb`), Malayalam (`ml`), Swahili (`sw`), Erzya (`myv`), Moksha (`mdf`), Macedonian (`mk`), Santali (Ol Chiki) (`sat`) Note: minor variations in the validated hours of minor dot releases reflects the fact that labeling/validation happens on a different schedule than recording. In the timespan between dot releases the community will usually have performed additional validations, even if the clip cut-off date remains the same. ### [Corpus 7.0](cv-corpus-7.0-2021-07-21.json) Regularly scheduled dataset release for H1 of 2021. - **Date released**: 28 July 2021 - **Clip cut-off date**: 21 July 2021 - **Total hours**: 13,905 - **Total validated hours**: 11,192 - **Number of languages**: 76 **New languages since last major release**: Basaa (`bas`), Slovak (`sk`), Kurmanji Kurdish (`kmr`), Bulgarian (`bg`), Kazakh (`kk`), Bashkir (`ba`), Galician (`gl`), Uyghur (`ug`), Armenian (`hy-AM`), Belarusian (`be`), Urdu (`ur`), Guarani (`gn`), Serbian (`sr`), Uzbek (`uz`), Azerbaijani (`az`), Hausa (`ha`) #### Dataset Changes in Corpus 7.0 - changed tar file naming to `cv-corpus-{version}-{YYYY-MM-DD}-{locale}.tar.gz` ### [Singleword Segment 7.0](cv-corpus-7.0-singleword.json) Update to Singleword Segment 6.1. - **Date released**: 28 July 2021 - **Clip cut-off date**: 21 July 2021 - **Total hours**: 141 - **Total validated hours**: 82 - **Number of languages**: 34 ### [Corpus 6.1](cv-corpus-6.1-2020-12-11.json) Correction to Corpus 6.0, which had a bug that did not properly attribute demographics information. - **Date released**: 22 Dec 2020 - **Clip cut-off date**: 11 Dec 2020 - **Total hours**: 9,283 - **Total validated hours**: 7,335 - **Number of languages**: 60 ### [Singleword Segment 6.1](cv-corpus-6.1-singleword.json) Correction to Singleword Segment 6.0, which had a bug that did not properly attribute demographics information. - **Date released**: 22 Dec 2020 - **Clip cut-off date**: 11 Dec 2020 - **Total hours**: 131 - **Total validated hours**: 77 - **Number of languages**: 31 ### [Corpus 6.0](cv-corpus-6.0-2020-12-11.json) Regularly scheduled dataset release for H2 of 2020. - **Date released**: 22 Dec 2020 - **Clip cut-off date**: 11 Dec 2020 - **Total hours**: 9,261 - **Total validated hours**: 7,327 - **Number of languages**: 60 **New languages since last major release**: Hindi (`hi`), Lithuanian (`lt`), Luganda (`lg`), Thai (`th`), Finnish (`fi`), Hungarian (`hu`) ### [Singleword Segment 6.0](cv-corpus-6.0-singleword.json) Update to Singleword Segment 5.1. - **Date released**: 22 Dec 2020 - **Clip cut-off date**: 11 Dec 2020 - **Total hours**: 131 - **Total validated hours**: 77 - **Number of languages**: 31 ### [Corpus 5.1](cv-corpus-5.1-2020-06-22.json) Correction to Corpus 5.0, which unintentionally altered the column order of the test/train/dev sets, and included some redundant metadata entries for clips that didn't actually have valid audio. - **Date released**: 14 July 2020 - **Clip cut-off date**: 22 June 2020 - **Total hours**: 7,226 - **Total validated hours**: 5,671 - **Number of languages**: 54 ### [Singleword Segment 5.1](cv-corpus-5.1-singleword.json) Correction to Singleword Segment 5.0, which was still optimizing for no repeated sentences during segmentation and thus resulted in disproportionately small test/dev/train sets. - **Date released**: 16 September 2020 - **Clip cut-off date**: 22 June 2020 - **Total hours**: 120 - **Total validated hours**: 64 - **Number of languages**: 18 ### [Corpus 5.0](cv-corpus-5-2020-06-22.json) Regularly scheduled dataset release for H1 of 2020. This release introduced sha256 checksum values for each dataset. - **Date released**: 30 June 2020 - **Clip cut-off date**: 22 June 2020 - **Total hours**: 7,226 - **Total validated hours**: 5,591 - **Number of languages**: 54 **New languages since last major release**: Sorbian, Upper (`hsb`), Romanian (`ro`), Frisian (`fy-NL`), Czech (`cs`), Greek (`el`), Romansh Vallader (`rm-vallader`), Polish (`pl`), Assamese (`as`), Ukrainian (`uk`), Maltese (`mt`), Georgian (`ka`), Punjabi (`pa-IN`), Odia (`or`), Vietnamese (`vi`) #### Dataset Changes in Corpus 5.0 - changed archive folder structure: dataset release archive now contains a locale folder ```txt cv-corpus-5.1-2020-06-22/ └── tr/ ├── clips/ ├── dev.tsv ├── invalidated.tsv ├── other.tsv ├── reported.tsv ├── test.tsv ├── train.tsv └── validated.tsv ``` - added `reported.tsv` containing sentences that have been reported by the community - added `locale` and `segment` columns to the [Corpora Creator](https://github.com/common-voice/CorporaCreator) files ### [Singleword Segment 5.0](cv-corpus-5-singleword.json) This contains all of the voice data collected as part of the Common Voice pilot target segment effort collecting single-word utterances for a benchmark experiment. - **Date released**: 30 June 2020 - **Clip cut-off date**: 22 June 2020 - **Total hours**: 120 - **Total validated hours**: 64 - **Number of languages**: 18 ### [Corpus 4](cv-corpus-4-2019-12-10.json) Regularly scheduled dataset release for H2 of 2019. - **Date released**: 14 Jan 2020 - **Clip cut-off date**: 10 Dec 2019 - **Total hours**: 4,257 - **Total validated hours**: 3,401 - **Number of languages**: 40 **New languages since last major release**: Abkhaz (`ab`), Arabic (`ar`), Chinese (Hong Kong) (`zh-HK`), Indonesian (`id`), Interlingua (`ia`), Japanese (`ja`), Latvian (`lv`), Portuguese (`pt`), Romansh Sursilvan (`rm-sursilv`), Tamil (`ta`), Votic (`vot`) #### Dataset Changes in Corpus 4.0 - changed tar file naming from `cv-corpus-{version}_{locale}.tar.tar` to `cv-corpus-{version}-{YYYY-MM-DD}_{locale}.tar.tar` ### [Corpus 3](cv-corpus-3.json) Minor update to Corpus 2 to correct an issue with file-naming. - **Date released**: 24 June 2019 - **Clip cut-off date**: 24 June 2019 (est) - **Total hours**: 2,454 - **Total validated hours**: 1,979 - **Number of languages**: 29 **New languages since last major release**: Persian (`fa`) ### [Corpus 2](cv-corpus-2.json) Regularly scheduled dataset release for H1 of 2019. - **Date released**: 11 June 2019 - **Clip cut-off date**: 11 June 2019 (est) - **Total hours**: 2,366 - **Total validated hours**: 1,872 - **Number of languages**: 28 **New languages since last major release**: Basque (`eu`), Spanish (`es`), Chinese (China) (`zh-CN`), Mongolian (`mn`), Sakha/Yakut (`sah`), Dhivehi (`dv`), Kinyarwanda (`rw`), Swedish (`sv-SE`), Russian (`ru`) ### [Corpus 1](cv-corpus-1.json) First multilingual release. - **Date released**: 25 February 2019 - **Clip cut-off date**: 25 February 2019 (est) - **Total hours**: 1,368 - **Total validated hours**: 1,096 - **Number of languages**: 19 **New languages**: German (`de`), French (`fr`), Welsh (`cy`), Breton (`br`), Chuvash (`cv`), Turkish (`tr`), Tatar (`tt`), Kyrgyz (`ky`), Irish (`ga-IE`), Kabyle (`kab`), Catalan (`ca`), Chinese (Taiwan) (`zh-TW`), Slovenian (`sl`), Italian (`it`), Dutch (`nl`), Hakha Chin (`cnh`), Esperanto (`eo`), Estonian (`et`) #### Initial Dataset Structure - the initial dataset release folder structure: ```txt cv-corpus-1_{locale}/ ├── clips/ ├── dev.tsv ├── invalidated.tsv ├── other.tsv ├── test.tsv ├── train.tsv └── validated.tsv ``` - to get more information about the files included in the dataset release, please see [Corpora Creator](https://github.com/common-voice/CorporaCreator) - columns: `client_id`, `path`, `sentence`, `up_votes`, `down_votes`, `age`, `gender`, `accent` ================================================ FILE: datasets/scripted-speech/README.md ================================================ # Scripted Speech (SCS) Scripted Speech is the classic Common Voice dataset. Contributors read pre-written sentences aloud, and the community validates the recordings. New datasets are released approximately every quarter. All voice contributions are released as part of datasets, regardless of validation status. From v25.0 on, clips that fail quality checks (over-length, corrupted, or missing audio files) are excluded during bundling; a per-locale problem clip report is included with each release for transparency. The clips are currently bundled using the embedded bundler in the public repo [Common Voice - Bundler](https://github.com/common-voice/common-voice/tree/main/bundler). ## Release History See the full [Changelog](CHANGELOG.md) for detailed release notes and new languages per release. ### Total and Validated Hours ```mermaid --- config: xyChart: width: 900 height: 400 --- xychart-beta title "Scripted Speech: Total & Validated Hours" x-axis ["1","2","3","4","5.1","6.1","7","8","9","10","11","12","13","14","15","16.1","17","18","19","20","21","22","23","24","25"] y-axis "Hours" 0 --> 42000 bar [1368,2366,2454,4257,7226,9283,13905,18243,20217,20817,24231,26119,27141,28117,28750,30328,31175,32121,32584,33154,33534,33815,35921,38932,41792] bar [1096,1872,1979,3401,5671,7335,11192,14122,14973,15234,16429,17127,17689,18651,19159,19915,20408,20943,21593,22106,22344,22640,24600,25886,28377] ``` ### Contributors ```mermaid --- config: xyChart: width: 900 height: 400 --- xychart-beta title "Scripted Speech: Total Contributors" x-axis ["1","2","3","4","5.1","6.1","7","8","9","10","11","12","13","14","15","16.1","17","18","19","20","21","22","23","24","25"] y-axis "Users" 0 --> 500000 bar [42109,56059,57420,95798,138225,151434,191622,207602,252576,263879,271817,281069,288617,298724,302232,319703,330323,335780,338378,345996,350098,356074,361614,371058,375673] ``` _Counts are summed per language — contributors active in multiple languages are counted once per language._ ### Language Count ```mermaid --- config: xyChart: width: 900 height: 400 --- xychart-beta title "Scripted Speech: Languages per Release" x-axis ["1","2","3","4","5.1","6.1","7","8","9","10","11","12","13","14","15","16.1","17","18","19","20","21","22","23","24","25"] y-axis "Languages" 0 --> 310 line [19,28,29,40,54,60,76,87,93,96,100,104,108,112,114,120,124,129,131,133,134,137,286,289,290] ``` ### Release Summary
| Release | Date | Languages | Total Hours | Validated Hours | | ------- | ---------- | --------: | ----------: | --------------: | | v1 | 2019-02-25 | 19 | 1,368 | 1,096 | | v2 | 2019-06-11 | 28 | 2,366 | 1,872 | | v3 | 2019-06-24 | 29 | 2,454 | 1,979 | | v4 | 2019-12-10 | 40 | 4,257 | 3,401 | | v5.1 | 2020-06-22 | 54 | 7,226 | 5,671 | | v6.1 | 2020-12-11 | 60 | 9,283 | 7,335 | | v7.0 | 2021-07-21 | 76 | 13,905 | 11,192 | | v8.0 | 2022-01-19 | 87 | 18,243 | 14,122 | | v9.0 | 2022-04-27 | 93 | 20,217 | 14,973 | | v10.0 | 2022-07-04 | 96 | 20,817 | 15,234 | | v11.0 | 2022-09-21 | 100 | 24,231 | 16,429 | | v12.0 | 2022-12-07 | 104 | 26,119 | 17,127 | | v13.0 | 2023-03-09 | 108 | 27,141 | 17,689 | | v14.0 | 2023-06-23 | 112 | 28,117 | 18,651 | | v15.0 | 2023-09-08 | 114 | 28,750 | 19,159 | | v16.1 | 2023-12-06 | 120 | 30,328 | 19,915 | | v17.0 | 2024-03-15 | 124 | 31,175 | 20,408 | | v18.0 | 2024-06-14 | 129 | 32,121 | 20,943 | | v19.0 | 2024-09-13 | 131 | 32,584 | 21,593 | | v20.0 | 2024-12-06 | 133 | 33,154 | 22,106 | | v21.0 | 2025-03-14 | 134 | 33,534 | 22,344 | | v22.0 | 2025-06-20 | 137 | 33,815 | 22,640 | | v23.0 | 2025-09-05 | 286 | 35,921 | 24,600 | | v24.0 | 2025-12-05 | 289 | 38,932 | 25,886 | | v25.0 | 2026-03-09 | 290 | 41,792 | 28,377 |
## About the Statistics Statistics for each release are stored as JSON files in this directory. The JSON structure may have changed slightly from release to release, so if you plan on doing any comparisons you may need to normalize them between versions. Any demographic split (i.e. gender, age, accent) is applied to **the entire dataset**, not just the validated set. Unless otherwise indicated, durations are measured in milliseconds, and file sizes are measured in bytes. ## Archive Structure Each downloaded `.tar.gz` file has the following structure, where `{lang}` represents the [BCP 47](https://en.wikipedia.org/wiki/IETF_language_tag) locale code for that language: ```txt cv-corpus-{version}-{YYYY-MM-DD}-{lang}.tar.gz/ cv-corpus-{version}-{YYYY-MM-DD}/ └── {lang}/ ├── README.md (datasheet, since Corpus 25.0) ├── clips/ │ └── *.mp3 ├── dev.tsv ├── invalidated.tsv ├── other.tsv ├── test.tsv ├── train.tsv ├── validated.tsv ├── reported.tsv ├── clip_durations.tsv ├── validated_sentences.tsv └── unvalidated_sentences.tsv ``` ## TSV Fields Each row of a clip TSV file (`validated.tsv`, `invalidated.tsv`, `other.tsv`, `train.tsv`, `dev.tsv`, `test.tsv`) represents a single audio clip: - `client_id` -- hashed UUID of a given user - `path` -- relative path of the audio file - `sentence` -- transcription of the audio to be read aloud by the contributor - `sentence_id` -- unique identifier for the sentence (since Corpus 17.0) - `sentence_domain` -- domain classification(s) of the sentence (since Corpus 17.0) - `up_votes` -- number of people who said audio matches the sentence - `down_votes` -- number of people who said audio does not match the sentence - `age` -- age bracket of the speaker - if provided\* - `gender` -- gender of the speaker - if provided\* - `accents` -- accent(s) of the speaker - if provided\* (previously named `accent` but renamed to reflect multiple selections, since Corpus 17.0) - `variant` -- language variant - if provided (since Corpus 13.0) - `locale` -- locale code of the language (since Corpus 5.0) - `segment` -- custom dataset segment, if applicable (since Corpus 5.0) The `train.tsv`, `dev.tsv`, and `test.tsv` splits are produced by [CorporaCreator](https://github.com/common-voice/CorporaCreator) and contain the same columns as `validated.tsv`. \*For a full list of age, gender, and accent options, see the [demographics spec](https://github.com/common-voice/common-voice/blob/main/web/src/stores/demographics.ts). These are only reported if the speaker opted in. ### Additional TSV Files **`clip_durations.tsv`** (since Corpus 16.1) -- clip filename and duration: - `clip` -- clip filename - `duration[ms]` -- duration of the clip in milliseconds **`validated_sentences.tsv`** (since Corpus 17.0) -- sentences that have reached the validated threshold (two or more up votes): - `sentence_id` -- unique identifier for the sentence - `sentence` -- the sentence itself - `variant` -- language variant token for the sentence, if provided (since Corpus 25.0) - `sentence_domain` -- domain classification(s) of the sentence, if provided - `source` -- origin of the sentence (user provided or from old files under server/data) - `is_used` -- whether the sentence is still eligible for recording (sentences may be retired if they are incorrect, outdated, too similar to other sentences, or for other reasons via database migrations) - `clips_count` -- number of clips recorded for this sentence **`unvalidated_sentences.tsv`** (since Corpus 17.0) -- sentences that have not reached the validated threshold or have been rejected: - `sentence_id` -- unique identifier for the sentence - `sentence` -- the sentence itself - `variant` -- language variant token for the sentence, if provided (since Corpus 25.0) - `sentence_domain` -- domain classification(s) of the sentence, if provided - `source` -- origin of the sentence (user provided or from old files under server/data) - `up_votes` -- number of approving votes (since Corpus 25.0) - `down_votes` -- number of rejecting votes (since Corpus 25.0) - `status` -- `pending` (not yet decided) or `rejected` (2+ down votes exceeding up votes) (since Corpus 25.0) ### Validation Categories - `validated` -- clips with two or more validations where `up_votes` > `down_votes` - `invalidated` -- clips with two or more validations where `down_votes` > `up_votes`, or three or more where `down_votes` = `up_votes` - `other` -- clips without sufficient validations to determine their status **`reported.tsv`** (since Corpus 5.0) -- sentences flagged by contributors: - `sentence` -- text of the reported sentence - `sentence_id` -- unique identifier for the sentence - `locale` -- locale code - `reason` -- report reason: `offensive-language`, `grammar-or-spelling`, `different-language`, `difficult-pronounce` Note: reporting a sentence does not remove it from circulation. Reported sentences remain available for recording and validation. The `reported.tsv` file is provided for post-processing by dataset consumers. ## Use for Machine Learning We use the [Corpora Creator](https://github.com/common-voice/CorporaCreator) tool to parse through metadata to generate [train, dev, and test](https://en.wikipedia.org/wiki/Training,_validation,_and_test_sets) sets. The Corpora Creator eliminates duplication in clips and maximizes for speaker diversity. Each train/dev/test set is generated non-deterministically, meaning they will vary from release to release even for minor updates. This is to avoid reproducing and perpetuating any demographic skews in each subsequent set. Note that total clips in these sets will most probably not add up to the total validated clips because of this limitation. Please check the repo to include multiple recordings per sentence (the `-s` flag) if you want to get as close as possible to the total validated clips. ================================================ FILE: datasets/scripted-speech/cv-corpus-1.json ================================================ { "date": "2019-02-25", "locales": { "en": { "clips": 677020, "splits": { "accent": { "": 0.55, "canada": 0.02, "england": 0.08, "us": 0.21, "indian": 0.04, "australia": 0.02, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.01, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.03 }, "age": { "": 0.48, "twenties": 0.18, "sixties": 0.02, "thirties": 0.14, "teens": 0.04, "seventies": 0.01, "fourties": 0.08, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male": 0.41, "female": 0.1, "other": 0.01 } }, "users": 33541, "duration": 2893916688, "buckets": { "dev": 7016, "invalidated": 61200, "other": 125337, "test": 7016, "train": 12135, "validated": 490483 }, "size": 22487893709 }, "de": { "clips": 133646, "splits": { "accent": { "germany": 0.71, "": 0.23, "austria": 0.02, "liechtenstein": 0, "switzerland": 0.03, "france": 0, "other": 0.01, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0 }, "age": { "twenties": 0.31, "fourties": 0.17, "": 0.19, "thirties": 0.14, "teens": 0.04, "sixties": 0.03, "fifties": 0.12, "seventies": 0 }, "gender": { "male": 0.76, "": 0.19, "female": 0.05, "other": 0 } }, "users": 2249, "duration": 526772160, "buckets": { "dev": 2269, "invalidated": 5487, "other": 0, "test": 2269, "train": 2629, "validated": 128159 }, "size": 4151335731 }, "fr": { "clips": 75022, "splits": { "accent": { "": 0.22, "france": 0.74, "germany": 0, "belgium": 0.02, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.01, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0 }, "age": { "twenties": 0.23, "thirties": 0.2, "": 0.21, "teens": 0.06, "fourties": 0.25, "fifties": 0.02, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.72, "": 0.21, "female": 0.07, "other": 0 } }, "users": 1697, "duration": 284516280, "buckets": { "dev": 8857, "invalidated": 4770, "other": 0, "test": 8858, "train": 18941, "validated": 70252 }, "size": 2245754155 }, "cy": { "clips": 19412, "splits": { "accent": { "united_kingdom": 0.6, "": 0.36, "other": 0.03 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.18, "fifties": 0.09, "": 0.32, "thirties": 0.11, "seventies": 0.01, "eighties": 0 }, "gender": { "male": 0.43, "female": 0.26, "": 0.31 } }, "users": 365, "duration": 79378296, "buckets": { "dev": 484, "invalidated": 672, "other": 0, "test": 484, "train": 500, "validated": 18731 }, "size": 622806292 }, "br": { "clips": 9306, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.16, "": 0.56, "fifties": 0.01, "fourties": 0.13, "thirties": 0.14, "sixties": 0.01 }, "gender": { "male": 0.43, "": 0.56, "female": 0.02 } }, "users": 82, "duration": 26068056, "buckets": { "dev": 1022, "invalidated": 364, "other": 5147, "test": 1054, "train": 1458, "validated": 3795 }, "size": 201554829 }, "cv": { "clips": 2299, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.03, "": 0.54, "fourties": 0.39, "thirties": 0.03, "teens": 0.01 }, "gender": { "male": 0.46, "": 0.54 } }, "users": 33, "duration": 9802458, "buckets": { "dev": 49, "invalidated": 628, "other": 915, "test": 187, "train": 414, "validated": 756 }, "size": 77597058 }, "tr": { "clips": 6226, "splits": { "accent": { "": 0.87, "other": 0.13 }, "age": { "": 0.18, "thirties": 0.41, "twenties": 0.37, "teens": 0.02, "fourties": 0.01, "fifties": 0 }, "gender": { "": 0.18, "male": 0.75, "female": 0.07 } }, "users": 203, "duration": 23086560, "buckets": { "dev": 1039, "invalidated": 843, "other": 331, "test": 1112, "train": 1265, "validated": 5052 }, "size": 182107529 }, "tt": { "clips": 20663, "splits": { "accent": { "": 1 }, "age": { "": 0.17, "thirties": 0.78, "twenties": 0.04, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0 }, "gender": { "": 0.17, "male": 0.8, "female": 0.02 } }, "users": 117, "duration": 73813776, "buckets": { "dev": 1882, "invalidated": 199, "other": 17, "test": 3437, "train": 7425, "validated": 20447 }, "size": 555266991 }, "ky": { "clips": 4766, "splits": { "accent": { "other": 0.11, "": 0.89 }, "age": { "thirties": 0.19, "": 0.13, "fourties": 0.05, "twenties": 0.64 }, "gender": { "male": 0.8, "": 0.15, "female": 0.06 } }, "users": 63, "duration": 22124544, "buckets": { "dev": 782, "invalidated": 249, "other": 0, "test": 1206, "train": 1888, "validated": 4517 }, "size": 152729320 }, "ga-IE": { "clips": 2007, "splits": { "accent": { "": 0.24, "connachta": 0.45, "other": 0.18, "ulaidh": 0.13 }, "age": { "twenties": 0.03, "": 0.21, "thirties": 0.57, "fourties": 0.13, "sixties": 0.01, "teens": 0.04 }, "gender": { "male": 0.57, "": 0.21, "female": 0.22 } }, "users": 30, "duration": 6320352, "buckets": { "dev": 284, "invalidated": 134, "other": 0, "test": 415, "train": 644, "validated": 1873 }, "size": 48777677 }, "kab": { "clips": 101313, "splits": { "accent": { "": 0.73, "other": 0.27 }, "age": { "fourties": 0.16, "thirties": 0.21, "": 0.33, "twenties": 0.17, "fifties": 0.04, "eighties": 0.08, "teens": 0, "sixties": 0 }, "gender": { "male": 0.53, "": 0.29, "female": 0.17, "other": 0.01 } }, "users": 382, "duration": 353173920, "buckets": { "dev": 4073, "invalidated": 6111, "other": 0, "test": 4073, "train": 5395, "validated": 95202 }, "size": 2787486601 }, "ca": { "clips": 77137, "splits": { "accent": { "valencian": 0.07, "central": 0.69, "": 0.18, "other": 0.01, "balearic": 0.01, "northwestern": 0.04, "northern": 0 }, "age": { "thirties": 0.13, "fifties": 0.26, "fourties": 0.28, "twenties": 0.08, "": 0.18, "sixties": 0.05, "teens": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "male": 0.38, "": 0.18, "female": 0.44, "other": 0 } }, "users": 1639, "duration": 353712432, "buckets": { "dev": 8375, "invalidated": 4302, "other": 21, "test": 8374, "train": 16870, "validated": 72814 }, "size": 2780768799 }, "zh-TW": { "clips": 36369, "splits": { "accent": { "": 0.79, "other": 0.21 }, "age": { "thirties": 0.3, "twenties": 0.43, "teens": 0.01, "": 0.24, "fifties": 0.01, "seventies": 0, "fourties": 0.02 }, "gender": { "male": 0.38, "": 0.23, "female": 0.35, "other": 0.04 } }, "users": 695, "duration": 101606832, "buckets": { "dev": 1154, "invalidated": 1765, "other": 9315, "test": 1154, "train": 1240, "validated": 25289 }, "size": 800988779 }, "sl": { "clips": 3286, "splits": { "accent": { "other": 0.02, "": 0.98 }, "age": { "twenties": 0.83, "teens": 0.01, "": 0.01, "sixties": 0, "fifties": 0.15 }, "gender": { "female": 0.17, "male": 0.82, "": 0.01 } }, "users": 18, "duration": 12475584, "buckets": { "dev": 291, "invalidated": 97, "other": 1399, "test": 320, "train": 762, "validated": 1790 }, "size": 98867237 }, "it": { "clips": 16048, "splits": { "accent": { "": 0.7, "other": 0.3 }, "age": { "thirties": 0.13, "twenties": 0.37, "": 0.34, "fifties": 0.08, "fourties": 0.06, "seventies": 0, "sixties": 0, "teens": 0.02 }, "gender": { "female": 0.07, "male": 0.67, "": 0.26 } }, "users": 313, "duration": 70795560, "buckets": { "dev": 3085, "invalidated": 3061, "other": 2, "test": 3082, "train": 3812, "validated": 12985 }, "size": 556736370 }, "nl": { "clips": 13385, "splits": { "accent": { "": 0.22, "netherlands": 0.68, "belgium": 0.1, "other": 0 }, "age": { "": 0.18, "twenties": 0.35, "fourties": 0.05, "thirties": 0.13, "teens": 0.02, "fifties": 0.26, "sixties": 0 }, "gender": { "": 0.24, "male": 0.74, "female": 0.02, "other": 0 } }, "users": 373, "duration": 48954768, "buckets": { "dev": 1542, "invalidated": 700, "other": 243, "test": 1542, "train": 1701, "validated": 12442 }, "size": 382910541 }, "cnh": { "clips": 4289, "splits": { "accent": { "": 0.8, "other": 0.2 }, "age": { "": 0.52, "twenties": 0.3, "fourties": 0.02, "teens": 0.02, "thirties": 0.11, "fifties": 0.03 }, "gender": { "": 0.53, "male": 0.26, "female": 0.22 } }, "users": 253, "duration": 15737520, "buckets": { "dev": 641, "invalidated": 452, "other": 1689, "test": 659, "train": 733, "validated": 2148 }, "size": 124559394 }, "eo": { "clips": 5882, "splits": { "accent": { "": 0.76, "internacia": 0.19, "other": 0.05 }, "age": { "twenties": 0.78, "thirties": 0.04, "": 0.1, "fourties": 0.05, "fifties": 0.01, "seventies": 0, "teens": 0.02 }, "gender": { "male": 0.21, "": 0.68, "female": 0.1, "other": 0.01 } }, "users": 53, "duration": 23382864, "buckets": { "dev": 526, "invalidated": 238, "other": 1872, "test": 1057, "train": 1992, "validated": 3772 }, "size": 184514284 }, "et": { "clips": 35, "splits": { "accent": { "": 1 }, "age": { "": 0.14, "thirties": 0.86 }, "gender": { "": 0.14, "male": 0.86 } }, "users": 3, "duration": 230328, "buckets": { "dev": 5, "invalidated": 12, "other": 0, "test": 3, "train": 15, "validated": 23 }, "size": 1822156 } }, "totalDuration": 4925868978, "totalValidDurationSecs": 3946252, "totalHrs": 1368, "totalValidHrs": 1096, "totalClips": 1208111 } ================================================ FILE: datasets/scripted-speech/cv-corpus-10.0-2022-07-04.json ================================================ { "date": "2022-07-04", "locales": { "en": { "duration": 10981597567, "buckets": { "dev": 16345, "invalidated": 248337, "other": 293021, "reported": 4158, "test": 16345, "train": 921404, "validated": 1589008 }, "reportedSentences": 4095, "clips": 2130366, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.45, "female": 0.16, "other": 0.02 } }, "users": 83790, "size": 78797545779, "checksum": "b82354bf4ff7a62568e071dbba3a48160f7368ed94890fd57f466a85c27e0511", "avgDurationSecs": 5.155, "validDurationSecs": 8191008.675, "totalHrs": 3050.44, "validHrs": 2275.28 }, "fa": { "duration": 1371060000, "buckets": { "dev": 9937, "invalidated": 13320, "other": 34431, "reported": 2117, "test": 9937, "train": 24672, "validated": 294426 }, "reportedSentences": 2109, "clips": 342177, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "twenties": 0.31, "thirties": 0.37, "fifties": 0.02, "fourties": 0.03, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.21, "male": 0.72, "female": 0.07, "other": 0 } }, "users": 4088, "size": 10118159391, "checksum": "c298ceacbe35edbc0ed948c068afbd87c2076768a5e70ebe7f4166f7e053e4a8", "avgDurationSecs": 4.0, "validDurationSecs": 1177704, "totalHrs": 380.85, "validHrs": 327.14 }, "fr": { "buckets": { "dev": 16058, "invalidated": 55708, "other": 10156, "reported": 6384, "test": 16058, "train": 458935, "validated": 625586 }, "reportedSentences": 6308, "duration": 3455188498, "clips": 691450, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "thirties": 0.17, "": 0.35, "teens": 0.03, "fourties": 0.14, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.62, "": 0.27, "female": 0.1, "other": 0.01 } }, "users": 16510, "size": 24964187263, "checksum": "a7f6596f3f679fca1a9ae24f319b5feda67bcea5d1a514c6f6f32ae65f88aa27", "avgDurationSecs": 4.997, "validDurationSecs": 3126064.866, "totalHrs": 959.77, "validHrs": 868.35 }, "es": { "buckets": { "dev": 15459, "invalidated": 48321, "other": 211249, "reported": 1773, "test": 15459, "train": 217774, "validated": 293025 }, "reportedSentences": 1759, "duration": 2792421414, "clips": 552595, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.32, "fifties": 0.08, "twenties": 0.25, "teens": 0.03, "fourties": 0.07, "sixties": 0.15, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.51, "": 0.32, "other": 0.01, "female": 0.17 } }, "users": 23627, "size": 20310025597, "checksum": "29921567c0b8f98953295ff53d69bd3c0b6c6beb746791472daba1c87199ae66", "avgDurationSecs": 5.053, "validDurationSecs": 1480739.574, "totalHrs": 775.67, "validHrs": 411.31 }, "sl": { "buckets": { "dev": 1143, "invalidated": 247, "other": 1596, "reported": 34, "test": 1233, "train": 1460, "validated": 9525 }, "reportedSentences": 35, "duration": 43362226, "clips": 11368, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "teens": 0.08, "": 0.2, "sixties": 0.07, "fifties": 0.07, "fourties": 0.02, "thirties": 0.05 }, "gender": { "female": 0.16, "male": 0.64, "": 0.2, "other": 0 } }, "users": 137, "size": 308301224, "checksum": "a461623bfc1deed48ce7ef2ec4b64fd724006472b1e481e8c3dd47e3290c23f3", "avgDurationSecs": 3.814, "validDurationSecs": 36332.266, "totalHrs": 12.04, "validHrs": 10.09 }, "kab": { "buckets": { "dev": 14889, "invalidated": 19217, "other": 102012, "reported": 4844, "test": 14891, "train": 141620, "validated": 597994 }, "reportedSentences": 4838, "duration": 2393581624, "clips": 719223, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.3, "": 0.27, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.54, "": 0.25, "female": 0.2, "other": 0 } }, "users": 1451, "size": 18030751474, "checksum": "c0b4cec3a040eaf0abf4ab5cc434aef152939f901cb8fd5e7a46f0dc03d280cd", "avgDurationSecs": 3.328, "validDurationSecs": 1990130.251, "totalHrs": 664.88, "validHrs": 552.81 }, "cy": { "buckets": { "dev": 5200, "invalidated": 4294, "other": 18134, "reported": 152, "test": 5209, "train": 7594, "validated": 87294 }, "reportedSentences": 153, "duration": 531427401, "clips": 109722, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.06, "fifties": 0.09, "": 0.42, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.25, "": 0.41, "other": 0.01 } }, "users": 1715, "size": 3921370932, "checksum": "83f81318cf77b1f9835762b0f4dc06af083d387bf0524200b8dce8492a16fb56", "avgDurationSecs": 4.843, "validDurationSecs": 422799.653, "totalHrs": 147.61, "validHrs": 117.44 }, "ca": { "buckets": { "dev": 16277, "invalidated": 65015, "other": 560080, "reported": 4809, "test": 16277, "train": 744272, "validated": 907704 }, "reportedSentences": 4761, "duration": 8449005306, "clips": 1532799, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.07, "fifties": 0.17, "fourties": 0.1, "twenties": 0.06, "": 0.34, "sixties": 0.22, "teens": 0.01, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.42, "": 0.35, "female": 0.23, "other": 0 } }, "users": 28648, "size": 52904759182, "checksum": "e6df7a73ffa2f9b61615c4f8199b44fb7a599cbbbd74a56789758842a6d77f54", "avgDurationSecs": 5.512, "validDurationSecs": 5003393.082, "totalHrs": 2346.94, "validHrs": 1389.83 }, "de": { "buckets": { "dev": 16067, "invalidated": 47081, "other": 5646, "reported": 7953, "test": 16067, "train": 466189, "validated": 793068 }, "reportedSentences": 7929, "duration": 4363152833, "clips": 845795, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "fourties": 0.17, "": 0.32, "thirties": 0.15, "teens": 0.03, "sixties": 0.03, "fifties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.32, "female": 0.09, "other": 0.01 } }, "users": 16944, "size": 31181120747, "checksum": "a293c3e341f6aaf25019bc852d6475d0ef2f85c2835f8481f7cc65dbd0bde2fa", "avgDurationSecs": 5.159, "validDurationSecs": 4091153.165, "totalHrs": 1211.98, "validHrs": 1136.43 }, "tt": { "duration": 108365246, "buckets": { "dev": 3062, "invalidated": 385, "other": 7, "reported": 3, "test": 5119, "train": 9783, "validated": 28531 }, "reportedSentences": 4, "clips": 28923, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.73, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.2, "male": 0.79, "female": 0.02 } }, "users": 219, "size": 802652172, "checksum": "e5b1670372444451dbd146a2bb911144eca59a843f59c347c98f75ee5c5ac507", "avgDurationSecs": 3.747, "validDurationSecs": 106896.547, "totalHrs": 30.1, "validHrs": 29.69 }, "ta": { "duration": 1370209560, "buckets": { "dev": 11781, "invalidated": 5557, "other": 85369, "reported": 3296, "test": 11820, "train": 40987, "validated": 129693 }, "reportedSentences": 3296, "clips": 220619, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.08, "thirties": 0.09, "": 0.72, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.03, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.71, "other": 0, "female": 0.13 } }, "users": 761, "size": 8212452630, "checksum": "8d8427ca7d2735131f5b77afc4ecd01342f617487f76a64c5eb2a597bc74f9a2", "avgDurationSecs": 6.211, "validDurationSecs": 805490.862, "totalHrs": 380.61, "validHrs": 223.74 }, "ru": { "duration": 753692040, "buckets": { "dev": 9495, "invalidated": 6751, "other": 20572, "reported": 320, "test": 9494, "train": 22117, "validated": 118707 }, "reportedSentences": 314, "clips": 146030, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "teens": 0.09, "": 0.21, "fourties": 0.14, "thirties": 0.13, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.21, "other": 0, "female": 0.16 } }, "users": 2688, "size": 5297836761, "checksum": "78ea4fa2de776edc8ecae52440d0e4fc4669eb962ae8150f91078c907c5d049a", "avgDurationSecs": 5.161, "validDurationSecs": 612672.197, "totalHrs": 209.35, "validHrs": 170.18 }, "nl": { "duration": 394176457, "buckets": { "dev": 10634, "invalidated": 5098, "other": 2412, "reported": 318, "test": 10641, "train": 29521, "validated": 83821 }, "reportedSentences": 318, "clips": 91331, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.22, "fourties": 0.14, "thirties": 0.11, "teens": 0.02, "fifties": 0.08, "sixties": 0.01, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.42, "male": 0.47, "female": 0.11, "other": 0 } }, "users": 1520, "size": 2713214861, "checksum": "d9053d64a7e5fc2d3853c0d920c73229650cb4d88130c9c9b9939f1fa582fe4a", "avgDurationSecs": 4.316, "validDurationSecs": 361763.966, "totalHrs": 109.49, "validHrs": 100.48 }, "it": { "duration": 1249550976, "buckets": { "dev": 14964, "invalidated": 17188, "other": 27, "reported": 5279, "test": 14973, "train": 149590, "validated": 216074 }, "reportedSentences": 5275, "clips": 233289, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.16, "twenties": 0.21, "": 0.3, "fifties": 0.16, "fourties": 0.14, "seventies": 0, "sixties": 0.02, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.12, "male": 0.59, "": 0.29, "other": 0 } }, "users": 6735, "size": 8670197349, "checksum": "ae01f1b6fd93a65d964c274c4aee182ea00d861f0a4f9e98bc5c06fe55a4a1b5", "avgDurationSecs": 5.356, "validDurationSecs": 1157343.371, "totalHrs": 347.09, "validHrs": 321.48 }, "eu": { "duration": 528391635, "buckets": { "dev": 6560, "invalidated": 5790, "other": 26855, "reported": 61, "test": 6560, "train": 10829, "validated": 69142 }, "reportedSentences": 61, "clips": 101787, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.14, "twenties": 0.35, "": 0.25, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.47, "female": 0.26, "": 0.25, "other": 0.02 } }, "users": 1209, "size": 3983306491, "checksum": "a0b8c72bdcce0d23e9a58ba71681e5a1c5093504159c5f69d699a4b9da42cb85", "avgDurationSecs": 5.191, "validDurationSecs": 358926.527, "totalHrs": 146.77, "validHrs": 99.7 }, "tr": { "duration": 285540831, "buckets": { "dev": 9095, "invalidated": 3503, "other": 151, "reported": 332, "test": 9124, "train": 20228, "validated": 74486 }, "reportedSentences": 333, "clips": 78140, "splits": { "accent": { "": 1 }, "age": { "": 0.33, "thirties": 0.09, "twenties": 0.27, "teens": 0.02, "fourties": 0.04, "fifties": 0.09, "sixties": 0.12, "eighties": 0, "seventies": 0.03 }, "gender": { "": 0.33, "male": 0.46, "female": 0.21, "other": 0 } }, "users": 1299, "size": 1777072023, "checksum": "2d12424877b65b3b39e6031c368a3d60579e347d31593558c57e0a3dc11b3791", "avgDurationSecs": 3.654, "validDurationSecs": 272188.307, "totalHrs": 79.31, "validHrs": 75.6 }, "ar": { "duration": 523410000, "buckets": { "dev": 10354, "invalidated": 14919, "other": 34918, "reported": 2062, "test": 10435, "train": 28078, "validated": 75862 }, "reportedSentences": 2049, "clips": 125699, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.56, "twenties": 0.28, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.18, "": 0.56, "male": 0.27, "other": 0 } }, "users": 1272, "size": 3110063691, "checksum": "4f81fec5272134b6e7de8195fc94e629975c3142691587ad3be961c3cb12a686", "avgDurationSecs": 4.164, "validDurationSecs": 315889.368, "totalHrs": 145.4, "validHrs": 87.747 }, "zh-TW": { "duration": 396217642, "buckets": { "dev": 4670, "invalidated": 4556, "other": 39955, "reported": 139, "test": 4670, "train": 6494, "validated": 76358 }, "reportedSentences": 140, "clips": 120869, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.33, "teens": 0.05, "": 0.27, "fifties": 0.04, "seventies": 0, "fourties": 0.1, "sixties": 0 }, "gender": { "male": 0.47, "": 0.27, "female": 0.25, "other": 0.02 } }, "users": 2061, "size": 2797955208, "checksum": "c4c488f69eadae226b056396e3f6dc40baf89674240afb101e19d01ee2922614", "avgDurationSecs": 3.278, "validDurationSecs": 250307.248, "totalHrs": 110.06, "validHrs": 69.52 }, "br": { "duration": 72258667, "buckets": { "dev": 2158, "invalidated": 759, "other": 11499, "reported": 221, "test": 2157, "train": 2559, "validated": 11168 }, "reportedSentences": 221, "clips": 23426, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.33, "fifties": 0.06, "fourties": 0.07, "thirties": 0.08, "sixties": 0.17, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.64, "": 0.33, "female": 0.02, "other": 0 } }, "users": 177, "size": 531272372, "checksum": "6275bfb2f17e9857a61d25185e47d83f9486f3b5a08e67219f269e0559410f3a", "avgDurationSecs": 3.085, "validDurationSecs": 34448.254, "totalHrs": 20.07, "validHrs": 9.56 }, "pt": { "duration": 521906587, "buckets": { "dev": 8606, "invalidated": 4634, "other": 16276, "reported": 2390, "test": 8611, "train": 17852, "validated": 103678 }, "reportedSentences": 2385, "clips": 124588, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "twenties": 0.41, "teens": 0.03, "thirties": 0.22, "fourties": 0.1, "sixties": 0.01, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.2, "male": 0.74, "female": 0.04, "other": 0.02 } }, "users": 2562, "size": 3400999932, "checksum": "3ceeb91b12a07cbf8cf983b367573a35fdb89b39d925c086b595d1d32ca807cb", "avgDurationSecs": 4.189, "validDurationSecs": 434313.346, "totalHrs": 144.97, "validHrs": 120.64 }, "eo": { "duration": 6740710000, "buckets": { "dev": 14907, "invalidated": 127293, "other": 135058, "reported": 2127, "test": 14907, "train": 143988, "validated": 848511 }, "reportedSentences": 2126, "clips": 1110862, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.56, "thirties": 0.12, "": 0.2, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1541, "size": 40260737095, "checksum": "2179bad54bb2b69cd12964bc2f6533b9538b7a3f943f9e65f8f9a463796fd901", "avgDurationSecs": 6.068, "validDurationSecs": 5148764, "totalHrs": 1872.42, "validHrs": 1430.21 }, "zh-CN": { "duration": 1614763392, "buckets": { "dev": 9760, "invalidated": 6889, "other": 293461, "reported": 510, "test": 9783, "train": 23764, "validated": 50432 }, "reportedSentences": 501, "clips": 350782, "splits": { "accent": { "": 1 }, "age": { "": 0.87, "teens": 0.02, "twenties": 0.08, "thirties": 0.02, "fourties": 0.01, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.87, "male": 0.11, "female": 0.02, "other": 0 } }, "users": 5061, "size": 9873490748, "checksum": "5862d3e55aaa507b62c6e81343bb17ae784a32c1a36e9facf4b2713ecb5da4ce", "avgDurationSecs": 4.603, "validDurationSecs": 232154.864, "totalHrs": 448.54, "validHrs": 64.48 }, "id": { "duration": 200152524, "buckets": { "dev": 3219, "invalidated": 2460, "other": 23682, "reported": 268, "test": 3621, "train": 5046, "validated": 23219 }, "reportedSentences": 269, "clips": 49361, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.39, "thirties": 0.07, "teens": 0.26, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.26, "male": 0.41, "female": 0.29, "other": 0.04 } }, "users": 438, "size": 1273536247, "checksum": "33d49848b4e5341d166642ce175dc7c6128114307c4f3cab881be4e34b0703f7", "avgDurationSecs": 4.055, "validDurationSecs": 94150.067, "totalHrs": 55.59, "validHrs": 26.15 }, "ia": { "duration": 60377448, "buckets": { "dev": 1789, "invalidated": 328, "other": 2744, "reported": 264, "test": 1731, "train": 5049, "validated": 11361 }, "reportedSentences": 260, "clips": 14433, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.39, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.39, "female": 0.01 } }, "users": 60, "size": 409324563, "checksum": "581ed384b3498194710ae266a83bfc220df575893f57db49907db29cd8fcfbdf", "avgDurationSecs": 4.183, "validDurationSecs": 47526.376, "totalHrs": 16.77, "validHrs": 13.2 }, "lv": { "duration": 30849209, "buckets": { "dev": 1857, "invalidated": 167, "other": 1267, "reported": 27, "test": 2161, "train": 3103, "validated": 7608 }, "reportedSentences": 28, "clips": 9042, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "fourties": 0.03, "": 0.18, "twenties": 0.28, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.7, "female": 0.13, "": 0.17 } }, "users": 117, "size": 226830766, "checksum": "6b245bc30f4a09415c0234545068084692a919b657df7281b414931e331fdf2f", "avgDurationSecs": 3.412, "validDurationSecs": 25956.733, "totalHrs": 8.56, "validHrs": 7.21 }, "ja": { "duration": 185458929, "buckets": { "dev": 4312, "invalidated": 2262, "other": 370, "reported": 153, "test": 4489, "train": 6352, "validated": 36021 }, "reportedSentences": 153, "clips": 38653, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.32, "": 0.23, "teens": 0.04, "fifties": 0.01, "thirties": 0.1, "fourties": 0.29, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.53, "": 0.21, "female": 0.25, "other": 0 } }, "users": 652, "size": 1125802593, "checksum": "7196d23c02058a545c921539aa553bc6655c692274bca88cf0941f7e30018826", "avgDurationSecs": 4.798, "validDurationSecs": 172830.468, "totalHrs": 51.51, "validHrs": 48 }, "rw": { "duration": 8580574229, "buckets": { "dev": 15987, "invalidated": 227746, "other": 47302, "reported": 629, "test": 16213, "train": 1003021, "validated": 1438408 }, "reportedSentences": 630, "clips": 1713456, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1076, "size": 60998084828, "checksum": "824d4a62cc4ce8a5e3fe0b4c24bd5a191a286dff50d4f4ccc4c724b342413a4b", "avgDurationSecs": 5.008, "validDurationSecs": 7203200.208, "totalHrs": 2383.49, "validHrs": 2000.88 }, "sv-SE": { "duration": 180457431, "buckets": { "dev": 5055, "invalidated": 1338, "other": 5701, "reported": 575, "test": 5057, "train": 7275, "validated": 38581 }, "reportedSentences": 576, "clips": 45620, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.48, "": 0.18, "female": 0.33, "other": 0 } }, "users": 752, "size": 1145881820, "checksum": "ee19ce93f376d4980e4afb32f7b7ac04b74fd257547718a43110d530789a1e95", "avgDurationSecs": 3.956, "validDurationSecs": 152613.506, "totalHrs": 50.12, "validHrs": 42.39 }, "cnh": { "duration": 20675832, "buckets": { "dev": 761, "invalidated": 436, "other": 2908, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "reportedSentences": 9, "clips": 5802, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 299, "size": 161394167, "checksum": "76a3e555e9503e94799077f48b1ef84acfef3f1f19fd9cf1f6a30dc7c10b48fa", "avgDurationSecs": 3.564, "validDurationSecs": 8759.255, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "duration": 195218498, "buckets": { "dev": 2637, "invalidated": 6610, "other": 651, "reported": 476, "test": 2637, "train": 3136, "validated": 21633 }, "reportedSentences": 473, "clips": 28894, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.08, "twenties": 0.68, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.26, "other": 0 } }, "users": 808, "size": 1332576701, "checksum": "895f05b75825d23da787a223cd1c84f44b07bba18bec1e34a8a46b7afc642e56", "avgDurationSecs": 6.756, "validDurationSecs": 146160.51, "totalHrs": 54.22, "validHrs": 40.6 }, "ky": { "duration": 161722308, "buckets": { "dev": 1613, "invalidated": 5588, "other": 309, "reported": 36, "test": 1613, "train": 1787, "validated": 29711 }, "reportedSentences": 37, "clips": 35608, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18 }, "gender": { "male": 0.54, "": 0.11, "female": 0.35, "other": 0 } }, "users": 247, "size": 1045163001, "checksum": "b23c82a73e969b2bb26b7ffe4e4dc327b4bea59514fcc054be492192bc8ea493", "avgDurationSecs": 4.542, "validDurationSecs": 134939.662, "totalHrs": 44.92, "validHrs": 37.48 }, "ro": { "duration": 138589267, "buckets": { "dev": 3792, "invalidated": 860, "other": 19079, "reported": 304, "test": 3841, "train": 5080, "validated": 14777 }, "reportedSentences": 305, "clips": 34716, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "teens": 0.02, "": 0.11, "fourties": 0.06, "sixties": 0, "twenties": 0.66, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.73, "": 0.11, "female": 0.15, "other": 0.01 } }, "users": 361, "size": 870416684, "checksum": "c9da340807a83058beea735c8e3290a327886933ac6b158744de9ecf6d44c87f", "avgDurationSecs": 3.992, "validDurationSecs": 58991.059, "totalHrs": 38.49, "validHrs": 16.38 }, "hsb": { "duration": 10207332, "buckets": { "dev": 172, "invalidated": 243, "other": 8, "reported": 71, "test": 440, "train": 808, "validated": 1420 }, "reportedSentences": 72, "clips": 1671, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.18, "thirties": 0.1, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.82, "": 0.18, "other": 0 } }, "users": 20, "size": 79769846, "checksum": "f3cb738b99ef8700809e4787c7877dbd90942b6b197375d212c3c5951ad0b32b", "avgDurationSecs": 6.109, "validDurationSecs": 8674.094, "totalHrs": 2.83, "validHrs": 2.4 }, "el": { "duration": 103011982, "buckets": { "dev": 1704, "invalidated": 792, "other": 9222, "reported": 64, "test": 1695, "train": 1910, "validated": 14919 }, "reportedSentences": 65, "clips": 24933, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.38, "fourties": 0.13, "": 0.33, "twenties": 0.13, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.63, "": 0.32, "other": 0.02, "female": 0.03 } }, "users": 341, "size": 694147628, "checksum": "19d90367ab2be112a8400b5c39bdc718d45c07c7fc311ca29966145bb75931bd", "avgDurationSecs": 4.132, "validDurationSecs": 61638.622, "totalHrs": 28.61, "validHrs": 17.12 }, "cs": { "duration": 253963263, "buckets": { "dev": 7257, "invalidated": 1275, "other": 9169, "reported": 700, "test": 7585, "train": 14413, "validated": 48252 }, "reportedSentences": 697, "clips": 58696, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.19, "": 0.36, "thirties": 0.14, "teens": 0.01, "twenties": 0.27, "fifties": 0.02, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.35, "female": 0.02 } }, "users": 567, "size": 1788385110, "checksum": "f3e5120b45c0c1a469cfea6008fcee7b102c245b3e772128e2ae189a2799feca", "avgDurationSecs": 4.327, "validDurationSecs": 208774.625, "totalHrs": 70.54, "validHrs": 57.99 }, "pl": { "duration": 599130506, "buckets": { "dev": 8223, "invalidated": 6061, "other": 5576, "reported": 535, "test": 8223, "train": 16261, "validated": 122188 }, "reportedSentences": 535, "clips": 133825, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.32, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3100, "size": 4305564674, "checksum": "f9d491272e90ef9a10451779b9a4ceed52be45e9ceee80e6a3fc2b9689dc348c", "avgDurationSecs": 4.477, "validDurationSecs": 547032.006, "totalHrs": 166.42, "validHrs": 151.95 }, "rm-sursilv": { "duration": 38747237, "buckets": { "dev": 1344, "invalidated": 674, "other": 2177, "reported": 11, "test": 1329, "train": 1544, "validated": 4220 }, "reportedSentences": 12, "clips": 7071, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.17, "female": 0.19, "": 0.64, "other": 0 } }, "users": 85, "size": 292787901, "checksum": "e3bfa984c4cd61b2b9cf3dca1051ae64e49815b5e5786acfbf9a56242bfc9f1a", "avgDurationSecs": 5.48, "validDurationSecs": 23124.5, "totalHrs": 10.76, "validHrs": 6.42 }, "rm-vallader": { "duration": 15074402, "buckets": { "dev": 376, "invalidated": 392, "other": 720, "reported": 31, "test": 437, "train": 664, "validated": 1484 }, "reportedSentences": 30, "clips": 2596, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 51, "size": 115234506, "checksum": "1e12ab4b075f336e5937333ef9b48bd61b0bbdb81b19ad1c43f94b73aee1c693", "avgDurationSecs": 5.807, "validDurationSecs": 8617.262, "totalHrs": 4.18, "validHrs": 2.39 }, "mn": { "duration": 68212232, "buckets": { "dev": 1853, "invalidated": 754, "other": 3449, "reported": 18, "test": 1881, "train": 2170, "validated": 8258 }, "reportedSentences": 19, "clips": 12461, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.27, "twenties": 0.41, "fourties": 0.01, "teens": 0.02, "nineties": 0.06, "fifties": 0 }, "gender": { "male": 0.36, "": 0.27, "female": 0.31, "other": 0.06 } }, "users": 472, "size": 519169015, "checksum": "7b8861f86e4414d444b8c1e2a64db45da8b82eeeac034ec426615ad4cdbbd822", "avgDurationSecs": 5.474, "validDurationSecs": 45204.768, "totalHrs": 18.94, "validHrs": 12.55 }, "zh-HK": { "duration": 464720136, "buckets": { "dev": 5587, "invalidated": 4166, "other": 17036, "reported": 638, "test": 5587, "train": 8414, "validated": 89103 }, "reportedSentences": 627, "clips": 110305, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.12, "thirties": 0.11, "": 0.4, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0, "twenties": 0.31 }, "gender": { "male": 0.42, "": 0.36, "female": 0.21, "other": 0.01 } }, "users": 2907, "size": 3364974920, "checksum": "dbd0a4254447b5de319be1be5d4262ec9bfe68da0d7235430469c33d9298c985", "avgDurationSecs": 4.213, "validDurationSecs": 375395.116, "totalHrs": 129.08, "validHrs": 104.27 }, "ab": { "duration": 301806756, "buckets": { "dev": 9152, "invalidated": 5271, "other": 11662, "reported": 220, "test": 9122, "train": 21027, "validated": 41930 }, "reportedSentences": 219, "clips": 58863, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.19, "teens": 0.28, "twenties": 0.18, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 397, "size": 1723981967, "checksum": "404ea029bc6cfca120fe9c1b181cee4ad23957621ab18c0401a8dc732877b053", "avgDurationSecs": 5.127, "validDurationSecs": 214986.618, "totalHrs": 83.83, "validHrs": 59.71 }, "cv": { "duration": 88743504, "buckets": { "dev": 1140, "invalidated": 1960, "other": 990, "reported": 143, "test": 1267, "train": 1566, "validated": 14664 }, "reportedSentences": 139, "clips": 17614, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.49, "": 0.21, "fourties": 0.06, "thirties": 0.01, "teens": 0.21, "fifties": 0.01 }, "gender": { "male": 0.52, "": 0.19, "female": 0.29 } }, "users": 104, "size": 613669627, "checksum": "689f89b2db0b97c1fbf5cae853b438d22d151f66fe2b1c9b88dd187462f5df28", "avgDurationSecs": 5.038, "validDurationSecs": 73880.705, "totalHrs": 24.65, "validHrs": 20.52 }, "uk": { "duration": 302901648, "buckets": { "dev": 6786, "invalidated": 2410, "other": 8616, "reported": 587, "test": 6785, "train": 11463, "validated": 52269 }, "reportedSentences": 588, "clips": 63295, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "teens": 0.1, "": 0.26, "fourties": 0.13, "thirties": 0.26, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.58, "female": 0.16, "": 0.26 } }, "users": 734, "size": 2061567072, "checksum": "b5bd9d8d49d96e5ef865a59e5ea00e16473be254cbbcc3874e8b58d7162600cb", "avgDurationSecs": 4.786, "validDurationSecs": 250136.128, "totalHrs": 84.13, "validHrs": 69.48 }, "mt": { "duration": 61216920, "buckets": { "dev": 1594, "invalidated": 320, "other": 6252, "reported": 9, "test": 1636, "train": 1948, "validated": 6350 }, "reportedSentences": 10, "clips": 12922, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.26, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 205, "size": 455461817, "checksum": "b5ef5b1715cc505e3202161a72ed16371856e133d8d61ee17d996bb3a29ff3f6", "avgDurationSecs": 4.737, "validDurationSecs": 30082.607, "totalHrs": 17, "validHrs": 8.35 }, "as": { "duration": 11681021, "buckets": { "dev": 448, "invalidated": 92, "other": 605, "reported": 9, "test": 307, "train": 604, "validated": 1359 }, "reportedSentences": 10, "clips": 2056, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.37, "": 0.59, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.41, "": 0.59, "female": 0 } }, "users": 42, "size": 73049488, "checksum": "4c6eca577436845cf0fe990dfc5a396c5ccfb9df89c0dd3a17825604d6ee320c", "avgDurationSecs": 5.681, "validDurationSecs": 7721.064, "totalHrs": 3.24, "validHrs": 2.14 }, "ka": { "duration": 29847816, "buckets": { "dev": 1353, "invalidated": 367, "other": 5, "reported": 40, "test": 1365, "train": 1686, "validated": 5232 }, "reportedSentences": 41, "clips": 5604, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.38, "thirties": 0.24, "": 0.34, "fourties": 0.02, "fifties": 0, "teens": 0.01 }, "gender": { "male": 0.46, "female": 0.2, "": 0.35 } }, "users": 136, "size": 196989034, "checksum": "042994d0aad43cd28261476019f9c00aa704c9d6738102beeda3345e9741b04f", "avgDurationSecs": 5.326, "validDurationSecs": 27866.483, "totalHrs": 8.29, "validHrs": 7.74 }, "fy-NL": { "duration": 459512019, "buckets": { "dev": 3025, "invalidated": 2913, "other": 53343, "reported": 425, "test": 3025, "train": 3700, "validated": 36057 }, "reportedSentences": 423, "clips": 92313, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "fifties": 0.12, "thirties": 0.04, "twenties": 0.02, "fourties": 0.07, "sixties": 0.15, "seventies": 0.01, "teens": 0, "eighties": 0 }, "gender": { "": 0.6, "male": 0.1, "female": 0.3 } }, "users": 1143, "size": 2876553843, "checksum": "93281e617fbfe22a4f677bc2039ac91b343b43fc4fb952fb0e6d8a477878820d", "avgDurationSecs": 4.978, "validDurationSecs": 179483.116, "totalHrs": 127.64, "validHrs": 49.85 }, "dv": { "duration": 212224635, "buckets": { "dev": 2253, "invalidated": 1545, "other": 14616, "reported": 49, "test": 2249, "train": 2611, "validated": 25883 }, "reportedSentences": 50, "clips": 42044, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.36, "fourties": 0.23, "teens": 0.01, "nineties": 0 }, "gender": { "": 0.22, "male": 0.28, "female": 0.5 } }, "users": 315, "size": 1362758958, "checksum": "467a1a1cf204e4b8a2f713e694f83db9705c35131f1ef66f0a90f597858be0c7", "avgDurationSecs": 5.048, "validDurationSecs": 130649.087, "totalHrs": 58.95, "validHrs": 36.29 }, "pa-IN": { "duration": 13144634, "buckets": { "dev": 280, "invalidated": 75, "other": 1285, "reported": 249, "test": 399, "train": 685, "validated": 1364 }, "reportedSentences": 244, "clips": 2724, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "fourties": 0.04, "fifties": 0.05, "thirties": 0.4, "twenties": 0.25, "sixties": 0, "teens": 0 }, "gender": { "": 0.25, "male": 0.75, "female": 0 } }, "users": 58, "size": 94525450, "checksum": "f397560fdf6a0d61d756dc94c2ebe03f4b9f02336561e5e80f3117f1a8a6c8a3", "avgDurationSecs": 4.825, "validDurationSecs": 6581.968, "totalHrs": 3.65, "validHrs": 1.82 }, "vi": { "duration": 63903104, "buckets": { "dev": 227, "invalidated": 334, "other": 11313, "reported": 179, "test": 1195, "train": 2559, "validated": 4460 }, "reportedSentences": 178, "clips": 16107, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.18, "": 0.25, "teens": 0.21, "seventies": 0, "fourties": 0.02, "sixties": 0.31 }, "gender": { "male": 0.52, "": 0.25, "female": 0.21, "other": 0.02 } }, "users": 229, "size": 371346215, "checksum": "a4a0f2d2dfc35ef4317c334713762aaa51b53f7b26e1c9f0e860b5ed0cc1f31a", "avgDurationSecs": 3.967, "validDurationSecs": 17694.657, "totalHrs": 17.75, "validHrs": 4.91 }, "or": { "duration": 35312604, "buckets": { "dev": 309, "invalidated": 163, "other": 5689, "reported": 9, "test": 218, "train": 477, "validated": 1143 }, "reportedSentences": 10, "clips": 6995, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.16, "": 0.08, "thirties": 0.75, "fourties": 0, "teens": 0 }, "gender": { "male": 0.92, "": 0.08, "female": 0 } }, "users": 86, "size": 255371109, "checksum": "00672631bee5854227d87b0f84ba5f19b8263477d34028dc086d9eeb2514e6a8", "avgDurationSecs": 5.048, "validDurationSecs": 5770.165, "totalHrs": 9.8, "validHrs": 1.6 }, "ga-IE": { "duration": 33018939, "buckets": { "dev": 511, "invalidated": 815, "other": 3899, "reported": 14, "test": 512, "train": 535, "validated": 4608 }, "reportedSentences": 15, "clips": 9322, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.37, "thirties": 0.26, "fourties": 0.04, "sixties": 0.01, "teens": 0.02, "fifties": 0.05 }, "gender": { "male": 0.49, "": 0.37, "female": 0.13, "other": 0 } }, "users": 162, "size": 229204889, "checksum": "e25bcd05f22041bbd2e8ebb4136f1c7a38ba3a55d59d6f9698ec2f97e7b7fc18", "avgDurationSecs": 3.542, "validDurationSecs": 16321.741, "totalHrs": 9.17, "validHrs": 4.53 }, "fi": { "duration": 60106935, "buckets": { "dev": 1584, "invalidated": 194, "other": 5679, "reported": 45, "test": 1726, "train": 2205, "validated": 7232 }, "reportedSentences": 46, "clips": 13105, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.15, "": 0.37, "twenties": 0.1, "fourties": 0.34, "teens": 0.01, "fifties": 0.04, "seventies": 0 }, "gender": { "male": 0.26, "": 0.37, "female": 0.37, "other": 0 } }, "users": 192, "size": 358878983, "checksum": "9ca4fb4ca2bfb9eb0d10dd46469dd3a5ce0cc1a3595e61a82dfd44be67c0e971", "avgDurationSecs": 4.587, "validDurationSecs": 33170.038, "totalHrs": 16.69, "validHrs": 9.21 }, "hu": { "duration": 93883429, "buckets": { "dev": 4634, "invalidated": 831, "other": 2080, "reported": 95, "test": 4627, "train": 6870, "validated": 16172 }, "reportedSentences": 96, "clips": 19083, "splits": { "accent": { "": 1 }, "age": { "teens": 0.08, "": 0.29, "thirties": 0.15, "twenties": 0.39, "fifties": 0.06, "fourties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.6, "": 0.29, "female": 0.11 } }, "users": 223, "size": 606662040, "checksum": "5ea1a62667d68a8e33a6758083a3fdb722fb22781996d5a7826f23a4d69c89e4", "avgDurationSecs": 4.92, "validDurationSecs": 79562.061, "totalHrs": 26.07, "validHrs": 22.1 }, "th": { "duration": 1394333760, "buckets": { "dev": 10868, "invalidated": 8420, "other": 196481, "reported": 3943, "test": 10868, "train": 31331, "validated": 129415 }, "reportedSentences": 3943, "clips": 334316, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.21, "": 0.43, "thirties": 0.07, "fourties": 0.04, "teens": 0.05, "fifties": 0.21, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.38, "": 0.43, "female": 0.18, "other": 0.01 } }, "users": 7616, "size": 8111064630, "checksum": "b56ce794693feb2a79ef294a25f9def1bda407a06e0fe209d46a58e13621212d", "avgDurationSecs": 4.171, "validDurationSecs": 539751.922, "totalHrs": 387.31, "validHrs": 149.93 }, "lt": { "duration": 73757780, "buckets": { "dev": 3505, "invalidated": 554, "other": 1423, "reported": 128, "test": 3667, "train": 5157, "validated": 12332 }, "reportedSentences": 128, "clips": 14309, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.24, "thirties": 0.28, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.05 }, "gender": { "male": 0.62, "": 0.24, "female": 0.14 } }, "users": 260, "size": 453243619, "checksum": "10a1fe7a63972122c308ffb4cb37e6d1fc7a888ffb667aad9fa9e9993fe79bcc", "avgDurationSecs": 5.155, "validDurationSecs": 63567.052, "totalHrs": 20.48, "validHrs": 17.65 }, "lg": { "duration": 1720048347, "buckets": { "dev": 12660, "invalidated": 38089, "other": 5922, "reported": 6039, "test": 12717, "train": 55020, "validated": 252222 }, "reportedSentences": 6034, "clips": 296233, "splits": { "accent": { "": 1 }, "age": { "": 0.27, "thirties": 0.22, "twenties": 0.41, "fourties": 0.05, "fifties": 0.03, "teens": 0.01, "nineties": 0, "sixties": 0.01 }, "gender": { "": 0.27, "female": 0.4, "male": 0.34 } }, "users": 487, "size": 10083327282, "checksum": "48451a611b86562bda710fd350b5fcf767921ea9bb1e3d6e78e51caf42a6efa5", "avgDurationSecs": 5.806, "validDurationSecs": 1464502.72, "totalHrs": 477.79, "validHrs": 406.8 }, "hi": { "duration": 63918132, "buckets": { "dev": 2178, "invalidated": 670, "other": 3280, "reported": 110, "test": 2839, "train": 4321, "validated": 9367 }, "reportedSentences": 111, "clips": 13317, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "fourties": 0.03, "": 0.32, "thirties": 0.27, "teens": 0.01, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.64, "female": 0.04, "": 0.33 } }, "users": 318, "size": 377781828, "checksum": "21364b7526eb32b58502f3ba897009127b5375beda9a1b8e4558a9aa0bbdb08e", "avgDurationSecs": 4.8, "validDurationSecs": 44959.161, "totalHrs": 17.75, "validHrs": 12.48 }, "bas": { "duration": 9991980, "buckets": { "dev": 457, "invalidated": 483, "other": 109, "reported": 7, "test": 444, "train": 763, "validated": 1664 }, "reportedSentences": 8, "clips": 2256, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 32, "size": 55578662, "checksum": "b7c92d2be66bbe18fc49a4e64e22c643a62483d25644208c9b6d45c2216e076d", "avgDurationSecs": 4.429, "validDurationSecs": 7369.971, "totalHrs": 2.77, "validHrs": 2.04 }, "sk": { "duration": 69713676, "buckets": { "dev": 2240, "invalidated": 713, "other": 183, "reported": 30, "test": 2241, "train": 3029, "validated": 16544 }, "reportedSentences": 31, "clips": 17440, "splits": { "accent": { "": 1 }, "age": { "": 0.53, "thirties": 0.22, "twenties": 0.04, "fourties": 0.1, "teens": 0.11 }, "gender": { "": 0.52, "male": 0.37, "female": 0.09, "other": 0.01 } }, "users": 143, "size": 390501164, "checksum": "119e70bd5b08d7ead030f9a50cdf70122df080975d53bf1794137b030328a6ff", "avgDurationSecs": 3.997, "validDurationSecs": 66132.056, "totalHrs": 19.36, "validHrs": 18.37 }, "kmr": { "duration": 194365584, "buckets": { "dev": 2375, "invalidated": 1570, "other": 2728, "reported": 637, "test": 2398, "train": 2838, "validated": 39641 }, "reportedSentences": 638, "clips": 43939, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "twenties": 0.3, "thirties": 0.06, "fourties": 0.04, "fifties": 0.09, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.5, "male": 0.34, "female": 0.17 } }, "users": 309, "size": 1036731522, "checksum": "134ca34be64ae928ac451ca33d09342227b0ba6ff8b8e5499f51c5f31b41ebc3", "avgDurationSecs": 4.424, "validDurationSecs": 175353.242, "totalHrs": 53.99, "validHrs": 48.7 }, "bg": { "duration": 46476000, "buckets": { "dev": 915, "invalidated": 390, "other": 2062, "reported": 145, "test": 1900, "train": 3161, "validated": 5987 }, "reportedSentences": 146, "clips": 8439, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.33, "thirties": 0.07, "": 0.42, "twenties": 0.17, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.52, "female": 0.06, "": 0.42 } }, "users": 69, "size": 271306749, "checksum": "f532e435bd3df3422db84b13311068a819dc4685e4973ea847c2c8b41fa0d968", "avgDurationSecs": 5.507, "validDurationSecs": 32972.131, "totalHrs": 12.91, "validHrs": 9.15 }, "kk": { "duration": 6733260, "buckets": { "dev": 379, "invalidated": 195, "other": 0, "reported": 22, "test": 384, "train": 401, "validated": 1169 }, "reportedSentences": 23, "clips": 1364, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "thirties": 0.03, "twenties": 0.31, "teens": 0.06, "fifties": 0.1 }, "gender": { "": 0.52, "male": 0.46, "female": 0.03 } }, "users": 80, "size": 38650960, "checksum": "6cc9593b426d0ed39a7e8e10e576031ee3436b6f00bd8327a57da2653fa32fdc", "avgDurationSecs": 4.936, "validDurationSecs": 5770.661, "totalHrs": 1.87, "validHrs": 1.6 }, "ba": { "duration": 958413996, "buckets": { "dev": 14559, "invalidated": 7892, "other": 45, "reported": 866, "test": 14526, "train": 118983, "validated": 208602 }, "reportedSentences": 863, "clips": 216539, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 888, "size": 5376316576, "checksum": "834d2433e1fb541a7b3310f5f1cde1161bd7653e5f7d88338fbc4757b06f2818", "avgDurationSecs": 4.426, "validDurationSecs": 923284.38, "totalHrs": 266.22, "validHrs": 256.46 }, "gl": { "duration": 60356232, "buckets": { "dev": 2396, "invalidated": 317, "other": 3748, "reported": 190, "test": 2556, "train": 3402, "validated": 8413 }, "reportedSentences": 191, "clips": 12478, "splits": { "accent": { "": 1 }, "age": { "": 0.38, "thirties": 0.39, "fifties": 0.09, "twenties": 0.07, "fourties": 0.06, "teens": 0, "sixties": 0.01 }, "gender": { "": 0.39, "male": 0.4, "female": 0.21, "other": 0.01 } }, "users": 162, "size": 347443062, "checksum": "05d320fa03a7003f26952392d9a17da6ea3252679d0c08e4110c1268157d0f98", "avgDurationSecs": 4.837, "validDurationSecs": 40693.779, "totalHrs": 16.76, "validHrs": 11.3 }, "ug": { "duration": 261275040, "buckets": { "dev": 2748, "invalidated": 1969, "other": 2907, "reported": 184, "test": 2747, "train": 3293, "validated": 38878 }, "reportedSentences": 185, "clips": 43754, "splits": { "accent": { "": 1 }, "age": { "": 0.58, "fifties": 0.02, "twenties": 0.13, "thirties": 0.11, "fourties": 0.15, "teens": 0, "eighties": 0.01 }, "gender": { "": 0.58, "male": 0.33, "female": 0.08, "other": 0 } }, "users": 396, "size": 1521167832, "checksum": "de56839d05aab1d73099bbebb8ec54c11e88bb54191d97397f7618bf70276f1a", "avgDurationSecs": 5.971, "validDurationSecs": 232158.226, "totalHrs": 72.57, "validHrs": 64.48 }, "hy-AM": { "duration": 16193592, "buckets": { "dev": 352, "invalidated": 91, "other": 1224, "reported": 27, "test": 382, "train": 599, "validated": 1334 }, "reportedSentences": 28, "clips": 2649, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.14, "twenties": 0.36, "fifties": 0.04, "teens": 0.08 }, "gender": { "": 0.37, "male": 0.23, "female": 0.4 } }, "users": 60, "size": 95058888, "checksum": "37aa8f887538d7bfe54f1ba6554b27c7f1996d4d96e8e74c891aedf1c8598b4a", "avgDurationSecs": 6.113, "validDurationSecs": 8154.87, "totalHrs": 4.49, "validHrs": 2.26 }, "be": { "duration": 4045811760, "buckets": { "dev": 15875, "invalidated": 26057, "other": 27, "reported": 3124, "test": 15879, "train": 347012, "validated": 824875 }, "reportedSentences": 3123, "clips": 850959, "splits": { "accent": { "": 1 }, "age": { "": 0.8, "fourties": 0.06, "thirties": 0.07, "twenties": 0.05, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.79, "male": 0.09, "female": 0.12, "other": 0 } }, "users": 6408, "size": 22942027342, "checksum": "96f071638a5360172908b6573907284bceaa25b3bde66cb950e2e63a5a61488a", "avgDurationSecs": 4.754, "validDurationSecs": 3921797.614, "totalHrs": 1123.83, "validHrs": 1089.38 }, "ur": { "duration": 500892012, "buckets": { "dev": 3303, "invalidated": 3185, "other": 84521, "reported": 48, "test": 3298, "train": 4128, "validated": 41591 }, "reportedSentences": 48, "clips": 129297, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.89, "": 0.1, "fourties": 0.01, "thirties": 0, "teens": 0, "fifties": 0 }, "gender": { "male": 0.69, "": 0.1, "female": 0.2 } }, "users": 183, "size": 2912285969, "checksum": "72c662684967be9aafb3c1f42231fcdad4926b23bee7898d2ca8e5a5afcfb316", "avgDurationSecs": 3.874, "validDurationSecs": 161122.065, "totalHrs": 139.13, "validHrs": 44.75 }, "gn": { "duration": 11959812, "buckets": { "dev": 201, "invalidated": 82, "other": 1815, "reported": 25, "test": 267, "train": 356, "validated": 824 }, "reportedSentences": 26, "clips": 2721, "splits": { "accent": { "": 1 }, "age": { "": 0.52, "twenties": 0.35, "thirties": 0.12, "sixties": 0.01 }, "gender": { "": 0.52, "male": 0.35, "female": 0.13 } }, "users": 69, "size": 66523649, "checksum": "752ec145f7ea7a6f7b676c0c0cfd24336fe2628f24b85113419961280ebfecd0", "avgDurationSecs": 4.395, "validDurationSecs": 3621.788, "totalHrs": 3.32, "validHrs": 1 }, "sr": { "duration": 6711228, "buckets": { "dev": 623, "invalidated": 40, "other": 14, "reported": 18, "test": 659, "train": 1037, "validated": 2321 }, "reportedSentences": 19, "clips": 2375, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.68, "": 0.15, "fifties": 0.01, "fourties": 0.11, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.39, "": 0.15, "female": 0.46 } }, "users": 56, "size": 37169416, "checksum": "9fc5a4dd4885de3044303e0d356d727831da46cc4c2f770bd98c4bea3dbe4410", "avgDurationSecs": 2.826, "validDurationSecs": 6558.636, "totalHrs": 1.86, "validHrs": 1.82 }, "uz": { "duration": 904201128, "buckets": { "dev": 11570, "invalidated": 13134, "other": 123659, "reported": 1750, "test": 12242, "train": 47082, "validated": 83316 }, "reportedSentences": 1733, "clips": 220109, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "": 0.41, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.44, "": 0.41, "female": 0.15, "other": 0 } }, "users": 1932, "size": 5040011949, "checksum": "5d7887f2d36f891e02bd70858549752e8556c70b98855f759e94596572321253", "avgDurationSecs": 4.108, "validDurationSecs": 342259.613, "totalHrs": 251.16, "validHrs": 95.07 }, "mr": { "duration": 95987052, "buckets": { "dev": 1689, "invalidated": 2204, "other": 2649, "reported": 43, "test": 1761, "train": 2284, "validated": 10670 }, "reportedSentences": 44, "clips": 15523, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "sixties": 0, "twenties": 0.28, "": 0.05, "teens": 0.54 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 77, "size": 562491867, "checksum": "2ceb9169d113c3354a3cc97907b3f36acdbc43c8ebe78c24f77a73afac216787", "avgDurationSecs": 6.184, "validDurationSecs": 65978.345, "totalHrs": 26.66, "validHrs": 18.32 }, "da": { "duration": 35201952, "buckets": { "dev": 1905, "invalidated": 310, "other": 301, "reported": 208, "test": 1912, "train": 2273, "validated": 7645 }, "reportedSentences": 208, "clips": 8256, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "thirties": 0.31, "twenties": 0.2, "sixties": 0, "fourties": 0.11, "fifties": 0.04, "teens": 0 }, "gender": { "": 0.34, "female": 0.08, "male": 0.58 } }, "users": 194, "size": 201862714, "checksum": "6a3338dd0bfec1e945c8f0ea5cf92fee517c48004dbf7389076d20a5abb4dcc4", "avgDurationSecs": 4.264, "validDurationSecs": 32596.769, "totalHrs": 9.77, "validHrs": 9.05 }, "myv": { "duration": 11097324, "buckets": { "dev": 498, "invalidated": 18, "other": 236, "reported": 19, "test": 488, "train": 684, "validated": 1676 }, "reportedSentences": 20, "clips": 1930, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.27, "": 0.38, "thirties": 0.25, "twenties": 0.09, "teens": 0.01 }, "gender": { "male": 0.55, "": 0.38, "female": 0.08 } }, "users": 12, "size": 64916998, "checksum": "be220f8186d52f8c866c84bb6fec0c2094333dfb525ad44bd607159b6b702100", "avgDurationSecs": 5.75, "validDurationSecs": 9636.847, "totalHrs": 3.08, "validHrs": 2.67 }, "nn-NO": { "duration": 3267000, "buckets": { "dev": 193, "invalidated": 13, "other": 79, "reported": 14, "test": 195, "train": 240, "validated": 633 }, "reportedSentences": 15, "clips": 725, "splits": { "accent": { "": 1 }, "age": { "": 0.4, "thirties": 0.36, "twenties": 0.21, "fourties": 0.04 }, "gender": { "": 0.4, "female": 0.2, "male": 0.37, "other": 0.03 } }, "users": 25, "size": 18486398, "checksum": "d848f0d5bdedb577c8b56aab1f396bdf896bfcc7ce687052aa3fa4ce25163a61", "avgDurationSecs": 4.506, "validDurationSecs": 2852.429, "totalHrs": 0.9, "validHrs": 0.79 }, "ha": { "duration": 39436776, "buckets": { "dev": 532, "invalidated": 161, "other": 5936, "reported": 17, "test": 469, "train": 1945, "validated": 2973 }, "reportedSentences": 17, "clips": 9070, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.75, "twenties": 0.04, "fourties": 0, "fifties": 0.03 }, "gender": { "": 0.18, "male": 0.55, "female": 0.28 } }, "users": 30, "size": 230742490, "checksum": "484e9ec6bdbe21875f5b91c74bdfabb38ad90d5a4d6453c7579b2a3ed56b2232", "avgDurationSecs": 4.348, "validDurationSecs": 12926.74, "totalHrs": 10.95, "validHrs": 3.59 }, "ckb": { "duration": 405826848, "buckets": { "dev": 4524, "invalidated": 4973, "other": 17429, "reported": 2261, "test": 4526, "train": 6225, "validated": 81567 }, "reportedSentences": 2261, "clips": 103969, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.13, "twenties": 0.44, "fourties": 0.03, "teens": 0.02, "fifties": 0.02 }, "gender": { "": 0.35, "male": 0.58, "female": 0.07, "other": 0 } }, "users": 1151, "size": 2190261738, "checksum": "742eace3283d4d50f771862f4366e72df5a626493a7bcb0efc38f87bf5d43af2", "avgDurationSecs": 3.903, "validDurationSecs": 318384.119, "totalHrs": 112.72, "validHrs": 88.44 }, "ml": { "duration": 10157040, "buckets": { "dev": 0, "invalidated": 6, "other": 1964, "reported": 112, "test": 80, "train": 414, "validated": 494 }, "reportedSentences": 113, "clips": 2464, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "twenties": 0.49, "thirties": 0.05, "fourties": 0 }, "gender": { "": 0.45, "male": 0.55 } }, "users": 22, "size": 58792967, "checksum": "cf6a3595a571d780db0c9e6b986e519ab047b56bd8b097fb9094dd8fd197682a", "avgDurationSecs": 4.122, "validDurationSecs": 2036.355, "totalHrs": 2.82, "validHrs": 0.56 }, "mdf": { "duration": 1791720, "buckets": { "dev": 48, "invalidated": 6, "other": 77, "reported": 9, "test": 78, "train": 130, "validated": 256 }, "reportedSentences": 10, "clips": 339, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.59, "fourties": 0.35 }, "gender": { "male": 0.06, "": 0.59, "female": 0.35 } }, "users": 10, "size": 10526142, "checksum": "936191d697b2820af25c3a7b71baf1716090510467a45c4c286167f3dde8094c", "avgDurationSecs": 5.285, "validDurationSecs": 1353.039, "totalHrs": 0.49, "validHrs": 0.37 }, "sw": { "duration": 2636057412, "buckets": { "dev": 9196, "invalidated": 10371, "other": 340061, "reported": 1832, "test": 9288, "train": 21141, "validated": 144219 }, "reportedSentences": 1827, "clips": 494651, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "twenties": 0.44, "thirties": 0.14, "teens": 0, "fifties": 0.06, "fourties": 0.05, "sixties": 0.01 }, "gender": { "": 0.27, "male": 0.39, "female": 0.34, "other": 0 } }, "users": 663, "size": 15409263795, "checksum": "fbd27fae537fd3c843635a4518979c678c523b1bda5f109e050ec483af3a66fc", "avgDurationSecs": 5.329, "validDurationSecs": 768561.195, "totalHrs": 732.23, "validHrs": 213.48 }, "sat": { "duration": 3025764, "buckets": { "dev": 0, "invalidated": 11, "other": 281, "reported": 6, "test": 118, "train": 275, "validated": 393 }, "reportedSentences": 7, "clips": 685, "splits": { "accent": { "": 1 }, "age": { "": 0.44, "twenties": 0.41, "fourties": 0.01, "fifties": 0.01, "teens": 0.01, "thirties": 0.11 }, "gender": { "": 0.42, "male": 0.57, "female": 0.01 } }, "users": 9, "size": 16534167, "checksum": "37ea92dc8d330ab09c7f0ce8aedabd867d81530741b5be98a577aaa8c4e08d45", "avgDurationSecs": 4.417, "validDurationSecs": 1735.949, "totalHrs": 0.84, "validHrs": 0.48 }, "tig": { "duration": 103284, "buckets": { "dev": 0, "invalidated": 8, "other": 5, "reported": 0, "test": 0, "train": 10, "validated": 10 }, "reportedSentences": 1, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 0.78, "twenties": 0.22 }, "gender": { "": 0.78, "male": 0.22 } }, "users": 5, "size": 603415, "checksum": "17d9f9d54f00aa556bd4cc4daf4cdda80c345fee78b53a54befdfc640b027b8d", "avgDurationSecs": 4.491, "validDurationSecs": 44.906, "totalHrs": 0.02, "validHrs": 0.01 }, "ig": { "duration": 31109796, "buckets": { "dev": 2, "invalidated": 2, "other": 5673, "reported": 6, "test": 4, "train": 8, "validated": 14 }, "reportedSentences": 6, "clips": 5689, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "twenties": 0.32, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02, "fourties": 0 }, "gender": { "": 0.56, "male": 0.13, "female": 0.3 } }, "users": 104, "size": 181718397, "checksum": "23a28c2c5ab68230a17f531213e7010df1263108ae813ccf99503985ecce4813", "avgDurationSecs": 5.468, "validDurationSecs": 76.558, "totalHrs": 8.64, "validHrs": 0.02 }, "nan-tw": { "duration": 33604848, "buckets": { "dev": 936, "invalidated": 257, "other": 9422, "reported": 118, "test": 892, "train": 1040, "validated": 2875 }, "reportedSentences": 119, "clips": 12554, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "": 0.12, "twenties": 0.4, "fourties": 0.23, "teens": 0.03, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.53, "": 0.12, "other": 0.12, "female": 0.23 } }, "users": 97, "size": 188163531, "checksum": "fb567620a6f4b449503f2a46a278f858cf2c62c9f817c769620f6e0000f06a4e", "avgDurationSecs": 2.677, "validDurationSecs": 7695.869, "totalHrs": 9.33, "validHrs": 2.13 }, "mhr": { "duration": 424034172, "buckets": { "dev": 12583, "invalidated": 2701, "other": 0, "reported": 35, "test": 12797, "train": 59242, "validated": 86370 }, "reportedSentences": 36, "clips": 89071, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.09, "": 0.11, "sixties": 0.08, "thirties": 0.22, "fourties": 0.18, "twenties": 0.26, "teens": 0.03, "seventies": 0.01 }, "gender": { "male": 0.18, "": 0.11, "female": 0.7 } }, "users": 239, "size": 2402678566, "checksum": "2291217507995a8741511c3e90fbbe69954046442210d50442e8e95aa462ae1e", "avgDurationSecs": 4.761, "validDurationSecs": 411175.707, "totalHrs": 117.78, "validHrs": 114.21 }, "bn": { "duration": 1608861096, "buckets": { "dev": 8226, "invalidated": 6312, "other": 218729, "reported": 970, "test": 8226, "train": 16271, "validated": 35981 }, "reportedSentences": 965, "clips": 261022, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.24, "": 0.69, "teens": 0.04, "fourties": 0 }, "gender": { "male": 0.26, "": 0.69, "female": 0.05, "other": 0 } }, "users": 20630, "size": 9255007781, "checksum": "1430962f663e16ed42869096afca6378cf6194f96979b8194f5b170c2474e5f2", "avgDurationSecs": 6.164, "validDurationSecs": 221776.061, "totalHrs": 446.9, "validHrs": 61.6 }, "tok": { "buckets": { "dev": 1756, "invalidated": 177, "other": 2203, "reported": 89, "test": 1616, "train": 2214, "validated": 6782 }, "reportedSentences": 90, "duration": 33801120, "clips": 9162, "splits": { "accent": { "": 1 }, "age": { "": 0.44, "twenties": 0.19, "teens": 0.26, "thirties": 0.12, "fourties": 0 }, "gender": { "": 0.44, "male": 0.45, "other": 0.03, "female": 0.08 } }, "users": 73, "size": 196707194, "checksum": "7c1e45005bcdc3a63e25cb4cfb9809ea2263868eeb2051435449c451dd50251f", "avgDurationSecs": 3.689, "validDurationSecs": 25020.65, "totalHrs": 9.38, "validHrs": 6.95 }, "yue": { "duration": 139717368, "buckets": { "dev": 2157, "invalidated": 1332, "other": 17354, "reported": 601, "test": 2168, "train": 2547, "validated": 14772 }, "reportedSentences": 594, "clips": 33458, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.18, "": 0.42, "twenties": 0.32, "fourties": 0.04, "sixties": 0, "fifties": 0, "teens": 0.03 }, "gender": { "male": 0.34, "": 0.48, "female": 0.16, "other": 0.02 } }, "users": 584, "size": 790089657, "checksum": "10b86810d19f041c13f22b188872df968443aa382e8bf86a6a6a53ca9ecded17", "avgDurationSecs": 4.176, "validDurationSecs": 61686.442, "totalHrs": 38.81, "validHrs": 17.13 }, "sah": { "duration": 24460932, "buckets": { "dev": 1083, "invalidated": 101, "other": 1, "reported": 2, "test": 1249, "train": 1575, "validated": 3975 }, "reportedSentences": 3, "clips": 4077, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.03, "fourties": 0.07, "thirties": 0.43, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.37, "male": 0.53, "female": 0.1 } }, "users": 53, "size": 186022863, "checksum": "0d424943facc126253edf6a8a459a52f5b3986a4ddec251eebd46a8777a16cb9", "avgDurationSecs": 6, "validDurationSecs": 23848.959, "totalHrs": 6.79, "validHrs": 6.62 }, "mk": { "duration": 979452, "buckets": { "dev": 0, "invalidated": 7, "other": 48, "reported": 4, "test": 15, "train": 114, "validated": 129 }, "reportedSentences": 5, "clips": 184, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "": 0.33, "twenties": 0.11, "teens": 0.08 }, "gender": { "male": 0.67, "": 0.33 } }, "users": 6, "size": 5763649, "checksum": "c319ef45575aaad82e1d9d5629da486dbf789f21dbc04fe0ad43b44d9fbd1af5", "avgDurationSecs": 5.323, "validDurationSecs": 686.681, "totalHrs": 0.27, "validHrs": 0.19 }, "sc": { "duration": 3140316, "buckets": { "dev": 80, "invalidated": 21, "other": 227, "reported": 0, "test": 99, "train": 201, "validated": 388 }, "reportedSentences": 1, "clips": 636, "splits": { "accent": { "": 1 }, "age": { "": 0.29, "thirties": 0.57, "twenties": 0.14 }, "gender": { "": 0.29, "female": 0.65, "male": 0.06 } }, "users": 8, "size": 17925900, "checksum": "6d87934eface17484f592b21ac0944ff5eb08c07c8606041f92771e044a61dbb", "avgDurationSecs": 4.938, "validDurationSecs": 1915.79, "totalHrs": 0.87, "validHrs": 0.53 }, "vot": { "duration": 1025976, "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7892427, "checksum": "97ab1ad8cfa2c0526abe359ae53d94d873af374755366b430959faafe62967b4", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "duration": 611820, "buckets": { "dev": 20, "invalidated": 28, "other": 1, "test": 22, "train": 39, "validated": 81 }, "clips": 110, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "twenties": 0.55, "fourties": 0.03 }, "gender": { "": 0.42, "male": 0.58 } }, "users": 14, "size": 3573693, "checksum": "3e571071448c82c67b14a3a8fd9ddacb67152542f3b40159fac1da25a9c48fcd", "avgDurationSecs": 5.562, "validDurationSecs": 450.522, "totalHrs": 0.16, "validHrs": 0.12 }, "ast": { "duration": 921168, "buckets": { "dev": 0, "invalidated": 0, "other": 214, "test": 0, "train": 0, "validated": 0 }, "clips": 214, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 5399531, "checksum": "2e13bf7a4f09d4b89ef269360faf1a30fa918c8dd81f28ee7e935e668177decb", "avgDurationSecs": 4.305, "validDurationSecs": 0, "totalHrs": 0.25, "validHrs": 0 }, "ne-NP": { "duration": 515232, "buckets": { "dev": 0, "invalidated": 1, "other": 122, "test": 1, "train": 4, "validated": 5 }, "clips": 128, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.04, "": 0.96 }, "gender": { "male": 0.04, "": 0.96 } }, "users": 4, "size": 2927867, "checksum": "5c571f22daeb4c27cb4c668790d769bb47f4a5dd45d18abf08c4667458ef77dc", "avgDurationSecs": 4.025, "validDurationSecs": 20.126, "totalHrs": 0.14, "validHrs": 0 } }, "totalDuration": 74941200000, "totalValidDurationSecs": 54841011, "totalHrs": 20817, "totalValidHrs": 15234 } ================================================ FILE: datasets/scripted-speech/cv-corpus-10.0-delta-2022-07-04.json ================================================ { "locales": { "en": { "duration": 348687468, "reportedSentences": 253, "clips": 63939, "users": 2705, "size": 2029487644, "checksum": "b82354bf4ff7a62568e071dbba3a48160f7368ed94890fd57f466a85c27e0511", "avgDurationSecs": 5.155, "validDurationSecs": 183220.9, "totalHrs": 96.86, "validHrs": 50.9 }, "fa": { "duration": 14904748, "reportedSentences": 39, "clips": 3957, "users": 30, "size": 85189844, "checksum": "c298ceacbe35edbc0ed948c068afbd87c2076768a5e70ebe7f4166f7e053e4a8", "avgDurationSecs": 4, "validDurationSecs": 14033.62, "totalHrs": 4.15, "validHrs": 3.9 }, "fr": { "reportedSentences": 179, "duration": 120206124, "clips": 22871, "users": 219, "size": 703389081, "checksum": "a7f6596f3f679fca1a9ae24f319b5feda67bcea5d1a514c6f6f32ae65f88aa27", "avgDurationSecs": 4.997, "validDurationSecs": 72958.71, "totalHrs": 33.39, "validHrs": 20.27 }, "es": { "reportedSentences": 82, "duration": 87761484, "clips": 16163, "users": 614, "size": 509558554, "checksum": "29921567c0b8f98953295ff53d69bd3c0b6c6beb746791472daba1c87199ae66", "avgDurationSecs": 5.053, "validDurationSecs": 13675.14, "totalHrs": 24.38, "validHrs": 3.8 }, "sl": { "reportedSentences": 6, "duration": 3233196, "clips": 947, "users": 9, "size": 18764069, "checksum": "a461623bfc1deed48ce7ef2ec4b64fd724006472b1e481e8c3dd47e3290c23f3", "avgDurationSecs": 3.814, "validDurationSecs": 554.62, "totalHrs": 0.9, "validHrs": 0.16 }, "kab": { "reportedSentences": 36, "duration": 10593576, "clips": 2494, "users": 10, "size": 55516139, "checksum": "c0b4cec3a040eaf0abf4ab5cc434aef152939f901cb8fd5e7a46f0dc03d280cd", "avgDurationSecs": 3.328, "validDurationSecs": 7479.3, "totalHrs": 2.94, "validHrs": 2.08 }, "cy": { "reportedSentences": 0, "duration": 2239596, "clips": 373, "users": 10, "size": 13145485, "checksum": "83f81318cf77b1f9835762b0f4dc06af083d387bf0524200b8dce8492a16fb56", "avgDurationSecs": 4.843, "validDurationSecs": 1255.49, "totalHrs": 0.62, "validHrs": 0.35 }, "ca": { "reportedSentences": 1075, "duration": 1086014664, "clips": 216035, "users": 2595, "size": 5809675725, "checksum": "e6df7a73ffa2f9b61615c4f8199b44fb7a599cbbbd74a56789758842a6d77f54", "avgDurationSecs": 5.512, "validDurationSecs": 510777.98, "totalHrs": 301.67, "validHrs": 141.89 }, "de": { "reportedSentences": 717, "duration": 165497976, "clips": 29099, "users": 299, "size": 960858837, "checksum": "a293c3e341f6aaf25019bc852d6475d0ef2f85c2835f8481f7cc65dbd0bde2fa", "avgDurationSecs": 5.159, "validDurationSecs": 155145.01, "totalHrs": 45.97, "validHrs": 43.1 }, "tt": { "duration": 422100, "reportedSentences": 0, "clips": 93, "users": 4, "size": 2408354, "checksum": "e5b1670372444451dbd146a2bb911144eca59a843f59c347c98f75ee5c5ac507", "avgDurationSecs": 3.747, "validDurationSecs": 368.68, "totalHrs": 0.12, "validHrs": 0.1 }, "ta": { "duration": 68076324, "reportedSentences": 67, "clips": 10547, "users": 31, "size": 399769191, "checksum": "8d8427ca7d2735131f5b77afc4ecd01342f617487f76a64c5eb2a597bc74f9a2", "avgDurationSecs": 6.211, "validDurationSecs": 11889.52, "totalHrs": 18.91, "validHrs": 3.3 }, "ru": { "duration": 28183392, "reportedSentences": 30, "clips": 5741, "users": 103, "size": 165197914, "checksum": "78ea4fa2de776edc8ecae52440d0e4fc4669eb962ae8150f91078c907c5d049a", "avgDurationSecs": 5.161, "validDurationSecs": 17910.15, "totalHrs": 7.82, "validHrs": 4.97 }, "nl": { "duration": 5887440, "reportedSentences": 7, "clips": 1127, "users": 32, "size": 34580925, "checksum": "d9053d64a7e5fc2d3853c0d920c73229650cb4d88130c9c9b9939f1fa582fe4a", "avgDurationSecs": 4.316, "validDurationSecs": 4588.35, "totalHrs": 1.64, "validHrs": 1.27 }, "it": { "duration": 22953780, "reportedSentences": 75, "clips": 4010, "users": 95, "size": 133391324, "checksum": "ae01f1b6fd93a65d964c274c4aee182ea00d861f0a4f9e98bc5c06fe55a4a1b5", "avgDurationSecs": 5.356, "validDurationSecs": 21773.05, "totalHrs": 6.37, "validHrs": 6.05 }, "eu": { "duration": 1302084, "reportedSentences": 17, "clips": 223, "users": 8, "size": 7592880, "checksum": "a0b8c72bdcce0d23e9a58ba71681e5a1c5093504159c5f69d699a4b9da42cb85", "avgDurationSecs": 5.191, "validDurationSecs": 259.22, "totalHrs": 0.36, "validHrs": 0.08 }, "tr": { "duration": 28867104, "reportedSentences": 27, "clips": 8649, "users": 35, "size": 168127776, "checksum": "2d12424877b65b3b39e6031c368a3d60579e347d31593558c57e0a3dc11b3791", "avgDurationSecs": 3.654, "validDurationSecs": 28120.92, "totalHrs": 8.02, "validHrs": 7.81 }, "ar": { "duration": 71301762451, "reportedSentences": 16, "clips": 5202, "users": 35, "size": 114366388, "checksum": "4f81fec5272134b6e7de8195fc94e629975c3142691587ad3be961c3cb12a686", "avgDurationSecs": 4.164, "validDurationSecs": 1872.37, "totalHrs": 5.87, "validHrs": 0.52 }, "zh-TW": { "duration": 16178400, "reportedSentences": 1, "clips": 3900, "users": 59, "size": 78208489, "checksum": "c4c488f69eadae226b056396e3f6dc40baf89674240afb101e19d01ee2922614", "avgDurationSecs": 3.278, "validDurationSecs": 6972.18, "totalHrs": 4.5, "validHrs": 1.93 }, "br": { "duration": 933840, "reportedSentences": 38, "clips": 235, "users": 5, "size": 5392101, "checksum": "6275bfb2f17e9857a61d25185e47d83f9486f3b5a08e67219f269e0559410f3a", "avgDurationSecs": 3.085, "validDurationSecs": 149.84, "totalHrs": 0.26, "validHrs": 0.04 }, "pt": { "duration": 37053540, "reportedSentences": 80, "clips": 9099, "users": 109, "size": 213021717, "checksum": "3ceeb91b12a07cbf8cf983b367573a35fdb89b39d925c086b595d1d32ca807cb", "avgDurationSecs": 4.189, "validDurationSecs": 17959.14, "totalHrs": 10.29, "validHrs": 4.99 }, "eo": { "duration": -6705988902, "reportedSentences": 13, "clips": 4593, "users": 43, "size": 173814790, "checksum": "2179bad54bb2b69cd12964bc2f6533b9538b7a3f943f9e65f8f9a463796fd901", "avgDurationSecs": 6.068, "validDurationSecs": 467, "totalHrs": -434.44, "validHrs": 442.34 }, "zh-CN": { "duration": 1255890492, "reportedSentences": 56, "clips": 281511, "users": 828, "size": 7236025294, "checksum": "5862d3e55aaa507b62c6e81343bb17ae784a32c1a36e9facf4b2713ecb5da4ce", "avgDurationSecs": 4.603, "validDurationSecs": -19192.42, "totalHrs": 348.86, "validHrs": -5.33 }, "id": { "duration": 3512736, "reportedSentences": 8, "clips": 988, "users": 22, "size": 20488039, "checksum": "33d49848b4e5341d166642ce175dc7c6128114307c4f3cab881be4e34b0703f7", "avgDurationSecs": 4.055, "validDurationSecs": 116.79, "totalHrs": 0.97, "validHrs": 0.03 }, "ia": { "duration": 327852, "reportedSentences": 1, "clips": 60, "users": 1, "size": 1932757, "checksum": "581ed384b3498194710ae266a83bfc220df575893f57db49907db29cd8fcfbdf", "avgDurationSecs": 4.183, "validDurationSecs": 156.84, "totalHrs": 0.09, "validHrs": 0.05 }, "lv": { "duration": 360684, "reportedSentences": 2, "clips": 102, "users": 2, "size": 2103521, "checksum": "6b245bc30f4a09415c0234545068084692a919b657df7281b414931e331fdf2f", "avgDurationSecs": 3.412, "validDurationSecs": 140.39, "totalHrs": 0.1, "validHrs": 0.04 }, "ja": { "duration": 7523784, "reportedSentences": 24, "clips": 1433, "users": 33, "size": 44204310, "checksum": "7196d23c02058a545c921539aa553bc6655c692274bca88cf0941f7e30018826", "avgDurationSecs": 4.798, "validDurationSecs": 6870.81, "totalHrs": 2.09, "validHrs": 1.91 }, "rw": { "duration": 189180, "reportedSentences": 6, "clips": 58, "users": 8, "size": 1206683, "checksum": "824d4a62cc4ce8a5e3fe0b4c24bd5a191a286dff50d4f4ccc4c724b342413a4b", "avgDurationSecs": 5.008, "validDurationSecs": 240.49, "totalHrs": 0.05, "validHrs": 0.06 }, "sv-SE": { "duration": 2524896, "reportedSentences": 5, "clips": 530, "users": 14, "size": 14829390, "checksum": "ee19ce93f376d4980e4afb32f7b7ac04b74fd257547718a43110d530789a1e95", "avgDurationSecs": 3.956, "validDurationSecs": 1088.7, "totalHrs": 0.7, "validHrs": 0.3 }, "cnh": { "duration": 2700, "reportedSentences": 0, "clips": 1, "users": 1, "size": 18315, "checksum": "76a3e555e9503e94799077f48b1ef84acfef3f1f19fd9cf1f6a30dc7c10b48fa", "avgDurationSecs": 3.564, "validDurationSecs": -0.37, "totalHrs": 0, "validHrs": 0 }, "et": { "duration": 17878644, "reportedSentences": 12, "clips": 2658, "users": 15, "size": 105537453, "checksum": "895f05b75825d23da787a223cd1c84f44b07bba18bec1e34a8a46b7afc642e56", "avgDurationSecs": 6.756, "validDurationSecs": 10884.46, "totalHrs": 4.96, "validHrs": 3.03 }, "ky": { "duration": 442404, "reportedSentences": 2, "clips": 102, "users": 8, "size": 2545815, "checksum": "b23c82a73e969b2bb26b7ffe4e4dc327b4bea59514fcc054be492192bc8ea493", "avgDurationSecs": 4.542, "validDurationSecs": 205.12, "totalHrs": 0.13, "validHrs": 0.06 }, "ro": { "duration": 4459500, "reportedSentences": 7, "clips": 855, "users": 15, "size": 26220871, "checksum": "c9da340807a83058beea735c8e3290a327886933ac6b158744de9ecf6d44c87f", "avgDurationSecs": 3.992, "validDurationSecs": 947.78, "totalHrs": 1.24, "validHrs": 0.26 }, "hsb": { "duration": 104004, "reportedSentences": 15, "clips": 15, "users": 1, "size": 413576, "checksum": "f3cb738b99ef8700809e4787c7877dbd90942b6b197375d212c3c5951ad0b32b", "avgDurationSecs": 6.109, "validDurationSecs": 65.52, "totalHrs": 0.03, "validHrs": 0.01 }, "el": { "duration": 7146972, "reportedSentences": 0, "clips": 1697, "users": 16, "size": 41609522, "checksum": "19d90367ab2be112a8400b5c39bdc718d45c07c7fc311ca29966145bb75931bd", "avgDurationSecs": 4.132, "validDurationSecs": 2958.64, "totalHrs": 1.99, "validHrs": 0.83 }, "cs": { "duration": 3449736, "reportedSentences": 4, "clips": 871, "users": 33, "size": 20010919, "checksum": "f3e5120b45c0c1a469cfea6008fcee7b102c245b3e772128e2ae189a2799feca", "avgDurationSecs": 4.327, "validDurationSecs": 3061.11, "totalHrs": 0.96, "validHrs": 0.85 }, "pl": { "duration": 5775120, "reportedSentences": 7, "clips": 1059, "users": 38, "size": 33484538, "checksum": "f9d491272e90ef9a10451779b9a4ceed52be45e9ceee80e6a3fc2b9689dc348c", "avgDurationSecs": 4.477, "validDurationSecs": 12307.82, "totalHrs": 1.6, "validHrs": 3.42 }, "rm-sursilv": { "duration": 326844, "reportedSentences": 2, "clips": 65, "users": 0, "size": 1930334, "checksum": "e3bfa984c4cd61b2b9cf3dca1051ae64e49815b5e5786acfbf9a56242bfc9f1a", "avgDurationSecs": 5.48, "validDurationSecs": 223.62, "totalHrs": 0.09, "validHrs": 0.06 }, "rm-vallader": { "duration": 0, "reportedSentences": 1, "clips": 0, "users": 0, "size": 1717, "checksum": "1e12ab4b075f336e5937333ef9b48bd61b0bbdb81b19ad1c43f94b73aee1c693", "avgDurationSecs": 5.807, "validDurationSecs": 17.42, "totalHrs": 0, "validHrs": 0.01 }, "mn": { "duration": 529920, "reportedSentences": 0, "clips": 100, "users": 7, "size": 2996787, "checksum": "7b8861f86e4414d444b8c1e2a64db45da8b82eeeac034ec426615ad4cdbbd822", "avgDurationSecs": 5.474, "validDurationSecs": 262.09, "totalHrs": 0.14, "validHrs": 0.07 }, "zh-HK": { "duration": 13566780, "reportedSentences": 12, "clips": 3504, "users": 38, "size": 77800236, "checksum": "dbd0a4254447b5de319be1be5d4262ec9bfe68da0d7235430469c33d9298c985", "avgDurationSecs": 4.213, "validDurationSecs": 6238.54, "totalHrs": 3.76, "validHrs": 1.73 }, "ab": { "duration": 669816, "reportedSentences": 1, "clips": 109, "users": 0, "size": 3971809, "checksum": "404ea029bc6cfca120fe9c1b181cee4ad23957621ab18c0401a8dc732877b053", "avgDurationSecs": 5.127, "validDurationSecs": 217.56, "totalHrs": 0.19, "validHrs": 0.06 }, "cv": { "duration": 3923136, "reportedSentences": 1, "clips": 671, "users": 2, "size": 22877938, "checksum": "689f89b2db0b97c1fbf5cae853b438d22d151f66fe2b1c9b88dd187462f5df28", "avgDurationSecs": 5.038, "validDurationSecs": 4829.93, "totalHrs": 1.09, "validHrs": 1.34 }, "uk": { "duration": 15494112, "reportedSentences": 9, "clips": 3750, "users": 38, "size": 90538999, "checksum": "b5bd9d8d49d96e5ef865a59e5ea00e16473be254cbbcc3874e8b58d7162600cb", "avgDurationSecs": 4.786, "validDurationSecs": 10889.69, "totalHrs": 4.3, "validHrs": 3.03 }, "mt": { "duration": 171108, "reportedSentences": 2, "clips": 35, "users": 1, "size": 1006833, "checksum": "b5ef5b1715cc505e3202161a72ed16371856e133d8d61ee17d996bb3a29ff3f6", "avgDurationSecs": 4.737, "validDurationSecs": 92.61, "totalHrs": 0.05, "validHrs": 0.02 }, "as": { "duration": 6479856, "reportedSentences": 0, "clips": 1076, "users": 1, "size": 37214046, "checksum": "4c6eca577436845cf0fe990dfc5a396c5ccfb9df89c0dd3a17825604d6ee320c", "avgDurationSecs": 5.681, "validDurationSecs": 2822.42, "totalHrs": 1.8, "validHrs": 0.78 }, "ka": { "duration": 120132, "reportedSentences": 4, "clips": 35, "users": 4, "size": 705711, "checksum": "042994d0aad43cd28261476019f9c00aa704c9d6738102beeda3345e9741b04f", "avgDurationSecs": 5.326, "validDurationSecs": 71.18, "totalHrs": 0.04, "validHrs": 0.02 }, "fy-NL": { "duration": 3485916, "reportedSentences": 23, "clips": 688, "users": 5, "size": 20094683, "checksum": "93281e617fbfe22a4f677bc2039ac91b343b43fc4fb952fb0e6d8a477878820d", "avgDurationSecs": 4.978, "validDurationSecs": 128.61, "totalHrs": 0.97, "validHrs": 0.03 }, "dv": { "duration": 1411884, "reportedSentences": 0, "clips": 236, "users": 7, "size": 7113857, "checksum": "467a1a1cf204e4b8a2f713e694f83db9705c35131f1ef66f0a90f597858be0c7", "avgDurationSecs": 5.048, "validDurationSecs": 282.82, "totalHrs": 0.4, "validHrs": 0.08 }, "pa-IN": { "duration": 332316, "reportedSentences": 12, "clips": 69, "users": 7, "size": 1927149, "checksum": "f397560fdf6a0d61d756dc94c2ebe03f4b9f02336561e5e80f3117f1a8a6c8a3", "avgDurationSecs": 4.825, "validDurationSecs": 125.14, "totalHrs": 0.1, "validHrs": 0.03 }, "vi": { "duration": 1050624, "reportedSentences": 0, "clips": 277, "users": 10, "size": 6133120, "checksum": "a4a0f2d2dfc35ef4317c334713762aaa51b53f7b26e1c9f0e860b5ed0cc1f31a", "avgDurationSecs": 3.967, "validDurationSecs": 752.68, "totalHrs": 0.3, "validHrs": 0.21 }, "or": { "duration": 1341288, "reportedSentences": 5, "clips": 264, "users": 1, "size": 7830367, "checksum": "00672631bee5854227d87b0f84ba5f19b8263477d34028dc086d9eeb2514e6a8", "avgDurationSecs": 5.048, "validDurationSecs": 36.78, "totalHrs": 0.37, "validHrs": 0.01 }, "ga-IE": { "duration": 795492, "reportedSentences": 5, "clips": 186, "users": 5, "size": 4672055, "checksum": "e25bcd05f22041bbd2e8ebb4136f1c7a38ba3a55d59d6f9698ec2f97e7b7fc18", "avgDurationSecs": 3.542, "validDurationSecs": 358.16, "totalHrs": 0.22, "validHrs": 0.1 }, "fi": { "duration": 4769604, "reportedSentences": 7, "clips": 941, "users": 14, "size": 28009942, "checksum": "9ca4fb4ca2bfb9eb0d10dd46469dd3a5ce0cc1a3595e61a82dfd44be67c0e971", "avgDurationSecs": 4.587, "validDurationSecs": 1648.14, "totalHrs": 1.32, "validHrs": 0.46 }, "hu": { "duration": 2222676, "reportedSentences": 9, "clips": 411, "users": 14, "size": 13054968, "checksum": "5ea1a62667d68a8e33a6758083a3fdb722fb22781996d5a7826f23a4d69c89e4", "avgDurationSecs": 4.92, "validDurationSecs": 3021.01, "totalHrs": 0.61, "validHrs": 0.84 }, "th": { "duration": 90219312, "reportedSentences": 82, "clips": 22522, "users": 156, "size": 520392367, "checksum": "b56ce794693feb2a79ef294a25f9def1bda407a06e0fe209d46a58e13621212d", "avgDurationSecs": 4.171, "validDurationSecs": 15975.52, "totalHrs": 25.06, "validHrs": 4.44 }, "lt": { "duration": 273888, "reportedSentences": 6, "clips": 57, "users": 8, "size": 1591278, "checksum": "10a1fe7a63972122c308ffb4cb37e6d1fc7a888ffb667aad9fa9e9993fe79bcc", "avgDurationSecs": 5.155, "validDurationSecs": 720.07, "totalHrs": 0.07, "validHrs": 0.2 }, "lg": { "duration": 24624, "reportedSentences": 0, "clips": 5, "users": 0, "size": 290226, "checksum": "48451a611b86562bda710fd350b5fcf767921ea9bb1e3d6e78e51caf42a6efa5", "avgDurationSecs": 5.806, "validDurationSecs": 83.34, "totalHrs": 0.01, "validHrs": 0.02 }, "hi": { "duration": 1342548, "reportedSentences": 7, "clips": 244, "users": 19, "size": 7905284, "checksum": "21364b7526eb32b58502f3ba897009127b5375beda9a1b8e4558a9aa0bbdb08e", "avgDurationSecs": 4.8, "validDurationSecs": 1209.38, "totalHrs": 0.37, "validHrs": 0.33 }, "bas": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -893, "checksum": "b7c92d2be66bbe18fc49a4e64e22c643a62483d25644208c9b6d45c2216e076d", "avgDurationSecs": 4.429, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sk": { "duration": 1360404, "reportedSentences": 0, "clips": 358, "users": 7, "size": 7875631, "checksum": "119e70bd5b08d7ead030f9a50cdf70122df080975d53bf1794137b030328a6ff", "avgDurationSecs": 3.997, "validDurationSecs": 1388.12, "totalHrs": 0.38, "validHrs": 0.39 }, "kmr": { "duration": 2453760, "reportedSentences": 60, "clips": 546, "users": 10, "size": 13959328, "checksum": "134ca34be64ae928ac451ca33d09342227b0ba6ff8b8e5499f51c5f31b41ebc3", "avgDurationSecs": 4.424, "validDurationSecs": 5311.4, "totalHrs": 0.69, "validHrs": 1.47 }, "bg": { "duration": 464940, "reportedSentences": 0, "clips": 80, "users": 4, "size": 2745094, "checksum": "f532e435bd3df3422db84b13311068a819dc4685e4973ea847c2c8b41fa0d968", "avgDurationSecs": 5.507, "validDurationSecs": 633.94, "totalHrs": 0.13, "validHrs": 0.17 }, "kk": { "duration": 177444, "reportedSentences": 15, "clips": 54, "users": 1, "size": 1036939, "checksum": "6cc9593b426d0ed39a7e8e10e576031ee3436b6f00bd8327a57da2653fa32fdc", "avgDurationSecs": 4.936, "validDurationSecs": 205.72, "totalHrs": 0.05, "validHrs": 0.06 }, "ba": { "duration": 208260, "reportedSentences": 0, "clips": 36, "users": 0, "size": 1275103, "checksum": "834d2433e1fb541a7b3310f5f1cde1161bd7653e5f7d88338fbc4757b06f2818", "avgDurationSecs": 4.426, "validDurationSecs": 237.45, "totalHrs": 0.06, "validHrs": 0.06 }, "gl": { "duration": 3843756, "reportedSentences": 25, "clips": 763, "users": 22, "size": 22456454, "checksum": "05d320fa03a7003f26952392d9a17da6ea3252679d0c08e4110c1268157d0f98", "avgDurationSecs": 4.837, "validDurationSecs": 1590.91, "totalHrs": 1.07, "validHrs": 0.44 }, "ug": { "duration": 25854804, "reportedSentences": 4, "clips": 4722, "users": 11, "size": 148822135, "checksum": "de56839d05aab1d73099bbebb8ec54c11e88bb54191d97397f7618bf70276f1a", "avgDurationSecs": 5.971, "validDurationSecs": 8764.74, "totalHrs": 7.18, "validHrs": 2.43 }, "hy-AM": { "duration": 31536, "reportedSentences": 2, "clips": 5, "users": 0, "size": 192363, "checksum": "37aa8f887538d7bfe54f1ba6554b27c7f1996d4d96e8e74c891aedf1c8598b4a", "avgDurationSecs": 6.113, "validDurationSecs": 153.31, "totalHrs": 0.01, "validHrs": 0.04 }, "be": { "duration": 247822380, "reportedSentences": 26, "clips": 53268, "users": 92, "size": 1411816938, "checksum": "96f071638a5360172908b6573907284bceaa25b3bde66cb950e2e63a5a61488a", "avgDurationSecs": 4.754, "validDurationSecs": 247319.27, "totalHrs": 68.84, "validHrs": 68.7 }, "ur": { "duration": 452521440, "reportedSentences": 13, "clips": 117862, "users": 75, "size": 2629186798, "checksum": "72c662684967be9aafb3c1f42231fcdad4926b23bee7898d2ca8e5a5afcfb316", "avgDurationSecs": 3.874, "validDurationSecs": 117083.06, "totalHrs": 125.7, "validHrs": 32.52 }, "gn": { "duration": 2981592, "reportedSentences": 5, "clips": 671, "users": 7, "size": 17087849, "checksum": "752ec145f7ea7a6f7b676c0c0cfd24336fe2628f24b85113419961280ebfecd0", "avgDurationSecs": 4.395, "validDurationSecs": 1177.96, "totalHrs": 0.83, "validHrs": 0.33 }, "sr": { "duration": 94248, "reportedSentences": 0, "clips": 40, "users": 3, "size": 540976, "checksum": "9fc5a4dd4885de3044303e0d356d727831da46cc4c2f770bd98c4bea3dbe4410", "avgDurationSecs": 2.826, "validDurationSecs": 49.34, "totalHrs": 0.03, "validHrs": 0.02 }, "uz": { "duration": 70874604, "reportedSentences": 69, "clips": 13865, "users": 429, "size": 395971356, "checksum": "5d7887f2d36f891e02bd70858549752e8556c70b98855f759e94596572321253", "avgDurationSecs": 4.108, "validDurationSecs": 25335.81, "totalHrs": 19.69, "validHrs": 7.04 }, "mr": { "duration": 7931412, "reportedSentences": 6, "clips": 1202, "users": 7, "size": 46288478, "checksum": "2ceb9169d113c3354a3cc97907b3f36acdbc43c8ebe78c24f77a73afac216787", "avgDurationSecs": 6.184, "validDurationSecs": 2456.05, "totalHrs": 2.21, "validHrs": 0.68 }, "da": { "duration": 4251636, "reportedSentences": 94, "clips": 1096, "users": 25, "size": 24689961, "checksum": "6a3338dd0bfec1e945c8f0ea5cf92fee517c48004dbf7389076d20a5abb4dcc4", "avgDurationSecs": 4.264, "validDurationSecs": 4460.51, "totalHrs": 1.18, "validHrs": 1.24 }, "myv": { "duration": 2514744, "reportedSentences": 5, "clips": 429, "users": 7, "size": 14753085, "checksum": "be220f8186d52f8c866c84bb6fec0c2094333dfb525ad44bd607159b6b702100", "avgDurationSecs": 5.75, "validDurationSecs": 2929.74, "totalHrs": 0.7, "validHrs": 0.81 }, "nn-NO": { "duration": 947808, "reportedSentences": 10, "clips": 211, "users": 7, "size": 4891549, "checksum": "d848f0d5bdedb577c8b56aab1f396bdf896bfcc7ce687052aa3fa4ce25163a61", "avgDurationSecs": 4.506, "validDurationSecs": 664.09, "totalHrs": 0.26, "validHrs": 0.19 }, "ha": { "duration": 964692, "reportedSentences": 3, "clips": 204, "users": 3, "size": 5667061, "checksum": "484e9ec6bdbe21875f5b91c74bdfabb38ad90d5a4d6453c7579b2a3ed56b2232", "avgDurationSecs": 4.348, "validDurationSecs": 616.19, "totalHrs": 0.27, "validHrs": 0.18 }, "ckb": { "duration": 216675072, "reportedSentences": 511, "clips": 54073, "users": 901, "size": 1174860843, "checksum": "742eace3283d4d50f771862f4366e72df5a626493a7bcb0efc38f87bf5d43af2", "avgDurationSecs": 3.903, "validDurationSecs": 147610.73, "totalHrs": 60.18, "validHrs": 41.01 }, "ml": { "duration": 712368, "reportedSentences": 6, "clips": 159, "users": 6, "size": 4039995, "checksum": "cf6a3595a571d780db0c9e6b986e519ab047b56bd8b097fb9094dd8fd197682a", "avgDurationSecs": 4.122, "validDurationSecs": 438.34, "totalHrs": 0.2, "validHrs": 0.12 }, "mdf": { "duration": 0, "reportedSentences": 4, "clips": 0, "users": 0, "size": 580, "checksum": "936191d697b2820af25c3a7b71baf1716090510467a45c4c286167f3dde8094c", "avgDurationSecs": 5.285, "validDurationSecs": 105.71, "totalHrs": 0, "validHrs": 0.03 }, "sw": { "duration": 46233252, "reportedSentences": 1464, "clips": 10616, "users": 94, "size": 269705348, "checksum": "fbd27fae537fd3c843635a4518979c678c523b1bda5f109e050ec483af3a66fc", "avgDurationSecs": 5.329, "validDurationSecs": 125962.06, "totalHrs": 12.84, "validHrs": 34.99 }, "sat": { "duration": 982836, "reportedSentences": 0, "clips": 275, "users": 1, "size": 5722323, "checksum": "37ea92dc8d330ab09c7f0ce8aedabd867d81530741b5be98a577aaa8c4e08d45", "avgDurationSecs": 4.417, "validDurationSecs": 958.64, "totalHrs": 0.28, "validHrs": 0.27 }, "tig": { "duration": 29268, "reportedSentences": 0, "clips": 5, "users": 1, "size": 167260, "checksum": "17d9f9d54f00aa556bd4cc4daf4cdda80c345fee78b53a54befdfc640b027b8d", "avgDurationSecs": 4.491, "validDurationSecs": 3.79, "totalHrs": 0, "validHrs": 0 }, "ig": { "duration": 1185372, "reportedSentences": 1, "clips": 200, "users": 8, "size": 6926095, "checksum": "23a28c2c5ab68230a17f531213e7010df1263108ae813ccf99503985ecce4813", "avgDurationSecs": 5.468, "validDurationSecs": 38.4, "totalHrs": 0.33, "validHrs": 0.01 }, "nan-tw": { "duration": 22239108, "reportedSentences": 99, "clips": 8311, "users": 42, "size": 122814958, "checksum": "fb567620a6f4b449503f2a46a278f858cf2c62c9f817c769620f6e0000f06a4e", "avgDurationSecs": 2.677, "validDurationSecs": 5151.1, "totalHrs": 6.18, "validHrs": 1.43 }, "mhr": { "duration": 187157268, "reportedSentences": 4, "clips": 39849, "users": 59, "size": 1060765720, "checksum": "2291217507995a8741511c3e90fbbe69954046442210d50442e8e95aa462ae1e", "avgDurationSecs": 4.761, "validDurationSecs": 213414.15, "totalHrs": 51.99, "validHrs": 59.28 }, "bn": { "duration": 170748288, "reportedSentences": 272, "clips": 29902, "users": 767, "size": 992617275, "checksum": "1430962f663e16ed42869096afca6378cf6194f96979b8194f5b170c2474e5f2", "avgDurationSecs": 6.164, "validDurationSecs": 17968.74, "totalHrs": 47.43, "validHrs": 4.99 }, "tok": { "reportedSentences": 2, "duration": 8310852, "clips": 1996, "users": 23, "size": 48459960, "checksum": "7c1e45005bcdc3a63e25cb4cfb9809ea2263868eeb2051435449c451dd50251f", "avgDurationSecs": 3.689, "validDurationSecs": 6345.81, "totalHrs": 2.3, "validHrs": 1.77 }, "yue": { "duration": 81398376, "reportedSentences": 498, "clips": 19828, "users": 252, "size": 458600224, "checksum": "10b86810d19f041c13f22b188872df968443aa382e8bf86a6a6a53ca9ecded17", "avgDurationSecs": 4.176, "validDurationSecs": 32240.27, "totalHrs": 22.62, "validHrs": 8.96 }, "sah": { "duration": 253188, "reportedSentences": 2, "clips": 40, "users": 1, "size": 1503446, "checksum": "0d424943facc126253edf6a8a459a52f5b3986a4ddec251eebd46a8777a16cb9", "avgDurationSecs": 6, "validDurationSecs": 390.77, "totalHrs": 0.07, "validHrs": 0.11 }, "mk": { "duration": 250452, "clips": 39, "users": 1, "size": 1477107, "checksum": "c319ef45575aaad82e1d9d5629da486dbf789f21dbc04fe0ad43b44d9fbd1af5", "avgDurationSecs": 5.323, "validDurationSecs": 38.12, "totalHrs": 0.07, "validHrs": 0.01 }, "vot": { "duration": 0, "clips": 0, "users": 0, "size": 121, "checksum": "97ab1ad8cfa2c0526abe359ae53d94d873af374755366b430959faafe62967b4", "avgDurationSecs": 2.408, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 7344, "clips": 2, "users": 1, "size": 43779, "checksum": "3e571071448c82c67b14a3a8fd9ddacb67152542f3b40159fac1da25a9c48fcd", "avgDurationSecs": 5.562, "validDurationSecs": 2.76, "totalHrs": 0, "validHrs": 0 } }, "totalDuration": 2159111903, "totalValidDurationSecs": 936568, "totalHrs": 600, "totalValidHrs": 261 } ================================================ FILE: datasets/scripted-speech/cv-corpus-11.0-2022-09-21.json ================================================ { "locales": { "en": { "duration": 11152496587, "buckets": { "dev": 16354, "invalidated": 252599, "other": 290846, "reported": 4366, "test": 16354, "train": 948736, "validated": 1618225 }, "reportedSentences": 4294, "clips": 2161670, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.45, "female": 0.15, "other": 0.02 } }, "users": 84673, "size": 79751937788, "checksum": "0efd86ca6b40641b55d1411b7d3b1f1ab8626de4b207504953706df201d198a5", "avgDurationSecs": 5.159, "validDurationSecs": 8348752.95, "totalHrs": 3097.91, "validHrs": 2319.09 }, "fa": { "buckets": { "dev": 10288, "invalidated": 13793, "other": 24401, "reported": 2168, "test": 10288, "train": 26951, "validated": 309996 }, "reportedSentences": 2159, "duration": 1392397316, "clips": 348190, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.31, "thirties": 0.37, "fifties": 0.02, "fourties": 0.03, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.07, "other": 0 } }, "users": 4124, "size": 10237548588, "checksum": "e40247da130302d1dd71e5f25742a0f2f61e8627e7b674c13294967a23f6cf47", "avgDurationSecs": 3.999, "validDurationSecs": 1239661.1, "totalHrs": 386.77, "validHrs": 344.35 }, "fr": { "buckets": { "dev": 16089, "invalidated": 57607, "other": 14359, "reported": 6586, "test": 16089, "train": 485034, "validated": 652051 }, "reportedSentences": 6510, "duration": 3623103766, "clips": 724017, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "thirties": 0.16, "": 0.36, "teens": 0.03, "fourties": 0.14, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.61, "": 0.28, "female": 0.1, "other": 0.01 } }, "users": 16785, "size": 25947906602, "checksum": "f763f9b1817280cd37db4d4161a9afc76257024d5bb54951a5987464e1e2ebb4", "avgDurationSecs": 5.004, "validDurationSecs": 3262973.706, "totalHrs": 1006.41, "validHrs": 906.38 }, "es": { "buckets": { "dev": 15520, "invalidated": 52095, "other": 1180383, "reported": 2033, "test": 15520, "train": 230467, "validated": 305875 }, "reportedSentences": 2019, "duration": 7475216238, "clips": 1538353, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.1, "": 0.13, "fifties": 0.04, "twenties": 0.56, "teens": 0.08, "fourties": 0.03, "sixties": 0.06, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.54, "": 0.13, "other": 0, "female": 0.33 } }, "users": 24516, "size": 47288212406, "checksum": "319ae22d17dc2158322bb189a4938faba0debe653b611a325ee80c672be277a1", "avgDurationSecs": 4.859, "validDurationSecs": 1486318.008, "totalHrs": 2076.44, "validHrs": 412.86 }, "sl": { "buckets": { "dev": 1206, "invalidated": 251, "other": 1562, "reported": 34, "test": 1207, "train": 1423, "validated": 9590 }, "reportedSentences": 35, "duration": 43522354, "clips": 11403, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "teens": 0.08, "": 0.2, "sixties": 0.07, "fifties": 0.07, "fourties": 0.02, "thirties": 0.05 }, "gender": { "female": 0.16, "male": 0.64, "": 0.2, "other": 0 } }, "users": 138, "size": 309245843, "checksum": "acbbccd20450efbfb100ab1f5fd0484756d761a3300d1e5eaf8fd403a56f5bbf", "avgDurationSecs": 3.817, "validDurationSecs": 36602.594, "totalHrs": 12.08, "validHrs": 10.16 }, "kab": { "buckets": { "dev": 14994, "invalidated": 19492, "other": 110003, "reported": 8947, "test": 14994, "train": 151534, "validated": 608713 }, "reportedSentences": 8942, "duration": 2462033464, "clips": 738208, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.29, "": 0.28, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.54, "": 0.26, "female": 0.2, "other": 0 } }, "users": 1496, "size": 18395506381, "checksum": "09e61eddb933a73606af153a5ed9394390f531093d79b004d27635ee79ecd95b", "avgDurationSecs": 3.335, "validDurationSecs": 2030148.381, "totalHrs": 683.89, "validHrs": 563.93 }, "cy": { "buckets": { "dev": 5247, "invalidated": 4337, "other": 18730, "reported": 156, "test": 5266, "train": 7726, "validated": 88378 }, "reportedSentences": 157, "duration": 541540989, "clips": 111445, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.06, "fifties": 0.09, "": 0.43, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.24, "": 0.41, "other": 0.01 } }, "users": 1723, "size": 3980577087, "checksum": "b1a5d115e0b65bcab23e1fbbc170ed3b61d74aeba506720202d8b732089136cd", "avgDurationSecs": 4.859, "validDurationSecs": 429452.282, "totalHrs": 150.42, "validHrs": 119.29 }, "ca": { "buckets": { "dev": 16340, "invalidated": 76690, "other": 481402, "reported": 5357, "test": 16340, "train": 905243, "validated": 1111949 }, "reportedSentences": 5312, "duration": 9194234502, "clips": 1670041, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.06, "fifties": 0.17, "fourties": 0.1, "twenties": 0.05, "": 0.35, "sixties": 0.22, "teens": 0.01, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.42, "": 0.35, "female": 0.23, "other": 0 } }, "users": 30225, "size": 56931368598, "checksum": "3ae9f3c82dee5102dfd8a3319b4339262980236f1b85336700ba5e7d3dcb4aae", "avgDurationSecs": 5.505, "validDurationSecs": 6121717.886, "totalHrs": 2553.95, "validHrs": 1700.47 }, "de": { "buckets": { "dev": 16082, "invalidated": 47953, "other": 5329, "reported": 8204, "test": 16082, "train": 479008, "validated": 805962 }, "reportedSentences": 8180, "duration": 4439964557, "clips": 859244, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "fourties": 0.17, "": 0.32, "thirties": 0.15, "teens": 0.03, "sixties": 0.03, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.32, "female": 0.09, "other": 0.01 } }, "users": 17226, "size": 31626133806, "checksum": "94a0c7aeb0d18a280380e5a568d21251ed421f093bc164c9f67d8b28dfbecaaf", "avgDurationSecs": 5.167, "validDurationSecs": 4164640.91, "totalHrs": 1233.32, "validHrs": 1156.84 }, "tt": { "buckets": { "dev": 3062, "invalidated": 388, "other": 252, "reported": 3, "test": 5124, "train": 9778, "validated": 28538 }, "reportedSentences": 4, "duration": 109538738, "clips": 29178, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.73, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.2, "male": 0.78, "female": 0.02 } }, "users": 223, "size": 809543535, "checksum": "e56d549c0aa66f6df596350347dd38d76778da732d3393d4b2d0281ff68cc8dc", "avgDurationSecs": 3.754, "validDurationSecs": 107136.079, "totalHrs": 30.42, "validHrs": 29.76 }, "ta": { "buckets": { "dev": 11758, "invalidated": 5575, "other": 87993, "reported": 3315, "test": 11815, "train": 41710, "validated": 130461 }, "reportedSentences": 3315, "duration": 1392279684, "clips": 224029, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.08, "thirties": 0.09, "": 0.72, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.03, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.71, "other": 0, "female": 0.13 } }, "users": 792, "size": 8341540337, "checksum": "d23d087efee1ba3c0c9ce93789d77f8a659e0469643a0de73a0b6586735adccc", "avgDurationSecs": 6.215, "validDurationSecs": 810779.854, "totalHrs": 386.74, "validHrs": 225.21 }, "ru": { "buckets": { "dev": 9629, "invalidated": 7159, "other": 16865, "reported": 356, "test": 9630, "train": 22862, "validated": 125553 }, "reportedSentences": 350, "duration": 771816312, "clips": 149577, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "teens": 0.09, "": 0.21, "fourties": 0.15, "thirties": 0.13, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.21, "other": 0, "female": 0.16 } }, "users": 2731, "size": 5403479932, "checksum": "c5e32c22b2bda21dbded3f20fbcf77910e7a63932da8138058c2e71c13ffd5bd", "avgDurationSecs": 5.16, "validDurationSecs": 647852.634, "totalHrs": 214.39, "validHrs": 179.95 }, "nl": { "buckets": { "dev": 10736, "invalidated": 5161, "other": 2157, "reported": 328, "test": 10743, "train": 30318, "validated": 84823 }, "reportedSentences": 328, "duration": 397871425, "clips": 92141, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.21, "fourties": 0.14, "thirties": 0.11, "teens": 0.02, "fifties": 0.08, "sixties": 0.01, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.42, "male": 0.47, "female": 0.11, "other": 0 } }, "users": 1530, "size": 2734842015, "checksum": "03c65fb0d4964d23286337aca8200dfbec44e4c63361bedb0e0adc1b7f1f5758", "avgDurationSecs": 4.318, "validDurationSecs": 366271.778, "totalHrs": 110.51, "validHrs": 101.74 }, "it": { "buckets": { "dev": 14997, "invalidated": 17476, "other": 27, "reported": 5329, "test": 15003, "train": 152609, "validated": 219211 }, "reportedSentences": 5325, "duration": 1268981904, "clips": 236714, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "twenties": 0.2, "": 0.29, "fifties": 0.16, "fourties": 0.14, "seventies": 0, "sixties": 0.03, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.11, "male": 0.59, "": 0.29, "other": 0 } }, "users": 6767, "size": 8784083327, "checksum": "a976f2e7ab10c7dbee95f1271c1221bb2d42ab52589bfefd28cf63b3a4fae520", "avgDurationSecs": 5.361, "validDurationSecs": 1175151.415, "totalHrs": 352.49, "validHrs": 326.43 }, "eu": { "buckets": { "dev": 6561, "invalidated": 5791, "other": 26899, "reported": 72, "test": 6561, "train": 10832, "validated": 69159 }, "reportedSentences": 72, "duration": 528767151, "clips": 101849, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.14, "twenties": 0.35, "": 0.25, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.47, "female": 0.26, "": 0.26, "other": 0.02 } }, "users": 1213, "size": 3985536335, "checksum": "58fc92fc7c4e2c8874c4e6ae9f58cbd418740e1573c181ac284b0782acc977b0", "avgDurationSecs": 5.192, "validDurationSecs": 359051.217, "totalHrs": 146.87, "validHrs": 99.73 }, "tr": { "buckets": { "dev": 10127, "invalidated": 3593, "other": 151, "reported": 339, "test": 10143, "train": 25998, "validated": 82351 }, "reportedSentences": 340, "duration": 314932815, "clips": 86095, "splits": { "accent": { "": 1 }, "age": { "": 0.32, "thirties": 0.09, "twenties": 0.25, "teens": 0.02, "fourties": 0.04, "fifties": 0.09, "sixties": 0.15, "eighties": 0.02, "seventies": 0.03 }, "gender": { "": 0.32, "male": 0.43, "female": 0.25, "other": 0 } }, "users": 1328, "size": 1948927227, "checksum": "1e9499bf233e6668d5e34802f3ec704c3fffc271380eef16733629e673092610", "avgDurationSecs": 3.658, "validDurationSecs": 301237.38, "totalHrs": 87.48, "validHrs": 83.67 }, "ar": { "buckets": { "dev": 10438, "invalidated": 14959, "other": 35514, "reported": 2074, "test": 10440, "train": 28043, "validated": 76208 }, "reportedSentences": 2066, "clips": 126681, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.56, "twenties": 0.28, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.18, "": 0.55, "male": 0.27, "other": 0 } }, "users": 1309, "duration": 528133089, "size": 3133096686, "checksum": "c5122a5fcb393f6091e81ef60ae8cfc8e1a80451eabee0c31fa600f7c92e99f2", "avgDurationSecs": 4.169, "validDurationSecs": 317711.152, "totalHrs": 146.704, "validHrs": 88.253 }, "zh-TW": { "buckets": { "dev": 4709, "invalidated": 4596, "other": 40630, "reported": 139, "test": 4709, "train": 6568, "validated": 77357 }, "reportedSentences": 140, "duration": 404438206, "clips": 122583, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.32, "teens": 0.05, "": 0.27, "fifties": 0.05, "seventies": 0, "fourties": 0.1, "sixties": 0 }, "gender": { "male": 0.47, "": 0.26, "female": 0.24, "other": 0.02 } }, "users": 2082, "size": 2830133882, "checksum": "6666d38ac9095833ee88da1ec2df7917ee43a6b7a4e1d60e2148e9fbf2f36c37", "avgDurationSecs": 3.299, "validDurationSecs": 255224.022, "totalHrs": 112.34, "validHrs": 70.89 }, "br": { "buckets": { "dev": 2122, "invalidated": 785, "other": 12352, "reported": 267, "test": 2119, "train": 2645, "validated": 11334 }, "reportedSentences": 267, "duration": 76458895, "clips": 24471, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.34, "fifties": 0.06, "fourties": 0.06, "thirties": 0.08, "sixties": 0.17, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.63, "": 0.35, "female": 0.02, "other": 0 } }, "users": 180, "size": 555823615, "checksum": "e5dda67bebcf968fd81e43fb0c0a5789deae2574504b4d74446cac9cd3565559", "avgDurationSecs": 3.124, "validDurationSecs": 35412.738, "totalHrs": 21.23, "validHrs": 9.83 }, "pt": { "buckets": { "dev": 8688, "invalidated": 4870, "other": 16751, "reported": 2400, "test": 8693, "train": 18211, "validated": 108100 }, "reportedSentences": 2395, "duration": 541049911, "clips": 129721, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "twenties": 0.4, "teens": 0.03, "thirties": 0.22, "fourties": 0.1, "sixties": 0.01, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.21, "male": 0.73, "female": 0.04, "other": 0.02 } }, "users": 2621, "size": 3512588399, "checksum": "96ee3666ad3409e642c1407b0c948949569665b957b3b41a5e0f9247ed438c5e", "avgDurationSecs": 4.171, "validDurationSecs": 450871.45, "totalHrs": 150.29, "validHrs": 125.24 }, "eo": { "buckets": { "dev": 14912, "invalidated": 127311, "other": 138344, "reported": 2151, "test": 14915, "train": 143978, "validated": 848681 }, "reportedSentences": 2150, "clips": 1114336, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.56, "thirties": 0.12, "": 0.2, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1581, "size": 40389394457, "checksum": "1346a519fa7f5e158122a1c76c66c7342c5e3b87a8a8549fbaf69b1c78d7f936", "avgDurationSecs": 6.068, "validDurationSecs": 5149796, "duration": 6761790848, "totalHrs": 1878.28, "validHrs": 1430.499 }, "zh-CN": { "buckets": { "dev": 10581, "invalidated": 21302, "other": 698486, "reported": 637, "test": 10581, "train": 29056, "validated": 102525 }, "reportedSentences": 634, "duration": 3742248732, "clips": 822313, "splits": { "accent": { "": 1 }, "age": { "": 0.94, "teens": 0.01, "twenties": 0.04, "thirties": 0.01, "fourties": 0, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.94, "male": 0.05, "female": 0.01, "other": 0 } }, "users": 6400, "size": 22132786891, "checksum": "d1168d8cf2fd2654fa74056f3b7a0ade9ccb46777efb52d27ccf4d3b3a1841c5", "avgDurationSecs": 4.551, "validDurationSecs": 466579.09, "totalHrs": 1039.51, "validHrs": 129.6 }, "id": { "duration": 202841724, "buckets": { "dev": 3226, "invalidated": 2466, "other": 24238, "reported": 280, "test": 3618, "train": 5048, "validated": 23417 }, "reportedSentences": 281, "clips": 50121, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.39, "thirties": 0.07, "teens": 0.26, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.26, "male": 0.41, "female": 0.29, "other": 0.04 } }, "users": 445, "size": 1289214557, "checksum": "f10b1dad4071a8bb1a8723ae5ab1cff2a6f32f8e145fbf6126e6afb3beb85906", "avgDurationSecs": 4.047, "validDurationSecs": 94769.551, "totalHrs": 56.34, "validHrs": 26.32 }, "ia": { "duration": 60448296, "buckets": { "dev": 1787, "invalidated": 329, "other": 2690, "reported": 267, "test": 1748, "train": 5041, "validated": 11431 }, "reportedSentences": 263, "clips": 14450, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.39, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.39, "female": 0.01 } }, "users": 62, "size": 409737134, "checksum": "9e4d39a1285aee039fc16661d7f65560801ffdc157dc0737b1644ef6c39e95c9", "avgDurationSecs": 4.183, "validDurationSecs": 47818.995, "totalHrs": 16.79, "validHrs": 13.28 }, "lv": { "duration": 30869261, "buckets": { "dev": 1902, "invalidated": 171, "other": 1125, "reported": 31, "test": 2218, "train": 3017, "validated": 7751 }, "reportedSentences": 32, "clips": 9047, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "fourties": 0.03, "": 0.18, "twenties": 0.28, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.7, "female": 0.13, "": 0.17 } }, "users": 118, "size": 226907582, "checksum": "ca9ef194ccedf357a976cda6315352f0d379b312599305f6f1ff353f3622b292", "avgDurationSecs": 3.412, "validDurationSecs": 26447.181, "totalHrs": 8.57, "validHrs": 7.34 }, "ja": { "duration": 198018825, "buckets": { "dev": 4485, "invalidated": 2464, "other": 1146, "reported": 185, "test": 4604, "train": 6505, "validated": 37710 }, "reportedSentences": 185, "clips": 41320, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.23, "teens": 0.04, "fifties": 0.01, "thirties": 0.09, "fourties": 0.29, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.54, "": 0.21, "female": 0.25, "other": 0 } }, "users": 688, "size": 1199309960, "checksum": "5200ab82dc91927beadf946cd7db877f85494823b12e17ade9241ff703ae2352", "avgDurationSecs": 4.792, "validDurationSecs": 180718.536, "totalHrs": 55, "validHrs": 50.19 }, "rw": { "duration": 8580641153, "buckets": { "dev": 15987, "invalidated": 227748, "other": 47295, "reported": 629, "test": 16213, "train": 1003021, "validated": 1438429 }, "reportedSentences": 630, "clips": 1713472, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1080, "size": 60998364053, "checksum": "a377be2f63f0606228396644d9bc23d81ff105397866f7ecbabccffad6dbfd81", "avgDurationSecs": 5.008, "validDurationSecs": 7203294.29, "totalHrs": 2383.51, "validHrs": 2000.91 }, "sv-SE": { "duration": 181162455, "buckets": { "dev": 5052, "invalidated": 1346, "other": 5699, "reported": 576, "test": 5069, "train": 7308, "validated": 38736 }, "reportedSentences": 577, "clips": 45781, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.48, "": 0.19, "female": 0.33, "other": 0 } }, "users": 755, "size": 1149905955, "checksum": "a5dbc058b1746308d9eab31bc925c0b0c1315ad3726adfce0511c00161fcbc49", "avgDurationSecs": 3.957, "validDurationSecs": 153284.307, "totalHrs": 50.32, "validHrs": 42.57 }, "cnh": { "duration": 20675832, "buckets": { "dev": 761, "invalidated": 436, "other": 2908, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "reportedSentences": 9, "clips": 5802, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 299, "size": 161393367, "checksum": "2be89cd4a8bd37309c6cdc7a524cb5b667a51499af1abf1caeb32acb7d0743d8", "avgDurationSecs": 3.564, "validDurationSecs": 8759.255, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "duration": 195951710, "buckets": { "dev": 2638, "invalidated": 6650, "other": 663, "reported": 487, "test": 2638, "train": 3137, "validated": 21681 }, "reportedSentences": 484, "clips": 28994, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.08, "twenties": 0.68, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.26, "other": 0 } }, "users": 808, "size": 1336904754, "checksum": "d3e0be25b4b756e991ca597295ae7d9dda2203d2af0cb691feda80fba4608054", "avgDurationSecs": 6.758, "validDurationSecs": 146527.869, "totalHrs": 54.43, "validHrs": 40.7 }, "ky": { "duration": 162152040, "buckets": { "dev": 1613, "invalidated": 5599, "other": 325, "reported": 36, "test": 1613, "train": 1787, "validated": 29798 }, "reportedSentences": 37, "clips": 35722, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18, "fifties": 0 }, "gender": { "male": 0.54, "": 0.11, "female": 0.35, "other": 0 } }, "users": 250, "size": 1047564926, "checksum": "3943d7cc30ee725552ff98d10b53ec0ab61033cb29704769750838be2313d211", "avgDurationSecs": 4.539, "validDurationSecs": 135261.365, "totalHrs": 45.04, "validHrs": 37.57 }, "ro": { "duration": 141653551, "buckets": { "dev": 3703, "invalidated": 882, "other": 19267, "reported": 333, "test": 3859, "train": 5187, "validated": 15295 }, "reportedSentences": 334, "clips": 35444, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "teens": 0.02, "": 0.11, "fourties": 0.06, "sixties": 0, "twenties": 0.66, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.73, "": 0.1, "female": 0.15, "other": 0.01 } }, "users": 369, "size": 888395480, "checksum": "0bb1458db358dc5110aabc056c8a25007608ef817b1938965afd6137e816b563", "avgDurationSecs": 3.997, "validDurationSecs": 61127.16, "totalHrs": 39.34, "validHrs": 16.97 }, "hsb": { "duration": 10207332, "buckets": { "dev": 172, "invalidated": 247, "other": 0, "reported": 84, "test": 444, "train": 808, "validated": 1424 }, "reportedSentences": 85, "clips": 1671, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.18, "thirties": 0.1, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.82, "": 0.18, "other": 0 } }, "users": 20, "size": 79770563, "checksum": "77d8350910f2eea3d7813f47fb93cea8108c5aa37d12fb456def557bb238724b", "avgDurationSecs": 6.109, "validDurationSecs": 8698.528, "totalHrs": 2.83, "validHrs": 2.41 }, "el": { "duration": 103273162, "buckets": { "dev": 1701, "invalidated": 797, "other": 9072, "reported": 64, "test": 1696, "train": 1914, "validated": 15119 }, "reportedSentences": 65, "clips": 24988, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.37, "fourties": 0.13, "": 0.33, "twenties": 0.13, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.63, "": 0.32, "other": 0.02, "female": 0.03 } }, "users": 346, "size": 695652047, "checksum": "a38515243d120d0e466e02a723a69101135054b757685cf29019e1cc29ab81c8", "avgDurationSecs": 4.133, "validDurationSecs": 62485.47, "totalHrs": 28.68, "validHrs": 17.35 }, "cs": { "duration": 255212499, "buckets": { "dev": 7543, "invalidated": 1319, "other": 8538, "reported": 701, "test": 7714, "train": 14612, "validated": 49130 }, "reportedSentences": 698, "clips": 58987, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.19, "": 0.36, "thirties": 0.14, "teens": 0.02, "twenties": 0.27, "fifties": 0.02, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.35, "female": 0.02, "other": 0 } }, "users": 578, "size": 1795704611, "checksum": "65b2cc61d58e100db51099a90365376f0e46794e2f2cd3a55940a9c870d7bf3a", "avgDurationSecs": 4.327, "validDurationSecs": 212565.312, "totalHrs": 70.89, "validHrs": 59.04 }, "pl": { "duration": 604034894, "buckets": { "dev": 8294, "invalidated": 6136, "other": 4686, "reported": 537, "test": 8294, "train": 16539, "validated": 123870 }, "reportedSentences": 537, "clips": 134692, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3123, "size": 4334476289, "checksum": "dacae20342ecbbcdf8292859c9c086c3cf9e4124a811a8473f1930bb9fa9d784", "avgDurationSecs": 4.485, "validDurationSecs": 555502.942, "totalHrs": 167.78, "validHrs": 154.3 }, "rm-sursilv": { "duration": 38781365, "buckets": { "dev": 1345, "invalidated": 675, "other": 2176, "reported": 14, "test": 1331, "train": 1546, "validated": 4225 }, "reportedSentences": 15, "clips": 7076, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.17, "female": 0.19, "": 0.64, "other": 0 } }, "users": 86, "size": 292992052, "checksum": "6abb81458703971ae3a75753d39f09ca58c92e1091031a5e5e7f70bbc6d081c9", "avgDurationSecs": 5.481, "validDurationSecs": 23155.917, "totalHrs": 10.77, "validHrs": 6.43 }, "rm-vallader": { "duration": 15115550, "buckets": { "dev": 377, "invalidated": 392, "other": 720, "reported": 34, "test": 439, "train": 666, "validated": 1489 }, "reportedSentences": 33, "clips": 2601, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 52, "size": 115478749, "checksum": "f703ec8a0021dc9b79b9fbbb4b1a476f8c40d6ecba3f5a533c885f74b796f753", "avgDurationSecs": 5.811, "validDurationSecs": 8653.231, "totalHrs": 4.19, "validHrs": 2.4 }, "mn": { "duration": 68433164, "buckets": { "dev": 1859, "invalidated": 755, "other": 3444, "reported": 21, "test": 1886, "train": 2160, "validated": 8300 }, "reportedSentences": 22, "clips": 12499, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.22, "": 0.27, "twenties": 0.41, "fourties": 0.01, "teens": 0.02, "nineties": 0.06, "fifties": 0 }, "gender": { "male": 0.36, "": 0.27, "female": 0.31, "other": 0.06 } }, "users": 478, "size": 520448499, "checksum": "6576938cb5562ce78fbd88095ba628e337a9b0a514599f8dfaa8406365d82435", "avgDurationSecs": 5.475, "validDurationSecs": 45443.256, "totalHrs": 19, "validHrs": 12.62 }, "zh-HK": { "duration": 487386312, "buckets": { "dev": 5591, "invalidated": 4289, "other": 21310, "reported": 653, "test": 5591, "train": 8423, "validated": 90698 }, "reportedSentences": 642, "clips": 116297, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.39, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0.01, "twenties": 0.32 }, "gender": { "male": 0.42, "": 0.35, "female": 0.22, "other": 0.01 } }, "users": 2943, "size": 3495790077, "checksum": "cb7b78f3152bf8612a71a1c2256b6c9c1b529e1dbec29356444f102527879ea7", "avgDurationSecs": 4.191, "validDurationSecs": 380104.076, "totalHrs": 135.38, "validHrs": 105.58 }, "ab": { "duration": 303837264, "buckets": { "dev": 9152, "invalidated": 5274, "other": 11965, "reported": 220, "test": 9124, "train": 21027, "validated": 41955 }, "reportedSentences": 219, "clips": 59194, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.18, "teens": 0.28, "twenties": 0.18, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 399, "size": 1735941482, "checksum": "465528bf6238587d6d6d4ec99bac71e41d4fa3ed1b7b7f0a3a2a61a11d1a556d", "avgDurationSecs": 5.133, "validDurationSecs": 215351.09, "totalHrs": 84.39, "validHrs": 59.81 }, "cv": { "duration": 94204992, "buckets": { "dev": 1142, "invalidated": 2038, "other": 1329, "reported": 143, "test": 1285, "train": 1546, "validated": 15216 }, "reportedSentences": 139, "clips": 18583, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.52, "": 0.2, "fourties": 0.06, "thirties": 0.01, "teens": 0.2, "fifties": 0.01 }, "gender": { "male": 0.54, "": 0.18, "female": 0.28 } }, "users": 104, "size": 644876766, "checksum": "865ab51ef224de462bcbe30408bbbeddfc2915e25f85283831a66c79ad2aa2fa", "avgDurationSecs": 5.069, "validDurationSecs": 77136.262, "totalHrs": 26.16, "validHrs": 21.42 }, "uk": { "duration": 310843140, "buckets": { "dev": 7181, "invalidated": 2505, "other": 7907, "reported": 593, "test": 7184, "train": 12650, "validated": 54839 }, "reportedSentences": 594, "clips": 65251, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "teens": 0.1, "": 0.26, "fourties": 0.12, "thirties": 0.27, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.59, "female": 0.15, "": 0.26 } }, "users": 759, "size": 2107776630, "checksum": "94df9e01e8a774ad590274c090f6683e8698108993b95a81b1c69d7c7c53c234", "avgDurationSecs": 4.764, "validDurationSecs": 261242.386, "totalHrs": 86.34, "validHrs": 72.56 }, "mt": { "duration": 61421004, "buckets": { "dev": 1592, "invalidated": 320, "other": 6267, "reported": 9, "test": 1640, "train": 1946, "validated": 6370 }, "reportedSentences": 10, "clips": 12957, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.26, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 209, "size": 456665746, "checksum": "3371481ec5d72b9ebbd3bda69f0fb50d25e76e18193c27e1dc7e2f2a9ad3cf52", "avgDurationSecs": 4.74, "validDurationSecs": 30196.172, "totalHrs": 17.06, "validHrs": 8.38 }, "as": { "duration": 11715113, "buckets": { "dev": 469, "invalidated": 163, "other": 297, "reported": 9, "test": 308, "train": 824, "validated": 1601 }, "reportedSentences": 10, "clips": 2061, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.59, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.41, "": 0.59, "female": 0 } }, "users": 43, "size": 73260231, "checksum": "291ed568bef3b2b7c1491e082e8661482a73dad5f6f7aa296c52b3cda960a928", "avgDurationSecs": 5.684, "validDurationSecs": 9100.386, "totalHrs": 3.25, "validHrs": 2.52 }, "ka": { "duration": 61768404, "buckets": { "dev": 2309, "invalidated": 680, "other": 1033, "reported": 50, "test": 2499, "train": 3192, "validated": 10236 }, "reportedSentences": 51, "clips": 11949, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.29, "thirties": 0.23, "": 0.4, "fourties": 0.07, "fifties": 0.01, "teens": 0.01 }, "gender": { "male": 0.38, "female": 0.22, "": 0.4 } }, "users": 324, "size": 381797966, "checksum": "b3110f9dfe3b3f74eb7d8904a36ba7ec60148317f3d67e37f3966d2d20274c12", "avgDurationSecs": 5.169, "validDurationSecs": 52913.33, "totalHrs": 17.15, "validHrs": 14.69 }, "fy-NL": { "duration": 464544675, "buckets": { "dev": 3026, "invalidated": 2913, "other": 54421, "reported": 427, "test": 3026, "train": 3701, "validated": 36094 }, "reportedSentences": 425, "clips": 93428, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "fifties": 0.12, "thirties": 0.04, "twenties": 0.02, "fourties": 0.07, "sixties": 0.16, "seventies": 0.01, "teens": 0, "eighties": 0 }, "gender": { "": 0.59, "male": 0.1, "female": 0.31 } }, "users": 1146, "size": 2906008186, "checksum": "fb4980c6dc30c19cee43beef27cbe3dd8bd7ce136576008740eae03625219765", "avgDurationSecs": 4.972, "validDurationSecs": 179467.349, "totalHrs": 129.04, "validHrs": 49.85 }, "dv": { "duration": 226308267, "buckets": { "dev": 2176, "invalidated": 1647, "other": 16211, "reported": 55, "test": 2253, "train": 2687, "validated": 26823 }, "reportedSentences": 56, "clips": 44681, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.17, "thirties": 0.36, "fourties": 0.22, "teens": 0.01, "nineties": 0, "fifties": 0.02 }, "gender": { "": 0.21, "male": 0.28, "female": 0.5 } }, "users": 321, "size": 1434192033, "checksum": "7aed6401d1453fc44f880da86697e4bd0283e28f723a07a70fc792d000236b5f", "avgDurationSecs": 5.065, "validDurationSecs": 135857.896, "totalHrs": 62.86, "validHrs": 37.73 }, "pa-IN": { "duration": 13144634, "buckets": { "dev": 280, "invalidated": 75, "other": 1285, "reported": 249, "test": 399, "train": 685, "validated": 1364 }, "reportedSentences": 244, "clips": 2724, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "fourties": 0.04, "fifties": 0.05, "thirties": 0.4, "twenties": 0.25, "sixties": 0, "teens": 0 }, "gender": { "": 0.25, "male": 0.75, "female": 0 } }, "users": 58, "size": 94525896, "checksum": "b1cbc71ceb2062e0fedab43c68729da3e148c0673ccb47d370cc8b6150c549dd", "avgDurationSecs": 4.825, "validDurationSecs": 6581.968, "totalHrs": 3.65, "validHrs": 1.82 }, "vi": { "duration": 64896128, "buckets": { "dev": 248, "invalidated": 337, "other": 11476, "reported": 181, "test": 1237, "train": 2525, "validated": 4543 }, "reportedSentences": 180, "clips": 16356, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.19, "": 0.25, "teens": 0.22, "seventies": 0, "fourties": 0.02, "sixties": 0.31 }, "gender": { "male": 0.53, "": 0.25, "female": 0.2, "other": 0.02 } }, "users": 236, "size": 377152685, "checksum": "d932231ed7fca7e3dfbca761f142a2c3d3ef37f806e4c0feada5bfb40a399952", "avgDurationSecs": 3.968, "validDurationSecs": 18025.38, "totalHrs": 18.02, "validHrs": 5 }, "or": { "duration": 35899224, "buckets": { "dev": 309, "invalidated": 167, "other": 5803, "reported": 11, "test": 220, "train": 477, "validated": 1148 }, "reportedSentences": 12, "clips": 7118, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.16, "": 0.08, "thirties": 0.76, "fourties": 0, "teens": 0 }, "gender": { "male": 0.92, "": 0.08, "female": 0 } }, "users": 88, "size": 258750215, "checksum": "c4bb1953da17e06096536db58931665a2df74624c4692342f0f57ec61918d5a1", "avgDurationSecs": 5.043, "validDurationSecs": 5789.872, "totalHrs": 9.97, "validHrs": 1.6 }, "ga-IE": { "duration": 33116895, "buckets": { "dev": 513, "invalidated": 827, "other": 3857, "reported": 22, "test": 514, "train": 532, "validated": 4668 }, "reportedSentences": 23, "clips": 9352, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.24, "": 0.37, "thirties": 0.26, "fourties": 0.04, "sixties": 0.01, "teens": 0.02, "fifties": 0.05 }, "gender": { "male": 0.49, "": 0.37, "female": 0.13, "other": 0 } }, "users": 164, "size": 229775200, "checksum": "fe2302a8e2088a3f131545e979ea40ea82f673d73aa9ebaefcd13d9604fcdf37", "avgDurationSecs": 3.541, "validDurationSecs": 16530.118, "totalHrs": 9.19, "validHrs": 4.59 }, "fi": { "duration": 61257459, "buckets": { "dev": 1650, "invalidated": 197, "other": 5779, "reported": 46, "test": 1704, "train": 2165, "validated": 7387 }, "reportedSentences": 47, "clips": 13363, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.15, "": 0.36, "twenties": 0.11, "fourties": 0.33, "teens": 0.01, "fifties": 0.04, "seventies": 0 }, "gender": { "male": 0.27, "": 0.36, "female": 0.37, "other": 0 } }, "users": 197, "size": 365601331, "checksum": "73b9915ddfbcc23c6c6407712e35c86064f3f16486a2fe81d66c2fff662e001c", "avgDurationSecs": 4.584, "validDurationSecs": 33862.819, "totalHrs": 17.01, "validHrs": 9.4 }, "hu": { "duration": 95230009, "buckets": { "dev": 4269, "invalidated": 867, "other": 1937, "reported": 101, "test": 4631, "train": 7633, "validated": 16581 }, "reportedSentences": 102, "clips": 19385, "splits": { "accent": { "": 1 }, "age": { "teens": 0.08, "": 0.3, "thirties": 0.15, "twenties": 0.38, "fifties": 0.06, "fourties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.6, "": 0.3, "female": 0.11 } }, "users": 234, "size": 614380360, "checksum": "9f26068dcf71c0fef9924065766e3705137217891c3efb5670e702bfd63ade41", "avgDurationSecs": 4.913, "validDurationSecs": 81455.186, "totalHrs": 26.45, "validHrs": 22.62 }, "th": { "duration": 1418710008, "buckets": { "dev": 10930, "invalidated": 8771, "other": 195162, "reported": 4141, "test": 10930, "train": 31849, "validated": 135897 }, "reportedSentences": 4141, "clips": 339830, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.21, "": 0.42, "thirties": 0.07, "fourties": 0.04, "teens": 0.04, "fifties": 0.21, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.39, "": 0.42, "female": 0.18, "other": 0.01 } }, "users": 7678, "size": 8253279343, "checksum": "f0e9108e331e1151d54aac6e541619ee04209a1c887439f9f44d7748366be6eb", "avgDurationSecs": 4.175, "validDurationSecs": 567337.886, "totalHrs": 394.08, "validHrs": 157.59 }, "lt": { "duration": 74319884, "buckets": { "dev": 3690, "invalidated": 559, "other": 1225, "reported": 137, "test": 3749, "train": 5194, "validated": 12646 }, "reportedSentences": 137, "clips": 14430, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.24, "thirties": 0.28, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.05 }, "gender": { "male": 0.62, "": 0.24, "female": 0.14 } }, "users": 265, "size": 456552653, "checksum": "dad7fa591f293f412e9b252ef519ee643ed62fc0f4c08dbb17ffde310e054d2d", "avgDurationSecs": 5.15, "validDurationSecs": 65131.618, "totalHrs": 20.64, "validHrs": 18.09 }, "lg": { "duration": 1748791683, "buckets": { "dev": 12660, "invalidated": 38102, "other": 9787, "reported": 6174, "test": 12773, "train": 55169, "validated": 252479 }, "reportedSentences": 6169, "clips": 300368, "splits": { "accent": { "": 1 }, "age": { "": 0.27, "thirties": 0.22, "twenties": 0.41, "fourties": 0.05, "fifties": 0.04, "teens": 0.01, "nineties": 0, "sixties": 0.01, "seventies": 0, "eighties": 0 }, "gender": { "": 0.27, "female": 0.4, "male": 0.33 } }, "users": 502, "size": 10250752352, "checksum": "18c3b9b33439cba516d5aa0307cbf299550fcf32507eacd9bfc3ac94957a9b72", "avgDurationSecs": 5.822, "validDurationSecs": 1469974.083, "totalHrs": 485.77, "validHrs": 408.32 }, "hi": { "duration": 64934772, "buckets": { "dev": 2179, "invalidated": 680, "other": 3328, "reported": 118, "test": 2894, "train": 4361, "validated": 9468 }, "reportedSentences": 119, "clips": 13476, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "fourties": 0.03, "": 0.32, "thirties": 0.27, "teens": 0.01, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.63, "female": 0.04, "": 0.33, "other": 0 } }, "users": 333, "size": 383717355, "checksum": "97bda013c6583bfd133168c2e165e37f151219f784e9a38689f54075c4f75cf6", "avgDurationSecs": 4.819, "validDurationSecs": 45622.026, "totalHrs": 18.03, "validHrs": 12.67 }, "bas": { "duration": 9991980, "buckets": { "dev": 457, "invalidated": 483, "other": 109, "reported": 7, "test": 444, "train": 763, "validated": 1664 }, "reportedSentences": 8, "clips": 2256, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 32, "size": 55577917, "checksum": "09d698f61da142afbe1fd8d16ea6bb5f528a1038f5455c191071f1643d3fc61a", "avgDurationSecs": 4.429, "validDurationSecs": 7369.971, "totalHrs": 2.77, "validHrs": 2.04 }, "sk": { "duration": 70677972, "buckets": { "dev": 2239, "invalidated": 714, "other": 290, "reported": 30, "test": 2267, "train": 3007, "validated": 16699 }, "reportedSentences": 31, "clips": 17703, "splits": { "accent": { "": 1 }, "age": { "": 0.53, "thirties": 0.22, "twenties": 0.04, "fourties": 0.1, "teens": 0.1 }, "gender": { "": 0.53, "male": 0.37, "female": 0.09, "other": 0.01 } }, "users": 145, "size": 396085360, "checksum": "572f6eb7e485d428b97a43575648af7dd1b6a1e43b702ced27b84094aefdb37f", "avgDurationSecs": 3.992, "validDurationSecs": 66669.573, "totalHrs": 19.63, "validHrs": 18.51 }, "kmr": { "duration": 229331916, "buckets": { "dev": 2833, "invalidated": 1879, "other": 7756, "reported": 780, "test": 2837, "train": 3448, "validated": 43772 }, "reportedSentences": 780, "clips": 53407, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.3, "thirties": 0.08, "fourties": 0.03, "fifties": 0.07, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.5, "male": 0.35, "female": 0.14 } }, "users": 421, "size": 1221004179, "checksum": "20102f351b5134cc376d24cdc20c337ba6616bdbd146467c5654103eddb93a81", "avgDurationSecs": 4.294, "validDurationSecs": 187958.819, "totalHrs": 63.7, "validHrs": 52.21 }, "bg": { "duration": 47173572, "buckets": { "dev": 1175, "invalidated": 412, "other": 1954, "reported": 145, "test": 1818, "train": 3180, "validated": 6184 }, "reportedSentences": 146, "clips": 8550, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.34, "thirties": 0.07, "": 0.41, "twenties": 0.17, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.52, "female": 0.06, "": 0.41 } }, "users": 72, "size": 275428863, "checksum": "7dd61cc78dcfe256f843ff96564fb4a8622a990d01dfe181d59871178dae6b2b", "avgDurationSecs": 5.517, "validDurationSecs": 34119.458, "totalHrs": 13.1, "validHrs": 9.47 }, "kk": { "duration": 6755904, "buckets": { "dev": 341, "invalidated": 195, "other": 0, "reported": 30, "test": 375, "train": 453, "validated": 1174 }, "reportedSentences": 31, "clips": 1369, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "thirties": 0.03, "twenties": 0.31, "teens": 0.06, "fifties": 0.1 }, "gender": { "": 0.52, "male": 0.45, "female": 0.03 } }, "users": 81, "size": 38782745, "checksum": "17d246cde024bf626662ceba6c757d8aa1465054d9e570943082d7e1c7a03316", "avgDurationSecs": 4.935, "validDurationSecs": 5793.595, "totalHrs": 1.87, "validHrs": 1.6 }, "ba": { "duration": 958413996, "buckets": { "dev": 14557, "invalidated": 7892, "other": 26, "reported": 866, "test": 14542, "train": 118971, "validated": 208621 }, "reportedSentences": 863, "clips": 216539, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 888, "size": 5376324683, "checksum": "f3f066725689592c8f0142a518e96b07d929aa71042f19ee89a969410cd5c927", "avgDurationSecs": 4.426, "validDurationSecs": 923368.475, "totalHrs": 266.22, "validHrs": 256.49 }, "gl": { "duration": 65124252, "buckets": { "dev": 2474, "invalidated": 327, "other": 4469, "reported": 206, "test": 2651, "train": 3488, "validated": 8676 }, "reportedSentences": 207, "clips": 13472, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "thirties": 0.37, "fifties": 0.09, "twenties": 0.07, "fourties": 0.05, "teens": 0, "sixties": 0.01 }, "gender": { "": 0.42, "male": 0.38, "female": 0.19, "other": 0.01 } }, "users": 168, "size": 375303345, "checksum": "d17ca8baf6dd617f49a57dde654360ed605e3859b1641a9cf9f29e138dcce2de", "avgDurationSecs": 4.834, "validDurationSecs": 41940.173, "totalHrs": 18.09, "validHrs": 11.65 }, "ug": { "duration": 404340444, "buckets": { "dev": 3270, "invalidated": 3525, "other": 6, "reported": 275, "test": 3270, "train": 4072, "validated": 63934 }, "reportedSentences": 276, "clips": 67465, "splits": { "accent": { "": 1 }, "age": { "": 0.58, "fifties": 0.01, "twenties": 0.12, "thirties": 0.16, "fourties": 0.12, "teens": 0.01, "eighties": 0 }, "gender": { "": 0.58, "male": 0.33, "female": 0.08, "other": 0 } }, "users": 747, "size": 2350804361, "checksum": "ba19ae7b142ab2efac62777929b2f4d281c860cb89896b37d4b87b502aa4f4ac", "avgDurationSecs": 5.993, "validDurationSecs": 383177.973, "totalHrs": 112.31, "validHrs": 106.43 }, "hy-AM": { "duration": 16732692, "buckets": { "dev": 356, "invalidated": 94, "other": 1282, "reported": 38, "test": 397, "train": 607, "validated": 1361 }, "reportedSentences": 39, "clips": 2737, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "thirties": 0.14, "twenties": 0.37, "fifties": 0.04, "teens": 0.09 }, "gender": { "": 0.36, "male": 0.22, "female": 0.42 } }, "users": 62, "size": 98241701, "checksum": "bd4b084e56ef01f7efbe69bcee0b865173ac0bd5483c3abfd857db4e7fbccae8", "avgDurationSecs": 6.114, "validDurationSecs": 8320.495, "totalHrs": 4.64, "validHrs": 2.31 }, "be": { "duration": 4381134876, "buckets": { "dev": 15872, "invalidated": 27726, "other": 16153, "reported": 3128, "test": 15877, "train": 347158, "validated": 877088 }, "reportedSentences": 3127, "clips": 920967, "splits": { "accent": { "": 1 }, "age": { "": 0.81, "fourties": 0.06, "thirties": 0.07, "twenties": 0.05, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.8, "male": 0.08, "female": 0.11, "other": 0 } }, "users": 6965, "size": 24844853632, "checksum": "a7ccc4301a2dfdf9becd4fb61da754accc34e6fc9f3c96d644f6b6ea21235e00", "avgDurationSecs": 4.757, "validDurationSecs": 4172397.954, "totalHrs": 1216.98, "validHrs": 1158.99 }, "ur": { "duration": 505257948, "buckets": { "dev": 3303, "invalidated": 3275, "other": 85123, "reported": 48, "test": 3302, "train": 4129, "validated": 41814 }, "reportedSentences": 48, "clips": 130212, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.88, "": 0.1, "fourties": 0.01, "thirties": 0, "teens": 0.01, "fifties": 0 }, "gender": { "male": 0.7, "": 0.1, "female": 0.2 } }, "users": 193, "size": 2937277928, "checksum": "f1d465a4280b22e81d2734fe168ababc729509bc7b9f57c58343f21afc5b816b", "avgDurationSecs": 3.88, "validDurationSecs": 162249.684, "totalHrs": 140.34, "validHrs": 45.06 }, "gn": { "duration": 24481764, "buckets": { "dev": 489, "invalidated": 108, "other": 3952, "reported": 26, "test": 544, "train": 654, "validated": 1688 }, "reportedSentences": 27, "clips": 5748, "splits": { "accent": { "": 1 }, "age": { "": 0.27, "twenties": 0.16, "thirties": 0.56, "sixties": 0.01 }, "gender": { "": 0.27, "male": 0.67, "female": 0.06 } }, "users": 80, "size": 139759193, "checksum": "4c098f64ac849cbfdeabbbb005985ba71c3da36462a7d66b463060356bf86db8", "avgDurationSecs": 4.259, "validDurationSecs": 7189.495, "totalHrs": 6.8, "validHrs": 1.99 }, "sr": { "duration": 6972588, "buckets": { "dev": 623, "invalidated": 42, "other": 61, "reported": 18, "test": 677, "train": 1045, "validated": 2347 }, "reportedSentences": 19, "clips": 2450, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.67, "": 0.15, "fifties": 0.03, "fourties": 0.11, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.38, "": 0.15, "female": 0.47 } }, "users": 56, "size": 38695031, "checksum": "8afbbcc62115a5bed217eccf693839bfed8a80ed7ea412142349452994a1da11", "avgDurationSecs": 2.846, "validDurationSecs": 6679.455, "totalHrs": 1.93, "validHrs": 1.85 }, "uz": { "duration": 927212976, "buckets": { "dev": 11727, "invalidated": 13448, "other": 126570, "reported": 1784, "test": 12290, "train": 47606, "validated": 84406 }, "reportedSentences": 1767, "clips": 224424, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "": 0.41, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.44, "": 0.41, "female": 0.15, "other": 0 } }, "users": 2025, "size": 5174452258, "checksum": "a24b838552c121bfe27a585aec5fab4a8fa8aad8b38e4b95277bb583badf41fa", "avgDurationSecs": 4.132, "validDurationSecs": 348725.352, "totalHrs": 257.55, "validHrs": 96.86 }, "mr": { "duration": 98196084, "buckets": { "dev": 1682, "invalidated": 2237, "other": 2819, "reported": 56, "test": 1816, "train": 2245, "validated": 10804 }, "reportedSentences": 57, "clips": 15860, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "sixties": 0, "twenties": 0.29, "": 0.05, "teens": 0.53 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 78, "size": 575538502, "checksum": "18fb6bbeb05feb5a525e8147e561bb651930f2c15ed7b50b2b9ef72064a1b881", "avgDurationSecs": 6.191, "validDurationSecs": 66892.213, "totalHrs": 27.27, "validHrs": 18.58 }, "da": { "duration": 37796976, "buckets": { "dev": 2105, "invalidated": 336, "other": 111, "reported": 311, "test": 2133, "train": 2544, "validated": 8489 }, "reportedSentences": 311, "clips": 8936, "splits": { "accent": { "": 1 }, "age": { "": 0.33, "thirties": 0.29, "twenties": 0.2, "sixties": 0, "fourties": 0.13, "fifties": 0.03, "teens": 0 }, "gender": { "": 0.33, "female": 0.08, "male": 0.59 } }, "users": 209, "size": 216858799, "checksum": "bd8c850acb743492894629ed104e2fcfed7d2591c0367cf07adfc70e822cc67f", "avgDurationSecs": 4.23, "validDurationSecs": 35906.281, "totalHrs": 10.49, "validHrs": 9.97 }, "myv": { "duration": 11274696, "buckets": { "dev": 500, "invalidated": 19, "other": 220, "reported": 25, "test": 525, "train": 690, "validated": 1721 }, "reportedSentences": 26, "clips": 1960, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.26, "": 0.39, "thirties": 0.25, "twenties": 0.09, "teens": 0.01 }, "gender": { "male": 0.54, "": 0.39, "female": 0.07 } }, "users": 12, "size": 65965092, "checksum": "c7a086a97d4b0879822e58aa76c6dda7200fdae41ee3bf290b7051c1b3b7fced", "avgDurationSecs": 5.752, "validDurationSecs": 9899.873, "totalHrs": 3.13, "validHrs": 2.74 }, "nn-NO": { "duration": 3267000, "buckets": { "dev": 168, "invalidated": 16, "other": 0, "reported": 14, "test": 222, "train": 314, "validated": 709 }, "reportedSentences": 15, "clips": 725, "splits": { "accent": { "": 1 }, "age": { "": 0.4, "thirties": 0.36, "twenties": 0.21, "fourties": 0.04 }, "gender": { "": 0.4, "female": 0.2, "male": 0.37, "other": 0.03 } }, "users": 25, "size": 18488246, "checksum": "3b4dd645ee96251810da56b43638d28eb1eea98f3b6cb2e6f151f5e1a074ea9f", "avgDurationSecs": 4.506, "validDurationSecs": 3194.901, "totalHrs": 0.9, "validHrs": 0.88 }, "ha": { "duration": 39648924, "buckets": { "dev": 551, "invalidated": 165, "other": 5865, "reported": 27, "test": 542, "train": 1939, "validated": 3093 }, "reportedSentences": 27, "clips": 9123, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "thirties": 0.75, "twenties": 0.04, "fourties": 0, "fifties": 0.03 }, "gender": { "": 0.18, "male": 0.55, "female": 0.27 } }, "users": 33, "size": 231981708, "checksum": "2bf4813c32cb56fd8a3b1b1be2fbf7b198ef0fea9865475f486d70a3fd9665d7", "avgDurationSecs": 4.346, "validDurationSecs": 13442.302, "totalHrs": 11.01, "validHrs": 3.73 }, "ckb": { "duration": 440906112, "buckets": { "dev": 4857, "invalidated": 6840, "other": 9620, "reported": 2343, "test": 4862, "train": 6874, "validated": 96618 }, "reportedSentences": 2343, "clips": 113078, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "thirties": 0.12, "twenties": 0.47, "fourties": 0.03, "teens": 0.02, "fifties": 0.02 }, "gender": { "": 0.33, "male": 0.6, "female": 0.07, "other": 0 } }, "users": 1201, "size": 2383952233, "checksum": "f04fa532f9d9dc1e99ed188a9821da5462f728c27ec29a2e976341884607f2a5", "avgDurationSecs": 3.899, "validDurationSecs": 376726.39, "totalHrs": 122.47, "validHrs": 104.64 }, "ml": { "duration": 10261764, "buckets": { "dev": 0, "invalidated": 11, "other": 1939, "reported": 115, "test": 112, "train": 430, "validated": 542 }, "reportedSentences": 116, "clips": 2492, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "twenties": 0.49, "thirties": 0.05, "fourties": 0 }, "gender": { "": 0.45, "male": 0.55 } }, "users": 25, "size": 59401132, "checksum": "1cae82127fffd138b6c546b1128b6c22246e46f02272c2a72bca6e896674d9e4", "avgDurationSecs": 4.118, "validDurationSecs": 2231.892, "totalHrs": 2.85, "validHrs": 0.61 }, "mdf": { "duration": 1791720, "buckets": { "dev": 48, "invalidated": 7, "other": 73, "reported": 12, "test": 81, "train": 130, "validated": 259 }, "reportedSentences": 13, "clips": 339, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.59, "fourties": 0.35 }, "gender": { "male": 0.06, "": 0.59, "female": 0.35 } }, "users": 10, "size": 10526456, "checksum": "0f54942589d6caa8497c9143c1a25771a3463c074230e40e7f9675f786f9081e", "avgDurationSecs": 5.285, "validDurationSecs": 1368.895, "totalHrs": 0.49, "validHrs": 0.38 }, "sw": { "duration": 2710469520, "buckets": { "dev": 10233, "invalidated": 47470, "other": 240109, "reported": 1861, "test": 10238, "train": 26614, "validated": 219381 }, "reportedSentences": 1856, "clips": 506960, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "twenties": 0.44, "thirties": 0.14, "teens": 0, "fifties": 0.06, "fourties": 0.05, "sixties": 0.01 }, "gender": { "": 0.28, "male": 0.38, "female": 0.34, "other": 0 } }, "users": 705, "size": 15844049912, "checksum": "83b6c3a375b085320cce997bdecfece1c002b193d45f389f42fa61887ed6dfa6", "avgDurationSecs": 5.347, "validDurationSecs": 1172923.926, "totalHrs": 752.9, "validHrs": 325.81 }, "sat": { "duration": 3217608, "buckets": { "dev": 0, "invalidated": 10, "other": 316, "reported": 6, "test": 118, "train": 277, "validated": 395 }, "reportedSentences": 7, "clips": 721, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "twenties": 0.42, "fourties": 0.01, "fifties": 0.01, "teens": 0.01, "thirties": 0.12 }, "gender": { "": 0.4, "male": 0.58, "female": 0.01 } }, "users": 11, "size": 17630034, "checksum": "e93b571ca6cc790730045c01a0e7588875f08bd3e21cce0d1e8d51e5b45397eb", "avgDurationSecs": 4.463, "validDurationSecs": 1762.767, "totalHrs": 0.89, "validHrs": 0.48 }, "tig": { "duration": 103284, "buckets": { "dev": 0, "invalidated": 12, "other": 0, "reported": 0, "test": 1, "train": 10, "validated": 11 }, "reportedSentences": 1, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 0.78, "twenties": 0.22 }, "gender": { "": 0.78, "male": 0.22 } }, "users": 5, "size": 603433, "checksum": "6e9b5f27882feb0a335a290a1ed40c6a38644909e27b6806070b1915606975c4", "avgDurationSecs": 4.491, "validDurationSecs": 49.397, "totalHrs": 0.02, "validHrs": 0.01 }, "ig": { "duration": 31132332, "buckets": { "dev": 2, "invalidated": 2, "other": 5678, "reported": 13, "test": 4, "train": 8, "validated": 14 }, "reportedSentences": 13, "clips": 5694, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "twenties": 0.32, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02, "fourties": 0 }, "gender": { "": 0.56, "male": 0.13, "female": 0.3 } }, "users": 105, "size": 181851495, "checksum": "ffde860516825ed5a96b3b4b4cf71203e18930b270afee687aed64470be6bd34", "avgDurationSecs": 5.468, "validDurationSecs": 76.546, "totalHrs": 8.64, "validHrs": 0.02 }, "nan-tw": { "duration": 36004176, "buckets": { "dev": 617, "invalidated": 292, "other": 9970, "reported": 129, "test": 986, "train": 1577, "validated": 3188 }, "reportedSentences": 130, "clips": 13450, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.19, "": 0.14, "twenties": 0.39, "fourties": 0.23, "teens": 0.02, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.5, "": 0.14, "other": 0.12, "female": 0.23 } }, "users": 112, "size": 200271996, "checksum": "e01f84635aad1133d66a8e7eef7ebf2ab1012977c48299b0c42f2cc51cc6813f", "avgDurationSecs": 2.677, "validDurationSecs": 8533.927, "totalHrs": 10, "validHrs": 2.37 }, "mhr": { "duration": 496853028, "buckets": { "dev": 13122, "invalidated": 3286, "other": 343, "reported": 41, "test": 13201, "train": 73573, "validated": 101683 }, "reportedSentences": 42, "clips": 105312, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.08, "": 0.15, "sixties": 0.07, "thirties": 0.24, "fourties": 0.16, "twenties": 0.25, "teens": 0.03, "seventies": 0.01 }, "gender": { "male": 0.19, "": 0.15, "female": 0.66 } }, "users": 268, "size": 2795859878, "checksum": "7a03c3bea72483d9ab5ab39d0d39a833f6a787e1e9ae5682a7bd1a08c70cca76", "avgDurationSecs": 4.718, "validDurationSecs": 479731.716, "totalHrs": 138.01, "validHrs": 133.25 }, "bn": { "duration": 1656201528, "buckets": { "dev": 8353, "invalidated": 6447, "other": 225826, "reported": 1345, "test": 8353, "train": 16777, "validated": 36903 }, "reportedSentences": 1340, "clips": 269176, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.25, "": 0.68, "teens": 0.04, "fourties": 0 }, "gender": { "male": 0.27, "": 0.68, "female": 0.05, "other": 0 } }, "users": 20866, "size": 9531062265, "checksum": "363bea11b09b5e839031c026214a1f9a544cc60d5711d65527af3f17696aa728", "avgDurationSecs": 6.153, "validDurationSecs": 227058.895, "totalHrs": 460.05, "validHrs": 63.07 }, "tok": { "duration": 35868456, "buckets": { "dev": 1807, "invalidated": 187, "other": 2303, "reported": 89, "test": 1866, "train": 2173, "validated": 7209 }, "reportedSentences": 90, "clips": 9699, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "twenties": 0.18, "teens": 0.29, "thirties": 0.11, "fourties": 0 }, "gender": { "": 0.42, "male": 0.46, "other": 0.04, "female": 0.08 } }, "users": 82, "size": 208629893, "checksum": "7f2255c6f5d3897111e1a7647e00da926ce300cf93b91fe6ee7bbb681bce3c62", "avgDurationSecs": 3.698, "validDurationSecs": 26660.037, "totalHrs": 9.96, "validHrs": 7.4 }, "yue": { "duration": 188105256, "buckets": { "dev": 2419, "invalidated": 1559, "other": 25100, "reported": 770, "test": 2438, "train": 2877, "validated": 18761 }, "reportedSentences": 771, "clips": 45420, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.16, "": 0.42, "twenties": 0.34, "fourties": 0.03, "sixties": 0.01, "fifties": 0, "teens": 0.04 }, "gender": { "male": 0.33, "": 0.47, "female": 0.19, "other": 0.02 } }, "users": 743, "size": 1056711049, "checksum": "d66f0107e838c04d73dfac0d759cd8464d6bf50c80ac745dd360f9c27503c1e7", "avgDurationSecs": 4.141, "validDurationSecs": 77697.99, "totalHrs": 52.25, "validHrs": 21.58 }, "sah": { "duration": 24542328, "buckets": { "dev": 1083, "invalidated": 102, "other": 5, "reported": 2, "test": 1249, "train": 1585, "validated": 3985 }, "reportedSentences": 3, "clips": 4092, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.03, "fourties": 0.07, "thirties": 0.43, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.36, "male": 0.53, "female": 0.1 } }, "users": 53, "size": 186503130, "checksum": "a23f35ad90c9eb440dc7258a452190284b9c73a6bb506a5ab67304efb6f113d9", "avgDurationSecs": 5.998, "validDurationSecs": 23900.581, "totalHrs": 6.81, "validHrs": 6.63 }, "mk": { "duration": 979452, "buckets": { "dev": 0, "invalidated": 8, "other": 46, "reported": 7, "test": 15, "train": 115, "validated": 130 }, "reportedSentences": 8, "clips": 184, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "": 0.33, "twenties": 0.11, "teens": 0.08 }, "gender": { "male": 0.67, "": 0.33 } }, "users": 6, "size": 5763939, "checksum": "e254aa9e6585fb14662a77418292030ec09747d910550dc9afb516f2aca1d684", "avgDurationSecs": 5.323, "validDurationSecs": 692.004, "totalHrs": 0.27, "validHrs": 0.19 }, "sc": { "duration": 5172660, "buckets": { "dev": 201, "invalidated": 23, "other": 218, "reported": 2, "test": 165, "train": 439, "validated": 936 }, "reportedSentences": 3, "clips": 1177, "splits": { "accent": { "": 1 }, "age": { "": 0.6, "thirties": 0.31, "twenties": 0.09 }, "gender": { "": 0.6, "female": 0.35, "male": 0.04 } }, "users": 12, "size": 29824198, "checksum": "56bb1cf602f92b62c02a8b62229df9bd48ddd0831066b15d3c77f396ecab245f", "avgDurationSecs": 4.395, "validDurationSecs": 4113.517, "totalHrs": 1.43, "validHrs": 1.14 }, "skr": { "duration": 19023912, "buckets": { "dev": 183, "invalidated": 372, "other": 2301, "reported": 16, "test": 641, "train": 1201, "validated": 2029 }, "reportedSentences": 2, "clips": 4702, "splits": { "accent": { "": 1 }, "age": { "": 0.32, "twenties": 0.23, "fifties": 0.43, "teens": 0, "fourties": 0, "thirties": 0.01 }, "gender": { "": 0.32, "male": 0.68 } }, "users": 37, "size": 110889142, "checksum": "08b2c06adc70b54095bdd5e3a5997506d31bd6e00963896d87c56c7d21053d1a", "avgDurationSecs": 4.046, "validDurationSecs": 8209.17, "totalHrs": 5.28, "validHrs": 2.28 }, "ti": { "buckets": { "dev": 0, "invalidated": 0, "other": 7, "reported": 0, "test": 1, "train": 5, "validated": 6 }, "reportedSentences": 1, "duration": 77904, "clips": 13, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "size": 460760, "checksum": "de3abadad6e2b61e738ca5c13624869dce419b2bdd6acc217f11e7dc39ef76b5", "avgDurationSecs": 5.993, "validDurationSecs": 35.956, "totalHrs": 0.02, "validHrs": 0 }, "mrj": { "duration": 59447412, "buckets": { "dev": 3279, "invalidated": 199, "other": 176, "reported": 34, "test": 3818, "train": 5710, "validated": 12935 }, "reportedSentences": 35, "clips": 13310, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.11, "thirties": 0.04, "sixties": 0.08, "fourties": 0.23, "fifties": 0.17, "teens": 0 }, "gender": { "male": 0.32, "female": 0.57, "": 0.11 } }, "users": 27, "size": 344883390, "checksum": "758cc264f68160cd1c0cfb1cae52554ac7c37175b5684fd8f5eb56af39756cd2", "avgDurationSecs": 4.466, "validDurationSecs": 57772.522, "totalHrs": 16.51, "validHrs": 16.04 }, "tw": { "duration": 5580, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "reported": 1, "test": 0, "train": 1, "validated": 1 }, "reportedSentences": 2, "clips": 1, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 33844, "checksum": "12fc0220ad3fb5d9522aef10186d42273918d29ad58456fc5a73310bb4bf58c3", "avgDurationSecs": 5.58, "validDurationSecs": 5.58, "totalHrs": 0, "validHrs": 0 }, "vot": { "duration": 1025976, "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7892459, "checksum": "4e559d4e5b3fb342cddc86cb470e0f27bfb8fce2f48a4351e13543534dbb64af", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "duration": 630828, "buckets": { "dev": 20, "invalidated": 30, "other": 2, "reported": 0, "test": 22, "train": 39, "validated": 81 }, "clips": 113, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "twenties": 0.54, "fourties": 0.03 }, "gender": { "": 0.43, "male": 0.57 } }, "users": 16, "size": 3685781, "checksum": "8b51750a0cbcccfa5f081b0c3b358030d9bf83e21a351bfe5e618c0bfc125655", "avgDurationSecs": 5.583, "validDurationSecs": 452.186, "totalHrs": 0.17, "validHrs": 0.12 }, "ast": { "duration": 921168, "buckets": { "dev": 0, "invalidated": 0, "other": 203, "test": 2, "train": 9, "validated": 11 }, "clips": 214, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 5399239, "checksum": "5d486544bfd98347d78fbfdd3a091efbef12e34e34bc86800c79bfa21b356c3e", "avgDurationSecs": 4.305, "validDurationSecs": 47.35, "totalHrs": 0.25, "validHrs": 0.01 }, "ne-NP": { "duration": 827172, "buckets": { "dev": 2, "invalidated": 2, "other": 186, "test": 2, "train": 5, "validated": 9 }, "clips": 197, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "": 0.87, "twenties": 0.1 }, "gender": { "male": 0.13, "": 0.87 } }, "users": 7, "size": 4759225, "checksum": "f18c6290ac79cd301a48cc228ee1d667d66322edf634b28d2ba2d12a730a4500", "avgDurationSecs": 4.199, "validDurationSecs": 37.79, "totalHrs": 0.22, "validHrs": 0.01 } }, "totalDuration": 87231600000, "totalValidDurationSecs": 59145166000, "totalHrs": 24231, "totalValidHrs": 16429 } ================================================ FILE: datasets/scripted-speech/cv-corpus-11.0-delta-2022-09-21.json ================================================ { "locales": { "en": { "duration": 170899020, "reportedSentences": 199, "clips": 31304, "users": 883, "size": 954392009, "checksum": "0efd86ca6b40641b55d1411b7d3b1f1ab8626de4b207504953706df201d198a5", "avgDurationSecs": 5.159, "validDurationSecs": 157744.28, "totalHrs": 47.47, "validHrs": 43.81 }, "fa": { "reportedSentences": 50, "duration": 21337316, "clips": 6013, "users": 36, "size": 119389197, "checksum": "e40247da130302d1dd71e5f25742a0f2f61e8627e7b674c13294967a23f6cf47", "avgDurationSecs": 3.999, "validDurationSecs": 61957.1, "totalHrs": 5.92, "validHrs": 17.21 }, "fr": { "reportedSentences": 202, "duration": 167915268, "clips": 32567, "users": 275, "size": 983719339, "checksum": "f763f9b1817280cd37db4d4161a9afc76257024d5bb54951a5987464e1e2ebb4", "avgDurationSecs": 5.004, "validDurationSecs": 136908.84, "totalHrs": 46.64, "validHrs": 38.03 }, "es": { "reportedSentences": 260, "duration": 4682794824, "clips": 985758, "users": 889, "size": 26978186809, "checksum": "319ae22d17dc2158322bb189a4938faba0debe653b611a325ee80c672be277a1", "avgDurationSecs": 4.859, "validDurationSecs": 5578.43, "totalHrs": 1300.77, "validHrs": 1.55 }, "sl": { "reportedSentences": 0, "duration": 160128, "clips": 35, "users": 1, "size": 944619, "checksum": "acbbccd20450efbfb100ab1f5fd0484756d761a3300d1e5eaf8fd403a56f5bbf", "avgDurationSecs": 3.817, "validDurationSecs": 270.33, "totalHrs": 0.04, "validHrs": 0.07 }, "kab": { "reportedSentences": 4104, "duration": 68451840, "clips": 18985, "users": 45, "size": 364754907, "checksum": "09e61eddb933a73606af153a5ed9394390f531093d79b004d27635ee79ecd95b", "avgDurationSecs": 3.335, "validDurationSecs": 40018.13, "totalHrs": 19.01, "validHrs": 11.12 }, "cy": { "reportedSentences": 4, "duration": 10113588, "clips": 1723, "users": 8, "size": 59206155, "checksum": "b1a5d115e0b65bcab23e1fbbc170ed3b61d74aeba506720202d8b732089136cd", "avgDurationSecs": 4.859, "validDurationSecs": 6652.63, "totalHrs": 2.81, "validHrs": 1.85 }, "ca": { "reportedSentences": 551, "duration": 745229196, "clips": 137242, "users": 1577, "size": 4026609416, "checksum": "3ae9f3c82dee5102dfd8a3319b4339262980236f1b85336700ba5e7d3dcb4aae", "avgDurationSecs": 5.505, "validDurationSecs": 1118324.8, "totalHrs": 207.01, "validHrs": 310.64 }, "de": { "reportedSentences": 251, "duration": 76811724, "clips": 13449, "users": 282, "size": 445013059, "checksum": "94a0c7aeb0d18a280380e5a568d21251ed421f093bc164c9f67d8b28dfbecaaf", "avgDurationSecs": 5.167, "validDurationSecs": 73487.75, "totalHrs": 21.34, "validHrs": 20.41 }, "tt": { "reportedSentences": 0, "duration": 1173492, "clips": 255, "users": 4, "size": 6891363, "checksum": "e56d549c0aa66f6df596350347dd38d76778da732d3393d4b2d0281ff68cc8dc", "avgDurationSecs": 3.754, "validDurationSecs": 239.53, "totalHrs": 0.32, "validHrs": 0.07 }, "ta": { "reportedSentences": 19, "duration": 22070124, "clips": 3410, "users": 31, "size": 129087707, "checksum": "d23d087efee1ba3c0c9ce93789d77f8a659e0469643a0de73a0b6586735adccc", "avgDurationSecs": 6.215, "validDurationSecs": 5288.99, "totalHrs": 6.13, "validHrs": 1.47 }, "ru": { "reportedSentences": 36, "duration": 18124272, "clips": 3547, "users": 43, "size": 105643171, "checksum": "c5e32c22b2bda21dbded3f20fbcf77910e7a63932da8138058c2e71c13ffd5bd", "avgDurationSecs": 5.16, "validDurationSecs": 35180.44, "totalHrs": 5.04, "validHrs": 9.77 }, "nl": { "reportedSentences": 10, "duration": 3694968, "clips": 810, "users": 10, "size": 21627154, "checksum": "03c65fb0d4964d23286337aca8200dfbec44e4c63361bedb0e0adc1b7f1f5758", "avgDurationSecs": 4.318, "validDurationSecs": 4507.81, "totalHrs": 1.02, "validHrs": 1.26 }, "it": { "reportedSentences": 50, "duration": 19430928, "clips": 3425, "users": 32, "size": 113885978, "checksum": "a976f2e7ab10c7dbee95f1271c1221bb2d42ab52589bfefd28cf63b3a4fae520", "avgDurationSecs": 5.361, "validDurationSecs": 17808.04, "totalHrs": 5.4, "validHrs": 4.95 }, "eu": { "reportedSentences": 11, "duration": 375516, "clips": 62, "users": 4, "size": 2229844, "checksum": "58fc92fc7c4e2c8874c4e6ae9f58cbd418740e1573c181ac284b0782acc977b0", "avgDurationSecs": 5.192, "validDurationSecs": 124.69, "totalHrs": 0.1, "validHrs": 0.03 }, "tr": { "reportedSentences": 7, "duration": 29391984, "clips": 7955, "users": 29, "size": 171855204, "checksum": "1e9499bf233e6668d5e34802f3ec704c3fffc271380eef16733629e673092610", "avgDurationSecs": 3.658, "validDurationSecs": 29049.07, "totalHrs": 8.17, "validHrs": 8.07 }, "ar": { "reportedSentences": 17, "clips": 982, "users": 37, "duration": -71275979065, "size": 23032995, "checksum": "c5122a5fcb393f6091e81ef60ae8cfc8e1a80451eabee0c31fa600f7c92e99f2", "avgDurationSecs": 4.169, "validDurationSecs": 1821.78, "totalHrs": 1.29, "validHrs": 0.51 }, "zh-TW": { "reportedSentences": 0, "duration": 8220564, "clips": 1714, "users": 21, "size": 32178674, "checksum": "6666d38ac9095833ee88da1ec2df7917ee43a6b7a4e1d60e2148e9fbf2f36c37", "avgDurationSecs": 3.299, "validDurationSecs": 4916.77, "totalHrs": 2.28, "validHrs": 1.37 }, "br": { "reportedSentences": 46, "duration": 4200228, "clips": 1045, "users": 3, "size": 24551243, "checksum": "e5dda67bebcf968fd81e43fb0c0a5789deae2574504b4d74446cac9cd3565559", "avgDurationSecs": 3.124, "validDurationSecs": 964.48, "totalHrs": 1.16, "validHrs": 0.27 }, "pt": { "reportedSentences": 10, "duration": 19143324, "clips": 5133, "users": 59, "size": 111588467, "checksum": "96ee3666ad3409e642c1407b0c948949569665b957b3b41a5e0f9247ed438c5e", "avgDurationSecs": 4.171, "validDurationSecs": 16558.1, "totalHrs": 5.32, "validHrs": 4.6 }, "eo": { "reportedSentences": 24, "clips": 3474, "users": 40, "size": 128657362, "checksum": "1346a519fa7f5e158122a1c76c66c7342c5e3b87a8a8549fbaf69b1c78d7f936", "avgDurationSecs": 6.068, "validDurationSecs": 1032, "duration": 21080848, "totalHrs": 5.86, "validHrs": 0.29 }, "zh-CN": { "reportedSentences": 133, "duration": 2127485340, "clips": 471531, "users": 1339, "size": 12259296143, "checksum": "d1168d8cf2fd2654fa74056f3b7a0ade9ccb46777efb52d27ccf4d3b3a1841c5", "avgDurationSecs": 4.551, "validDurationSecs": 234424.23, "totalHrs": 590.97, "validHrs": 65.12 }, "id": { "duration": 2689200, "reportedSentences": 12, "clips": 760, "users": 7, "size": 15678310, "checksum": "f10b1dad4071a8bb1a8723ae5ab1cff2a6f32f8e145fbf6126e6afb3beb85906", "avgDurationSecs": 4.047, "validDurationSecs": 619.48, "totalHrs": 0.75, "validHrs": 0.17 }, "ia": { "duration": 70848, "reportedSentences": 3, "clips": 17, "users": 2, "size": 412571, "checksum": "9e4d39a1285aee039fc16661d7f65560801ffdc157dc0737b1644ef6c39e95c9", "avgDurationSecs": 4.183, "validDurationSecs": 292.62, "totalHrs": 0.02, "validHrs": 0.08 }, "lv": { "duration": 20052, "reportedSentences": 4, "clips": 5, "users": 1, "size": 76816, "checksum": "ca9ef194ccedf357a976cda6315352f0d379b312599305f6f1ff353f3622b292", "avgDurationSecs": 3.412, "validDurationSecs": 490.45, "totalHrs": 0.01, "validHrs": 0.13 }, "ja": { "duration": 12559896, "reportedSentences": 32, "clips": 2667, "users": 36, "size": 73507367, "checksum": "5200ab82dc91927beadf946cd7db877f85494823b12e17ade9241ff703ae2352", "avgDurationSecs": 4.792, "validDurationSecs": 7888.07, "totalHrs": 3.49, "validHrs": 2.19 }, "rw": { "duration": 66924, "reportedSentences": 0, "clips": 16, "users": 4, "size": 279225, "checksum": "a377be2f63f0606228396644d9bc23d81ff105397866f7ecbabccffad6dbfd81", "avgDurationSecs": 5.008, "validDurationSecs": 94.08, "totalHrs": 0.02, "validHrs": 0.03 }, "sv-SE": { "duration": 705024, "reportedSentences": 1, "clips": 161, "users": 3, "size": 4024135, "checksum": "a5dbc058b1746308d9eab31bc925c0b0c1315ad3726adfce0511c00161fcbc49", "avgDurationSecs": 3.957, "validDurationSecs": 670.8, "totalHrs": 0.2, "validHrs": 0.18 }, "cnh": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -800, "checksum": "2be89cd4a8bd37309c6cdc7a524cb5b667a51499af1abf1caeb32acb7d0743d8", "avgDurationSecs": 3.564, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "et": { "duration": 733212, "reportedSentences": 11, "clips": 100, "users": 0, "size": 4328053, "checksum": "d3e0be25b4b756e991ca597295ae7d9dda2203d2af0cb691feda80fba4608054", "avgDurationSecs": 6.758, "validDurationSecs": 367.36, "totalHrs": 0.21, "validHrs": 0.1 }, "ky": { "duration": 429732, "reportedSentences": 0, "clips": 114, "users": 3, "size": 2401925, "checksum": "3943d7cc30ee725552ff98d10b53ec0ab61033cb29704769750838be2313d211", "avgDurationSecs": 4.539, "validDurationSecs": 321.7, "totalHrs": 0.12, "validHrs": 0.09 }, "ro": { "duration": 3064284, "reportedSentences": 29, "clips": 728, "users": 8, "size": 17978796, "checksum": "0bb1458db358dc5110aabc056c8a25007608ef817b1938965afd6137e816b563", "avgDurationSecs": 3.997, "validDurationSecs": 2136.1, "totalHrs": 0.85, "validHrs": 0.59 }, "hsb": { "duration": 0, "reportedSentences": 13, "clips": 0, "users": 0, "size": 717, "checksum": "77d8350910f2eea3d7813f47fb93cea8108c5aa37d12fb456def557bb238724b", "avgDurationSecs": 6.109, "validDurationSecs": 24.43, "totalHrs": 0, "validHrs": 0.01 }, "el": { "duration": 261180, "reportedSentences": 0, "clips": 55, "users": 5, "size": 1504419, "checksum": "a38515243d120d0e466e02a723a69101135054b757685cf29019e1cc29ab81c8", "avgDurationSecs": 4.133, "validDurationSecs": 846.85, "totalHrs": 0.07, "validHrs": 0.23 }, "cs": { "duration": 1249236, "reportedSentences": 1, "clips": 291, "users": 11, "size": 7319501, "checksum": "65b2cc61d58e100db51099a90365376f0e46794e2f2cd3a55940a9c870d7bf3a", "avgDurationSecs": 4.327, "validDurationSecs": 3790.69, "totalHrs": 0.35, "validHrs": 1.05 }, "pl": { "duration": 4904388, "reportedSentences": 2, "clips": 867, "users": 23, "size": 28911615, "checksum": "dacae20342ecbbcdf8292859c9c086c3cf9e4124a811a8473f1930bb9fa9d784", "avgDurationSecs": 4.485, "validDurationSecs": 8470.94, "totalHrs": 1.36, "validHrs": 2.35 }, "rm-sursilv": { "duration": 34128, "reportedSentences": 3, "clips": 5, "users": 1, "size": 204151, "checksum": "6abb81458703971ae3a75753d39f09ca58c92e1091031a5e5e7f70bbc6d081c9", "avgDurationSecs": 5.481, "validDurationSecs": 31.42, "totalHrs": 0.01, "validHrs": 0.01 }, "rm-vallader": { "duration": 41148, "reportedSentences": 3, "clips": 5, "users": 1, "size": 244243, "checksum": "f703ec8a0021dc9b79b9fbbb4b1a476f8c40d6ecba3f5a533c885f74b796f753", "avgDurationSecs": 5.811, "validDurationSecs": 35.97, "totalHrs": 0.01, "validHrs": 0.01 }, "mn": { "duration": 220932, "reportedSentences": 3, "clips": 38, "users": 6, "size": 1279484, "checksum": "6576938cb5562ce78fbd88095ba628e337a9b0a514599f8dfaa8406365d82435", "avgDurationSecs": 5.475, "validDurationSecs": 238.49, "totalHrs": 0.06, "validHrs": 0.07 }, "zh-HK": { "duration": 22666176, "reportedSentences": 15, "clips": 5992, "users": 36, "size": 130815157, "checksum": "cb7b78f3152bf8612a71a1c2256b6c9c1b529e1dbec29356444f102527879ea7", "avgDurationSecs": 4.191, "validDurationSecs": 4708.96, "totalHrs": 6.3, "validHrs": 1.31 }, "ab": { "duration": 2030508, "reportedSentences": 0, "clips": 331, "users": 2, "size": 11959515, "checksum": "465528bf6238587d6d6d4ec99bac71e41d4fa3ed1b7b7f0a3a2a61a11d1a556d", "avgDurationSecs": 5.133, "validDurationSecs": 364.47, "totalHrs": 0.56, "validHrs": 0.1 }, "cv": { "duration": 5461488, "reportedSentences": 0, "clips": 969, "users": 0, "size": 31207139, "checksum": "865ab51ef224de462bcbe30408bbbeddfc2915e25f85283831a66c79ad2aa2fa", "avgDurationSecs": 5.069, "validDurationSecs": 3255.56, "totalHrs": 1.51, "validHrs": 0.9 }, "uk": { "duration": 7941492, "reportedSentences": 6, "clips": 1956, "users": 25, "size": 46209558, "checksum": "94df9e01e8a774ad590274c090f6683e8698108993b95a81b1c69d7c7c53c234", "avgDurationSecs": 4.764, "validDurationSecs": 11106.26, "totalHrs": 2.21, "validHrs": 3.08 }, "mt": { "duration": 204084, "reportedSentences": 0, "clips": 35, "users": 4, "size": 1203929, "checksum": "3371481ec5d72b9ebbd3bda69f0fb50d25e76e18193c27e1dc7e2f2a9ad3cf52", "avgDurationSecs": 4.74, "validDurationSecs": 113.56, "totalHrs": 0.06, "validHrs": 0.03 }, "as": { "duration": 34092, "reportedSentences": 0, "clips": 5, "users": 1, "size": 210743, "checksum": "291ed568bef3b2b7c1491e082e8661482a73dad5f6f7aa296c52b3cda960a928", "avgDurationSecs": 5.684, "validDurationSecs": 1379.32, "totalHrs": 0.01, "validHrs": 0.38 }, "ka": { "duration": 31920588, "reportedSentences": 10, "clips": 6345, "users": 188, "size": 184808932, "checksum": "b3110f9dfe3b3f74eb7d8904a36ba7ec60148317f3d67e37f3966d2d20274c12", "avgDurationSecs": 5.169, "validDurationSecs": 25046.85, "totalHrs": 8.86, "validHrs": 6.95 }, "fy-NL": { "duration": 5032656, "reportedSentences": 2, "clips": 1115, "users": 3, "size": 29454343, "checksum": "fb4980c6dc30c19cee43beef27cbe3dd8bd7ce136576008740eae03625219765", "avgDurationSecs": 4.972, "validDurationSecs": -15.77, "totalHrs": 1.4, "validHrs": 0 }, "dv": { "duration": 14083632, "reportedSentences": 6, "clips": 2637, "users": 6, "size": 71433075, "checksum": "7aed6401d1453fc44f880da86697e4bd0283e28f723a07a70fc792d000236b5f", "avgDurationSecs": 5.065, "validDurationSecs": 5208.81, "totalHrs": 3.91, "validHrs": 1.44 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 446, "checksum": "b1cbc71ceb2062e0fedab43c68729da3e148c0673ccb47d370cc8b6150c549dd", "avgDurationSecs": 4.825, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "vi": { "duration": 993024, "reportedSentences": 2, "clips": 249, "users": 7, "size": 5806470, "checksum": "d932231ed7fca7e3dfbca761f142a2c3d3ef37f806e4c0feada5bfb40a399952", "avgDurationSecs": 3.968, "validDurationSecs": 330.72, "totalHrs": 0.27, "validHrs": 0.09 }, "or": { "duration": 586620, "reportedSentences": 2, "clips": 123, "users": 2, "size": 3379106, "checksum": "c4bb1953da17e06096536db58931665a2df74624c4692342f0f57ec61918d5a1", "avgDurationSecs": 5.043, "validDurationSecs": 19.71, "totalHrs": 0.17, "validHrs": 0 }, "ga-IE": { "duration": 97956, "reportedSentences": 8, "clips": 30, "users": 2, "size": 570311, "checksum": "fe2302a8e2088a3f131545e979ea40ea82f673d73aa9ebaefcd13d9604fcdf37", "avgDurationSecs": 3.541, "validDurationSecs": 208.38, "totalHrs": 0.02, "validHrs": 0.06 }, "fi": { "duration": 1150524, "reportedSentences": 1, "clips": 258, "users": 5, "size": 6722348, "checksum": "73b9915ddfbcc23c6c6407712e35c86064f3f16486a2fe81d66c2fff662e001c", "avgDurationSecs": 4.584, "validDurationSecs": 692.78, "totalHrs": 0.32, "validHrs": 0.19 }, "hu": { "duration": 1346580, "reportedSentences": 6, "clips": 302, "users": 11, "size": 7718320, "checksum": "9f26068dcf71c0fef9924065766e3705137217891c3efb5670e702bfd63ade41", "avgDurationSecs": 4.913, "validDurationSecs": 1893.13, "totalHrs": 0.38, "validHrs": 0.52 }, "th": { "duration": 24376248, "reportedSentences": 198, "clips": 5514, "users": 62, "size": 142214713, "checksum": "f0e9108e331e1151d54aac6e541619ee04209a1c887439f9f44d7748366be6eb", "avgDurationSecs": 4.175, "validDurationSecs": 27585.96, "totalHrs": 6.77, "validHrs": 7.66 }, "lt": { "duration": 562104, "reportedSentences": 9, "clips": 121, "users": 5, "size": 3309034, "checksum": "dad7fa591f293f412e9b252ef519ee643ed62fc0f4c08dbb17ffde310e054d2d", "avgDurationSecs": 5.15, "validDurationSecs": 1564.57, "totalHrs": 0.16, "validHrs": 0.44 }, "lg": { "duration": 28743336, "reportedSentences": 135, "clips": 4135, "users": 15, "size": 167425070, "checksum": "18c3b9b33439cba516d5aa0307cbf299550fcf32507eacd9bfc3ac94957a9b72", "avgDurationSecs": 5.822, "validDurationSecs": 5471.36, "totalHrs": 7.98, "validHrs": 1.52 }, "hi": { "duration": 1016640, "reportedSentences": 8, "clips": 159, "users": 15, "size": 5935527, "checksum": "97bda013c6583bfd133168c2e165e37f151219f784e9a38689f54075c4f75cf6", "avgDurationSecs": 4.819, "validDurationSecs": 662.86, "totalHrs": 0.28, "validHrs": 0.19 }, "bas": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -745, "checksum": "09d698f61da142afbe1fd8d16ea6bb5f528a1038f5455c191071f1643d3fc61a", "avgDurationSecs": 4.429, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sk": { "duration": 964296, "reportedSentences": 0, "clips": 263, "users": 2, "size": 5584196, "checksum": "572f6eb7e485d428b97a43575648af7dd1b6a1e43b702ced27b84094aefdb37f", "avgDurationSecs": 3.992, "validDurationSecs": 537.52, "totalHrs": 0.27, "validHrs": 0.14 }, "kmr": { "duration": 34966332, "reportedSentences": 142, "clips": 9468, "users": 112, "size": 184272657, "checksum": "20102f351b5134cc376d24cdc20c337ba6616bdbd146467c5654103eddb93a81", "avgDurationSecs": 4.294, "validDurationSecs": 12605.58, "totalHrs": 9.71, "validHrs": 3.51 }, "bg": { "duration": 697572, "reportedSentences": 0, "clips": 111, "users": 3, "size": 4122114, "checksum": "7dd61cc78dcfe256f843ff96564fb4a8622a990d01dfe181d59871178dae6b2b", "avgDurationSecs": 5.517, "validDurationSecs": 1147.33, "totalHrs": 0.19, "validHrs": 0.32 }, "kk": { "duration": 22644, "reportedSentences": 8, "clips": 5, "users": 1, "size": 131785, "checksum": "17d246cde024bf626662ceba6c757d8aa1465054d9e570943082d7e1c7a03316", "avgDurationSecs": 4.935, "validDurationSecs": 22.93, "totalHrs": 0, "validHrs": 0 }, "ba": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 8107, "checksum": "f3f066725689592c8f0142a518e96b07d929aa71042f19ee89a969410cd5c927", "avgDurationSecs": 4.426, "validDurationSecs": 84.09, "totalHrs": 0, "validHrs": 0.03 }, "gl": { "duration": 4768020, "reportedSentences": 16, "clips": 994, "users": 6, "size": 27860283, "checksum": "d17ca8baf6dd617f49a57dde654360ed605e3859b1641a9cf9f29e138dcce2de", "avgDurationSecs": 4.834, "validDurationSecs": 1246.39, "totalHrs": 1.33, "validHrs": 0.35 }, "ug": { "duration": 143065404, "reportedSentences": 91, "clips": 23711, "users": 351, "size": 829636529, "checksum": "ba19ae7b142ab2efac62777929b2f4d281c860cb89896b37d4b87b502aa4f4ac", "avgDurationSecs": 5.993, "validDurationSecs": 151019.75, "totalHrs": 39.74, "validHrs": 41.95 }, "hy-AM": { "duration": 539100, "reportedSentences": 11, "clips": 88, "users": 2, "size": 3182813, "checksum": "bd4b084e56ef01f7efbe69bcee0b865173ac0bd5483c3abfd857db4e7fbccae8", "avgDurationSecs": 6.114, "validDurationSecs": 165.63, "totalHrs": 0.15, "validHrs": 0.05 }, "be": { "duration": 335323116, "reportedSentences": 4, "clips": 70008, "users": 557, "size": 1902826290, "checksum": "a7ccc4301a2dfdf9becd4fb61da754accc34e6fc9f3c96d644f6b6ea21235e00", "avgDurationSecs": 4.757, "validDurationSecs": 250600.34, "totalHrs": 93.15, "validHrs": 69.61 }, "ur": { "duration": 4365936, "reportedSentences": 0, "clips": 915, "users": 10, "size": 24991959, "checksum": "f1d465a4280b22e81d2734fe168ababc729509bc7b9f57c58343f21afc5b816b", "avgDurationSecs": 3.88, "validDurationSecs": 1127.62, "totalHrs": 1.21, "validHrs": 0.31 }, "gn": { "duration": 12521952, "reportedSentences": 1, "clips": 3027, "users": 11, "size": 73235544, "checksum": "4c098f64ac849cbfdeabbbb005985ba71c3da36462a7d66b463060356bf86db8", "avgDurationSecs": 4.259, "validDurationSecs": 3567.71, "totalHrs": 3.48, "validHrs": 0.99 }, "sr": { "duration": 261360, "reportedSentences": 0, "clips": 75, "users": 0, "size": 1525615, "checksum": "8afbbcc62115a5bed217eccf693839bfed8a80ed7ea412142349452994a1da11", "avgDurationSecs": 2.846, "validDurationSecs": 120.82, "totalHrs": 0.07, "validHrs": 0.03 }, "uz": { "duration": 23011848, "reportedSentences": 34, "clips": 4315, "users": 93, "size": 134440309, "checksum": "a24b838552c121bfe27a585aec5fab4a8fa8aad8b38e4b95277bb583badf41fa", "avgDurationSecs": 4.132, "validDurationSecs": 6465.74, "totalHrs": 6.39, "validHrs": 1.79 }, "mr": { "duration": 2209032, "reportedSentences": 13, "clips": 337, "users": 1, "size": 13046635, "checksum": "18fb6bbeb05feb5a525e8147e561bb651930f2c15ed7b50b2b9ef72064a1b881", "avgDurationSecs": 6.191, "validDurationSecs": 913.87, "totalHrs": 0.61, "validHrs": 0.26 }, "da": { "duration": 2595024, "reportedSentences": 103, "clips": 680, "users": 15, "size": 14996085, "checksum": "bd8c850acb743492894629ed104e2fcfed7d2591c0367cf07adfc70e822cc67f", "avgDurationSecs": 4.23, "validDurationSecs": 3309.51, "totalHrs": 0.72, "validHrs": 0.92 }, "myv": { "duration": 177372, "reportedSentences": 6, "clips": 30, "users": 0, "size": 1048094, "checksum": "c7a086a97d4b0879822e58aa76c6dda7200fdae41ee3bf290b7051c1b3b7fced", "avgDurationSecs": 5.752, "validDurationSecs": 263.03, "totalHrs": 0.05, "validHrs": 0.07 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1848, "checksum": "3b4dd645ee96251810da56b43638d28eb1eea98f3b6cb2e6f151f5e1a074ea9f", "avgDurationSecs": 4.506, "validDurationSecs": 342.47, "totalHrs": 0, "validHrs": 0.09 }, "ha": { "duration": 212148, "reportedSentences": 10, "clips": 53, "users": 3, "size": 1239218, "checksum": "2bf4813c32cb56fd8a3b1b1be2fbf7b198ef0fea9865475f486d70a3fd9665d7", "avgDurationSecs": 4.346, "validDurationSecs": 515.56, "totalHrs": 0.06, "validHrs": 0.14 }, "ckb": { "duration": 35079264, "reportedSentences": 82, "clips": 9109, "users": 50, "size": 193690495, "checksum": "f04fa532f9d9dc1e99ed188a9821da5462f728c27ec29a2e976341884607f2a5", "avgDurationSecs": 3.899, "validDurationSecs": 58342.27, "totalHrs": 9.75, "validHrs": 16.2 }, "ml": { "duration": 104724, "reportedSentences": 3, "clips": 28, "users": 3, "size": 608165, "checksum": "1cae82127fffd138b6c546b1128b6c22246e46f02272c2a72bca6e896674d9e4", "avgDurationSecs": 4.118, "validDurationSecs": 195.54, "totalHrs": 0.03, "validHrs": 0.05 }, "mdf": { "duration": 0, "reportedSentences": 3, "clips": 0, "users": 0, "size": 314, "checksum": "0f54942589d6caa8497c9143c1a25771a3463c074230e40e7f9675f786f9081e", "avgDurationSecs": 5.285, "validDurationSecs": 15.86, "totalHrs": 0, "validHrs": 0.01 }, "sw": { "duration": 74412108, "reportedSentences": 29, "clips": 12309, "users": 42, "size": 434786117, "checksum": "83b6c3a375b085320cce997bdecfece1c002b193d45f389f42fa61887ed6dfa6", "avgDurationSecs": 5.347, "validDurationSecs": 404362.73, "totalHrs": 20.67, "validHrs": 112.33 }, "sat": { "duration": 191844, "reportedSentences": 0, "clips": 36, "users": 2, "size": 1095867, "checksum": "e93b571ca6cc790730045c01a0e7588875f08bd3e21cce0d1e8d51e5b45397eb", "avgDurationSecs": 4.463, "validDurationSecs": 26.82, "totalHrs": 0.05, "validHrs": 0 }, "tig": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 18, "checksum": "6e9b5f27882feb0a335a290a1ed40c6a38644909e27b6806070b1915606975c4", "avgDurationSecs": 4.491, "validDurationSecs": 4.49, "totalHrs": 0, "validHrs": 0 }, "ig": { "duration": 22536, "reportedSentences": 7, "clips": 5, "users": 1, "size": 133098, "checksum": "ffde860516825ed5a96b3b4b4cf71203e18930b270afee687aed64470be6bd34", "avgDurationSecs": 5.468, "validDurationSecs": -0.01, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 2399328, "reportedSentences": 11, "clips": 896, "users": 15, "size": 12108465, "checksum": "e01f84635aad1133d66a8e7eef7ebf2ab1012977c48299b0c42f2cc51cc6813f", "avgDurationSecs": 2.677, "validDurationSecs": 838.06, "totalHrs": 0.67, "validHrs": 0.24 }, "mhr": { "duration": 72818856, "reportedSentences": 6, "clips": 16241, "users": 29, "size": 393181312, "checksum": "7a03c3bea72483d9ab5ab39d0d39a833f6a787e1e9ae5682a7bd1a08c70cca76", "avgDurationSecs": 4.718, "validDurationSecs": 68556.01, "totalHrs": 20.23, "validHrs": 19.04 }, "bn": { "duration": 47340432, "reportedSentences": 375, "clips": 8154, "users": 236, "size": 276054484, "checksum": "363bea11b09b5e839031c026214a1f9a544cc60d5711d65527af3f17696aa728", "avgDurationSecs": 6.153, "validDurationSecs": 5282.83, "totalHrs": 13.15, "validHrs": 1.47 }, "tok": { "duration": 2067336, "reportedSentences": 0, "clips": 537, "users": 9, "size": 11922699, "checksum": "7f2255c6f5d3897111e1a7647e00da926ce300cf93b91fe6ee7bbb681bce3c62", "avgDurationSecs": 3.698, "validDurationSecs": 1639.39, "totalHrs": 0.58, "validHrs": 0.45 }, "yue": { "duration": 48387888, "reportedSentences": 177, "clips": 11962, "users": 159, "size": 266621392, "checksum": "d66f0107e838c04d73dfac0d759cd8464d6bf50c80ac745dd360f9c27503c1e7", "avgDurationSecs": 4.141, "validDurationSecs": 16011.55, "totalHrs": 13.44, "validHrs": 4.45 }, "sah": { "duration": 81396, "reportedSentences": 0, "clips": 15, "users": 0, "size": 480267, "checksum": "a23f35ad90c9eb440dc7258a452190284b9c73a6bb506a5ab67304efb6f113d9", "avgDurationSecs": 5.998, "validDurationSecs": 51.62, "totalHrs": 0.02, "validHrs": 0.01 }, "mk": { "duration": 0, "reportedSentences": 3, "clips": 0, "users": 0, "size": 290, "checksum": "e254aa9e6585fb14662a77418292030ec09747d910550dc9afb516f2aca1d684", "avgDurationSecs": 5.323, "validDurationSecs": 5.32, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 2032344, "reportedSentences": 2, "clips": 541, "users": 4, "size": 11898298, "checksum": "56bb1cf602f92b62c02a8b62229df9bd48ddd0831066b15d3c77f396ecab245f", "avgDurationSecs": 4.395, "validDurationSecs": 2197.73, "totalHrs": 0.56, "validHrs": 0.61 }, "vot": { "duration": 0, "clips": 0, "users": 0, "size": 32, "checksum": "4e559d4e5b3fb342cddc86cb470e0f27bfb8fce2f48a4351e13543534dbb64af", "avgDurationSecs": 2.408, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 19008, "clips": 3, "users": 2, "size": 112088, "checksum": "8b51750a0cbcccfa5f081b0c3b358030d9bf83e21a351bfe5e618c0bfc125655", "avgDurationSecs": 5.583, "validDurationSecs": 1.66, "totalHrs": 0.01, "validHrs": 0 }, "ast": { "duration": 0, "clips": 0, "users": 0, "size": -292, "checksum": "5d486544bfd98347d78fbfdd3a091efbef12e34e34bc86800c79bfa21b356c3e", "avgDurationSecs": 4.305, "validDurationSecs": 47.35, "totalHrs": 0, "validHrs": 0.01 }, "ne-NP": { "duration": 311940, "clips": 69, "users": 3, "size": 1831358, "checksum": "f18c6290ac79cd301a48cc228ee1d667d66322edf634b28d2ba2d12a730a4500", "avgDurationSecs": 4.199, "validDurationSecs": 17.66, "totalHrs": 0.08, "validHrs": 0.01 } }, "totalDuration": 12290400000, "totalValidDurationSecs": 4304155, "totalHrs": 3414, "totalValidHrs": 1195 } ================================================ FILE: datasets/scripted-speech/cv-corpus-12.0-2022-12-07.json ================================================ { "locales": { "en": { "duration": 11378329699, "buckets": { "dev": 16365, "invalidated": 259242, "other": 283258, "reported": 4577, "test": 16365, "train": 986897, "validated": 1659877 }, "reportedSentences": 4486, "clips": 2202377, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.25, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.45, "female": 0.16, "other": 0.02 } }, "users": 85825, "size": 79751937788, "checksum": "0efd86ca6b40641b55d1411b7d3b1f1ab8626de4b207504953706df201d198a5", "avgDurationSecs": 5.166, "validDurationSecs": 8575565.294, "totalHrs": 3160.64, "validHrs": 2382.1 }, "fa": { "buckets": { "dev": 10400, "invalidated": 13969, "other": 22128, "reported": 2232, "test": 10400, "train": 27735, "validated": 316602 }, "reportedSentences": 2223, "duration": 1407903848, "clips": 352699, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.31, "thirties": 0.37, "fifties": 0.02, "fourties": 0.02, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.07, "other": 0 } }, "users": 4151, "size": 10326546053, "checksum": "4b9635a8f2f14198ae8bc6a31245b5a0611d90edd8df850d0f67554b27c1a94e", "avgDurationSecs": 3.992, "validDurationSecs": 1263811.845, "totalHrs": 391.08, "validHrs": 351.05 }, "fr": { "buckets": { "dev": 16104, "invalidated": 58418, "other": 34255, "reported": 6831, "test": 16104, "train": 499535, "validated": 666754 }, "reportedSentences": 6755, "duration": 3798200674, "clips": 759427, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "thirties": 0.16, "": 0.37, "teens": 0.03, "fourties": 0.14, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.3, "female": 0.1, "other": 0.01 } }, "users": 17130, "size": 26968765675, "checksum": "00afc519d48d749a4724386dc203b8a0286060efe4ccb46963555794fef216eb", "avgDurationSecs": 5.001, "validDurationSecs": 3334705.63, "totalHrs": 1055.05, "validHrs": 926.3 }, "es": { "buckets": { "dev": 15683, "invalidated": 66516, "other": 1159857, "reported": 2057, "test": 15683, "train": 272465, "validated": 348635 }, "reportedSentences": 2043, "duration": 7659961902, "clips": 1575008, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.1, "": 0.13, "fifties": 0.04, "twenties": 0.55, "teens": 0.08, "fourties": 0.03, "sixties": 0.08, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.54, "": 0.13, "other": 0, "female": 0.33 } }, "users": 24889, "size": 48368058973, "checksum": "04e90a4bc52d1f0f9997f1b56ec3c636af09c411dc0437ca50df3c67a137c07a", "avgDurationSecs": 4.863, "validDurationSecs": 1695566.51, "totalHrs": 2127.76, "validHrs": 470.99 }, "sl": { "buckets": { "dev": 1140, "invalidated": 261, "other": 2196, "reported": 37, "test": 1237, "train": 1461, "validated": 9852 }, "reportedSentences": 38, "duration": 46572670, "clips": 12309, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.47, "teens": 0.07, "": 0.18, "sixties": 0.07, "fifties": 0.07, "fourties": 0.08, "thirties": 0.05 }, "gender": { "female": 0.15, "male": 0.66, "": 0.18, "other": 0 } }, "users": 139, "size": 326984569, "checksum": "fe68f6c28f38918388128bba5c6999ec478e508d8b7af5cbfa9e51c339bba67e", "avgDurationSecs": 3.784, "validDurationSecs": 37276.297, "totalHrs": 12.93, "validHrs": 10.35 }, "kab": { "buckets": { "dev": 14994, "invalidated": 19497, "other": 111486, "reported": 8978, "test": 14995, "train": 151641, "validated": 608855 }, "reportedSentences": 8971, "duration": 2468343796, "clips": 739838, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.29, "": 0.28, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.54, "": 0.26, "female": 0.2, "other": 0 } }, "users": 1507, "size": 18431732539, "checksum": "683057ee2860eada3a5ed5874ccb55ee5cd664f1e95f9876282f54404f7cc955", "avgDurationSecs": 3.336, "validDurationSecs": 2031341.269, "totalHrs": 685.65, "validHrs": 564.26 }, "cy": { "buckets": { "dev": 5283, "invalidated": 4362, "other": 18596, "reported": 161, "test": 5292, "train": 7769, "validated": 88794 }, "reportedSentences": 162, "duration": 543448269, "clips": 111752, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.06, "fifties": 0.09, "": 0.43, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.24, "": 0.41, "other": 0.01 } }, "users": 1733, "size": 3991381337, "checksum": "1bb1e1bba095aa4d022ef949dda8b2a806ef2a20d3f135f37b5d12723290db14", "avgDurationSecs": 4.863, "validDurationSecs": 431803.866, "totalHrs": 150.95, "validHrs": 119.94 }, "ca": { "buckets": { "dev": 16363, "invalidated": 81433, "other": 479083, "reported": 5503, "test": 16363, "train": 982969, "validated": 1223086 }, "reportedSentences": 5455, "duration": 9799111110, "clips": 1783602, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.06, "fifties": 0.17, "fourties": 0.1, "twenties": 0.05, "": 0.34, "sixties": 0.23, "teens": 0.01, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.43, "": 0.35, "female": 0.22, "other": 0 } }, "users": 30888, "size": 60339302819, "checksum": "c9b0f4916532cd00782f1e155bd376e3e6f6f4829f73a8536c8aa9be5e773afa", "avgDurationSecs": 5.494, "validDurationSecs": 6719635.665, "totalHrs": 2721.97, "validHrs": 1866.56 }, "de": { "buckets": { "dev": 16113, "invalidated": 49375, "other": 5436, "reported": 8606, "test": 16113, "train": 508449, "validated": 835999 }, "reportedSentences": 8582, "duration": 4618755785, "clips": 890810, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "fourties": 0.17, "": 0.32, "thirties": 0.16, "teens": 0.03, "sixties": 0.02, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.32, "female": 0.09, "other": 0.01 } }, "users": 17498, "size": 32667867182, "checksum": "2b8f54b28022d483f22c6a203a998d50fe946911b347f7e901113219ce5e9ab6", "avgDurationSecs": 5.185, "validDurationSecs": 4334566.538, "totalHrs": 1282.98, "validHrs": 1204.04 }, "tt": { "buckets": { "dev": 3295, "invalidated": 396, "other": 11, "reported": 3, "test": 5105, "train": 9565, "validated": 28829 }, "reportedSentences": 4, "duration": 109776950, "clips": 29236, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.72, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.2, "male": 0.78, "female": 0.02 } }, "users": 229, "size": 810939093, "checksum": "34edaf782e34e67177710eb44ac85cf14b488ad177c7f883996f42334939840b", "avgDurationSecs": 3.755, "validDurationSecs": 108248.724, "totalHrs": 30.49, "validHrs": 30.06 }, "ta": { "buckets": { "dev": 11905, "invalidated": 5596, "other": 87248, "reported": 3334, "test": 11955, "train": 42594, "validated": 131756 }, "reportedSentences": 3333, "duration": 1395830364, "clips": 224600, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.08, "thirties": 0.09, "": 0.72, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.03, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.71, "other": 0, "female": 0.13 } }, "users": 800, "size": 8361980031, "checksum": "20835f24c7f3bf8cb7cc60e6c4b368606a6f8988a1395657ca4f64395ca6a001", "avgDurationSecs": 6.215, "validDurationSecs": 818829.143, "totalHrs": 387.73, "validHrs": 227.45 }, "ru": { "buckets": { "dev": 10153, "invalidated": 8379, "other": 8641, "reported": 381, "test": 10153, "train": 26043, "validated": 142279 }, "reportedSentences": 375, "duration": 822704832, "clips": 159299, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.37, "teens": 0.09, "": 0.24, "fourties": 0.14, "thirties": 0.12, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.6, "": 0.24, "other": 0, "female": 0.16 } }, "users": 2815, "size": 5701465200, "checksum": "b7c55379cbb5a7fae18a8343bb5c6512587165c9a7b258de921b48484aef82c2", "avgDurationSecs": 5.165, "validDurationSecs": 734804.492, "totalHrs": 228.52, "validHrs": 204.11 }, "nl": { "buckets": { "dev": 10798, "invalidated": 5189, "other": 2227, "reported": 333, "test": 10806, "train": 30827, "validated": 85457 }, "reportedSentences": 333, "duration": 401587705, "clips": 92873, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "twenties": 0.21, "fourties": 0.14, "thirties": 0.11, "teens": 0.02, "fifties": 0.08, "sixties": 0.01, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.42, "male": 0.47, "female": 0.11, "other": 0 } }, "users": 1557, "size": 2756366032, "checksum": "674cf7146187d88957fa5592a53ce05025fba5c79353a06227694a13be998af0", "avgDurationSecs": 4.324, "validDurationSecs": 369520.533, "totalHrs": 111.55, "validHrs": 102.64 }, "it": { "buckets": { "dev": 15074, "invalidated": 17646, "other": 316, "reported": 5710, "test": 15074, "train": 160160, "validated": 226954 }, "reportedSentences": 5706, "duration": 1313352984, "clips": 244916, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "twenties": 0.22, "": 0.29, "fifties": 0.15, "fourties": 0.14, "seventies": 0, "sixties": 0.03, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.11, "male": 0.6, "": 0.29, "other": 0 } }, "users": 6832, "size": 9040574665, "checksum": "00e4954197195361dbf0a919afbe99c735ac3128ce66e085037b13eba24a8d41", "avgDurationSecs": 5.362, "validDurationSecs": 1217032.424, "totalHrs": 364.82, "validHrs": 338.06 }, "eu": { "buckets": { "dev": 6586, "invalidated": 5812, "other": 27842, "reported": 91, "test": 6586, "train": 10904, "validated": 69493 }, "reportedSentences": 91, "duration": 535065711, "clips": 103147, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.14, "twenties": 0.35, "": 0.26, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.46, "female": 0.26, "": 0.26, "other": 0.02 } }, "users": 1234, "size": 4021718361, "checksum": "a669cbc38edd57d54441b1db84a4becad117c99c75f6ea973318741e7e66fd74", "avgDurationSecs": 5.187, "validDurationSecs": 360488.637, "totalHrs": 148.62, "validHrs": 100.13 }, "tr": { "buckets": { "dev": 10817, "invalidated": 3885, "other": 156, "reported": 347, "test": 10817, "train": 30915, "validated": 90344 }, "reportedSentences": 348, "duration": 341891379, "clips": 94385, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "thirties": 0.08, "twenties": 0.28, "teens": 0.02, "fourties": 0.03, "fifties": 0.09, "sixties": 0.15, "eighties": 0.02, "seventies": 0.03 }, "gender": { "": 0.3, "male": 0.46, "female": 0.23, "other": 0 } }, "users": 1392, "size": 2086520104, "checksum": "778825e1dc4a29f4fa5201e7bac51740a205763c6055632636dffc6aea669342", "avgDurationSecs": 3.622, "validDurationSecs": 327253.639, "totalHrs": 94.96, "validHrs": 90.9 }, "ar": { "buckets": { "dev": 10353, "invalidated": 14974, "other": 36219, "reported": 2088, "test": 10433, "train": 28195, "validated": 76482 }, "reportedSentences": 2080, "clips": 127675, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.56, "twenties": 0.28, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.18, "": 0.56, "male": 0.27, "other": 0 } }, "users": 1386, "size": 3157168023, "checksum": "b2a7db7b89bc919d365c56ed31d78733eb4270abb59cddae29bde215093ae6e1", "duration": 532277075, "avgDurationSecs": 4.169, "validDurationSecs": 318853.458, "totalHrs": 147.85, "validHrs": 88.57 }, "zh-TW": { "buckets": { "dev": 4752, "invalidated": 4624, "other": 42929, "reported": 141, "test": 4752, "train": 6655, "validated": 78203 }, "reportedSentences": 142, "duration": 416786530, "clips": 125756, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.31, "teens": 0.06, "": 0.27, "fifties": 0.06, "seventies": 0, "fourties": 0.1, "sixties": 0 }, "gender": { "male": 0.48, "": 0.26, "female": 0.24, "other": 0.02 } }, "users": 2099, "size": 2877557966, "checksum": "76d1ec2365724dbaae27b99bc69e526ad006b8edbcacb535f0f30979eb352014", "avgDurationSecs": 3.314, "validDurationSecs": 259184.111, "totalHrs": 115.77, "validHrs": 71.99 }, "br": { "buckets": { "dev": 2123, "invalidated": 839, "other": 14968, "reported": 270, "test": 2122, "train": 2644, "validated": 11348 }, "reportedSentences": 270, "duration": 85109299, "clips": 27155, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.31, "fifties": 0.05, "fourties": 0.06, "thirties": 0.07, "sixties": 0.15, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.67, "": 0.31, "female": 0.02, "other": 0 } }, "users": 180, "size": 605988650, "checksum": "22dcc7f11a79437446cac99e9b4406661343a44ab95ba888599a9719f6dff5e3", "avgDurationSecs": 3.134, "validDurationSecs": 35566.943, "totalHrs": 23.64, "validHrs": 9.87 }, "pt": { "buckets": { "dev": 8924, "invalidated": 6004, "other": 29778, "reported": 2457, "test": 8930, "train": 19282, "validated": 122596 }, "reportedSentences": 2450, "duration": 648361735, "clips": 158378, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.37, "teens": 0.02, "thirties": 0.19, "fourties": 0.14, "sixties": 0.02, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.22, "male": 0.7, "female": 0.06, "other": 0.02 } }, "users": 2967, "size": 4132456159, "checksum": "c2dda2673732a414967a609cbbcdda384ce29be8b81f510a98deb5163f3fd13b", "avgDurationSecs": 4.094, "validDurationSecs": 501878.766, "totalHrs": 180.1, "validHrs": 139.41 }, "eo": { "buckets": { "dev": 14904, "invalidated": 127334, "other": 139922, "reported": 2200, "test": 14903, "train": 143998, "validated": 848786 }, "reportedSentences": 2199, "duration": 6696252000, "clips": 1116042, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.56, "thirties": 0.12, "": 0.2, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1598, "size": 40451476104, "checksum": "5e19e7bc2c690ccdef66860b0a7ddb6cb721a44cc785d1a1887fd25be5f6b6d3", "avgDurationSecs": 6.0, "validDurationSecs": 5092704, "totalHrs": 1860.07, "validHrs": 1414.64 }, "zh-CN": { "buckets": { "dev": 10623, "invalidated": 55704, "other": 593295, "reported": 657, "test": 10623, "train": 29375, "validated": 177063 }, "reportedSentences": 654, "duration": 3758155260, "clips": 826062, "splits": { "accent": { "": 1 }, "age": { "": 0.94, "teens": 0.01, "twenties": 0.04, "thirties": 0.01, "fourties": 0, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.94, "male": 0.05, "female": 0.01, "other": 0 } }, "users": 6589, "size": 22221976419, "checksum": "20c88b6413084e8762cff0d27953d5912ba8d88b2c662b707f047b3269209610", "avgDurationSecs": 4.549, "validDurationSecs": 805545.159, "totalHrs": 1043.93, "validHrs": 223.76 }, "id": { "buckets": { "dev": 3288, "invalidated": 2546, "other": 28212, "reported": 342, "test": 3647, "train": 5040, "validated": 24877 }, "reportedSentences": 343, "duration": 222696192, "clips": 55635, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.4, "thirties": 0.1, "teens": 0.23, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.25, "male": 0.43, "female": 0.27, "other": 0.04 } }, "users": 482, "size": 1404008599, "checksum": "9567242d629cd7788efe19bc8b88576c6810d37d5ceb3f3248428d062c4ceb5e", "avgDurationSecs": 4.003, "validDurationSecs": 99577.841, "totalHrs": 61.86, "validHrs": 27.66 }, "ia": { "buckets": { "dev": 1790, "invalidated": 330, "other": 2672, "reported": 269, "test": 1761, "train": 5030, "validated": 11522 }, "reportedSentences": 265, "duration": 60726972, "clips": 14524, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.38, "female": 0.01 } }, "users": 62, "size": 411364433, "checksum": "d64e20bc8fd084bc00cf7782ce392c9734c25ce648a73d5544505d891d0b7bd3", "avgDurationSecs": 4.181, "validDurationSecs": 48175.17, "totalHrs": 16.86, "validHrs": 13.38 }, "lv": { "buckets": { "dev": 2174, "invalidated": 184, "other": 2225, "reported": 37, "test": 2263, "train": 2734, "validated": 8220 }, "reportedSentences": 38, "duration": 36636353, "clips": 10629, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.49, "fourties": 0.03, "": 0.21, "twenties": 0.24, "teens": 0.02, "fifties": 0 }, "gender": { "male": 0.67, "female": 0.12, "": 0.2 } }, "users": 127, "size": 259873352, "checksum": "a9168e15dd608ab50f1de913d0df1453bea9471ddeb117c0aa4ae5e329f740b4", "avgDurationSecs": 3.447, "validDurationSecs": 28332.94, "totalHrs": 10.17, "validHrs": 7.87 }, "ja": { "buckets": { "dev": 5030, "invalidated": 5927, "other": 56196, "reported": 197, "test": 5030, "train": 7211, "validated": 57293 }, "reportedSentences": 197, "duration": 522536469, "clips": 119416, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.55, "": 0.23, "teens": 0.06, "fifties": 0.01, "thirties": 0.04, "fourties": 0.11, "sixties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.51, "": 0.23, "female": 0.25, "other": 0.01 } }, "users": 1570, "size": 3097693238, "checksum": "f573fa5aff77257374832592f374b16b1eeb37d882211b2cf05d820bcebd8fd7", "avgDurationSecs": 4.376, "validDurationSecs": 250700.76, "totalHrs": 145.14, "validHrs": 69.63 }, "rw": { "buckets": { "dev": 15987, "invalidated": 227757, "other": 47242, "reported": 629, "test": 16213, "train": 1003021, "validated": 1438483 }, "reportedSentences": 630, "duration": 8580674957, "clips": 1713482, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1082, "size": 60997982287, "checksum": "07e467fd9c658db093dd33b1479abc52393df99dc475e154ed9eaebc215206ea", "avgDurationSecs": 5.008, "validDurationSecs": 7203551.046, "totalHrs": 2383.52, "validHrs": 2000.98 }, "sv-SE": { "buckets": { "dev": 5012, "invalidated": 1367, "other": 5859, "reported": 578, "test": 5091, "train": 7421, "validated": 39091 }, "reportedSentences": 579, "duration": 183619455, "clips": 46317, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.48, "": 0.19, "female": 0.33, "other": 0 } }, "users": 769, "size": 1164306935, "checksum": "0512fa3a82e12e8e6d40125dde29a9e76be1fdfda0c3b7c88a1548bee9053edb", "avgDurationSecs": 3.964, "validDurationSecs": 154972.648, "totalHrs": 51, "validHrs": 43.04 }, "cnh": { "buckets": { "dev": 761, "invalidated": 436, "other": 2908, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "reportedSentences": 9, "duration": 20675832, "clips": 5802, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 299, "size": 161395357, "checksum": "af57810dd1ad121bc016c57ece58d47d9f3d0701e4f0a0297b69a71ac943c5ee", "avgDurationSecs": 3.564, "validDurationSecs": 8759.255, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "buckets": { "dev": 2638, "invalidated": 6682, "other": 551, "reported": 489, "test": 2638, "train": 3137, "validated": 21766 }, "reportedSentences": 486, "duration": 195989330, "clips": 28999, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.08, "twenties": 0.68, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.26, "other": 0 } }, "users": 809, "size": 1337115941, "checksum": "1c809f7c0f95b30a892cbe09bbb65d687b69b97718b636b19246e19d82a95538", "avgDurationSecs": 6.758, "validDurationSecs": 147105.202, "totalHrs": 54.44, "validHrs": 40.86 }, "ky": { "buckets": { "dev": 1613, "invalidated": 5614, "other": 80, "reported": 36, "test": 1613, "train": 1787, "validated": 30083 }, "reportedSentences": 37, "duration": 162346656, "clips": 35777, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18, "fifties": 0 }, "gender": { "male": 0.54, "": 0.11, "female": 0.35, "other": 0 } }, "users": 255, "size": 1048670893, "checksum": "b0372b6d83b152a9c58aea685a6be8df570da3e3b3326928f2d493d6cfb1f3d5", "avgDurationSecs": 4.538, "validDurationSecs": 136508.775, "totalHrs": 45.09, "validHrs": 37.91 }, "ro": { "buckets": { "dev": 3792, "invalidated": 898, "other": 20880, "reported": 359, "test": 3861, "train": 5113, "validated": 15495 }, "reportedSentences": 360, "duration": 148622935, "clips": 37273, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "teens": 0.02, "": 0.1, "fourties": 0.06, "sixties": 0, "twenties": 0.68, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.74, "": 0.1, "female": 0.15, "other": 0.01 } }, "users": 374, "size": 928458462, "checksum": "916003804f2e760ffef496451dbb04a8feae1b0701a829f36280132ead8097ea", "avgDurationSecs": 3.987, "validDurationSecs": 61785.002, "totalHrs": 41.28, "validHrs": 17.16 }, "hsb": { "buckets": { "dev": 172, "invalidated": 247, "other": 0, "reported": 86, "test": 444, "train": 808, "validated": 1424 }, "reportedSentences": 87, "duration": 10207332, "clips": 1671, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.18, "thirties": 0.1, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.82, "": 0.18, "other": 0 } }, "users": 20, "size": 79768999, "checksum": "0ae100f50c2d4077ea4ed540f232f248726e778a3ef0a6101d796570336b1b5f", "avgDurationSecs": 6.109, "validDurationSecs": 8698.528, "totalHrs": 2.83, "validHrs": 2.41 }, "el": { "buckets": { "dev": 1695, "invalidated": 803, "other": 9262, "reported": 64, "test": 1697, "train": 1919, "validated": 15312 }, "reportedSentences": 65, "duration": 104774038, "clips": 25377, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.38, "fourties": 0.14, "": 0.32, "twenties": 0.13, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.63, "": 0.32, "other": 0.02, "female": 0.03 } }, "users": 351, "size": 704339159, "checksum": "7835c6c85ae509e498610e30e7c89474fc35f6d14e8a75c290170e2213ec66f8", "avgDurationSecs": 4.129, "validDurationSecs": 63218.665, "totalHrs": 29.1, "validHrs": 17.56 }, "cs": { "buckets": { "dev": 7739, "invalidated": 1344, "other": 8782, "reported": 706, "test": 7758, "train": 14815, "validated": 49799 }, "reportedSentences": 703, "duration": 259489335, "clips": 59925, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.19, "": 0.36, "thirties": 0.14, "teens": 0.02, "twenties": 0.27, "fifties": 0.02, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.35, "female": 0.02, "other": 0 } }, "users": 602, "size": 1819654605, "checksum": "65f7c995a5d9a9ab24b02390ceea535782631347ea7ed0958ba20885fdfe07b8", "avgDurationSecs": 4.33, "validDurationSecs": 215641.375, "totalHrs": 72.08, "validHrs": 59.9 }, "pl": { "buckets": { "dev": 8387, "invalidated": 6178, "other": 4184, "reported": 548, "test": 8388, "train": 16916, "validated": 125319 }, "reportedSentences": 548, "duration": 609201650, "clips": 135681, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3149, "size": 4364825121, "checksum": "14e2419c3b8692cc4e36a68b2b981d6525c4e3712f4d224f58ec579bf3107c06", "avgDurationSecs": 4.49, "validDurationSecs": 562676.731, "totalHrs": 169.22, "validHrs": 156.29 }, "rm-sursilv": { "buckets": { "dev": 1353, "invalidated": 677, "other": 2142, "reported": 16, "test": 1346, "train": 1552, "validated": 4258 }, "reportedSentences": 17, "duration": 38788025, "clips": 7077, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.17, "female": 0.19, "": 0.64, "other": 0 } }, "users": 87, "size": 293027201, "checksum": "021875aa984bc25c3ffd88991b84d24bd8c1e60523f6b8b6a469524160227183", "avgDurationSecs": 5.481, "validDurationSecs": 23337.489, "totalHrs": 10.77, "validHrs": 6.48 }, "rm-vallader": { "buckets": { "dev": 377, "invalidated": 393, "other": 711, "reported": 34, "test": 442, "train": 671, "validated": 1497 }, "reportedSentences": 33, "duration": 15115550, "clips": 2601, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 52, "size": 115477883, "checksum": "d403a6a235255cd0a4fa69e57240a66f10bd62dc4f325c8779da77919aaaba8c", "avgDurationSecs": 5.811, "validDurationSecs": 8699.723, "totalHrs": 4.19, "validHrs": 2.41 }, "mn": { "buckets": { "dev": 1879, "invalidated": 763, "other": 3455, "reported": 24, "test": 1891, "train": 2149, "validated": 8366 }, "reportedSentences": 25, "duration": 68918012, "clips": 12584, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.22, "": 0.28, "twenties": 0.41, "fourties": 0.01, "teens": 0.02, "nineties": 0.06, "fifties": 0 }, "gender": { "male": 0.36, "": 0.28, "female": 0.31, "other": 0.06 } }, "users": 484, "size": 523303849, "checksum": "8454570d8ebe14c5215b8c487f9481791f331aa0b43fc81c644895295dd26a52", "avgDurationSecs": 5.477, "validDurationSecs": 45817.553, "totalHrs": 19.14, "validHrs": 12.72 }, "zh-HK": { "buckets": { "dev": 5592, "invalidated": 4324, "other": 21464, "reported": 662, "test": 5592, "train": 8425, "validated": 91239 }, "reportedSentences": 651, "duration": 489793020, "clips": 117027, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.39, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0.01, "twenties": 0.33 }, "gender": { "male": 0.42, "": 0.35, "female": 0.22, "other": 0.01 } }, "users": 2959, "size": 3509410261, "checksum": "9177786ff68dc9439aac656cbb69e6754708f87913aa4bb1ecea82d0f29425eb", "avgDurationSecs": 4.185, "validDurationSecs": 381862.522, "totalHrs": 136.05, "validHrs": 106.07 }, "ab": { "buckets": { "dev": 9152, "invalidated": 5275, "other": 11953, "reported": 220, "test": 9124, "train": 21027, "validated": 41966 }, "reportedSentences": 219, "duration": 303837264, "clips": 59194, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.18, "teens": 0.28, "twenties": 0.18, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 399, "size": 1735910765, "checksum": "8dfcf8a2d3396a5d1414a97f4f38594fc1e61781aa9192a5d6ddf1587f530b15", "avgDurationSecs": 5.133, "validDurationSecs": 215407.552, "totalHrs": 84.39, "validHrs": 59.83 }, "cv": { "buckets": { "dev": 1160, "invalidated": 2115, "other": 434, "reported": 143, "test": 1288, "train": 1538, "validated": 16803 }, "reportedSentences": 139, "duration": 98016600, "clips": 19352, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "": 0.21, "fourties": 0.06, "thirties": 0.01, "teens": 0.19, "fifties": 0.01 }, "gender": { "male": 0.54, "": 0.19, "female": 0.27 } }, "users": 105, "size": 666776712, "checksum": "9661d6d9b09b505f206f9fe4a8ec214bd6d1f548662016e9e70df04c3023ba8a", "avgDurationSecs": 5.065, "validDurationSecs": 85106.084, "totalHrs": 27.22, "validHrs": 23.64 }, "uk": { "buckets": { "dev": 8085, "invalidated": 2478, "other": 1525, "reported": 593, "test": 8085, "train": 15749, "validated": 61578 }, "reportedSentences": 594, "duration": 312392004, "clips": 65581, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "teens": 0.1, "": 0.26, "fourties": 0.12, "thirties": 0.27, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.58, "female": 0.15, "": 0.26 } }, "users": 780, "size": 2115944701, "checksum": "3a49dab5846681ca5d4a44da8fede4098e07d921bb12c514efac9a7e297597f1", "avgDurationSecs": 4.763, "validDurationSecs": 293323.902, "totalHrs": 86.77, "validHrs": 81.47 }, "mt": { "buckets": { "dev": 1592, "invalidated": 320, "other": 6267, "reported": 9, "test": 1642, "train": 1944, "validated": 6376 }, "reportedSentences": 10, "duration": 61454772, "clips": 12963, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.25, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 211, "size": 456857354, "checksum": "fcd5ba501be158baefa5b54543e6860fdfdc49119d64e129516ee3ace9e5eeb7", "avgDurationSecs": 4.741, "validDurationSecs": 30227.233, "totalHrs": 17.07, "validHrs": 8.39 }, "as": { "buckets": { "dev": 471, "invalidated": 163, "other": 293, "reported": 9, "test": 308, "train": 831, "validated": 1610 }, "reportedSentences": 10, "duration": 11755685, "clips": 2066, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.59, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.41, "": 0.59, "female": 0 } }, "users": 44, "size": 73491923, "checksum": "3a78dfece9c53ab3f7c51a42150c0e9870b47e439671151656ccb8cff6ab26ad", "avgDurationSecs": 5.69, "validDurationSecs": 9161.013, "totalHrs": 3.26, "validHrs": 2.54 }, "ka": { "buckets": { "dev": 3120, "invalidated": 964, "other": 6264, "reported": 161, "test": 3158, "train": 3944, "validated": 12976 }, "reportedSentences": 153, "duration": 107276148, "clips": 20204, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.31, "thirties": 0.18, "": 0.31, "fourties": 0.04, "fifties": 0.11, "teens": 0.01, "sixties": 0.03 }, "gender": { "male": 0.43, "female": 0.26, "": 0.31 } }, "users": 407, "size": 647527803, "checksum": "43680f0e73b0ff83c673529d71020a04980fcd66fea38950ae32e3a941bf7058", "avgDurationSecs": 5.31, "validDurationSecs": 68898.005, "totalHrs": 29.79, "validHrs": 19.13 }, "fy-NL": { "buckets": { "dev": 3089, "invalidated": 3133, "other": 65829, "reported": 476, "test": 3089, "train": 3799, "validated": 39211 }, "reportedSentences": 467, "duration": 538296975, "clips": 108173, "splits": { "accent": { "": 1 }, "age": { "": 0.57, "fifties": 0.13, "thirties": 0.06, "twenties": 0.02, "fourties": 0.07, "sixties": 0.15, "seventies": 0, "teens": 0, "eighties": 0 }, "gender": { "": 0.57, "male": 0.1, "female": 0.32 } }, "users": 1422, "size": 3295528319, "checksum": "78d0b64630747fb937c65bff12d8365ba6ce45e14c47adaec91cabaf20bc156d", "avgDurationSecs": 4.976, "validDurationSecs": 195124.132, "totalHrs": 149.52, "validHrs": 54.2 }, "dv": { "buckets": { "dev": 2227, "invalidated": 1650, "other": 16366, "reported": 55, "test": 2207, "train": 2682, "validated": 26895 }, "reportedSentences": 56, "duration": 227260323, "clips": 44911, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.36, "fourties": 0.22, "teens": 0.01, "nineties": 0, "fifties": 0.02 }, "gender": { "": 0.21, "male": 0.29, "female": 0.5 } }, "users": 327, "size": 1439750005, "checksum": "8035637c87a8a5842f937e2b56232992e3f2f55e4e0c1762efc58a3e8fa311ce", "avgDurationSecs": 5.06, "validDurationSecs": 136095.086, "totalHrs": 63.12, "validHrs": 37.8 }, "pa-IN": { "buckets": { "dev": 282, "invalidated": 76, "other": 1420, "reported": 249, "test": 414, "train": 695, "validated": 1391 }, "reportedSentences": 244, "duration": 13950674, "clips": 2887, "splits": { "accent": { "": 1 }, "age": { "": 0.29, "fourties": 0.04, "fifties": 0.05, "thirties": 0.38, "twenties": 0.24, "sixties": 0, "teens": 0 }, "gender": { "": 0.29, "male": 0.71, "female": 0 } }, "users": 60, "size": 98594156, "checksum": "c2dd9fdc789651dd9c79cbe7c074dff5e1f6ff49cda28c50940f2c563129bd5b", "avgDurationSecs": 4.832, "validDurationSecs": 6721.644, "totalHrs": 3.87, "validHrs": 1.86 }, "vi": { "buckets": { "dev": 285, "invalidated": 349, "other": 11417, "reported": 186, "test": 1302, "train": 2475, "validated": 4681 }, "reportedSentences": 185, "duration": 65255012, "clips": 16447, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.19, "": 0.25, "teens": 0.21, "seventies": 0, "fourties": 0.02, "sixties": 0.31 }, "gender": { "male": 0.53, "": 0.25, "female": 0.2, "other": 0.02 } }, "users": 244, "size": 379202663, "checksum": "c64e2958fa792eae2a13ec6603c5b0ef61a298baa7564a8d8b01ea32bf8613c9", "avgDurationSecs": 3.968, "validDurationSecs": 18572.306, "totalHrs": 18.12, "validHrs": 5.15 }, "or": { "buckets": { "dev": 342, "invalidated": 174, "other": 6123, "reported": 21, "test": 226, "train": 482, "validated": 1217 }, "reportedSentences": 22, "duration": 37845672, "clips": 7514, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.08, "thirties": 0.72, "fourties": 0, "teens": 0 }, "gender": { "male": 0.88, "": 0.08, "female": 0.04 } }, "users": 92, "size": 270042628, "checksum": "fd75c2bfb9f6b3ceee9651fc6141240b2d70408eb256281283b18609875c3abf", "avgDurationSecs": 5.037, "validDurationSecs": 6129.649, "totalHrs": 10.51, "validHrs": 1.7 }, "ga-IE": { "buckets": { "dev": 514, "invalidated": 862, "other": 3890, "reported": 27, "test": 513, "train": 537, "validated": 4843 }, "reportedSentences": 28, "duration": 34059015, "clips": 9595, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.24, "": 0.37, "thirties": 0.26, "fourties": 0.06, "sixties": 0.01, "teens": 0.02, "fifties": 0.05 }, "gender": { "male": 0.5, "": 0.37, "female": 0.13, "other": 0 } }, "users": 168, "size": 235201624, "checksum": "35338eb00d3f88053516d182ec03a80ad61912fafe196e122813896f7b62abb8", "avgDurationSecs": 3.55, "validDurationSecs": 17191.017, "totalHrs": 9.46, "validHrs": 4.77 }, "fi": { "buckets": { "dev": 1651, "invalidated": 203, "other": 6227, "reported": 50, "test": 1753, "train": 2121, "validated": 7540 }, "reportedSentences": 51, "duration": 64206435, "clips": 13970, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.18, "": 0.35, "twenties": 0.11, "fourties": 0.32, "teens": 0.01, "fifties": 0.04, "seventies": 0 }, "gender": { "male": 0.29, "": 0.35, "female": 0.35, "other": 0 } }, "users": 201, "size": 382843013, "checksum": "405ca875072acac664033af14693b8d9ac5f13b1121fdfff8d01d9bcdc0aebc5", "avgDurationSecs": 4.596, "validDurationSecs": 34654.01, "totalHrs": 17.83, "validHrs": 9.62 }, "hu": { "buckets": { "dev": 4580, "invalidated": 935, "other": 2326, "reported": 105, "test": 4871, "train": 7744, "validated": 17245 }, "reportedSentences": 106, "duration": 101285317, "clips": 20506, "splits": { "accent": { "": 1 }, "age": { "teens": 0.08, "": 0.28, "thirties": 0.18, "twenties": 0.37, "fifties": 0.06, "fourties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.57, "": 0.28, "female": 0.14 } }, "users": 251, "size": 649364981, "checksum": "11d04b0076a86966fcfbf0106bd972937915de6cfe2281ffea66a851833c0032", "avgDurationSecs": 4.939, "validDurationSecs": 85178.255, "totalHrs": 28.13, "validHrs": 23.66 }, "th": { "buckets": { "dev": 10964, "invalidated": 8948, "other": 196059, "reported": 4144, "test": 10964, "train": 32148, "validated": 139710 }, "reportedSentences": 4144, "duration": 1440985512, "clips": 344717, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.21, "": 0.42, "thirties": 0.07, "fourties": 0.04, "teens": 0.04, "fifties": 0.21, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.4, "": 0.42, "female": 0.18, "other": 0.01 } }, "users": 7755, "size": 8380070534, "checksum": "4b8307f8b106afc87fce830a33784cf98505fc4275c3dec01441343983d3febb", "avgDurationSecs": 4.18, "validDurationSecs": 584015.543, "totalHrs": 400.27, "validHrs": 162.22 }, "lt": { "buckets": { "dev": 3701, "invalidated": 562, "other": 1200, "reported": 147, "test": 3785, "train": 5196, "validated": 12698 }, "reportedSentences": 147, "duration": 74465432, "clips": 14460, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.24, "thirties": 0.28, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.05 }, "gender": { "male": 0.62, "": 0.24, "female": 0.14 } }, "users": 267, "size": 457408146, "checksum": "3fc9878dc6c9657a35054212286f05654dff2b407e1c3e067f6e4ebbc0d3e418", "avgDurationSecs": 5.15, "validDurationSecs": 65391.567, "totalHrs": 20.68, "validHrs": 18.16 }, "lg": { "buckets": { "dev": 13291, "invalidated": 38909, "other": 42484, "reported": 6448, "test": 13288, "train": 66273, "validated": 266078 }, "reportedSentences": 6443, "duration": 2009919807, "clips": 347471, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.22, "twenties": 0.38, "fourties": 0.07, "fifties": 0.05, "teens": 0.01, "nineties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "": 0.25, "female": 0.39, "male": 0.36 } }, "users": 643, "size": 11773327076, "checksum": "23ecc3417a4b97e19d8a8cf8ec986a5a9ef9f373011ac198a1cf153dbb37629f", "avgDurationSecs": 5.784, "validDurationSecs": 1539108.134, "totalHrs": 558.31, "validHrs": 427.53 }, "hi": { "buckets": { "dev": 2179, "invalidated": 692, "other": 3493, "reported": 122, "test": 2962, "train": 4437, "validated": 9622 }, "reportedSentences": 123, "duration": 66799860, "clips": 13807, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "fourties": 0.03, "": 0.33, "thirties": 0.27, "teens": 0.01, "fifties": 0.01, "sixties": 0.01 }, "gender": { "male": 0.62, "female": 0.04, "": 0.34, "other": 0 } }, "users": 348, "size": 394566082, "checksum": "d79f1a1163b3603948b165069a719cced3ec83f97c24e484dd4450c96bf14020", "avgDurationSecs": 4.838, "validDurationSecs": 46552.347, "totalHrs": 18.55, "validHrs": 12.93 }, "bas": { "duration": 9991980, "buckets": { "dev": 457, "invalidated": 485, "other": 93, "reported": 7, "test": 458, "train": 763, "validated": 1678 }, "reportedSentences": 8, "clips": 2256, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 32, "size": 55578888, "checksum": "68461ede6419acfe13d92d956bc3a793fae7d3ea94429a886b35215d4d208719", "avgDurationSecs": 4.429, "validDurationSecs": 7431.978, "totalHrs": 2.77, "validHrs": 2.06 }, "sk": { "duration": 73060164, "buckets": { "dev": 2345, "invalidated": 729, "other": 287, "reported": 30, "test": 2330, "train": 2967, "validated": 17221 }, "reportedSentences": 31, "clips": 18237, "splits": { "accent": { "": 1 }, "age": { "": 0.52, "thirties": 0.22, "twenties": 0.06, "fourties": 0.1, "teens": 0.1 }, "gender": { "": 0.52, "male": 0.38, "female": 0.09, "other": 0.01 } }, "users": 150, "size": 408451086, "checksum": "a9e1f44d89f62f4ce6f0b7eb27481117dd52cfe64beb743ec39ca1329c778fe5", "avgDurationSecs": 4.006, "validDurationSecs": 68989.915, "totalHrs": 20.29, "validHrs": 19.16 }, "kmr": { "duration": 338145300, "buckets": { "dev": 3475, "invalidated": 2351, "other": 29453, "reported": 934, "test": 3484, "train": 4426, "validated": 52117 }, "reportedSentences": 934, "clips": 83921, "splits": { "accent": { "": 1 }, "age": { "": 0.49, "twenties": 0.35, "thirties": 0.07, "fourties": 0.03, "fifties": 0.05, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.49, "male": 0.41, "female": 0.1, "other": 0 } }, "users": 533, "size": 1778365638, "checksum": "523182fa9217f44322ad9059e9290d621369a6f14856c4dd722482ce5b1458c8", "avgDurationSecs": 4.029, "validDurationSecs": 209996.528, "totalHrs": 93.92, "validHrs": 58.33 }, "bg": { "duration": 50306004, "buckets": { "dev": 1246, "invalidated": 422, "other": 2238, "reported": 156, "test": 1979, "train": 3212, "validated": 6448 }, "reportedSentences": 157, "clips": 9108, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.33, "thirties": 0.09, "": 0.41, "twenties": 0.16, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.53, "female": 0.06, "": 0.41 } }, "users": 74, "size": 293796811, "checksum": "b26b390025d8b701d0a44bf7c1feeedebe59d860e80df2e1ef94c78d0f790efa", "avgDurationSecs": 5.523, "validDurationSecs": 35614.088, "totalHrs": 13.97, "validHrs": 9.89 }, "kk": { "duration": 6894468, "buckets": { "dev": 356, "invalidated": 196, "other": 10, "reported": 38, "test": 380, "train": 453, "validated": 1194 }, "reportedSentences": 39, "clips": 1400, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "thirties": 0.03, "twenties": 0.32, "teens": 0.06, "fifties": 0.1 }, "gender": { "": 0.51, "male": 0.46, "female": 0.03 } }, "users": 85, "size": 39595254, "checksum": "751ed8770e259cf4377264e2585a243171d4989b0f00aeccc7cbbd4892026c27", "avgDurationSecs": 4.925, "validDurationSecs": 5879.996, "totalHrs": 1.91, "validHrs": 1.63 }, "ba": { "duration": 959352156, "buckets": { "dev": 14585, "invalidated": 7896, "other": 26, "reported": 866, "test": 14570, "train": 118945, "validated": 208802 }, "reportedSentences": 863, "clips": 216724, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 891, "size": 5380966696, "checksum": "1744abaff7a497e011f3b1fbfdbd07bf10cbdda04d2ccf974b953f42d9bf3a42", "avgDurationSecs": 4.427, "validDurationSecs": 924284.569, "totalHrs": 266.48, "validHrs": 256.74 }, "gl": { "duration": 102669948, "buckets": { "dev": 3329, "invalidated": 597, "other": 8304, "reported": 235, "test": 3598, "train": 5021, "validated": 12056 }, "reportedSentences": 236, "clips": 20957, "splits": { "accent": { "": 1 }, "age": { "": 0.29, "thirties": 0.24, "fifties": 0.1, "twenties": 0.25, "fourties": 0.1, "teens": 0.02, "sixties": 0.01 }, "gender": { "": 0.31, "male": 0.35, "female": 0.34, "other": 0 } }, "users": 366, "size": 592911481, "checksum": "075f0dce6d595d075c79419f954d5447feea4a9b810ac544769d38b32f4a6844", "avgDurationSecs": 4.899, "validDurationSecs": 59063.267, "totalHrs": 28.51, "validHrs": 16.4 }, "ug": { "duration": 424477584, "buckets": { "dev": 3491, "invalidated": 3735, "other": 3, "reported": 281, "test": 3491, "train": 4421, "validated": 66794 }, "reportedSentences": 282, "clips": 70532, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "fifties": 0.01, "twenties": 0.11, "thirties": 0.15, "fourties": 0.12, "teens": 0.01, "eighties": 0 }, "gender": { "": 0.59, "male": 0.32, "female": 0.08, "other": 0 } }, "users": 840, "size": 2468559258, "checksum": "294a53ea0a774aeb7e32dc58ded213b91f67d44a11162240e6ff8a26bf7b44e4", "avgDurationSecs": 6.018, "validDurationSecs": 401981.452, "totalHrs": 117.91, "validHrs": 111.66 }, "hy-AM": { "duration": 16876800, "buckets": { "dev": 359, "invalidated": 99, "other": 1274, "reported": 42, "test": 412, "train": 617, "validated": 1389 }, "reportedSentences": 43, "clips": 2762, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.13, "twenties": 0.37, "fifties": 0.04, "teens": 0.09 }, "gender": { "": 0.37, "male": 0.22, "female": 0.41 } }, "users": 65, "size": 99090030, "checksum": "9ce56a955233ccb2902a1f54aef399c7777a8a5ab217cee5cd7ef66908c8e6f9", "avgDurationSecs": 6.11, "validDurationSecs": 8487.283, "totalHrs": 4.68, "validHrs": 2.35 }, "be": { "duration": 5105023020, "buckets": { "dev": 15876, "invalidated": 29331, "other": 104334, "reported": 3132, "test": 15876, "train": 347261, "validated": 937345 }, "reportedSentences": 3131, "clips": 1071010, "splits": { "accent": { "": 1 }, "age": { "": 0.84, "fourties": 0.05, "thirties": 0.06, "twenties": 0.04, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.83, "male": 0.07, "female": 0.09, "other": 0 } }, "users": 7903, "size": 28885754954, "checksum": "bcdb153eb0de0528452003a71f88de3998bf5640ca1ee7b18c59f42a502e1171", "avgDurationSecs": 4.767, "validDurationSecs": 4467902.076, "totalHrs": 1418.06, "validHrs": 1241.08 }, "ur": { "buckets": { "dev": 3303, "invalidated": 3304, "other": 85329, "reported": 50, "test": 3301, "train": 4130, "validated": 41863 }, "reportedSentences": 50, "duration": 506836512, "clips": 130496, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.88, "": 0.1, "fourties": 0.01, "thirties": 0, "teens": 0.01, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.7, "": 0.1, "female": 0.2 } }, "users": 199, "size": 2946100085, "checksum": "b972c21ccbb81a9ce201e5b314fb7576207c9b26b902156944491c5ff018becf", "avgDurationSecs": 3.884, "validDurationSecs": 162592.699, "totalHrs": 140.78, "validHrs": 45.16 }, "gn": { "buckets": { "dev": 352, "invalidated": 137, "other": 5165, "reported": 33, "test": 808, "train": 1393, "validated": 2629 }, "reportedSentences": 34, "duration": 33540480, "clips": 7931, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "twenties": 0.19, "thirties": 0.43, "sixties": 0, "fourties": 0.14, "teens": 0.02 }, "gender": { "": 0.21, "male": 0.59, "female": 0.19 } }, "users": 90, "size": 192661922, "checksum": "1254a51d19e9d93bf8eb24aa30beaef2ca408437ed3f6d03475b33a1ddc57b2e", "avgDurationSecs": 4.229, "validDurationSecs": 11118.134, "totalHrs": 9.31, "validHrs": 3.08 }, "sr": { "buckets": { "dev": 1037, "invalidated": 88, "other": 118, "reported": 27, "test": 1112, "train": 1380, "validated": 3552 }, "reportedSentences": 28, "duration": 10588968, "clips": 3758, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.49, "": 0.25, "fifties": 0.02, "fourties": 0.08, "thirties": 0.16, "teens": 0 }, "gender": { "male": 0.43, "": 0.25, "female": 0.32 } }, "users": 108, "size": 58901938, "checksum": "990f3583ce23c22bd2663a8d43911e52c193547ac6da186a4c46abdedc36122d", "avgDurationSecs": 2.818, "validDurationSecs": 10008.519, "totalHrs": 2.94, "validHrs": 2.78 }, "uz": { "buckets": { "dev": 11994, "invalidated": 13770, "other": 127652, "reported": 1823, "test": 12334, "train": 48186, "validated": 85770 }, "reportedSentences": 1805, "duration": 941558436, "clips": 227192, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.41, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.44, "": 0.41, "female": 0.15, "other": 0 } }, "users": 2092, "size": 5251668625, "checksum": "deaacaac5077e41cc67e219ea0d59ad78281e515bfe48d107c327535775da6d3", "avgDurationSecs": 4.144, "validDurationSecs": 355459.114, "totalHrs": 261.54, "validHrs": 98.73 }, "mr": { "buckets": { "dev": 1678, "invalidated": 2241, "other": 2803, "reported": 57, "test": 1827, "train": 2238, "validated": 10826 }, "reportedSentences": 58, "duration": 98264520, "clips": 15870, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "sixties": 0, "twenties": 0.29, "": 0.05, "teens": 0.53 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 79, "size": 575937746, "checksum": "a36ae0bebf5348a9c70e77f785411344f07b008aa186eae6ea27c81b9bda1b08", "avgDurationSecs": 6.192, "validDurationSecs": 67032.873, "totalHrs": 27.29, "validHrs": 18.62 }, "da": { "buckets": { "dev": 2105, "invalidated": 343, "other": 66, "reported": 326, "test": 2153, "train": 2734, "validated": 8756 }, "reportedSentences": 326, "duration": 38640996, "clips": 9165, "splits": { "accent": { "": 1 }, "age": { "": 0.33, "thirties": 0.29, "twenties": 0.2, "sixties": 0, "fourties": 0.14, "fifties": 0.03, "teens": 0 }, "gender": { "": 0.33, "female": 0.07, "male": 0.59 } }, "users": 217, "size": 221769760, "checksum": "4aa13fd019c3f6e47d975b1c604a643ee39eea3287531d4af0c614222dbfa16c", "avgDurationSecs": 4.216, "validDurationSecs": 36916.591, "totalHrs": 10.73, "validHrs": 10.25 }, "myv": { "buckets": { "dev": 239, "invalidated": 21, "other": 0, "reported": 30, "test": 449, "train": 1241, "validated": 1939 }, "reportedSentences": 31, "duration": 11274696, "clips": 1960, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.26, "": 0.39, "thirties": 0.25, "twenties": 0.09, "teens": 0.01 }, "gender": { "male": 0.54, "": 0.39, "female": 0.07 } }, "users": 12, "size": 65970315, "checksum": "8319a9a7b2183855e657d954bab009e96f35dc7abe83ac639d19b4fa43529d0e", "avgDurationSecs": 5.752, "validDurationSecs": 11153.896, "totalHrs": 3.13, "validHrs": 3.09 }, "nn-NO": { "duration": 3312612, "buckets": { "dev": 168, "invalidated": 15, "other": 10, "reported": 14, "test": 223, "train": 314, "validated": 710 }, "reportedSentences": 15, "clips": 735, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "thirties": 0.35, "twenties": 0.2, "fourties": 0.04, "fifties": 0.01 }, "gender": { "": 0.39, "female": 0.2, "male": 0.37, "other": 0.03 } }, "users": 26, "size": 18754220, "checksum": "39c5eb3a6f41d5b3f7aed4d36a1ca9bbf9ad73883ed4bd2989ea4fab341e6e23", "avgDurationSecs": 4.507, "validDurationSecs": 3199.938, "totalHrs": 0.92, "validHrs": 0.88 }, "ha": { "duration": 43750080, "buckets": { "dev": 579, "invalidated": 169, "other": 6660, "reported": 32, "test": 650, "train": 1930, "validated": 3266 }, "reportedSentences": 32, "clips": 10095, "splits": { "accent": { "": 1 }, "age": { "": 0.17, "thirties": 0.68, "twenties": 0.12, "fourties": 0, "fifties": 0.02 }, "gender": { "": 0.16, "male": 0.59, "female": 0.25 } }, "users": 36, "size": 255945833, "checksum": "57868f512cd681a137a0b73eda1cd3a5da21800bf6e9f8e406092878d38ae985", "avgDurationSecs": 4.334, "validDurationSecs": 14154.31, "totalHrs": 12.15, "validHrs": 3.93 }, "ckb": { "duration": 451110708, "buckets": { "dev": 4924, "invalidated": 7242, "other": 7897, "reported": 2358, "test": 4929, "train": 7010, "validated": 100425 }, "reportedSentences": 2358, "clips": 115564, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "thirties": 0.12, "twenties": 0.47, "fourties": 0.03, "teens": 0.02, "fifties": 0.02 }, "gender": { "": 0.33, "male": 0.6, "female": 0.07, "other": 0 } }, "users": 1227, "size": 2442625830, "checksum": "04c67dd99441c062a12938a9ec8ce974b033b8c57c0a065fc6f311bf271251cd", "avgDurationSecs": 3.904, "validDurationSecs": 392014.752, "totalHrs": 125.3, "validHrs": 108.89 }, "ml": { "duration": 10411416, "buckets": { "dev": 0, "invalidated": 14, "other": 1903, "reported": 115, "test": 146, "train": 459, "validated": 605 }, "reportedSentences": 116, "clips": 2522, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "twenties": 0.49, "thirties": 0.05, "fourties": 0 }, "gender": { "": 0.46, "male": 0.54 } }, "users": 31, "size": 60278242, "checksum": "45d67f6793a80ce63d46afeebcaae44aa1475d06fbeb114c93352ef6f59aaf02", "avgDurationSecs": 4.128, "validDurationSecs": 2497.584, "totalHrs": 2.89, "validHrs": 0.69 }, "mdf": { "duration": 1811340, "buckets": { "dev": 54, "invalidated": 8, "other": 9, "reported": 14, "test": 101, "train": 173, "validated": 328 }, "reportedSentences": 15, "clips": 345, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.58, "fourties": 0.34, "twenties": 0.02 }, "gender": { "male": 0.08, "": 0.58, "female": 0.34 } }, "users": 11, "size": 10642651, "checksum": "6655d3c62187fedda01c2c3e9d22f80b254265c6323cc14ca2f4bf9921d0ba11", "avgDurationSecs": 5.25, "validDurationSecs": 1722.086, "totalHrs": 0.5, "validHrs": 0.47 }, "sw": { "duration": 3088677492, "buckets": { "dev": 10977, "invalidated": 60533, "other": 293496, "reported": 1865, "test": 10978, "train": 32277, "validated": 227042 }, "reportedSentences": 1860, "clips": 581071, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.42, "thirties": 0.13, "teens": 0, "fifties": 0.05, "fourties": 0.04, "sixties": 0.01 }, "gender": { "": 0.32, "male": 0.35, "female": 0.33, "other": 0 } }, "users": 978, "size": 18027758464, "checksum": "367f6cf3ae5aebf698642a08a6a075da84b1439985ff46e3bf43b17ffcfb3cde", "avgDurationSecs": 5.315, "validDurationSecs": 1206839.638, "totalHrs": 857.96, "validHrs": 335.23 }, "sat": { "buckets": { "dev": 0, "invalidated": 10, "other": 373, "reported": 6, "test": 126, "train": 279, "validated": 405 }, "reportedSentences": 7, "duration": 3531996, "clips": 788, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "twenties": 0.44, "fourties": 0.01, "fifties": 0.01, "teens": 0.01, "thirties": 0.11 }, "gender": { "": 0.4, "male": 0.59, "female": 0.01 } }, "users": 12, "size": 19459847, "checksum": "dc415e84c9ac25d4536ddb4213ea99382f335995178b3a9ea7a062a4a2112807", "avgDurationSecs": 4.482, "validDurationSecs": 1815.303, "totalHrs": 0.98, "validHrs": 0.5 }, "tig": { "buckets": { "dev": 0, "invalidated": 12, "other": 0, "reported": 0, "test": 1, "train": 10, "validated": 11 }, "reportedSentences": 1, "duration": 103284, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 0.78, "twenties": 0.22 }, "gender": { "": 0.78, "male": 0.22 } }, "users": 5, "size": 603426, "checksum": "dccaf792dfa26edc61cb84c12068a9493e8d611f338c6a655c5587aabaf50849", "avgDurationSecs": 4.491, "validDurationSecs": 49.397, "totalHrs": 0.02, "validHrs": 0.01 }, "ig": { "buckets": { "dev": 2, "invalidated": 2, "other": 5708, "reported": 14, "test": 4, "train": 8, "validated": 14 }, "reportedSentences": 14, "duration": 31307832, "clips": 5724, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "twenties": 0.32, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02, "fourties": 0 }, "gender": { "": 0.56, "male": 0.13, "female": 0.3 } }, "users": 108, "size": 182819142, "checksum": "ef428d7f3099020060e3933903414a27dad453871924873f3855991814b6191a", "avgDurationSecs": 5.47, "validDurationSecs": 76.574, "totalHrs": 8.69, "validHrs": 0.02 }, "nan-tw": { "buckets": { "dev": 654, "invalidated": 304, "other": 10164, "reported": 140, "test": 1032, "train": 1646, "validated": 3340 }, "reportedSentences": 141, "duration": 36867924, "clips": 13808, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "": 0.14, "twenties": 0.39, "fourties": 0.23, "teens": 0.02, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.51, "": 0.14, "other": 0.12, "female": 0.23 } }, "users": 117, "size": 205255675, "checksum": "4643ba4dcc7a658e3d09d8c5ea8bc144a94f2707bb51a6be8778efc1a6e51348", "avgDurationSecs": 2.67, "validDurationSecs": 8917.936, "totalHrs": 10.24, "validHrs": 2.47 }, "mhr": { "duration": 651511116, "buckets": { "dev": 12905, "invalidated": 4587, "other": 24002, "reported": 65, "test": 13557, "train": 82828, "validated": 111101 }, "reportedSentences": 64, "clips": 139690, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.06, "": 0.17, "sixties": 0.05, "thirties": 0.28, "fourties": 0.16, "twenties": 0.22, "teens": 0.06, "seventies": 0.01 }, "gender": { "male": 0.21, "": 0.16, "female": 0.62 } }, "users": 318, "size": 3658505779, "checksum": "e7b89f49ae49e710d811e2eb5c1342db6ed680c498236888b0850ec98fb1a698", "avgDurationSecs": 4.664, "validDurationSecs": 518172.643, "totalHrs": 180.97, "validHrs": 143.93 }, "bn": { "duration": 4503387024, "buckets": { "dev": 9168, "invalidated": 7558, "other": 985005, "reported": 1811, "test": 9168, "train": 20422, "validated": 42900 }, "reportedSentences": 1805, "clips": 1035463, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.67, "": 0.22, "teens": 0.07, "fourties": 0.01, "fifties": 0 }, "gender": { "male": 0.54, "": 0.22, "female": 0.24, "other": 0 } }, "users": 22647, "size": 25940826446, "checksum": "545c840317cd3ed7c0720bad8bbad010674fd1428d01c192e5d87c449c7c46b8", "avgDurationSecs": 4.349, "validDurationSecs": 253139.848, "totalHrs": 1250.94, "validHrs": 70.31 }, "tok": { "duration": 37083996, "buckets": { "dev": 1934, "invalidated": 202, "other": 1619, "reported": 90, "test": 1889, "train": 2450, "validated": 8180 }, "reportedSentences": 91, "clips": 10001, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "twenties": 0.19, "teens": 0.28, "thirties": 0.11, "fourties": 0 }, "gender": { "": 0.42, "male": 0.45, "other": 0.05, "female": 0.08 } }, "users": 94, "size": 215735669, "checksum": "c82e6d9e3e2edcdc05883c26d941cbc834cd513174dc76a0e825af68b85a0346", "avgDurationSecs": 3.708, "validDurationSecs": 30331.676, "totalHrs": 10.3, "validHrs": 8.42 }, "yue": { "duration": 208784772, "buckets": { "dev": 2476, "invalidated": 1596, "other": 29736, "reported": 785, "test": 2494, "train": 2959, "validated": 19579 }, "reportedSentences": 786, "clips": 50911, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "": 0.38, "twenties": 0.39, "fourties": 0.03, "sixties": 0.01, "fifties": 0, "teens": 0.04 }, "gender": { "male": 0.31, "": 0.43, "female": 0.24, "other": 0.02 } }, "users": 788, "size": 1161443871, "checksum": "4d1684c3a2603d4e1a8118b8e04eaf42d441347199161df549e2e804d7c8b813", "avgDurationSecs": 4.101, "validDurationSecs": 80293.003, "totalHrs": 57.99, "validHrs": 22.3 }, "sah": { "duration": 24576564, "buckets": { "dev": 1083, "invalidated": 102, "other": 1, "reported": 2, "test": 1250, "train": 1594, "validated": 3995 }, "reportedSentences": 3, "clips": 4098, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.03, "fourties": 0.07, "thirties": 0.43, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.36, "male": 0.53, "female": 0.1 } }, "users": 54, "size": 186698156, "checksum": "6c6dac95bba33d77ec4d6e8ae8f6ce7ac2dba66ac38269b1ea3e4bd20a316af6", "avgDurationSecs": 5.997, "validDurationSecs": 23958.851, "totalHrs": 6.82, "validHrs": 6.65 }, "mk": { "duration": 979452, "buckets": { "dev": 0, "invalidated": 11, "other": 41, "reported": 8, "test": 17, "train": 115, "validated": 132 }, "reportedSentences": 9, "clips": 184, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "": 0.33, "twenties": 0.11, "teens": 0.08 }, "gender": { "male": 0.67, "": 0.33 } }, "users": 6, "size": 5764067, "checksum": "58de04279d5676acd329a8ed4f44e656b69e2e473bb7c4504a0cca9489127858", "avgDurationSecs": 5.323, "validDurationSecs": 702.65, "totalHrs": 0.27, "validHrs": 0.19 }, "sc": { "duration": 5936256, "buckets": { "dev": 230, "invalidated": 25, "other": 320, "reported": 2, "test": 170, "train": 458, "validated": 1000 }, "reportedSentences": 3, "clips": 1345, "splits": { "accent": { "": 1 }, "age": { "": 0.62, "thirties": 0.31, "twenties": 0.08 }, "gender": { "": 0.62, "female": 0.31, "male": 0.08 } }, "users": 12, "size": 34304699, "checksum": "a5298b8da4367be2a2c45e44c3df88826466c30f9108b9889d6d248313ff64dc", "avgDurationSecs": 4.414, "validDurationSecs": 4413.573, "totalHrs": 1.64, "validHrs": 1.22 }, "skr": { "duration": 21367800, "buckets": { "dev": 982, "invalidated": 427, "other": 1728, "reported": 16, "test": 825, "train": 1256, "validated": 3068 }, "reportedSentences": 2, "clips": 5223, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.21, "fifties": 0.41, "teens": 0, "fourties": 0, "thirties": 0.01 }, "gender": { "": 0.37, "male": 0.63 } }, "users": 40, "size": 122716244, "checksum": "9351a40b7de22a00d63f049b48d3a9360c7e94001613cf36a2d82a43d68a1fb9", "avgDurationSecs": 4.091, "validDurationSecs": 12551.486, "totalHrs": 5.93, "validHrs": 3.48 }, "ti": { "duration": 132912, "buckets": { "dev": 0, "invalidated": 0, "other": 10, "reported": 1, "test": 3, "train": 10, "validated": 13 }, "reportedSentences": 2, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "fourties": 0.22, "thirties": 0.43 }, "gender": { "": 0.35, "male": 0.65 } }, "users": 4, "size": 785220, "checksum": "428863f3968718e42946382b054a74092c23d96390a6e85183f3d076b841a063", "avgDurationSecs": 5.779, "validDurationSecs": 75.124, "totalHrs": 0.03, "validHrs": 0.02 }, "mrj": { "duration": 85568148, "buckets": { "dev": 3284, "invalidated": 258, "other": 6087, "reported": 49, "test": 4137, "train": 7173, "validated": 14722 }, "reportedSentences": 50, "clips": 21067, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.23, "": 0.12, "thirties": 0.25, "sixties": 0.05, "fourties": 0.16, "fifties": 0.17, "teens": 0.01 }, "gender": { "male": 0.22, "female": 0.65, "": 0.12 } }, "users": 35, "size": 483615482, "checksum": "9b408b04da3d532c0e4b26cbdaf8f34e4802c078672e427c1db4ae42d16eae4f", "avgDurationSecs": 4.062, "validDurationSecs": 59796.567, "totalHrs": 23.76, "validHrs": 16.61 }, "tw": { "duration": 915372, "buckets": { "dev": 0, "invalidated": 0, "other": 217, "reported": 2, "test": 3, "train": 12, "validated": 15 }, "reportedSentences": 3, "clips": 232, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.02, "thirties": 0.93 }, "gender": { "": 0.05, "male": 0.95 } }, "users": 4, "size": 5360520, "checksum": "57646a1ce316fd4c2e9aafd62a0fc8655fbb29f154004c82ab0ad68f089aca2c", "avgDurationSecs": 3.946, "validDurationSecs": 59.184, "totalHrs": 0.25, "validHrs": 0.01 }, "ko": { "duration": 7663320, "buckets": { "dev": 30, "invalidated": 38, "other": 1009, "reported": 1, "test": 29, "train": 94, "validated": 154 }, "reportedSentences": 2, "clips": 1201, "splits": { "accent": { "": 1 }, "age": { "": 0.13, "thirties": 0.49, "twenties": 0.3, "teens": 0.03, "fourties": 0.04 }, "gender": { "": 0.13, "male": 0.68, "female": 0.18 } }, "users": 20, "size": 44544935, "checksum": "3fa0d3ca796efd169472b291c1af461058edec00a47eb6bc8803afdd87bf86f9", "avgDurationSecs": 6.381, "validDurationSecs": 982.641, "totalHrs": 2.12, "validHrs": 0.27 }, "yo": { "duration": 741384, "buckets": { "dev": 26, "invalidated": 20, "other": 7, "reported": 3, "test": 27, "train": 39, "validated": 92 }, "reportedSentences": 1, "clips": 119, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "twenties": 0.26, "": 0.61 }, "gender": { "male": 0.3, "": 0.61, "female": 0.08 } }, "users": 16, "size": 3998282, "checksum": "4c5656e106018b0455d74b22e7c65d313e751c43fe20b67931e88ac591bd16da", "avgDurationSecs": 6.23, "validDurationSecs": 573.171, "totalHrs": 0.2, "validHrs": 0.15 }, "vot": { "duration": 1025976, "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7892370, "checksum": "ff4dcc62db098cc0400a7ae781a852aef363ba6f16b93c14c411403d3256c58b", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "duration": 1019304, "buckets": { "dev": 20, "invalidated": 31, "other": 76, "reported": 0, "test": 22, "train": 39, "validated": 81 }, "clips": 188, "splits": { "accent": { "": 1 }, "age": { "": 0.53, "twenties": 0.46, "fourties": 0.02 }, "gender": { "": 0.53, "male": 0.47 } }, "users": 18, "size": 5966004, "checksum": "0763b38b5b64fbcccff69d47ad2a9958c416c0ab37f20472af8eaa299200b0d2", "avgDurationSecs": 5.422, "validDurationSecs": 439.168, "totalHrs": 0.28, "validHrs": 0.12 }, "ast": { "duration": 921168, "buckets": { "dev": 0, "invalidated": 0, "other": 96, "test": 0, "train": 118, "validated": 118 }, "clips": 214, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 5399433, "checksum": "20f097653d3648a27c99df690cd14adb0ea63646b3270d31b1ff6c9f085b6768", "avgDurationSecs": 4.305, "validDurationSecs": 507.934, "totalHrs": 0.25, "validHrs": 0.14 }, "ne-NP": { "duration": 2129328, "buckets": { "dev": 47, "invalidated": 32, "other": 229, "test": 66, "train": 167, "validated": 280 }, "clips": 541, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.31, "": 0.48, "twenties": 0.21 }, "gender": { "male": 0.52, "": 0.48 } }, "users": 13, "size": 12365944, "checksum": "b8fd5ae4a948fc403e487da22ea47f21829a750277b1f3ef2ade3035a143065c", "avgDurationSecs": 3.936, "validDurationSecs": 1102.055, "totalHrs": 0.59, "validHrs": 0.3 }, "quy": { "duration": 4320, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 1, "validated": 1 }, "clips": 1, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 25957, "checksum": "c4994b7c44605ff7129011993cd0d0acb127b7102d40a186a000c2ca7a5fab5a", "avgDurationSecs": 4.32, "validDurationSecs": 4.32, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 658872, "buckets": { "dev": 20, "invalidated": 19, "other": 23, "test": 17, "train": 41, "validated": 78 }, "clips": 120, "splits": { "accent": { "": 1 }, "age": { "": 0.92, "fifties": 0.08 }, "gender": { "": 0.92, "male": 0.08 } }, "users": 10, "size": 3783472, "checksum": "4aa7ea6618b08881bc05c1e90096c0d456e80752f32ee4fe02dd40761c476b8e", "avgDurationSecs": 5.491, "validDurationSecs": 428.267, "totalHrs": 0.18, "validHrs": 0.11 } }, "totalDuration": 94028400000, "totalValidDurationSecs": 61656878, "totalHrs": 26119, "totalValidHrs": 17127 } ================================================ FILE: datasets/scripted-speech/cv-corpus-12.0-delta-2022-12-07.json ================================================ { "locales": { "en": { "duration": 225833112, "reportedSentences": 192, "clips": 40707, "users": 1152, "size": 1314755828, "checksum": "0efd86ca6b40641b55d1411b7d3b1f1ab8626de4b207504953706df201d198a5", "avgDurationSecs": 5.166, "validDurationSecs": 226812.34, "totalHrs": 62.73, "validHrs": 63.01 }, "fa": { "reportedSentences": 64, "duration": 15506532, "clips": 4509, "users": 27, "size": 88997465, "checksum": "4b9635a8f2f14198ae8bc6a31245b5a0611d90edd8df850d0f67554b27c1a94e", "avgDurationSecs": 3.992, "validDurationSecs": 24150.74, "totalHrs": 4.31, "validHrs": 6.7 }, "fr": { "reportedSentences": 245, "duration": 175096908, "clips": 35410, "users": 345, "size": 1020859073, "checksum": "00afc519d48d749a4724386dc203b8a0286060efe4ccb46963555794fef216eb", "avgDurationSecs": 5.001, "validDurationSecs": 71731.92, "totalHrs": 48.64, "validHrs": 19.92 }, "es": { "reportedSentences": 24, "duration": 184745664, "clips": 36655, "users": 373, "size": 1079846567, "checksum": "04e90a4bc52d1f0f9997f1b56ec3c636af09c411dc0437ca50df3c67a137c07a", "avgDurationSecs": 4.863, "validDurationSecs": 209248.5, "totalHrs": 51.32, "validHrs": 58.13 }, "sl": { "reportedSentences": 3, "duration": 3050316, "clips": 906, "users": 1, "size": 17738726, "checksum": "fe68f6c28f38918388128bba5c6999ec478e508d8b7af5cbfa9e51c339bba67e", "avgDurationSecs": 3.784, "validDurationSecs": 673.7, "totalHrs": 0.85, "validHrs": 0.19 }, "kab": { "reportedSentences": 29, "duration": 6310332, "clips": 1630, "users": 11, "size": 36226158, "checksum": "683057ee2860eada3a5ed5874ccb55ee5cd664f1e95f9876282f54404f7cc955", "avgDurationSecs": 3.336, "validDurationSecs": 1192.89, "totalHrs": 1.76, "validHrs": 0.33 }, "cy": { "reportedSentences": 5, "duration": 1907280, "clips": 307, "users": 10, "size": 10804250, "checksum": "1bb1e1bba095aa4d022ef949dda8b2a806ef2a20d3f135f37b5d12723290db14", "avgDurationSecs": 4.863, "validDurationSecs": 2351.58, "totalHrs": 0.53, "validHrs": 0.65 }, "ca": { "reportedSentences": 143, "duration": 604876608, "clips": 113561, "users": 663, "size": 3407934221, "checksum": "c9b0f4916532cd00782f1e155bd376e3e6f6f4829f73a8536c8aa9be5e773afa", "avgDurationSecs": 5.494, "validDurationSecs": 597917.78, "totalHrs": 168.02, "validHrs": 166.09 }, "de": { "reportedSentences": 402, "duration": 178791228, "clips": 31566, "users": 272, "size": 1041733376, "checksum": "2b8f54b28022d483f22c6a203a998d50fe946911b347f7e901113219ce5e9ab6", "avgDurationSecs": 5.185, "validDurationSecs": 169925.63, "totalHrs": 49.66, "validHrs": 47.2 }, "tt": { "reportedSentences": 0, "duration": 238212, "clips": 58, "users": 6, "size": 1395558, "checksum": "34edaf782e34e67177710eb44ac85cf14b488ad177c7f883996f42334939840b", "avgDurationSecs": 3.755, "validDurationSecs": 1112.65, "totalHrs": 0.07, "validHrs": 0.3 }, "ta": { "reportedSentences": 18, "duration": 3550680, "clips": 571, "users": 8, "size": 20439694, "checksum": "20835f24c7f3bf8cb7cc60e6c4b368606a6f8988a1395657ca4f64395ca6a001", "avgDurationSecs": 6.215, "validDurationSecs": 8049.29, "totalHrs": 0.99, "validHrs": 2.24 }, "ru": { "reportedSentences": 25, "duration": 50888520, "clips": 9722, "users": 84, "size": 297985268, "checksum": "b7c55379cbb5a7fae18a8343bb5c6512587165c9a7b258de921b48484aef82c2", "avgDurationSecs": 5.165, "validDurationSecs": 86951.86, "totalHrs": 14.13, "validHrs": 24.16 }, "nl": { "reportedSentences": 5, "duration": 3716280, "clips": 732, "users": 27, "size": 21524017, "checksum": "674cf7146187d88957fa5592a53ce05025fba5c79353a06227694a13be998af0", "avgDurationSecs": 4.324, "validDurationSecs": 3248.76, "totalHrs": 1.04, "validHrs": 0.9 }, "it": { "reportedSentences": 381, "duration": 44371080, "clips": 8202, "users": 65, "size": 256491338, "checksum": "00e4954197195361dbf0a919afbe99c735ac3128ce66e085037b13eba24a8d41", "avgDurationSecs": 5.362, "validDurationSecs": 41881.01, "totalHrs": 12.33, "validHrs": 11.63 }, "eu": { "reportedSentences": 19, "duration": 6298560, "clips": 1298, "users": 21, "size": 36182026, "checksum": "a669cbc38edd57d54441b1db84a4becad117c99c75f6ea973318741e7e66fd74", "avgDurationSecs": 5.187, "validDurationSecs": 1437.42, "totalHrs": 1.75, "validHrs": 0.4 }, "tr": { "reportedSentences": 8, "duration": 26958564, "clips": 8290, "users": 64, "size": 137592877, "checksum": "778825e1dc4a29f4fa5201e7bac51740a205763c6055632636dffc6aea669342", "avgDurationSecs": 3.622, "validDurationSecs": 26016.26, "totalHrs": 7.48, "validHrs": 7.23 }, "ar": { "reportedSentences": 14, "clips": 994, "users": 77, "size": 24071337, "checksum": "b2a7db7b89bc919d365c56ed31d78733eb4270abb59cddae29bde215093ae6e1", "duration": 4143986, "avgDurationSecs": 4.169, "validDurationSecs": 1142.31, "totalHrs": 1.15, "validHrs": 0.32 }, "zh-TW": { "reportedSentences": 2, "duration": 12348324, "clips": 3173, "users": 17, "size": 47424084, "checksum": "76d1ec2365724dbaae27b99bc69e526ad006b8edbcacb535f0f30979eb352014", "avgDurationSecs": 3.314, "validDurationSecs": 3960.09, "totalHrs": 3.43, "validHrs": 1.1 }, "br": { "reportedSentences": 3, "duration": 8650404, "clips": 2684, "users": 0, "size": 50165035, "checksum": "22dcc7f11a79437446cac99e9b4406661343a44ab95ba888599a9719f6dff5e3", "avgDurationSecs": 3.134, "validDurationSecs": 154.21, "totalHrs": 2.41, "validHrs": 0.04 }, "pt": { "reportedSentences": 55, "duration": 107311824, "clips": 28657, "users": 346, "size": 619867760, "checksum": "c2dda2673732a414967a609cbbcdda384ce29be8b81f510a98deb5163f3fd13b", "avgDurationSecs": 4.094, "validDurationSecs": 51007.32, "totalHrs": 29.81, "validHrs": 14.17 }, "eo": { "reportedSentences": 49, "duration": 10352008, "clips": 1706, "users": 17, "size": 62081647, "checksum": "5e19e7bc2c690ccdef66860b0a7ddb6cb721a44cc785d1a1887fd25be5f6b6d3", "avgDurationSecs": 6.068, "validDurationSecs": 637.14, "totalHrs": 2.87, "validHrs": 0.18 }, "zh-CN": { "reportedSentences": 20, "duration": 15906528, "clips": 3749, "users": 189, "size": 89189528, "checksum": "20c88b6413084e8762cff0d27953d5912ba8d88b2c662b707f047b3269209610", "avgDurationSecs": 4.549, "validDurationSecs": 338966.07, "totalHrs": 4.42, "validHrs": 94.16 }, "id": { "reportedSentences": 62, "duration": 19854468, "clips": 5514, "users": 37, "size": 114794042, "checksum": "9567242d629cd7788efe19bc8b88576c6810d37d5ceb3f3248428d062c4ceb5e", "avgDurationSecs": 4.003, "validDurationSecs": 4808.29, "totalHrs": 5.52, "validHrs": 1.34 }, "ia": { "reportedSentences": 2, "duration": 278676, "clips": 74, "users": 0, "size": 1627299, "checksum": "d64e20bc8fd084bc00cf7782ce392c9734c25ce648a73d5544505d891d0b7bd3", "avgDurationSecs": 4.181, "validDurationSecs": 356.17, "totalHrs": 0.07, "validHrs": 0.1 }, "lv": { "reportedSentences": 6, "duration": 5767092, "clips": 1582, "users": 9, "size": 32965770, "checksum": "a9168e15dd608ab50f1de913d0df1453bea9471ddeb117c0aa4ae5e329f740b4", "avgDurationSecs": 3.447, "validDurationSecs": 1885.76, "totalHrs": 1.6, "validHrs": 0.53 }, "ja": { "reportedSentences": 12, "duration": 324517644, "clips": 78096, "users": 882, "size": 1898383278, "checksum": "f573fa5aff77257374832592f374b16b1eeb37d882211b2cf05d820bcebd8fd7", "avgDurationSecs": 4.376, "validDurationSecs": 69982.22, "totalHrs": 90.14, "validHrs": 19.44 }, "rw": { "reportedSentences": 0, "duration": 33804, "clips": 10, "users": 2, "size": -381766, "checksum": "07e467fd9c658db093dd33b1479abc52393df99dc475e154ed9eaebc215206ea", "avgDurationSecs": 5.008, "validDurationSecs": 256.76, "totalHrs": 0.01, "validHrs": 0.07 }, "sv-SE": { "reportedSentences": 2, "duration": 2457000, "clips": 536, "users": 14, "size": 14400980, "checksum": "0512fa3a82e12e8e6d40125dde29a9e76be1fdfda0c3b7c88a1548bee9053edb", "avgDurationSecs": 3.964, "validDurationSecs": 1688.34, "totalHrs": 0.68, "validHrs": 0.47 }, "cnh": { "reportedSentences": 0, "duration": 0, "clips": 0, "users": 0, "size": 1990, "checksum": "af57810dd1ad121bc016c57ece58d47d9f3d0701e4f0a0297b69a71ac943c5ee", "avgDurationSecs": 3.564, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "et": { "reportedSentences": 2, "duration": 37620, "clips": 5, "users": 1, "size": 211187, "checksum": "1c809f7c0f95b30a892cbe09bbb65d687b69b97718b636b19246e19d82a95538", "avgDurationSecs": 6.758, "validDurationSecs": 577.33, "totalHrs": 0.01, "validHrs": 0.16 }, "ky": { "reportedSentences": 0, "duration": 194616, "clips": 55, "users": 5, "size": 1105967, "checksum": "b0372b6d83b152a9c58aea685a6be8df570da3e3b3326928f2d493d6cfb1f3d5", "avgDurationSecs": 4.538, "validDurationSecs": 1247.41, "totalHrs": 0.05, "validHrs": 0.34 }, "ro": { "reportedSentences": 26, "duration": 6969384, "clips": 1829, "users": 5, "size": 40062982, "checksum": "916003804f2e760ffef496451dbb04a8feae1b0701a829f36280132ead8097ea", "avgDurationSecs": 3.987, "validDurationSecs": 657.84, "totalHrs": 1.94, "validHrs": 0.19 }, "hsb": { "reportedSentences": 2, "duration": 0, "clips": 0, "users": 0, "size": -1564, "checksum": "0ae100f50c2d4077ea4ed540f232f248726e778a3ef0a6101d796570336b1b5f", "avgDurationSecs": 6.109, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "el": { "reportedSentences": 0, "duration": 1500876, "clips": 389, "users": 5, "size": 8687112, "checksum": "7835c6c85ae509e498610e30e7c89474fc35f6d14e8a75c290170e2213ec66f8", "avgDurationSecs": 4.129, "validDurationSecs": 733.19, "totalHrs": 0.42, "validHrs": 0.21 }, "cs": { "reportedSentences": 5, "duration": 4276836, "clips": 938, "users": 24, "size": 23949994, "checksum": "65f7c995a5d9a9ab24b02390ceea535782631347ea7ed0958ba20885fdfe07b8", "avgDurationSecs": 4.33, "validDurationSecs": 3076.06, "totalHrs": 1.19, "validHrs": 0.86 }, "pl": { "reportedSentences": 11, "duration": 5166756, "clips": 989, "users": 26, "size": 30348832, "checksum": "14e2419c3b8692cc4e36a68b2b981d6525c4e3712f4d224f58ec579bf3107c06", "avgDurationSecs": 4.49, "validDurationSecs": 7173.79, "totalHrs": 1.44, "validHrs": 1.99 }, "rm-sursilv": { "reportedSentences": 2, "duration": 6660, "clips": 1, "users": 1, "size": 35149, "checksum": "021875aa984bc25c3ffd88991b84d24bd8c1e60523f6b8b6a469524160227183", "avgDurationSecs": 5.481, "validDurationSecs": 181.57, "totalHrs": 0, "validHrs": 0.05 }, "rm-vallader": { "reportedSentences": 0, "duration": 0, "clips": 0, "users": 0, "size": -866, "checksum": "d403a6a235255cd0a4fa69e57240a66f10bd62dc4f325c8779da77919aaaba8c", "avgDurationSecs": 5.811, "validDurationSecs": 46.49, "totalHrs": 0, "validHrs": 0.01 }, "mn": { "reportedSentences": 3, "duration": 484848, "clips": 85, "users": 6, "size": 2855350, "checksum": "8454570d8ebe14c5215b8c487f9481791f331aa0b43fc81c644895295dd26a52", "avgDurationSecs": 5.477, "validDurationSecs": 374.3, "totalHrs": 0.14, "validHrs": 0.1 }, "zh-HK": { "reportedSentences": 9, "duration": 2406708, "clips": 730, "users": 16, "size": 13620184, "checksum": "9177786ff68dc9439aac656cbb69e6754708f87913aa4bb1ecea82d0f29425eb", "avgDurationSecs": 4.185, "validDurationSecs": 1758.45, "totalHrs": 0.67, "validHrs": 0.49 }, "ab": { "reportedSentences": 0, "duration": 0, "clips": 0, "users": 0, "size": -30717, "checksum": "8dfcf8a2d3396a5d1414a97f4f38594fc1e61781aa9192a5d6ddf1587f530b15", "avgDurationSecs": 5.133, "validDurationSecs": 56.46, "totalHrs": 0, "validHrs": 0.02 }, "cv": { "reportedSentences": 0, "duration": 3811608, "clips": 769, "users": 1, "size": 21899946, "checksum": "9661d6d9b09b505f206f9fe4a8ec214bd6d1f548662016e9e70df04c3023ba8a", "avgDurationSecs": 5.065, "validDurationSecs": 7969.82, "totalHrs": 1.06, "validHrs": 2.22 }, "uk": { "reportedSentences": 0, "duration": 1548864, "clips": 330, "users": 21, "size": 8168071, "checksum": "3a49dab5846681ca5d4a44da8fede4098e07d921bb12c514efac9a7e297597f1", "avgDurationSecs": 4.763, "validDurationSecs": 32081.52, "totalHrs": 0.43, "validHrs": 8.91 }, "mt": { "reportedSentences": 0, "duration": 33768, "clips": 6, "users": 2, "size": 191608, "checksum": "fcd5ba501be158baefa5b54543e6860fdfdc49119d64e129516ee3ace9e5eeb7", "avgDurationSecs": 4.741, "validDurationSecs": 31.06, "totalHrs": 0.01, "validHrs": 0.01 }, "as": { "reportedSentences": 0, "duration": 40572, "clips": 5, "users": 1, "size": 231692, "checksum": "3a78dfece9c53ab3f7c51a42150c0e9870b47e439671151656ccb8cff6ab26ad", "avgDurationSecs": 5.69, "validDurationSecs": 60.63, "totalHrs": 0.01, "validHrs": 0.02 }, "ka": { "reportedSentences": 102, "duration": 45507744, "clips": 8255, "users": 83, "size": 265729837, "checksum": "43680f0e73b0ff83c673529d71020a04980fcd66fea38950ae32e3a941bf7058", "avgDurationSecs": 5.31, "validDurationSecs": 15984.68, "totalHrs": 12.64, "validHrs": 4.44 }, "fy-NL": { "reportedSentences": 42, "duration": 73752300, "clips": 14745, "users": 276, "size": 389520133, "checksum": "78d0b64630747fb937c65bff12d8365ba6ce45e14c47adaec91cabaf20bc156d", "avgDurationSecs": 4.976, "validDurationSecs": 15656.78, "totalHrs": 20.48, "validHrs": 4.35 }, "dv": { "reportedSentences": 0, "duration": 952056, "clips": 230, "users": 6, "size": 5557972, "checksum": "8035637c87a8a5842f937e2b56232992e3f2f55e4e0c1762efc58a3e8fa311ce", "avgDurationSecs": 5.06, "validDurationSecs": 237.19, "totalHrs": 0.26, "validHrs": 0.07 }, "pa-IN": { "reportedSentences": 0, "duration": 806040, "clips": 163, "users": 2, "size": 4068260, "checksum": "c2dd9fdc789651dd9c79cbe7c074dff5e1f6ff49cda28c50940f2c563129bd5b", "avgDurationSecs": 4.832, "validDurationSecs": 139.68, "totalHrs": 0.22, "validHrs": 0.04 }, "vi": { "reportedSentences": 5, "duration": 358884, "clips": 91, "users": 8, "size": 2049978, "checksum": "c64e2958fa792eae2a13ec6603c5b0ef61a298baa7564a8d8b01ea32bf8613c9", "avgDurationSecs": 3.968, "validDurationSecs": 546.93, "totalHrs": 0.1, "validHrs": 0.15 }, "or": { "reportedSentences": 10, "duration": 1946448, "clips": 396, "users": 4, "size": 11292413, "checksum": "fd75c2bfb9f6b3ceee9651fc6141240b2d70408eb256281283b18609875c3abf", "avgDurationSecs": 5.037, "validDurationSecs": 339.78, "totalHrs": 0.54, "validHrs": 0.1 }, "ga-IE": { "reportedSentences": 5, "duration": 942120, "clips": 243, "users": 4, "size": 5426424, "checksum": "35338eb00d3f88053516d182ec03a80ad61912fafe196e122813896f7b62abb8", "avgDurationSecs": 3.55, "validDurationSecs": 660.9, "totalHrs": 0.27, "validHrs": 0.18 }, "fi": { "reportedSentences": 4, "duration": 2948976, "clips": 607, "users": 4, "size": 17241682, "checksum": "405ca875072acac664033af14693b8d9ac5f13b1121fdfff8d01d9bcdc0aebc5", "avgDurationSecs": 4.596, "validDurationSecs": 791.19, "totalHrs": 0.82, "validHrs": 0.22 }, "hu": { "reportedSentences": 4, "duration": 6055308, "clips": 1121, "users": 17, "size": 34984621, "checksum": "11d04b0076a86966fcfbf0106bd972937915de6cfe2281ffea66a851833c0032", "avgDurationSecs": 4.939, "validDurationSecs": 3723.07, "totalHrs": 1.68, "validHrs": 1.04 }, "th": { "reportedSentences": 3, "duration": 22275504, "clips": 4887, "users": 77, "size": 126791191, "checksum": "4b8307f8b106afc87fce830a33784cf98505fc4275c3dec01441343983d3febb", "avgDurationSecs": 4.18, "validDurationSecs": 16677.66, "totalHrs": 6.19, "validHrs": 4.63 }, "lt": { "reportedSentences": 10, "duration": 145548, "clips": 30, "users": 2, "size": 855493, "checksum": "3fc9878dc6c9657a35054212286f05654dff2b407e1c3e067f6e4ebbc0d3e418", "avgDurationSecs": 5.15, "validDurationSecs": 259.95, "totalHrs": 0.04, "validHrs": 0.07 }, "lg": { "reportedSentences": 274, "duration": 261128124, "clips": 47103, "users": 141, "size": 1522574724, "checksum": "23ecc3417a4b97e19d8a8cf8ec986a5a9ef9f373011ac198a1cf153dbb37629f", "avgDurationSecs": 5.784, "validDurationSecs": 69134.05, "totalHrs": 72.54, "validHrs": 19.21 }, "hi": { "reportedSentences": 4, "duration": 1865088, "clips": 331, "users": 15, "size": 10848727, "checksum": "d79f1a1163b3603948b165069a719cced3ec83f97c24e484dd4450c96bf14020", "avgDurationSecs": 4.838, "validDurationSecs": 930.32, "totalHrs": 0.52, "validHrs": 0.26 }, "bas": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 971, "checksum": "68461ede6419acfe13d92d956bc3a793fae7d3ea94429a886b35215d4d208719", "avgDurationSecs": 4.429, "validDurationSecs": 62.01, "totalHrs": 0, "validHrs": 0.02 }, "sk": { "duration": 2382192, "reportedSentences": 0, "clips": 534, "users": 5, "size": 12365726, "checksum": "a9e1f44d89f62f4ce6f0b7eb27481117dd52cfe64beb743ec39ca1329c778fe5", "avgDurationSecs": 4.006, "validDurationSecs": 2320.34, "totalHrs": 0.66, "validHrs": 0.65 }, "kmr": { "duration": 108813384, "reportedSentences": 154, "clips": 30514, "users": 112, "size": 557361459, "checksum": "523182fa9217f44322ad9059e9290d621369a6f14856c4dd722482ce5b1458c8", "avgDurationSecs": 4.029, "validDurationSecs": 22037.71, "totalHrs": 30.22, "validHrs": 6.12 }, "bg": { "duration": 3132432, "reportedSentences": 11, "clips": 558, "users": 2, "size": 18367948, "checksum": "b26b390025d8b701d0a44bf7c1feeedebe59d860e80df2e1ef94c78d0f790efa", "avgDurationSecs": 5.523, "validDurationSecs": 1494.63, "totalHrs": 0.87, "validHrs": 0.42 }, "kk": { "duration": 138564, "reportedSentences": 8, "clips": 31, "users": 4, "size": 812509, "checksum": "751ed8770e259cf4377264e2585a243171d4989b0f00aeccc7cbbd4892026c27", "avgDurationSecs": 4.925, "validDurationSecs": 86.4, "totalHrs": 0.04, "validHrs": 0.03 }, "ba": { "duration": 938160, "reportedSentences": 0, "clips": 185, "users": 3, "size": 4642013, "checksum": "1744abaff7a497e011f3b1fbfdbd07bf10cbdda04d2ccf974b953f42d9bf3a42", "avgDurationSecs": 4.427, "validDurationSecs": 916.09, "totalHrs": 0.26, "validHrs": 0.25 }, "gl": { "duration": 37545696, "reportedSentences": 29, "clips": 7485, "users": 198, "size": 217608136, "checksum": "075f0dce6d595d075c79419f954d5447feea4a9b810ac544769d38b32f4a6844", "avgDurationSecs": 4.899, "validDurationSecs": 17123.09, "totalHrs": 10.42, "validHrs": 4.75 }, "ug": { "duration": 20137140, "reportedSentences": 6, "clips": 3067, "users": 93, "size": 117754897, "checksum": "294a53ea0a774aeb7e32dc58ded213b91f67d44a11162240e6ff8a26bf7b44e4", "avgDurationSecs": 6.018, "validDurationSecs": 18803.48, "totalHrs": 5.6, "validHrs": 5.23 }, "hy-AM": { "duration": 144108, "reportedSentences": 4, "clips": 25, "users": 3, "size": 848329, "checksum": "9ce56a955233ccb2902a1f54aef399c7777a8a5ab217cee5cd7ef66908c8e6f9", "avgDurationSecs": 6.11, "validDurationSecs": 166.79, "totalHrs": 0.04, "validHrs": 0.04 }, "be": { "duration": 723888144, "reportedSentences": 4, "clips": 150043, "users": 938, "size": 4040901322, "checksum": "bcdb153eb0de0528452003a71f88de3998bf5640ca1ee7b18c59f42a502e1171", "avgDurationSecs": 4.767, "validDurationSecs": 295504.12, "totalHrs": 201.08, "validHrs": 82.09 }, "ur": { "reportedSentences": 2, "duration": 1578564, "clips": 284, "users": 6, "size": 8822157, "checksum": "b972c21ccbb81a9ce201e5b314fb7576207c9b26b902156944491c5ff018becf", "avgDurationSecs": 3.884, "validDurationSecs": 343.01, "totalHrs": 0.44, "validHrs": 0.1 }, "gn": { "reportedSentences": 7, "duration": 9058716, "clips": 2183, "users": 10, "size": 52902729, "checksum": "1254a51d19e9d93bf8eb24aa30beaef2ca408437ed3f6d03475b33a1ddc57b2e", "avgDurationSecs": 4.229, "validDurationSecs": 3928.64, "totalHrs": 2.51, "validHrs": 1.09 }, "sr": { "reportedSentences": 9, "duration": 3616380, "clips": 1308, "users": 52, "size": 20206907, "checksum": "990f3583ce23c22bd2663a8d43911e52c193547ac6da186a4c46abdedc36122d", "avgDurationSecs": 2.818, "validDurationSecs": 3329.06, "totalHrs": 1.01, "validHrs": 0.93 }, "uz": { "reportedSentences": 38, "duration": 14345460, "clips": 2768, "users": 67, "size": 77216367, "checksum": "deaacaac5077e41cc67e219ea0d59ad78281e515bfe48d107c327535775da6d3", "avgDurationSecs": 4.144, "validDurationSecs": 6733.76, "totalHrs": 3.99, "validHrs": 1.87 }, "mr": { "reportedSentences": 1, "duration": 68436, "clips": 10, "users": 1, "size": 399244, "checksum": "a36ae0bebf5348a9c70e77f785411344f07b008aa186eae6ea27c81b9bda1b08", "avgDurationSecs": 6.192, "validDurationSecs": 140.66, "totalHrs": 0.02, "validHrs": 0.04 }, "da": { "reportedSentences": 15, "duration": 844020, "clips": 229, "users": 8, "size": 4910961, "checksum": "4aa13fd019c3f6e47d975b1c604a643ee39eea3287531d4af0c614222dbfa16c", "avgDurationSecs": 4.216, "validDurationSecs": 1010.31, "totalHrs": 0.24, "validHrs": 0.28 }, "myv": { "reportedSentences": 5, "duration": 0, "clips": 0, "users": 0, "size": 5223, "checksum": "8319a9a7b2183855e657d954bab009e96f35dc7abe83ac639d19b4fa43529d0e", "avgDurationSecs": 5.752, "validDurationSecs": 1254.02, "totalHrs": 0, "validHrs": 0.35 }, "nn-NO": { "duration": 45612, "reportedSentences": 0, "clips": 10, "users": 1, "size": 265974, "checksum": "39c5eb3a6f41d5b3f7aed4d36a1ca9bbf9ad73883ed4bd2989ea4fab341e6e23", "avgDurationSecs": 4.507, "validDurationSecs": 5.04, "totalHrs": 0.02, "validHrs": 0 }, "ha": { "duration": 4101156, "reportedSentences": 5, "clips": 972, "users": 3, "size": 23964125, "checksum": "57868f512cd681a137a0b73eda1cd3a5da21800bf6e9f8e406092878d38ae985", "avgDurationSecs": 4.334, "validDurationSecs": 712.01, "totalHrs": 1.14, "validHrs": 0.2 }, "ckb": { "duration": 10204596, "reportedSentences": 15, "clips": 2486, "users": 26, "size": 58673597, "checksum": "04c67dd99441c062a12938a9ec8ce974b033b8c57c0a065fc6f311bf271251cd", "avgDurationSecs": 3.904, "validDurationSecs": 15288.36, "totalHrs": 2.83, "validHrs": 4.25 }, "ml": { "duration": 149652, "reportedSentences": 0, "clips": 30, "users": 6, "size": 877110, "checksum": "45d67f6793a80ce63d46afeebcaae44aa1475d06fbeb114c93352ef6f59aaf02", "avgDurationSecs": 4.128, "validDurationSecs": 265.69, "totalHrs": 0.04, "validHrs": 0.08 }, "mdf": { "duration": 19620, "reportedSentences": 2, "clips": 6, "users": 1, "size": 116195, "checksum": "6655d3c62187fedda01c2c3e9d22f80b254265c6323cc14ca2f4bf9921d0ba11", "avgDurationSecs": 5.25, "validDurationSecs": 353.19, "totalHrs": 0.01, "validHrs": 0.09 }, "sw": { "duration": 378207972, "reportedSentences": 4, "clips": 74111, "users": 273, "size": 2183708552, "checksum": "367f6cf3ae5aebf698642a08a6a075da84b1439985ff46e3bf43b17ffcfb3cde", "avgDurationSecs": 5.315, "validDurationSecs": 33915.71, "totalHrs": 105.06, "validHrs": 9.42 }, "sat": { "reportedSentences": 0, "duration": 314388, "clips": 67, "users": 1, "size": 1829813, "checksum": "dc415e84c9ac25d4536ddb4213ea99382f335995178b3a9ea7a062a4a2112807", "avgDurationSecs": 4.482, "validDurationSecs": 52.54, "totalHrs": 0.09, "validHrs": 0.02 }, "tig": { "reportedSentences": 0, "duration": 0, "clips": 0, "users": 0, "size": -7, "checksum": "dccaf792dfa26edc61cb84c12068a9493e8d611f338c6a655c5587aabaf50849", "avgDurationSecs": 4.491, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ig": { "reportedSentences": 1, "duration": 175500, "clips": 30, "users": 3, "size": 967647, "checksum": "ef428d7f3099020060e3933903414a27dad453871924873f3855991814b6191a", "avgDurationSecs": 5.47, "validDurationSecs": 0.03, "totalHrs": 0.05, "validHrs": 0 }, "nan-tw": { "reportedSentences": 11, "duration": 863748, "clips": 358, "users": 5, "size": 4983679, "checksum": "4643ba4dcc7a658e3d09d8c5ea8bc144a94f2707bb51a6be8778efc1a6e51348", "avgDurationSecs": 2.67, "validDurationSecs": 384.01, "totalHrs": 0.24, "validHrs": 0.1 }, "mhr": { "duration": 154658088, "reportedSentences": 22, "clips": 34378, "users": 50, "size": 862645901, "checksum": "e7b89f49ae49e710d811e2eb5c1342db6ed680c498236888b0850ec98fb1a698", "avgDurationSecs": 4.664, "validDurationSecs": 38440.93, "totalHrs": 42.96, "validHrs": 10.68 }, "bn": { "duration": 2847185496, "reportedSentences": 465, "clips": 766287, "users": 1781, "size": 16409764181, "checksum": "545c840317cd3ed7c0720bad8bbad010674fd1428d01c192e5d87c449c7c46b8", "avgDurationSecs": 4.349, "validDurationSecs": 26080.953, "totalHrs": 790.89, "validHrs": 7.24 }, "tok": { "duration": 1215540, "reportedSentences": 1, "clips": 302, "users": 12, "size": 7105776, "checksum": "c82e6d9e3e2edcdc05883c26d941cbc834cd513174dc76a0e825af68b85a0346", "avgDurationSecs": 3.708, "validDurationSecs": 3671.64, "totalHrs": 0.34, "validHrs": 1.02 }, "yue": { "duration": 20679516, "reportedSentences": 15, "clips": 5491, "users": 45, "size": 104732822, "checksum": "4d1684c3a2603d4e1a8118b8e04eaf42d441347199161df549e2e804d7c8b813", "avgDurationSecs": 4.101, "validDurationSecs": 2595.01, "totalHrs": 5.74, "validHrs": 0.72 }, "sah": { "duration": 34236, "reportedSentences": 0, "clips": 6, "users": 1, "size": 195026, "checksum": "6c6dac95bba33d77ec4d6e8ae8f6ce7ac2dba66ac38269b1ea3e4bd20a316af6", "avgDurationSecs": 5.997, "validDurationSecs": 58.27, "totalHrs": 0.01, "validHrs": 0.02 }, "mk": { "duration": 0, "reportedSentences": 1, "clips": 0, "users": 0, "size": 128, "checksum": "58de04279d5676acd329a8ed4f44e656b69e2e473bb7c4504a0cca9489127858", "avgDurationSecs": 5.323, "validDurationSecs": 10.65, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 763596, "reportedSentences": 0, "clips": 168, "users": 0, "size": 4480501, "checksum": "a5298b8da4367be2a2c45e44c3df88826466c30f9108b9889d6d248313ff64dc", "avgDurationSecs": 4.414, "validDurationSecs": 300.06, "totalHrs": 0.21, "validHrs": 0.08 }, "skr": { "duration": 2343888, "reportedSentences": 0, "clips": 521, "users": 3, "size": 11827102, "checksum": "9351a40b7de22a00d63f049b48d3a9360c7e94001613cf36a2d82a43d68a1fb9", "avgDurationSecs": 4.091, "validDurationSecs": 4342.32, "totalHrs": 0.65, "validHrs": 1.2 }, "ti": { "duration": 55008, "reportedSentences": 1, "clips": 10, "users": 1, "size": 324460, "checksum": "428863f3968718e42946382b054a74092c23d96390a6e85183f3d076b841a063", "avgDurationSecs": 5.779, "validDurationSecs": 39.17, "totalHrs": 0.01, "validHrs": 0.02 }, "mrj": { "duration": 26120736, "reportedSentences": 15, "clips": 7757, "users": 8, "size": 138732092, "checksum": "9b408b04da3d532c0e4b26cbdaf8f34e4802c078672e427c1db4ae42d16eae4f", "avgDurationSecs": 4.062, "validDurationSecs": 2024.05, "totalHrs": 7.25, "validHrs": 0.57 }, "tw": { "duration": 909792, "reportedSentences": 1, "clips": 231, "users": 3, "size": 5326676, "checksum": "57646a1ce316fd4c2e9aafd62a0fc8655fbb29f154004c82ab0ad68f089aca2c", "avgDurationSecs": 3.946, "validDurationSecs": 53.6, "totalHrs": 0.25, "validHrs": 0.01 }, "vot": { "duration": 0, "clips": 0, "users": 0, "size": -89, "checksum": "ff4dcc62db098cc0400a7ae781a852aef363ba6f16b93c14c411403d3256c58b", "avgDurationSecs": 2.408, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 388476, "clips": 75, "users": 2, "size": 2280223, "checksum": "0763b38b5b64fbcccff69d47ad2a9958c416c0ab37f20472af8eaa299200b0d2", "avgDurationSecs": 5.422, "validDurationSecs": -13.02, "totalHrs": 0.11, "validHrs": 0 }, "ast": { "duration": 0, "clips": 0, "users": 0, "size": 194, "checksum": "20f097653d3648a27c99df690cd14adb0ea63646b3270d31b1ff6c9f085b6768", "avgDurationSecs": 4.305, "validDurationSecs": 460.58, "totalHrs": 0, "validHrs": 0.13 }, "ne-NP": { "duration": 1302156, "clips": 344, "users": 6, "size": 7606719, "checksum": "b8fd5ae4a948fc403e487da22ea47f21829a750277b1f3ef2ade3035a143065c", "avgDurationSecs": 3.936, "validDurationSecs": 1064.27, "totalHrs": 0.37, "validHrs": 0.29 } }, "totalDuration": 6865295058, "totalValidDurationSecs": 2634014, "totalHrs": 1907, "totalValidHrs": 732 } ================================================ FILE: datasets/scripted-speech/cv-corpus-13.0-2023-03-09.json ================================================ { "locales": { "de": { "duration": 4821107393, "buckets": { "dev": 16143, "invalidated": 50705, "other": 6381, "reported": 9131, "test": 16143, "train": 540437, "validated": 868264 }, "reportedSentences": 9100, "clips": 925350, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "fourties": 0.17, "": 0.32, "thirties": 0.16, "teens": 0.03, "sixties": 0.02, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.32, "female": 0.08, "other": 0.01 } }, "users": 17867, "size": 33828262029, "checksum": "71664fadd4189922f3c814889f640111e925fb511b290242e10e7a768bd7b1bb", "avgDurationSecs": 5.21, "validDurationSecs": 4523687.242, "totalHrs": 1339.19, "validHrs": 1256.57 }, "en": { "buckets": { "dev": 16372, "invalidated": 264713, "other": 278333, "reported": 4732, "test": 16372, "train": 1013968, "validated": 1689599 }, "reportedSentences": 4657, "duration": 11550150103, "clips": 2232645, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.46, "female": 0.16, "other": 0.02 } }, "users": 86942, "size": 82019442116, "checksum": "e9af8ba157ea45d94b98490004b8e7e1b8432414e27be396b27b94e99e284421", "avgDurationSecs": 5.173, "validDurationSecs": 8740808.352, "totalHrs": 3208.37, "validHrs": 2428 }, "fa": { "buckets": { "dev": 10440, "invalidated": 14071, "other": 20673, "reported": 2268, "test": 10440, "train": 28024, "validated": 320143 }, "reportedSentences": 2259, "duration": 1415098376, "clips": 354887, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.31, "thirties": 0.37, "fifties": 0.02, "fourties": 0.02, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.07, "other": 0 } }, "users": 4188, "size": 10368977174, "checksum": "921ff70850b58468bcc232f1d6f8e7c5bf58aff2ee1efdd4f26de19e75f7ed2a", "avgDurationSecs": 3.987, "validDurationSecs": 1276558.001, "totalHrs": 393.08, "validHrs": 354.59 }, "fr": { "buckets": { "dev": 16114, "invalidated": 58926, "other": 31371, "reported": 6922, "test": 16114, "train": 509300, "validated": 676602 }, "reportedSentences": 6846, "duration": 3839056354, "clips": 766899, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "thirties": 0.16, "": 0.37, "teens": 0.03, "fourties": 0.14, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.3, "female": 0.1, "other": 0.01 } }, "users": 17428, "size": 27191480970, "checksum": "a44e73aa19a6805838a1302c1d311b704e90de7463da187b4b4ac3bcfba053fd", "avgDurationSecs": 5.006, "validDurationSecs": 3387034.286, "totalHrs": 1066.4, "validHrs": 940.84 }, "es": { "buckets": { "dev": 15708, "invalidated": 68143, "other": 1167766, "reported": 2095, "test": 15708, "train": 280329, "validated": 356713 }, "reportedSentences": 2078, "duration": 7746327810, "clips": 1592622, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.1, "": 0.13, "fifties": 0.04, "twenties": 0.55, "teens": 0.08, "fourties": 0.03, "sixties": 0.08, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.55, "": 0.13, "other": 0, "female": 0.33 } }, "users": 25096, "size": 48860618846, "checksum": "4d2b6be24bc13b4dce65bb8f63531225ed68b9590ef768c4245decfe319bc7cf", "avgDurationSecs": 4.864, "validDurationSecs": 1735010.462, "totalHrs": 2151.75, "validHrs": 481.94 }, "sl": { "buckets": { "dev": 1162, "invalidated": 261, "other": 2302, "reported": 37, "test": 1240, "train": 1436, "validated": 9909 }, "reportedSentences": 38, "duration": 47128294, "clips": 12472, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.47, "teens": 0.07, "": 0.18, "sixties": 0.07, "fifties": 0.06, "fourties": 0.09, "thirties": 0.05 }, "gender": { "female": 0.15, "male": 0.67, "": 0.18, "other": 0 } }, "users": 141, "size": 330113132, "checksum": "cbd0b177478b176d3b3162b9a48e54ad3240f6be46b35308b45d6f7d5788c289", "avgDurationSecs": 3.779, "validDurationSecs": 37443.414, "totalHrs": 13.09, "validHrs": 10.4 }, "kab": { "buckets": { "dev": 14996, "invalidated": 19513, "other": 111915, "reported": 9012, "test": 14996, "train": 151730, "validated": 609062 }, "reportedSentences": 9007, "duration": 2470790068, "clips": 740490, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.29, "": 0.28, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.53, "": 0.26, "female": 0.2, "other": 0 } }, "users": 1514, "size": 18439964913, "checksum": "9461233ae95d9850048bacf0c2d5d3102384e6bf8e542978cdd11732053dcf42", "avgDurationSecs": 3.337, "validDurationSecs": 2032254.778, "totalHrs": 686.33, "validHrs": 564.51 }, "cy": { "buckets": { "dev": 5293, "invalidated": 4390, "other": 18680, "reported": 162, "test": 5307, "train": 7810, "validated": 89159 }, "reportedSentences": 162, "duration": 546587361, "clips": 112229, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.06, "fifties": 0.09, "": 0.43, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.24, "": 0.41, "other": 0.01 } }, "users": 1750, "size": 4009004534, "checksum": "c54adb86f51b50eb5813d793e3a7b21ddd69e9a69df8f73eb7a147e47957ca68", "avgDurationSecs": 4.87, "validDurationSecs": 434229.856, "totalHrs": 151.82, "validHrs": 120.61 }, "ca": { "duration": 10415935674, "buckets": { "dev": 16380, "invalidated": 86112, "other": 493015, "reported": 5738, "test": 16380, "train": 1046965, "validated": 1325430 }, "reportedSentences": 5690, "clips": 1904557, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.06, "fifties": 0.17, "fourties": 0.11, "twenties": 0.05, "": 0.34, "sixties": 0.23, "teens": 0.01, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.44, "": 0.34, "female": 0.21, "other": 0 } }, "users": 31941, "size": 63732227938, "checksum": "7a7f050ea8e98ad8aa8082758ec2065d16024ebcc5300c6e8baa47a5dfdeecf2", "avgDurationSecs": 5.469, "validDurationSecs": 7248716.431, "totalHrs": 2893.31, "validHrs": 2013.53 }, "tt": { "duration": 110016998, "buckets": { "dev": 3296, "invalidated": 404, "other": 34, "reported": 4, "test": 5117, "train": 9553, "validated": 28851 }, "reportedSentences": 5, "clips": 29289, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.72, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.2, "male": 0.78, "female": 0.02 } }, "users": 234, "size": 811793536, "checksum": "88f304cc84abff09d809698f58de875663516f9a2b7e85ed351f525dcec520ce", "avgDurationSecs": 3.756, "validDurationSecs": 108371.758, "totalHrs": 30.56, "validHrs": 30.1 }, "ta": { "duration": 1404253896, "buckets": { "dev": 11937, "invalidated": 5621, "other": 87707, "reported": 3343, "test": 11973, "train": 43350, "validated": 132638 }, "reportedSentences": 3343, "clips": 225966, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.08, "thirties": 0.09, "": 0.72, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.03, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.71, "other": 0, "female": 0.13 } }, "users": 850, "size": 8408601882, "checksum": "51c38e4ce57c21b1352f9d981dbe19702d01a093f230fc6271fb8bf97f8665d1", "avgDurationSecs": 6.214, "validDurationSecs": 824271.918, "totalHrs": 390.07, "validHrs": 228.96 }, "ru": { "duration": 883818540, "buckets": { "dev": 10185, "invalidated": 9523, "other": 13491, "reported": 391, "test": 10186, "train": 26269, "validated": 148642 }, "reportedSentences": 385, "clips": 171656, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.35, "teens": 0.09, "": 0.24, "fourties": 0.14, "thirties": 0.15, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.6, "": 0.24, "other": 0, "female": 0.15 } }, "users": 2901, "size": 6058265022, "checksum": "0992166fab13d7a8a8df539b0667c4cd947de88f2b61f107c3805c04ceae0a43", "avgDurationSecs": 5.149, "validDurationSecs": 765324.576, "totalHrs": 245.5, "validHrs": 212.59 }, "nl": { "duration": 411681817, "buckets": { "dev": 10930, "invalidated": 5331, "other": 2723, "reported": 334, "test": 10936, "train": 31906, "validated": 86798 }, "reportedSentences": 334, "clips": 94852, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.21, "fourties": 0.15, "thirties": 0.11, "teens": 0.02, "fifties": 0.08, "sixties": 0.02, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.42, "male": 0.47, "female": 0.11, "other": 0 } }, "users": 1610, "size": 2808697434, "checksum": "2a8edc9005bbc8a3623ce25bfe95979bc9144e49a09468e8fd574ea76de30d94", "avgDurationSecs": 4.34, "validDurationSecs": 376725.407, "totalHrs": 114.35, "validHrs": 104.64 }, "it": { "duration": 1327694592, "buckets": { "dev": 15086, "invalidated": 17764, "other": 209, "reported": 5752, "test": 15096, "train": 162637, "validated": 229467 }, "reportedSentences": 5748, "clips": 247440, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "twenties": 0.22, "": 0.29, "fifties": 0.15, "fourties": 0.14, "seventies": 0, "sixties": 0.03, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.11, "male": 0.6, "": 0.28, "other": 0 } }, "users": 6881, "size": 9117088260, "checksum": "38d09ee666b8b813415e37d2c149d804c25f1d4dc149e602325554c7ed5f2580", "avgDurationSecs": 5.366, "validDurationSecs": 1231256.446, "totalHrs": 368.8, "validHrs": 342.01 }, "eu": { "duration": 538424295, "buckets": { "dev": 6591, "invalidated": 5819, "other": 28376, "reported": 97, "test": 6591, "train": 10918, "validated": 69567 }, "reportedSentences": 97, "clips": 103762, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.13, "twenties": 0.35, "": 0.26, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.46, "female": 0.25, "": 0.27, "other": 0.02 } }, "users": 1252, "size": 4040542343, "checksum": "07e2c8846ad8307b9790f402fa45a66ce71a2f1c95df8ec8c82d6fb3e9214a33", "avgDurationSecs": 5.189, "validDurationSecs": 360985.36, "totalHrs": 149.56, "validHrs": 100.27 }, "tr": { "duration": 353861415, "buckets": { "dev": 10837, "invalidated": 4090, "other": 156, "reported": 356, "test": 10839, "train": 31091, "validated": 93160 }, "reportedSentences": 357, "clips": 97406, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "thirties": 0.08, "twenties": 0.28, "teens": 0.02, "fourties": 0.03, "fifties": 0.09, "sixties": 0.16, "eighties": 0.02, "seventies": 0.03 }, "gender": { "": 0.3, "male": 0.47, "female": 0.22, "other": 0 } }, "users": 1435, "size": 2154735517, "checksum": "c1a9adf7e17d82022f0ef16d9143560f8fdb426eba15dea385d1dcf64432a1b1", "avgDurationSecs": 3.633, "validDurationSecs": 338436.333, "totalHrs": 98.29, "validHrs": 94.01 }, "ar": { "duration": 533060847, "buckets": { "dev": 10409, "invalidated": 14994, "other": 36192, "reported": 2093, "test": 10445, "train": 28167, "validated": 76677 }, "reportedSentences": 2085, "clips": 127863, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.56, "twenties": 0.28, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.18, "": 0.56, "male": 0.27, "other": 0 } }, "users": 1407, "size": 3160282600, "checksum": "b51f50ecee1a7323ecf6d5f0dae5c1140df0c583d978cb8b3cf05fe6cc1e2d38", "avgDurationSecs": 4.169, "validDurationSecs": 319666.413, "totalHrs": 148.07, "validHrs": 88.79 }, "zh-TW": { "duration": 429109942, "buckets": { "dev": 4825, "invalidated": 4657, "other": 44705, "reported": 142, "test": 4825, "train": 6799, "validated": 79531 }, "reportedSentences": 143, "clips": 128893, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.31, "teens": 0.06, "": 0.27, "fifties": 0.07, "seventies": 0, "fourties": 0.1, "sixties": 0 }, "gender": { "male": 0.49, "": 0.26, "female": 0.23, "other": 0.02 } }, "users": 2121, "size": 2930301331, "checksum": "2bfa61c1c915713204aa8875dfa8dd2dc1aa6fc56245cf90c2ed13fded6a7253", "avgDurationSecs": 3.329, "validDurationSecs": 264774.214, "totalHrs": 119.19, "validHrs": 73.54 }, "br": { "duration": 86037091, "buckets": { "dev": 2142, "invalidated": 846, "other": 15235, "reported": 274, "test": 2129, "train": 2618, "validated": 11369 }, "reportedSentences": 274, "clips": 27450, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.31, "fifties": 0.05, "fourties": 0.06, "thirties": 0.07, "sixties": 0.15, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.67, "": 0.31, "female": 0.02, "other": 0 } }, "users": 181, "size": 611265443, "checksum": "ca761f982405be3e675b6d11e42938cf72f2ec1d1622f3da3d1c4df2e1758e45", "avgDurationSecs": 3.134, "validDurationSecs": 35634.087, "totalHrs": 23.89, "validHrs": 9.89 }, "pt": { "duration": 707442559, "buckets": { "dev": 9072, "invalidated": 6465, "other": 33792, "reported": 2544, "test": 9072, "train": 19948, "validated": 133154 }, "reportedSentences": 2537, "clips": 173411, "splits": { "accent": { "": 1 }, "age": { "": 0.23, "twenties": 0.38, "teens": 0.02, "thirties": 0.19, "fourties": 0.14, "sixties": 0.02, "fifties": 0.02, "seventies": 0 }, "gender": { "": 0.23, "male": 0.69, "female": 0.06, "other": 0.02 } }, "users": 3099, "size": 4475741482, "checksum": "7dc2dbf1ad261825ebca28ec594a3adc9b9d14d33a64e5f05e0fccad9ed44bd1", "avgDurationSecs": 4.08, "validDurationSecs": 543211.252, "totalHrs": 196.51, "validHrs": 150.89 }, "eo": { "duration": 6787549508, "buckets": { "dev": 14909, "invalidated": 127338, "other": 142397, "reported": 2210, "test": 14913, "train": 143984, "validated": 848846 }, "reportedSentences": 2209, "clips": 1118581, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.56, "thirties": 0.12, "": 0.2, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1616, "size": 40520544051, "checksum": "df26401578f024d2d1748438bc2bb8da1f4fcd066d26b5b1f1fb7dbc88ca4537", "avgDurationSecs": 6.068, "validDurationSecs": 5150797.528, "totalHrs": 1885.43, "validHrs": 1430.77 }, "zh-CN": { "duration": 3779771748, "buckets": { "dev": 10624, "invalidated": 56661, "other": 594422, "reported": 681, "test": 10624, "train": 29383, "validated": 179473 }, "reportedSentences": 675, "clips": 830556, "splits": { "accent": { "": 1 }, "age": { "": 0.94, "teens": 0.01, "twenties": 0.04, "thirties": 0.01, "fourties": 0, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.94, "male": 0.05, "female": 0.01, "other": 0 } }, "users": 6667, "size": 22348267124, "checksum": "c160d6c83e20424692ffacbbf2fc6480c6613c0913ae5cb1d2a2ddf7d1c239cc", "avgDurationSecs": 4.551, "validDurationSecs": 816762.476, "totalHrs": 1049.93, "validHrs": 226.87 }, "id": { "duration": 226821756, "buckets": { "dev": 3292, "invalidated": 2553, "other": 29260, "reported": 345, "test": 3649, "train": 5041, "validated": 25027 }, "reportedSentences": 346, "clips": 56840, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.23, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.25, "male": 0.44, "female": 0.27, "other": 0.04 } }, "users": 492, "size": 1427745911, "checksum": "35e1c00612c9a306be8597d9544c032bafac35d892456f6a6186e53117a0933c", "avgDurationSecs": 3.991, "validDurationSecs": 99871.008, "totalHrs": 63, "validHrs": 27.74 }, "ia": { "duration": 60756600, "buckets": { "dev": 1796, "invalidated": 331, "other": 2615, "reported": 275, "test": 1777, "train": 5018, "validated": 11583 }, "reportedSentences": 271, "clips": 14529, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.38, "female": 0.01 } }, "users": 62, "size": 411366463, "checksum": "cae091b1e805fb1bc4621b5547ca755dc8e655b872bf19c2d06c2218ef12ab13", "avgDurationSecs": 4.182, "validDurationSecs": 48437.174, "totalHrs": 16.87, "validHrs": 13.45 }, "lv": { "duration": 61503893, "buckets": { "dev": 2246, "invalidated": 448, "other": 3247, "reported": 37, "test": 2295, "train": 2756, "validated": 13277 }, "reportedSentences": 38, "clips": 16972, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.43, "fourties": 0.04, "": 0.3, "twenties": 0.18, "teens": 0.02, "fifties": 0.03 }, "gender": { "male": 0.58, "female": 0.13, "": 0.3 } }, "users": 321, "size": 398577312, "checksum": "717c7e08b7ea15f95a4629b7bdacfa416c74b66e1d0ea491ee8f1cf16715b359", "avgDurationSecs": 3.624, "validDurationSecs": 48113.787, "totalHrs": 17.08, "validHrs": 13.36 }, "ja": { "duration": 621734109, "buckets": { "dev": 4961, "invalidated": 5993, "other": 73033, "reported": 193, "test": 4961, "train": 7071, "validated": 57527 }, "reportedSentences": 193, "clips": 136553, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.5, "": 0.21, "teens": 0.05, "fifties": 0.01, "thirties": 0.09, "fourties": 0.14, "sixties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.47, "": 0.21, "female": 0.32, "other": 0.01 } }, "users": 1630, "size": 3675036058, "checksum": "14188e016cc264e7d4381e4835d1efd54623f35f0ffb85562004d1ae906fe5c4", "avgDurationSecs": 4.553, "validDurationSecs": 261923.928, "totalHrs": 172.7, "validHrs": 72.75 }, "rw": { "duration": 8581498601, "buckets": { "dev": 15987, "invalidated": 227795, "other": 47246, "reported": 629, "test": 16213, "train": 1003023, "validated": 1438618 }, "reportedSentences": 630, "clips": 1713659, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1103, "size": 60972955216, "checksum": "9570b365918a227121b1111de3f2404a6624b51e0374683da7c2701a4327fb99", "avgDurationSecs": 5.008, "validDurationSecs": 7204174.433, "totalHrs": 2383.74, "validHrs": 2001.15 }, "sv-SE": { "duration": 187757835, "buckets": { "dev": 5114, "invalidated": 1381, "other": 6313, "reported": 580, "test": 5120, "train": 7407, "validated": 39506 }, "reportedSentences": 581, "clips": 47200, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.48, "": 0.19, "female": 0.32, "other": 0.01 } }, "users": 791, "size": 1188005367, "checksum": "760816a9eb17ede1738f08fc44ff87d6e9f9e26a6e6f4fe4f979084aba8de708", "avgDurationSecs": 3.978, "validDurationSecs": 157151.717, "totalHrs": 52.15, "validHrs": 43.65 }, "cnh": { "duration": 20675832, "buckets": { "dev": 761, "invalidated": 436, "other": 2908, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "reportedSentences": 9, "clips": 5802, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 299, "size": 161331878, "checksum": "38066864bff36a1ee125d901be9602f6d25061ef9a5f30fcb8f0e953e3a37208", "avgDurationSecs": 3.564, "validDurationSecs": 8759.255, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "duration": 195989330, "buckets": { "dev": 2638, "invalidated": 6697, "other": 506, "reported": 490, "test": 2638, "train": 3138, "validated": 21796 }, "reportedSentences": 487, "clips": 28999, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.08, "twenties": 0.68, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.26, "other": 0 } }, "users": 809, "size": 1336581339, "checksum": "dfce8875a0c27ab49fa5167ab20b264d1e5773c052bb45c3524d1e2c3e400454", "avgDurationSecs": 6.758, "validDurationSecs": 147307.957, "totalHrs": 54.44, "validHrs": 40.91 }, "ky": { "duration": 162502536, "buckets": { "dev": 1612, "invalidated": 5618, "other": 76, "reported": 37, "test": 1613, "train": 1788, "validated": 30117 }, "reportedSentences": 38, "clips": 35811, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18, "fifties": 0 }, "gender": { "male": 0.54, "": 0.11, "female": 0.35, "other": 0 } }, "users": 257, "size": 1049223157, "checksum": "d964f43dacf1b4dd69ec8e2ac32ee6ab745d46c760c4c23632752b00abaebc51", "avgDurationSecs": 4.538, "validDurationSecs": 136664.401, "totalHrs": 45.13, "validHrs": 37.96 }, "ro": { "duration": 151497031, "buckets": { "dev": 3743, "invalidated": 922, "other": 21057, "reported": 380, "test": 3861, "train": 5206, "validated": 15960 }, "reportedSentences": 381, "clips": 37939, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "teens": 0.02, "": 0.11, "fourties": 0.06, "sixties": 0, "twenties": 0.67, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.74, "": 0.1, "female": 0.15, "other": 0.01 } }, "users": 391, "size": 944768429, "checksum": "d4e66e8267946831ffb6af6e6e53957cbfa6dbf6621af23be0197482f25328f5", "avgDurationSecs": 3.993, "validDurationSecs": 63731.058, "totalHrs": 42.08, "validHrs": 17.7 }, "hsb": { "duration": 10318356, "buckets": { "dev": 172, "invalidated": 247, "other": 15, "reported": 86, "test": 444, "train": 808, "validated": 1424 }, "reportedSentences": 87, "clips": 1686, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.54, "": 0.18, "thirties": 0.1, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.81, "": 0.18, "other": 0 } }, "users": 21, "size": 80367147, "checksum": "3e9666e60a5da9bfd80879d8b1f015cce61d59729a15eee0912162b0bb5d93ce", "avgDurationSecs": 6.12, "validDurationSecs": 8714.91, "totalHrs": 2.86, "validHrs": 2.42 }, "el": { "duration": 110402278, "buckets": { "dev": 1663, "invalidated": 818, "other": 10261, "reported": 69, "test": 1704, "train": 1944, "validated": 15637 }, "reportedSentences": 70, "clips": 26716, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.37, "fourties": 0.16, "": 0.31, "twenties": 0.12, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.64, "": 0.31, "other": 0.02, "female": 0.03 } }, "users": 360, "size": 736695676, "checksum": "77499bae45f5a2308de48f71e20fec3ed92a54a7a7583b255f1c0a80c1d027df", "avgDurationSecs": 4.132, "validDurationSecs": 64618.971, "totalHrs": 30.66, "validHrs": 17.94 }, "cs": { "duration": 920669619, "buckets": { "dev": 8769, "invalidated": 1970, "other": 145750, "reported": 883, "test": 8829, "train": 18877, "validated": 58629 }, "reportedSentences": 877, "clips": 206349, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.07, "": 0.23, "thirties": 0.48, "teens": 0.02, "twenties": 0.18, "fifties": 0.01, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.54, "": 0.23, "female": 0.23, "other": 0 } }, "users": 876, "size": 5640424340, "checksum": "68e53a82eb28d99cc10a90d7ec32a7a98930d7a7f764d2e321ae413eb0924558", "avgDurationSecs": 4.462, "validDurationSecs": 261585.659, "totalHrs": 255.74, "validHrs": 72.66 }, "pl": { "duration": 617355218, "buckets": { "dev": 8534, "invalidated": 6266, "other": 3824, "reported": 553, "test": 8534, "train": 17518, "validated": 126997 }, "reportedSentences": 553, "clips": 137087, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3183, "size": 4411276400, "checksum": "017a81af2eb81a533410e03f7d0b67cce4e5bcfa40683852d261a3421ec35264", "avgDurationSecs": 4.503, "validDurationSecs": 571916.087, "totalHrs": 171.48, "validHrs": 158.86 }, "rm-sursilv": { "duration": 38788025, "buckets": { "dev": 1352, "invalidated": 677, "other": 2140, "reported": 16, "test": 1347, "train": 1554, "validated": 4260 }, "reportedSentences": 17, "clips": 7077, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.17, "female": 0.19, "": 0.64, "other": 0 } }, "users": 87, "size": 292892713, "checksum": "adef75ee647c88a0ec20a1b8977b4f89ae3bafee1d491862e92e77f869ae3140", "avgDurationSecs": 5.481, "validDurationSecs": 23348.451, "totalHrs": 10.77, "validHrs": 6.48 }, "rm-vallader": { "duration": 15115550, "buckets": { "dev": 377, "invalidated": 394, "other": 709, "reported": 36, "test": 442, "train": 672, "validated": 1498 }, "reportedSentences": 35, "clips": 2601, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 52, "size": 115424323, "checksum": "f303f525cf8ee884a1538c895b433f62b99fcfdfaeeaea5e81312f06ddc0c969", "avgDurationSecs": 5.811, "validDurationSecs": 8705.534, "totalHrs": 4.19, "validHrs": 2.41 }, "mn": { "duration": 69547436, "buckets": { "dev": 1767, "invalidated": 767, "other": 3522, "reported": 31, "test": 1877, "train": 2275, "validated": 8396 }, "reportedSentences": 32, "clips": 12685, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.28, "twenties": 0.41, "fourties": 0.01, "teens": 0.02, "nineties": 0.06, "fifties": 0 }, "gender": { "male": 0.36, "": 0.28, "female": 0.31, "other": 0.06 } }, "users": 488, "size": 526725245, "checksum": "919edfde38ca1b875cfda0da52157b3074f323e010acf6c8b0ae28ba534f3a87", "avgDurationSecs": 5.483, "validDurationSecs": 46032.343, "totalHrs": 19.31, "validHrs": 12.78 }, "zh-HK": { "duration": 491515152, "buckets": { "dev": 5593, "invalidated": 4345, "other": 21594, "reported": 671, "test": 5593, "train": 8425, "validated": 91509 }, "reportedSentences": 660, "clips": 117448, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.39, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0.01, "twenties": 0.33 }, "gender": { "male": 0.42, "": 0.35, "female": 0.22, "other": 0.01 } }, "users": 2971, "size": 3518822638, "checksum": "d1e4e28bc38aa02d32919a4bd4bee55dc1a195963a7e2487da1057095a3b94d0", "avgDurationSecs": 4.185, "validDurationSecs": 382961.481, "totalHrs": 136.53, "validHrs": 106.37 }, "ab": { "duration": 303841224, "buckets": { "dev": 9160, "invalidated": 5275, "other": 11948, "reported": 220, "test": 9116, "train": 21027, "validated": 41972 }, "reportedSentences": 219, "clips": 59195, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.18, "teens": 0.28, "twenties": 0.18, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 400, "size": 1734407911, "checksum": "f50ed6c61818d641ecc81d9fb6eeb2b713adf2ac0ad5eca60c7e9869b83247fd", "avgDurationSecs": 5.133, "validDurationSecs": 215437.518, "totalHrs": 84.4, "validHrs": 59.84 }, "cv": { "duration": 98308740, "buckets": { "dev": 1158, "invalidated": 2120, "other": 457, "reported": 143, "test": 1288, "train": 1540, "validated": 16840 }, "reportedSentences": 139, "clips": 19417, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "": 0.22, "fourties": 0.06, "thirties": 0.01, "teens": 0.19, "fifties": 0.01 }, "gender": { "male": 0.54, "": 0.19, "female": 0.27 } }, "users": 108, "size": 668315995, "checksum": "75c84a1c2b8cef32a2eed9a692c156c55e4cc16f511f482832a1ea53768debb5", "avgDurationSecs": 5.063, "validDurationSecs": 85261.327, "totalHrs": 27.3, "validHrs": 23.68 }, "uk": { "duration": 316242528, "buckets": { "dev": 8377, "invalidated": 2531, "other": 0, "reported": 606, "test": 8383, "train": 16911, "validated": 63928 }, "reportedSentences": 607, "clips": 66459, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "teens": 0.1, "": 0.26, "fourties": 0.12, "thirties": 0.27, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.58, "female": 0.15, "": 0.26 } }, "users": 818, "size": 2137250020, "checksum": "a00db74b6941d8f735ec7be3e84154eac0972707c523063f52126174d23fc341", "avgDurationSecs": 4.758, "validDurationSecs": 304198.864, "totalHrs": 87.84, "validHrs": 84.49 }, "mt": { "duration": 61454772, "buckets": { "dev": 1593, "invalidated": 321, "other": 6259, "reported": 9, "test": 1644, "train": 1943, "validated": 6383 }, "reportedSentences": 10, "clips": 12963, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.25, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 211, "size": 456683345, "checksum": "7f8c9084a18a7789b9d1b227df6cbf27bcbd8576f999dfbf238c9be9b18165ef", "avgDurationSecs": 4.741, "validDurationSecs": 30260.419, "totalHrs": 17.07, "validHrs": 8.4 }, "as": { "duration": 11755685, "buckets": { "dev": 508, "invalidated": 203, "other": 195, "reported": 9, "test": 548, "train": 612, "validated": 1668 }, "reportedSentences": 10, "clips": 2066, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.59, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.41, "": 0.59, "female": 0 } }, "users": 44, "size": 73430340, "checksum": "a536305b88375d09560d5a30b2d99c6ac90dcc38f480e3da7c4b499d7419225e", "avgDurationSecs": 5.69, "validDurationSecs": 9491.037, "totalHrs": 3.26, "validHrs": 2.63 }, "ka": { "duration": 160786692, "buckets": { "dev": 4424, "invalidated": 1495, "other": 9739, "reported": 243, "test": 4532, "train": 6379, "validated": 18962 }, "reportedSentences": 243, "clips": 30196, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "thirties": 0.17, "": 0.27, "fourties": 0.05, "fifties": 0.17, "teens": 0.01, "sixties": 0.06 }, "gender": { "male": 0.45, "female": 0.28, "": 0.27 } }, "users": 523, "size": 959259578, "checksum": "bf45a832f2765d39f0d83c1d9c09649723f82d00def1cab320cccc719bd89a64", "avgDurationSecs": 5.325, "validDurationSecs": 100968.249, "totalHrs": 44.66, "validHrs": 28.04 }, "fy-NL": { "duration": 751674303, "buckets": { "dev": 3170, "invalidated": 3773, "other": 102007, "reported": 535, "test": 3169, "train": 3917, "validated": 49112 }, "reportedSentences": 533, "clips": 154892, "splits": { "accent": { "": 1 }, "age": { "": 0.57, "fifties": 0.1, "thirties": 0.11, "twenties": 0.04, "fourties": 0.06, "sixties": 0.13, "seventies": 0, "teens": 0, "eighties": 0 }, "gender": { "": 0.57, "male": 0.12, "female": 0.31 } }, "users": 1969, "size": 4506898320, "checksum": "9324a2ee92e9651ddb0630d0ea534960288f2fa4534903b8644cff661ce2baac", "avgDurationSecs": 4.853, "validDurationSecs": 238335.281, "totalHrs": 208.79, "validHrs": 66.2 }, "dv": { "duration": 227748735, "buckets": { "dev": 2227, "invalidated": 1653, "other": 16395, "reported": 55, "test": 2212, "train": 2677, "validated": 26964 }, "reportedSentences": 56, "clips": 45012, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.36, "fourties": 0.22, "teens": 0.01, "nineties": 0, "fifties": 0.02 }, "gender": { "": 0.21, "male": 0.29, "female": 0.5 } }, "users": 331, "size": 1442355118, "checksum": "942e0269c339009b7d7d6ac17337ae5f593249926926fee6c37d64aa5c25a2b5", "avgDurationSecs": 5.06, "validDurationSecs": 136430.661, "totalHrs": 63.26, "validHrs": 37.89 }, "pa-IN": { "duration": 14215922, "buckets": { "dev": 285, "invalidated": 77, "other": 1414, "reported": 249, "test": 459, "train": 712, "validated": 1456 }, "reportedSentences": 244, "clips": 2947, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "fourties": 0.04, "fifties": 0.05, "thirties": 0.38, "twenties": 0.23, "sixties": 0, "teens": 0 }, "gender": { "": 0.3, "male": 0.69, "female": 0.01 } }, "users": 62, "size": 100051978, "checksum": "b6f1ba247777341e72d91a83f34e9114b739110b3ad4f60fb05e63eec8b1b825", "avgDurationSecs": 4.824, "validDurationSecs": 7023.543, "totalHrs": 3.94, "validHrs": 1.95 }, "vi": { "duration": 65730176, "buckets": { "dev": 392, "invalidated": 350, "other": 11486, "reported": 186, "test": 1225, "train": 2462, "validated": 4732 }, "reportedSentences": 185, "clips": 16568, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.19, "": 0.25, "teens": 0.21, "seventies": 0, "fourties": 0.02, "sixties": 0.3 }, "gender": { "male": 0.53, "": 0.25, "female": 0.2, "other": 0.02 } }, "users": 252, "size": 381782157, "checksum": "0c012f9c8aa77bf5de9c5263fe30a64218afb2c48534b868d55f100583239746", "avgDurationSecs": 3.967, "validDurationSecs": 18773.249, "totalHrs": 18.25, "validHrs": 5.21 }, "or": { "duration": 38655420, "buckets": { "dev": 342, "invalidated": 176, "other": 6284, "reported": 23, "test": 227, "train": 482, "validated": 1227 }, "reportedSentences": 24, "clips": 7687, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.08, "thirties": 0.73, "fourties": 0, "teens": 0 }, "gender": { "male": 0.88, "": 0.08, "female": 0.05 } }, "users": 94, "size": 274647208, "checksum": "0df8ef262d2b0e69b4ed0601620e6c581fa283632fb3f39efbd8fb8902217baa", "avgDurationSecs": 5.029, "validDurationSecs": 6170.183, "totalHrs": 10.73, "validHrs": 1.71 }, "ga-IE": { "duration": 36168975, "buckets": { "dev": 506, "invalidated": 882, "other": 4226, "reported": 27, "test": 511, "train": 549, "validated": 5022 }, "reportedSentences": 28, "clips": 10130, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.26, "": 0.37, "thirties": 0.25, "fourties": 0.06, "sixties": 0.01, "teens": 0.01, "fifties": 0.05 }, "gender": { "male": 0.5, "": 0.37, "female": 0.13, "other": 0 } }, "users": 175, "size": 247174669, "checksum": "3cf9de0620073155dab06d73c89b6200f33b08f9bf410089ec4abe0fdb10d5e6", "avgDurationSecs": 3.57, "validDurationSecs": 17930.957, "totalHrs": 10.04, "validHrs": 4.98 }, "fi": { "duration": 66664227, "buckets": { "dev": 1694, "invalidated": 235, "other": 5900, "reported": 54, "test": 1748, "train": 2105, "validated": 8356 }, "reportedSentences": 55, "clips": 14491, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.18, "": 0.34, "twenties": 0.11, "fourties": 0.33, "teens": 0.01, "fifties": 0.04, "seventies": 0 }, "gender": { "male": 0.32, "": 0.34, "female": 0.34, "other": 0 } }, "users": 208, "size": 397079516, "checksum": "b78d40a194095854d5682d6714ea25a3f311bb824f4e351cef9a1f32562e6ec2", "avgDurationSecs": 4.6, "validDurationSecs": 38440.845, "totalHrs": 18.51, "validHrs": 10.67 }, "hu": { "duration": 297940429, "buckets": { "dev": 7688, "invalidated": 1838, "other": 23308, "reported": 338, "test": 7875, "train": 15516, "validated": 31212 }, "reportedSentences": 339, "clips": 56358, "splits": { "accent": { "": 1 }, "age": { "teens": 0.03, "": 0.29, "thirties": 0.16, "twenties": 0.17, "fifties": 0.17, "fourties": 0.14, "sixties": 0.02, "seventies": 0.01 }, "gender": { "male": 0.35, "": 0.3, "female": 0.34, "other": 0.01 } }, "users": 920, "size": 1769841289, "checksum": "01ab3d2fa87729761f250cbd4c1304f1e3e2c38a9fc050a88a584114ad126d6a", "avgDurationSecs": 5.287, "validDurationSecs": 165004.377, "totalHrs": 82.76, "validHrs": 45.83 }, "th": { "duration": 1497189504, "buckets": { "dev": 11002, "invalidated": 9117, "other": 204730, "reported": 4168, "test": 11002, "train": 32477, "validated": 143263 }, "reportedSentences": 4168, "clips": 357110, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.23, "": 0.41, "thirties": 0.07, "fourties": 0.04, "teens": 0.04, "fifties": 0.2, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.42, "": 0.41, "female": 0.17, "other": 0.01 } }, "users": 7784, "size": 8707281050, "checksum": "3b3bc4c1bad6ff2590a7e764941592fa89435a827cadd4e2c5371980adefc31b", "avgDurationSecs": 4.193, "validDurationSecs": 600632.466, "totalHrs": 415.88, "validHrs": 166.84 }, "lt": { "duration": 88824932, "buckets": { "dev": 4215, "invalidated": 651, "other": 1704, "reported": 185, "test": 4319, "train": 6205, "validated": 14917 }, "reportedSentences": 185, "clips": 17272, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.21, "thirties": 0.31, "fifties": 0.04, "sixties": 0.01, "teens": 0.03, "fourties": 0.04 }, "gender": { "male": 0.56, "": 0.21, "female": 0.22 } }, "users": 273, "size": 541352467, "checksum": "5b9308a384488b8964896bfdf654400825a1ec4bd973c7357269f4a11d16bab5", "avgDurationSecs": 5.143, "validDurationSecs": 76713.844, "totalHrs": 24.67, "validHrs": 21.3 }, "lg": { "duration": 2011525119, "buckets": { "dev": 13389, "invalidated": 39159, "other": 36922, "reported": 6450, "test": 13420, "train": 70813, "validated": 271640 }, "reportedSentences": 6445, "clips": 347721, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.22, "twenties": 0.38, "fourties": 0.07, "fifties": 0.05, "teens": 0.01, "nineties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "": 0.25, "female": 0.39, "male": 0.36 } }, "users": 646, "size": 11779660819, "checksum": "ce391d09f447bdd33da0b3249362580179ddf3d7cc522f3902a706ce7b35669d", "avgDurationSecs": 5.785, "validDurationSecs": 1571405.475, "totalHrs": 558.75, "validHrs": 436.5 }, "hi": { "duration": 67634340, "buckets": { "dev": 2281, "invalidated": 706, "other": 3487, "reported": 132, "test": 2947, "train": 4479, "validated": 9751 }, "reportedSentences": 133, "clips": 13944, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "fourties": 0.03, "": 0.34, "thirties": 0.26, "teens": 0.01, "fifties": 0.01, "sixties": 0.01 }, "gender": { "male": 0.62, "female": 0.04, "": 0.34, "other": 0 } }, "users": 365, "size": 398945100, "checksum": "d8df02f35f6fb84ec14bfa3ca491e8f8090cfc52475b84f4e8b79ce0d66d3765", "avgDurationSecs": 4.85, "validDurationSecs": 47296.504, "totalHrs": 18.78, "validHrs": 13.13 }, "bas": { "duration": 10040580, "buckets": { "dev": 457, "invalidated": 508, "other": 10, "reported": 7, "test": 528, "train": 763, "validated": 1748 }, "reportedSentences": 8, "clips": 2266, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 33, "size": 55827418, "checksum": "83f512a6347ebac5df13162cee85ad2986fb99d3fecc6488c1d734051d505002", "avgDurationSecs": 4.431, "validDurationSecs": 7745.337, "totalHrs": 2.78, "validHrs": 2.15 }, "sk": { "duration": 87440076, "buckets": { "dev": 2492, "invalidated": 793, "other": 2329, "reported": 47, "test": 2552, "train": 3227, "validated": 18222 }, "reportedSentences": 48, "clips": 21344, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "thirties": 0.22, "twenties": 0.08, "fourties": 0.16, "teens": 0.09 }, "gender": { "": 0.45, "male": 0.46, "female": 0.07, "other": 0.01 } }, "users": 185, "size": 492582779, "checksum": "5d4904324f11e6a17b4d6fe1aec4278ac58ed71ae84fcb86b67dd7339b583bb6", "avgDurationSecs": 4.097, "validDurationSecs": 74650.162, "totalHrs": 24.28, "validHrs": 20.73 }, "kmr": { "duration": 347753052, "buckets": { "dev": 3491, "invalidated": 2368, "other": 31131, "reported": 1030, "test": 3485, "train": 4451, "validated": 52411 }, "reportedSentences": 1025, "clips": 85910, "splits": { "accent": { "": 1 }, "age": { "": 0.49, "twenties": 0.35, "thirties": 0.07, "fourties": 0.03, "fifties": 0.04, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.49, "male": 0.41, "female": 0.1, "other": 0 } }, "users": 545, "size": 1831680486, "checksum": "af6a3a2865cca9d90c2e377c6bd1fe2d80f3865845c3e5a780d588b1c242571d", "avgDurationSecs": 4.048, "validDurationSecs": 212153.244, "totalHrs": 96.59, "validHrs": 58.93 }, "bg": { "duration": 66152520, "buckets": { "dev": 2358, "invalidated": 586, "other": 3272, "reported": 181, "test": 2463, "train": 3385, "validated": 8222 }, "reportedSentences": 182, "clips": 12080, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.25, "thirties": 0.22, "": 0.39, "twenties": 0.12, "teens": 0, "sixties": 0, "fifties": 0 }, "gender": { "male": 0.56, "female": 0.05, "": 0.39 } }, "users": 96, "size": 386104092, "checksum": "69f56f3125411a9d9c1c7d907d0cf7b0116316113b3d273805415fc9185ba823", "avgDurationSecs": 5.476, "validDurationSecs": 45025.333, "totalHrs": 18.37, "validHrs": 12.5 }, "kk": { "duration": 7002288, "buckets": { "dev": 369, "invalidated": 197, "other": 1, "reported": 41, "test": 396, "train": 453, "validated": 1223 }, "reportedSentences": 42, "clips": 1421, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "thirties": 0.03, "twenties": 0.31, "teens": 0.05, "fifties": 0.09 }, "gender": { "": 0.52, "male": 0.46, "female": 0.02 } }, "users": 90, "size": 40191047, "checksum": "088026aa813183a4d5a61ba8788c079b2f9253df77cef68546d426ee5868e777", "avgDurationSecs": 4.928, "validDurationSecs": 6026.6, "totalHrs": 1.94, "validHrs": 1.67 }, "ba": { "duration": 960572412, "buckets": { "dev": 14507, "invalidated": 7913, "other": 131, "reported": 866, "test": 14580, "train": 119038, "validated": 208963 }, "reportedSentences": 863, "clips": 217007, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 904, "size": 5383162213, "checksum": "95a8b68a7b66484e7726fea9abbb0b5fc7773a01da36fcc58a75242b197d9cd8", "avgDurationSecs": 4.426, "validDurationSecs": 924965.982, "totalHrs": 266.82, "validHrs": 256.93 }, "gl": { "duration": 213131016, "buckets": { "dev": 6397, "invalidated": 1871, "other": 17121, "reported": 355, "test": 6546, "train": 10951, "validated": 24179 }, "reportedSentences": 355, "clips": 43171, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "thirties": 0.18, "fifties": 0.07, "twenties": 0.21, "fourties": 0.19, "teens": 0.02, "sixties": 0.01, "seventies": 0 }, "gender": { "": 0.36, "male": 0.35, "female": 0.3, "other": 0 } }, "users": 997, "size": 1222354918, "checksum": "433a9dd928cb2f0d9f92c60e0d10e8a7d1aca1005e9cdd092f6a5bdfe9df7e37", "avgDurationSecs": 4.937, "validDurationSecs": 119369.365, "totalHrs": 59.2, "validHrs": 33.15 }, "ug": { "duration": 435382452, "buckets": { "dev": 3555, "invalidated": 3927, "other": 0, "reported": 284, "test": 3555, "train": 4521, "validated": 68366 }, "reportedSentences": 285, "clips": 72293, "splits": { "accent": { "": 1 }, "age": { "": 0.6, "fifties": 0.01, "twenties": 0.11, "thirties": 0.15, "fourties": 0.12, "teens": 0.01, "eighties": 0 }, "gender": { "": 0.6, "male": 0.32, "female": 0.08, "other": 0 } }, "users": 887, "size": 2530721785, "checksum": "b09aff4bec1cda909cfdf10f021bba6421206d13da6d9037d281b657960b2bdf", "avgDurationSecs": 6.022, "validDurationSecs": 411732.211, "totalHrs": 120.93, "validHrs": 114.37 }, "hy-AM": { "duration": 16940484, "buckets": { "dev": 364, "invalidated": 105, "other": 1230, "reported": 44, "test": 442, "train": 631, "validated": 1438 }, "reportedSentences": 45, "clips": 2773, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.13, "twenties": 0.37, "fifties": 0.04, "teens": 0.09 }, "gender": { "": 0.37, "male": 0.22, "female": 0.41 } }, "users": 66, "size": 99409773, "checksum": "109b4522d04e14184142ac022527346abe18c7bb6200d5ea98aedaea240e7f08", "avgDurationSecs": 6.109, "validDurationSecs": 8784.86, "totalHrs": 4.7, "validHrs": 2.44 }, "be": { "duration": 5334186312, "buckets": { "dev": 15880, "invalidated": 30692, "other": 75973, "reported": 3134, "test": 15879, "train": 347355, "validated": 1013522 }, "reportedSentences": 3133, "clips": 1120187, "splits": { "accent": { "": 1 }, "age": { "": 0.84, "fourties": 0.05, "thirties": 0.05, "twenties": 0.04, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.84, "male": 0.07, "female": 0.09, "other": 0 } }, "users": 8052, "size": 30184769413, "checksum": "00a53691d285774e98f5d62a7f9f513b3e237ee580f5665083e009b9430f208f", "avgDurationSecs": 4.762, "validDurationSecs": 4826261.311, "totalHrs": 1481.71, "validHrs": 1340.62 }, "ur": { "duration": 507179196, "buckets": { "dev": 3302, "invalidated": 3312, "other": 85366, "reported": 50, "test": 3304, "train": 4129, "validated": 41882 }, "reportedSentences": 50, "clips": 130560, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.88, "": 0.1, "fourties": 0.01, "thirties": 0, "teens": 0.01, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.7, "": 0.1, "female": 0.2 } }, "users": 202, "size": 2947911505, "checksum": "22e848a2ad547205dbb78c929ef0e21b02ce82e5c926ddfe805b7e03fcc0cee9", "avgDurationSecs": 3.885, "validDurationSecs": 162696.684, "totalHrs": 140.88, "validHrs": 45.19 }, "gn": { "duration": 34785252, "buckets": { "dev": 352, "invalidated": 142, "other": 5426, "reported": 35, "test": 811, "train": 1414, "validated": 2655 }, "reportedSentences": 36, "clips": 8223, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "twenties": 0.21, "thirties": 0.41, "sixties": 0, "fourties": 0.15, "teens": 0.02 }, "gender": { "": 0.2, "male": 0.6, "female": 0.19 } }, "users": 90, "size": 199897821, "checksum": "4a2932b1ea10d2b47fbf26a46d3ddc77e7b1a68cd4fdbbb24319218779fa5fa6", "avgDurationSecs": 4.23, "validDurationSecs": 11231.283, "totalHrs": 9.66, "validHrs": 3.11 }, "sr": { "duration": 16845480, "buckets": { "dev": 1276, "invalidated": 119, "other": 1529, "reported": 28, "test": 1263, "train": 1499, "validated": 4330 }, "reportedSentences": 29, "clips": 5978, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.24, "fifties": 0.2, "fourties": 0.07, "thirties": 0.13, "teens": 0 }, "gender": { "male": 0.51, "": 0.24, "female": 0.25 } }, "users": 134, "size": 92911009, "checksum": "768c88391cc827b95c05bba2377ec5564cf19935e9b56e5016d765ab56fab0c3", "avgDurationSecs": 2.818, "validDurationSecs": 12201.56, "totalHrs": 4.67, "validHrs": 3.38 }, "uz": { "duration": 943613712, "buckets": { "dev": 12061, "invalidated": 13811, "other": 127766, "reported": 1823, "test": 12321, "train": 48286, "validated": 85995 }, "reportedSentences": 1806, "clips": 227572, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.41, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.44, "": 0.41, "female": 0.15, "other": 0 } }, "users": 2109, "size": 5261185727, "checksum": "64d120e4adf234f10b244d6f3dcf7a0ac889a6e669458fd16c7b57f4c049e93b", "avgDurationSecs": 4.146, "validDurationSecs": 356573.134, "totalHrs": 262.11, "validHrs": 99.04 }, "mr": { "duration": 98491032, "buckets": { "dev": 1783, "invalidated": 2251, "other": 2799, "reported": 58, "test": 1735, "train": 2226, "validated": 10854 }, "reportedSentences": 59, "clips": 15904, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "sixties": 0, "twenties": 0.29, "": 0.05, "teens": 0.53 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 83, "size": 577008222, "checksum": "a71e722038eb3690641f30d8a8046b65c4c8d33210ad716e0286d2751d1e8498", "avgDurationSecs": 6.193, "validDurationSecs": 67217.157, "totalHrs": 27.35, "validHrs": 18.67 }, "da": { "duration": 42516504, "buckets": { "dev": 2222, "invalidated": 354, "other": 841, "reported": 379, "test": 2160, "train": 2746, "validated": 9015 }, "reportedSentences": 379, "clips": 10210, "splits": { "accent": { "": 1 }, "age": { "": 0.32, "thirties": 0.26, "twenties": 0.18, "sixties": 0, "fourties": 0.21, "fifties": 0.03, "teens": 0 }, "gender": { "": 0.32, "female": 0.08, "male": 0.61 } }, "users": 228, "size": 243857551, "checksum": "6c85261bcf8dffe5c06ad29c82760cda5cd1fdc7d9c1c99b6285a425f11d105e", "avgDurationSecs": 4.164, "validDurationSecs": 37540.282, "totalHrs": 11.81, "validHrs": 10.42 }, "myv": { "duration": 11274696, "buckets": { "dev": 239, "invalidated": 21, "other": 0, "reported": 30, "test": 449, "train": 1241, "validated": 1939 }, "reportedSentences": 31, "clips": 1960, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.26, "": 0.39, "thirties": 0.25, "twenties": 0.09, "teens": 0.01 }, "gender": { "male": 0.54, "": 0.39, "female": 0.07 } }, "users": 12, "size": 65894328, "checksum": "741e516b505e2552aa54adf74c10811a3230d2084edb02fd4d6783b2b20f4e72", "avgDurationSecs": 5.752, "validDurationSecs": 11153.896, "totalHrs": 3.13, "validHrs": 3.09 }, "nn-NO": { "duration": 3677832, "buckets": { "dev": 197, "invalidated": 42, "other": 16, "reported": 22, "test": 230, "train": 314, "validated": 746 }, "reportedSentences": 23, "clips": 804, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.32, "twenties": 0.23, "fourties": 0.04, "fifties": 0.01, "teens": 0.03 }, "gender": { "": 0.37, "female": 0.23, "male": 0.37, "other": 0.03 } }, "users": 29, "size": 20884241, "checksum": "f39f03129e67eb69534db439c127e62bdc2d39a02ef92e5e2daddc3358208512", "avgDurationSecs": 4.574, "validDurationSecs": 3412.516, "totalHrs": 1.02, "validHrs": 0.94 }, "ha": { "duration": 43793136, "buckets": { "dev": 580, "invalidated": 169, "other": 6660, "reported": 32, "test": 659, "train": 1926, "validated": 3277 }, "reportedSentences": 32, "clips": 10106, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.68, "twenties": 0.12, "fourties": 0, "fifties": 0.02 }, "gender": { "": 0.16, "male": 0.59, "female": 0.25 } }, "users": 39, "size": 256132468, "checksum": "dde8d350a4f0a109108adfc8e55a53b2d7d26a08ec537ecc4060836213ded990", "avgDurationSecs": 4.333, "validDurationSecs": 14200.486, "totalHrs": 12.16, "validHrs": 3.94 }, "ckb": { "duration": 456040944, "buckets": { "dev": 4940, "invalidated": 7379, "other": 7350, "reported": 2380, "test": 4940, "train": 7035, "validated": 102012 }, "reportedSentences": 2380, "clips": 116741, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "thirties": 0.12, "twenties": 0.48, "fourties": 0.03, "teens": 0.02, "fifties": 0.02 }, "gender": { "": 0.33, "male": 0.61, "female": 0.07, "other": 0 } }, "users": 1269, "size": 2470127013, "checksum": "6592784e044be63d085d9b288db2dfa61d82a227e70803022c48b9758d68c840", "avgDurationSecs": 3.906, "validDurationSecs": 398503.086, "totalHrs": 126.67, "validHrs": 110.69 }, "ml": { "duration": 14479308, "buckets": { "dev": 0, "invalidated": 16, "other": 2817, "reported": 115, "test": 215, "train": 509, "validated": 724 }, "reportedSentences": 116, "clips": 3557, "splits": { "accent": { "": 1 }, "age": { "": 0.33, "twenties": 0.35, "thirties": 0.04, "fourties": 0.29 }, "gender": { "": 0.33, "male": 0.67 } }, "users": 33, "size": 84013164, "checksum": "0c53537225d238bcbf6ae11d8e3ecbffbaf8f74922bbd2a4989ea3f5931e3321", "avgDurationSecs": 4.071, "validDurationSecs": 2947.152, "totalHrs": 4.02, "validHrs": 0.81 }, "mdf": { "duration": 1811340, "buckets": { "dev": 54, "invalidated": 8, "other": 1, "reported": 14, "test": 107, "train": 175, "validated": 336 }, "reportedSentences": 15, "clips": 345, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.58, "fourties": 0.34, "twenties": 0.02 }, "gender": { "male": 0.08, "": 0.58, "female": 0.34 } }, "users": 11, "size": 10631887, "checksum": "33f38dce47f710bef75e57e5b40e98e61f1877cd136775e6e3e7f1dc45895413", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "sw": { "duration": 3264314328, "buckets": { "dev": 11274, "invalidated": 64047, "other": 324868, "reported": 1866, "test": 11271, "train": 34980, "validated": 231468 }, "reportedSentences": 1861, "clips": 620383, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "twenties": 0.44, "thirties": 0.13, "teens": 0, "fifties": 0.05, "fourties": 0.04, "sixties": 0.01 }, "gender": { "": 0.32, "male": 0.36, "female": 0.32, "other": 0 } }, "users": 1103, "size": 19044510370, "checksum": "a04acaa50b6fe75d5f8ece78660b2827716a91d99e93600d78c09ebecce42e72", "avgDurationSecs": 5.262, "validDurationSecs": 1217932.001, "totalHrs": 906.75, "validHrs": 338.31 }, "sat": { "duration": 3624516, "buckets": { "dev": 0, "invalidated": 9, "other": 364, "reported": 6, "test": 144, "train": 292, "validated": 436 }, "reportedSentences": 7, "clips": 809, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "twenties": 0.43, "fourties": 0.01, "fifties": 0.01, "teens": 0.01, "thirties": 0.1 }, "gender": { "": 0.41, "male": 0.58, "female": 0.01 } }, "users": 14, "size": 19983401, "checksum": "c58f2434a2974518e727897fceae8030be7ea5403b2dc18652e4921c046ba2cc", "avgDurationSecs": 4.48, "validDurationSecs": 1953.386, "totalHrs": 1, "validHrs": 0.54 }, "tig": { "duration": 103284, "buckets": { "dev": 0, "invalidated": 12, "other": 0, "reported": 0, "test": 1, "train": 10, "validated": 11 }, "reportedSentences": 1, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 0.78, "twenties": 0.22 }, "gender": { "": 0.78, "male": 0.22 } }, "users": 5, "size": 602992, "checksum": "b0a1697ef65f1f5f30476c789aac7e6f54acc7dc99c82112642e2c87cd5665f6", "avgDurationSecs": 4.491, "validDurationSecs": 49.397, "totalHrs": 0.02, "validHrs": 0.01 }, "ig": { "duration": 31531356, "buckets": { "dev": 2, "invalidated": 3, "other": 5754, "reported": 15, "test": 4, "train": 8, "validated": 14 }, "reportedSentences": 15, "clips": 5771, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "twenties": 0.32, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02, "fourties": 0 }, "gender": { "": 0.56, "male": 0.14, "female": 0.3 } }, "users": 112, "size": 184140356, "checksum": "7a580953242f8f1072d1b816859e9b9a00fa2f34311a9af0ebfda6120ae0806a", "avgDurationSecs": 5.464, "validDurationSecs": 76.493, "totalHrs": 8.75, "validHrs": 0.02 }, "nan-tw": { "duration": 38958552, "buckets": { "dev": 755, "invalidated": 294, "other": 10569, "reported": 140, "test": 1139, "train": 1833, "validated": 3737 }, "reportedSentences": 141, "clips": 14600, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.13, "twenties": 0.36, "fourties": 0.22, "teens": 0.02, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.53, "": 0.13, "other": 0.12, "female": 0.22 } }, "users": 120, "size": 214575545, "checksum": "c253dbed21c6e4100dc9e3f3ebbde78e32ca38b7d47128d596732f60fdb5ab37", "avgDurationSecs": 2.668, "validDurationSecs": 9971.788, "totalHrs": 10.82, "validHrs": 2.76 }, "mhr": { "duration": 745755624, "buckets": { "dev": 13132, "invalidated": 4762, "other": 41051, "reported": 77, "test": 13752, "train": 86150, "validated": 114848 }, "reportedSentences": 76, "clips": 160661, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.05, "": 0.16, "sixties": 0.05, "thirties": 0.29, "fourties": 0.15, "twenties": 0.22, "teens": 0.07, "seventies": 0.01 }, "gender": { "male": 0.23, "": 0.16, "female": 0.61 } }, "users": 388, "size": 4162732159, "checksum": "fdc30f70750bb13213f0d5c699f31be429445533a4acc29e7aacbd4ba438f332", "avgDurationSecs": 4.642, "validDurationSecs": 533101.013, "totalHrs": 207.15, "validHrs": 148.08 }, "bn": { "duration": 4559272416, "buckets": { "dev": 9230, "invalidated": 7683, "other": 994175, "reported": 1862, "test": 9230, "train": 20729, "validated": 43362 }, "reportedSentences": 1856, "clips": 1045220, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.67, "": 0.22, "teens": 0.07, "fourties": 0.01, "fifties": 0 }, "gender": { "male": 0.54, "": 0.22, "female": 0.23, "other": 0 } }, "users": 22817, "size": 26255160682, "checksum": "fa3f3cfc75b68741dac50f0bb271a7e83ca28a3191e920042d881a6f8badf112", "avgDurationSecs": 4.362, "validDurationSecs": 189145.989, "totalHrs": 1266.46, "validHrs": 52.54 }, "tok": { "duration": 43560792, "buckets": { "dev": 1883, "invalidated": 234, "other": 2261, "reported": 129, "test": 1907, "train": 2732, "validated": 9033 }, "reportedSentences": 130, "clips": 11528, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.18, "teens": 0.3, "thirties": 0.1, "fourties": 0 }, "gender": { "": 0.41, "male": 0.4, "other": 0.12, "female": 0.07 } }, "users": 119, "size": 253192772, "checksum": "1de25832d6417466aa16561594ced6d4a2b72da694910e1198f69f1c22e3ff5e", "avgDurationSecs": 3.779, "validDurationSecs": 34132.949, "totalHrs": 12.1, "validHrs": 9.48 }, "yue": { "duration": 251348868, "buckets": { "dev": 2485, "invalidated": 1629, "other": 39882, "reported": 868, "test": 2526, "train": 3030, "validated": 20081 }, "reportedSentences": 864, "clips": 61592, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.21, "": 0.32, "twenties": 0.4, "fourties": 0.03, "sixties": 0.01, "fifties": 0, "teens": 0.03 }, "gender": { "male": 0.26, "": 0.36, "female": 0.37, "other": 0.01 } }, "users": 827, "size": 1406690648, "checksum": "1d51745b0e7b4dd79f9eb25f8129d9f90dcc7346ec2ff3aaa4ab9a69f5aaff66", "avgDurationSecs": 4.081, "validDurationSecs": 81947.925, "totalHrs": 69.81, "validHrs": 22.76 }, "sah": { "duration": 24741156, "buckets": { "dev": 1083, "invalidated": 102, "other": 1, "reported": 2, "test": 1270, "train": 1594, "validated": 4015 }, "reportedSentences": 3, "clips": 4118, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.03, "fourties": 0.07, "thirties": 0.43, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.36, "male": 0.53, "female": 0.11 } }, "users": 55, "size": 187502644, "checksum": "e6970ac845dd0f3377404eaba252c0407f3c64b1d931aabd75ec7c6f7aed132d", "avgDurationSecs": 6.008, "validDurationSecs": 24122.327, "totalHrs": 6.87, "validHrs": 6.7 }, "mk": { "duration": 1110348, "buckets": { "dev": 0, "invalidated": 11, "other": 57, "reported": 8, "test": 26, "train": 115, "validated": 141 }, "reportedSentences": 9, "clips": 209, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.43, "": 0.33, "twenties": 0.17, "teens": 0.07 }, "gender": { "male": 0.67, "": 0.33 } }, "users": 7, "size": 6527029, "checksum": "c2275e2901850f3a54cef485a8099714fc38c48fab148838f7d89c37296a4c8f", "avgDurationSecs": 5.313, "validDurationSecs": 749.086, "totalHrs": 0.3, "validHrs": 0.2 }, "sc": { "duration": 5936256, "buckets": { "dev": 234, "invalidated": 25, "other": 284, "reported": 2, "test": 177, "train": 476, "validated": 1036 }, "reportedSentences": 3, "clips": 1345, "splits": { "accent": { "": 1 }, "age": { "": 0.62, "thirties": 0.31, "twenties": 0.08 }, "gender": { "": 0.62, "female": 0.31, "male": 0.08 } }, "users": 12, "size": 34282030, "checksum": "5848ec3b17ae8c47e3e15ee762d930e7757701a1f91c8f195b0384c5a5db2193", "avgDurationSecs": 4.414, "validDurationSecs": 4572.462, "totalHrs": 1.64, "validHrs": 1.27 }, "skr": { "duration": 23051304, "buckets": { "dev": 1026, "invalidated": 428, "other": 1967, "reported": 16, "test": 834, "train": 1327, "validated": 3192 }, "reportedSentences": 2, "clips": 5587, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.25, "fifties": 0.39, "teens": 0, "fourties": 0, "thirties": 0.01 }, "gender": { "": 0.35, "male": 0.65 } }, "users": 41, "size": 131854098, "checksum": "b6c3c9ee1e9661de973298e331efe26f523589e300eddaa1e23b74969f2e4f42", "avgDurationSecs": 4.126, "validDurationSecs": 13169.816, "totalHrs": 6.4, "validHrs": 3.65 }, "ti": { "duration": 132912, "buckets": { "dev": 0, "invalidated": 0, "other": 10, "reported": 1, "test": 3, "train": 10, "validated": 13 }, "reportedSentences": 2, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 4, "size": 784757, "checksum": "317936e0faac748f4a73ab5f3b17d4fb6909d553e6bc67c456666b3d11d415ab", "avgDurationSecs": 5.779, "validDurationSecs": 75.124, "totalHrs": 0.03, "validHrs": 0.02 }, "mrj": { "duration": 104076468, "buckets": { "dev": 3297, "invalidated": 272, "other": 10180, "reported": 49, "test": 4428, "train": 7272, "validated": 15125 }, "reportedSentences": 50, "clips": 25577, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.29, "": 0.12, "thirties": 0.24, "sixties": 0.04, "fourties": 0.15, "fifties": 0.14, "teens": 0.01 }, "gender": { "male": 0.26, "female": 0.63, "": 0.12 } }, "users": 48, "size": 575489473, "checksum": "42877608f77aa7a1a548bce9fea0d5aee628a75d2688340923b008648a976273", "avgDurationSecs": 4.069, "validDurationSecs": 61545.786, "totalHrs": 28.91, "validHrs": 17.09 }, "tw": { "duration": 943884, "buckets": { "dev": 0, "invalidated": 0, "other": 202, "reported": 2, "test": 3, "train": 32, "validated": 35 }, "reportedSentences": 3, "clips": 237, "splits": { "accent": { "": 1 }, "age": { "": 0.07, "twenties": 0.02, "thirties": 0.91 }, "gender": { "": 0.07, "male": 0.93 } }, "users": 5, "size": 5529362, "checksum": "2c8e68f83c5ef0926855a6ede4a24bcca6f806293fffd952c6adf9a00782ffdc", "avgDurationSecs": 3.983, "validDurationSecs": 139.392, "totalHrs": 0.26, "validHrs": 0.03 }, "ko": { "duration": 14055624, "buckets": { "dev": 105, "invalidated": 97, "other": 1746, "reported": 13, "test": 131, "train": 192, "validated": 429 }, "reportedSentences": 14, "clips": 2272, "splits": { "accent": { "": 1 }, "age": { "": 0.14, "thirties": 0.43, "twenties": 0.35, "teens": 0.04, "fourties": 0.04 }, "gender": { "": 0.14, "male": 0.72, "female": 0.14, "other": 0 } }, "users": 42, "size": 81829460, "checksum": "a308f2a8b7991716417c12aedba39dd4a93d2410a5a148531ddd5ecc89434af4", "avgDurationSecs": 6.186, "validDurationSecs": 2653.989, "totalHrs": 3.9, "validHrs": 0.73 }, "yo": { "duration": 24984468, "buckets": { "dev": 554, "invalidated": 151, "other": 2082, "reported": 17, "test": 638, "train": 797, "validated": 1990 }, "reportedSentences": 18, "clips": 4223, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.65, "": 0.3, "teens": 0.03, "sixties": 0 }, "gender": { "male": 0.33, "": 0.3, "female": 0.37 } }, "users": 93, "size": 145667521, "checksum": "9d10c538a7d0237e5f21d2c516e1c07957e0ae318c25203d33b5d5fedb2ac2c6", "avgDurationSecs": 5.916, "validDurationSecs": 11773.405, "totalHrs": 6.94, "validHrs": 3.27 }, "oc": { "duration": 37154340, "buckets": { "dev": 173, "invalidated": 129, "other": 6647, "reported": 8, "test": 184, "train": 208, "validated": 840 }, "reportedSentences": 9, "clips": 7616, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "fifties": 0.2, "twenties": 0.01, "thirties": 0.04, "seventies": 0, "fourties": 0.22, "teens": 0.01, "sixties": 0.14 }, "gender": { "": 0.39, "male": 0.09, "female": 0.52 } }, "users": 138, "size": 208779711, "checksum": "e241c12159ac7b3d880f41d5e91d804775da188a3ac413c775341eef3406001b", "avgDurationSecs": 4.878, "validDurationSecs": 4097.905, "totalHrs": 10.32, "validHrs": 1.13 }, "tk": { "duration": 6197616, "buckets": { "dev": 232, "invalidated": 53, "other": 270, "reported": 5, "test": 281, "train": 383, "validated": 896 }, "reportedSentences": 6, "clips": 1219, "splits": { "accent": { "": 1 }, "age": { "": 0.08, "twenties": 0.75, "thirties": 0.18 }, "gender": { "": 0.08, "male": 0.29, "female": 0.63 } }, "users": 14, "size": 36062710, "checksum": "5dcb82753e5192d731761b364836bb1fbc70b82e234a55f8e4a2168cca8746a8", "avgDurationSecs": 5.084, "validDurationSecs": 4555.426, "totalHrs": 1.72, "validHrs": 1.26 }, "vot": { "duration": 1025976, "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7891176, "checksum": "5ec41e90ea98ece390981fd44ca5b51728791988a2c116b52b2b3ad3b216c35f", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "duration": 1441620, "buckets": { "dev": 21, "invalidated": 34, "other": 149, "reported": 0, "test": 27, "train": 39, "validated": 87 }, "clips": 270, "splits": { "accent": { "": 1 }, "age": { "": 0.38, "twenties": 0.37, "fourties": 0.01, "thirties": 0.24 }, "gender": { "": 0.38, "male": 0.62 } }, "users": 21, "size": 8439871, "checksum": "dab12e4b9250a93cfdb89b97a038a616f5fc95c28b91bc0cfa966e313503bd9a", "avgDurationSecs": 5.339, "validDurationSecs": 464.522, "totalHrs": 0.4, "validHrs": 0.12 }, "ast": { "duration": 4813164, "buckets": { "dev": 81, "invalidated": 12, "other": 684, "test": 79, "train": 205, "validated": 365 }, "clips": 1061, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "fourties": 0.42, "fifties": 0.33 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 12, "size": 27001785, "checksum": "c40efb1033e7ced0168637485bda22b2e60c53cab4cfe880b09b1ec0d80634bc", "avgDurationSecs": 4.536, "validDurationSecs": 1655.801, "totalHrs": 1.33, "validHrs": 0.45 }, "ne-NP": { "duration": 2276280, "buckets": { "dev": 49, "invalidated": 35, "other": 245, "test": 74, "train": 173, "validated": 296 }, "clips": 576, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.29, "": 0.46, "twenties": 0.24 }, "gender": { "male": 0.54, "": 0.46 } }, "users": 15, "size": 13220758, "checksum": "637db7008c47c43e3d0d10c097d7dd25a3e3a00a1849c9b8f1f06a56e06cde8b", "avgDurationSecs": 3.952, "validDurationSecs": 1169.755, "totalHrs": 0.63, "validHrs": 0.32 }, "quy": { "duration": 4320, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 1, "validated": 1 }, "clips": 1, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 25833, "checksum": "f7b24f25235e87f58a0b89362e541255dc3bfb55f19f94d6a0caa352ef86b6eb", "avgDurationSecs": 4.32, "validDurationSecs": 4.32, "totalHrs": 0, "validHrs": 0 }, "lo": { "duration": 294552, "buckets": { "dev": 0, "invalidated": 5, "other": 1, "test": 10, "train": 28, "validated": 38 }, "clips": 44, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.05, "fourties": 0.02, "thirties": 0.68 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 6, "size": 1736458, "checksum": "3e1cd64e6bcca689dce85e9b6c1f04a467c1e255041921d9563c99d5265b2c6b", "avgDurationSecs": 6.694, "validDurationSecs": 254.386, "totalHrs": 0.08, "validHrs": 0.07 }, "dyu": { "duration": 6228, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 1, "validated": 1 }, "clips": 1, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 37380, "checksum": "767317573e21e6d5279d5499f3ec3402a24d7547218283a36c2148767a120422", "avgDurationSecs": 6.228, "validDurationSecs": 6.228, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 70416, "buckets": { "dev": 0, "invalidated": 0, "other": 10, "test": 0, "train": 0, "validated": 0 }, "clips": 10, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 410396, "checksum": "48db6e809f5b6eb0c00b077e6b736aeeee5d544ee3f2fdd059244da88926c040", "avgDurationSecs": 7.042, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 } }, "totalDuration": 97709611853, "totalValidDurationSecs": 63681475, "totalHrs": 27141, "totalValidHrs": 17689 } ================================================ FILE: datasets/scripted-speech/cv-corpus-13.0-delta-2023-03-09.json ================================================ { "locales": { "de": { "duration": 202351608, "reportedSentences": 518, "clips": 34540, "users": 369, "size": 1160394847, "checksum": "71664fadd4189922f3c814889f640111e925fb511b290242e10e7a768bd7b1bb", "avgDurationSecs": 5.21, "validDurationSecs": 189120.7, "totalHrs": 56.21, "validHrs": 52.53 }, "en": { "reportedSentences": 171, "duration": 171820404, "clips": 30268, "users": 1117, "size": 2267504328, "checksum": "e9af8ba157ea45d94b98490004b8e7e1b8432414e27be396b27b94e99e284421", "avgDurationSecs": 5.173, "validDurationSecs": 165243.06, "totalHrs": 47.73, "validHrs": 45.9 }, "fa": { "reportedSentences": 36, "duration": 7194528, "clips": 2188, "users": 37, "size": 42431121, "checksum": "921ff70850b58468bcc232f1d6f8e7c5bf58aff2ee1efdd4f26de19e75f7ed2a", "avgDurationSecs": 3.987, "validDurationSecs": 12746.16, "totalHrs": 2, "validHrs": 3.54 }, "fr": { "reportedSentences": 91, "duration": 40855680, "clips": 7472, "users": 298, "size": 222715295, "checksum": "a44e73aa19a6805838a1302c1d311b704e90de7463da187b4b4ac3bcfba053fd", "avgDurationSecs": 5.006, "validDurationSecs": 52328.66, "totalHrs": 11.35, "validHrs": 14.54 }, "es": { "reportedSentences": 35, "duration": 86365908, "clips": 17614, "users": 207, "size": 492559873, "checksum": "4d2b6be24bc13b4dce65bb8f63531225ed68b9590ef768c4245decfe319bc7cf", "avgDurationSecs": 4.864, "validDurationSecs": 39443.95, "totalHrs": 23.99, "validHrs": 10.95 }, "sl": { "reportedSentences": 0, "duration": 555624, "clips": 163, "users": 2, "size": 3128563, "checksum": "cbd0b177478b176d3b3162b9a48e54ad3240f6be46b35308b45d6f7d5788c289", "avgDurationSecs": 3.779, "validDurationSecs": 167.12, "totalHrs": 0.16, "validHrs": 0.05 }, "kab": { "reportedSentences": 36, "duration": 2446272, "clips": 652, "users": 7, "size": 8232374, "checksum": "9461233ae95d9850048bacf0c2d5d3102384e6bf8e542978cdd11732053dcf42", "avgDurationSecs": 3.337, "validDurationSecs": 913.51, "totalHrs": 0.68, "validHrs": 0.25 }, "cy": { "reportedSentences": 0, "duration": 3139092, "clips": 477, "users": 17, "size": 17623197, "checksum": "c54adb86f51b50eb5813d793e3a7b21ddd69e9a69df8f73eb7a147e47957ca68", "avgDurationSecs": 4.87, "validDurationSecs": 2425.99, "totalHrs": 0.87, "validHrs": 0.67 }, "ca": { "duration": 616824564, "reportedSentences": 235, "clips": 120955, "users": 1053, "size": 3392925119, "checksum": "7a7f050ea8e98ad8aa8082758ec2065d16024ebcc5300c6e8baa47a5dfdeecf2", "avgDurationSecs": 5.469, "validDurationSecs": 529080.77, "totalHrs": 171.34, "validHrs": 146.97 }, "tt": { "duration": 240048, "reportedSentences": 1, "clips": 53, "users": 5, "size": 854443, "checksum": "88f304cc84abff09d809698f58de875663516f9a2b7e85ed351f525dcec520ce", "avgDurationSecs": 3.756, "validDurationSecs": 123.03, "totalHrs": 0.07, "validHrs": 0.04 }, "ta": { "duration": 8423532, "reportedSentences": 10, "clips": 1366, "users": 50, "size": 46621851, "checksum": "51c38e4ce57c21b1352f9d981dbe19702d01a093f230fc6271fb8bf97f8665d1", "avgDurationSecs": 6.214, "validDurationSecs": 5442.77, "totalHrs": 2.34, "validHrs": 1.51 }, "ru": { "duration": 61113708, "reportedSentences": 10, "clips": 12357, "users": 86, "size": 356799822, "checksum": "0992166fab13d7a8a8df539b0667c4cd947de88f2b61f107c3805c04ceae0a43", "avgDurationSecs": 5.149, "validDurationSecs": 30520.08, "totalHrs": 16.98, "validHrs": 8.48 }, "nl": { "duration": 10094112, "reportedSentences": 1, "clips": 1979, "users": 53, "size": 52331402, "checksum": "2a8edc9005bbc8a3623ce25bfe95979bc9144e49a09468e8fd574ea76de30d94", "avgDurationSecs": 4.34, "validDurationSecs": 7204.87, "totalHrs": 2.8, "validHrs": 2 }, "it": { "duration": 14341608, "reportedSentences": 42, "clips": 2524, "users": 49, "size": 76513595, "checksum": "38d09ee666b8b813415e37d2c149d804c25f1d4dc149e602325554c7ed5f2580", "avgDurationSecs": 5.366, "validDurationSecs": 14224.02, "totalHrs": 3.98, "validHrs": 3.95 }, "eu": { "duration": 3358584, "reportedSentences": 6, "clips": 615, "users": 18, "size": 18823982, "checksum": "07e2c8846ad8307b9790f402fa45a66ce71a2f1c95df8ec8c82d6fb3e9214a33", "avgDurationSecs": 5.189, "validDurationSecs": 496.72, "totalHrs": 0.94, "validHrs": 0.14 }, "tr": { "duration": 11970036, "reportedSentences": 9, "clips": 3021, "users": 43, "size": 68215413, "checksum": "c1a9adf7e17d82022f0ef16d9143560f8fdb426eba15dea385d1dcf64432a1b1", "avgDurationSecs": 3.633, "validDurationSecs": 11182.69, "totalHrs": 3.33, "validHrs": 3.11 }, "ar": { "duration": 783772, "reportedSentences": 5, "clips": 188, "users": 21, "size": 3114577, "checksum": "b51f50ecee1a7323ecf6d5f0dae5c1140df0c583d978cb8b3cf05fe6cc1e2d38", "avgDurationSecs": 4.169, "validDurationSecs": 812.96, "totalHrs": 0.22, "validHrs": 0.22 }, "zh-TW": { "duration": 12323412, "reportedSentences": 1, "clips": 3137, "users": 22, "size": 52743365, "checksum": "2bfa61c1c915713204aa8875dfa8dd2dc1aa6fc56245cf90c2ed13fded6a7253", "avgDurationSecs": 3.329, "validDurationSecs": 5590.1, "totalHrs": 3.42, "validHrs": 1.55 }, "br": { "duration": 927792, "reportedSentences": 4, "clips": 295, "users": 1, "size": 5276793, "checksum": "ca761f982405be3e675b6d11e42938cf72f2ec1d1622f3da3d1c4df2e1758e45", "avgDurationSecs": 3.134, "validDurationSecs": 67.14, "totalHrs": 0.25, "validHrs": 0.02 }, "pt": { "duration": 59080824, "reportedSentences": 87, "clips": 15033, "users": 132, "size": 343285323, "checksum": "7dc2dbf1ad261825ebca28ec594a3adc9b9d14d33a64e5f05e0fccad9ed44bd1", "avgDurationSecs": 4.08, "validDurationSecs": 41332.49, "totalHrs": 16.41, "validHrs": 11.48 }, "eo": { "duration": 91297508, "reportedSentences": 10, "clips": 2539, "users": 18, "size": 69067947, "checksum": "df26401578f024d2d1748438bc2bb8da1f4fcd066d26b5b1f1fb7dbc88ca4537", "avgDurationSecs": 6.068, "validDurationSecs": 58093.53, "totalHrs": 25.36, "validHrs": 16.13 }, "zh-CN": { "duration": 21616488, "reportedSentences": 21, "clips": 4494, "users": 78, "size": 126290705, "checksum": "c160d6c83e20424692ffacbbf2fc6480c6613c0913ae5cb1d2a2ddf7d1c239cc", "avgDurationSecs": 4.551, "validDurationSecs": 11217.32, "totalHrs": 6, "validHrs": 3.11 }, "id": { "duration": 4125564, "reportedSentences": 3, "clips": 1205, "users": 10, "size": 23737312, "checksum": "35e1c00612c9a306be8597d9544c032bafac35d892456f6a6186e53117a0933c", "avgDurationSecs": 3.991, "validDurationSecs": 293.17, "totalHrs": 1.14, "validHrs": 0.08 }, "ia": { "duration": 29628, "reportedSentences": 6, "clips": 5, "users": 0, "size": 2030, "checksum": "cae091b1e805fb1bc4621b5547ca755dc8e655b872bf19c2d06c2218ef12ab13", "avgDurationSecs": 4.182, "validDurationSecs": 262, "totalHrs": 0.01, "validHrs": 0.07 }, "lv": { "duration": 24867540, "reportedSentences": 0, "clips": 6343, "users": 194, "size": 138703960, "checksum": "717c7e08b7ea15f95a4629b7bdacfa416c74b66e1d0ea491ee8f1cf16715b359", "avgDurationSecs": 3.624, "validDurationSecs": 19780.85, "totalHrs": 6.91, "validHrs": 5.49 }, "ja": { "duration": 99197640, "reportedSentences": -4, "clips": 17137, "users": 60, "size": 577342820, "checksum": "14188e016cc264e7d4381e4835d1efd54623f35f0ffb85562004d1ae906fe5c4", "avgDurationSecs": 4.553, "validDurationSecs": 11223.17, "totalHrs": 27.56, "validHrs": 3.12 }, "rw": { "duration": 823644, "reportedSentences": 0, "clips": 177, "users": 21, "size": -25027071, "checksum": "9570b365918a227121b1111de3f2404a6624b51e0374683da7c2701a4327fb99", "avgDurationSecs": 5.008, "validDurationSecs": 623.39, "totalHrs": 0.22, "validHrs": 0.17 }, "sv-SE": { "duration": 4138380, "reportedSentences": 2, "clips": 883, "users": 22, "size": 23698432, "checksum": "760816a9eb17ede1738f08fc44ff87d6e9f9e26a6e6f4fe4f979084aba8de708", "avgDurationSecs": 3.978, "validDurationSecs": 2179.07, "totalHrs": 1.15, "validHrs": 0.61 }, "cnh": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -63479, "checksum": "38066864bff36a1ee125d901be9602f6d25061ef9a5f30fcb8f0e953e3a37208", "avgDurationSecs": 3.564, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "et": { "duration": 0, "reportedSentences": 1, "clips": 0, "users": 0, "size": -534602, "checksum": "dfce8875a0c27ab49fa5167ab20b264d1e5773c052bb45c3524d1e2c3e400454", "avgDurationSecs": 6.758, "validDurationSecs": 202.76, "totalHrs": 0, "validHrs": 0.05 }, "ky": { "duration": 155880, "reportedSentences": 1, "clips": 34, "users": 2, "size": 552264, "checksum": "d964f43dacf1b4dd69ec8e2ac32ee6ab745d46c760c4c23632752b00abaebc51", "avgDurationSecs": 4.538, "validDurationSecs": 155.63, "totalHrs": 0.04, "validHrs": 0.05 }, "ro": { "duration": 2874096, "reportedSentences": 21, "clips": 666, "users": 17, "size": 16309967, "checksum": "d4e66e8267946831ffb6af6e6e53957cbfa6dbf6621af23be0197482f25328f5", "avgDurationSecs": 3.993, "validDurationSecs": 1946.06, "totalHrs": 0.8, "validHrs": 0.54 }, "hsb": { "duration": 111024, "reportedSentences": 0, "clips": 15, "users": 1, "size": 598148, "checksum": "3e9666e60a5da9bfd80879d8b1f015cce61d59729a15eee0912162b0bb5d93ce", "avgDurationSecs": 6.12, "validDurationSecs": 16.38, "totalHrs": 0.03, "validHrs": 0.01 }, "el": { "duration": 5628240, "reportedSentences": 5, "clips": 1339, "users": 9, "size": 32356517, "checksum": "77499bae45f5a2308de48f71e20fec3ed92a54a7a7583b255f1c0a80c1d027df", "avgDurationSecs": 4.132, "validDurationSecs": 1400.31, "totalHrs": 1.56, "validHrs": 0.38 }, "cs": { "duration": 661180284, "reportedSentences": 174, "clips": 146424, "users": 274, "size": 3820769735, "checksum": "68e53a82eb28d99cc10a90d7ec32a7a98930d7a7f764d2e321ae413eb0924558", "avgDurationSecs": 4.462, "validDurationSecs": 45944.28, "totalHrs": 183.66, "validHrs": 12.76 }, "pl": { "duration": 8153568, "reportedSentences": 5, "clips": 1406, "users": 34, "size": 46451279, "checksum": "017a81af2eb81a533410e03f7d0b67cce4e5bcfa40683852d261a3421ec35264", "avgDurationSecs": 4.503, "validDurationSecs": 9239.36, "totalHrs": 2.26, "validHrs": 2.57 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -134488, "checksum": "adef75ee647c88a0ec20a1b8977b4f89ae3bafee1d491862e92e77f869ae3140", "avgDurationSecs": 5.481, "validDurationSecs": 10.96, "totalHrs": 0, "validHrs": 0 }, "rm-vallader": { "duration": 0, "reportedSentences": 2, "clips": 0, "users": 0, "size": -53560, "checksum": "f303f525cf8ee884a1538c895b433f62b99fcfdfaeeaea5e81312f06ddc0c969", "avgDurationSecs": 5.811, "validDurationSecs": 5.81, "totalHrs": 0, "validHrs": 0 }, "mn": { "duration": 629424, "reportedSentences": 7, "clips": 101, "users": 4, "size": 3421396, "checksum": "919edfde38ca1b875cfda0da52157b3074f323e010acf6c8b0ae28ba534f3a87", "avgDurationSecs": 5.483, "validDurationSecs": 214.79, "totalHrs": 0.17, "validHrs": 0.06 }, "zh-HK": { "duration": 1722132, "reportedSentences": 9, "clips": 421, "users": 12, "size": 9412377, "checksum": "d1e4e28bc38aa02d32919a4bd4bee55dc1a195963a7e2487da1057095a3b94d0", "avgDurationSecs": 4.185, "validDurationSecs": 1098.96, "totalHrs": 0.48, "validHrs": 0.3 }, "ab": { "duration": 3960, "reportedSentences": 0, "clips": 1, "users": 1, "size": -1502854, "checksum": "f50ed6c61818d641ecc81d9fb6eeb2b713adf2ac0ad5eca60c7e9869b83247fd", "avgDurationSecs": 5.133, "validDurationSecs": 29.97, "totalHrs": 0.01, "validHrs": 0.01 }, "cv": { "duration": 292140, "reportedSentences": 0, "clips": 65, "users": 3, "size": 1539283, "checksum": "75c84a1c2b8cef32a2eed9a692c156c55e4cc16f511f482832a1ea53768debb5", "avgDurationSecs": 5.063, "validDurationSecs": 155.24, "totalHrs": 0.08, "validHrs": 0.04 }, "uk": { "duration": 3850524, "reportedSentences": 13, "clips": 878, "users": 38, "size": 21305319, "checksum": "a00db74b6941d8f735ec7be3e84154eac0972707c523063f52126174d23fc341", "avgDurationSecs": 4.758, "validDurationSecs": 10874.96, "totalHrs": 1.07, "validHrs": 3.02 }, "mt": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -174009, "checksum": "7f8c9084a18a7789b9d1b227df6cbf27bcbd8576f999dfbf238c9be9b18165ef", "avgDurationSecs": 4.741, "validDurationSecs": 33.19, "totalHrs": 0, "validHrs": 0.01 }, "as": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -61583, "checksum": "a536305b88375d09560d5a30b2d99c6ac90dcc38f480e3da7c4b499d7419225e", "avgDurationSecs": 5.69, "validDurationSecs": 330.02, "totalHrs": 0, "validHrs": 0.09 }, "ka": { "duration": 53510544, "reportedSentences": 90, "clips": 9992, "users": 116, "size": 311731775, "checksum": "bf45a832f2765d39f0d83c1d9c09649723f82d00def1cab320cccc719bd89a64", "avgDurationSecs": 5.325, "validDurationSecs": 32070.24, "totalHrs": 14.87, "validHrs": 8.91 }, "fy-NL": { "duration": 213377328, "reportedSentences": 66, "clips": 46719, "users": 547, "size": 1211370001, "checksum": "9324a2ee92e9651ddb0630d0ea534960288f2fa4534903b8644cff661ce2baac", "avgDurationSecs": 4.853, "validDurationSecs": 43211.15, "totalHrs": 59.27, "validHrs": 12 }, "dv": { "duration": 488412, "reportedSentences": 0, "clips": 101, "users": 4, "size": 2605113, "checksum": "942e0269c339009b7d7d6ac17337ae5f593249926926fee6c37d64aa5c25a2b5", "avgDurationSecs": 5.06, "validDurationSecs": 335.57, "totalHrs": 0.14, "validHrs": 0.09 }, "pa-IN": { "duration": 265248, "reportedSentences": 0, "clips": 60, "users": 2, "size": 1457822, "checksum": "b6f1ba247777341e72d91a83f34e9114b739110b3ad4f60fb05e63eec8b1b825", "avgDurationSecs": 4.824, "validDurationSecs": 301.9, "totalHrs": 0.07, "validHrs": 0.09 }, "vi": { "duration": 475164, "reportedSentences": 0, "clips": 121, "users": 8, "size": 2579494, "checksum": "0c012f9c8aa77bf5de9c5263fe30a64218afb2c48534b868d55f100583239746", "avgDurationSecs": 3.967, "validDurationSecs": 200.94, "totalHrs": 0.13, "validHrs": 0.06 }, "or": { "duration": 809748, "reportedSentences": 2, "clips": 173, "users": 2, "size": 4604580, "checksum": "0df8ef262d2b0e69b4ed0601620e6c581fa283632fb3f39efbd8fb8902217baa", "avgDurationSecs": 5.029, "validDurationSecs": 40.53, "totalHrs": 0.22, "validHrs": 0.01 }, "ga-IE": { "duration": 2109960, "reportedSentences": 0, "clips": 535, "users": 7, "size": 11973045, "checksum": "3cf9de0620073155dab06d73c89b6200f33b08f9bf410089ec4abe0fdb10d5e6", "avgDurationSecs": 3.57, "validDurationSecs": 739.94, "totalHrs": 0.58, "validHrs": 0.21 }, "fi": { "duration": 2457792, "reportedSentences": 4, "clips": 521, "users": 7, "size": 14236503, "checksum": "b78d40a194095854d5682d6714ea25a3f311bb824f4e351cef9a1f32562e6ec2", "avgDurationSecs": 4.6, "validDurationSecs": 3786.83, "totalHrs": 0.68, "validHrs": 1.05 }, "hu": { "duration": 196655112, "reportedSentences": 233, "clips": 35852, "users": 669, "size": 1120476308, "checksum": "01ab3d2fa87729761f250cbd4c1304f1e3e2c38a9fc050a88a584114ad126d6a", "avgDurationSecs": 5.287, "validDurationSecs": 79826.12, "totalHrs": 54.63, "validHrs": 22.17 }, "th": { "duration": 56203992, "reportedSentences": 24, "clips": 12393, "users": 29, "size": 327210516, "checksum": "3b3bc4c1bad6ff2590a7e764941592fa89435a827cadd4e2c5371980adefc31b", "avgDurationSecs": 4.193, "validDurationSecs": 16616.92, "totalHrs": 15.61, "validHrs": 4.62 }, "lt": { "duration": 14359500, "reportedSentences": 38, "clips": 2812, "users": 6, "size": 83944321, "checksum": "5b9308a384488b8964896bfdf654400825a1ec4bd973c7357269f4a11d16bab5", "avgDurationSecs": 5.143, "validDurationSecs": 11322.28, "totalHrs": 3.99, "validHrs": 3.14 }, "lg": { "duration": 1605312, "reportedSentences": 2, "clips": 250, "users": 3, "size": 6333743, "checksum": "ce391d09f447bdd33da0b3249362580179ddf3d7cc522f3902a706ce7b35669d", "avgDurationSecs": 5.785, "validDurationSecs": 32297.34, "totalHrs": 0.44, "validHrs": 8.97 }, "hi": { "duration": 834480, "reportedSentences": 10, "clips": 137, "users": 17, "size": 4379018, "checksum": "d8df02f35f6fb84ec14bfa3ca491e8f8090cfc52475b84f4e8b79ce0d66d3765", "avgDurationSecs": 4.85, "validDurationSecs": 744.16, "totalHrs": 0.23, "validHrs": 0.2 }, "bas": { "duration": 48600, "reportedSentences": 0, "clips": 10, "users": 1, "size": 248530, "checksum": "83f512a6347ebac5df13162cee85ad2986fb99d3fecc6488c1d734051d505002", "avgDurationSecs": 4.431, "validDurationSecs": 313.36, "totalHrs": 0.01, "validHrs": 0.09 }, "sk": { "duration": 14379912, "reportedSentences": 17, "clips": 3107, "users": 35, "size": 84131693, "checksum": "5d4904324f11e6a17b4d6fe1aec4278ac58ed71ae84fcb86b67dd7339b583bb6", "avgDurationSecs": 4.097, "validDurationSecs": 5660.25, "totalHrs": 3.99, "validHrs": 1.57 }, "kmr": { "duration": 9607752, "reportedSentences": 91, "clips": 1989, "users": 12, "size": 53314848, "checksum": "af6a3a2865cca9d90c2e377c6bd1fe2d80f3865845c3e5a780d588b1c242571d", "avgDurationSecs": 4.048, "validDurationSecs": 2156.72, "totalHrs": 2.67, "validHrs": 0.6 }, "bg": { "duration": 15846516, "reportedSentences": 25, "clips": 2972, "users": 22, "size": 92307281, "checksum": "69f56f3125411a9d9c1c7d907d0cf7b0116316113b3d273805415fc9185ba823", "avgDurationSecs": 5.476, "validDurationSecs": 9411.24, "totalHrs": 4.4, "validHrs": 2.61 }, "kk": { "duration": 107820, "reportedSentences": 3, "clips": 21, "users": 5, "size": 595793, "checksum": "088026aa813183a4d5a61ba8788c079b2f9253df77cef68546d426ee5868e777", "avgDurationSecs": 4.928, "validDurationSecs": 146.6, "totalHrs": 0.03, "validHrs": 0.04 }, "ba": { "duration": 1220256, "reportedSentences": 0, "clips": 283, "users": 13, "size": 2195517, "checksum": "95a8b68a7b66484e7726fea9abbb0b5fc7773a01da36fcc58a75242b197d9cd8", "avgDurationSecs": 4.426, "validDurationSecs": 681.41, "totalHrs": 0.34, "validHrs": 0.19 }, "gl": { "duration": 110461068, "reportedSentences": 119, "clips": 22214, "users": 631, "size": 629443437, "checksum": "433a9dd928cb2f0d9f92c60e0d10e8a7d1aca1005e9cdd092f6a5bdfe9df7e37", "avgDurationSecs": 4.937, "validDurationSecs": 60306.1, "totalHrs": 30.69, "validHrs": 16.75 }, "ug": { "duration": 10904868, "reportedSentences": 3, "clips": 1761, "users": 47, "size": 62162527, "checksum": "b09aff4bec1cda909cfdf10f021bba6421206d13da6d9037d281b657960b2bdf", "avgDurationSecs": 6.022, "validDurationSecs": 9750.76, "totalHrs": 3.02, "validHrs": 2.71 }, "hy-AM": { "duration": 63684, "reportedSentences": 2, "clips": 11, "users": 1, "size": 319743, "checksum": "109b4522d04e14184142ac022527346abe18c7bb6200d5ea98aedaea240e7f08", "avgDurationSecs": 6.109, "validDurationSecs": 297.58, "totalHrs": 0.02, "validHrs": 0.09 }, "be": { "duration": 229163292, "reportedSentences": 2, "clips": 49177, "users": 149, "size": 1299014459, "checksum": "00a53691d285774e98f5d62a7f9f513b3e237ee580f5665083e009b9430f208f", "avgDurationSecs": 4.762, "validDurationSecs": 358359.23, "totalHrs": 63.65, "validHrs": 99.54 }, "ur": { "duration": 342684, "reportedSentences": 0, "clips": 64, "users": 3, "size": 1811420, "checksum": "22e848a2ad547205dbb78c929ef0e21b02ce82e5c926ddfe805b7e03fcc0cee9", "avgDurationSecs": 3.885, "validDurationSecs": 103.99, "totalHrs": 0.1, "validHrs": 0.03 }, "gn": { "duration": 1244772, "reportedSentences": 2, "clips": 292, "users": 0, "size": 7235899, "checksum": "4a2932b1ea10d2b47fbf26a46d3ddc77e7b1a68cd4fdbbb24319218779fa5fa6", "avgDurationSecs": 4.23, "validDurationSecs": 113.15, "totalHrs": 0.35, "validHrs": 0.03 }, "sr": { "duration": 6256512, "reportedSentences": 1, "clips": 2220, "users": 26, "size": 34009071, "checksum": "768c88391cc827b95c05bba2377ec5564cf19935e9b56e5016d765ab56fab0c3", "avgDurationSecs": 2.818, "validDurationSecs": 2193.04, "totalHrs": 1.73, "validHrs": 0.6 }, "uz": { "duration": 2055276, "reportedSentences": 1, "clips": 380, "users": 17, "size": 9517102, "checksum": "64d120e4adf234f10b244d6f3dcf7a0ac889a6e669458fd16c7b57f4c049e93b", "avgDurationSecs": 4.146, "validDurationSecs": 1114.02, "totalHrs": 0.57, "validHrs": 0.31 }, "mr": { "duration": 226512, "reportedSentences": 1, "clips": 34, "users": 4, "size": 1070476, "checksum": "a71e722038eb3690641f30d8a8046b65c4c8d33210ad716e0286d2751d1e8498", "avgDurationSecs": 6.193, "validDurationSecs": 184.28, "totalHrs": 0.06, "validHrs": 0.05 }, "da": { "duration": 3875508, "reportedSentences": 53, "clips": 1045, "users": 11, "size": 22087791, "checksum": "6c85261bcf8dffe5c06ad29c82760cda5cd1fdc7d9c1c99b6285a425f11d105e", "avgDurationSecs": 4.164, "validDurationSecs": 623.69, "totalHrs": 1.08, "validHrs": 0.17 }, "myv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -75987, "checksum": "741e516b505e2552aa54adf74c10811a3230d2084edb02fd4d6783b2b20f4e72", "avgDurationSecs": 5.752, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nn-NO": { "duration": 365220, "reportedSentences": 8, "clips": 69, "users": 3, "size": 2130021, "checksum": "f39f03129e67eb69534db439c127e62bdc2d39a02ef92e5e2daddc3358208512", "avgDurationSecs": 4.574, "validDurationSecs": 212.58, "totalHrs": 0.1, "validHrs": 0.06 }, "ha": { "duration": 43056, "reportedSentences": 0, "clips": 11, "users": 3, "size": 186635, "checksum": "dde8d350a4f0a109108adfc8e55a53b2d7d26a08ec537ecc4060836213ded990", "avgDurationSecs": 4.333, "validDurationSecs": 46.18, "totalHrs": 0.01, "validHrs": 0.01 }, "ckb": { "duration": 4930236, "reportedSentences": 22, "clips": 1177, "users": 42, "size": 27501183, "checksum": "6592784e044be63d085d9b288db2dfa61d82a227e70803022c48b9758d68c840", "avgDurationSecs": 3.906, "validDurationSecs": 6488.33, "totalHrs": 1.37, "validHrs": 1.8 }, "ml": { "duration": 4067892, "reportedSentences": 0, "clips": 1035, "users": 2, "size": 23734922, "checksum": "0c53537225d238bcbf6ae11d8e3ecbffbaf8f74922bbd2a4989ea3f5931e3321", "avgDurationSecs": 4.071, "validDurationSecs": 449.57, "totalHrs": 1.13, "validHrs": 0.12 }, "mdf": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -10764, "checksum": "33f38dce47f710bef75e57e5b40e98e61f1877cd136775e6e3e7f1dc45895413", "avgDurationSecs": 5.25, "validDurationSecs": 42, "totalHrs": 0, "validHrs": 0.02 }, "sw": { "duration": 175636836, "reportedSentences": 1, "clips": 39312, "users": 125, "size": 1016751906, "checksum": "a04acaa50b6fe75d5f8ece78660b2827716a91d99e93600d78c09ebecce42e72", "avgDurationSecs": 5.262, "validDurationSecs": 11092.36, "totalHrs": 48.79, "validHrs": 3.08 }, "sat": { "duration": 92520, "reportedSentences": 0, "clips": 21, "users": 2, "size": 523554, "checksum": "c58f2434a2974518e727897fceae8030be7ea5403b2dc18652e4921c046ba2cc", "avgDurationSecs": 4.48, "validDurationSecs": 138.08, "totalHrs": 0.02, "validHrs": 0.04 }, "tig": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -434, "checksum": "b0a1697ef65f1f5f30476c789aac7e6f54acc7dc99c82112642e2c87cd5665f6", "avgDurationSecs": 4.491, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ig": { "duration": 223524, "reportedSentences": 1, "clips": 47, "users": 4, "size": 1321214, "checksum": "7a580953242f8f1072d1b816859e9b9a00fa2f34311a9af0ebfda6120ae0806a", "avgDurationSecs": 5.464, "validDurationSecs": -0.08, "totalHrs": 0.06, "validHrs": 0 }, "nan-tw": { "duration": 2090628, "reportedSentences": 0, "clips": 792, "users": 3, "size": 9319870, "checksum": "c253dbed21c6e4100dc9e3f3ebbde78e32ca38b7d47128d596732f60fdb5ab37", "avgDurationSecs": 2.668, "validDurationSecs": 1053.85, "totalHrs": 0.58, "validHrs": 0.29 }, "mhr": { "duration": 94244508, "reportedSentences": 12, "clips": 20971, "users": 70, "size": 504226380, "checksum": "fdc30f70750bb13213f0d5c699f31be429445533a4acc29e7aacbd4ba438f332", "avgDurationSecs": 4.642, "validDurationSecs": 14928.37, "totalHrs": 26.18, "validHrs": 4.15 }, "bn": { "duration": 55885392, "reportedSentences": 51, "clips": 9757, "users": 170, "size": 314334236, "checksum": "fa3f3cfc75b68741dac50f0bb271a7e83ca28a3191e920042d881a6f8badf112", "avgDurationSecs": 4.362, "validDurationSecs": -63993.86, "totalHrs": 15.52, "validHrs": -17.77 }, "tok": { "duration": 6476796, "reportedSentences": 39, "clips": 1527, "users": 25, "size": 37457103, "checksum": "1de25832d6417466aa16561594ced6d4a2b72da694910e1198f69f1c22e3ff5e", "avgDurationSecs": 3.779, "validDurationSecs": 3801.27, "totalHrs": 1.8, "validHrs": 1.06 }, "yue": { "duration": 42564096, "reportedSentences": 78, "clips": 10681, "users": 39, "size": 245246777, "checksum": "1d51745b0e7b4dd79f9eb25f8129d9f90dcc7346ec2ff3aaa4ab9a69f5aaff66", "avgDurationSecs": 4.081, "validDurationSecs": 1654.92, "totalHrs": 11.82, "validHrs": 0.46 }, "sah": { "duration": 164592, "reportedSentences": 0, "clips": 20, "users": 1, "size": 804488, "checksum": "e6970ac845dd0f3377404eaba252c0407f3c64b1d931aabd75ec7c6f7aed132d", "avgDurationSecs": 6.008, "validDurationSecs": 163.48, "totalHrs": 0.05, "validHrs": 0.05 }, "mk": { "duration": 130896, "reportedSentences": 0, "clips": 25, "users": 1, "size": 762962, "checksum": "c2275e2901850f3a54cef485a8099714fc38c48fab148838f7d89c37296a4c8f", "avgDurationSecs": 5.313, "validDurationSecs": 46.44, "totalHrs": 0.03, "validHrs": 0.01 }, "sc": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -22669, "checksum": "5848ec3b17ae8c47e3e15ee762d930e7757701a1f91c8f195b0384c5a5db2193", "avgDurationSecs": 4.414, "validDurationSecs": 158.89, "totalHrs": 0, "validHrs": 0.05 }, "skr": { "duration": 1683504, "reportedSentences": 0, "clips": 364, "users": 1, "size": 9137854, "checksum": "b6c3c9ee1e9661de973298e331efe26f523589e300eddaa1e23b74969f2e4f42", "avgDurationSecs": 4.126, "validDurationSecs": 618.33, "totalHrs": 0.47, "validHrs": 0.17 }, "ti": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -463, "checksum": "317936e0faac748f4a73ab5f3b17d4fb6909d553e6bc67c456666b3d11d415ab", "avgDurationSecs": 5.779, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mrj": { "duration": 18508320, "reportedSentences": 0, "clips": 4510, "users": 13, "size": 91873991, "checksum": "42877608f77aa7a1a548bce9fea0d5aee628a75d2688340923b008648a976273", "avgDurationSecs": 4.069, "validDurationSecs": 1749.22, "totalHrs": 5.15, "validHrs": 0.48 }, "tw": { "duration": 28512, "reportedSentences": 0, "clips": 5, "users": 1, "size": 168842, "checksum": "2c8e68f83c5ef0926855a6ede4a24bcca6f806293fffd952c6adf9a00782ffdc", "avgDurationSecs": 3.983, "validDurationSecs": 80.21, "totalHrs": 0.01, "validHrs": 0.02 }, "ko": { "duration": 6392304, "reportedSentences": 12, "clips": 1071, "users": 22, "size": 37284525, "checksum": "a308f2a8b7991716417c12aedba39dd4a93d2410a5a148531ddd5ecc89434af4", "avgDurationSecs": 6.186, "validDurationSecs": 1671.35, "totalHrs": 1.78, "validHrs": 0.46 }, "yo": { "duration": 24243084, "reportedSentences": 17, "clips": 4104, "users": 77, "size": 141669239, "checksum": "9d10c538a7d0237e5f21d2c516e1c07957e0ae318c25203d33b5d5fedb2ac2c6", "avgDurationSecs": 5.916, "validDurationSecs": 11200.23, "totalHrs": 6.74, "validHrs": 3.12 }, "oc": { "duration": 36495468, "clips": 7496, "users": 128, "size": 204996239, "checksum": "e241c12159ac7b3d880f41d5e91d804775da188a3ac413c775341eef3406001b", "avgDurationSecs": 4.878, "validDurationSecs": 3669.64, "totalHrs": 10.14, "validHrs": 1.02 }, "vot": { "duration": 0, "clips": 0, "users": 0, "size": -1194, "checksum": "5ec41e90ea98ece390981fd44ca5b51728791988a2c116b52b2b3ad3b216c35f", "avgDurationSecs": 2.408, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 422316, "clips": 82, "users": 3, "size": 2473867, "checksum": "dab12e4b9250a93cfdb89b97a038a616f5fc95c28b91bc0cfa966e313503bd9a", "avgDurationSecs": 5.339, "validDurationSecs": 25.35, "totalHrs": 0.12, "validHrs": 0 }, "ast": { "duration": 3891996, "clips": 847, "users": 10, "size": 21602352, "checksum": "c40efb1033e7ced0168637485bda22b2e60c53cab4cfe880b09b1ec0d80634bc", "avgDurationSecs": 4.536, "validDurationSecs": 1147.87, "totalHrs": 1.08, "validHrs": 0.31 }, "ne-NP": { "duration": 146952, "clips": 35, "users": 2, "size": 854814, "checksum": "637db7008c47c43e3d0d10c097d7dd25a3e3a00a1849c9b8f1f06a56e06cde8b", "avgDurationSecs": 3.952, "validDurationSecs": 67.7, "totalHrs": 0.04, "validHrs": 0.02 }, "quy": { "duration": 0, "clips": 0, "users": 0, "size": -124, "checksum": "f7b24f25235e87f58a0b89362e541255dc3bfb55f19f94d6a0caa352ef86b6eb", "avgDurationSecs": 4.32, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 } }, "totalDuration": 3681211853, "totalValidDurationSecs": 2024597, "totalHrs": 1022, "totalValidHrs": 562 } ================================================ FILE: datasets/scripted-speech/cv-corpus-14.0-2023-06-23.json ================================================ { "locales": { "en": { "buckets": { "dev": 16380, "invalidated": 272017, "other": 279585, "reported": 6445, "test": 16380, "train": 1046685, "validated": 1724421 }, "duration": 11802687079, "reportedSentences": 6368, "clips": 2276023, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.14, "teens": 0.06, "seventies": 0.01, "fourties": 0.09, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.45, "female": 0.16, "other": 0.02 } }, "users": 88154, "size": 83555475656, "checksum": "6e88c7460090c5a6ca7f02a8525bd669e7bc509a47b2f1974977d5065a054507", "avgDurationSecs": 5.186, "validDurationSecs": 8942265.283, "totalHrs": 3278.52, "validHrs": 2483.96 }, "fa": { "buckets": { "dev": 10484, "invalidated": 14192, "other": 21094, "reported": 2299, "test": 10484, "train": 28340, "validated": 322606 }, "duration": 1427612444, "reportedSentences": 2290, "clips": 357892, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.31, "thirties": 0.37, "fifties": 0.02, "fourties": 0.02, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.07, "other": 0 } }, "users": 4339, "size": 10440862898, "checksum": "f52db225ad13fafdbe6a08ba2d9ec66e4234c381272f541f178c97dc2d72074a", "avgDurationSecs": 3.989, "validDurationSecs": 1286858.438, "totalHrs": 396.55, "validHrs": 357.46 }, "fr": { "buckets": { "dev": 16126, "invalidated": 59834, "other": 27244, "reported": 6970, "test": 16126, "train": 521199, "validated": 688599 }, "duration": 3884174866, "reportedSentences": 6894, "clips": 775677, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "thirties": 0.16, "": 0.37, "teens": 0.03, "fourties": 0.14, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.3, "female": 0.1, "other": 0.01 } }, "users": 17761, "size": 27483550337, "checksum": "6f706a46135b987ecbd06457ea5986a7020821adc3afd90b982e19ff59b8eed9", "avgDurationSecs": 5.007, "validDurationSecs": 3448134.892, "totalHrs": 1078.93, "validHrs": 957.81 }, "es": { "buckets": { "dev": 15753, "invalidated": 72480, "other": 1163604, "reported": 2198, "test": 15753, "train": 295468, "validated": 372263 }, "duration": 7828158150, "reportedSentences": 2178, "clips": 1608347, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.1, "": 0.13, "fifties": 0.04, "twenties": 0.54, "teens": 0.08, "fourties": 0.03, "sixties": 0.09, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.55, "": 0.13, "other": 0, "female": 0.32 } }, "users": 25261, "size": 49365724567, "checksum": "c3861608ed5a177118cd4d7c6c9ea06fa96824bb3dc03a673b1275d67a7b7b19", "avgDurationSecs": 4.867, "validDurationSecs": 1811881.166, "totalHrs": 2174.48, "validHrs": 503.3 }, "sl": { "buckets": { "dev": 1221, "invalidated": 263, "other": 2252, "reported": 37, "test": 1213, "train": 1405, "validated": 9973 }, "duration": 47194678, "reportedSentences": 38, "clips": 12488, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.47, "teens": 0.07, "": 0.18, "sixties": 0.07, "fifties": 0.06, "fourties": 0.09, "thirties": 0.05 }, "gender": { "female": 0.15, "male": 0.67, "": 0.18, "other": 0 } }, "users": 144, "size": 330683954, "checksum": "6c2430cd32e00992fb10191165eb0e74dfc11add3f783e4285885d88d083c7f1", "avgDurationSecs": 3.779, "validDurationSecs": 37689.984, "totalHrs": 13.1, "validHrs": 10.46 }, "kab": { "buckets": { "dev": 14997, "invalidated": 19541, "other": 113194, "reported": 9244, "test": 14997, "train": 151862, "validated": 609344 }, "duration": 2476327588, "reportedSentences": 9237, "clips": 742079, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.29, "": 0.28, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.53, "": 0.26, "female": 0.2, "other": 0 } }, "users": 1529, "size": 18479045880, "checksum": "2049a98155ec7cdf55f1f6f13083c87c98df2ccefe5f90f3081fa5fb1d885389", "avgDurationSecs": 3.337, "validDurationSecs": 2033389.111, "totalHrs": 687.86, "validHrs": 564.83 }, "cy": { "buckets": { "dev": 5324, "invalidated": 4414, "other": 19110, "reported": 169, "test": 5333, "train": 7861, "validated": 89546 }, "duration": 551701125, "reportedSentences": 170, "clips": 113070, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.07, "fifties": 0.09, "": 0.43, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.25, "": 0.41, "other": 0.01 } }, "users": 1777, "size": 4040351921, "checksum": "26ef48296570550e5c493d0e34fa4199307642c3283d3cf61e412d479ebb4495", "avgDurationSecs": 4.879, "validDurationSecs": 436920.748, "totalHrs": 153.25, "validHrs": 121.36 }, "ca": { "buckets": { "dev": 16401, "invalidated": 99317, "other": 405294, "reported": 6004, "test": 16401, "train": 1141012, "validated": 1598438 }, "duration": 11363595030, "reportedSentences": 5959, "clips": 2103049, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.06, "fifties": 0.17, "fourties": 0.11, "twenties": 0.05, "": 0.32, "sixties": 0.25, "teens": 0.01, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.46, "": 0.33, "female": 0.21, "other": 0 } }, "users": 34628, "size": 69011912090, "checksum": "b6f054ad545b2d42a0a6ef89426ccaffaf29de75d66d62118fa5a5dcb251da4a", "avgDurationSecs": 5.403, "validDurationSecs": 8636984.736, "totalHrs": 3156.55, "validHrs": 2399.16 }, "de": { "buckets": { "dev": 16161, "invalidated": 52668, "other": 6527, "reported": 9608, "test": 16161, "train": 561238, "validated": 889183 }, "duration": 4952370233, "reportedSentences": 9577, "clips": 948378, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "fourties": 0.17, "": 0.32, "thirties": 0.16, "teens": 0.03, "sixties": 0.02, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.32, "female": 0.08, "other": 0.01 } }, "users": 18187, "size": 34628820478, "checksum": "36947cfb42f3ce87d8badd0d0c1b8eafb164ffd0f3a4d9b55477d6db9e229cea", "avgDurationSecs": 5.222, "validDurationSecs": 4643257.668, "totalHrs": 1375.65, "validHrs": 1289.79 }, "tt": { "buckets": { "dev": 3323, "invalidated": 414, "other": 8, "reported": 5, "test": 5198, "train": 9446, "validated": 28995 }, "duration": 110614454, "reportedSentences": 6, "clips": 29417, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.72, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.2, "male": 0.78, "female": 0.02 } }, "users": 241, "size": 815818603, "checksum": "cd90cd5db0ef158bc1b0a3f7604bbae34bef2f08537239fd8454248fdf43658f", "avgDurationSecs": 3.76, "validDurationSecs": 109027.64, "totalHrs": 30.72, "validHrs": 30.28 }, "ta": { "buckets": { "dev": 11957, "invalidated": 5631, "other": 89062, "reported": 3353, "test": 12012, "train": 43538, "validated": 132908 }, "duration": 1414617648, "reportedSentences": 3353, "clips": 227601, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.09, "thirties": 0.09, "": 0.71, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.03, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.7, "other": 0, "female": 0.13 } }, "users": 858, "size": 8474352380, "checksum": "9d31279a433de353e535c7b4f324e7bfa83faca89797c94a707e403984e857f1", "avgDurationSecs": 6.215, "validDurationSecs": 826068.437, "totalHrs": 392.94, "validHrs": 229.46 }, "ru": { "buckets": { "dev": 10194, "invalidated": 9666, "other": 11569, "reported": 398, "test": 10194, "train": 26316, "validated": 156017 }, "duration": 914120532, "reportedSentences": 392, "clips": 177252, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "teens": 0.09, "": 0.24, "fourties": 0.14, "thirties": 0.16, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.6, "": 0.25, "other": 0, "female": 0.15 } }, "users": 3001, "size": 6238778000, "checksum": "8cdacdf2f67ae79ffc6650524484ff9741776ec52de35fa74b6ee70d772feadf", "avgDurationSecs": 5.157, "validDurationSecs": 804607.807, "totalHrs": 253.92, "validHrs": 223.5 }, "nl": { "buckets": { "dev": 11068, "invalidated": 5426, "other": 3012, "reported": 336, "test": 11076, "train": 33132, "validated": 88302 }, "duration": 420517297, "reportedSentences": 336, "clips": 96740, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.21, "fourties": 0.15, "thirties": 0.12, "teens": 0.02, "fifties": 0.08, "sixties": 0.02, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.41, "male": 0.48, "female": 0.11, "other": 0 } }, "users": 1635, "size": 2862847627, "checksum": "9f465fd2e9a3c10815e04520230fc805b8a833246842386cc7750f2d6f359072", "avgDurationSecs": 4.347, "validDurationSecs": 383838.313, "totalHrs": 116.81, "validHrs": 106.62 }, "it": { "buckets": { "dev": 15115, "invalidated": 18505, "other": 1872, "reported": 5900, "test": 15113, "train": 164842, "validated": 231768 }, "duration": 1355373804, "reportedSentences": 5896, "clips": 252145, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "twenties": 0.21, "": 0.28, "fifties": 0.15, "fourties": 0.13, "seventies": 0, "sixties": 0.04, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.11, "male": 0.6, "": 0.28, "other": 0 } }, "users": 6930, "size": 9288307231, "checksum": "bb1ea6b826ec4f133b7b3c1437b0de91070a61069dad6cb6a8f4330fe67d0a8f", "avgDurationSecs": 5.375, "validDurationSecs": 1245839.798, "totalHrs": 376.49, "validHrs": 346.06 }, "eu": { "buckets": { "dev": 6640, "invalidated": 5859, "other": 29398, "reported": 98, "test": 6640, "train": 11050, "validated": 70118 }, "duration": 548048067, "reportedSentences": 98, "clips": 105375, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.13, "twenties": 0.34, "": 0.27, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.46, "female": 0.25, "": 0.27, "other": 0.02 } }, "users": 1323, "size": 4097264550, "checksum": "be4468215bbc50608429011efd71108709ee6ad454bda5d923e35e09fd0111c6", "avgDurationSecs": 5.201, "validDurationSecs": 364678.855, "totalHrs": 152.23, "validHrs": 101.29 }, "tr": { "buckets": { "dev": 10832, "invalidated": 4211, "other": 156, "reported": 374, "test": 10845, "train": 31155, "validated": 99987 }, "duration": 377906463, "reportedSentences": 375, "clips": 104354, "splits": { "accent": { "": 1 }, "age": { "": 0.28, "thirties": 0.08, "twenties": 0.26, "teens": 0.02, "fourties": 0.03, "fifties": 0.08, "sixties": 0.18, "eighties": 0.04, "seventies": 0.03 }, "gender": { "": 0.28, "male": 0.47, "female": 0.25, "other": 0 } }, "users": 1485, "size": 2295049634, "checksum": "dc65972975dfa8ab5c768884b3b5628d480b7f276e90f5c28e72556141fdfd9b", "avgDurationSecs": 3.621, "validDurationSecs": 362091.856, "totalHrs": 104.97, "validHrs": 100.58 }, "ar": { "buckets": { "dev": 10242, "invalidated": 15044, "other": 40146, "reported": 2117, "test": 10463, "train": 28445, "validated": 77419 }, "duration": 552531078, "reportedSentences": 2109, "clips": 132609, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.54, "twenties": 0.3, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.17, "": 0.54, "male": 0.29, "other": 0 } }, "users": 1466, "size": 3276239205, "checksum": "367f81d44b086e782f3e4f74a442e345be143c3a310bc9e97ae0b09a42f5f94a", "avgDurationSecs": 4.167, "validDurationSecs": 322575.417, "totalHrs": 153.48, "validHrs": 89.6 }, "zh-TW": { "buckets": { "dev": 4899, "invalidated": 4721, "other": 46086, "reported": 142, "test": 4899, "train": 6945, "validated": 80852 }, "duration": 440710834, "reportedSentences": 143, "clips": 131659, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.3, "teens": 0.06, "": 0.26, "fifties": 0.08, "seventies": 0, "fourties": 0.09, "sixties": 0 }, "gender": { "male": 0.5, "": 0.26, "female": 0.23, "other": 0.02 } }, "users": 2151, "size": 2979244635, "checksum": "4104e2afb207d0cf34b6f44221c07056c2b77c8d3dd19647408d09d4e217c6b2", "avgDurationSecs": 3.347, "validDurationSecs": 270641.22, "totalHrs": 122.41, "validHrs": 75.17 }, "br": { "buckets": { "dev": 2173, "invalidated": 898, "other": 14908, "reported": 279, "test": 2179, "train": 2569, "validated": 12153 }, "duration": 87783451, "reportedSentences": 279, "clips": 27959, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.31, "fifties": 0.05, "fourties": 0.06, "thirties": 0.07, "sixties": 0.15, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.67, "": 0.31, "female": 0.02, "other": 0 } }, "users": 189, "size": 621719192, "checksum": "28083359eb7f9a0de516121b31f98be9402b8846e0d2207d8d681b2ca7e9b8fd", "avgDurationSecs": 3.14, "validDurationSecs": 38157.026, "totalHrs": 24.38, "validHrs": 10.59 }, "pt": { "buckets": { "dev": 9236, "invalidated": 6825, "other": 27459, "reported": 2566, "test": 9243, "train": 20803, "validated": 142984 }, "duration": 723669235, "reportedSentences": 2559, "clips": 177268, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "twenties": 0.37, "teens": 0.02, "thirties": 0.18, "fourties": 0.14, "sixties": 0.02, "fifties": 0.02, "seventies": 0 }, "gender": { "": 0.24, "male": 0.68, "female": 0.06, "other": 0.02 } }, "users": 3181, "size": 4573826111, "checksum": "195ee8d04eb501074b2a5d89f5708d4f9ecfb8fd40fd608582f85c1ef63b7447", "avgDurationSecs": 4.082, "validDurationSecs": 583710.099, "totalHrs": 201.01, "validHrs": 162.14 }, "eo": { "buckets": { "dev": 14911, "invalidated": 127355, "other": 147059, "reported": 2219, "test": 14914, "train": 144014, "validated": 849149 }, "duration": 6813724111, "reportedSentences": 2218, "clips": 1123563, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.56, "thirties": 0.12, "": 0.2, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0.01 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1655, "size": 40701039616, "checksum": "8e88986f6477e345b886e21d2276887db1b64f13dd927a0ceab5a7099d48c42b", "avgDurationSecs": 6.064, "validDurationSecs": 5149570.621, "totalHrs": 1892.7, "validHrs": 1430.43 }, "zh-CN": { "buckets": { "dev": 10625, "invalidated": 57916, "other": 592410, "reported": 715, "test": 10625, "train": 29393, "validated": 182571 }, "duration": 3788954808, "reportedSentences": 712, "clips": 832897, "splits": { "accent": { "": 1 }, "age": { "": 0.93, "teens": 0.01, "twenties": 0.04, "thirties": 0.01, "fourties": 0, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.93, "male": 0.05, "female": 0.01, "other": 0 } }, "users": 6764, "size": 22410235297, "checksum": "d51e898f102d51d37c81119e6dba5f930c806763e1270b0a410f3e1fbeb4faf5", "avgDurationSecs": 4.549, "validDurationSecs": 830538.792, "totalHrs": 1052.48, "validHrs": 230.7 }, "id": { "buckets": { "dev": 3308, "invalidated": 2560, "other": 29369, "reported": 344, "test": 3630, "train": 4967, "validated": 25211 }, "duration": 228045396, "reportedSentences": 345, "clips": 57140, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.25, "male": 0.45, "female": 0.27, "other": 0.04 } }, "users": 507, "size": 1435325454, "checksum": "e944f8f641003f19a2923f721dd1334771dc47a7e963e66b43a448ad549bc5f8", "avgDurationSecs": 3.991, "validDurationSecs": 100616.949, "totalHrs": 63.34, "validHrs": 27.94 }, "ia": { "buckets": { "dev": 1801, "invalidated": 331, "other": 2608, "reported": 275, "test": 1781, "train": 5012, "validated": 11615 }, "duration": 60888432, "reportedSentences": 271, "clips": 14554, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.38, "female": 0.01 } }, "users": 64, "size": 412405056, "checksum": "471323b3630a5b874574acf7cf977eecd3bdb062adc17b2a5e9829b107717204", "avgDurationSecs": 4.184, "validDurationSecs": 48592.767, "totalHrs": 16.91, "validHrs": 13.49 }, "lv": { "buckets": { "dev": 2596, "invalidated": 2161, "other": 59346, "reported": 179, "test": 2596, "train": 3077, "validated": 62703 }, "duration": 487732661, "reportedSentences": 180, "clips": 124210, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.34, "fourties": 0.1, "": 0.38, "twenties": 0.07, "teens": 0.01, "fifties": 0.09, "sixties": 0.01 }, "gender": { "male": 0.3, "female": 0.32, "": 0.38, "other": 0 } }, "users": 2396, "size": 2678868112, "checksum": "ed3a91981b2c03a6889046f9ddb84c732fef43c7d8353e81f284f21967e40732", "avgDurationSecs": 3.927, "validDurationSecs": 246214.484, "totalHrs": 135.48, "validHrs": 68.39 }, "ja": { "buckets": { "dev": 4971, "invalidated": 6163, "other": 105092, "reported": 200, "test": 4971, "train": 7089, "validated": 58072 }, "duration": 812798601, "reportedSentences": 200, "clips": 169327, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.44, "": 0.17, "teens": 0.04, "fifties": 0.05, "thirties": 0.13, "fourties": 0.16, "sixties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.41, "": 0.17, "female": 0.42, "other": 0 } }, "users": 1707, "size": 4804178878, "checksum": "09d32dfcf4586de728a1c41813f5a33cc0f290bdcf1a7367ee36330e014585cc", "avgDurationSecs": 4.8, "validDurationSecs": 278755.546, "totalHrs": 225.77, "validHrs": 77.43 }, "rw": { "buckets": { "dev": 15987, "invalidated": 227798, "other": 47289, "reported": 629, "test": 16213, "train": 1003023, "validated": 1438668 }, "duration": 8581808597, "reportedSentences": 630, "clips": 1713755, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1114, "size": 61017046680, "checksum": "a0e24547c06863d214fdad1c171ef52deebd8daf641e3493a7c281b89441fd66", "avgDurationSecs": 5.008, "validDurationSecs": 7204281.482, "totalHrs": 2383.83, "validHrs": 2001.18 }, "sv-SE": { "buckets": { "dev": 5085, "invalidated": 1388, "other": 6226, "reported": 581, "test": 5141, "train": 7503, "validated": 39774 }, "duration": 188619027, "reportedSentences": 582, "clips": 47388, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.19, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.48, "": 0.19, "female": 0.32, "other": 0.01 } }, "users": 801, "size": 1193891712, "checksum": "4aada51de5ea008b5ff87f79c6109cea2acd778e8760ab28c4ef286b1412121d", "avgDurationSecs": 3.98, "validDurationSecs": 158312.931, "totalHrs": 52.39, "validHrs": 43.97 }, "cnh": { "buckets": { "dev": 761, "invalidated": 436, "other": 2908, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "duration": 20675832, "reportedSentences": 9, "clips": 5802, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 299, "size": 161428394, "checksum": "9e7853df7950f63697b97f9ae766568fa10c895aed6930d46813ee05954f5d75", "avgDurationSecs": 3.564, "validDurationSecs": 8759.255, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "buckets": { "dev": 2638, "invalidated": 6708, "other": 474, "reported": 490, "test": 2638, "train": 3138, "validated": 21842 }, "duration": 196192766, "reportedSentences": 487, "clips": 29024, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.08, "twenties": 0.68, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.26, "other": 0 } }, "users": 812, "size": 1338524800, "checksum": "76a93e1ae4d97d1134aca416623edfbc79f2a99d780d3e74be8d4053841eaa39", "avgDurationSecs": 6.76, "validDurationSecs": 147644.79, "totalHrs": 54.49, "validHrs": 41.01 }, "ky": { "buckets": { "dev": 1611, "invalidated": 5623, "other": 71, "reported": 37, "test": 1611, "train": 1791, "validated": 30164 }, "duration": 162640344, "reportedSentences": 38, "clips": 35858, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18, "fifties": 0 }, "gender": { "male": 0.53, "": 0.11, "female": 0.35, "other": 0 } }, "users": 264, "size": 1050256673, "checksum": "9437dde385748e74da489819c9965a5564f47a24553bfa20905bdaecc3dfc29a", "avgDurationSecs": 4.536, "validDurationSecs": 136814.193, "totalHrs": 45.17, "validHrs": 38 }, "ro": { "buckets": { "dev": 3805, "invalidated": 929, "other": 20790, "reported": 382, "test": 3880, "train": 5155, "validated": 16383 }, "duration": 152255371, "reportedSentences": 383, "clips": 38102, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "teens": 0.02, "": 0.11, "fourties": 0.06, "sixties": 0, "twenties": 0.67, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.74, "": 0.1, "female": 0.15, "other": 0.01 } }, "users": 404, "size": 949779260, "checksum": "b283ac16b9005840c983f85ace7d5a8b59ce1475dfd17596a9ae24d0db463595", "avgDurationSecs": 3.996, "validDurationSecs": 65466.373, "totalHrs": 42.29, "validHrs": 18.18 }, "hsb": { "buckets": { "dev": 172, "invalidated": 247, "other": 15, "reported": 86, "test": 444, "train": 808, "validated": 1424 }, "duration": 10318356, "reportedSentences": 87, "clips": 1686, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.54, "": 0.18, "thirties": 0.1, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.81, "": 0.18, "other": 0 } }, "users": 21, "size": 80438744, "checksum": "5d9b33a0f183663d905b088c54560f424da312df5831d4b87f3c09f1ff39abd5", "avgDurationSecs": 6.12, "validDurationSecs": 8714.91, "totalHrs": 2.86, "validHrs": 2.42 }, "el": { "buckets": { "dev": 1695, "invalidated": 819, "other": 10326, "reported": 69, "test": 1699, "train": 1919, "validated": 15731 }, "duration": 111094666, "reportedSentences": 70, "clips": 26876, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.37, "fourties": 0.16, "": 0.31, "twenties": 0.12, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.64, "": 0.31, "other": 0.02, "female": 0.03 } }, "users": 368, "size": 740925928, "checksum": "2e27cc0e3a21b25fd8eb25f602913bfbc8ffb9dd9e4c4f6761fb0b28e601f941", "avgDurationSecs": 4.134, "validDurationSecs": 65025.681, "totalHrs": 30.85, "validHrs": 18.06 }, "cs": { "buckets": { "dev": 8859, "invalidated": 2034, "other": 146938, "reported": 901, "test": 8857, "train": 19341, "validated": 59548 }, "duration": 930382491, "reportedSentences": 895, "clips": 208520, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.07, "": 0.23, "thirties": 0.47, "teens": 0.02, "twenties": 0.19, "fifties": 0.01, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.54, "": 0.23, "female": 0.23, "other": 0 } }, "users": 910, "size": 5700801793, "checksum": "a3f4eaa88e531f27a42043ac20d160a4400defd629432af24071b709b545a8be", "avgDurationSecs": 4.462, "validDurationSecs": 265693.538, "totalHrs": 258.43, "validHrs": 73.8 }, "pl": { "buckets": { "dev": 8817, "invalidated": 6374, "other": 1671, "reported": 555, "test": 8817, "train": 18759, "validated": 129464 }, "duration": 619841306, "reportedSentences": 555, "clips": 137509, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3208, "size": 4428427451, "checksum": "ee7fb61a9aa9da3d81027afe5b74639b463ac369f0809389e024bab4ed7d4d71", "avgDurationSecs": 4.508, "validDurationSecs": 583577.328, "totalHrs": 172.17, "validHrs": 162.1 }, "rm-sursilv": { "buckets": { "dev": 1352, "invalidated": 677, "other": 2249, "reported": 16, "test": 1347, "train": 1554, "validated": 4260 }, "duration": 39264017, "reportedSentences": 17, "clips": 7186, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.63, "teens": 0.05, "fourties": 0.19 }, "gender": { "male": 0.18, "female": 0.19, "": 0.63, "other": 0 } }, "users": 90, "size": 295870611, "checksum": "90ecd3bfa39d2b5a070d66203264c250a0d126c7ee108d13f5895cb4593b5e37", "avgDurationSecs": 5.464, "validDurationSecs": 23276.47, "totalHrs": 10.9, "validHrs": 6.46 }, "rm-vallader": { "buckets": { "dev": 500, "invalidated": 394, "other": 707, "reported": 37, "test": 462, "train": 556, "validated": 1525 }, "duration": 15294938, "reportedSentences": 36, "clips": 2626, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 52, "size": 116554453, "checksum": "0fa59d0db83a06ee970a076c37d9b022eb448424b747e0da84ffff490b326850", "avgDurationSecs": 5.824, "validDurationSecs": 8882.247, "totalHrs": 4.24, "validHrs": 2.46 }, "mn": { "buckets": { "dev": 1791, "invalidated": 781, "other": 4640, "reported": 31, "test": 1891, "train": 2243, "validated": 8497 }, "duration": 75726080, "reportedSentences": 32, "clips": 13918, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.22, "": 0.25, "twenties": 0.44, "fourties": 0.01, "teens": 0.02, "nineties": 0.05, "fifties": 0, "eighties": 0.01 }, "gender": { "male": 0.39, "": 0.26, "female": 0.29, "other": 0.06 } }, "users": 505, "size": 562555749, "checksum": "eb5db8ca620505be8688de1d1e181813bf5c548a6c9e2f21cada91998bc26876", "avgDurationSecs": 5.441, "validDurationSecs": 46231.104, "totalHrs": 21.03, "validHrs": 12.84 }, "zh-HK": { "buckets": { "dev": 5593, "invalidated": 4369, "other": 21833, "reported": 673, "test": 5593, "train": 8426, "validated": 92004 }, "duration": 494280888, "reportedSentences": 662, "clips": 118206, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.39, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0.01, "twenties": 0.33 }, "gender": { "male": 0.42, "": 0.35, "female": 0.22, "other": 0.01 } }, "users": 2982, "size": 3536434053, "checksum": "920147b28c6709076047753b40c7d66fdc1f1c1ef301c1f496a54d4ff426239d", "avgDurationSecs": 4.182, "validDurationSecs": 384716.671, "totalHrs": 137.3, "validHrs": 106.86 }, "ab": { "buckets": { "dev": 9160, "invalidated": 5277, "other": 11942, "reported": 224, "test": 9116, "train": 21027, "validated": 41976 }, "duration": 303841224, "reportedSentences": 223, "clips": 59195, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.18, "teens": 0.28, "twenties": 0.18, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 400, "size": 1736340543, "checksum": "935e46dcc3cb293f2d81dc0a877a9dc25d08c08fb248aacb0d5c0bf9ea49bb31", "avgDurationSecs": 5.133, "validDurationSecs": 215458.049, "totalHrs": 84.4, "validHrs": 59.84 }, "cv": { "buckets": { "dev": 1261, "invalidated": 2135, "other": 123, "reported": 143, "test": 1290, "train": 1435, "validated": 17314 }, "duration": 99019848, "reportedSentences": 139, "clips": 19572, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "": 0.22, "fourties": 0.06, "thirties": 0.01, "teens": 0.19, "fifties": 0.01 }, "gender": { "male": 0.54, "": 0.2, "female": 0.27 } }, "users": 112, "size": 672785579, "checksum": "cbc9adc419a32634b05ffa6a844c336f65dc9ab91c8b7dfc20e14cf42aece1dc", "avgDurationSecs": 5.059, "validDurationSecs": 87596.038, "totalHrs": 27.5, "validHrs": 24.33 }, "uk": { "buckets": { "dev": 8871, "invalidated": 2627, "other": 38, "reported": 622, "test": 8875, "train": 19024, "validated": 67031 }, "duration": 330461952, "reportedSentences": 623, "clips": 69696, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.26, "teens": 0.1, "": 0.26, "fourties": 0.12, "thirties": 0.26, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.58, "female": 0.15, "": 0.27, "other": 0 } }, "users": 889, "size": 2220799870, "checksum": "2ab9444796cee58a6405800426c13f8f88170efe6eed746b2622494328498d03", "avgDurationSecs": 4.741, "validDurationSecs": 317825.917, "totalHrs": 91.79, "validHrs": 88.28 }, "mt": { "buckets": { "dev": 1592, "invalidated": 321, "other": 6257, "reported": 9, "test": 1646, "train": 1942, "validated": 6390 }, "duration": 61481160, "reportedSentences": 10, "clips": 12968, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.25, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 212, "size": 457097788, "checksum": "1c4aea8906220887fd0bbe353c294dd48c3337399dfeffe20893e2ce45c0d095", "avgDurationSecs": 4.741, "validDurationSecs": 30294.927, "totalHrs": 17.07, "validHrs": 8.41 }, "as": { "buckets": { "dev": 510, "invalidated": 204, "other": 154, "reported": 9, "test": 551, "train": 652, "validated": 1713 }, "duration": 11784629, "reportedSentences": 10, "clips": 2071, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.6, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.41, "": 0.59, "female": 0 } }, "users": 45, "size": 73679330, "checksum": "4f7d8378367c1fdafb981c2a4d3af3c7229da5cb5bfca054d7f4454d0a19f141", "avgDurationSecs": 5.69, "validDurationSecs": 9747.499, "totalHrs": 3.27, "validHrs": 2.7 }, "ka": { "buckets": { "dev": 10196, "invalidated": 3177, "other": 25708, "reported": 630, "test": 10444, "train": 28931, "validated": 60191 }, "duration": 458929908, "reportedSentences": 631, "clips": 89076, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.24, "thirties": 0.38, "": 0.22, "fourties": 0.06, "fifties": 0.07, "teens": 0.02, "sixties": 0.02 }, "gender": { "male": 0.28, "female": 0.5, "": 0.22, "other": 0 } }, "users": 1165, "size": 2685141603, "checksum": "4ca1e6d7297f3838187e664504875e22f05a218534d584fa57b617d6977b84c0", "avgDurationSecs": 5.152, "validDurationSecs": 310111.03, "totalHrs": 127.48, "validHrs": 86.14 }, "fy-NL": { "buckets": { "dev": 3171, "invalidated": 3838, "other": 101752, "reported": 536, "test": 3170, "train": 3919, "validated": 50053 }, "duration": 755324307, "reportedSentences": 534, "clips": 155643, "splits": { "accent": { "": 1 }, "age": { "": 0.57, "fifties": 0.1, "thirties": 0.1, "twenties": 0.04, "fourties": 0.06, "sixties": 0.12, "seventies": 0, "teens": 0, "eighties": 0 }, "gender": { "": 0.57, "male": 0.12, "female": 0.31 } }, "users": 2000, "size": 4528474484, "checksum": "4c86bad927f7c0ba31338c6bb136ca46017bd1327bbf3d91fd4835651ee870ea", "avgDurationSecs": 4.853, "validDurationSecs": 242903.616, "totalHrs": 209.81, "validHrs": 67.47 }, "dv": { "buckets": { "dev": 2215, "invalidated": 1688, "other": 16094, "reported": 55, "test": 2250, "train": 2652, "validated": 27395 }, "duration": 228557187, "reportedSentences": 56, "clips": 45177, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.36, "fourties": 0.22, "teens": 0.01, "nineties": 0, "fifties": 0.02 }, "gender": { "": 0.21, "male": 0.29, "female": 0.5 } }, "users": 339, "size": 1447708048, "checksum": "6363db3d7c40dc7195c32b8be239304ccdceca87e0187cbbf9eb0632b6389957", "avgDurationSecs": 5.059, "validDurationSecs": 138595.395, "totalHrs": 63.48, "validHrs": 38.49 }, "pa-IN": { "buckets": { "dev": 286, "invalidated": 80, "other": 1395, "reported": 249, "test": 476, "train": 726, "validated": 1488 }, "duration": 14291270, "reportedSentences": 244, "clips": 2963, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "fourties": 0.04, "fifties": 0.05, "thirties": 0.38, "twenties": 0.23, "sixties": 0, "teens": 0 }, "gender": { "": 0.3, "male": 0.69, "female": 0.01 } }, "users": 65, "size": 100555420, "checksum": "b0be152800bbef88c6d357091c8af98daff40b7fa74ac20c500dae9c65a6f66b", "avgDurationSecs": 4.823, "validDurationSecs": 7176.986, "totalHrs": 3.96, "validHrs": 1.99 }, "vi": { "buckets": { "dev": 432, "invalidated": 354, "other": 11619, "reported": 186, "test": 1275, "train": 2412, "validated": 4853 }, "duration": 66727808, "reportedSentences": 185, "clips": 16826, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.2, "": 0.25, "teens": 0.21, "seventies": 0, "fourties": 0.02, "sixties": 0.3 }, "gender": { "male": 0.53, "": 0.25, "female": 0.2, "other": 0.02 } }, "users": 264, "size": 387719091, "checksum": "b2820fe318bf10d54b920b3c4438b80dd4e5c9f897f389796572581dc8327dd8", "avgDurationSecs": 3.966, "validDurationSecs": 19245.813, "totalHrs": 18.53, "validHrs": 5.34 }, "or": { "buckets": { "dev": 0, "invalidated": 267, "other": 5352, "reported": 26, "test": 669, "train": 1851, "validated": 2897 }, "duration": 42557460, "reportedSentences": 27, "clips": 8516, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "": 0.09, "thirties": 0.7, "fourties": 0.02, "teens": 0, "fifties": 0.01 }, "gender": { "male": 0.87, "": 0.09, "female": 0.04 } }, "users": 125, "size": 296812285, "checksum": "bab44fa0396c3fdda8ed5921b871f76de264585c0cf33197527c42bfb0ba3c9a", "avgDurationSecs": 4.997, "validDurationSecs": 14477.332, "totalHrs": 11.82, "validHrs": 4.02 }, "ga-IE": { "buckets": { "dev": 516, "invalidated": 883, "other": 4219, "reported": 27, "test": 514, "train": 536, "validated": 5051 }, "duration": 36262395, "reportedSentences": 28, "clips": 10153, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.26, "": 0.37, "thirties": 0.25, "fourties": 0.06, "sixties": 0.01, "teens": 0.01, "fifties": 0.05 }, "gender": { "male": 0.5, "": 0.37, "female": 0.13, "other": 0 } }, "users": 178, "size": 247816750, "checksum": "d8b0fc5a984ef4c622ef10347030ca83c1d9f80c494940038c0cfbb589053a09", "avgDurationSecs": 3.572, "validDurationSecs": 18040.122, "totalHrs": 10.07, "validHrs": 5.01 }, "fi": { "buckets": { "dev": 1631, "invalidated": 243, "other": 5829, "reported": 54, "test": 1747, "train": 2185, "validated": 8815 }, "duration": 68542167, "reportedSentences": 55, "clips": 14887, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "": 0.34, "twenties": 0.1, "fourties": 0.32, "teens": 0.01, "fifties": 0.04, "seventies": 0 }, "gender": { "male": 0.33, "": 0.34, "female": 0.33, "other": 0 } }, "users": 212, "size": 407826880, "checksum": "80b30a743d8b567c6d824ed02f4d5e2fed89dfc9b3c5ca855b36972d954a651c", "avgDurationSecs": 4.604, "validDurationSecs": 40585.692, "totalHrs": 19.03, "validHrs": 11.27 }, "hu": { "buckets": { "dev": 10669, "invalidated": 2631, "other": 43638, "reported": 410, "test": 10773, "train": 31075, "validated": 52864 }, "duration": 540841933, "reportedSentences": 411, "clips": 99133, "splits": { "accent": { "": 1 }, "age": { "teens": 0.02, "": 0.31, "thirties": 0.13, "twenties": 0.13, "fifties": 0.25, "fourties": 0.13, "sixties": 0.02, "seventies": 0.02, "eighties": 0 }, "gender": { "male": 0.27, "": 0.32, "female": 0.41, "other": 0.01 } }, "users": 1553, "size": 3154771110, "checksum": "2e7b2aa9aa70917b3ec210c1ab93eb104a038de0ecbe5296cd734dc7366fa19d", "avgDurationSecs": 5.456, "validDurationSecs": 288411.205, "totalHrs": 150.23, "validHrs": 80.11 }, "th": { "buckets": { "dev": 11024, "invalidated": 9204, "other": 204938, "reported": 4191, "test": 11024, "train": 32669, "validated": 145329 }, "duration": 1506747396, "reportedSentences": 4191, "clips": 359471, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.24, "": 0.4, "thirties": 0.07, "fourties": 0.04, "teens": 0.04, "fifties": 0.2, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.42, "": 0.4, "female": 0.17, "other": 0.01 } }, "users": 7814, "size": 8767813919, "checksum": "30b1e7378a39b54f6a90f87a4dd4613f078de9ee35208084e5b0fb17819b256a", "avgDurationSecs": 4.192, "validDurationSecs": 609156.489, "totalHrs": 418.54, "validHrs": 169.21 }, "lt": { "buckets": { "dev": 4316, "invalidated": 713, "other": 1196, "reported": 188, "test": 4585, "train": 6607, "validated": 15698 }, "duration": 90348056, "reportedSentences": 188, "clips": 17607, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.21, "thirties": 0.31, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.04 }, "gender": { "male": 0.57, "": 0.21, "female": 0.22 } }, "users": 282, "size": 550931539, "checksum": "779a654d61aa4c6df1dc068a02e45849608934a343d4905aeac2c02931924011", "avgDurationSecs": 5.131, "validDurationSecs": 80552.268, "totalHrs": 25.09, "validHrs": 22.37 }, "lg": { "buckets": { "dev": 13398, "invalidated": 39161, "other": 36929, "reported": 6450, "test": 13431, "train": 70819, "validated": 271673 }, "duration": 2011798467, "reportedSentences": 6445, "clips": 347763, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.22, "twenties": 0.38, "fourties": 0.07, "fifties": 0.05, "teens": 0.01, "nineties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "": 0.25, "female": 0.39, "male": 0.36 } }, "users": 648, "size": 11786491897, "checksum": "83aca077d149596ee9902353a1556e61a99782934a48e11ef4be012133c7a4a0", "avgDurationSecs": 5.785, "validDurationSecs": 1571620.112, "totalHrs": 558.83, "validHrs": 436.56 }, "hi": { "buckets": { "dev": 2396, "invalidated": 723, "other": 3400, "reported": 132, "test": 2982, "train": 4575, "validated": 10004 }, "duration": 68657640, "reportedSentences": 133, "clips": 14127, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "fourties": 0.03, "": 0.34, "thirties": 0.26, "teens": 0.01, "fifties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.62, "female": 0.04, "": 0.34, "other": 0 } }, "users": 383, "size": 405434490, "checksum": "5da9d205060550d7f47026fe2762e7d97df4af2a2a518c1697fe45846c48dd30", "avgDurationSecs": 4.86, "validDurationSecs": 48619.737, "totalHrs": 19.07, "validHrs": 13.5 }, "bas": { "buckets": { "dev": 457, "invalidated": 508, "other": 5, "reported": 7, "test": 533, "train": 763, "validated": 1753 }, "duration": 10040580, "reportedSentences": 8, "clips": 2266, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 33, "size": 55875175, "checksum": "5e8b9893ecbeea88130d16cd3e2caf61c081358852f3ca738f03cf01a7bf47ee", "avgDurationSecs": 4.431, "validDurationSecs": 7767.492, "totalHrs": 2.78, "validHrs": 2.15 }, "sk": { "buckets": { "dev": 2531, "invalidated": 820, "other": 3012, "reported": 61, "test": 2649, "train": 3285, "validated": 19202 }, "duration": 93862332, "reportedSentences": 61, "clips": 23034, "splits": { "accent": { "": 1 }, "age": { "": 0.44, "thirties": 0.21, "twenties": 0.08, "fourties": 0.18, "teens": 0.08, "fifties": 0.01 }, "gender": { "": 0.43, "male": 0.48, "female": 0.07, "other": 0.01 } }, "users": 213, "size": 529939982, "checksum": "197ea34c5c565470c0ad19f2cedbdacfe232ebdce88b2ae9e5ef0bba21649227", "avgDurationSecs": 4.075, "validDurationSecs": 78247.135, "totalHrs": 26.07, "validHrs": 21.73 }, "kmr": { "buckets": { "dev": 3512, "invalidated": 2396, "other": 31874, "reported": 1033, "test": 3501, "train": 4473, "validated": 52831 }, "duration": 353120796, "reportedSentences": 1028, "clips": 87101, "splits": { "accent": { "": 1 }, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.08, "fourties": 0.03, "fifties": 0.04, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.48, "male": 0.42, "female": 0.1, "other": 0 } }, "users": 550, "size": 1863722947, "checksum": "d112d882472a280d3656924d0d5592f3ff245dfe79e4ecece729d7aa494e4d34", "avgDurationSecs": 4.054, "validDurationSecs": 214184.967, "totalHrs": 98.08, "validHrs": 59.49 }, "bg": { "buckets": { "dev": 2467, "invalidated": 689, "other": 3914, "reported": 184, "test": 2607, "train": 3406, "validated": 8496 }, "duration": 71651844, "reportedSentences": 185, "clips": 13099, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.24, "thirties": 0.28, "": 0.37, "twenties": 0.11, "teens": 0, "sixties": 0, "fifties": 0 }, "gender": { "male": 0.59, "female": 0.04, "": 0.37 } }, "users": 106, "size": 418787488, "checksum": "bcdcee16c60a4192de13116e18d4f243ecd93f0042dde55db49dead0bd324f1b", "avgDurationSecs": 5.47, "validDurationSecs": 46473.324, "totalHrs": 19.9, "validHrs": 12.9 }, "kk": { "buckets": { "dev": 369, "invalidated": 197, "other": 14, "reported": 41, "test": 398, "train": 453, "validated": 1225 }, "duration": 7074144, "reportedSentences": 42, "clips": 1436, "splits": { "accent": { "": 1 }, "age": { "": 0.52, "thirties": 0.03, "twenties": 0.31, "teens": 0.05, "fifties": 0.09 }, "gender": { "": 0.53, "male": 0.45, "female": 0.02 } }, "users": 92, "size": 40662137, "checksum": "b3f9897e2eadcaf49df76130b62ace0342ad3ce36d84ad52fbd54584b805a167", "avgDurationSecs": 4.926, "validDurationSecs": 6034.698, "totalHrs": 1.96, "validHrs": 1.67 }, "ba": { "buckets": { "dev": 14563, "invalidated": 7928, "other": 176, "reported": 866, "test": 14514, "train": 119078, "validated": 209142 }, "duration": 961652844, "reportedSentences": 863, "clips": 217246, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 908, "size": 5395823441, "checksum": "4ef60613767d37170b8d226cff695b7d3692c0e60116eba0a215a6fb250943fe", "avgDurationSecs": 4.427, "validDurationSecs": 925779.987, "totalHrs": 267.12, "validHrs": 257.16 }, "gl": { "buckets": { "dev": 6849, "invalidated": 2080, "other": 14988, "reported": 361, "test": 7077, "train": 12623, "validated": 26880 }, "duration": 216533448, "reportedSentences": 361, "clips": 43948, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "thirties": 0.18, "fifties": 0.07, "twenties": 0.21, "fourties": 0.19, "teens": 0.02, "sixties": 0.01, "seventies": 0 }, "gender": { "": 0.35, "male": 0.35, "female": 0.29, "other": 0 } }, "users": 1007, "size": 1243154581, "checksum": "634fbaa19d8a30297317fac63934a883bf4b619e57be1800c3ea3726af9f971e", "avgDurationSecs": 4.927, "validDurationSecs": 132438.77, "totalHrs": 60.14, "validHrs": 36.78 }, "ug": { "buckets": { "dev": 3846, "invalidated": 3996, "other": 5, "reported": 284, "test": 3846, "train": 5005, "validated": 69434 }, "duration": 442916784, "reportedSentences": 285, "clips": 73435, "splits": { "accent": { "": 1 }, "age": { "": 0.6, "fifties": 0.01, "twenties": 0.11, "thirties": 0.15, "fourties": 0.12, "teens": 0.01, "eighties": 0 }, "gender": { "": 0.6, "male": 0.31, "female": 0.08, "other": 0 } }, "users": 926, "size": 2572712328, "checksum": "d59188dec224850bce3e656df2681b923b93801f9c6777072b1ac642f7c64f8d", "avgDurationSecs": 6.031, "validDurationSecs": 418785.102, "totalHrs": 123.03, "validHrs": 116.32 }, "hy-AM": { "buckets": { "dev": 371, "invalidated": 109, "other": 1193, "reported": 44, "test": 473, "train": 636, "validated": 1481 }, "duration": 17004096, "reportedSentences": 45, "clips": 2783, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.13, "twenties": 0.37, "fifties": 0.04, "teens": 0.09 }, "gender": { "": 0.37, "male": 0.22, "female": 0.41 } }, "users": 68, "size": 99865274, "checksum": "ce13e81be33bc2340e03ff9d8267362b7f706245054d49fe6a0ee52317d187bd", "avgDurationSecs": 6.11, "validDurationSecs": 9048.892, "totalHrs": 4.72, "validHrs": 2.51 }, "be": { "buckets": { "dev": 15877, "invalidated": 33282, "other": 114, "reported": 3135, "test": 15877, "train": 347560, "validated": 1141720 }, "duration": 5597641080, "reportedSentences": 3134, "clips": 1175116, "splits": { "accent": { "": 1 }, "age": { "": 0.85, "fourties": 0.05, "thirties": 0.05, "twenties": 0.04, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.85, "male": 0.07, "female": 0.09, "other": 0 } }, "users": 8162, "size": 31720606141, "checksum": "8f3f4cbcd01b80bb9c26abbc2bba42833f8cc59b80705303ef28b4d114e73b20", "avgDurationSecs": 4.763, "validDurationSecs": 5438559.916, "totalHrs": 1554.9, "validHrs": 1510.71 }, "ur": { "buckets": { "dev": 3302, "invalidated": 3377, "other": 85550, "reported": 56, "test": 3304, "train": 4130, "validated": 41996 }, "duration": 509111748, "reportedSentences": 56, "clips": 130923, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.88, "": 0.1, "fourties": 0.01, "thirties": 0, "teens": 0.01, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.69, "": 0.1, "female": 0.2 } }, "users": 207, "size": 2960368172, "checksum": "d1d136604caace2cd1ee3d4cc6493de0ac81e9e72c7a06f468d4852982029ab9", "avgDurationSecs": 3.889, "validDurationSecs": 163307.112, "totalHrs": 141.41, "validHrs": 45.36 }, "gn": { "buckets": { "dev": 360, "invalidated": 157, "other": 9611, "reported": 35, "test": 843, "train": 1550, "validated": 2865 }, "duration": 53781552, "reportedSentences": 36, "clips": 12633, "splits": { "accent": { "": 1 }, "age": { "": 0.16, "twenties": 0.42, "thirties": 0.3, "sixties": 0, "fourties": 0.1, "teens": 0.02 }, "gender": { "": 0.16, "male": 0.62, "female": 0.22 } }, "users": 115, "size": 306545569, "checksum": "85081c02843e46f01a25a8b478f9bbdb38e919fc7760cf49f2e39723c6eec9a0", "avgDurationSecs": 4.257, "validDurationSecs": 12196.956, "totalHrs": 14.93, "validHrs": 3.38 }, "sr": { "buckets": { "dev": 1207, "invalidated": 168, "other": 1725, "reported": 29, "test": 1326, "train": 1694, "validated": 4629 }, "duration": 18808200, "reportedSentences": 30, "clips": 6522, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.26, "fifties": 0.19, "fourties": 0.07, "thirties": 0.15, "teens": 0 }, "gender": { "male": 0.5, "": 0.26, "female": 0.24 } }, "users": 137, "size": 98872390, "checksum": "859301ec5ca2233aebfe68f61dbd5a029ce3f26eb25b86a1f6a38fa77818a2f5", "avgDurationSecs": 2.884, "validDurationSecs": 13349.15, "totalHrs": 5.22, "validHrs": 3.7 }, "uz": { "buckets": { "dev": 12079, "invalidated": 13876, "other": 127911, "reported": 1826, "test": 12352, "train": 48369, "validated": 86188 }, "duration": 945554400, "reportedSentences": 1809, "clips": 227975, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.41, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.44, "": 0.41, "female": 0.15, "other": 0 } }, "users": 2134, "size": 5276247455, "checksum": "40c5f7e5df4f04168f21e7a7d974a4a2fac981ab8c4f0086e372bb0450d5d66a", "avgDurationSecs": 4.148, "validDurationSecs": 357475.349, "totalHrs": 262.65, "validHrs": 99.29 }, "mr": { "buckets": { "dev": 1783, "invalidated": 2259, "other": 2826, "reported": 58, "test": 1737, "train": 2225, "validated": 10861 }, "duration": 98787240, "reportedSentences": 59, "clips": 15946, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "sixties": 0, "twenties": 0.29, "": 0.05, "teens": 0.53 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 87, "size": 579131341, "checksum": "82a4a94737b7fb7fa9cccbc6ff85d3376f1bc2aa7511f91c7c6762f8dbb21cdb", "avgDurationSecs": 6.195, "validDurationSecs": 67285.101, "totalHrs": 27.44, "validHrs": 18.69 }, "da": { "buckets": { "dev": 2000, "invalidated": 362, "other": 721, "reported": 401, "test": 2314, "train": 3238, "validated": 9494 }, "duration": 43878456, "reportedSentences": 401, "clips": 10577, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "thirties": 0.26, "twenties": 0.18, "sixties": 0.01, "fourties": 0.21, "fifties": 0.03, "teens": 0 }, "gender": { "": 0.31, "female": 0.07, "male": 0.61 } }, "users": 236, "size": 252068290, "checksum": "5adef6e485333aeccd60cd0c1b26df06e20d4a4d1b96103891112097aeae9f3f", "avgDurationSecs": 4.148, "validDurationSecs": 39385.654, "totalHrs": 12.18, "validHrs": 10.94 }, "myv": { "buckets": { "dev": 239, "invalidated": 23, "other": 1, "reported": 30, "test": 481, "train": 1241, "validated": 1971 }, "duration": 11519388, "reportedSentences": 31, "clips": 1995, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.26, "": 0.38, "thirties": 0.24, "twenties": 0.09, "teens": 0.01, "fourties": 0.02 }, "gender": { "male": 0.54, "": 0.38, "female": 0.07 } }, "users": 13, "size": 67201612, "checksum": "3874606a26f5e3b1d9930e7a09c950ebe4b1ed982bb7ca90010343ef145285fa", "avgDurationSecs": 5.774, "validDurationSecs": 11380.809, "totalHrs": 3.19, "validHrs": 3.16 }, "nn-NO": { "buckets": { "dev": 242, "invalidated": 54, "other": 151, "reported": 22, "test": 281, "train": 396, "validated": 924 }, "duration": 5008248, "reportedSentences": 23, "clips": 1129, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.49, "twenties": 0.17, "fourties": 0.04, "fifties": 0.01, "teens": 0.02 }, "gender": { "": 0.26, "female": 0.16, "male": 0.54, "other": 0.03 } }, "users": 32, "size": 28716209, "checksum": "bef79f16962d5d1e198f79faca68d808324803ad549c2ed6fa6bd5bf83d5b378", "avgDurationSecs": 4.436, "validDurationSecs": 4098.867, "totalHrs": 1.39, "validHrs": 1.13 }, "ha": { "buckets": { "dev": 582, "invalidated": 169, "other": 6657, "reported": 32, "test": 660, "train": 1926, "validated": 3280 }, "duration": 43793136, "reportedSentences": 32, "clips": 10106, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.68, "twenties": 0.12, "fourties": 0, "fifties": 0.02 }, "gender": { "": 0.16, "male": 0.59, "female": 0.25 } }, "users": 39, "size": 256252964, "checksum": "e32413abf57ded4f063e7792e228c907214d776de50a21a5501a81aba19e5d2a", "avgDurationSecs": 4.333, "validDurationSecs": 14213.486, "totalHrs": 12.16, "validHrs": 3.94 }, "ckb": { "buckets": { "dev": 4945, "invalidated": 7393, "other": 7408, "reported": 2384, "test": 4940, "train": 7045, "validated": 102361 }, "duration": 457982856, "reportedSentences": 2384, "clips": 117162, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "thirties": 0.12, "twenties": 0.48, "fourties": 0.03, "teens": 0.02, "fifties": 0.02 }, "gender": { "": 0.32, "male": 0.61, "female": 0.07, "other": 0 } }, "users": 1283, "size": 2482508283, "checksum": "cbf33af7ae77d6b524ebc9c42ae3d7523d9c90233d46f65a4064df8a15335038", "avgDurationSecs": 3.909, "validDurationSecs": 400126.177, "totalHrs": 127.21, "validHrs": 111.14 }, "ml": { "buckets": { "dev": 468, "invalidated": 88, "other": 4561, "reported": 119, "test": 543, "train": 1196, "validated": 2271 }, "duration": 28728936, "reportedSentences": 120, "clips": 6920, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.55, "thirties": 0.04, "fourties": 0.15, "sixties": 0 }, "gender": { "": 0.26, "male": 0.56, "other": 0.06, "female": 0.12 } }, "users": 121, "size": 162980893, "checksum": "139a64e20d26b0acf8be00914c4cde380a230326b4e6149d7e6cdc8a78aea6f0", "avgDurationSecs": 4.152, "validDurationSecs": 9428.239, "totalHrs": 7.98, "validHrs": 2.61 }, "mdf": { "buckets": { "dev": 54, "invalidated": 9, "other": 0, "reported": 14, "test": 107, "train": 175, "validated": 336 }, "duration": 1811340, "reportedSentences": 15, "clips": 345, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.58, "fourties": 0.34, "twenties": 0.02 }, "gender": { "male": 0.08, "": 0.58, "female": 0.34 } }, "users": 11, "size": 10645944, "checksum": "3f815a63c4aec0e71a338db3b4432586dac4ca79588eae3bf3b17809725cd22d", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "sw": { "buckets": { "dev": 11621, "invalidated": 69128, "other": 318132, "reported": 1867, "test": 11618, "train": 38552, "validated": 237755 }, "duration": 3290122944, "reportedSentences": 1862, "clips": 625015, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "twenties": 0.44, "thirties": 0.12, "teens": 0, "fifties": 0.05, "fourties": 0.04, "sixties": 0.01 }, "gender": { "": 0.32, "male": 0.36, "female": 0.32, "other": 0 } }, "users": 1177, "size": 19198285962, "checksum": "413dc644173112f5710830a63e19e3dbf2f5f0549f4f2375db45e779ee613ae9", "avgDurationSecs": 5.264, "validDurationSecs": 1251559.051, "totalHrs": 913.92, "validHrs": 347.65 }, "sat": { "buckets": { "dev": 0, "invalidated": 9, "other": 354, "reported": 6, "test": 146, "train": 315, "validated": 461 }, "duration": 3676320, "reportedSentences": 7, "clips": 824, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "twenties": 0.43, "fourties": 0.01, "fifties": 0.01, "teens": 0.01, "thirties": 0.1 }, "gender": { "": 0.41, "male": 0.58, "female": 0.01 } }, "users": 13, "size": 20296733, "checksum": "43783cb0e3716cad8aea0cc1b648ad1f67f0da25298b5cf1928abd160a43bea4", "avgDurationSecs": 4.462, "validDurationSecs": 2056.776, "totalHrs": 1.02, "validHrs": 0.57 }, "tig": { "buckets": { "dev": 10, "invalidated": 12, "other": 268, "reported": 0, "test": 11, "train": 20, "validated": 41 }, "duration": 1620288, "reportedSentences": 1, "clips": 321, "splits": { "accent": { "": 1 }, "age": { "": 0.92, "twenties": 0.02, "thirties": 0.06 }, "gender": { "": 0.92, "male": 0.08 } }, "users": 9, "size": 9522671, "checksum": "14f0c46fd4d94ac15603778d3d29af54984625603fcbe168346e12ee8ffac292", "avgDurationSecs": 5.048, "validDurationSecs": 206.953, "totalHrs": 0.45, "validHrs": 0.05 }, "ig": { "buckets": { "dev": 2, "invalidated": 3, "other": 5755, "reported": 15, "test": 4, "train": 8, "validated": 14 }, "duration": 31535604, "reportedSentences": 15, "clips": 5772, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "twenties": 0.32, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02, "fourties": 0 }, "gender": { "": 0.56, "male": 0.14, "female": 0.3 } }, "users": 113, "size": 184186514, "checksum": "420b3c6e1b3dd71892e8a124d5377b2443a3324ac70348437b44ced8bbe92b88", "avgDurationSecs": 5.464, "validDurationSecs": 76.49, "totalHrs": 8.75, "validHrs": 0.02 }, "nan-tw": { "buckets": { "dev": 1085, "invalidated": 325, "other": 11243, "reported": 148, "test": 1588, "train": 2279, "validated": 4964 }, "duration": 44122392, "reportedSentences": 149, "clips": 16532, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.13, "twenties": 0.35, "fourties": 0.27, "teens": 0.02, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.55, "": 0.14, "other": 0.12, "female": 0.19 } }, "users": 137, "size": 247245573, "checksum": "e099c9e8491a3e098901f543ba4974e80a969137494cebbb48923ce606d0d6d9", "avgDurationSecs": 2.669, "validDurationSecs": 13248.461, "totalHrs": 12.25, "validHrs": 3.68 }, "mhr": { "buckets": { "dev": 14221, "invalidated": 5806, "other": 47370, "reported": 89, "test": 14251, "train": 105425, "validated": 135759 }, "duration": 878626980, "reportedSentences": 88, "clips": 188935, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.05, "": 0.14, "sixties": 0.04, "thirties": 0.25, "fourties": 0.15, "twenties": 0.28, "teens": 0.08, "seventies": 0.01 }, "gender": { "male": 0.21, "": 0.14, "female": 0.65 } }, "users": 418, "size": 4912194586, "checksum": "7ccc5dbb87bbd8a97b6daeead741c7222c1f11b09a3bcc65b47068ffa81ec74d", "avgDurationSecs": 4.65, "validDurationSecs": 631336.281, "totalHrs": 244.06, "validHrs": 175.37 }, "bn": { "buckets": { "dev": 9293, "invalidated": 7756, "other": 994420, "reported": 1868, "test": 9293, "train": 21048, "validated": 43861 }, "duration": 4563772560, "reportedSentences": 1862, "clips": 1046037, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.67, "": 0.22, "teens": 0.07, "fourties": 0.01, "fifties": 0 }, "gender": { "male": 0.54, "": 0.22, "female": 0.23, "other": 0 } }, "users": 22855, "size": 26290919632, "checksum": "078bba82edefafd1d49d5b415149e0f2831666d8f11894ad3543114b512c9d18", "avgDurationSecs": 4.363, "validDurationSecs": 191361.9, "totalHrs": 1267.71, "validHrs": 53.15 }, "tok": { "buckets": { "dev": 2042, "invalidated": 264, "other": 2858, "reported": 139, "test": 2050, "train": 2591, "validated": 10057 }, "duration": 50848884, "reportedSentences": 140, "clips": 13179, "splits": { "accent": { "": 1 }, "age": { "": 0.38, "twenties": 0.24, "teens": 0.27, "thirties": 0.08, "fourties": 0, "eighties": 0.03 }, "gender": { "": 0.38, "male": 0.45, "other": 0.11, "female": 0.07 } }, "users": 129, "size": 296066167, "checksum": "06512dd33af375c9cf3fea1a7c6c1e77cd6f4409a212d7b0ca7cf91b46fb149b", "avgDurationSecs": 3.858, "validDurationSecs": 38803.189, "totalHrs": 14.12, "validHrs": 10.77 }, "yue": { "buckets": { "dev": 2537, "invalidated": 1656, "other": 52461, "reported": 1151, "test": 2560, "train": 3049, "validated": 20558 }, "duration": 301607460, "reportedSentences": 1152, "clips": 74675, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.29, "": 0.27, "twenties": 0.39, "fourties": 0.02, "sixties": 0.01, "fifties": 0, "teens": 0.02 }, "gender": { "male": 0.26, "": 0.3, "female": 0.43, "other": 0.01 } }, "users": 855, "size": 1700958956, "checksum": "4718340a0cfdb6300e8c52d6a1753ffd664cf225d390fd9a8ca1cd1bc722fca1", "avgDurationSecs": 4.039, "validDurationSecs": 83032.423, "totalHrs": 83.77, "validHrs": 23.06 }, "sah": { "buckets": { "dev": 1083, "invalidated": 102, "other": 150, "reported": 2, "test": 1280, "train": 1594, "validated": 4025 }, "duration": 25709196, "reportedSentences": 3, "clips": 4277, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.04, "fourties": 0.1, "thirties": 0.41, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.35, "male": 0.51, "female": 0.14 } }, "users": 57, "size": 193421983, "checksum": "62b7b0ea86c70ca5d49db8e0d6a6d0fe6bf75af2de8c5593baea2132aa6e336b", "avgDurationSecs": 6.011, "validDurationSecs": 24194.415, "totalHrs": 7.14, "validHrs": 6.72 }, "mk": { "buckets": { "dev": 0, "invalidated": 11, "other": 59, "reported": 8, "test": 29, "train": 115, "validated": 144 }, "duration": 1132524, "reportedSentences": 9, "clips": 214, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.42, "": 0.35, "twenties": 0.16, "teens": 0.07 }, "gender": { "male": 0.65, "": 0.35 } }, "users": 8, "size": 6666651, "checksum": "a89749010f5eb7c35c6f1e72d81ce648f0b410fb46e3694c0fa4b7a968e10944", "avgDurationSecs": 5.292, "validDurationSecs": 762.072, "totalHrs": 0.31, "validHrs": 0.21 }, "sc": { "buckets": { "dev": 238, "invalidated": 26, "other": 299, "reported": 2, "test": 191, "train": 498, "validated": 1085 }, "duration": 6325632, "reportedSentences": 3, "clips": 1410, "splits": { "accent": { "": 1 }, "age": { "": 0.63, "thirties": 0.29, "twenties": 0.07 }, "gender": { "": 0.63, "female": 0.29, "male": 0.07 } }, "users": 14, "size": 36609129, "checksum": "2a4fa14b3581125721f4ea09fdf6543e104860fadaeb01e1bc874015409d761e", "avgDurationSecs": 4.486, "validDurationSecs": 4867.596, "totalHrs": 1.75, "validHrs": 1.35 }, "skr": { "buckets": { "dev": 1084, "invalidated": 438, "other": 1815, "reported": 16, "test": 899, "train": 1414, "validated": 3404 }, "duration": 23386284, "reportedSentences": 2, "clips": 5657, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.25, "fifties": 0.38, "teens": 0, "fourties": 0, "thirties": 0.01 }, "gender": { "": 0.35, "male": 0.65 } }, "users": 50, "size": 133855447, "checksum": "fbbbb280e556b030cce60279bcdad8766cf8a96dff986a5d938c4798823c47ff", "avgDurationSecs": 4.134, "validDurationSecs": 14072.284, "totalHrs": 6.49, "validHrs": 3.9 }, "ti": { "buckets": { "dev": 5, "invalidated": 1, "other": 0, "reported": 1, "test": 3, "train": 14, "validated": 22 }, "duration": 132912, "reportedSentences": 2, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 4, "size": 785563, "checksum": "aeeee172208ea737da50fdf64a4603e80cf8d93a19625b51e1382a04e7ea5c5d", "avgDurationSecs": 5.779, "validDurationSecs": 127.133, "totalHrs": 0.03, "validHrs": 0.03 }, "mrj": { "buckets": { "dev": 4388, "invalidated": 283, "other": 11733, "reported": 49, "test": 3799, "train": 7382, "validated": 15697 }, "duration": 115195644, "reportedSentences": 50, "clips": 27713, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.32, "": 0.11, "thirties": 0.25, "sixties": 0.04, "fourties": 0.14, "fifties": 0.13, "teens": 0.01 }, "gender": { "male": 0.26, "female": 0.63, "": 0.11 } }, "users": 49, "size": 639685226, "checksum": "4b05b82130789291a489ef6b83edf3a9cf8feed62b99927141c6a461dc0d3de4", "avgDurationSecs": 4.157, "validDurationSecs": 65248.296, "totalHrs": 31.99, "validHrs": 18.12 }, "tw": { "buckets": { "dev": 0, "invalidated": 0, "other": 193, "reported": 2, "test": 3, "train": 41, "validated": 44 }, "duration": 943884, "reportedSentences": 3, "clips": 237, "splits": { "accent": { "": 1 }, "age": { "": 0.07, "twenties": 0.02, "thirties": 0.91 }, "gender": { "": 0.07, "male": 0.93 } }, "users": 5, "size": 5530578, "checksum": "83191cb7ed177e37e0aa79e6e856f93f091e0ea25aa510a8528b824e78fa6a65", "avgDurationSecs": 3.983, "validDurationSecs": 175.236, "totalHrs": 0.26, "validHrs": 0.04 }, "ko": { "buckets": { "dev": 189, "invalidated": 164, "other": 1588, "reported": 15, "test": 190, "train": 241, "validated": 621 }, "duration": 14617656, "reportedSentences": 16, "clips": 2373, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "thirties": 0.41, "twenties": 0.34, "teens": 0.05, "fourties": 0.04 }, "gender": { "": 0.15, "male": 0.71, "female": 0.13, "other": 0 } }, "users": 57, "size": 85151623, "checksum": "ab44f5b7a5ab820e9c9c96b332f8d597ad95bdf0c79896e6c8ddd92bdc6ddbb2", "avgDurationSecs": 6.16, "validDurationSecs": 3825.354, "totalHrs": 4.06, "validHrs": 1.06 }, "yo": { "buckets": { "dev": 859, "invalidated": 228, "other": 1101, "reported": 17, "test": 988, "train": 1199, "validated": 3048 }, "duration": 26030628, "reportedSentences": 18, "clips": 4377, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.63, "": 0.3, "teens": 0.05, "sixties": 0 }, "gender": { "male": 0.32, "": 0.3, "female": 0.38 } }, "users": 104, "size": 152023149, "checksum": "8808751ca2b725727bb2ace8a434bf86bfa8e3c5c04a97d4c2bb4d102c276b28", "avgDurationSecs": 5.947, "validDurationSecs": 18126.88, "totalHrs": 7.23, "validHrs": 5.03 }, "oc": { "buckets": { "dev": 209, "invalidated": 144, "other": 7699, "reported": 9, "test": 208, "train": 217, "validated": 1057 }, "duration": 43379748, "reportedSentences": 10, "clips": 8900, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "fifties": 0.17, "twenties": 0.02, "thirties": 0.04, "seventies": 0, "fourties": 0.19, "teens": 0.01, "sixties": 0.12 }, "gender": { "": 0.47, "male": 0.09, "female": 0.44 } }, "users": 141, "size": 245458468, "checksum": "2b4f1a9f16be67704feda83a76acf2c82c594c8c00fc27dce02b14e5d14e9840", "avgDurationSecs": 4.874, "validDurationSecs": 5151.954, "totalHrs": 12.04, "validHrs": 1.43 }, "tk": { "buckets": { "dev": 326, "invalidated": 110, "other": 1494, "reported": 16, "test": 429, "train": 819, "validated": 1653 }, "duration": 17536212, "reportedSentences": 17, "clips": 3257, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.63, "thirties": 0.12 }, "gender": { "": 0.25, "male": 0.49, "female": 0.25 } }, "users": 68, "size": 99579843, "checksum": "71d172525ef7e523b8cc332482cd44522056f92ce09278d98c688a4230986c2b", "avgDurationSecs": 5.384, "validDurationSecs": 8900.018, "totalHrs": 4.87, "validHrs": 2.47 }, "ps": { "buckets": { "dev": 0, "invalidated": 7, "other": 153, "reported": 0, "test": 58, "train": 875, "validated": 933 }, "duration": 5387544, "reportedSentences": 1, "clips": 1093, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.95, "twenties": 0.02, "": 0.03 }, "gender": { "male": 0.97, "": 0.03 } }, "users": 5, "size": 31673292, "checksum": "1be3afd135baac5bd98e4b9dc6d282433b32f934fec89d167845a1c96589c003", "avgDurationSecs": 4.929, "validDurationSecs": 4598.882, "totalHrs": 1.49, "validHrs": 1.27 }, "sq": { "buckets": { "dev": 313, "invalidated": 9, "other": 6, "reported": 9, "test": 285, "train": 349, "validated": 951 }, "duration": 4448232, "reportedSentences": 10, "clips": 966, "splits": { "accent": { "": 1 }, "age": { "": 0.71, "twenties": 0.05, "thirties": 0.24 }, "gender": { "": 0.58, "male": 0.42 } }, "users": 35, "size": 25597119, "checksum": "a5f88175ba6a00200cc33cf69fb9c4992e4a010e27edb71f520c09fd1bb79ec3", "avgDurationSecs": 4.605, "validDurationSecs": 4379.16, "totalHrs": 1.23, "validHrs": 1.21 }, "ne-NP": { "buckets": { "dev": 83, "invalidated": 40, "other": 361, "reported": 2, "test": 129, "train": 192, "validated": 408 }, "duration": 3081852, "reportedSentences": 3, "clips": 809, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.21, "": 0.39, "twenties": 0.4 }, "gender": { "male": 0.62, "": 0.38 } }, "users": 21, "size": 17932620, "checksum": "b0ee0a5d3959a0b360a738d8f151fa384a14984a4cc8a4620dac226470607632", "avgDurationSecs": 3.809, "validDurationSecs": 1554.259, "totalHrs": 0.85, "validHrs": 0.43 }, "vot": { "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "duration": 1025976, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7895744, "checksum": "3170871827252b6c37e629a24fe190a58c6d829dd49623621971364056e20009", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "buckets": { "dev": 25, "invalidated": 36, "other": 162, "reported": 0, "test": 28, "train": 39, "validated": 92 }, "duration": 1554876, "clips": 290, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "twenties": 0.35, "fourties": 0.01, "thirties": 0.22 }, "gender": { "": 0.42, "male": 0.58 } }, "users": 24, "size": 8957755, "checksum": "a9208b506fa1067ff79d677c7f7414c75b04a712ac5842535a9820b18f4d7d0c", "avgDurationSecs": 5.362, "validDurationSecs": 493.271, "totalHrs": 0.43, "validHrs": 0.13 }, "ast": { "buckets": { "dev": 94, "invalidated": 12, "other": 657, "test": 126, "train": 233, "validated": 453 }, "duration": 5096556, "clips": 1122, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "fourties": 0.4, "fifties": 0.31, "teens": 0.05 }, "gender": { "": 0.24, "male": 0.75, "female": 0.01 } }, "users": 15, "size": 28682224, "checksum": "390a2062d490db8420f56e0a05b238f0cd5079318c095e310baa9c8c9abfefda", "avgDurationSecs": 4.542, "validDurationSecs": 2057.7, "totalHrs": 1.41, "validHrs": 0.57 }, "quy": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 1, "validated": 1 }, "duration": 36792, "clips": 6, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 217871, "checksum": "aeee1fe8d224b18351332e700d25c87ae23d4eea45769b94aa95f68184476c92", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "lo": { "buckets": { "dev": 9, "invalidated": 7, "other": 55, "test": 10, "train": 28, "validated": 47 }, "duration": 724032, "clips": 109, "splits": { "accent": { "": 1 }, "age": { "": 0.1, "twenties": 0.61, "fourties": 0.01, "thirties": 0.28 }, "gender": { "": 0.1, "male": 0.9 } }, "users": 8, "size": 4152621, "checksum": "42f61a8754d0b8b9ff63b679ddd2fb89e4e84f8a43e8641aea8c87a72e85dd25", "avgDurationSecs": 6.642, "validDurationSecs": 312.197, "totalHrs": 0.2, "validHrs": 0.08 }, "dyu": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 1, "validated": 1 }, "duration": 6228, "clips": 1, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 37537, "checksum": "081c25661ec11582fbef6d0b2292d8f0c2b4625f91ebe66ede85c6e699deacc4", "avgDurationSecs": 6.228, "validDurationSecs": 6.228, "totalHrs": 0, "validHrs": 0 }, "is": { "buckets": { "dev": 0, "invalidated": 1, "other": 6, "test": 0, "train": 8, "validated": 8 }, "duration": 110016, "clips": 15, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 645317, "checksum": "c1dfa11a035c1d36ab48abec4c8d3e99c589c71a6d03a954085397a76b01f2b6", "avgDurationSecs": 7.334, "validDurationSecs": 58.675, "totalHrs": 0.03, "validHrs": 0.01 }, "am": { "buckets": { "dev": 59, "invalidated": 14, "other": 817, "test": 8, "train": 121, "validated": 188 }, "duration": 6059484, "clips": 1019, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.84, "": 0.12, "twenties": 0.04 }, "gender": { "male": 0.88, "": 0.12 } }, "users": 11, "size": 35497692, "checksum": "eca0d3cd5b5365b394baa73b48c11de9ca92568be6478861dda55aa0e5ada80f", "avgDurationSecs": 5.947, "validDurationSecs": 1117.942, "totalHrs": 1.68, "validHrs": 0.31 }, "zgh": { "buckets": { "dev": 0, "invalidated": 6, "other": 2, "test": 3, "train": 57, "validated": 60 }, "duration": 243036, "clips": 68, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "size": 1420231, "checksum": "c13e4b82ce6f5a2cc62d18b8312c067c617cb8d3d55a9212af6291fcea8b663f", "avgDurationSecs": 3.574, "validDurationSecs": 214.444, "totalHrs": 0.06, "validHrs": 0.05 } }, "totalDuration": 101224789375, "totalValidDurationSecs": 67144231, "totalHrs": 28117, "totalValidHrs": 18651 } ================================================ FILE: datasets/scripted-speech/cv-corpus-14.0-delta-2023-06-23.json ================================================ { "locales": { "en": { "duration": 252536976, "reportedSentences": 1711, "clips": 43378, "users": 1212, "size": 1536033540, "checksum": "6e88c7460090c5a6ca7f02a8525bd669e7bc509a47b2f1974977d5065a054507", "avgDurationSecs": 5.186, "validDurationSecs": 201456.93, "totalHrs": 70.15, "validHrs": 55.96 }, "fa": { "duration": 12514068, "reportedSentences": 31, "clips": 3005, "users": 151, "size": 71885724, "checksum": "f52db225ad13fafdbe6a08ba2d9ec66e4234c381272f541f178c97dc2d72074a", "avgDurationSecs": 3.989, "validDurationSecs": 10300.44, "totalHrs": 3.47, "validHrs": 2.87 }, "fr": { "duration": 45118512, "reportedSentences": 48, "clips": 8778, "users": 333, "size": 292069367, "checksum": "6f706a46135b987ecbd06457ea5986a7020821adc3afd90b982e19ff59b8eed9", "avgDurationSecs": 5.007, "validDurationSecs": 61100.61, "totalHrs": 12.53, "validHrs": 16.97 }, "es": { "duration": 81830340, "reportedSentences": 100, "clips": 15725, "users": 165, "size": 505105721, "checksum": "c3861608ed5a177118cd4d7c6c9ea06fa96824bb3dc03a673b1275d67a7b7b19", "avgDurationSecs": 4.867, "validDurationSecs": 76870.7, "totalHrs": 22.73, "validHrs": 21.36 }, "sl": { "duration": 66384, "reportedSentences": 0, "clips": 16, "users": 3, "size": 570822, "checksum": "6c2430cd32e00992fb10191165eb0e74dfc11add3f783e4285885d88d083c7f1", "avgDurationSecs": 3.779, "validDurationSecs": 246.57, "totalHrs": 0.01, "validHrs": 0.06 }, "kab": { "duration": 5537520, "reportedSentences": 230, "clips": 1589, "users": 15, "size": 39080967, "checksum": "2049a98155ec7cdf55f1f6f13083c87c98df2ccefe5f90f3081fa5fb1d885389", "avgDurationSecs": 3.337, "validDurationSecs": 1134.33, "totalHrs": 1.53, "validHrs": 0.32 }, "cy": { "duration": 5113764, "reportedSentences": 8, "clips": 841, "users": 27, "size": 31347387, "checksum": "26ef48296570550e5c493d0e34fa4199307642c3283d3cf61e412d479ebb4495", "avgDurationSecs": 4.879, "validDurationSecs": 2690.89, "totalHrs": 1.43, "validHrs": 0.75 }, "ca": { "duration": 947659356, "reportedSentences": 269, "clips": 198492, "users": 2687, "size": 5279684152, "checksum": "b6f054ad545b2d42a0a6ef89426ccaffaf29de75d66d62118fa5a5dcb251da4a", "avgDurationSecs": 5.403, "validDurationSecs": 1388268.3, "totalHrs": 263.24, "validHrs": 385.63 }, "de": { "duration": 131262840, "reportedSentences": 477, "clips": 23028, "users": 320, "size": 800558449, "checksum": "36947cfb42f3ce87d8badd0d0c1b8eafb164ffd0f3a4d9b55477d6db9e229cea", "avgDurationSecs": 5.222, "validDurationSecs": 119570.43, "totalHrs": 36.46, "validHrs": 33.22 }, "tt": { "duration": 597456, "reportedSentences": 1, "clips": 128, "users": 7, "size": 4025067, "checksum": "cd90cd5db0ef158bc1b0a3f7604bbae34bef2f08537239fd8454248fdf43658f", "avgDurationSecs": 3.76, "validDurationSecs": 655.88, "totalHrs": 0.16, "validHrs": 0.18 }, "ta": { "duration": 10363752, "reportedSentences": 10, "clips": 1635, "users": 8, "size": 65750498, "checksum": "9d31279a433de353e535c7b4f324e7bfa83faca89797c94a707e403984e857f1", "avgDurationSecs": 6.215, "validDurationSecs": 1796.52, "totalHrs": 2.87, "validHrs": 0.5 }, "ru": { "duration": 30301992, "reportedSentences": 7, "clips": 5596, "users": 100, "size": 180512978, "checksum": "8cdacdf2f67ae79ffc6650524484ff9741776ec52de35fa74b6ee70d772feadf", "avgDurationSecs": 5.157, "validDurationSecs": 39283.23, "totalHrs": 8.42, "validHrs": 10.91 }, "nl": { "duration": 8835480, "reportedSentences": 2, "clips": 1888, "users": 25, "size": 54150193, "checksum": "9f465fd2e9a3c10815e04520230fc805b8a833246842386cc7750f2d6f359072", "avgDurationSecs": 4.347, "validDurationSecs": 7112.91, "totalHrs": 2.46, "validHrs": 1.98 }, "it": { "duration": 27679212, "reportedSentences": 148, "clips": 4705, "users": 49, "size": 171218971, "checksum": "bb1ea6b826ec4f133b7b3c1437b0de91070a61069dad6cb6a8f4330fe67d0a8f", "avgDurationSecs": 5.375, "validDurationSecs": 14583.35, "totalHrs": 7.69, "validHrs": 4.05 }, "eu": { "duration": 9623772, "reportedSentences": 1, "clips": 1613, "users": 71, "size": 56722207, "checksum": "be4468215bbc50608429011efd71108709ee6ad454bda5d923e35e09fd0111c6", "avgDurationSecs": 5.201, "validDurationSecs": 3693.49, "totalHrs": 2.67, "validHrs": 1.02 }, "tr": { "duration": 24045048, "reportedSentences": 18, "clips": 6948, "users": 50, "size": 140314117, "checksum": "dc65972975dfa8ab5c768884b3b5628d480b7f276e90f5c28e72556141fdfd9b", "avgDurationSecs": 3.621, "validDurationSecs": 23655.52, "totalHrs": 6.68, "validHrs": 6.57 }, "ar": { "duration": 19470231, "reportedSentences": 24, "clips": 4746, "users": 59, "size": 115956605, "checksum": "367f81d44b086e782f3e4f74a442e345be143c3a310bc9e97ae0b09a42f5f94a", "avgDurationSecs": 4.167, "validDurationSecs": 2909, "totalHrs": 5.41, "validHrs": 0.81 }, "zh-TW": { "duration": 11600892, "reportedSentences": 0, "clips": 2766, "users": 30, "size": 48943304, "checksum": "4104e2afb207d0cf34b6f44221c07056c2b77c8d3dd19647408d09d4e217c6b2", "avgDurationSecs": 3.347, "validDurationSecs": 5867.01, "totalHrs": 3.22, "validHrs": 1.63 }, "br": { "duration": 1746360, "reportedSentences": 5, "clips": 509, "users": 8, "size": 10453749, "checksum": "28083359eb7f9a0de516121b31f98be9402b8846e0d2207d8d681b2ca7e9b8fd", "avgDurationSecs": 3.14, "validDurationSecs": 2522.94, "totalHrs": 0.49, "validHrs": 0.7 }, "pt": { "duration": 16226676, "reportedSentences": 22, "clips": 3857, "users": 82, "size": 98084629, "checksum": "195ee8d04eb501074b2a5d89f5708d4f9ecfb8fd40fd608582f85c1ef63b7447", "avgDurationSecs": 4.082, "validDurationSecs": 40498.85, "totalHrs": 4.5, "validHrs": 11.25 }, "eo": { "duration": 26174603, "reportedSentences": 9, "clips": 4982, "users": 39, "size": 180495565, "checksum": "8e88986f6477e345b886e21d2276887db1b64f13dd927a0ceab5a7099d48c42b", "avgDurationSecs": 6.064, "validDurationSecs": -1226.91, "totalHrs": 7.27, "validHrs": -0.34 }, "zh-CN": { "duration": 9183060, "reportedSentences": 37, "clips": 2341, "users": 97, "size": 61968173, "checksum": "d51e898f102d51d37c81119e6dba5f930c806763e1270b0a410f3e1fbeb4faf5", "avgDurationSecs": 4.549, "validDurationSecs": 13776.32, "totalHrs": 2.55, "validHrs": 3.83 }, "id": { "duration": 1223640, "reportedSentences": -1, "clips": 300, "users": 15, "size": 7579543, "checksum": "e944f8f641003f19a2923f721dd1334771dc47a7e963e66b43a448ad549bc5f8", "avgDurationSecs": 3.991, "validDurationSecs": 745.94, "totalHrs": 0.34, "validHrs": 0.2 }, "ia": { "duration": 131832, "reportedSentences": 0, "clips": 25, "users": 2, "size": 1038593, "checksum": "471323b3630a5b874574acf7cf977eecd3bdb062adc17b2a5e9829b107717204", "avgDurationSecs": 4.184, "validDurationSecs": 155.59, "totalHrs": 0.04, "validHrs": 0.04 }, "lv": { "duration": 426228768, "reportedSentences": 142, "clips": 107238, "users": 2075, "size": 2280290800, "checksum": "ed3a91981b2c03a6889046f9ddb84c732fef43c7d8353e81f284f21967e40732", "avgDurationSecs": 3.927, "validDurationSecs": 198100.7, "totalHrs": 118.4, "validHrs": 55.03 }, "ja": { "duration": 191064492, "reportedSentences": 7, "clips": 32774, "users": 77, "size": 1129142820, "checksum": "09d32dfcf4586de728a1c41813f5a33cc0f290bdcf1a7367ee36330e014585cc", "avgDurationSecs": 4.8, "validDurationSecs": 16831.62, "totalHrs": 53.07, "validHrs": 4.68 }, "rw": { "duration": 309996, "reportedSentences": 0, "clips": 96, "users": 11, "size": 44091464, "checksum": "a0e24547c06863d214fdad1c171ef52deebd8daf641e3493a7c281b89441fd66", "avgDurationSecs": 5.008, "validDurationSecs": 107.05, "totalHrs": 0.09, "validHrs": 0.03 }, "sv-SE": { "duration": 861192, "reportedSentences": 1, "clips": 188, "users": 10, "size": 5886345, "checksum": "4aada51de5ea008b5ff87f79c6109cea2acd778e8760ab28c4ef286b1412121d", "avgDurationSecs": 3.98, "validDurationSecs": 1161.21, "totalHrs": 0.24, "validHrs": 0.32 }, "cnh": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 96516, "checksum": "9e7853df7950f63697b97f9ae766568fa10c895aed6930d46813ee05954f5d75", "avgDurationSecs": 3.564, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "et": { "duration": 203436, "reportedSentences": 0, "clips": 25, "users": 3, "size": 1943461, "checksum": "76a93e1ae4d97d1134aca416623edfbc79f2a99d780d3e74be8d4053841eaa39", "avgDurationSecs": 6.76, "validDurationSecs": 336.83, "totalHrs": 0.05, "validHrs": 0.1 }, "ky": { "duration": 137808, "reportedSentences": 0, "clips": 47, "users": 7, "size": 1033516, "checksum": "9437dde385748e74da489819c9965a5564f47a24553bfa20905bdaecc3dfc29a", "avgDurationSecs": 4.536, "validDurationSecs": 149.79, "totalHrs": 0.04, "validHrs": 0.04 }, "ro": { "duration": 758340, "reportedSentences": 2, "clips": 163, "users": 13, "size": 5010831, "checksum": "b283ac16b9005840c983f85ace7d5a8b59ce1475dfd17596a9ae24d0db463595", "avgDurationSecs": 3.996, "validDurationSecs": 1735.32, "totalHrs": 0.21, "validHrs": 0.48 }, "hsb": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 71597, "checksum": "5d9b33a0f183663d905b088c54560f424da312df5831d4b87f3c09f1ff39abd5", "avgDurationSecs": 6.12, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "el": { "duration": 692388, "reportedSentences": 0, "clips": 160, "users": 8, "size": 4230252, "checksum": "2e27cc0e3a21b25fd8eb25f602913bfbc8ffb9dd9e4c4f6761fb0b28e601f941", "avgDurationSecs": 4.134, "validDurationSecs": 406.71, "totalHrs": 0.19, "validHrs": 0.12 }, "cs": { "duration": 9712872, "reportedSentences": 18, "clips": 2171, "users": 34, "size": 60377453, "checksum": "a3f4eaa88e531f27a42043ac20d160a4400defd629432af24071b709b545a8be", "avgDurationSecs": 4.462, "validDurationSecs": 4107.88, "totalHrs": 2.69, "validHrs": 1.14 }, "pl": { "duration": 2486088, "reportedSentences": 2, "clips": 422, "users": 25, "size": 17151051, "checksum": "ee7fb61a9aa9da3d81027afe5b74639b463ac369f0809389e024bab4ed7d4d71", "avgDurationSecs": 4.508, "validDurationSecs": 11661.24, "totalHrs": 0.69, "validHrs": 3.24 }, "rm-sursilv": { "duration": 475992, "reportedSentences": 0, "clips": 109, "users": 3, "size": 2977898, "checksum": "90ecd3bfa39d2b5a070d66203264c250a0d126c7ee108d13f5895cb4593b5e37", "avgDurationSecs": 5.464, "validDurationSecs": -71.98, "totalHrs": 0.13, "validHrs": -0.02 }, "rm-vallader": { "duration": 179388, "reportedSentences": 1, "clips": 25, "users": 0, "size": 1130130, "checksum": "0fa59d0db83a06ee970a076c37d9b022eb448424b747e0da84ffff490b326850", "avgDurationSecs": 5.824, "validDurationSecs": 176.71, "totalHrs": 0.05, "validHrs": 0.05 }, "mn": { "duration": 6178644, "reportedSentences": 0, "clips": 1233, "users": 17, "size": 35830504, "checksum": "eb5db8ca620505be8688de1d1e181813bf5c548a6c9e2f21cada91998bc26876", "avgDurationSecs": 5.441, "validDurationSecs": 198.76, "totalHrs": 1.72, "validHrs": 0.06 }, "zh-HK": { "duration": 2765736, "reportedSentences": 2, "clips": 758, "users": 11, "size": 17611415, "checksum": "920147b28c6709076047753b40c7d66fdc1f1c1ef301c1f496a54d4ff426239d", "avgDurationSecs": 4.182, "validDurationSecs": 1755.19, "totalHrs": 0.77, "validHrs": 0.49 }, "ab": { "duration": 0, "reportedSentences": 4, "clips": 0, "users": 0, "size": 1932632, "checksum": "935e46dcc3cb293f2d81dc0a877a9dc25d08c08fb248aacb0d5c0bf9ea49bb31", "avgDurationSecs": 5.133, "validDurationSecs": 20.53, "totalHrs": 0, "validHrs": 0 }, "cv": { "duration": 711108, "reportedSentences": 0, "clips": 155, "users": 4, "size": 4469584, "checksum": "cbc9adc419a32634b05ffa6a844c336f65dc9ab91c8b7dfc20e14cf42aece1dc", "avgDurationSecs": 5.059, "validDurationSecs": 2334.71, "totalHrs": 0.2, "validHrs": 0.65 }, "uk": { "duration": 14219424, "reportedSentences": 16, "clips": 3237, "users": 71, "size": 83549850, "checksum": "2ab9444796cee58a6405800426c13f8f88170efe6eed746b2622494328498d03", "avgDurationSecs": 4.741, "validDurationSecs": 13627.05, "totalHrs": 3.95, "validHrs": 3.79 }, "mt": { "duration": 26388, "reportedSentences": 0, "clips": 5, "users": 1, "size": 414443, "checksum": "1c4aea8906220887fd0bbe353c294dd48c3337399dfeffe20893e2ce45c0d095", "avgDurationSecs": 4.741, "validDurationSecs": 34.51, "totalHrs": 0, "validHrs": 0.01 }, "as": { "duration": 28944, "reportedSentences": 0, "clips": 5, "users": 1, "size": 248990, "checksum": "4f7d8378367c1fdafb981c2a4d3af3c7229da5cb5bfca054d7f4454d0a19f141", "avgDurationSecs": 5.69, "validDurationSecs": 256.46, "totalHrs": 0.01, "validHrs": 0.07 }, "ka": { "duration": 298143216, "reportedSentences": 388, "clips": 58880, "users": 642, "size": 1725882025, "checksum": "4ca1e6d7297f3838187e664504875e22f05a218534d584fa57b617d6977b84c0", "avgDurationSecs": 5.152, "validDurationSecs": 209142.78, "totalHrs": 82.82, "validHrs": 58.1 }, "fy-NL": { "duration": 3650004, "reportedSentences": 1, "clips": 751, "users": 31, "size": 21576164, "checksum": "4c86bad927f7c0ba31338c6bb136ca46017bd1327bbf3d91fd4835651ee870ea", "avgDurationSecs": 4.853, "validDurationSecs": 4568.34, "totalHrs": 1.02, "validHrs": 1.27 }, "dv": { "duration": 808452, "reportedSentences": 0, "clips": 165, "users": 8, "size": 5352930, "checksum": "6363db3d7c40dc7195c32b8be239304ccdceca87e0187cbbf9eb0632b6389957", "avgDurationSecs": 5.059, "validDurationSecs": 2164.73, "totalHrs": 0.22, "validHrs": 0.6 }, "pa-IN": { "duration": 75348, "reportedSentences": 0, "clips": 16, "users": 3, "size": 503442, "checksum": "b0be152800bbef88c6d357091c8af98daff40b7fa74ac20c500dae9c65a6f66b", "avgDurationSecs": 4.823, "validDurationSecs": 153.44, "totalHrs": 0.02, "validHrs": 0.04 }, "vi": { "duration": 997632, "reportedSentences": 0, "clips": 258, "users": 12, "size": 5936934, "checksum": "b2820fe318bf10d54b920b3c4438b80dd4e5c9f897f389796572581dc8327dd8", "avgDurationSecs": 3.966, "validDurationSecs": 472.56, "totalHrs": 0.28, "validHrs": 0.13 }, "or": { "duration": 3902040, "reportedSentences": 3, "clips": 829, "users": 31, "size": 22165077, "checksum": "bab44fa0396c3fdda8ed5921b871f76de264585c0cf33197527c42bfb0ba3c9a", "avgDurationSecs": 4.997, "validDurationSecs": 8307.15, "totalHrs": 1.09, "validHrs": 2.31 }, "ga-IE": { "duration": 93420, "reportedSentences": 0, "clips": 23, "users": 3, "size": 642081, "checksum": "d8b0fc5a984ef4c622ef10347030ca83c1d9f80c494940038c0cfbb589053a09", "avgDurationSecs": 3.572, "validDurationSecs": 109.17, "totalHrs": 0.03, "validHrs": 0.03 }, "fi": { "duration": 1877940, "reportedSentences": 0, "clips": 396, "users": 4, "size": 10747364, "checksum": "80b30a743d8b567c6d824ed02f4d5e2fed89dfc9b3c5ca855b36972d954a651c", "avgDurationSecs": 4.604, "validDurationSecs": 2144.85, "totalHrs": 0.52, "validHrs": 0.6 }, "hu": { "duration": 242901504, "reportedSentences": 72, "clips": 42775, "users": 633, "size": 1384929821, "checksum": "2e7b2aa9aa70917b3ec210c1ab93eb104a038de0ecbe5296cd734dc7366fa19d", "avgDurationSecs": 5.456, "validDurationSecs": 123406.83, "totalHrs": 67.47, "validHrs": 34.28 }, "th": { "duration": 9557892, "reportedSentences": 23, "clips": 2361, "users": 30, "size": 60532869, "checksum": "30b1e7378a39b54f6a90f87a4dd4613f078de9ee35208084e5b0fb17819b256a", "avgDurationSecs": 4.192, "validDurationSecs": 8524.02, "totalHrs": 2.66, "validHrs": 2.37 }, "lt": { "duration": 1523124, "reportedSentences": 3, "clips": 335, "users": 9, "size": 9579072, "checksum": "779a654d61aa4c6df1dc068a02e45849608934a343d4905aeac2c02931924011", "avgDurationSecs": 5.131, "validDurationSecs": 3838.42, "totalHrs": 0.42, "validHrs": 1.07 }, "lg": { "duration": 273348, "reportedSentences": 0, "clips": 42, "users": 2, "size": 6831078, "checksum": "83aca077d149596ee9902353a1556e61a99782934a48e11ef4be012133c7a4a0", "avgDurationSecs": 5.785, "validDurationSecs": 214.64, "totalHrs": 0.08, "validHrs": 0.06 }, "hi": { "duration": 1023300, "reportedSentences": 0, "clips": 183, "users": 18, "size": 6489390, "checksum": "5da9d205060550d7f47026fe2762e7d97df4af2a2a518c1697fe45846c48dd30", "avgDurationSecs": 4.86, "validDurationSecs": 1323.23, "totalHrs": 0.29, "validHrs": 0.37 }, "bas": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 47757, "checksum": "5e8b9893ecbeea88130d16cd3e2caf61c081358852f3ca738f03cf01a7bf47ee", "avgDurationSecs": 4.431, "validDurationSecs": 22.15, "totalHrs": 0, "validHrs": 0 }, "sk": { "duration": 6422256, "reportedSentences": 13, "clips": 1690, "users": 28, "size": 37357203, "checksum": "197ea34c5c565470c0ad19f2cedbdacfe232ebdce88b2ae9e5ef0bba21649227", "avgDurationSecs": 4.075, "validDurationSecs": 3596.97, "totalHrs": 1.79, "validHrs": 1 }, "kmr": { "duration": 5367744, "reportedSentences": 3, "clips": 1191, "users": 5, "size": 32042461, "checksum": "d112d882472a280d3656924d0d5592f3ff245dfe79e4ecece729d7aa494e4d34", "avgDurationSecs": 4.054, "validDurationSecs": 2031.72, "totalHrs": 1.49, "validHrs": 0.56 }, "bg": { "duration": 5499324, "reportedSentences": 3, "clips": 1019, "users": 10, "size": 32683396, "checksum": "bcdcee16c60a4192de13116e18d4f243ecd93f0042dde55db49dead0bd324f1b", "avgDurationSecs": 5.47, "validDurationSecs": 1447.99, "totalHrs": 1.53, "validHrs": 0.4 }, "kk": { "duration": 71856, "reportedSentences": 0, "clips": 15, "users": 2, "size": 471090, "checksum": "b3f9897e2eadcaf49df76130b62ace0342ad3ce36d84ad52fbd54584b805a167", "avgDurationSecs": 4.926, "validDurationSecs": 8.1, "totalHrs": 0.02, "validHrs": 0 }, "ba": { "duration": 1080432, "reportedSentences": 0, "clips": 239, "users": 4, "size": 12661228, "checksum": "4ef60613767d37170b8d226cff695b7d3692c0e60116eba0a215a6fb250943fe", "avgDurationSecs": 4.427, "validDurationSecs": 814.01, "totalHrs": 0.3, "validHrs": 0.23 }, "gl": { "duration": 3402432, "reportedSentences": 6, "clips": 777, "users": 10, "size": 20799663, "checksum": "634fbaa19d8a30297317fac63934a883bf4b619e57be1800c3ea3726af9f971e", "avgDurationSecs": 4.927, "validDurationSecs": 13069.4, "totalHrs": 0.94, "validHrs": 3.63 }, "ug": { "duration": 7534332, "reportedSentences": 0, "clips": 1142, "users": 39, "size": 41990543, "checksum": "d59188dec224850bce3e656df2681b923b93801f9c6777072b1ac642f7c64f8d", "avgDurationSecs": 6.031, "validDurationSecs": 7052.89, "totalHrs": 2.1, "validHrs": 1.95 }, "hy-AM": { "duration": 63612, "reportedSentences": 0, "clips": 10, "users": 2, "size": 455501, "checksum": "ce13e81be33bc2340e03ff9d8267362b7f706245054d49fe6a0ee52317d187bd", "avgDurationSecs": 6.11, "validDurationSecs": 264.03, "totalHrs": 0.02, "validHrs": 0.07 }, "be": { "duration": 263454768, "reportedSentences": 1, "clips": 54929, "users": 110, "size": 1535836728, "checksum": "8f3f4cbcd01b80bb9c26abbc2bba42833f8cc59b80705303ef28b4d114e73b20", "avgDurationSecs": 4.763, "validDurationSecs": 612298.61, "totalHrs": 73.19, "validHrs": 170.09 }, "ur": { "duration": 1932552, "reportedSentences": 6, "clips": 363, "users": 5, "size": 12456667, "checksum": "d1d136604caace2cd1ee3d4cc6493de0ac81e9e72c7a06f468d4852982029ab9", "avgDurationSecs": 3.889, "validDurationSecs": 610.43, "totalHrs": 0.53, "validHrs": 0.17 }, "gn": { "duration": 18996300, "reportedSentences": 0, "clips": 4410, "users": 25, "size": 106647748, "checksum": "85081c02843e46f01a25a8b478f9bbdb38e919fc7760cf49f2e39723c6eec9a0", "avgDurationSecs": 4.257, "validDurationSecs": 965.67, "totalHrs": 5.27, "validHrs": 0.27 }, "sr": { "duration": 1962720, "reportedSentences": 1, "clips": 544, "users": 3, "size": 5961381, "checksum": "859301ec5ca2233aebfe68f61dbd5a029ce3f26eb25b86a1f6a38fa77818a2f5", "avgDurationSecs": 2.884, "validDurationSecs": 1147.59, "totalHrs": 0.55, "validHrs": 0.32 }, "uz": { "duration": 1940688, "reportedSentences": 3, "clips": 403, "users": 25, "size": 15061728, "checksum": "40c5f7e5df4f04168f21e7a7d974a4a2fac981ab8c4f0086e372bb0450d5d66a", "avgDurationSecs": 4.148, "validDurationSecs": 902.21, "totalHrs": 0.54, "validHrs": 0.25 }, "mr": { "duration": 296208, "reportedSentences": 0, "clips": 42, "users": 4, "size": 2123119, "checksum": "82a4a94737b7fb7fa9cccbc6ff85d3376f1bc2aa7511f91c7c6762f8dbb21cdb", "avgDurationSecs": 6.195, "validDurationSecs": 67.94, "totalHrs": 0.09, "validHrs": 0.02 }, "da": { "duration": 1361952, "reportedSentences": 22, "clips": 367, "users": 8, "size": 8210739, "checksum": "5adef6e485333aeccd60cd0c1b26df06e20d4a4d1b96103891112097aeae9f3f", "avgDurationSecs": 4.148, "validDurationSecs": 1845.37, "totalHrs": 0.37, "validHrs": 0.52 }, "myv": { "duration": 244692, "reportedSentences": 0, "clips": 35, "users": 1, "size": 1307284, "checksum": "3874606a26f5e3b1d9930e7a09c950ebe4b1ed982bb7ca90010343ef145285fa", "avgDurationSecs": 5.774, "validDurationSecs": 226.91, "totalHrs": 0.06, "validHrs": 0.07 }, "nn-NO": { "duration": 1330416, "reportedSentences": 0, "clips": 325, "users": 3, "size": 7831968, "checksum": "bef79f16962d5d1e198f79faca68d808324803ad549c2ed6fa6bd5bf83d5b378", "avgDurationSecs": 4.436, "validDurationSecs": 686.35, "totalHrs": 0.37, "validHrs": 0.19 }, "ha": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 120496, "checksum": "e32413abf57ded4f063e7792e228c907214d776de50a21a5501a81aba19e5d2a", "avgDurationSecs": 4.333, "validDurationSecs": 13, "totalHrs": 0, "validHrs": 0 }, "ckb": { "duration": 1941912, "reportedSentences": 4, "clips": 421, "users": 14, "size": 12381270, "checksum": "cbf33af7ae77d6b524ebc9c42ae3d7523d9c90233d46f65a4064df8a15335038", "avgDurationSecs": 3.909, "validDurationSecs": 1623.09, "totalHrs": 0.54, "validHrs": 0.45 }, "ml": { "duration": 14249628, "reportedSentences": 4, "clips": 3363, "users": 88, "size": 78967729, "checksum": "139a64e20d26b0acf8be00914c4cde380a230326b4e6149d7e6cdc8a78aea6f0", "avgDurationSecs": 4.152, "validDurationSecs": 6481.09, "totalHrs": 3.96, "validHrs": 1.8 }, "mdf": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 14057, "checksum": "3f815a63c4aec0e71a338db3b4432586dac4ca79588eae3bf3b17809725cd22d", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sw": { "duration": 25808616, "reportedSentences": 1, "clips": 4632, "users": 74, "size": 153775592, "checksum": "413dc644173112f5710830a63e19e3dbf2f5f0549f4f2375db45e779ee613ae9", "avgDurationSecs": 5.264, "validDurationSecs": 33627.05, "totalHrs": 7.17, "validHrs": 9.34 }, "sat": { "duration": 51804, "reportedSentences": 0, "clips": 15, "users": 0, "size": 313332, "checksum": "43783cb0e3716cad8aea0cc1b648ad1f67f0da25298b5cf1928abd160a43bea4", "avgDurationSecs": 4.462, "validDurationSecs": 103.39, "totalHrs": 0.02, "validHrs": 0.03 }, "tig": { "duration": 1517004, "reportedSentences": 0, "clips": 298, "users": 4, "size": 8919679, "checksum": "14f0c46fd4d94ac15603778d3d29af54984625603fcbe168346e12ee8ffac292", "avgDurationSecs": 5.048, "validDurationSecs": 157.56, "totalHrs": 0.43, "validHrs": 0.04 }, "ig": { "duration": 4248, "reportedSentences": 0, "clips": 1, "users": 1, "size": 46158, "checksum": "420b3c6e1b3dd71892e8a124d5377b2443a3324ac70348437b44ced8bbe92b88", "avgDurationSecs": 5.464, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 5163840, "reportedSentences": 8, "clips": 1932, "users": 17, "size": 32670028, "checksum": "e099c9e8491a3e098901f543ba4974e80a969137494cebbb48923ce606d0d6d9", "avgDurationSecs": 2.669, "validDurationSecs": 3276.67, "totalHrs": 1.43, "validHrs": 0.92 }, "mhr": { "duration": 132871356, "reportedSentences": 12, "clips": 28274, "users": 30, "size": 749462427, "checksum": "7ccc5dbb87bbd8a97b6daeead741c7222c1f11b09a3bcc65b47068ffa81ec74d", "avgDurationSecs": 4.65, "validDurationSecs": 98235.27, "totalHrs": 36.91, "validHrs": 27.29 }, "bn": { "duration": 4500144, "reportedSentences": 6, "clips": 817, "users": 38, "size": 35758950, "checksum": "078bba82edefafd1d49d5b415149e0f2831666d8f11894ad3543114b512c9d18", "avgDurationSecs": 4.363, "validDurationSecs": 2215.91, "totalHrs": 1.25, "validHrs": 0.61 }, "tok": { "duration": 7288092, "reportedSentences": 10, "clips": 1651, "users": 10, "size": 42873395, "checksum": "06512dd33af375c9cf3fea1a7c6c1e77cd6f4409a212d7b0ca7cf91b46fb149b", "avgDurationSecs": 3.858, "validDurationSecs": 4670.24, "totalHrs": 2.02, "validHrs": 1.29 }, "yue": { "duration": 50258592, "reportedSentences": 288, "clips": 13083, "users": 28, "size": 294268308, "checksum": "4718340a0cfdb6300e8c52d6a1753ffd664cf225d390fd9a8ca1cd1bc722fca1", "avgDurationSecs": 4.039, "validDurationSecs": 1084.5, "totalHrs": 13.96, "validHrs": 0.3 }, "sah": { "duration": 968040, "reportedSentences": 0, "clips": 159, "users": 2, "size": 5919339, "checksum": "62b7b0ea86c70ca5d49db8e0d6a6d0fe6bf75af2de8c5593baea2132aa6e336b", "avgDurationSecs": 6.011, "validDurationSecs": 72.09, "totalHrs": 0.27, "validHrs": 0.02 }, "mk": { "duration": 22176, "reportedSentences": 0, "clips": 5, "users": 1, "size": 139622, "checksum": "a89749010f5eb7c35c6f1e72d81ce648f0b410fb46e3694c0fa4b7a968e10944", "avgDurationSecs": 5.292, "validDurationSecs": 12.99, "totalHrs": 0.01, "validHrs": 0.01 }, "sc": { "duration": 389376, "reportedSentences": 0, "clips": 65, "users": 2, "size": 2327099, "checksum": "2a4fa14b3581125721f4ea09fdf6543e104860fadaeb01e1bc874015409d761e", "avgDurationSecs": 4.486, "validDurationSecs": 295.13, "totalHrs": 0.11, "validHrs": 0.08 }, "skr": { "duration": 334980, "reportedSentences": 0, "clips": 70, "users": 9, "size": 2001349, "checksum": "fbbbb280e556b030cce60279bcdad8766cf8a96dff986a5d938c4798823c47ff", "avgDurationSecs": 4.134, "validDurationSecs": 902.47, "totalHrs": 0.09, "validHrs": 0.25 }, "ti": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 806, "checksum": "aeeee172208ea737da50fdf64a4603e80cf8d93a19625b51e1382a04e7ea5c5d", "avgDurationSecs": 5.779, "validDurationSecs": 52.01, "totalHrs": 0, "validHrs": 0.01 }, "mrj": { "duration": 11119176, "reportedSentences": 0, "clips": 2136, "users": 1, "size": 64195753, "checksum": "4b05b82130789291a489ef6b83edf3a9cf8feed62b99927141c6a461dc0d3de4", "avgDurationSecs": 4.157, "validDurationSecs": 3702.51, "totalHrs": 3.08, "validHrs": 1.03 }, "tw": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1216, "checksum": "83191cb7ed177e37e0aa79e6e856f93f091e0ea25aa510a8528b824e78fa6a65", "avgDurationSecs": 3.983, "validDurationSecs": 35.84, "totalHrs": 0, "validHrs": 0.01 }, "ko": { "duration": 562032, "reportedSentences": 2, "clips": 101, "users": 15, "size": 3322163, "checksum": "ab44f5b7a5ab820e9c9c96b332f8d597ad95bdf0c79896e6c8ddd92bdc6ddbb2", "avgDurationSecs": 6.16, "validDurationSecs": 1171.36, "totalHrs": 0.16, "validHrs": 0.33 }, "yo": { "duration": 1046160, "reportedSentences": 0, "clips": 154, "users": 11, "size": 6355628, "checksum": "8808751ca2b725727bb2ace8a434bf86bfa8e3c5c04a97d4c2bb4d102c276b28", "avgDurationSecs": 5.947, "validDurationSecs": 6353.48, "totalHrs": 0.29, "validHrs": 1.76 }, "oc": { "duration": 6225408, "reportedSentences": 1, "clips": 1284, "users": 3, "size": 36678757, "checksum": "2b4f1a9f16be67704feda83a76acf2c82c594c8c00fc27dce02b14e5d14e9840", "avgDurationSecs": 4.874, "validDurationSecs": 1054.05, "totalHrs": 1.72, "validHrs": 0.3 }, "tk": { "duration": 11338596, "reportedSentences": 11, "clips": 2038, "users": 54, "size": 63517133, "checksum": "71d172525ef7e523b8cc332482cd44522056f92ce09278d98c688a4230986c2b", "avgDurationSecs": 5.384, "validDurationSecs": 4344.59, "totalHrs": 3.15, "validHrs": 1.21 }, "ne-NP": { "duration": 805572, "clips": 233, "users": 6, "size": 4711862, "checksum": "b0ee0a5d3959a0b360a738d8f151fa384a14984a4cc8a4620dac226470607632", "avgDurationSecs": 3.809, "validDurationSecs": 384.5, "totalHrs": 0.22, "validHrs": 0.11 }, "vot": { "duration": 0, "clips": 0, "users": 0, "size": 4568, "checksum": "3170871827252b6c37e629a24fe190a58c6d829dd49623621971364056e20009", "avgDurationSecs": 2.408, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 113256, "clips": 20, "users": 3, "size": 517884, "checksum": "a9208b506fa1067ff79d677c7f7414c75b04a712ac5842535a9820b18f4d7d0c", "avgDurationSecs": 5.362, "validDurationSecs": 28.75, "totalHrs": 0.03, "validHrs": 0.01 }, "ast": { "duration": 283392, "clips": 61, "users": 3, "size": 1680439, "checksum": "390a2062d490db8420f56e0a05b238f0cd5079318c095e310baa9c8c9abfefda", "avgDurationSecs": 4.542, "validDurationSecs": 401.9, "totalHrs": 0.08, "validHrs": 0.12 }, "quy": { "duration": 32472, "clips": 5, "users": 1, "size": 192038, "checksum": "aeee1fe8d224b18351332e700d25c87ae23d4eea45769b94aa95f68184476c92", "avgDurationSecs": 6.132, "validDurationSecs": 1.81, "totalHrs": 0.01, "validHrs": 0 }, "lo": { "duration": 429480, "clips": 65, "users": 2, "size": 2416163, "checksum": "42f61a8754d0b8b9ff63b679ddd2fb89e4e84f8a43e8641aea8c87a72e85dd25", "avgDurationSecs": 6.642, "validDurationSecs": 57.81, "totalHrs": 0.12, "validHrs": 0.01 }, "dyu": { "duration": 0, "clips": 0, "users": 0, "size": 157, "checksum": "081c25661ec11582fbef6d0b2292d8f0c2b4625f91ebe66ede85c6e699deacc4", "avgDurationSecs": 6.228, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 39600, "clips": 5, "users": 1, "size": 234921, "checksum": "c1dfa11a035c1d36ab48abec4c8d3e99c589c71a6d03a954085397a76b01f2b6", "avgDurationSecs": 7.334, "validDurationSecs": 58.675, "totalHrs": 0.02, "validHrs": 0.01 }, "totalDuration": 3515177522, "totalValidDurationSecs": 3462756, "totalHrs": 976, "totalValidHrs": 962 }, "totalDuration": 3515177522, "totalValidDurationSecs": 3462756, "totalHrs": 976, "totalValidHrs": 962 } ================================================ FILE: datasets/scripted-speech/cv-corpus-15.0-2023-09-08.json ================================================ { "locales": { "en": { "buckets": { "clip_durations": 2317263, "dev": 16386, "invalidated": 278774, "other": 285412, "reported": 6608, "test": 16386, "train": 1070066, "validated": 1752390 }, "duration": 12047305099, "reportedSentences": 6508, "clips": 2316576, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.25, "sixties": 0.04, "thirties": 0.14, "teens": 0.06, "seventies": 0.01, "fourties": 0.09, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.45, "female": 0.17, "other": 0.02 } }, "users": 88904, "size": 84926487835, "checksum": "794f4ea6c6bab3731d54cf7ce3d67996cf1ba7c0d92cbd338c36636a9716047a", "avgDurationSecs": 5.2, "validDurationSecs": 9113267.591, "totalHrs": 3346.47, "validHrs": 2531.46 }, "fa": { "buckets": { "clip_durations": 364909, "dev": 10540, "invalidated": 14417, "other": 23578, "reported": 2335, "test": 10540, "train": 28756, "validated": 326914 }, "duration": 1456906436, "reportedSentences": 2326, "clips": 364909, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.32, "thirties": 0.36, "fifties": 0.01, "fourties": 0.02, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.07, "other": 0 } }, "users": 4359, "size": 10613039705, "checksum": "eacf8280ecb9112024f9493818ac42c06a72855b18d8d5bf90f989fef87c4eee", "avgDurationSecs": 3.993, "validDurationSecs": 1305210.643, "totalHrs": 404.69, "validHrs": 362.55 }, "fr": { "buckets": { "clip_durations": 779271, "dev": 16132, "invalidated": 60291, "other": 23951, "reported": 7026, "test": 16132, "train": 527554, "validated": 695024 }, "duration": 3902596354, "reportedSentences": 6949, "clips": 779266, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "thirties": 0.16, "": 0.37, "teens": 0.03, "fourties": 0.14, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.3, "female": 0.1, "other": 0.01 } }, "users": 17911, "size": 27596946700, "checksum": "9ec31dc710142ccf100dcb52c26ebf7350a7777e85e2b53949a9bed765f6b8e2", "avgDurationSecs": 5.008, "validDurationSecs": 3480708.934, "totalHrs": 1084.05, "validHrs": 966.86 }, "es": { "buckets": { "clip_durations": 1616992, "dev": 15796, "invalidated": 77927, "other": 1150345, "reported": 2312, "test": 15796, "train": 311392, "validated": 388715 }, "duration": 7873329654, "reportedSentences": 2290, "clips": 1616987, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.09, "": 0.13, "fifties": 0.04, "twenties": 0.54, "teens": 0.08, "fourties": 0.03, "sixties": 0.09, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.55, "": 0.13, "other": 0, "female": 0.32 } }, "users": 25338, "size": 49641496772, "checksum": "1d29fa2bcc47eaa538777f397dd4f52114a86788850d45b5e196f439e5d1b231", "avgDurationSecs": 4.869, "validDurationSecs": 1892706.21, "totalHrs": 2187.03, "validHrs": 525.75 }, "sl": { "buckets": { "clip_durations": 12513, "dev": 1232, "invalidated": 264, "other": 2222, "reported": 37, "test": 1230, "train": 1378, "validated": 10027 }, "duration": 47304910, "reportedSentences": 38, "clips": 12513, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.47, "teens": 0.07, "": 0.18, "sixties": 0.07, "fifties": 0.06, "fourties": 0.09, "thirties": 0.05 }, "gender": { "female": 0.15, "male": 0.67, "": 0.18, "other": 0 } }, "users": 146, "size": 331423898, "checksum": "92d8965ba2bb1386fbfb9ad614a69464e2f87ec436c678a6d2272ca3a5baedd0", "avgDurationSecs": 3.78, "validDurationSecs": 37906.684, "totalHrs": 13.14, "validHrs": 10.52 }, "kab": { "buckets": { "clip_durations": 742494, "dev": 14998, "invalidated": 19570, "other": 113309, "reported": 9418, "test": 14998, "train": 151934, "validated": 609615 }, "duration": 2477736052, "reportedSentences": 9394, "clips": 742494, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.29, "": 0.28, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.53, "": 0.26, "female": 0.2, "other": 0 } }, "users": 1532, "size": 18492385859, "checksum": "4d79a2ac110ef5e2bf4f772a2cce0ebb4591e2c0edb733deb18868d446fd480b", "avgDurationSecs": 3.337, "validDurationSecs": 2034312.82, "totalHrs": 688.26, "validHrs": 565.08 }, "cy": { "buckets": { "clip_durations": 113688, "dev": 5336, "invalidated": 4421, "other": 19594, "reported": 171, "test": 5341, "train": 7871, "validated": 89673 }, "duration": 555754689, "reportedSentences": 172, "clips": 113688, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.06, "fifties": 0.09, "": 0.43, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.25, "": 0.41, "other": 0.01 } }, "users": 1800, "size": 4063856311, "checksum": "5a26c2501517e36e76b2cb8d6624ad455913518506e072d5f6a55bbf8650ab59", "avgDurationSecs": 4.888, "validDurationSecs": 438359.284, "totalHrs": 154.37, "validHrs": 121.76 }, "ca": { "buckets": { "clip_durations": 2229059, "dev": 16402, "invalidated": 105206, "other": 413187, "reported": 6172, "test": 16402, "train": 1142607, "validated": 1710661 }, "duration": 11978505306, "reportedSentences": 6127, "clips": 2229054, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.05, "fifties": 0.17, "fourties": 0.11, "twenties": 0.05, "": 0.31, "sixties": 0.26, "teens": 0.01, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.48, "": 0.31, "female": 0.21, "other": 0 } }, "users": 35062, "size": 72458226212, "checksum": "0db1c324ef3c68abc5ae1a69d68b73a3d5d117b97de321288af584ae1d7f84ff", "avgDurationSecs": 5.374, "validDurationSecs": 9192761.533, "totalHrs": 3327.36, "validHrs": 2553.54 }, "de": { "buckets": { "clip_durations": 955629, "dev": 16167, "invalidated": 53034, "other": 6652, "reported": 9994, "test": 16167, "train": 567993, "validated": 895943 }, "duration": 4994673077, "reportedSentences": 9960, "clips": 955629, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "fourties": 0.17, "": 0.32, "thirties": 0.16, "teens": 0.03, "sixties": 0.02, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.32, "female": 0.08, "other": 0.01 } }, "users": 18352, "size": 34883538578, "checksum": "d665c3a98aefc514bfbe56758f66b9ee015ed9dc338822f08a2702f9bb077a98", "avgDurationSecs": 5.227, "validDurationSecs": 4682719.32, "totalHrs": 1387.4, "validHrs": 1300.75 }, "tt": { "buckets": { "clip_durations": 29439, "dev": 3615, "invalidated": 414, "other": 10, "reported": 5, "test": 4926, "train": 9426, "validated": 29015 }, "duration": 110737466, "reportedSentences": 6, "clips": 29439, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "thirties": 0.72, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.21, "male": 0.77, "female": 0.02 } }, "users": 246, "size": 816731017, "checksum": "e47e9a6ebb7b18a67bd2e153bd9e67f9fa11587f498013d60ba68ecb565026d3", "avgDurationSecs": 3.762, "validDurationSecs": 109142.552, "totalHrs": 30.76, "validHrs": 30.31 }, "ta": { "buckets": { "clip_durations": 227749, "dev": 11972, "invalidated": 5637, "other": 88692, "reported": 3353, "test": 11956, "train": 44044, "validated": 133420 }, "duration": 1415794740, "reportedSentences": 3353, "clips": 227749, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.09, "thirties": 0.09, "": 0.71, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.03, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.7, "other": 0, "female": 0.13 } }, "users": 866, "size": 8482907967, "checksum": "583ef2b97804f70c62c3bf1708a4004e42a1d6b228453a28024468fe412e5bcf", "avgDurationSecs": 6.216, "validDurationSecs": 829401.377, "totalHrs": 393.27, "validHrs": 230.38 }, "ru": { "buckets": { "clip_durations": 180797, "dev": 10195, "invalidated": 9795, "other": 12585, "reported": 447, "test": 10196, "train": 26328, "validated": 158417 }, "duration": 932842296, "reportedSentences": 441, "clips": 180797, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "teens": 0.09, "": 0.24, "fourties": 0.14, "thirties": 0.15, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.6, "": 0.24, "other": 0, "female": 0.16 } }, "users": 3053, "size": 6348668542, "checksum": "8dafb999cbe49137de18568349f8c4b9044fc7417d2fbc3c7c71d98735edefea", "avgDurationSecs": 5.16, "validDurationSecs": 817370.189, "totalHrs": 259.12, "validHrs": 227.04 }, "nl": { "buckets": { "clip_durations": 97440, "dev": 11182, "invalidated": 5507, "other": 2451, "reported": 336, "test": 11182, "train": 34088, "validated": 89482 }, "duration": 423672373, "reportedSentences": 336, "clips": 97440, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.21, "fourties": 0.15, "thirties": 0.12, "teens": 0.02, "fifties": 0.08, "sixties": 0.02, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.41, "male": 0.48, "female": 0.11, "other": 0 } }, "users": 1645, "size": 2881919924, "checksum": "3c07ebb35f896d5acd76088e7b515e5b46b48a0eb46799e41c6a6d1a2e48b6ef", "avgDurationSecs": 4.348, "validDurationSecs": 389070.723, "totalHrs": 117.68, "validHrs": 108.07 }, "it": { "buckets": { "clip_durations": 256010, "dev": 15127, "invalidated": 18859, "other": 3691, "reported": 5915, "test": 15127, "train": 166503, "validated": 233460 }, "duration": 1378725780, "reportedSentences": 5911, "clips": 256010, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "twenties": 0.21, "": 0.28, "fifties": 0.15, "fourties": 0.13, "seventies": 0, "sixties": 0.04, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.11, "male": 0.61, "": 0.28, "other": 0 } }, "users": 6977, "size": 9426983321, "checksum": "25b668a1166fd668702bf22dbdaea7c4b47b6bb8aea1c8c0d299c3e045515a5e", "avgDurationSecs": 5.385, "validDurationSecs": 1257284.171, "totalHrs": 382.97, "validHrs": 349.24 }, "eu": { "buckets": { "clip_durations": 109124, "dev": 6806, "invalidated": 6034, "other": 31089, "reported": 145, "test": 6806, "train": 11519, "validated": 72001 }, "duration": 570503355, "reportedSentences": 145, "clips": 109124, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.13, "twenties": 0.33, "": 0.28, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.45, "female": 0.25, "": 0.29, "other": 0.02 } }, "users": 1425, "size": 4223538823, "checksum": "6afb9120d834059bad9ad32485d8269214fd5c5e14d0bd537635fd5158a6a508", "avgDurationSecs": 5.228, "validDurationSecs": 376423.262, "totalHrs": 158.47, "validHrs": 104.56 }, "tr": { "buckets": { "clip_durations": 114045, "dev": 10862, "invalidated": 4389, "other": 147, "reported": 391, "test": 10880, "train": 31465, "validated": 109509 }, "duration": 412661835, "reportedSentences": 392, "clips": 114045, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.08, "twenties": 0.24, "teens": 0.02, "fourties": 0.03, "fifties": 0.08, "sixties": 0.2, "eighties": 0.08, "seventies": 0.02 }, "gender": { "": 0.26, "male": 0.44, "female": 0.3, "other": 0 } }, "users": 1511, "size": 2498099796, "checksum": "c9ef82e60042b3eb7c50762464a59aa4122392459ddb1e00b678866bd8ee40e9", "avgDurationSecs": 3.618, "validDurationSecs": 396248.717, "totalHrs": 114.62, "validHrs": 110.06 }, "ar": { "buckets": { "clip_durations": 132845, "dev": 10362, "invalidated": 15068, "other": 39983, "reported": 2118, "test": 10474, "train": 28406, "validated": 77794 }, "duration": 553629078, "reportedSentences": 2110, "clips": 132845, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.54, "twenties": 0.3, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.17, "": 0.54, "male": 0.29, "other": 0 } }, "users": 1481, "size": 3283507356, "checksum": "a9a2337f4da2c8961bf7146dc0b04c119dc101b03fd1f5c43ee40cfafe48ceba", "avgDurationSecs": 4.167, "validDurationSecs": 324205.055, "totalHrs": 153.78, "validHrs": 90.05 }, "zh-TW": { "buckets": { "clip_durations": 133632, "dev": 4950, "invalidated": 4745, "other": 46938, "reported": 144, "test": 4950, "train": 7048, "validated": 81819 }, "duration": 449221270, "reportedSentences": 145, "clips": 133502, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.3, "teens": 0.05, "": 0.26, "fifties": 0.09, "seventies": 0, "fourties": 0.09, "sixties": 0 }, "gender": { "male": 0.5, "": 0.25, "female": 0.23, "other": 0.02 } }, "users": 2163, "size": 3014145148, "checksum": "17dfe0e9e9e9c2874f0fb684865f356d14ab26c2834d806a3f821267230b5cb1", "avgDurationSecs": 3.365, "validDurationSecs": 275312.992, "totalHrs": 124.78, "validHrs": 76.47 }, "br": { "buckets": { "clip_durations": 28955, "dev": 2088, "invalidated": 979, "other": 14870, "reported": 284, "test": 2197, "train": 2663, "validated": 13106 }, "duration": 91392091, "reportedSentences": 284, "clips": 28955, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.3, "fifties": 0.05, "fourties": 0.06, "thirties": 0.07, "sixties": 0.14, "seventies": 0.03, "teens": 0.01 }, "gender": { "male": 0.67, "": 0.31, "female": 0.02, "other": 0 } }, "users": 196, "size": 642924440, "checksum": "bf06ab83c28b9970847a19423bfd9ea0bc6c6577615b5e4374fd40d847b73ab9", "avgDurationSecs": 3.156, "validDurationSecs": 41367.113, "totalHrs": 25.38, "validHrs": 11.49 }, "pt": { "buckets": { "clip_durations": 178853, "dev": 9315, "invalidated": 6983, "other": 24716, "reported": 2579, "test": 9321, "train": 21202, "validated": 147154 }, "duration": 731370355, "reportedSentences": 2572, "clips": 178853, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "twenties": 0.37, "teens": 0.02, "thirties": 0.18, "fourties": 0.13, "sixties": 0.02, "fifties": 0.02, "seventies": 0 }, "gender": { "": 0.24, "male": 0.68, "female": 0.06, "other": 0.02 } }, "users": 3247, "size": 4618646587, "checksum": "b49932e6dcee44813a92178494f520bb524226f2c4e6a0beee0f85e8797ad3a4", "avgDurationSecs": 4.089, "validDurationSecs": 601745.977, "totalHrs": 203.15, "validHrs": 167.15 }, "eo": { "buckets": { "clip_durations": 1125523, "dev": 14900, "invalidated": 127378, "other": 148703, "reported": 2255, "test": 14913, "train": 144070, "validated": 849442 }, "duration": 6827010307, "reportedSentences": 2254, "clips": 1125523, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.56, "thirties": 0.12, "": 0.2, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0.01 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1682, "size": 40786438590, "checksum": "c9a6b833e7daea9302f5cac54c72012e8f5c1a752d4409a6f688c609e151557d", "avgDurationSecs": 6.066, "validDurationSecs": 5152404.073, "totalHrs": 1896.39, "validHrs": 1431.22 }, "zh-CN": { "buckets": { "clip_durations": 834237, "dev": 10625, "invalidated": 58157, "other": 592680, "reported": 728, "test": 10625, "train": 29396, "validated": 183370 }, "duration": 3794426916, "reportedSentences": 725, "clips": 834207, "splits": { "accent": { "": 1 }, "age": { "": 0.93, "teens": 0.01, "twenties": 0.04, "thirties": 0.01, "fourties": 0, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.93, "male": 0.06, "female": 0.01, "other": 0 } }, "users": 6823, "size": 22447770171, "checksum": "468cb3d9d561cb60aaf9d72c1e57b7f206ed497880475ed07f5c373286958f35", "avgDurationSecs": 4.549, "validDurationSecs": 834066.441, "totalHrs": 1054, "validHrs": 231.68 }, "id": { "buckets": { "clip_durations": 57564, "dev": 3330, "invalidated": 2588, "other": 29292, "reported": 344, "test": 3638, "train": 4968, "validated": 25684 }, "duration": 229541988, "reportedSentences": 345, "clips": 57564, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.25, "male": 0.45, "female": 0.27, "other": 0.04 } }, "users": 516, "size": 1444378010, "checksum": "bcf8dbb9fe2c70990d26aad714a25a993bd2d7fdae95411d0faa69a8f33901fc", "avgDurationSecs": 3.988, "validDurationSecs": 102417.421, "totalHrs": 63.76, "validHrs": 28.44 }, "ia": { "buckets": { "clip_durations": 14564, "dev": 1803, "invalidated": 335, "other": 2585, "reported": 275, "test": 1789, "train": 5007, "validated": 11644 }, "duration": 60943440, "reportedSentences": 271, "clips": 14564, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.38, "female": 0.01 } }, "users": 66, "size": 412823598, "checksum": "11cd9d4c2b32285bde1aba1241910d25b5ea221c2fdace828d8915b8a62adf95", "avgDurationSecs": 4.185, "validDurationSecs": 48724.623, "totalHrs": 16.92, "validHrs": 13.53 }, "lv": { "buckets": { "clip_durations": 149202, "dev": 2653, "invalidated": 2778, "other": 67540, "reported": 2881, "test": 2653, "train": 3157, "validated": 78884 }, "duration": 592648757, "reportedSentences": 2846, "clips": 149202, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.34, "fourties": 0.1, "": 0.38, "twenties": 0.07, "teens": 0.01, "fifties": 0.1, "sixties": 0.01, "eighties": 0, "seventies": 0 }, "gender": { "male": 0.29, "female": 0.33, "": 0.38, "other": 0 } }, "users": 2773, "size": 3247762415, "checksum": "91d38952f7eabeb9580b43b712b0a92c3bf6e4b08f524a99685d843e0c69c66e", "avgDurationSecs": 3.972, "validDurationSecs": 313336.983, "totalHrs": 164.62, "validHrs": 87.03 }, "ja": { "buckets": { "clip_durations": 169592, "dev": 4978, "invalidated": 6209, "other": 104938, "reported": 200, "test": 4978, "train": 7105, "validated": 58241 }, "duration": 813315453, "reportedSentences": 200, "clips": 169388, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.44, "": 0.17, "teens": 0.04, "fifties": 0.05, "thirties": 0.13, "fourties": 0.16, "sixties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.41, "": 0.17, "female": 0.42, "other": 0 } }, "users": 1713, "size": 4808292442, "checksum": "a9014bf0cc4a47b317dc7797956e838a159955e1ebc936e350f3f698f72e0057", "avgDurationSecs": 4.801, "validDurationSecs": 279643.808, "totalHrs": 225.92, "validHrs": 77.67 }, "rw": { "buckets": { "clip_durations": 1713830, "dev": 15987, "invalidated": 227802, "other": 47307, "reported": 629, "test": 16213, "train": 1003023, "validated": 1438721 }, "duration": 8582142749, "reportedSentences": 630, "clips": 1713830, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1120, "size": 61030591477, "checksum": "e6fb1d983284b2ca8e8c0648f1ed8f1cfec3c5bddbc9881c245797a49820e3b0", "avgDurationSecs": 5.008, "validDurationSecs": 7204512.115, "totalHrs": 2383.92, "validHrs": 2001.25 }, "sv-SE": { "buckets": { "clip_durations": 48283, "dev": 5127, "invalidated": 1398, "other": 6810, "reported": 581, "test": 5162, "train": 7584, "validated": 40075 }, "duration": 192832215, "reportedSentences": 582, "clips": 48283, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.48, "": 0.19, "female": 0.33, "other": 0.01 } }, "users": 808, "size": 1218976628, "checksum": "2d5d854a7d2fd066faf73573e456a9ea92c924125f77668c6e40925c00a4d34e", "avgDurationSecs": 3.994, "validDurationSecs": 160051.178, "totalHrs": 53.56, "validHrs": 44.45 }, "cnh": { "buckets": { "clip_durations": 5802, "dev": 761, "invalidated": 436, "other": 2908, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "duration": 20675832, "reportedSentences": 9, "clips": 5802, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 299, "size": 161463884, "checksum": "c4753100ec422ccaf7d625cc12a9e09924bab32c291b2b4074f5947dff5ec369", "avgDurationSecs": 3.564, "validDurationSecs": 8759.255, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "buckets": { "clip_durations": 30197, "dev": 2645, "invalidated": 7048, "other": 0, "reported": 547, "test": 2645, "train": 3148, "validated": 23149 }, "duration": 204374918, "reportedSentences": 544, "clips": 30197, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "thirties": 0.07, "twenties": 0.69, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.26, "other": 0 } }, "users": 854, "size": 1387077806, "checksum": "cf197dbc83297e7e1e8380e5c442379ec22f7c2fe98f9bf344dd6d81d562f1f2", "avgDurationSecs": 6.768, "validDurationSecs": 156673.675, "totalHrs": 56.77, "validHrs": 43.52 }, "ky": { "buckets": { "clip_durations": 36365, "dev": 1613, "invalidated": 5633, "other": 537, "reported": 38, "test": 1611, "train": 1790, "validated": 30195 }, "duration": 165185724, "reportedSentences": 39, "clips": 36365, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18, "fifties": 0 }, "gender": { "male": 0.53, "": 0.11, "female": 0.36, "other": 0 } }, "users": 269, "size": 1065443992, "checksum": "65ce6facdd84593482bfdbaf1b546a8c3fe805a42b988c2ed08999b3d87fe930", "avgDurationSecs": 4.542, "validDurationSecs": 137158.887, "totalHrs": 45.88, "validHrs": 38.09 }, "ro": { "buckets": { "clip_durations": 39009, "dev": 3821, "invalidated": 941, "other": 21431, "reported": 386, "test": 3870, "train": 5172, "validated": 16637 }, "duration": 156254215, "reportedSentences": 387, "clips": 39009, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "teens": 0.02, "": 0.13, "fourties": 0.06, "sixties": 0, "twenties": 0.65, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.72, "": 0.12, "female": 0.14, "other": 0.01 } }, "users": 408, "size": 973492949, "checksum": "42ef26acc88244a674ecc551d3a0afa9c06fa64a0084820666fa634095ef474f", "avgDurationSecs": 4.006, "validDurationSecs": 66641.067, "totalHrs": 43.4, "validHrs": 18.51 }, "hsb": { "buckets": { "clip_durations": 1765, "dev": 172, "invalidated": 247, "other": 94, "reported": 86, "test": 444, "train": 808, "validated": 1424 }, "duration": 10872468, "reportedSentences": 87, "clips": 1765, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.52, "": 0.18, "thirties": 0.1, "sixties": 0, "seventies": 0.03, "twenties": 0.1, "fifties": 0.03, "teens": 0.04 }, "gender": { "male": 0.82, "": 0.18, "other": 0 } }, "users": 21, "size": 83718119, "checksum": "fdb29410f020c9b03f822aa1d6c8f5708cdd10924556137cea85f964fe64c562", "avgDurationSecs": 6.16, "validDurationSecs": 8771.895, "totalHrs": 3.02, "validHrs": 2.43 }, "el": { "buckets": { "clip_durations": 26987, "dev": 1699, "invalidated": 821, "other": 10401, "reported": 69, "test": 1702, "train": 1913, "validated": 15765 }, "duration": 111572566, "reportedSentences": 70, "clips": 26987, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.37, "fourties": 0.16, "": 0.31, "twenties": 0.12, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.64, "": 0.31, "other": 0.02, "female": 0.03 } }, "users": 372, "size": 743735882, "checksum": "5cb339dd5cdd09dd127dd407ba0a675d6a6b94bf7112d928bcc1ce11f0e630bd", "avgDurationSecs": 4.134, "validDurationSecs": 65177.363, "totalHrs": 30.99, "validHrs": 18.1 }, "cs": { "buckets": { "clip_durations": 209332, "dev": 8922, "invalidated": 2074, "other": 147402, "reported": 908, "test": 8907, "train": 19358, "validated": 59856 }, "duration": 933804147, "reportedSentences": 902, "clips": 209332, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.07, "": 0.23, "thirties": 0.47, "teens": 0.02, "twenties": 0.19, "fifties": 0.01, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.54, "": 0.23, "female": 0.23, "other": 0 } }, "users": 931, "size": 5722154154, "checksum": "a40e52275c177f9a4cf26d4e0be0a7ba80bea1a49df9488195d438cbc08c0eb1", "avgDurationSecs": 4.461, "validDurationSecs": 267010.209, "totalHrs": 259.39, "validHrs": 74.16 }, "pl": { "buckets": { "clip_durations": 137759, "dev": 8895, "invalidated": 6438, "other": 1166, "reported": 557, "test": 8896, "train": 19119, "validated": 130155 }, "duration": 621335522, "reportedSentences": 557, "clips": 137759, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3230, "size": 4438183373, "checksum": "2120959fc8114f9b59743d0e401b49186c6ec6b4a0075c193bfcb6c89102cde6", "avgDurationSecs": 4.51, "validDurationSecs": 587039.14, "totalHrs": 172.59, "validHrs": 163.06 }, "rm-sursilv": { "buckets": { "clip_durations": 7186, "dev": 1352, "invalidated": 679, "other": 2227, "reported": 16, "test": 1355, "train": 1561, "validated": 4280 }, "duration": 39264017, "reportedSentences": 17, "clips": 7186, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.63, "teens": 0.05, "fourties": 0.19 }, "gender": { "male": 0.18, "female": 0.19, "": 0.63, "other": 0 } }, "users": 90, "size": 295917274, "checksum": "dc16aea509065151ebe382c34c6e62179bd5b7cbb2bd36bcea9a5f7dbbe79b52", "avgDurationSecs": 5.464, "validDurationSecs": 23385.749, "totalHrs": 10.9, "validHrs": 6.49 }, "rm-vallader": { "buckets": { "clip_durations": 2631, "dev": 500, "invalidated": 394, "other": 711, "reported": 37, "test": 462, "train": 557, "validated": 1526 }, "duration": 15327554, "reportedSentences": 36, "clips": 2631, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 53, "size": 116767042, "checksum": "ad2d8fb378dba1d15c184b23aa183aa6283493bef7ac5c80d013f74a2b525632", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "mn": { "buckets": { "clip_durations": 14001, "dev": 1795, "invalidated": 781, "other": 4707, "reported": 31, "test": 1893, "train": 2239, "validated": 8513 }, "duration": 76212008, "reportedSentences": 32, "clips": 14001, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.22, "": 0.25, "twenties": 0.44, "fourties": 0.01, "teens": 0.02, "nineties": 0.05, "fifties": 0, "eighties": 0.01 }, "gender": { "male": 0.39, "": 0.26, "female": 0.29, "other": 0.06 } }, "users": 510, "size": 565466098, "checksum": "917d2c114f6b1db77bff07bf891e16340db23115b59d1f1728e6f46db3e21cfd", "avgDurationSecs": 5.443, "validDurationSecs": 46339.035, "totalHrs": 21.17, "validHrs": 12.87 }, "zh-HK": { "buckets": { "clip_durations": 118736, "dev": 5593, "invalidated": 4395, "other": 22052, "reported": 681, "test": 5593, "train": 8426, "validated": 92289 }, "duration": 496228992, "reportedSentences": 670, "clips": 118736, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.39, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0.01, "twenties": 0.33 }, "gender": { "male": 0.42, "": 0.35, "female": 0.22, "other": 0.01 } }, "users": 3001, "size": 3548638507, "checksum": "1be17fbae0707a4d43c1ad35aeab1469dda071d40b0439e1152e79be54d6accb", "avgDurationSecs": 4.179, "validDurationSecs": 385700.019, "totalHrs": 137.84, "validHrs": 107.13 }, "ab": { "buckets": { "clip_durations": 59195, "dev": 9160, "invalidated": 5277, "other": 11942, "reported": 224, "test": 9116, "train": 21027, "validated": 41976 }, "duration": 303841224, "reportedSentences": 223, "clips": 59195, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.18, "teens": 0.28, "twenties": 0.18, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 400, "size": 1736712101, "checksum": "e3b8d228bb6b6656f52786a7b8fe81b837331d1f3ef4d3d9e7ff45718754b195", "avgDurationSecs": 5.133, "validDurationSecs": 215458.049, "totalHrs": 84.4, "validHrs": 59.84 }, "cv": { "buckets": { "clip_durations": 19572, "dev": 1260, "invalidated": 2136, "other": 118, "reported": 143, "test": 1290, "train": 1436, "validated": 17318 }, "duration": 99019848, "reportedSentences": 139, "clips": 19572, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "": 0.22, "fourties": 0.06, "thirties": 0.01, "teens": 0.19, "fifties": 0.01 }, "gender": { "male": 0.54, "": 0.2, "female": 0.27 } }, "users": 112, "size": 672914075, "checksum": "dc390296468cbac0688eef73efb9cd0aa837563884ebb6ab7ea8af0772519c6e", "avgDurationSecs": 5.059, "validDurationSecs": 87616.275, "totalHrs": 27.5, "validHrs": 24.33 }, "uk": { "buckets": { "clip_durations": 80395, "dev": 9550, "invalidated": 3036, "other": 5534, "reported": 636, "test": 9549, "train": 22436, "validated": 71825 }, "duration": 376129752, "reportedSentences": 637, "clips": 80395, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.31, "teens": 0.1, "": 0.24, "fourties": 0.1, "thirties": 0.25, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.58, "female": 0.18, "": 0.24, "other": 0 } }, "users": 1024, "size": 2484169387, "checksum": "8f4400cbf931f84bf5b6a1732c34e04da2fd323c26102949365a5843b6ab7896", "avgDurationSecs": 4.679, "validDurationSecs": 336034.821, "totalHrs": 104.48, "validHrs": 93.34 }, "mt": { "buckets": { "clip_durations": 12968, "dev": 1591, "invalidated": 321, "other": 6253, "reported": 9, "test": 1647, "train": 1942, "validated": 6394 }, "duration": 61481160, "reportedSentences": 10, "clips": 12968, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.25, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 212, "size": 457177202, "checksum": "43b23e3625e18b3fd7fca331767d93304df5aa78cd184c9f8d13cd59d7fcb1ec", "avgDurationSecs": 4.741, "validDurationSecs": 30313.891, "totalHrs": 17.07, "validHrs": 8.42 }, "as": { "buckets": { "clip_durations": 2071, "dev": 510, "invalidated": 208, "other": 146, "reported": 9, "test": 551, "train": 656, "validated": 1717 }, "duration": 11784629, "reportedSentences": 10, "clips": 2071, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.6, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.41, "": 0.59, "female": 0 } }, "users": 45, "size": 73692881, "checksum": "6c74b5a299400ca18797b7f2fd01f303e894c78f407630d15c4222ba126682f4", "avgDurationSecs": 5.69, "validDurationSecs": 9770.26, "totalHrs": 3.27, "validHrs": 2.71 }, "ka": { "buckets": { "clip_durations": 107921, "dev": 11609, "invalidated": 3327, "other": 27146, "reported": 670, "test": 11683, "train": 39326, "validated": 77448 }, "duration": 552117024, "reportedSentences": 671, "clips": 107921, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.2, "thirties": 0.46, "": 0.2, "fourties": 0.05, "fifties": 0.06, "teens": 0.02, "sixties": 0.02 }, "gender": { "male": 0.24, "female": 0.56, "": 0.2, "other": 0 } }, "users": 1254, "size": 3231141988, "checksum": "05d356d812259bd57e1dba180767683ea1690fb5845c3cdf64629cc59b964499", "avgDurationSecs": 5.116, "validDurationSecs": 396219.079, "totalHrs": 153.36, "validHrs": 110.06 }, "fy-NL": { "buckets": { "clip_durations": 156186, "dev": 3171, "invalidated": 3937, "other": 101504, "reported": 536, "test": 3170, "train": 3919, "validated": 50745 }, "duration": 757914399, "reportedSentences": 534, "clips": 156186, "splits": { "accent": { "": 1 }, "age": { "": 0.57, "fifties": 0.1, "thirties": 0.1, "twenties": 0.04, "fourties": 0.06, "sixties": 0.12, "seventies": 0, "teens": 0, "eighties": 0 }, "gender": { "": 0.57, "male": 0.13, "female": 0.31 } }, "users": 2006, "size": 4544762765, "checksum": "a73b5822e34901cc06ed3f6d1f59c4701eb4504b508b7176ebf4755af73418ae", "avgDurationSecs": 4.853, "validDurationSecs": 246247.206, "totalHrs": 210.53, "validHrs": 68.4 }, "dv": { "buckets": { "clip_durations": 45521, "dev": 2210, "invalidated": 1693, "other": 16317, "reported": 55, "test": 2207, "train": 2700, "validated": 27511 }, "duration": 230332383, "reportedSentences": 56, "clips": 45521, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.17, "thirties": 0.36, "fourties": 0.21, "teens": 0.01, "nineties": 0, "fifties": 0.02 }, "gender": { "": 0.21, "male": 0.29, "female": 0.5 } }, "users": 343, "size": 1458441381, "checksum": "ef62877143369ab32c6307174e201ce06b61d01830903791a1cc3a44b985c181", "avgDurationSecs": 5.06, "validDurationSecs": 139203.317, "totalHrs": 63.98, "validHrs": 38.66 }, "pa-IN": { "buckets": { "clip_durations": 2963, "dev": 286, "invalidated": 80, "other": 1387, "reported": 249, "test": 482, "train": 728, "validated": 1496 }, "duration": 14291270, "reportedSentences": 244, "clips": 2963, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "fourties": 0.04, "fifties": 0.05, "thirties": 0.38, "twenties": 0.23, "sixties": 0, "teens": 0 }, "gender": { "": 0.3, "male": 0.69, "female": 0.01 } }, "users": 65, "size": 100578403, "checksum": "c11f174858e7f80459fe1526be8dd455231e8feff0af0810fe934303d415346e", "avgDurationSecs": 4.823, "validDurationSecs": 7215.572, "totalHrs": 3.96, "validHrs": 2 }, "vi": { "buckets": { "clip_durations": 16851, "dev": 452, "invalidated": 361, "other": 11595, "reported": 186, "test": 1290, "train": 2383, "validated": 4895 }, "duration": 66816548, "reportedSentences": 185, "clips": 16851, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.2, "": 0.25, "teens": 0.21, "seventies": 0, "fourties": 0.02, "sixties": 0.3 }, "gender": { "male": 0.53, "": 0.25, "female": 0.2, "other": 0.02 } }, "users": 269, "size": 388349088, "checksum": "1a4f089f3129bb0b417a4560cb4638afd03fda56587aeadc55cc1a51b1ac2cd5", "avgDurationSecs": 3.965, "validDurationSecs": 19409.353, "totalHrs": 18.56, "validHrs": 5.39 }, "or": { "buckets": { "clip_durations": 8867, "dev": 0, "invalidated": 273, "other": 5474, "reported": 30, "test": 690, "train": 2023, "validated": 3120 }, "duration": 44369124, "reportedSentences": 31, "clips": 8867, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "": 0.09, "thirties": 0.71, "fourties": 0.02, "teens": 0, "fifties": 0.01 }, "gender": { "male": 0.87, "": 0.09, "female": 0.04 } }, "users": 125, "size": 307557190, "checksum": "7bf94cb626f81262749258809f09fc3210f162baabb00ef673bb79fe79481375", "avgDurationSecs": 5.004, "validDurationSecs": 15612.007, "totalHrs": 12.32, "validHrs": 4.33 }, "ga-IE": { "buckets": { "clip_durations": 10288, "dev": 516, "invalidated": 914, "other": 4021, "reported": 28, "test": 517, "train": 536, "validated": 5353 }, "duration": 36830151, "reportedSentences": 29, "clips": 10288, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.37, "thirties": 0.24, "fourties": 0.06, "sixties": 0.01, "teens": 0.01, "fifties": 0.05 }, "gender": { "male": 0.49, "": 0.37, "female": 0.13, "other": 0 } }, "users": 184, "size": 251206620, "checksum": "f0435c5e88cb726046f7ed8f3dba26c01460e5041ebc5da916c3729a66f20a2b", "avgDurationSecs": 3.58, "validDurationSecs": 19163.277, "totalHrs": 10.23, "validHrs": 5.32 }, "fi": { "buckets": { "clip_durations": 15103, "dev": 1667, "invalidated": 246, "other": 5830, "reported": 54, "test": 1755, "train": 2148, "validated": 9027 }, "duration": 69622491, "reportedSentences": 55, "clips": 15103, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.19, "": 0.33, "twenties": 0.11, "fourties": 0.31, "teens": 0.01, "fifties": 0.04, "seventies": 0, "sixties": 0.01 }, "gender": { "male": 0.34, "": 0.33, "female": 0.33, "other": 0 } }, "users": 214, "size": 414282392, "checksum": "2dfb2244e6b8da1bb762b96fe54a04647c713addd780775aec5ca629e2724f5e", "avgDurationSecs": 4.61, "validDurationSecs": 41613.072, "totalHrs": 19.33, "validHrs": 11.55 }, "hu": { "buckets": { "clip_durations": 106764, "dev": 11197, "invalidated": 2908, "other": 46574, "reported": 412, "test": 11181, "train": 34498, "validated": 57282 }, "duration": 588386845, "reportedSentences": 413, "clips": 106764, "splits": { "accent": { "": 1 }, "age": { "teens": 0.02, "": 0.29, "thirties": 0.13, "twenties": 0.12, "fifties": 0.29, "fourties": 0.12, "sixties": 0.02, "seventies": 0.02, "eighties": 0 }, "gender": { "male": 0.25, "": 0.3, "female": 0.44, "other": 0.01 } }, "users": 1571, "size": 3434923071, "checksum": "30a10de531f3eb2979b44f209a9002e2aacba5306e58341fa0cc4cc204dda62e", "avgDurationSecs": 5.511, "validDurationSecs": 315686.704, "totalHrs": 163.44, "validHrs": 87.69 }, "th": { "buckets": { "clip_durations": 360763, "dev": 11032, "invalidated": 9234, "other": 205483, "reported": 4212, "test": 11032, "train": 32737, "validated": 146046 }, "duration": 1511991264, "reportedSentences": 4212, "clips": 360763, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.24, "": 0.4, "thirties": 0.07, "fourties": 0.04, "teens": 0.04, "fifties": 0.2, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.42, "": 0.4, "female": 0.17, "other": 0.01 } }, "users": 7828, "size": 8800512797, "checksum": "27f7ac98bb30d18373e249519a3225974e60304dead643209edf45763836d426", "avgDurationSecs": 4.191, "validDurationSecs": 612092.36, "totalHrs": 419.99, "validHrs": 170.02 }, "lt": { "buckets": { "clip_durations": 17617, "dev": 4319, "invalidated": 719, "other": 1062, "reported": 188, "test": 4609, "train": 6715, "validated": 15836 }, "duration": 90407420, "reportedSentences": 188, "clips": 17617, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.36, "": 0.21, "thirties": 0.31, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.04 }, "gender": { "male": 0.57, "": 0.21, "female": 0.22 } }, "users": 283, "size": 551399688, "checksum": "786d1b7d9b12c4378fcb30cb03f007cd3bdd4328637505bba1c97bfc8a9134e2", "avgDurationSecs": 5.132, "validDurationSecs": 81267.634, "totalHrs": 25.11, "validHrs": 22.57 }, "lg": { "buckets": { "clip_durations": 347782, "dev": 13435, "invalidated": 39168, "other": 36823, "reported": 6450, "test": 13471, "train": 70844, "validated": 271791 }, "duration": 2011923063, "reportedSentences": 6445, "clips": 347782, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.22, "twenties": 0.38, "fourties": 0.07, "fifties": 0.05, "teens": 0.01, "nineties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "": 0.25, "female": 0.39, "male": 0.36 } }, "users": 651, "size": 11789493227, "checksum": "1088eee5f34fa7d2b30a619843784c77b328d2303cdd03dd20fa264be35f1ca2", "avgDurationSecs": 5.785, "validDurationSecs": 1572314.212, "totalHrs": 558.86, "validHrs": 436.75 }, "hi": { "buckets": { "clip_durations": 14697, "dev": 2416, "invalidated": 757, "other": 3767, "reported": 135, "test": 3072, "train": 4630, "validated": 10173 }, "duration": 71911680, "reportedSentences": 136, "clips": 14697, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "fourties": 0.03, "": 0.34, "thirties": 0.25, "teens": 0.01, "fifties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.62, "female": 0.04, "": 0.34, "other": 0 } }, "users": 396, "size": 424657197, "checksum": "1a9519393771c40dfb1481559cc0d12ae9b56871d72636d1419b64eb18d1cd20", "avgDurationSecs": 4.893, "validDurationSecs": 49775.976, "totalHrs": 19.97, "validHrs": 13.82 }, "bas": { "buckets": { "clip_durations": 2266, "dev": 457, "invalidated": 508, "other": 3, "reported": 7, "test": 535, "train": 763, "validated": 1755 }, "duration": 10040580, "reportedSentences": 8, "clips": 2266, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 33, "size": 55888674, "checksum": "8b8238f2e466fcf7dd254d712064915a045308d2ef4b13fa09a01841e46d4a38", "avgDurationSecs": 4.431, "validDurationSecs": 7776.354, "totalHrs": 2.78, "validHrs": 2.16 }, "sk": { "buckets": { "clip_durations": 23153, "dev": 2620, "invalidated": 823, "other": 3096, "reported": 62, "test": 2575, "train": 3276, "validated": 19234 }, "duration": 94376628, "reportedSentences": 62, "clips": 23153, "splits": { "accent": { "": 1 }, "age": { "": 0.44, "thirties": 0.21, "twenties": 0.09, "fourties": 0.18, "teens": 0.08, "fifties": 0.01 }, "gender": { "": 0.43, "male": 0.49, "female": 0.07, "other": 0.01 } }, "users": 216, "size": 533096664, "checksum": "e5140c4a6d842ceb53c541e7e38c0eb3b0718483a1530e4672b273742e246802", "avgDurationSecs": 4.076, "validDurationSecs": 78401.938, "totalHrs": 26.21, "validHrs": 21.77 }, "kmr": { "buckets": { "clip_durations": 87229, "dev": 3514, "invalidated": 2398, "other": 31941, "reported": 1033, "test": 3511, "train": 4472, "validated": 52890 }, "duration": 353778876, "reportedSentences": 1028, "clips": 87229, "splits": { "accent": { "": 1 }, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.08, "fourties": 0.03, "fifties": 0.04, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.48, "male": 0.42, "female": 0.1, "other": 0 } }, "users": 551, "size": 1868154773, "checksum": "e5aa3c86707ae2856ba7d9b7fbbd6c12f56ad4fe692d3d8273a13194d3d2f08e", "avgDurationSecs": 4.056, "validDurationSecs": 214508.532, "totalHrs": 98.27, "validHrs": 59.58 }, "bg": { "buckets": { "clip_durations": 13109, "dev": 2560, "invalidated": 699, "other": 3720, "reported": 184, "test": 2701, "train": 3413, "validated": 8690 }, "duration": 71712288, "reportedSentences": 185, "clips": 13109, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.24, "thirties": 0.28, "": 0.37, "twenties": 0.11, "teens": 0, "sixties": 0, "fifties": 0 }, "gender": { "male": 0.59, "female": 0.04, "": 0.37 } }, "users": 106, "size": 419239088, "checksum": "62ef08f145f1c0813502fd348d772b98693ec017932af38b14c214bf7a3c864e", "avgDurationSecs": 5.47, "validDurationSecs": 47538.316, "totalHrs": 19.92, "validHrs": 13.2 }, "kk": { "buckets": { "clip_durations": 1436, "dev": 382, "invalidated": 197, "other": 1, "reported": 41, "test": 398, "train": 453, "validated": 1238 }, "duration": 7074144, "reportedSentences": 42, "clips": 1436, "splits": { "accent": { "": 1 }, "age": { "": 0.52, "thirties": 0.03, "twenties": 0.31, "teens": 0.05, "fifties": 0.09 }, "gender": { "": 0.53, "male": 0.45, "female": 0.02 } }, "users": 92, "size": 40673476, "checksum": "394769ebdfa7d07db57f4f749028a8e39515683b23a8d5bf73e5dcf89b85e09f", "avgDurationSecs": 4.926, "validDurationSecs": 6098.74, "totalHrs": 1.96, "validHrs": 1.69 }, "ba": { "buckets": { "clip_durations": 217537, "dev": 14588, "invalidated": 7928, "other": 166, "reported": 866, "test": 14597, "train": 118993, "validated": 209443 }, "duration": 962839332, "reportedSentences": 863, "clips": 217537, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.19, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 912, "size": 5404128012, "checksum": "32e0c10eba60ac729686990e25313e0c3f74fe56c42ac55667e5c1c6cf389a5e", "avgDurationSecs": 4.426, "validDurationSecs": 927014.523, "totalHrs": 267.45, "validHrs": 257.5 }, "gl": { "buckets": { "clip_durations": 44109, "dev": 6961, "invalidated": 2100, "other": 14914, "reported": 361, "test": 7113, "train": 12688, "validated": 27095 }, "duration": 217062576, "reportedSentences": 361, "clips": 44109, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "thirties": 0.18, "fifties": 0.07, "twenties": 0.21, "fourties": 0.2, "teens": 0.02, "sixties": 0.01, "seventies": 0 }, "gender": { "": 0.35, "male": 0.35, "female": 0.29, "other": 0 } }, "users": 1008, "size": 1246211222, "checksum": "086d970127eb786d4a6b64ba5fdbf7246e0dc14b493d0d8453a32e089d213544", "avgDurationSecs": 4.921, "validDurationSecs": 133335.838, "totalHrs": 60.29, "validHrs": 37.03 }, "ug": { "buckets": { "clip_durations": 81361, "dev": 4687, "invalidated": 4419, "other": 892, "reported": 301, "test": 4687, "train": 6528, "validated": 76050 }, "duration": 491440068, "reportedSentences": 302, "clips": 81361, "splits": { "accent": { "": 1 }, "age": { "": 0.6, "fifties": 0.01, "twenties": 0.1, "thirties": 0.16, "fourties": 0.11, "teens": 0.01, "eighties": 0.01 }, "gender": { "": 0.6, "male": 0.31, "female": 0.08, "other": 0 } }, "users": 1085, "size": 2847501148, "checksum": "e98dcc27e62e672fce1a6c837ea8cca2f2a0ac70ddc07736cb426619c7a035b2", "avgDurationSecs": 6.04, "validDurationSecs": 459360.347, "totalHrs": 136.51, "validHrs": 127.6 }, "hy-AM": { "buckets": { "clip_durations": 2803, "dev": 376, "invalidated": 111, "other": 1184, "reported": 44, "test": 480, "train": 651, "validated": 1508 }, "duration": 17129772, "reportedSentences": 45, "clips": 2803, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.13, "twenties": 0.37, "fifties": 0.04, "teens": 0.09 }, "gender": { "": 0.37, "male": 0.22, "female": 0.41 } }, "users": 72, "size": 100629878, "checksum": "36d814191cf5aacc39a0a9afa7150e1b4650c42806868256f06e9afcd652ffcb", "avgDurationSecs": 6.111, "validDurationSecs": 9215.732, "totalHrs": 4.75, "validHrs": 2.55 }, "be": { "buckets": { "clip_durations": 1233293, "dev": 15875, "invalidated": 34704, "other": 108, "reported": 3135, "test": 15879, "train": 347580, "validated": 1198481 }, "duration": 5873191668, "reportedSentences": 3134, "clips": 1233293, "splits": { "accent": { "": 1 }, "age": { "": 0.86, "fourties": 0.04, "thirties": 0.05, "twenties": 0.04, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.85, "male": 0.06, "female": 0.08, "other": 0 } }, "users": 8205, "size": 33332980780, "checksum": "048582051cd2b452fa540cf1de49398411209700da0b6602d7d66cc6d8eed850", "avgDurationSecs": 4.762, "validDurationSecs": 5707409.856, "totalHrs": 1631.44, "validHrs": 1585.39 }, "ur": { "buckets": { "clip_durations": 188015, "dev": 4031, "invalidated": 6751, "other": 127661, "reported": 236, "test": 4031, "train": 5324, "validated": 53603 }, "duration": 793240344, "reportedSentences": 236, "clips": 188015, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.91, "": 0.08, "fourties": 0.01, "thirties": 0, "teens": 0, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.67, "": 0.08, "female": 0.25 } }, "users": 316, "size": 4613838606, "checksum": "d6d3cb6ab6051475d6bb5a550e63b0e07d50ab742adae0fa2c4e09f95b59c337", "avgDurationSecs": 4.219, "validDurationSecs": 226152.499, "totalHrs": 220.34, "validHrs": 62.82 }, "gn": { "buckets": { "clip_durations": 19791, "dev": 359, "invalidated": 158, "other": 16757, "reported": 35, "test": 846, "train": 1556, "validated": 2876 }, "duration": 89250372, "reportedSentences": 36, "clips": 19791, "splits": { "accent": { "": 1 }, "age": { "": 0.11, "twenties": 0.61, "thirties": 0.19, "sixties": 0, "fourties": 0.06, "teens": 0.02 }, "gender": { "": 0.11, "male": 0.39, "female": 0.49 } }, "users": 140, "size": 512419630, "checksum": "6a373d7db2a5ccd8105134c4ba569dfe09f335d4b279d38984913b7437df038c", "avgDurationSecs": 4.51, "validDurationSecs": 12969.737, "totalHrs": 24.79, "validHrs": 3.6 }, "sr": { "buckets": { "clip_durations": 6570, "dev": 1375, "invalidated": 175, "other": 1677, "reported": 29, "test": 1381, "train": 1521, "validated": 4718 }, "duration": 18992628, "reportedSentences": 30, "clips": 6570, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.26, "fifties": 0.19, "fourties": 0.07, "thirties": 0.15, "teens": 0 }, "gender": { "male": 0.5, "": 0.26, "female": 0.24 } }, "users": 144, "size": 99883619, "checksum": "83dfd72a51efe7a305d615ab23d92a07b0f5819875effbd8fb8931abed0515f5", "avgDurationSecs": 2.891, "validDurationSecs": 13638.846, "totalHrs": 5.27, "validHrs": 3.78 }, "uz": { "buckets": { "clip_durations": 228012, "dev": 12080, "invalidated": 13883, "other": 127892, "reported": 1826, "test": 12358, "train": 48389, "validated": 86237 }, "duration": 945782388, "reportedSentences": 1809, "clips": 228012, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.41, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.44, "": 0.41, "female": 0.15, "other": 0 } }, "users": 2141, "size": 5278910968, "checksum": "c4c662820a1909c3af9e5513f2f88da7072ff750c2b075883201c1ec02960553", "avgDurationSecs": 4.148, "validDurationSecs": 357706.769, "totalHrs": 262.71, "validHrs": 99.36 }, "mr": { "buckets": { "clip_durations": 15951, "dev": 1783, "invalidated": 2260, "other": 2830, "reported": 58, "test": 1737, "train": 2225, "validated": 10861 }, "duration": 98812764, "reportedSentences": 59, "clips": 15951, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "sixties": 0, "twenties": 0.29, "": 0.05, "teens": 0.53 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 87, "size": 579381426, "checksum": "6c32d868bd8fee5b5f9573a4f918e3a9ac315a6063eacadd9dbea966b33a4d1a", "avgDurationSecs": 6.195, "validDurationSecs": 67281.389, "totalHrs": 27.44, "validHrs": 18.68 }, "da": { "buckets": { "clip_durations": 10779, "dev": 2105, "invalidated": 387, "other": 472, "reported": 420, "test": 2444, "train": 3381, "validated": 9920 }, "duration": 44548632, "reportedSentences": 419, "clips": 10779, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "thirties": 0.25, "twenties": 0.19, "sixties": 0.01, "fourties": 0.21, "fifties": 0.03, "teens": 0.01 }, "gender": { "": 0.31, "female": 0.08, "male": 0.61 } }, "users": 243, "size": 256044354, "checksum": "cd350c8b105eed54181c31ec68bc457774635c65384f986e838041fa0e6c242b", "avgDurationSecs": 4.133, "validDurationSecs": 40998.463, "totalHrs": 12.37, "validHrs": 11.38 }, "myv": { "buckets": { "clip_durations": 1995, "dev": 239, "invalidated": 23, "other": 1, "reported": 30, "test": 481, "train": 1241, "validated": 1971 }, "duration": 11519388, "reportedSentences": 31, "clips": 1995, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.26, "": 0.38, "thirties": 0.24, "twenties": 0.09, "teens": 0.01, "fourties": 0.02 }, "gender": { "male": 0.54, "": 0.38, "female": 0.07 } }, "users": 13, "size": 67214215, "checksum": "07d69b582270528e74aad6c7284553317999b66b45876e0700bfa123628e5d91", "avgDurationSecs": 5.774, "validDurationSecs": 11380.809, "totalHrs": 3.19, "validHrs": 3.16 }, "nn-NO": { "buckets": { "clip_durations": 1129, "dev": 242, "invalidated": 59, "other": 136, "reported": 22, "test": 280, "train": 407, "validated": 934 }, "duration": 5008248, "reportedSentences": 23, "clips": 1129, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "thirties": 0.49, "twenties": 0.17, "fourties": 0.04, "fifties": 0.01, "teens": 0.02 }, "gender": { "": 0.26, "female": 0.16, "male": 0.54, "other": 0.03 } }, "users": 32, "size": 28723877, "checksum": "53ab1c41a02b0529743c020d6718adf6ee055315b716518fdb685a537ad9198d", "avgDurationSecs": 4.436, "validDurationSecs": 4143.227, "totalHrs": 1.39, "validHrs": 1.15 }, "ha": { "buckets": { "clip_durations": 10106, "dev": 582, "invalidated": 169, "other": 6657, "reported": 32, "test": 660, "train": 1926, "validated": 3280 }, "duration": 43793136, "reportedSentences": 32, "clips": 10106, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.68, "twenties": 0.12, "fourties": 0, "fifties": 0.02 }, "gender": { "": 0.16, "male": 0.59, "female": 0.25 } }, "users": 39, "size": 256311331, "checksum": "222f8ab1bcf5b3dd5094ecdaf1e0745289c88bfd399da5e63e143e518703f09b", "avgDurationSecs": 4.333, "validDurationSecs": 14213.486, "totalHrs": 12.16, "validHrs": 3.94 }, "ckb": { "buckets": { "clip_durations": 120197, "dev": 4948, "invalidated": 7483, "other": 9244, "reported": 2414, "test": 4952, "train": 7070, "validated": 103470 }, "duration": 469151964, "reportedSentences": 2414, "clips": 120197, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "thirties": 0.11, "twenties": 0.49, "fourties": 0.02, "teens": 0.02, "fifties": 0.02 }, "gender": { "": 0.32, "male": 0.62, "female": 0.06, "other": 0 } }, "users": 1294, "size": 2548252022, "checksum": "7308effe05102ea476924a3d93568b368daa9e21515e51675769b648f83be915", "avgDurationSecs": 3.903, "validDurationSecs": 403863.272, "totalHrs": 130.31, "validHrs": 112.18 }, "ml": { "buckets": { "clip_durations": 8006, "dev": 489, "invalidated": 98, "other": 5395, "reported": 123, "test": 667, "train": 1249, "validated": 2513 }, "duration": 33298164, "reportedSentences": 123, "clips": 8006, "splits": { "accent": { "": 1 }, "age": { "": 0.23, "twenties": 0.61, "thirties": 0.03, "fourties": 0.13, "sixties": 0 }, "gender": { "": 0.23, "male": 0.5, "other": 0.05, "female": 0.22 } }, "users": 128, "size": 189110552, "checksum": "39e027ae3413fa85ea4b831c9a6d4bd9cbc6ddf786bbc3cfeedcfa6903ef887a", "avgDurationSecs": 4.159, "validDurationSecs": 10451.947, "totalHrs": 9.24, "validHrs": 2.9 }, "mdf": { "buckets": { "clip_durations": 345, "dev": 54, "invalidated": 9, "other": 0, "reported": 14, "test": 107, "train": 175, "validated": 336 }, "duration": 1811340, "reportedSentences": 15, "clips": 345, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.58, "fourties": 0.34, "twenties": 0.02 }, "gender": { "male": 0.08, "": 0.58, "female": 0.34 } }, "users": 11, "size": 10649081, "checksum": "a1eb5f617f66a1b9388e5bbc46e2d7abb63e10654a68022491c0006aa75e9f24", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "sw": { "buckets": { "clip_durations": 636142, "dev": 12078, "invalidated": 76757, "other": 307706, "reported": 1879, "test": 12078, "train": 44075, "validated": 251679 }, "duration": 3355892964, "reportedSentences": 1874, "clips": 636142, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "twenties": 0.44, "thirties": 0.12, "teens": 0, "fifties": 0.05, "fourties": 0.04, "sixties": 0.01 }, "gender": { "": 0.32, "male": 0.36, "female": 0.33, "other": 0 } }, "users": 1235, "size": 19583319152, "checksum": "4100530284269f3dff0a48abcdc46c818b28f1b593ac855e188c466efcc226ab", "avgDurationSecs": 5.275, "validDurationSecs": 1327703.226, "totalHrs": 932.19, "validHrs": 368.8 }, "sat": { "buckets": { "clip_durations": 824, "dev": 0, "invalidated": 9, "other": 354, "reported": 6, "test": 146, "train": 315, "validated": 461 }, "duration": 3676320, "reportedSentences": 7, "clips": 824, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "twenties": 0.43, "fourties": 0.01, "fifties": 0.01, "teens": 0.01, "thirties": 0.1 }, "gender": { "": 0.41, "male": 0.58, "female": 0.01 } }, "users": 13, "size": 20302465, "checksum": "1568d5ba65b8e03964571cfae19938f974650d9a13f357ec2abb569f35d6fe9d", "avgDurationSecs": 4.462, "validDurationSecs": 2056.776, "totalHrs": 1.02, "validHrs": 0.57 }, "tig": { "buckets": { "clip_durations": 321, "dev": 10, "invalidated": 12, "other": 268, "reported": 0, "test": 11, "train": 20, "validated": 41 }, "duration": 1620288, "reportedSentences": 1, "clips": 321, "splits": { "accent": { "": 1 }, "age": { "": 0.92, "twenties": 0.02, "thirties": 0.06 }, "gender": { "": 0.92, "male": 0.08 } }, "users": 9, "size": 9524370, "checksum": "6349a2ad02b2e101862ada4b860dc4892d217b75fc99c1ea1c80f84d293eef26", "avgDurationSecs": 5.048, "validDurationSecs": 206.953, "totalHrs": 0.45, "validHrs": 0.05 }, "ig": { "buckets": { "clip_durations": 5772, "dev": 2, "invalidated": 3, "other": 5755, "reported": 15, "test": 4, "train": 8, "validated": 14 }, "duration": 31535604, "reportedSentences": 15, "clips": 5772, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "twenties": 0.32, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02, "fourties": 0 }, "gender": { "": 0.56, "male": 0.14, "female": 0.3 } }, "users": 113, "size": 184224726, "checksum": "5c217960dc04600da0464269f5ebcdfd67a29d39162e311fae6e5c9eb873e93d", "avgDurationSecs": 5.464, "validDurationSecs": 76.49, "totalHrs": 8.75, "validHrs": 0.02 }, "nan-tw": { "buckets": { "clip_durations": 27829, "dev": 2122, "invalidated": 496, "other": 20038, "reported": 212, "test": 2155, "train": 2821, "validated": 7176 }, "duration": 72262188, "reportedSentences": 210, "clips": 27710, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.15, "": 0.12, "twenties": 0.54, "fourties": 0.17, "teens": 0.01, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.64, "": 0.12, "other": 0.13, "female": 0.12 } }, "users": 207, "size": 408245438, "checksum": "830f9e8db88d199a487e59e94000a6a41a3318d0c6690d8f2c3677d8437dc2fa", "avgDurationSecs": 2.608, "validDurationSecs": 18713.586, "totalHrs": 20.07, "validHrs": 5.19 }, "mhr": { "buckets": { "clip_durations": 192337, "dev": 14668, "invalidated": 6963, "other": 28944, "reported": 93, "test": 14674, "train": 125138, "validated": 156430 }, "duration": 893471256, "reportedSentences": 92, "clips": 192337, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.05, "": 0.14, "sixties": 0.04, "thirties": 0.25, "fourties": 0.14, "twenties": 0.29, "teens": 0.09, "seventies": 0.01 }, "gender": { "male": 0.21, "": 0.14, "female": 0.65 } }, "users": 425, "size": 4998858020, "checksum": "f43afead24ee0c328646e836ddfb63adc50c9d19c7f74163ae68c25a5438aa81", "avgDurationSecs": 4.645, "validDurationSecs": 726670.94, "totalHrs": 248.18, "validHrs": 201.85 }, "bn": { "buckets": { "clip_durations": 1047788, "dev": 9302, "invalidated": 7774, "other": 996081, "reported": 1883, "test": 9302, "train": 21097, "validated": 43933 }, "duration": 4573146852, "reportedSentences": 1877, "clips": 1047788, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.67, "": 0.22, "teens": 0.07, "fourties": 0.01, "fifties": 0 }, "gender": { "male": 0.54, "": 0.22, "female": 0.23, "other": 0 } }, "users": 22879, "size": 26352317274, "checksum": "fcc35d8592e221196bb9849f9199dc91f4743b320a76ef524f175b0e9d0bbd9a", "avgDurationSecs": 4.365, "validDurationSecs": 191748.77, "totalHrs": 1270.31, "validHrs": 53.26 }, "tok": { "buckets": { "clip_durations": 14086, "dev": 2072, "invalidated": 286, "other": 3107, "reported": 141, "test": 2149, "train": 2553, "validated": 10693 }, "duration": 55807596, "reportedSentences": 142, "clips": 14086, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.22, "teens": 0.27, "thirties": 0.08, "fourties": 0, "eighties": 0.08 }, "gender": { "": 0.35, "male": 0.47, "other": 0.11, "female": 0.07 } }, "users": 135, "size": 325281775, "checksum": "e77f5142809671a461db17abc0ff9545658ca5c2f899507002c9f529cb6b8915", "avgDurationSecs": 3.962, "validDurationSecs": 42364.804, "totalHrs": 15.5, "validHrs": 11.76 }, "yue": { "buckets": { "clip_durations": 74949, "dev": 2562, "invalidated": 1675, "other": 52421, "reported": 1158, "test": 2565, "train": 3074, "validated": 20851 }, "duration": 302625720, "reportedSentences": 1159, "clips": 74947, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.29, "": 0.27, "twenties": 0.39, "fourties": 0.02, "sixties": 0.01, "fifties": 0, "teens": 0.02 }, "gender": { "male": 0.26, "": 0.3, "female": 0.43, "other": 0.01 } }, "users": 865, "size": 1707302555, "checksum": "34cd9cac6e6100421cbaadfcda8355c8bb61c7a703aafb98e66a31eb52ea4714", "avgDurationSecs": 4.038, "validDurationSecs": 84193.482, "totalHrs": 84.06, "validHrs": 23.38 }, "sah": { "buckets": { "clip_durations": 4277, "dev": 1276, "invalidated": 103, "other": 143, "reported": 2, "test": 1093, "train": 1594, "validated": 4031 }, "duration": 25709196, "reportedSentences": 3, "clips": 4277, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.04, "fourties": 0.1, "thirties": 0.41, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.35, "male": 0.51, "female": 0.14 } }, "users": 57, "size": 193448329, "checksum": "edcfd3fe431ee1fcc587124066aaa6a478d2753ac67e356a34cdf446e28bd825", "avgDurationSecs": 6.011, "validDurationSecs": 24230.481, "totalHrs": 7.14, "validHrs": 6.73 }, "mk": { "buckets": { "clip_durations": 214, "dev": 51, "invalidated": 14, "other": 40, "reported": 8, "test": 42, "train": 67, "validated": 160 }, "duration": 1132524, "reportedSentences": 9, "clips": 214, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.42, "": 0.35, "twenties": 0.16, "teens": 0.07 }, "gender": { "male": 0.65, "": 0.35 } }, "users": 8, "size": 6668625, "checksum": "0089f5f33f9216327002ac94edc6d65282dd68075f88e9ca61175e737f494cd8", "avgDurationSecs": 5.292, "validDurationSecs": 846.747, "totalHrs": 0.31, "validHrs": 0.23 }, "sc": { "buckets": { "clip_durations": 1410, "dev": 240, "invalidated": 26, "other": 287, "reported": 2, "test": 196, "train": 502, "validated": 1097 }, "duration": 6325632, "reportedSentences": 3, "clips": 1410, "splits": { "accent": { "": 1 }, "age": { "": 0.63, "thirties": 0.29, "twenties": 0.07 }, "gender": { "": 0.63, "female": 0.29, "male": 0.07 } }, "users": 14, "size": 36620433, "checksum": "ccf4ae936fa2f82f85ada0af1f76be68ec5511cb567a8e422efb19c49189b157", "avgDurationSecs": 4.486, "validDurationSecs": 4921.431, "totalHrs": 1.75, "validHrs": 1.36 }, "skr": { "buckets": { "clip_durations": 5678, "dev": 1107, "invalidated": 453, "other": 1693, "reported": 16, "test": 954, "train": 1464, "validated": 3532 }, "duration": 23488920, "reportedSentences": 2, "clips": 5678, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.25, "fifties": 0.38, "teens": 0, "fourties": 0, "thirties": 0.01 }, "gender": { "": 0.35, "male": 0.65 } }, "users": 53, "size": 134485881, "checksum": "623f70331830bfa20fdf9d753604a50f37e04e91cb461fd81d489e56a81e09a2", "avgDurationSecs": 4.137, "validDurationSecs": 14611.283, "totalHrs": 6.52, "validHrs": 4.05 }, "ti": { "buckets": { "clip_durations": 23, "dev": 5, "invalidated": 1, "other": 0, "reported": 1, "test": 3, "train": 14, "validated": 22 }, "duration": 132912, "reportedSentences": 2, "clips": 23, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 4, "size": 785836, "checksum": "80a08edc10bd2abfe37cc631492bbba66368ddbd9050dac7ce04045c74edbf4a", "avgDurationSecs": 5.779, "validDurationSecs": 127.133, "totalHrs": 0.03, "validHrs": 0.03 }, "mrj": { "buckets": { "clip_durations": 27735, "dev": 4477, "invalidated": 385, "other": 8797, "reported": 49, "test": 4287, "train": 9656, "validated": 18553 }, "duration": 115289100, "reportedSentences": 50, "clips": 27735, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.32, "": 0.11, "thirties": 0.25, "sixties": 0.04, "fourties": 0.14, "fifties": 0.13, "teens": 0.01 }, "gender": { "male": 0.26, "female": 0.63, "": 0.11 } }, "users": 51, "size": 640470242, "checksum": "f15d3732bb89a59954502dc3b25d5f306d2e52b843a1baaa05a11530f0120dab", "avgDurationSecs": 4.157, "validDurationSecs": 77121.279, "totalHrs": 32.02, "validHrs": 21.42 }, "tw": { "buckets": { "clip_durations": 237, "dev": 0, "invalidated": 0, "other": 189, "reported": 2, "test": 3, "train": 45, "validated": 48 }, "duration": 943884, "reportedSentences": 3, "clips": 237, "splits": { "accent": { "": 1 }, "age": { "": 0.07, "twenties": 0.02, "thirties": 0.91 }, "gender": { "": 0.07, "male": 0.93 } }, "users": 5, "size": 5531968, "checksum": "7ccad7a31ab09fbe9e8620f7112c4eec411f6eb07c270b7b15c96433dc5e990c", "avgDurationSecs": 3.983, "validDurationSecs": 191.166, "totalHrs": 0.26, "validHrs": 0.05 }, "ko": { "buckets": { "clip_durations": 2540, "dev": 213, "invalidated": 190, "other": 1629, "reported": 15, "test": 228, "train": 279, "validated": 721 }, "duration": 15534360, "reportedSentences": 16, "clips": 2540, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "thirties": 0.39, "twenties": 0.33, "teens": 0.04, "fourties": 0.09 }, "gender": { "": 0.15, "male": 0.73, "female": 0.13, "other": 0 } }, "users": 61, "size": 90493234, "checksum": "f338e35b1110a24fc342ca6c6aad09d2ffb22e4366345584569d30d4588e6a94", "avgDurationSecs": 6.116, "validDurationSecs": 4409.557, "totalHrs": 4.31, "validHrs": 1.22 }, "yo": { "buckets": { "clip_durations": 4402, "dev": 862, "invalidated": 229, "other": 1106, "reported": 17, "test": 994, "train": 1209, "validated": 3067 }, "duration": 26180928, "reportedSentences": 18, "clips": 4402, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.63, "": 0.3, "teens": 0.05, "sixties": 0 }, "gender": { "male": 0.32, "": 0.3, "female": 0.38 } }, "users": 106, "size": 152933051, "checksum": "d0db7c9a12fff5dd43877972220c6e086946ef825f5bc9915385987ac0d2d456", "avgDurationSecs": 5.948, "validDurationSecs": 18241.005, "totalHrs": 7.27, "validHrs": 5.06 }, "oc": { "buckets": { "clip_durations": 9039, "dev": 202, "invalidated": 149, "other": 7706, "reported": 9, "test": 220, "train": 246, "validated": 1184 }, "duration": 44068428, "reportedSentences": 10, "clips": 9039, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "fifties": 0.17, "twenties": 0.02, "thirties": 0.04, "seventies": 0, "fourties": 0.19, "teens": 0.01, "sixties": 0.12 }, "gender": { "": 0.48, "male": 0.09, "female": 0.44 } }, "users": 141, "size": 249564283, "checksum": "60df2b394e906559d9fe34069e81e52f4aab6b64f64b645b5e635e7e3d991960", "avgDurationSecs": 4.875, "validDurationSecs": 5772.433, "totalHrs": 12.24, "validHrs": 1.6 }, "tk": { "buckets": { "clip_durations": 3423, "dev": 333, "invalidated": 111, "other": 1611, "reported": 17, "test": 465, "train": 807, "validated": 1701 }, "duration": 18427788, "reportedSentences": 18, "clips": 3423, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.62, "thirties": 0.11, "teens": 0 }, "gender": { "": 0.26, "male": 0.5, "female": 0.24 } }, "users": 79, "size": 104693940, "checksum": "305cd2f6ed4d6193566adbd5ca5b3051032fb664e0ffcea59506c6e13c059e52", "avgDurationSecs": 5.384, "validDurationSecs": 9157.367, "totalHrs": 5.11, "validHrs": 2.54 }, "ps": { "buckets": { "clip_durations": 1234, "dev": 0, "invalidated": 12, "other": 0, "reported": 0, "test": 195, "train": 1027, "validated": 1222 }, "duration": 5977404, "reportedSentences": 1, "clips": 1234, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.84, "twenties": 0.13, "": 0.03 }, "gender": { "male": 0.97, "": 0.03 } }, "users": 7, "size": 35145495, "checksum": "53e87e3ad75e8eccf49926d728fb41d44b2415cf04d2d063e97728238c467391", "avgDurationSecs": 4.844, "validDurationSecs": 5919.277, "totalHrs": 1.66, "validHrs": 1.64 }, "sq": { "buckets": { "clip_durations": 996, "dev": 183, "invalidated": 9, "other": 10, "reported": 9, "test": 296, "train": 494, "validated": 977 }, "duration": 4582296, "reportedSentences": 10, "clips": 996, "splits": { "accent": { "": 1 }, "age": { "": 0.71, "twenties": 0.05, "thirties": 0.24 }, "gender": { "": 0.57, "male": 0.43 } }, "users": 38, "size": 26256548, "checksum": "84df66e78f546c6c64b022ed4d74e9f2185ae56165fee8be40f3889fdb51d604", "avgDurationSecs": 4.601, "validDurationSecs": 4494.883, "totalHrs": 1.27, "validHrs": 1.24 }, "ne-NP": { "buckets": { "clip_durations": 943, "dev": 106, "invalidated": 42, "other": 456, "reported": 2, "test": 139, "train": 194, "validated": 445 }, "duration": 3630492, "reportedSentences": 3, "clips": 943, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.18, "": 0.34, "twenties": 0.48 }, "gender": { "male": 0.67, "": 0.33 } }, "users": 23, "size": 21151802, "checksum": "cfadd5c287274655118d97fd32b9b503013aa1232d579277addf6e7dc09585f9", "avgDurationSecs": 3.85, "validDurationSecs": 1713.223, "totalHrs": 1, "validHrs": 0.47 }, "he": { "buckets": { "clip_durations": 605, "dev": 0, "invalidated": 4, "other": 553, "reported": 13, "test": 15, "train": 33, "validated": 48 }, "duration": 3083220, "reportedSentences": 14, "clips": 605, "splits": { "accent": { "": 1 }, "age": { "eighties": 0.23, "": 0.03, "thirties": 0.71, "fourties": 0.02 }, "gender": { "male": 0.95, "": 0.02, "female": 0.02 } }, "users": 6, "size": 17894890, "checksum": "7ab0c49575b97089131cd4adb787e85328d3ec23f66fc5a527e0315f0d5f724b", "avgDurationSecs": 5.096, "validDurationSecs": 244.619, "totalHrs": 0.85, "validHrs": 0.06 }, "az": { "buckets": { "clip_durations": 295, "dev": 26, "invalidated": 37, "other": 164, "reported": 4, "test": 29, "train": 39, "validated": 94 }, "duration": 1587492, "reportedSentences": 5, "clips": 295, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "twenties": 0.34, "fourties": 0.01, "thirties": 0.22 }, "gender": { "": 0.43, "male": 0.57 } }, "users": 25, "size": 9153069, "checksum": "76c5a611263f086ed025615651e78d2c80c2db8109a96838687bd9db2949509f", "avgDurationSecs": 5.381, "validDurationSecs": 505.845, "totalHrs": 0.44, "validHrs": 0.14 }, "am": { "buckets": { "clip_durations": 1051, "dev": 66, "invalidated": 14, "other": 835, "reported": 6, "test": 11, "train": 125, "validated": 202 }, "duration": 6259608, "reportedSentences": 1, "clips": 1051, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.81, "": 0.13, "twenties": 0.06 }, "gender": { "male": 0.87, "": 0.13 } }, "users": 16, "size": 36687903, "checksum": "6908c8e32862cd2b547777cc6142dc3dfdad41d6be954eceb59d549e9a6308eb", "avgDurationSecs": 5.956, "validDurationSecs": 1203.084, "totalHrs": 1.73, "validHrs": 0.33 }, "vot": { "buckets": { "clip_durations": 426, "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "duration": 1025976, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7898046, "checksum": "93876ec5db2db02cf7bd6762d40cd07634c830aa7b1c654fd99739d70ce5efc8", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "ast": { "buckets": { "clip_durations": 1122, "dev": 94, "invalidated": 12, "other": 656, "test": 126, "train": 234, "validated": 454 }, "duration": 5096556, "clips": 1122, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "fourties": 0.4, "fifties": 0.31, "teens": 0.05 }, "gender": { "": 0.24, "male": 0.75, "female": 0.01 } }, "users": 15, "size": 28689486, "checksum": "362476fbd65a5ae7072c69a000d90d666421eb385dcc67950dfb9ea1f86f451c", "avgDurationSecs": 4.542, "validDurationSecs": 2062.243, "totalHrs": 1.41, "validHrs": 0.57 }, "quy": { "buckets": { "clip_durations": 6, "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 1, "validated": 1 }, "duration": 36792, "clips": 6, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 218047, "checksum": "a4229627fa6852b10828185034909c00d536a2620a9ca8d2b83a069dbb68278e", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "lo": { "buckets": { "clip_durations": 189, "dev": 28, "invalidated": 6, "other": 79, "test": 21, "train": 55, "validated": 104 }, "duration": 1224792, "clips": 189, "splits": { "accent": { "": 1 }, "age": { "": 0.06, "twenties": 0.36, "fourties": 0.01, "thirties": 0.58 }, "gender": { "": 0.06, "male": 0.94 } }, "users": 9, "size": 7095144, "checksum": "15bc7cab70f9bf24a09d86722b7ff657fe2ac14f94cedb551c84c015e282dff9", "avgDurationSecs": 6.48, "validDurationSecs": 673.96, "totalHrs": 0.34, "validHrs": 0.18 }, "dyu": { "buckets": { "clip_durations": 35, "dev": 5, "invalidated": 2, "other": 0, "test": 6, "train": 22, "validated": 33 }, "duration": 232992, "clips": 35, "splits": { "accent": { "": 1 }, "age": { "": 0.97, "fourties": 0.03 }, "gender": { "": 0.97, "male": 0.03 } }, "users": 6, "size": 1370134, "checksum": "0582cec5fc25f2f32c8f063ff583de357ee3d6d8fc7d5ba55ef592ba0ca5a97b", "avgDurationSecs": 6.657, "validDurationSecs": 219.678, "totalHrs": 0.06, "validHrs": 0.06 }, "is": { "buckets": { "clip_durations": 15, "dev": 0, "invalidated": 1, "other": 6, "test": 0, "train": 8, "validated": 8 }, "duration": 110016, "clips": 15, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "size": 645519, "checksum": "d96a8f64d7b6f190fd5a034582a98297535f808722c2eb6c82aa0b39e4ebce68", "avgDurationSecs": 7.334, "validDurationSecs": 58.675, "totalHrs": 0.03, "validHrs": 0.01 }, "zgh": { "buckets": { "clip_durations": 274, "dev": 47, "invalidated": 8, "other": 18, "test": 43, "train": 158, "validated": 248 }, "duration": 905832, "clips": 274, "splits": { "accent": { "": 1 }, "age": { "": 0.04, "thirties": 0.64, "twenties": 0.32 }, "gender": { "": 0.04, "male": 0.96 } }, "users": 5, "size": 5038796, "checksum": "69b3d5adbe69ba2500f152869f0a2165032d0f8b6b942c197c1c3bff26d50578", "avgDurationSecs": 3.306, "validDurationSecs": 819.877, "totalHrs": 0.25, "validHrs": 0.22 }, "af": { "buckets": { "clip_durations": 94, "dev": 12, "invalidated": 4, "other": 50, "test": 13, "train": 15, "validated": 40 }, "duration": 651384, "clips": 94, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.32, "": 0.68 }, "gender": { "male": 0.32, "": 0.68 } }, "users": 10, "size": 3770366, "checksum": "cf8483cb0e6a69cafb32155c28ec7727be4ba63463b772b7348a7caa1b11ae6a", "avgDurationSecs": 6.93, "validDurationSecs": 277.185, "totalHrs": 0.18, "validHrs": 0.07 } }, "totalDuration": 103503334891, "totalValidDurationSecs": 68974275, "totalHrs": 28750, "totalValidHrs": 19159 } ================================================ FILE: datasets/scripted-speech/cv-corpus-15.0-delta-2023-09-08.json ================================================ { "locales": { "en": { "duration": 244618020, "reportedSentences": 140, "clips": 40553, "users": 750, "size": 1371012179, "checksum": "794f4ea6c6bab3731d54cf7ce3d67996cf1ba7c0d92cbd338c36636a9716047a", "avgDurationSecs": 5.2, "validDurationSecs": 171002.31, "totalHrs": 67.95, "validHrs": 47.5 }, "fa": { "duration": 29293992, "reportedSentences": 36, "clips": 7017, "users": 20, "size": 172176807, "checksum": "eacf8280ecb9112024f9493818ac42c06a72855b18d8d5bf90f989fef87c4eee", "avgDurationSecs": 3.993, "validDurationSecs": 18352.2, "totalHrs": 8.14, "validHrs": 5.09 }, "fr": { "duration": 18421488, "reportedSentences": 55, "clips": 3589, "users": 150, "size": 113396363, "checksum": "9ec31dc710142ccf100dcb52c26ebf7350a7777e85e2b53949a9bed765f6b8e2", "avgDurationSecs": 5.008, "validDurationSecs": 32574.04, "totalHrs": 5.12, "validHrs": 9.05 }, "es": { "duration": 45171504, "reportedSentences": 112, "clips": 8640, "users": 77, "size": 275772205, "checksum": "1d29fa2bcc47eaa538777f397dd4f52114a86788850d45b5e196f439e5d1b231", "avgDurationSecs": 4.869, "validDurationSecs": 80825.04, "totalHrs": 12.55, "validHrs": 22.45 }, "sl": { "duration": 110232, "reportedSentences": 0, "clips": 25, "users": 2, "size": 739944, "checksum": "92d8965ba2bb1386fbfb9ad614a69464e2f87ec436c678a6d2272ca3a5baedd0", "avgDurationSecs": 3.78, "validDurationSecs": 216.7, "totalHrs": 0.04, "validHrs": 0.06 }, "kab": { "duration": 1408464, "reportedSentences": 157, "clips": 415, "users": 3, "size": 13339979, "checksum": "4d79a2ac110ef5e2bf4f772a2cce0ebb4591e2c0edb733deb18868d446fd480b", "avgDurationSecs": 3.337, "validDurationSecs": 923.71, "totalHrs": 0.4, "validHrs": 0.25 }, "cy": { "duration": 4053564, "reportedSentences": 2, "clips": 618, "users": 23, "size": 23504390, "checksum": "5a26c2501517e36e76b2cb8d6624ad455913518506e072d5f6a55bbf8650ab59", "avgDurationSecs": 4.888, "validDurationSecs": 1438.54, "totalHrs": 1.12, "validHrs": 0.4 }, "ca": { "duration": 614910276, "reportedSentences": 168, "clips": 126005, "users": 434, "size": 3446314122, "checksum": "0db1c324ef3c68abc5ae1a69d68b73a3d5d117b97de321288af584ae1d7f84ff", "avgDurationSecs": 5.374, "validDurationSecs": 555776.8, "totalHrs": 170.81, "validHrs": 154.38 }, "de": { "duration": 42302844, "reportedSentences": 383, "clips": 7251, "users": 165, "size": 254718100, "checksum": "d665c3a98aefc514bfbe56758f66b9ee015ed9dc338822f08a2702f9bb077a98", "avgDurationSecs": 5.227, "validDurationSecs": 39461.65, "totalHrs": 11.75, "validHrs": 10.96 }, "tt": { "duration": 123012, "reportedSentences": 0, "clips": 22, "users": 5, "size": 912414, "checksum": "e47e9a6ebb7b18a67bd2e153bd9e67f9fa11587f498013d60ba68ecb565026d3", "avgDurationSecs": 3.762, "validDurationSecs": 114.91, "totalHrs": 0.04, "validHrs": 0.03 }, "ta": { "duration": 1177092, "reportedSentences": 0, "clips": 148, "users": 8, "size": 8555587, "checksum": "583ef2b97804f70c62c3bf1708a4004e42a1d6b228453a28024468fe412e5bcf", "avgDurationSecs": 6.216, "validDurationSecs": 3332.94, "totalHrs": 0.33, "validHrs": 0.92 }, "ru": { "duration": 18721764, "reportedSentences": 49, "clips": 3545, "users": 52, "size": 109890542, "checksum": "8dafb999cbe49137de18568349f8c4b9044fc7417d2fbc3c7c71d98735edefea", "avgDurationSecs": 5.16, "validDurationSecs": 12762.38, "totalHrs": 5.2, "validHrs": 3.54 }, "nl": { "duration": 3155076, "reportedSentences": 0, "clips": 700, "users": 10, "size": 19072297, "checksum": "3c07ebb35f896d5acd76088e7b515e5b46b48a0eb46799e41c6a6d1a2e48b6ef", "avgDurationSecs": 4.348, "validDurationSecs": 5232.41, "totalHrs": 0.87, "validHrs": 1.45 }, "it": { "duration": 23351976, "reportedSentences": 15, "clips": 3865, "users": 47, "size": 138676090, "checksum": "25b668a1166fd668702bf22dbdaea7c4b47b6bb8aea1c8c0d299c3e045515a5e", "avgDurationSecs": 5.385, "validDurationSecs": 11444.37, "totalHrs": 6.48, "validHrs": 3.18 }, "eu": { "duration": 22455288, "reportedSentences": 47, "clips": 3749, "users": 102, "size": 126274273, "checksum": "6afb9120d834059bad9ad32485d8269214fd5c5e14d0bd537635fd5158a6a508", "avgDurationSecs": 5.228, "validDurationSecs": 11744.41, "totalHrs": 6.24, "validHrs": 3.27 }, "tr": { "duration": 34755372, "reportedSentences": 17, "clips": 9691, "users": 26, "size": 203050162, "checksum": "c9ef82e60042b3eb7c50762464a59aa4122392459ddb1e00b678866bd8ee40e9", "avgDurationSecs": 3.618, "validDurationSecs": 34156.86, "totalHrs": 9.65, "validHrs": 9.48 }, "ar": { "duration": 1098000, "reportedSentences": 1, "clips": 236, "users": 15, "size": 7268151, "checksum": "a9a2337f4da2c8961bf7146dc0b04c119dc101b03fd1f5c43ee40cfafe48ceba", "avgDurationSecs": 4.167, "validDurationSecs": 1629.64, "totalHrs": 0.3, "validHrs": 0.45 }, "zh-TW": { "duration": 8510436, "reportedSentences": 2, "clips": 1843, "users": 12, "size": 34900513, "checksum": "17dfe0e9e9e9c2874f0fb684865f356d14ab26c2834d806a3f821267230b5cb1", "avgDurationSecs": 3.365, "validDurationSecs": 4671.77, "totalHrs": 2.37, "validHrs": 1.3 }, "br": { "duration": 3608640, "reportedSentences": 5, "clips": 996, "users": 7, "size": 21205248, "checksum": "bf06ab83c28b9970847a19423bfd9ea0bc6c6577615b5e4374fd40d847b73ab9", "avgDurationSecs": 3.156, "validDurationSecs": 3210.09, "totalHrs": 1, "validHrs": 0.9 }, "pt": { "duration": 7701120, "reportedSentences": 13, "clips": 1585, "users": 66, "size": 44820476, "checksum": "b49932e6dcee44813a92178494f520bb524226f2c4e6a0beee0f85e8797ad3a4", "avgDurationSecs": 4.089, "validDurationSecs": 18035.88, "totalHrs": 2.14, "validHrs": 5.01 }, "eo": { "duration": 13286196, "reportedSentences": 36, "clips": 1960, "users": 27, "size": 85398974, "checksum": "c9a6b833e7daea9302f5cac54c72012e8f5c1a752d4409a6f688c609e151557d", "avgDurationSecs": 6.066, "validDurationSecs": 2833.45, "totalHrs": 3.69, "validHrs": 0.79 }, "zh-CN": { "duration": 5472108, "reportedSentences": 13, "clips": 1310, "users": 59, "size": 37534874, "checksum": "468cb3d9d561cb60aaf9d72c1e57b7f206ed497880475ed07f5c373286958f35", "avgDurationSecs": 4.549, "validDurationSecs": 3527.65, "totalHrs": 1.52, "validHrs": 0.98 }, "id": { "duration": 1496592, "reportedSentences": 0, "clips": 424, "users": 9, "size": 9052556, "checksum": "bcf8dbb9fe2c70990d26aad714a25a993bd2d7fdae95411d0faa69a8f33901fc", "avgDurationSecs": 3.988, "validDurationSecs": 1800.47, "totalHrs": 0.42, "validHrs": 0.5 }, "ia": { "duration": 55008, "reportedSentences": 0, "clips": 10, "users": 2, "size": 418542, "checksum": "11cd9d4c2b32285bde1aba1241910d25b5ea221c2fdace828d8915b8a62adf95", "avgDurationSecs": 4.185, "validDurationSecs": 131.86, "totalHrs": 0.01, "validHrs": 0.04 }, "lv": { "duration": 104916096, "reportedSentences": 2666, "clips": 24992, "users": 377, "size": 568894303, "checksum": "91d38952f7eabeb9580b43b712b0a92c3bf6e4b08f524a99685d843e0c69c66e", "avgDurationSecs": 3.972, "validDurationSecs": 67122.5, "totalHrs": 29.14, "validHrs": 18.64 }, "ja": { "duration": 516852, "reportedSentences": 0, "clips": 61, "users": 6, "size": 4113564, "checksum": "a9014bf0cc4a47b317dc7797956e838a159955e1ebc936e350f3f698f72e0057", "avgDurationSecs": 4.801, "validDurationSecs": 888.26, "totalHrs": 0.15, "validHrs": 0.24 }, "rw": { "duration": 334152, "reportedSentences": 0, "clips": 75, "users": 6, "size": 13544797, "checksum": "e6fb1d983284b2ca8e8c0648f1ed8f1cfec3c5bddbc9881c245797a49820e3b0", "avgDurationSecs": 5.008, "validDurationSecs": 230.63, "totalHrs": 0.09, "validHrs": 0.07 }, "sv-SE": { "duration": 4213188, "reportedSentences": 0, "clips": 895, "users": 7, "size": 25084916, "checksum": "2d5d854a7d2fd066faf73573e456a9ea92c924125f77668c6e40925c00a4d34e", "avgDurationSecs": 3.994, "validDurationSecs": 1738.25, "totalHrs": 1.17, "validHrs": 0.48 }, "cnh": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 35490, "checksum": "c4753100ec422ccaf7d625cc12a9e09924bab32c291b2b4074f5947dff5ec369", "avgDurationSecs": 3.564, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "et": { "duration": 8182152, "reportedSentences": 57, "clips": 1173, "users": 42, "size": 48553006, "checksum": "cf197dbc83297e7e1e8380e5c442379ec22f7c2fe98f9bf344dd6d81d562f1f2", "avgDurationSecs": 6.768, "validDurationSecs": 9028.88, "totalHrs": 2.28, "validHrs": 2.51 }, "ky": { "duration": 2545380, "reportedSentences": 1, "clips": 507, "users": 5, "size": 15187319, "checksum": "65ce6facdd84593482bfdbaf1b546a8c3fe805a42b988c2ed08999b3d87fe930", "avgDurationSecs": 4.542, "validDurationSecs": 344.69, "totalHrs": 0.71, "validHrs": 0.09 }, "ro": { "duration": 3998844, "reportedSentences": 4, "clips": 907, "users": 4, "size": 23713689, "checksum": "42ef26acc88244a674ecc551d3a0afa9c06fa64a0084820666fa634095ef474f", "avgDurationSecs": 4.006, "validDurationSecs": 1174.69, "totalHrs": 1.11, "validHrs": 0.33 }, "hsb": { "duration": 554112, "reportedSentences": 0, "clips": 79, "users": 0, "size": 3279375, "checksum": "fdb29410f020c9b03f822aa1d6c8f5708cdd10924556137cea85f964fe64c562", "avgDurationSecs": 6.16, "validDurationSecs": 56.99, "totalHrs": 0.16, "validHrs": 0.01 }, "el": { "duration": 477900, "reportedSentences": 0, "clips": 111, "users": 4, "size": 2809954, "checksum": "5cb339dd5cdd09dd127dd407ba0a675d6a6b94bf7112d928bcc1ce11f0e630bd", "avgDurationSecs": 4.134, "validDurationSecs": 151.68, "totalHrs": 0.14, "validHrs": 0.04 }, "cs": { "duration": 3421656, "reportedSentences": 7, "clips": 812, "users": 21, "size": 21352361, "checksum": "a40e52275c177f9a4cf26d4e0be0a7ba80bea1a49df9488195d438cbc08c0eb1", "avgDurationSecs": 4.461, "validDurationSecs": 1316.67, "totalHrs": 0.96, "validHrs": 0.36 }, "pl": { "duration": 1494216, "reportedSentences": 2, "clips": 250, "users": 22, "size": 9755922, "checksum": "2120959fc8114f9b59743d0e401b49186c6ec6b4a0075c193bfcb6c89102cde6", "avgDurationSecs": 4.51, "validDurationSecs": 3461.81, "totalHrs": 0.42, "validHrs": 0.96 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 46663, "checksum": "dc16aea509065151ebe382c34c6e62179bd5b7cbb2bd36bcea9a5f7dbbe79b52", "avgDurationSecs": 5.464, "validDurationSecs": 109.28, "totalHrs": 0, "validHrs": 0.03 }, "rm-vallader": { "duration": 32616, "reportedSentences": 0, "clips": 5, "users": 1, "size": 212589, "checksum": "ad2d8fb378dba1d15c184b23aa183aa6283493bef7ac5c80d013f74a2b525632", "avgDurationSecs": 5.826, "validDurationSecs": 7.85, "totalHrs": 0.01, "validHrs": 0 }, "mn": { "duration": 485928, "reportedSentences": 0, "clips": 83, "users": 5, "size": 2910349, "checksum": "917d2c114f6b1db77bff07bf891e16340db23115b59d1f1728e6f46db3e21cfd", "avgDurationSecs": 5.443, "validDurationSecs": 107.93, "totalHrs": 0.14, "validHrs": 0.03 }, "zh-HK": { "duration": 1948104, "reportedSentences": 8, "clips": 530, "users": 19, "size": 12204454, "checksum": "1be17fbae0707a4d43c1ad35aeab1469dda071d40b0439e1152e79be54d6accb", "avgDurationSecs": 4.179, "validDurationSecs": 983.35, "totalHrs": 0.54, "validHrs": 0.27 }, "ab": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 371558, "checksum": "e3b8d228bb6b6656f52786a7b8fe81b837331d1f3ef4d3d9e7ff45718754b195", "avgDurationSecs": 5.133, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 128496, "checksum": "dc390296468cbac0688eef73efb9cd0aa837563884ebb6ab7ea8af0772519c6e", "avgDurationSecs": 5.059, "validDurationSecs": 20.24, "totalHrs": 0, "validHrs": 0 }, "uk": { "duration": 45667800, "reportedSentences": 14, "clips": 10699, "users": 135, "size": 263369517, "checksum": "8f4400cbf931f84bf5b6a1732c34e04da2fd323c26102949365a5843b6ab7896", "avgDurationSecs": 4.679, "validDurationSecs": 18208.9, "totalHrs": 12.69, "validHrs": 5.06 }, "mt": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 79414, "checksum": "43b23e3625e18b3fd7fca331767d93304df5aa78cd184c9f8d13cd59d7fcb1ec", "avgDurationSecs": 4.741, "validDurationSecs": 18.96, "totalHrs": 0, "validHrs": 0.01 }, "as": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 13551, "checksum": "6c74b5a299400ca18797b7f2fd01f303e894c78f407630d15c4222ba126682f4", "avgDurationSecs": 5.69, "validDurationSecs": 22.76, "totalHrs": 0, "validHrs": 0.01 }, "ka": { "duration": 93187116, "reportedSentences": 40, "clips": 18845, "users": 89, "size": 546000385, "checksum": "05d356d812259bd57e1dba180767683ea1690fb5845c3cdf64629cc59b964499", "avgDurationSecs": 5.116, "validDurationSecs": 86108.05, "totalHrs": 25.88, "validHrs": 23.92 }, "fy-NL": { "duration": 2590092, "reportedSentences": 0, "clips": 543, "users": 6, "size": 16288281, "checksum": "a73b5822e34901cc06ed3f6d1f59c4701eb4504b508b7176ebf4755af73418ae", "avgDurationSecs": 4.853, "validDurationSecs": 3343.59, "totalHrs": 0.72, "validHrs": 0.93 }, "dv": { "duration": 1775196, "reportedSentences": 0, "clips": 344, "users": 4, "size": 10733333, "checksum": "ef62877143369ab32c6307174e201ce06b61d01830903791a1cc3a44b985c181", "avgDurationSecs": 5.06, "validDurationSecs": 607.92, "totalHrs": 0.5, "validHrs": 0.17 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 22983, "checksum": "c11f174858e7f80459fe1526be8dd455231e8feff0af0810fe934303d415346e", "avgDurationSecs": 4.823, "validDurationSecs": 38.59, "totalHrs": 0, "validHrs": 0.01 }, "vi": { "duration": 88740, "reportedSentences": 0, "clips": 25, "users": 5, "size": 629997, "checksum": "1a4f089f3129bb0b417a4560cb4638afd03fda56587aeadc55cc1a51b1ac2cd5", "avgDurationSecs": 3.965, "validDurationSecs": 163.54, "totalHrs": 0.03, "validHrs": 0.05 }, "or": { "duration": 1811664, "reportedSentences": 4, "clips": 351, "users": 0, "size": 10744905, "checksum": "7bf94cb626f81262749258809f09fc3210f162baabb00ef673bb79fe79481375", "avgDurationSecs": 5.004, "validDurationSecs": 1134.67, "totalHrs": 0.5, "validHrs": 0.31 }, "ga-IE": { "duration": 567756, "reportedSentences": 1, "clips": 135, "users": 6, "size": 3389870, "checksum": "f0435c5e88cb726046f7ed8f3dba26c01460e5041ebc5da916c3729a66f20a2b", "avgDurationSecs": 3.58, "validDurationSecs": 1123.15, "totalHrs": 0.16, "validHrs": 0.31 }, "fi": { "duration": 1080324, "reportedSentences": 0, "clips": 216, "users": 2, "size": 6455512, "checksum": "2dfb2244e6b8da1bb762b96fe54a04647c713addd780775aec5ca629e2724f5e", "avgDurationSecs": 4.61, "validDurationSecs": 1027.38, "totalHrs": 0.3, "validHrs": 0.28 }, "hu": { "duration": 47544912, "reportedSentences": 2, "clips": 7631, "users": 18, "size": 280151961, "checksum": "30a10de531f3eb2979b44f209a9002e2aacba5306e58341fa0cc4cc204dda62e", "avgDurationSecs": 5.511, "validDurationSecs": 27275.5, "totalHrs": 13.21, "validHrs": 7.58 }, "th": { "duration": 5243868, "reportedSentences": 21, "clips": 1292, "users": 14, "size": 32698878, "checksum": "27f7ac98bb30d18373e249519a3225974e60304dead643209edf45763836d426", "avgDurationSecs": 4.191, "validDurationSecs": 2935.87, "totalHrs": 1.45, "validHrs": 0.81 }, "lt": { "duration": 59364, "reportedSentences": 0, "clips": 10, "users": 1, "size": 468149, "checksum": "786d1b7d9b12c4378fcb30cb03f007cd3bdd4328637505bba1c97bfc8a9134e2", "avgDurationSecs": 5.132, "validDurationSecs": 715.37, "totalHrs": 0.02, "validHrs": 0.2 }, "lg": { "duration": 124596, "reportedSentences": 0, "clips": 19, "users": 3, "size": 3001330, "checksum": "1088eee5f34fa7d2b30a619843784c77b328d2303cdd03dd20fa264be35f1ca2", "avgDurationSecs": 5.785, "validDurationSecs": 694.1, "totalHrs": 0.03, "validHrs": 0.19 }, "hi": { "duration": 3254040, "reportedSentences": 3, "clips": 570, "users": 13, "size": 19222707, "checksum": "1a9519393771c40dfb1481559cc0d12ae9b56871d72636d1419b64eb18d1cd20", "avgDurationSecs": 4.893, "validDurationSecs": 1156.24, "totalHrs": 0.9, "validHrs": 0.32 }, "bas": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 13499, "checksum": "8b8238f2e466fcf7dd254d712064915a045308d2ef4b13fa09a01841e46d4a38", "avgDurationSecs": 4.431, "validDurationSecs": 8.86, "totalHrs": 0, "validHrs": 0.01 }, "sk": { "duration": 514296, "reportedSentences": 1, "clips": 119, "users": 3, "size": 3156682, "checksum": "e5140c4a6d842ceb53c541e7e38c0eb3b0718483a1530e4672b273742e246802", "avgDurationSecs": 4.076, "validDurationSecs": 154.8, "totalHrs": 0.14, "validHrs": 0.04 }, "kmr": { "duration": 658080, "reportedSentences": 0, "clips": 128, "users": 1, "size": 4431826, "checksum": "e5aa3c86707ae2856ba7d9b7fbbd6c12f56ad4fe692d3d8273a13194d3d2f08e", "avgDurationSecs": 4.056, "validDurationSecs": 323.57, "totalHrs": 0.19, "validHrs": 0.09 }, "bg": { "duration": 60444, "reportedSentences": 0, "clips": 10, "users": 0, "size": 451600, "checksum": "62ef08f145f1c0813502fd348d772b98693ec017932af38b14c214bf7a3c864e", "avgDurationSecs": 5.47, "validDurationSecs": 1064.99, "totalHrs": 0.02, "validHrs": 0.3 }, "kk": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 11339, "checksum": "394769ebdfa7d07db57f4f749028a8e39515683b23a8d5bf73e5dcf89b85e09f", "avgDurationSecs": 4.926, "validDurationSecs": 64.04, "totalHrs": 0, "validHrs": 0.02 }, "ba": { "duration": 1186488, "reportedSentences": 0, "clips": 291, "users": 4, "size": 8304571, "checksum": "32e0c10eba60ac729686990e25313e0c3f74fe56c42ac55667e5c1c6cf389a5e", "avgDurationSecs": 4.426, "validDurationSecs": 1234.54, "totalHrs": 0.33, "validHrs": 0.34 }, "gl": { "duration": 529128, "reportedSentences": 0, "clips": 161, "users": 1, "size": 3056641, "checksum": "086d970127eb786d4a6b64ba5fdbf7246e0dc14b493d0d8453a32e089d213544", "avgDurationSecs": 4.921, "validDurationSecs": 897.07, "totalHrs": 0.15, "validHrs": 0.25 }, "ug": { "duration": 48523284, "reportedSentences": 17, "clips": 7926, "users": 159, "size": 274788820, "checksum": "e98dcc27e62e672fce1a6c837ea8cca2f2a0ac70ddc07736cb426619c7a035b2", "avgDurationSecs": 6.04, "validDurationSecs": 40575.24, "totalHrs": 13.48, "validHrs": 11.28 }, "hy-AM": { "duration": 125676, "reportedSentences": 0, "clips": 20, "users": 4, "size": 764604, "checksum": "36d814191cf5aacc39a0a9afa7150e1b4650c42806868256f06e9afcd652ffcb", "avgDurationSecs": 6.111, "validDurationSecs": 166.84, "totalHrs": 0.03, "validHrs": 0.04 }, "be": { "duration": 275550588, "reportedSentences": 0, "clips": 58177, "users": 43, "size": 1612374639, "checksum": "048582051cd2b452fa540cf1de49398411209700da0b6602d7d66cc6d8eed850", "avgDurationSecs": 4.762, "validDurationSecs": 268849.94, "totalHrs": 76.54, "validHrs": 74.68 }, "ur": { "duration": 284128596, "reportedSentences": 180, "clips": 57092, "users": 109, "size": 1653470434, "checksum": "d6d3cb6ab6051475d6bb5a550e63b0e07d50ab742adae0fa2c4e09f95b59c337", "avgDurationSecs": 4.219, "validDurationSecs": 62845.39, "totalHrs": 78.93, "validHrs": 17.46 }, "gn": { "duration": 35468820, "reportedSentences": 0, "clips": 7158, "users": 25, "size": 205874061, "checksum": "6a373d7db2a5ccd8105134c4ba569dfe09f335d4b279d38984913b7437df038c", "avgDurationSecs": 4.51, "validDurationSecs": 772.78, "totalHrs": 9.86, "validHrs": 0.22 }, "sr": { "duration": 184428, "reportedSentences": 0, "clips": 48, "users": 7, "size": 1011229, "checksum": "83dfd72a51efe7a305d615ab23d92a07b0f5819875effbd8fb8931abed0515f5", "avgDurationSecs": 2.891, "validDurationSecs": 289.7, "totalHrs": 0.05, "validHrs": 0.08 }, "uz": { "duration": 227988, "reportedSentences": 0, "clips": 37, "users": 7, "size": 2663513, "checksum": "c4c662820a1909c3af9e5513f2f88da7072ff750c2b075883201c1ec02960553", "avgDurationSecs": 4.148, "validDurationSecs": 231.42, "totalHrs": 0.06, "validHrs": 0.07 }, "mr": { "duration": 25524, "reportedSentences": 0, "clips": 5, "users": 0, "size": 250085, "checksum": "6c32d868bd8fee5b5f9573a4f918e3a9ac315a6063eacadd9dbea966b33a4d1a", "avgDurationSecs": 6.195, "validDurationSecs": -3.71, "totalHrs": 0, "validHrs": -0.01 }, "da": { "duration": 670176, "reportedSentences": 18, "clips": 202, "users": 7, "size": 3976064, "checksum": "cd350c8b105eed54181c31ec68bc457774635c65384f986e838041fa0e6c242b", "avgDurationSecs": 4.133, "validDurationSecs": 1612.81, "totalHrs": 0.19, "validHrs": 0.44 }, "myv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 12603, "checksum": "07d69b582270528e74aad6c7284553317999b66b45876e0700bfa123628e5d91", "avgDurationSecs": 5.774, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 7668, "checksum": "53ab1c41a02b0529743c020d6718adf6ee055315b716518fdb685a537ad9198d", "avgDurationSecs": 4.436, "validDurationSecs": 44.36, "totalHrs": 0, "validHrs": 0.02 }, "ha": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 58367, "checksum": "222f8ab1bcf5b3dd5094ecdaf1e0745289c88bfd399da5e63e143e518703f09b", "avgDurationSecs": 4.333, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ckb": { "duration": 11169108, "reportedSentences": 30, "clips": 3035, "users": 11, "size": 65743739, "checksum": "7308effe05102ea476924a3d93568b368daa9e21515e51675769b648f83be915", "avgDurationSecs": 3.903, "validDurationSecs": 3737.09, "totalHrs": 3.1, "validHrs": 1.04 }, "ml": { "duration": 4569228, "reportedSentences": 3, "clips": 1086, "users": 7, "size": 26129659, "checksum": "39e027ae3413fa85ea4b831c9a6d4bd9cbc6ddf786bbc3cfeedcfa6903ef887a", "avgDurationSecs": 4.159, "validDurationSecs": 1023.71, "totalHrs": 1.26, "validHrs": 0.29 }, "mdf": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 3137, "checksum": "a1eb5f617f66a1b9388e5bbc46e2d7abb63e10654a68022491c0006aa75e9f24", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sw": { "duration": 65770020, "reportedSentences": 12, "clips": 11127, "users": 58, "size": 385033190, "checksum": "4100530284269f3dff0a48abcdc46c818b28f1b593ac855e188c466efcc226ab", "avgDurationSecs": 5.275, "validDurationSecs": 76144.18, "totalHrs": 18.27, "validHrs": 21.15 }, "sat": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 5732, "checksum": "1568d5ba65b8e03964571cfae19938f974650d9a13f357ec2abb569f35d6fe9d", "avgDurationSecs": 4.462, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tig": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1699, "checksum": "6349a2ad02b2e101862ada4b860dc4892d217b75fc99c1ea1c80f84d293eef26", "avgDurationSecs": 5.048, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ig": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 38212, "checksum": "5c217960dc04600da0464269f5ebcdfd67a29d39162e311fae6e5c9eb873e93d", "avgDurationSecs": 5.464, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 28139796, "reportedSentences": 61, "clips": 11178, "users": 70, "size": 160999865, "checksum": "830f9e8db88d199a487e59e94000a6a41a3318d0c6690d8f2c3677d8437dc2fa", "avgDurationSecs": 2.608, "validDurationSecs": 5465.13, "totalHrs": 7.82, "validHrs": 1.51 }, "mhr": { "duration": 14844276, "reportedSentences": 4, "clips": 3402, "users": 7, "size": 86663434, "checksum": "f43afead24ee0c328646e836ddfb63adc50c9d19c7f74163ae68c25a5438aa81", "avgDurationSecs": 4.645, "validDurationSecs": 95334.66, "totalHrs": 4.12, "validHrs": 26.48 }, "bn": { "duration": 9374292, "reportedSentences": 15, "clips": 1751, "users": 24, "size": 61397642, "checksum": "fcc35d8592e221196bb9849f9199dc91f4743b320a76ef524f175b0e9d0bbd9a", "avgDurationSecs": 4.365, "validDurationSecs": 386.87, "totalHrs": 2.6, "validHrs": 0.11 }, "tok": { "duration": 4958712, "reportedSentences": 2, "clips": 907, "users": 6, "size": 29215608, "checksum": "e77f5142809671a461db17abc0ff9545658ca5c2f899507002c9f529cb6b8915", "avgDurationSecs": 3.962, "validDurationSecs": 3561.61, "totalHrs": 1.38, "validHrs": 0.99 }, "yue": { "duration": 1018260, "reportedSentences": 7, "clips": 272, "users": 10, "size": 6343599, "checksum": "34cd9cac6e6100421cbaadfcda8355c8bb61c7a703aafb98e66a31eb52ea4714", "avgDurationSecs": 4.038, "validDurationSecs": 1161.06, "totalHrs": 0.29, "validHrs": 0.32 }, "sah": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 26346, "checksum": "edcfd3fe431ee1fcc587124066aaa6a478d2753ac67e356a34cdf446e28bd825", "avgDurationSecs": 6.011, "validDurationSecs": 36.07, "totalHrs": 0, "validHrs": 0.01 }, "mk": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1974, "checksum": "0089f5f33f9216327002ac94edc6d65282dd68075f88e9ca61175e737f494cd8", "avgDurationSecs": 5.292, "validDurationSecs": 84.67, "totalHrs": 0, "validHrs": 0.02 }, "sc": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 11304, "checksum": "ccf4ae936fa2f82f85ada0af1f76be68ec5511cb567a8e422efb19c49189b157", "avgDurationSecs": 4.486, "validDurationSecs": 53.84, "totalHrs": 0, "validHrs": 0.01 }, "skr": { "duration": 102636, "reportedSentences": 0, "clips": 21, "users": 3, "size": 630434, "checksum": "623f70331830bfa20fdf9d753604a50f37e04e91cb461fd81d489e56a81e09a2", "avgDurationSecs": 4.137, "validDurationSecs": 539, "totalHrs": 0.03, "validHrs": 0.15 }, "ti": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 273, "checksum": "80a08edc10bd2abfe37cc631492bbba66368ddbd9050dac7ce04045c74edbf4a", "avgDurationSecs": 5.779, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mrj": { "duration": 93456, "reportedSentences": 0, "clips": 22, "users": 2, "size": 785016, "checksum": "f15d3732bb89a59954502dc3b25d5f306d2e52b843a1baaa05a11530f0120dab", "avgDurationSecs": 4.157, "validDurationSecs": 11872.98, "totalHrs": 0.03, "validHrs": 3.3 }, "tw": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1390, "checksum": "7ccad7a31ab09fbe9e8620f7112c4eec411f6eb07c270b7b15c96433dc5e990c", "avgDurationSecs": 3.983, "validDurationSecs": 15.93, "totalHrs": 0, "validHrs": 0.01 }, "ko": { "duration": 916704, "reportedSentences": 0, "clips": 167, "users": 4, "size": 5341611, "checksum": "f338e35b1110a24fc342ca6c6aad09d2ffb22e4366345584569d30d4588e6a94", "avgDurationSecs": 6.116, "validDurationSecs": 584.2, "totalHrs": 0.25, "validHrs": 0.16 }, "yo": { "duration": 150300, "reportedSentences": 0, "clips": 25, "users": 2, "size": 909902, "checksum": "d0db7c9a12fff5dd43877972220c6e086946ef825f5bc9915385987ac0d2d456", "avgDurationSecs": 5.948, "validDurationSecs": 114.13, "totalHrs": 0.04, "validHrs": 0.03 }, "oc": { "duration": 688680, "reportedSentences": 0, "clips": 139, "users": 0, "size": 4105815, "checksum": "60df2b394e906559d9fe34069e81e52f4aab6b64f64b645b5e635e7e3d991960", "avgDurationSecs": 4.875, "validDurationSecs": 620.48, "totalHrs": 0.2, "validHrs": 0.17 }, "tk": { "duration": 891576, "reportedSentences": 1, "clips": 166, "users": 11, "size": 5114097, "checksum": "305cd2f6ed4d6193566adbd5ca5b3051032fb664e0ffcea59506c6e13c059e52", "avgDurationSecs": 5.384, "validDurationSecs": 257.35, "totalHrs": 0.24, "validHrs": 0.07 }, "ps": { "duration": 589860, "reportedSentences": 0, "clips": 141, "users": 2, "size": 3472203, "checksum": "53e87e3ad75e8eccf49926d728fb41d44b2415cf04d2d063e97728238c467391", "avgDurationSecs": 4.844, "validDurationSecs": 1320.4, "totalHrs": 0.17, "validHrs": 0.37 }, "sq": { "duration": 134064, "reportedSentences": 0, "clips": 30, "users": 3, "size": 659429, "checksum": "84df66e78f546c6c64b022ed4d74e9f2185ae56165fee8be40f3889fdb51d604", "avgDurationSecs": 4.601, "validDurationSecs": 115.72, "totalHrs": 0.04, "validHrs": 0.03 }, "ne-NP": { "duration": 548640, "reportedSentences": 0, "clips": 134, "users": 2, "size": 3219182, "checksum": "cfadd5c287274655118d97fd32b9b503013aa1232d579277addf6e7dc09585f9", "avgDurationSecs": 3.85, "validDurationSecs": 158.96, "totalHrs": 0.15, "validHrs": 0.04 }, "az": { "duration": 32616, "clips": 5, "users": 1, "size": 195314, "checksum": "76c5a611263f086ed025615651e78d2c80c2db8109a96838687bd9db2949509f", "avgDurationSecs": 5.381, "validDurationSecs": 12.57, "totalHrs": 0.01, "validHrs": 0.01 }, "am": { "duration": 200124, "clips": 32, "users": 5, "size": 1190211, "checksum": "6908c8e32862cd2b547777cc6142dc3dfdad41d6be954eceb59d549e9a6308eb", "avgDurationSecs": 5.956, "validDurationSecs": 85.14, "totalHrs": 0.05, "validHrs": 0.02 }, "vot": { "duration": 0, "clips": 0, "users": 0, "size": 2302, "checksum": "93876ec5db2db02cf7bd6762d40cd07634c830aa7b1c654fd99739d70ce5efc8", "avgDurationSecs": 2.408, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ast": { "duration": 0, "clips": 0, "users": 0, "size": 7262, "checksum": "362476fbd65a5ae7072c69a000d90d666421eb385dcc67950dfb9ea1f86f451c", "avgDurationSecs": 4.542, "validDurationSecs": 4.54, "totalHrs": 0, "validHrs": 0 }, "quy": { "duration": 0, "clips": 0, "users": 0, "size": 176, "checksum": "a4229627fa6852b10828185034909c00d536a2620a9ca8d2b83a069dbb68278e", "avgDurationSecs": 6.132, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lo": { "duration": 500760, "clips": 80, "users": 1, "size": 2942523, "checksum": "15bc7cab70f9bf24a09d86722b7ff657fe2ac14f94cedb551c84c015e282dff9", "avgDurationSecs": 6.48, "validDurationSecs": 361.76, "totalHrs": 0.14, "validHrs": 0.1 }, "dyu": { "duration": 226764, "clips": 34, "users": 5, "size": 1332597, "checksum": "0582cec5fc25f2f32c8f063ff583de357ee3d6d8fc7d5ba55ef592ba0ca5a97b", "avgDurationSecs": 6.657, "validDurationSecs": 213.45, "totalHrs": 0.06, "validHrs": 0.06 }, "is": { "duration": 0, "clips": 0, "users": 0, "size": 202, "checksum": "d96a8f64d7b6f190fd5a034582a98297535f808722c2eb6c82aa0b39e4ebce68", "avgDurationSecs": 7.334, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "zgh": { "duration": 662796, "clips": 206, "users": 2, "size": 3618565, "checksum": "69b3d5adbe69ba2500f152869f0a2165032d0f8b6b942c197c1c3bff26d50578", "avgDurationSecs": 3.306, "validDurationSecs": 605.43, "totalHrs": 0.19, "validHrs": 0.17 } }, "totalDuration": 2278545516, "totalValidDurationSecs": 1830044, "totalHrs": 633, "totalValidHrs": 508 } ================================================ FILE: datasets/scripted-speech/cv-corpus-16.0-2023-12-06.json ================================================ { "locales": { "ab": { "buckets": { "validated": 41982, "invalidated": 5278, "dev": 9160, "test": 9116, "train": 21027, "other": 12964 }, "duration": 303854076, "reportedSentences": 225, "clips": 60224, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.19, "thirties": 0.13, "teens": 0.28, "fourties": 0.09, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.18, "male": 0.18, "female": 0.64, "other": 0 } }, "users": 401, "size": 1735604914, "checksum": "6060e49d24556545ef2df2e1d0296a01b1da86d2e4a5e76cc75779fc1417df90", "avgDurationSecs": 5.045, "validDurationSecs": 211815.918, "totalHrs": 84.4, "validHrs": 58.83 }, "af": { "buckets": { "validated": 86, "invalidated": 138, "dev": 5, "test": 28, "train": 53, "other": 50 }, "duration": 1231344, "reportedSentences": 0, "clips": 274, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.64, "thirties": 0, "teens": 0.02, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.51, "male": 0.49, "female": 0, "other": 0 } }, "users": 14, "size": 7153864, "checksum": "8b5dfea788a4fd772efa5033013be022b1e9fb82840233e441508d44102a2993", "avgDurationSecs": 4.494, "validDurationSecs": 386.48, "totalHrs": 0.34, "validHrs": 0.1 }, "am": { "buckets": { "validated": 753, "invalidated": 22, "dev": 247, "test": 158, "train": 348, "other": 448 }, "duration": 7353648, "reportedSentences": 11, "clips": 1223, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.12, "thirties": 0.75, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male": 0.88, "female": 0, "other": 0 } }, "users": 20, "size": 43146099, "checksum": "ff46ab9f61a4b7a0a80dbd5be936dda1d747149c3cbf7870f8ba88e815e50edf", "avgDurationSecs": 6.013, "validDurationSecs": 4527.634, "totalHrs": 2.04, "validHrs": 1.25 }, "ar": { "buckets": { "validated": 77938, "invalidated": 15078, "dev": 10421, "test": 10466, "train": 28381, "other": 41546 }, "duration": 559110834, "reportedSentences": 2119, "clips": 134562, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.29, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male": 0.28, "female": 0.17, "other": 0 } }, "users": 1549, "size": 3313141663, "checksum": "1d9cd14f2bd095ba5051528c96f9eb22cb7c1e86234109824a2d25cae51118e4", "avgDurationSecs": 4.155, "validDurationSecs": 323835.705, "totalHrs": 155.3, "validHrs": 89.95 }, "as": { "buckets": { "validated": 1717, "invalidated": 208, "dev": 510, "test": 551, "train": 656, "other": 146 }, "duration": 11784629, "reportedSentences": 10, "clips": 2071, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0.36, "thirties": 0.04, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.59, "male": 0.41, "female": 0, "other": 0 } }, "users": 45, "size": 73641687, "checksum": "1fa0c81538ffcf1ce3e64de1367df2e5c854fdcd5519593f3ab3f875c80dcf35", "avgDurationSecs": 5.69, "validDurationSecs": 9770.26, "totalHrs": 3.27, "validHrs": 2.71 }, "ast": { "buckets": { "validated": 643, "invalidated": 16, "dev": 112, "test": 157, "train": 372, "other": 845 }, "duration": 6680016, "reportedSentences": 0, "clips": 1504, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.02, "thirties": 0.13, "teens": 0.04, "fourties": 0.31, "fifties": 0.23, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.58, "female": 0.16, "other": 0 } }, "users": 27, "size": 37776319, "checksum": "a4418ed23d5d88a1d4e0b5da5059e281a4e1e0ea5ebe0c8ef9e190b28ad15dc4", "avgDurationSecs": 4.442, "validDurationSecs": 2855.885, "totalHrs": 1.85, "validHrs": 0.79 }, "az": { "buckets": { "validated": 102, "invalidated": 37, "dev": 31, "test": 32, "train": 39, "other": 162 }, "duration": 1614492, "reportedSentences": 5, "clips": 301, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.34, "thirties": 0.21, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male": 0.56, "female": 0, "other": 0 } }, "users": 26, "size": 9303562, "checksum": "2663480efab0faa1399566f6cc733efcc74acb059a5fd070eb443293388b54a0", "avgDurationSecs": 5.364, "validDurationSecs": 547.104, "totalHrs": 0.44, "validHrs": 0.15 }, "ba": { "buckets": { "validated": 209498, "invalidated": 7931, "dev": 14524, "test": 14551, "train": 119107, "other": 7786 }, "duration": 963020988, "reportedSentences": 867, "clips": 225215, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.06, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.3, "female": 0.4, "other": 0 } }, "users": 925, "size": 5401610765, "checksum": "c397114fd0918e839a2f599537b3498b967bc24ad8a064f2665cb8e2545e0023", "avgDurationSecs": 4.276, "validDurationSecs": 895814.981, "totalHrs": 267.5, "validHrs": 248.83 }, "bas": { "buckets": { "validated": 1757, "invalidated": 508, "dev": 457, "test": 537, "train": 763, "other": 26 }, "duration": 10139220, "reportedSentences": 8, "clips": 2291, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male": 0.01, "female": 0.02, "other": 0 } }, "users": 36, "size": 56346285, "checksum": "c29d8afa5763dd4c8254577333d8b597c0f8fb571619149da7200c5a1dfb445f", "avgDurationSecs": 4.426, "validDurationSecs": 7775.91, "totalHrs": 2.81, "validHrs": 2.15 }, "be": { "buckets": { "validated": 1245806, "invalidated": 35017, "dev": 15880, "test": 15878, "train": 347614, "other": 5600 }, "duration": 6096786120, "reportedSentences": 3136, "clips": 1286423, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.86, "male": 0.06, "female": 0.08, "other": 0 } }, "users": 8249, "size": 34609062895, "checksum": "6c8e226a26e7fd7acbd4f4cd490b614b283dc9908ba660912bad68b4ad037247", "avgDurationSecs": 4.739, "validDurationSecs": 5904288.659, "totalHrs": 1693.55, "validHrs": 1640.08 }, "bg": { "buckets": { "validated": 9154, "invalidated": 723, "dev": 2221, "test": 2647, "train": 4270, "other": 3594 }, "duration": 73053648, "reportedSentences": 187, "clips": 13471, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.59, "female": 0.04, "other": 0 } }, "users": 117, "size": 426455575, "checksum": "603819d87834b14b2019c72baa465ec54a862457128bc5813f6b5519dd6d7c9c", "avgDurationSecs": 5.423, "validDurationSecs": 49642.424, "totalHrs": 20.29, "validHrs": 13.78 }, "bn": { "buckets": { "validated": 44021, "invalidated": 7792, "dev": 9314, "test": 9314, "train": 21158, "other": 997739 }, "duration": 4578886548, "reportedSentences": 1887, "clips": 1049552, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male": 0.54, "female": 0.23, "other": 0 } }, "users": 22901, "size": 26364886622, "checksum": "8095091077507d63ce4a9f9462993b8cd60884256875679d93035118ccd4421f", "avgDurationSecs": 4.363, "validDurationSecs": 192050.67, "totalHrs": 1271.91, "validHrs": 53.34 }, "br": { "buckets": { "validated": 14564, "invalidated": 1092, "dev": 2179, "test": 2207, "train": 2591, "other": 14286 }, "duration": 95012791, "reportedSentences": 285, "clips": 29942, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.33, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.14, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.67, "female": 0.02, "other": 0 } }, "users": 203, "size": 663584779, "checksum": "028e9a239baa8f7c0d480f8b124c1b25fcd68503a5bfa0f1e15e6cee2a4674d1", "avgDurationSecs": 3.173, "validDurationSecs": 46214.892, "totalHrs": 26.39, "validHrs": 12.83 }, "ca": { "buckets": { "validated": 1792699, "invalidated": 108722, "dev": 16402, "test": 16402, "train": 1144932, "other": 466996 }, "duration": 12597142734, "reportedSentences": 6340, "clips": 2368417, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.17, "sixties": 0.28, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.5, "female": 0.2, "other": 0 } }, "users": 35619, "size": 75858638392, "checksum": "fc34f4479b1b87817ab2fca4be85012ac8c9ec1b1a9d318ecfffa35e1f3035fe", "avgDurationSecs": 5.319, "validDurationSecs": 9535012.281, "totalHrs": 3499.2, "validHrs": 2648.61 }, "ckb": { "buckets": { "validated": 103648, "invalidated": 7512, "dev": 4957, "test": 4951, "train": 7067, "other": 10303 }, "duration": 471696048, "reportedSentences": 2419, "clips": 121463, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.49, "thirties": 0.11, "teens": 0.02, "fourties": 0.02, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.62, "female": 0.06, "other": 0 } }, "users": 1310, "size": 2560816666, "checksum": "b1177e50c0fe9aa96f9b70e61018faf20f1e8488706f3cc99db4099180a91593", "avgDurationSecs": 3.883, "validDurationSecs": 402512.304, "totalHrs": 131.02, "validHrs": 111.8 }, "cnh": { "buckets": { "validated": 2458, "invalidated": 437, "dev": 761, "test": 763, "train": 817, "other": 3292 }, "duration": 21751656, "reportedSentences": 9, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male": 0.3, "female": 0.21, "other": 0 } }, "users": 300, "size": 167579883, "checksum": "a0ec0205ffc140d60fa8588fd299499ed6e816d81899f38328f62a453d33b2cc", "avgDurationSecs": 3.516, "validDurationSecs": 8641.599, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "validated": 60944, "invalidated": 2173, "dev": 9053, "test": 9017, "train": 19963, "other": 154884 }, "duration": 939359343, "reportedSentences": 929, "clips": 218001, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.18, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.53, "female": 0.23, "other": 0 } }, "users": 960, "size": 5749464342, "checksum": "0eb70e60960358b2c0209124abd33f231fe636ce420f9468421aaa0513f304d3", "avgDurationSecs": 4.309, "validDurationSecs": 262605.749, "totalHrs": 260.93, "validHrs": 72.94 }, "cv": { "buckets": { "validated": 17318, "invalidated": 2138, "dev": 1260, "test": 1290, "train": 1436, "other": 154 }, "duration": 99019848, "reportedSentences": 144, "clips": 19610, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.51, "thirties": 0.01, "teens": 0.19, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.27, "other": 0 } }, "users": 112, "size": 672449103, "checksum": "9a2a865cc7d4ff8329a602cb188ff12e3002daaf8698ed9c2b91bd65e0a67612", "avgDurationSecs": 5.049, "validDurationSecs": 87446.493, "totalHrs": 27.5, "validHrs": 24.29 }, "cy": { "buckets": { "validated": 89897, "invalidated": 4439, "dev": 5345, "test": 5354, "train": 7903, "other": 19930 }, "duration": 558444537, "reportedSentences": 172, "clips": 114266, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.06, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.33, "female": 0.25, "other": 0.01 } }, "users": 1833, "size": 4076506864, "checksum": "51a1f7f868c6b77415bc6e837cb0faaf113af556be2967180e309ce6788c0d5e", "avgDurationSecs": 4.887, "validDurationSecs": 439347.562, "totalHrs": 155.12, "validHrs": 122.04 }, "da": { "buckets": { "validated": 10137, "invalidated": 400, "dev": 2105, "test": 2511, "train": 3462, "other": 417 }, "duration": 45040500, "reportedSentences": 422, "clips": 10954, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.19, "thirties": 0.25, "teens": 0.01, "fourties": 0.2, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.6, "female": 0.08, "other": 0 } }, "users": 254, "size": 258709384, "checksum": "8abe54e66c87bc4f0d3bb04cb5445cf4dc94632907c8229b9efdf76914fcfbaf", "avgDurationSecs": 4.112, "validDurationSecs": 41681.171, "totalHrs": 12.51, "validHrs": 11.57 }, "de": { "buckets": { "validated": 903475, "invalidated": 54024, "dev": 16173, "test": 16173, "train": 575350, "other": 14332 }, "duration": 5049626489, "reportedSentences": 10243, "clips": 971831, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.59, "female": 0.08, "other": 0.01 } }, "users": 18687, "size": 35182190027, "checksum": "ccc03666e9aca2fb1fb7451eeee654edb845997e5d6c21aed29f24879fb6a8d5", "avgDurationSecs": 5.196, "validDurationSecs": 4694449.233, "totalHrs": 1402.67, "validHrs": 1304.01 }, "dv": { "buckets": { "validated": 27562, "invalidated": 1701, "dev": 2210, "test": 2210, "train": 2697, "other": 16384 }, "duration": 230942619, "reportedSentences": 56, "clips": 45647, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.17, "thirties": 0.36, "teens": 0.01, "fourties": 0.21, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male": 0.29, "female": 0.5, "other": 0 } }, "users": 350, "size": 1461063993, "checksum": "8847e8c3aac0efa3231526db579e05e78b32f0c35f28ff87714fd9fa75242eec", "avgDurationSecs": 5.059, "validDurationSecs": 139444.881, "totalHrs": 64.15, "validHrs": 38.73 }, "dyu": { "buckets": { "validated": 128, "invalidated": 6, "dev": 37, "test": 36, "train": 51, "other": 101 }, "duration": 1443960, "reportedSentences": 0, "clips": 235, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.83, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male": 0.47, "female": 0.38, "other": 0 } }, "users": 20, "size": 8473205, "checksum": "8ef2539b7b39633b87a71ace1dbfd9461546412877a5d697d9a0c5220c5b7f1e", "avgDurationSecs": 6.145, "validDurationSecs": 786.497, "totalHrs": 0.4, "validHrs": 0.21 }, "el": { "buckets": { "validated": 15938, "invalidated": 826, "dev": 1697, "test": 1701, "train": 1916, "other": 11293 }, "duration": 112289218, "reportedSentences": 70, "clips": 28057, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.12, "thirties": 0.35, "teens": 0.01, "fourties": 0.16, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male": 0.62, "female": 0.03, "other": 0.02 } }, "users": 387, "size": 747477250, "checksum": "197a6ba7cddcaac2044646118fc0faf5d932f0b691e81b17bdf11e9747c2eb40", "avgDurationSecs": 4.002, "validDurationSecs": 63786.775, "totalHrs": 31.19, "validHrs": 17.71 }, "en": { "buckets": { "validated": 1779770, "invalidated": 288305, "dev": 16390, "test": 16390, "train": 1088096, "other": 313197 }, "duration": 12374969047, "reportedSentences": 7205, "clips": 2381272, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.45, "female": 0.17, "other": 0.02 } }, "users": 90671, "size": 86756175841, "checksum": "5e68caa8382d5771a3a17ea22ae46473ab7287cf89504f9d9a894900f4d06d43", "avgDurationSecs": 5.197, "validDurationSecs": 9249089.84, "totalHrs": 3437.49, "validHrs": 2569.19 }, "eo": { "buckets": { "validated": 849940, "invalidated": 127401, "dev": 14917, "test": 14915, "train": 144111, "other": 167310 }, "duration": 6841056391, "reportedSentences": 2263, "clips": 1144651, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.57, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male": 0.69, "female": 0.11, "other": 0 } }, "users": 1725, "size": 40828700452, "checksum": "b33f6bab336da25ef0b774f57cb1c23517df11e6191ef1c3eda2da8a12e1ccfc", "avgDurationSecs": 5.977, "validDurationSecs": 5079703.306, "totalHrs": 1900.29, "validHrs": 1411.02 }, "es": { "buckets": { "validated": 403557, "invalidated": 82556, "dev": 15832, "test": 15832, "train": 325691, "other": 1159275 }, "duration": 7935345234, "reportedSentences": 2511, "clips": 1645388, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.54, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male": 0.55, "female": 0.32, "other": 0 } }, "users": 25728, "size": 49971259587, "checksum": "4b35d41a1d29f9e794055b4304ec874d4e2d713c3d56a84aaf2d3832a95d2320", "avgDurationSecs": 4.823, "validDurationSecs": 1946266.848, "totalHrs": 2204.26, "validHrs": 540.62 }, "et": { "buckets": { "validated": 24358, "invalidated": 7473, "dev": 2653, "test": 2653, "train": 3157, "other": 212 }, "duration": 215805926, "reportedSentences": 548, "clips": 32043, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.69, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male": 0.55, "female": 0.26, "other": 0 } }, "users": 898, "size": 1453438300, "checksum": "99c3e9c31c74d759c899676bdc7abb556fc4e8529efd9973194ee2c0d51f8a28", "avgDurationSecs": 6.735, "validDurationSecs": 164048.333, "totalHrs": 59.94, "validHrs": 45.56 }, "eu": { "buckets": { "validated": 138311, "invalidated": 18388, "dev": 12515, "test": 12515, "train": 50492, "other": 229096 }, "duration": 2098456311, "reportedSentences": 966, "clips": 385795, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.2, "thirties": 0.06, "teens": 0.01, "fourties": 0.18, "fifties": 0.08, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male": 0.26, "female": 0.29, "other": 0.01 } }, "users": 9090, "size": 12945171553, "checksum": "a64e6480267e7c82d7e29a0e73c85d872b3ec196f6b348568bcdc08a98325f4a", "avgDurationSecs": 5.439, "validDurationSecs": 752315.584, "totalHrs": 582.9, "validHrs": 208.97 }, "fa": { "buckets": { "validated": 327796, "invalidated": 14527, "dev": 10547, "test": 10547, "train": 28810, "other": 33390 }, "duration": 1473154388, "reportedSentences": 2405, "clips": 375713, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.32, "thirties": 0.37, "teens": 0.03, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.06, "other": 0 } }, "users": 4410, "size": 10698619442, "checksum": "56cdee3e96ed77e48567936e82843fa4d2b31dca0f292d652b6c5677536cfd1b", "avgDurationSecs": 3.921, "validDurationSecs": 1285273.908, "totalHrs": 409.2, "validHrs": 357.02 }, "fi": { "buckets": { "validated": 9283, "invalidated": 257, "dev": 1674, "test": 1763, "train": 2139, "other": 5834 }, "duration": 70929363, "reportedSentences": 66, "clips": 15374, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.11, "thirties": 0.2, "teens": 0.01, "fourties": 0.32, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male": 0.35, "female": 0.32, "other": 0 } }, "users": 218, "size": 421659436, "checksum": "bb67c394c5386fb77df753a38befe988744b1d275d5d79f57dc30a7d0040f811", "avgDurationSecs": 4.614, "validDurationSecs": 42827.974, "totalHrs": 19.7, "validHrs": 11.89 }, "fr": { "buckets": { "validated": 705183, "invalidated": 61846, "dev": 16141, "test": 16141, "train": 537615, "other": 32790 }, "duration": 4005163306, "reportedSentences": 7107, "clips": 799819, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.14, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.58, "female": 0.11, "other": 0.01 } }, "users": 18511, "size": 28175141203, "checksum": "51366fc068f7a5d1e4d79b5fc3d5990935b564b349de558e43b50a7075e821ac", "avgDurationSecs": 5.008, "validDurationSecs": 3531265.293, "totalHrs": 1112.54, "validHrs": 980.9 }, "fy-NL": { "buckets": { "validated": 50805, "invalidated": 3951, "dev": 3169, "test": 3170, "train": 3923, "other": 108032 }, "duration": 760765323, "reportedSentences": 537, "clips": 162788, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male": 0.12, "female": 0.3, "other": 0 } }, "users": 2041, "size": 4556119800, "checksum": "284393fad5d49dca5fb074a46644e0f8a38a73400a8f7d73e7ec65c66a83ec7f", "avgDurationSecs": 4.673, "validDurationSecs": 237429.554, "totalHrs": 211.32, "validHrs": 65.95 }, "ga-IE": { "buckets": { "validated": 5470, "invalidated": 936, "dev": 514, "test": 517, "train": 538, "other": 5253 }, "duration": 37015731, "reportedSentences": 29, "clips": 11659, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.22, "thirties": 0.31, "teens": 0.01, "fourties": 0.07, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male": 0.54, "female": 0.12, "other": 0 } }, "users": 188, "size": 252181781, "checksum": "b7ca4d8dbce7297945cdb81c569956c8511d555f72ed4895bb0d852c33c90055", "avgDurationSecs": 3.175, "validDurationSecs": 17366.502, "totalHrs": 10.28, "validHrs": 4.82 }, "gl": { "buckets": { "validated": 28488, "invalidated": 2142, "dev": 7372, "test": 7357, "train": 13406, "other": 17404 }, "duration": 228079692, "reportedSentences": 366, "clips": 48034, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.19, "thirties": 0.17, "teens": 0.02, "fourties": 0.19, "fifties": 0.09, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.36, "female": 0.27, "other": 0 } }, "users": 1035, "size": 1309986090, "checksum": "66340d5d53acdd4c6de2026dbf25a40fb5eabc7c6bb1fa1c851376f27ae788e6", "avgDurationSecs": 4.748, "validDurationSecs": 135269.481, "totalHrs": 63.35, "validHrs": 37.57 }, "gn": { "buckets": { "validated": 2907, "invalidated": 162, "dev": 360, "test": 855, "train": 1570, "other": 18808 }, "duration": 99216036, "reportedSentences": 36, "clips": 21877, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.58, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male": 0.38, "female": 0.49, "other": 0 } }, "users": 164, "size": 567524456, "checksum": "0f91cc76b30c394a363ad93fae1a5ecf3fd11ad17c16cce95adff2dd0093f5e0", "avgDurationSecs": 4.535, "validDurationSecs": 13183.755, "totalHrs": 27.56, "validHrs": 3.66 }, "ha": { "buckets": { "validated": 3280, "invalidated": 169, "dev": 582, "test": 660, "train": 1926, "other": 6668 }, "duration": 43814016, "reportedSentences": 33, "clips": 10117, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.68, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male": 0.59, "female": 0.25, "other": 0 } }, "users": 40, "size": 256226137, "checksum": "20e11292df1866d00997084dfac14b6fcc6758c3de04ccf60cec2747cfc28d5a", "avgDurationSecs": 4.331, "validDurationSecs": 14204.801, "totalHrs": 12.17, "validHrs": 3.94 }, "he": { "buckets": { "validated": 925, "invalidated": 92, "dev": 226, "test": 245, "train": 284, "other": 1780 }, "duration": 12543840, "reportedSentences": 23, "clips": 2797, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.77, "thirties": 0.16, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.05, "nineties": 0 }, "gender": { "": 0.01, "male": 0.98, "female": 0.01, "other": 0 } }, "users": 14, "size": 72317009, "checksum": "1329717c6a625b8bd4fe61428f56e8c23de2e3e3b871872cd3c92f3b0d825041", "avgDurationSecs": 4.485, "validDurationSecs": 4148.392, "totalHrs": 3.48, "validHrs": 1.15 }, "hi": { "buckets": { "validated": 10248, "invalidated": 765, "dev": 2423, "test": 3107, "train": 4661, "other": 3738 }, "duration": 72197520, "reportedSentences": 138, "clips": 14751, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.34, "thirties": 0.25, "teens": 0.01, "fourties": 0.03, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male": 0.62, "female": 0.04, "other": 0 } }, "users": 403, "size": 425984403, "checksum": "1a3a333a23c8aea39aabfcb6c3c6fb30d6c873ea67631058f6539e9bb28c757b", "avgDurationSecs": 4.894, "validDurationSecs": 50157.968, "totalHrs": 20.05, "validHrs": 13.93 }, "hsb": { "buckets": { "validated": 1424, "invalidated": 247, "dev": 172, "test": 444, "train": 808, "other": 94 }, "duration": 10872468, "reportedSentences": 87, "clips": 1765, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.1, "thirties": 0.1, "teens": 0.04, "fourties": 0.52, "fifties": 0.03, "sixties": 0, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male": 0.82, "female": 0, "other": 0 } }, "users": 21, "size": 83668562, "checksum": "69b3e623ff6a617c5daa1fd8545cf94a34837c8f7a9720810541b4254dce29af", "avgDurationSecs": 6.16, "validDurationSecs": 8771.895, "totalHrs": 3.02, "validHrs": 2.43 }, "hu": { "buckets": { "validated": 59654, "invalidated": 3051, "dev": 11312, "test": 11418, "train": 36496, "other": 49874 }, "duration": 612475057, "reportedSentences": 443, "clips": 112579, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.12, "thirties": 0.13, "teens": 0.02, "fourties": 0.12, "fifties": 0.3, "sixties": 0.02, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.25, "female": 0.45, "other": 0.01 } }, "users": 1611, "size": 3573583247, "checksum": "5632933d75c458fc10cdbed8a78ed7e09b1f4ac22a14aa9d42d1aa9f22497353", "avgDurationSecs": 5.44, "validDurationSecs": 324541.762, "totalHrs": 170.13, "validHrs": 90.15 }, "hy-AM": { "buckets": { "validated": 8738, "invalidated": 694, "dev": 2502, "test": 2692, "train": 3491, "other": 6215 }, "duration": 84972096, "reportedSentences": 88, "clips": 15647, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.48, "thirties": 0.07, "teens": 0.22, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male": 0.29, "female": 0.48, "other": 0 } }, "users": 224, "size": 491791591, "checksum": "5e2928823ada67a790da9ef88f128889f335204b83bdf9010a0516ea85503834", "avgDurationSecs": 5.431, "validDurationSecs": 47452.302, "totalHrs": 23.6, "validHrs": 13.18 }, "ia": { "buckets": { "validated": 11655, "invalidated": 336, "dev": 1803, "test": 1790, "train": 5007, "other": 2592 }, "duration": 61037436, "reportedSentences": 276, "clips": 14583, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.3, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.61, "female": 0.01, "other": 0 } }, "users": 66, "size": 413092822, "checksum": "2c6a12783d1c342def95a7ecc268f71515ebb08c9d49f8c4b3657bd92a0e2c5e", "avgDurationSecs": 4.186, "validDurationSecs": 48782.234, "totalHrs": 16.95, "validHrs": 13.55 }, "id": { "buckets": { "validated": 25962, "invalidated": 2596, "dev": 3340, "test": 3642, "train": 4969, "other": 29216 }, "duration": 229771812, "reportedSentences": 345, "clips": 57774, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.45, "female": 0.27, "other": 0.04 } }, "users": 528, "size": 1444635876, "checksum": "2d0603d8509c779e4e6b9d7278c4c7b59671b22452bacb55c311fd2dbb270d83", "avgDurationSecs": 3.977, "validDurationSecs": 103252.947, "totalHrs": 63.82, "validHrs": 28.68 }, "ig": { "buckets": { "validated": 14, "invalidated": 3, "dev": 2, "test": 4, "train": 8, "other": 5770 }, "duration": 31554684, "reportedSentences": 16, "clips": 5787, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male": 0.14, "female": 0.3, "other": 0 } }, "users": 114, "size": 184217832, "checksum": "66d99e155043e0bd8040a767f7d3634ad4866e7ffe14f310cfdcfd650d702a77", "avgDurationSecs": 5.453, "validDurationSecs": 76.338, "totalHrs": 8.76, "validHrs": 0.02 }, "is": { "buckets": { "validated": 13, "invalidated": 1, "dev": 0, "test": 4, "train": 9, "other": 1 }, "duration": 110016, "reportedSentences": 0, "clips": 15, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.33, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male": 0.33, "female": 0, "other": 0 } }, "users": 2, "size": 645061, "checksum": "4550c766b4988550a1308a1491e9a8244075948be6a387b21bbfe0da9a4d71cb", "avgDurationSecs": 7.334, "validDurationSecs": 95.347, "totalHrs": 0.03, "validHrs": 0.02 }, "it": { "buckets": { "validated": 235208, "invalidated": 19512, "dev": 15141, "test": 15141, "train": 168218, "other": 24027 }, "duration": 1397747840, "reportedSentences": 5928, "clips": 278747, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.21, "thirties": 0.16, "teens": 0.01, "fourties": 0.16, "fifties": 0.15, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male": 0.59, "female": 0.13, "other": 0 } }, "users": 7065, "size": 9533535151, "checksum": "4f4b804424c85b4bf73c51159ebc3f5cba72b16d7d5029f358534f222cc0825d", "avgDurationSecs": 5.014, "validDurationSecs": 1179426.053, "totalHrs": 388.26, "validHrs": 327.61 }, "ja": { "buckets": { "validated": 88020, "invalidated": 12416, "dev": 6021, "test": 6021, "train": 9435, "other": 244042 }, "duration": 1631522697, "reportedSentences": 344, "clips": 344478, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.54, "thirties": 0.08, "teens": 0.07, "fourties": 0.1, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male": 0.43, "female": 0.39, "other": 0.01 } }, "users": 3773, "size": 9610753960, "checksum": "9c01e239beb903f56a098d45c04206486cd8de008bb187025baa495407636dbf", "avgDurationSecs": 4.736, "validDurationSecs": 416881.855, "totalHrs": 453.2, "validHrs": 115.8 }, "ka": { "buckets": { "validated": 83970, "invalidated": 3442, "dev": 11857, "test": 12017, "train": 43420, "other": 27678 }, "duration": 581233248, "reportedSentences": 700, "clips": 115090, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.04, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.24, "female": 0.56, "other": 0 } }, "users": 1308, "size": 3399117971, "checksum": "a83818f08a14f272f67880a316c59d22c93a1ff428de3ef24cf17891ae2a1f36", "avgDurationSecs": 5.05, "validDurationSecs": 424069.475, "totalHrs": 161.45, "validHrs": 117.79 }, "kab": { "buckets": { "validated": 610054, "invalidated": 19704, "dev": 14998, "test": 14998, "train": 151949, "other": 124904 }, "duration": 2479725556, "reportedSentences": 9460, "clips": 754662, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.18, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.53, "female": 0.21, "other": 0 } }, "users": 1546, "size": 18490533578, "checksum": "205dbe17661a7337bf0ca240f9dd2b5fbfda2d58d9ee1502732f78519bf73d6c", "avgDurationSecs": 3.286, "validDurationSecs": 2004561.637, "totalHrs": 688.81, "validHrs": 556.82 }, "kk": { "buckets": { "validated": 1541, "invalidated": 225, "dev": 493, "test": 497, "train": 545, "other": 690 }, "duration": 12029184, "reportedSentences": 42, "clips": 2456, "splits": { "accent": {}, "age": { "": 0.53, "twenties": 0.25, "thirties": 0.03, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male": 0.35, "female": 0.12, "other": 0 } }, "users": 156, "size": 68338842, "checksum": "732841bfcce8c24cdb4c8d123bbda8507611659c30bf240c8c0a7499966b152e", "avgDurationSecs": 4.898, "validDurationSecs": 7547.627, "totalHrs": 3.34, "validHrs": 2.09 }, "kmr": { "buckets": { "validated": 52928, "invalidated": 2400, "dev": 3516, "test": 3520, "train": 4470, "other": 32549 }, "duration": 353791656, "reportedSentences": 1034, "clips": 87877, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.35, "thirties": 0.08, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male": 0.42, "female": 0.1, "other": 0 } }, "users": 553, "size": 1866781902, "checksum": "69e82368ca97e280bd3818edb86fc4599947b62844f917f4a8708b1d08449c32", "avgDurationSecs": 4.026, "validDurationSecs": 213087.438, "totalHrs": 98.27, "validHrs": 59.19 }, "ko": { "buckets": { "validated": 891, "invalidated": 232, "dev": 227, "test": 271, "train": 392, "other": 1865 }, "duration": 17813484, "reportedSentences": 18, "clips": 2988, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.41, "thirties": 0.34, "teens": 0.04, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male": 0.65, "female": 0.21, "other": 0 } }, "users": 74, "size": 103465677, "checksum": "01d219e9251fbe4e635767a7f52acb0820b1a8e6f3b86b71dc242321697f2b94", "avgDurationSecs": 5.962, "validDurationSecs": 5311.852, "totalHrs": 4.94, "validHrs": 1.47 }, "ky": { "buckets": { "validated": 30200, "invalidated": 5633, "dev": 1613, "test": 1611, "train": 1790, "other": 1405 }, "duration": 165195912, "reportedSentences": 39, "clips": 37238, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.67, "thirties": 0.08, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.11, "male": 0.54, "female": 0.35, "other": 0 } }, "users": 273, "size": 1064804708, "checksum": "a4b8555ccd802d0363ded7142b3ce0005a3bdacaaf3be2f28bb7247180d47800", "avgDurationSecs": 4.436, "validDurationSecs": 133973.805, "totalHrs": 45.88, "validHrs": 37.21 }, "lg": { "buckets": { "validated": 271803, "invalidated": 39183, "dev": 13439, "test": 13473, "train": 70846, "other": 50102 }, "duration": 2012482071, "reportedSentences": 6451, "clips": 361088, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.4, "thirties": 0.21, "teens": 0.01, "fourties": 0.07, "fifties": 0.04, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.38, "female": 0.37, "other": 0 } }, "users": 659, "size": 11786069390, "checksum": "5041e3db16b88c9740a6f11ed8ba8cdd2199858602680b2aa04267485e069aec", "avgDurationSecs": 5.573, "validDurationSecs": 1514862.483, "totalHrs": 559.02, "validHrs": 420.79 }, "lij": { "buckets": { "validated": 2211, "invalidated": 309, "dev": 261, "test": 625, "train": 1323, "other": 172 }, "duration": 9635688, "reportedSentences": 27, "clips": 2692, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.13, "thirties": 0.68, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.09, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male": 0.68, "female": 0.23, "other": 0 } }, "users": 12, "size": 52590025, "checksum": "e571e1b293e021fb152388ad3ca797f98d94b2bd41e93cd673423b25b723fffc", "avgDurationSecs": 3.579, "validDurationSecs": 7914.007, "totalHrs": 2.67, "validHrs": 2.19 }, "lo": { "buckets": { "validated": 104, "invalidated": 6, "dev": 28, "test": 21, "train": 55, "other": 96 }, "duration": 1234116, "reportedSentences": 0, "clips": 206, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.33, "thirties": 0.6, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.06, "male": 0.94, "female": 0, "other": 0 } }, "users": 10, "size": 7145225, "checksum": "802adaee6beccca4d80b1690e1264823db8ea0905ca128ebcb301f00e78e8b25", "avgDurationSecs": 5.991, "validDurationSecs": 623.049, "totalHrs": 0.34, "validHrs": 0.17 }, "lt": { "buckets": { "validated": 16420, "invalidated": 818, "dev": 4336, "test": 4760, "train": 7124, "other": 511 }, "duration": 90629648, "reportedSentences": 189, "clips": 17749, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.36, "thirties": 0.31, "teens": 0.03, "fourties": 0.04, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male": 0.57, "female": 0.22, "other": 0 } }, "users": 289, "size": 552283074, "checksum": "d36297b821cdede07ab04f2fa572af4126c9034a5b0f5001910b72032e818ce4", "avgDurationSecs": 5.106, "validDurationSecs": 83843.53, "totalHrs": 25.17, "validHrs": 23.28 }, "ltg": { "buckets": { "validated": 7471, "invalidated": 416, "dev": 1629, "test": 1660, "train": 2017, "other": 5642 }, "duration": 52708572, "reportedSentences": 19, "clips": 13529, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.11, "thirties": 0.19, "teens": 0, "fourties": 0.37, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.05, "female": 0.63, "other": 0 } }, "users": 169, "size": 300006549, "checksum": "afb0f301c89ab9368bebef1a17510e41370ed6d1c2aca8de8e747b0d1baf29a7", "avgDurationSecs": 3.896, "validDurationSecs": 29106.788, "totalHrs": 14.64, "validHrs": 8.08 }, "lv": { "buckets": { "validated": 132000, "invalidated": 6813, "dev": 4955, "test": 4955, "train": 7057, "other": 58783 }, "duration": 883642481, "reportedSentences": 3056, "clips": 197596, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.09, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male": 0.26, "female": 0.31, "other": 0 } }, "users": 5218, "size": 4901498162, "checksum": "f61a385159ff5c48abe181fbc7f4a6ecb1c6ced43a18169152a6bddd108e5739", "avgDurationSecs": 4.472, "validDurationSecs": 590299.437, "totalHrs": 245.45, "validHrs": 163.97 }, "mdf": { "buckets": { "validated": 336, "invalidated": 9, "dev": 54, "test": 107, "train": 175, "other": 1 }, "duration": 1811340, "reportedSentences": 15, "clips": 346, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male": 0.08, "female": 0.34, "other": 0 } }, "users": 11, "size": 10640352, "checksum": "d0f5564500df035dbcfddaca1bd378ae3b2c58b78595b6cf19fcb579f9c7ee01", "avgDurationSecs": 5.235, "validDurationSecs": 1758.989, "totalHrs": 0.5, "validHrs": 0.48 }, "mhr": { "buckets": { "validated": 205571, "invalidated": 11581, "dev": 14702, "test": 15067, "train": 173491, "other": 1490 }, "duration": 1008639540, "reportedSentences": 100, "clips": 218642, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.32, "thirties": 0.22, "teens": 0.12, "fourties": 0.13, "fifties": 0.04, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male": 0.2, "female": 0.67, "other": 0 } }, "users": 474, "size": 5647099642, "checksum": "6c81af4632f7bc50201eed6ddeee0ff3dea050378e0503bd1f2644208e75956a", "avgDurationSecs": 4.613, "validDurationSecs": 948340.387, "totalHrs": 280.17, "validHrs": 263.42 }, "mk": { "buckets": { "validated": 198, "invalidated": 16, "dev": 27, "test": 50, "train": 121, "other": 0 }, "duration": 1132524, "reportedSentences": 9, "clips": 214, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.16, "thirties": 0.42, "teens": 0.07, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male": 0.65, "female": 0, "other": 0 } }, "users": 8, "size": 6664827, "checksum": "d093d60c634b6afbd9793ff15dc21f1397328e72cff851d75697045e60a513a8", "avgDurationSecs": 5.292, "validDurationSecs": 1047.849, "totalHrs": 0.31, "validHrs": 0.29 }, "ml": { "buckets": { "validated": 2832, "invalidated": 105, "dev": 712, "test": 663, "train": 1256, "other": 5571 }, "duration": 35543052, "reportedSentences": 192, "clips": 8508, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male": 0.52, "female": 0.21, "other": 0.05 } }, "users": 131, "size": 199432768, "checksum": "dd888c487089b950c56531d3381b9a614759b372651baad0864aace7fdd8763a", "avgDurationSecs": 4.178, "validDurationSecs": 11830.974, "totalHrs": 9.87, "validHrs": 3.28 }, "mn": { "buckets": { "validated": 8616, "invalidated": 806, "dev": 1831, "test": 1893, "train": 2210, "other": 4758 }, "duration": 76987844, "reportedSentences": 34, "clips": 14180, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.44, "thirties": 0.22, "teens": 0.02, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0.05 }, "gender": { "": 0.27, "male": 0.39, "female": 0.29, "other": 0.06 } }, "users": 522, "size": 569078810, "checksum": "1498c94142ddd270f4ce2bdae512dfe31bb1d2a8a56cfd90cfdc1ee97dc335d7", "avgDurationSecs": 5.429, "validDurationSecs": 46779.074, "totalHrs": 21.38, "validHrs": 12.99 }, "mr": { "buckets": { "validated": 10880, "invalidated": 2262, "dev": 1783, "test": 1745, "train": 2217, "other": 2824 }, "duration": 98891496, "reportedSentences": 59, "clips": 15966, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male": 0.19, "female": 0.76, "other": 0 } }, "users": 89, "size": 579438614, "checksum": "dc84fae317848aeb0fba1de5ce98cc1fa0e2ffca3979a9da443ab29c131a19b9", "avgDurationSecs": 6.194, "validDurationSecs": 67389.42, "totalHrs": 27.46, "validHrs": 18.71 }, "mrj": { "buckets": { "validated": 28613, "invalidated": 994, "dev": 7314, "test": 6804, "train": 14325, "other": 814 }, "duration": 126320148, "reportedSentences": 50, "clips": 30421, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.34, "thirties": 0.23, "teens": 0.05, "fourties": 0.13, "fifties": 0.12, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male": 0.25, "female": 0.65, "other": 0 } }, "users": 58, "size": 702829853, "checksum": "e17a5a9cc476e78fca6b9146f5e2b51a435dc7e3192861aa9adb398a852dc635", "avgDurationSecs": 4.152, "validDurationSecs": 118812.61, "totalHrs": 35.08, "validHrs": 33 }, "mt": { "buckets": { "validated": 6434, "invalidated": 324, "dev": 1586, "test": 1659, "train": 1936, "other": 6320 }, "duration": 61963812, "reportedSentences": 10, "clips": 13078, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.25, "female": 0.47, "other": 0.01 } }, "users": 218, "size": 459746480, "checksum": "9b41d39dbd48f9a6bd9a7d109367c62ed51468ec81246c37e6f2c1f5c6c1ae61", "avgDurationSecs": 4.738, "validDurationSecs": 30484.414, "totalHrs": 17.21, "validHrs": 8.46 }, "myv": { "buckets": { "validated": 1971, "invalidated": 23, "dev": 239, "test": 481, "train": 1241, "other": 4 }, "duration": 11519388, "reportedSentences": 31, "clips": 1998, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.54, "female": 0.07, "other": 0 } }, "users": 13, "size": 67161204, "checksum": "8edc0e9a5ee2d02591dab6c63c9501ae5b3c98de38f9e14a3013c1cefde3abfb", "avgDurationSecs": 5.765, "validDurationSecs": 11363.721, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "validated": 7623, "invalidated": 505, "dev": 2345, "test": 2249, "train": 2897, "other": 19689 }, "duration": 71922744, "reportedSentences": 214, "clips": 27817, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.54, "thirties": 0.16, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male": 0.64, "female": 0.12, "other": 0.13 } }, "users": 209, "size": 405949448, "checksum": "e959cdab92843b162761c0216e7a4ff318de268e32fd45b327a0e5d37acff31a", "avgDurationSecs": 2.586, "validDurationSecs": 19709.785, "totalHrs": 19.97, "validHrs": 5.47 }, "ne-NP": { "buckets": { "validated": 714, "invalidated": 60, "dev": 205, "test": 193, "train": 283, "other": 478 }, "duration": 4879440, "reportedSentences": 5, "clips": 1252, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.6, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.74, "female": 0, "other": 0 } }, "users": 26, "size": 28431931, "checksum": "e97f5d3e09741117d3897bb44e402d1a922e285fd02b76f9c3f570fb9fe3aa36", "avgDurationSecs": 3.897, "validDurationSecs": 2782.684, "totalHrs": 1.35, "validHrs": 0.77 }, "nhi": { "buckets": { "validated": 14, "invalidated": 0, "dev": 0, "test": 0, "train": 14, "other": 11 }, "duration": 117972, "reportedSentences": 0, "clips": 25, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0, "thirties": 0.8, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.8, "female": 0, "other": 0 } }, "users": 2, "size": 694291, "checksum": "bba2b79e9be9ef5723f9ecfbc8ecd2dae0d80dadb3ddcf807355d11a9a20c2d4", "avgDurationSecs": 4.719, "validDurationSecs": 66.064, "totalHrs": 0.03, "validHrs": 0.01 }, "nl": { "buckets": { "validated": 90004, "invalidated": 5603, "dev": 11215, "test": 11227, "train": 34531, "other": 2514 }, "duration": 426106909, "reportedSentences": 339, "clips": 98121, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.21, "thirties": 0.12, "teens": 0.02, "fourties": 0.15, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.48, "female": 0.11, "other": 0 } }, "users": 1672, "size": 2894246867, "checksum": "4a7bce5c16c2610c24df45e79a0fa3dbb4b66a465931a2f3012b803e1bfee189", "avgDurationSecs": 4.343, "validDurationSecs": 390857.474, "totalHrs": 118.36, "validHrs": 108.57 }, "nn-NO": { "buckets": { "validated": 1033, "invalidated": 63, "dev": 245, "test": 322, "train": 461, "other": 231 }, "duration": 5786892, "reportedSentences": 23, "clips": 1327, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.29, "thirties": 0.42, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male": 0.6, "female": 0.14, "other": 0.03 } }, "users": 35, "size": 33244917, "checksum": "25b3dcb40d78b4ae68eac3efd0127fcecb9ce3dab2686ccaa3f27cfbb8b9ef7c", "avgDurationSecs": 4.361, "validDurationSecs": 4504.792, "totalHrs": 1.6, "validHrs": 1.25 }, "oc": { "buckets": { "validated": 1269, "invalidated": 156, "dev": 215, "test": 226, "train": 249, "other": 8266 }, "duration": 45233424, "reportedSentences": 10, "clips": 9691, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male": 0.09, "female": 0.41, "other": 0 } }, "users": 146, "size": 256246107, "checksum": "d0e482f786b008c806bf2299891c689965c546e50e418386fb7f9fda9bf2e5ac", "avgDurationSecs": 4.668, "validDurationSecs": 5923.147, "totalHrs": 12.56, "validHrs": 1.64 }, "or": { "buckets": { "validated": 3158, "invalidated": 276, "dev": 0, "test": 696, "train": 2048, "other": 5612 }, "duration": 44718252, "reportedSentences": 31, "clips": 9046, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.17, "thirties": 0.7, "teens": 0, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male": 0.87, "female": 0.04, "other": 0 } }, "users": 126, "size": 309408322, "checksum": "85abd55f5c0dda07e868f987fcb914136699fcc18dd1230f3be85103e219fb18", "avgDurationSecs": 4.943, "validDurationSecs": 15611.346, "totalHrs": 12.42, "validHrs": 4.33 }, "os": { "buckets": { "validated": 134, "invalidated": 27, "dev": 40, "test": 33, "train": 61, "other": 29 }, "duration": 1057932, "reportedSentences": 1, "clips": 190, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0.11, "fourties": 0.5, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male": 0.61, "female": 0, "other": 0 } }, "users": 7, "size": 5894562, "checksum": "0f7d679d688ac8a33746dc0554c6bd9a302d6b9db97bf8c9ba5335ca65ce89a0", "avgDurationSecs": 5.568, "validDurationSecs": 746.12, "totalHrs": 0.29, "validHrs": 0.2 }, "pa-IN": { "buckets": { "validated": 1504, "invalidated": 81, "dev": 286, "test": 486, "train": 732, "other": 1390 }, "duration": 14357366, "reportedSentences": 251, "clips": 2975, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.23, "thirties": 0.38, "teens": 0, "fourties": 0.04, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.69, "female": 0.01, "other": 0 } }, "users": 68, "size": 100836119, "checksum": "172e9a84d62f91700fe046318c3c44da7ba0d2e11ec8e5b3c8a083bbfe8c584a", "avgDurationSecs": 4.826, "validDurationSecs": 7258.312, "totalHrs": 3.98, "validHrs": 2.01 }, "pl": { "buckets": { "validated": 131313, "invalidated": 6519, "dev": 9043, "test": 9043, "train": 19811, "other": 3812 }, "duration": 627625406, "reportedSentences": 558, "clips": 141644, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.29, "thirties": 0.32, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.25, "male": 0.61, "female": 0.14, "other": 0.01 } }, "users": 3267, "size": 4472538018, "checksum": "5c9327a3acf133c2a89791863735701a43d0cf390574682f2b42043731553aee", "avgDurationSecs": 4.431, "validDurationSecs": 581848.684, "totalHrs": 174.34, "validHrs": 161.62 }, "ps": { "buckets": { "validated": 1222, "invalidated": 12, "dev": 0, "test": 195, "train": 1027, "other": 35 }, "duration": 6190344, "reportedSentences": 1, "clips": 1269, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.13, "thirties": 0.03, "teens": 0, "fourties": 0.81, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.03, "male": 0.97, "female": 0, "other": 0 } }, "users": 8, "size": 36373034, "checksum": "21251166a16e5e2a341fb94c428786501da3435b655fc50ea950ab6a1188bade", "avgDurationSecs": 4.878, "validDurationSecs": 5961.072, "totalHrs": 1.71, "validHrs": 1.65 }, "pt": { "buckets": { "validated": 150332, "invalidated": 7171, "dev": 9390, "test": 9399, "train": 21608, "other": 25162 }, "duration": 746365939, "reportedSentences": 2729, "clips": 182665, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.02, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.68, "female": 0.06, "other": 0.01 } }, "users": 3349, "size": 4703875493, "checksum": "92f836083c884c6c5d82f99b966a796280ea7427b5d8ffc471e6daef4d160496", "avgDurationSecs": 4.086, "validDurationSecs": 614253.876, "totalHrs": 207.32, "validHrs": 170.62 }, "quy": { "buckets": { "validated": 1, "invalidated": 0, "dev": 0, "test": 0, "train": 1, "other": 5 }, "duration": 36792, "reportedSentences": 0, "clips": 6, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male": 0, "female": 0, "other": 0 } }, "users": 2, "size": 217801, "checksum": "40e0c940d911bf407aecf6dabd565a42c6ea4528ffdd0dba5311114672cda16b", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "rm-sursilv": { "buckets": { "validated": 4299, "invalidated": 679, "dev": 1358, "test": 1360, "train": 1568, "other": 2266 }, "duration": 39264017, "reportedSentences": 17, "clips": 7244, "splits": { "accent": {}, "age": { "": 0.63, "twenties": 0.1, "thirties": 0.03, "teens": 0.05, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.63, "male": 0.18, "female": 0.18, "other": 0 } }, "users": 90, "size": 295769990, "checksum": "f966e74d43e7effe5d83f17bc60f41c0988b8e02ae56c9fad1866aee8ee5c377", "avgDurationSecs": 5.42, "validDurationSecs": 23301.492, "totalHrs": 10.9, "validHrs": 6.47 }, "rm-vallader": { "buckets": { "validated": 1526, "invalidated": 394, "dev": 500, "test": 462, "train": 557, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "clips": 2631, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 53, "size": 116709008, "checksum": "699b6bc058b5ac981a1eda0c57f75383ec6e8a9bacacf5f9338938bbc61b650a", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "validated": 17390, "invalidated": 966, "dev": 3882, "test": 3881, "train": 5149, "other": 22900 }, "duration": 165299827, "reportedSentences": 402, "clips": 41256, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.62, "thirties": 0.13, "teens": 0.01, "fourties": 0.06, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male": 0.71, "female": 0.13, "other": 0.01 } }, "users": 416, "size": 1025766593, "checksum": "99dbcb56bc6f71d6829c7e5ed3bc63a48b8712a275ccb9d2a622d74e0d9fa028", "avgDurationSecs": 4.007, "validDurationSecs": 69676.265, "totalHrs": 45.91, "validHrs": 19.35 }, "ru": { "buckets": { "validated": 160747, "invalidated": 9899, "dev": 10197, "test": 10197, "train": 26341, "other": 13601 }, "duration": 950615172, "reportedSentences": 465, "clips": 184247, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.34, "thirties": 0.15, "teens": 0.1, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.6, "female": 0.16, "other": 0 } }, "users": 3133, "size": 6450439511, "checksum": "5cd3cd80d03ffdde58a9b9c34ed5204aa282061bdaf4ada695fe4d4c0032256b", "avgDurationSecs": 5.159, "validDurationSecs": 829367.843, "totalHrs": 264.05, "validHrs": 230.37 }, "rw": { "buckets": { "validated": 1438767, "invalidated": 228126, "dev": 15987, "test": 16213, "train": 1003029, "other": 49129 }, "duration": 8582349497, "reportedSentences": 630, "clips": 1716022, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1128, "size": 60995087487, "checksum": "9223c670a31d20f88bbb03ad48d1ea5cfcaf45cabbe2e745b46eb91b07005f2f", "avgDurationSecs": 5.001, "validDurationSecs": 7195712.665, "totalHrs": 2383.98, "validHrs": 1998.8 }, "sah": { "buckets": { "validated": 4751, "invalidated": 149, "dev": 1279, "test": 1467, "train": 1900, "other": 1778 }, "duration": 40858536, "reportedSentences": 5, "clips": 6678, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.2, "thirties": 0.27, "teens": 0.06, "fourties": 0.07, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.39, "female": 0.23, "other": 0 } }, "users": 99, "size": 281946449, "checksum": "92895445170d6f84aa6e280a150a56222caaa5110b3bcf0be21861107c80396d", "avgDurationSecs": 6.118, "validDurationSecs": 29068.419, "totalHrs": 11.34, "validHrs": 8.07 }, "sat": { "buckets": { "validated": 461, "invalidated": 9, "dev": 0, "test": 146, "train": 315, "other": 354 }, "duration": 3676320, "reportedSentences": 7, "clips": 824, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.43, "thirties": 0.1, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.58, "female": 0.01, "other": 0 } }, "users": 13, "size": 20285925, "checksum": "181cef4d140dfd77d72ccdc74c21fb60f5deef2bd307af31053002d305df48e9", "avgDurationSecs": 4.462, "validDurationSecs": 2056.776, "totalHrs": 1.02, "validHrs": 0.57 }, "sc": { "buckets": { "validated": 1169, "invalidated": 26, "dev": 255, "test": 218, "train": 530, "other": 335 }, "duration": 6899760, "reportedSentences": 3, "clips": 1530, "splits": { "accent": {}, "age": { "": 0.59, "twenties": 0.1, "thirties": 0.31, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.59, "male": 0.11, "female": 0.31, "other": 0 } }, "users": 14, "size": 39927565, "checksum": "8524dcd3e541886cbce984c651530d8250ba436ef8f97c70bd3f60b993bfcd7e", "avgDurationSecs": 4.51, "validDurationSecs": 5271.777, "totalHrs": 1.91, "validHrs": 1.46 }, "sk": { "buckets": { "validated": 19316, "invalidated": 825, "dev": 2633, "test": 2597, "train": 3248, "other": 6440 }, "duration": 94837680, "reportedSentences": 66, "clips": 26581, "splits": { "accent": {}, "age": { "": 0.46, "twenties": 0.08, "thirties": 0.23, "teens": 0.07, "fourties": 0.16, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male": 0.47, "female": 0.06, "other": 0.01 } }, "users": 223, "size": 535545515, "checksum": "ead14cbb4006ea41612c7f1a1357d70bd57bd2865421ff6270f74c5628b605b7", "avgDurationSecs": 3.568, "validDurationSecs": 68917.07, "totalHrs": 26.34, "validHrs": 19.14 }, "skr": { "buckets": { "validated": 3632, "invalidated": 460, "dev": 1120, "test": 999, "train": 1501, "other": 1630 }, "duration": 23697324, "reportedSentences": 17, "clips": 5722, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0.38, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.64, "female": 0, "other": 0 } }, "users": 56, "size": 135623718, "checksum": "c03e2ad9c9186ddda5592fec4ff4476905a19b2f5a7a05b12572f9e4a4426b65", "avgDurationSecs": 4.141, "validDurationSecs": 15041.713, "totalHrs": 6.58, "validHrs": 4.17 }, "sl": { "buckets": { "validated": 10715, "invalidated": 278, "dev": 1211, "test": 1238, "train": 1404, "other": 3228 }, "duration": 53100802, "reportedSentences": 48, "clips": 14221, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.42, "thirties": 0.04, "teens": 0.06, "fourties": 0.18, "fifties": 0.06, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male": 0.69, "female": 0.13, "other": 0 } }, "users": 151, "size": 365015017, "checksum": "90f3302145af431285728416214c4f93d116526de6d7feabdf518ecea74bdf62", "avgDurationSecs": 3.734, "validDurationSecs": 40009.5, "totalHrs": 14.75, "validHrs": 11.11 }, "sq": { "buckets": { "validated": 1252, "invalidated": 15, "dev": 322, "test": 384, "train": 542, "other": 0 }, "duration": 5729292, "reportedSentences": 12, "clips": 1267, "splits": { "accent": {}, "age": { "": 0.65, "twenties": 0.07, "thirties": 0.19, "teens": 0, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male": 0.41, "female": 0.09, "other": 0 } }, "users": 45, "size": 32925473, "checksum": "0ede8f11bc7d9c391712bc2b2502d275e8d5de3f35c0444d35b6f32f8e620cbc", "avgDurationSecs": 4.522, "validDurationSecs": 5661.463, "totalHrs": 1.59, "validHrs": 1.57 }, "sr": { "buckets": { "validated": 5712, "invalidated": 266, "dev": 1429, "test": 1541, "train": 1879, "other": 3185 }, "duration": 23130792, "reportedSentences": 53, "clips": 9163, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.47, "thirties": 0.15, "teens": 0, "fourties": 0.05, "fifties": 0.14, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.6, "female": 0.21, "other": 0 } }, "users": 152, "size": 123860881, "checksum": "ed5b6a13e37ed376a6009a6fe08679dd6e352a03c1d5b3107613a044b320d873", "avgDurationSecs": 2.524, "validDurationSecs": 14419.195, "totalHrs": 6.42, "validHrs": 4 }, "sv-SE": { "buckets": { "validated": 40488, "invalidated": 1416, "dev": 5180, "test": 5222, "train": 7662, "other": 6871 }, "duration": 194228439, "reportedSentences": 589, "clips": 48775, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.38, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male": 0.48, "female": 0.33, "other": 0.01 } }, "users": 827, "size": 1226276613, "checksum": "54e05050118bc577fce8317ddabb0732fc0eb7bf33e0effe0e4c749344f01449", "avgDurationSecs": 3.982, "validDurationSecs": 161228.519, "totalHrs": 53.95, "validHrs": 44.78 }, "sw": { "buckets": { "validated": 265354, "invalidated": 79914, "dev": 12234, "test": 12231, "train": 46206, "other": 378246 }, "duration": 3889601568, "reportedSentences": 1901, "clips": 723514, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male": 0.35, "female": 0.37, "other": 0 } }, "users": 1436, "size": 22663878846, "checksum": "c915e358d5cce8f546feaea7cf0a4cbec7fd44d24614490fbf9fb11f620cb582", "avgDurationSecs": 5.376, "validDurationSecs": 1426539.548, "totalHrs": 1080.44, "validHrs": 396.26 }, "ta": { "buckets": { "validated": 134087, "invalidated": 5667, "dev": 11982, "test": 12086, "train": 44512, "other": 91580 }, "duration": 1418463816, "reportedSentences": 3354, "clips": 231334, "splits": { "accent": {}, "age": { "": 0.7, "twenties": 0.09, "thirties": 0.09, "teens": 0.03, "fourties": 0.03, "fifties": 0.03, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male": 0.16, "female": 0.14, "other": 0 } }, "users": 897, "size": 8491935514, "checksum": "5c9173f81474e6e487697fb95c8b1d637daeeb8090949e44108f93519f33d324", "avgDurationSecs": 6.132, "validDurationSecs": 822177.275, "totalHrs": 394.01, "validHrs": 228.38 }, "te": { "buckets": { "validated": 103, "invalidated": 0, "dev": 25, "test": 27, "train": 39, "other": 243 }, "duration": 1647972, "reportedSentences": 3, "clips": 346, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.17, "thirties": 0.07, "teens": 0, "fourties": 0.06, "fifties": 0, "sixties": 0.52, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male": 0.83, "female": 0, "other": 0 } }, "users": 11, "size": 9609699, "checksum": "97848253c09b79badfc5d5a76e5dc737c1920fc1633c2d6b9fa17e3959938062", "avgDurationSecs": 4.763, "validDurationSecs": 490.581, "totalHrs": 0.45, "validHrs": 0.13 }, "th": { "buckets": { "validated": 146686, "invalidated": 9276, "dev": 11038, "test": 11038, "train": 32785, "other": 206732 }, "duration": 1515310068, "reportedSentences": 4234, "clips": 362694, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male": 0.42, "female": 0.17, "other": 0.01 } }, "users": 7924, "size": 8813115954, "checksum": "3cfc9410ad46e7ee599887da9694bcbaeea1802748c40394f5136ac0154977d8", "avgDurationSecs": 4.178, "validDurationSecs": 612843.809, "totalHrs": 420.91, "validHrs": 170.23 }, "ti": { "buckets": { "validated": 22, "invalidated": 1, "dev": 5, "test": 3, "train": 14, "other": 46 }, "duration": 378540, "reportedSentences": 3, "clips": 69, "splits": { "accent": {}, "age": { "": 0.78, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male": 0.22, "female": 0, "other": 0 } }, "users": 6, "size": 2176791, "checksum": "49688db338fe2901ef2caea79c6737bdf232b91c9a0f639ea93519e4edd4e2a1", "avgDurationSecs": 5.486, "validDurationSecs": 120.694, "totalHrs": 0.1, "validHrs": 0.03 }, "tig": { "buckets": { "validated": 51, "invalidated": 12, "dev": 10, "test": 11, "train": 30, "other": 334 }, "duration": 1620288, "reportedSentences": 1, "clips": 397, "splits": { "accent": {}, "age": { "": 0.94, "twenties": 0.01, "thirties": 0.05, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.94, "male": 0.06, "female": 0, "other": 0 } }, "users": 9, "size": 9519438, "checksum": "757ce85483b3ac517347741cdea4bca9c8d6a4cf60f51668f6c0f7aaffc08974", "avgDurationSecs": 4.081, "validDurationSecs": 208.148, "totalHrs": 0.45, "validHrs": 0.05 }, "tk": { "buckets": { "validated": 1717, "invalidated": 113, "dev": 335, "test": 477, "train": 804, "other": 1648 }, "duration": 18731232, "reportedSentences": 18, "clips": 3478, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.63, "thirties": 0.11, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.5, "female": 0.24, "other": 0 } }, "users": 82, "size": 106231435, "checksum": "38bd83d8a5aea5bc6e1ee2490401ce4d9c38d6cf879fe99cd03cd52b9f002fa0", "avgDurationSecs": 5.386, "validDurationSecs": 9247.132, "totalHrs": 5.2, "validHrs": 2.56 }, "tok": { "buckets": { "validated": 11233, "invalidated": 317, "dev": 2109, "test": 2147, "train": 2588, "other": 3481 }, "duration": 60932340, "reportedSentences": 142, "clips": 15031, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.21, "thirties": 0.07, "teens": 0.26, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.12, "nineties": 0 }, "gender": { "": 0.33, "male": 0.49, "female": 0.06, "other": 0.12 } }, "users": 138, "size": 355140938, "checksum": "b6244c1af27c9802e464be8b53377222942d3f2f9a18e1e11085e8f7ad953172", "avgDurationSecs": 4.054, "validDurationSecs": 45536.09, "totalHrs": 16.92, "validHrs": 12.64 }, "tr": { "buckets": { "validated": 111192, "invalidated": 4412, "dev": 10996, "test": 11035, "train": 32802, "other": 322 }, "duration": 421520247, "reportedSentences": 396, "clips": 115926, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.24, "thirties": 0.08, "teens": 0.02, "fourties": 0.03, "fifties": 0.08, "sixties": 0.2, "seventies": 0.02, "eighties": 0.08, "nineties": 0 }, "gender": { "": 0.26, "male": 0.44, "female": 0.3, "other": 0 } }, "users": 1551, "size": 2548300519, "checksum": "b2ee0968129b2d88a63ea615abedc2de58dbcdde10b97310729b78f20fc6935d", "avgDurationSecs": 3.636, "validDurationSecs": 404306.879, "totalHrs": 117.08, "validHrs": 112.3 }, "tt": { "buckets": { "validated": 29053, "invalidated": 419, "dev": 3616, "test": 4951, "train": 9400, "other": 154 }, "duration": 111451166, "reportedSentences": 6, "clips": 29626, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.05, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male": 0.77, "female": 0.02, "other": 0 } }, "users": 252, "size": 820100023, "checksum": "3a5dfd30914fe5d123a7c390a3f919fd75ba02aa16fb9ce3bbb5ce256b0274ae", "avgDurationSecs": 3.762, "validDurationSecs": 109295.576, "totalHrs": 30.95, "validHrs": 30.35 }, "tw": { "buckets": { "validated": 52, "invalidated": 0, "dev": 0, "test": 3, "train": 49, "other": 185 }, "duration": 943884, "reportedSentences": 3, "clips": 237, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.02, "thirties": 0.91, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male": 0.93, "female": 0, "other": 0 } }, "users": 5, "size": 5527532, "checksum": "596948b15566752927de29f21e8dc8035a5435dc8ec7d458e9dcf8d5392ca1cc", "avgDurationSecs": 3.983, "validDurationSecs": 207.097, "totalHrs": 0.26, "validHrs": 0.05 }, "ug": { "buckets": { "validated": 86415, "invalidated": 4933, "dev": 5953, "test": 5953, "train": 9271, "other": 18272 }, "duration": 647441316, "reportedSentences": 577, "clips": 109620, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.11, "thirties": 0.26, "teens": 0.03, "fourties": 0.1, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male": 0.36, "female": 0.15, "other": 0 } }, "users": 1187, "size": 3715653241, "checksum": "24a393d4c3641b30a93bfc96131a7773ef13d7a89473264aa9eb58c2ad78de8c", "avgDurationSecs": 5.906, "validDurationSecs": 510387.168, "totalHrs": 179.84, "validHrs": 141.77 }, "uk": { "buckets": { "validated": 73689, "invalidated": 3130, "dev": 9786, "test": 9791, "train": 23799, "other": 5988 }, "duration": 386177244, "reportedSentences": 638, "clips": 82807, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.32, "thirties": 0.25, "teens": 0.09, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.58, "female": 0.18, "other": 0 } }, "users": 1058, "size": 2539347911, "checksum": "47a3b4de88e0466e36f38e573ace4853a5374f90a80ab93dac2ef7f54cf09177", "avgDurationSecs": 4.664, "validDurationSecs": 343654.702, "totalHrs": 107.27, "validHrs": 95.45 }, "ur": { "buckets": { "validated": 53840, "invalidated": 6812, "dev": 4054, "test": 4056, "train": 5368, "other": 137003 }, "duration": 834105816, "reportedSentences": 237, "clips": 197655, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.9, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male": 0.67, "female": 0.25, "other": 0 } }, "users": 342, "size": 4849616069, "checksum": "838e861e9b54f73387ea0413db79910a56682c2668b1ee0801080be56b894d47", "avgDurationSecs": 4.22, "validDurationSecs": 227205.267, "totalHrs": 231.69, "validHrs": 63.11 }, "uz": { "buckets": { "validated": 86351, "invalidated": 13912, "dev": 12130, "test": 12338, "train": 48439, "other": 128049 }, "duration": 946998252, "reportedSentences": 1831, "clips": 228312, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.44, "female": 0.15, "other": 0 } }, "users": 2151, "size": 5281385025, "checksum": "e5765c64bddf7ebea0d44fcf40a9570e0fa9dfb3fce8e3e054bd1a43d9c87a7c", "avgDurationSecs": 4.148, "validDurationSecs": 358168.848, "totalHrs": 263.05, "validHrs": 99.49 }, "vi": { "buckets": { "validated": 4975, "invalidated": 365, "dev": 476, "test": 1326, "train": 2351, "other": 12001 }, "duration": 67351544, "reportedSentences": 187, "clips": 17341, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.2, "thirties": 0.02, "teens": 0.21, "fourties": 0.02, "fifties": 0, "sixties": 0.29, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.53, "female": 0.19, "other": 0.02 } }, "users": 280, "size": 390733942, "checksum": "8a092bb88926918abb37e22cad29a502152df5a63e7387109d65672a493f2106", "avgDurationSecs": 3.884, "validDurationSecs": 19322.642, "totalHrs": 18.7, "validHrs": 5.36 }, "vot": { "buckets": { "validated": 102, "invalidated": 324, "dev": 0, "test": 6, "train": 96, "other": 1 }, "duration": 1029972, "reportedSentences": 0, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.75, "female": 0, "other": 0 } }, "users": 6, "size": 7916040, "checksum": "85b9b89167e72d83f0dcf335ce59f8423aca87cd78587349e029321f0d58e200", "avgDurationSecs": 2.412, "validDurationSecs": 246.035, "totalHrs": 0.28, "validHrs": 0.06 }, "yi": { "buckets": { "validated": 43, "invalidated": 1, "dev": 0, "test": 2, "train": 41, "other": 0 }, "duration": 166356, "reportedSentences": 1, "clips": 44, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.73, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male": 0, "female": 0, "other": 0 } }, "users": 2, "size": 966120, "checksum": "a5bfd312cc184bca1c2d25e77ac31cc71901b7548f7f9a67520c30cb742f9724", "avgDurationSecs": 3.781, "validDurationSecs": 162.575, "totalHrs": 0.04, "validHrs": 0.04 }, "yo": { "buckets": { "validated": 3075, "invalidated": 229, "dev": 863, "test": 998, "train": 1212, "other": 1169 }, "duration": 26232120, "reportedSentences": 19, "clips": 4473, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.63, "thirties": 0.02, "teens": 0.04, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male": 0.32, "female": 0.38, "other": 0 } }, "users": 108, "size": 153133859, "checksum": "76884ad1e9e8da1e19adc013dd9be8726c9b6592923de1ac038e49b29b8ca21e", "avgDurationSecs": 5.865, "validDurationSecs": 18033.483, "totalHrs": 7.28, "validHrs": 5 }, "yue": { "buckets": { "validated": 21023, "invalidated": 1684, "dev": 2540, "test": 2581, "train": 3104, "other": 140241 }, "duration": 629086572, "reportedSentences": 1669, "clips": 162948, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.32, "thirties": 0.45, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male": 0.25, "female": 0.56, "other": 0.01 } }, "users": 897, "size": 3607975409, "checksum": "48ca933c5cb7ba7628c3db8a6399991313fe36f6519e7a46e1a07056dcb0d0cf", "avgDurationSecs": 3.861, "validDurationSecs": 81162.622, "totalHrs": 174.74, "validHrs": 22.54 }, "zgh": { "buckets": { "validated": 262, "invalidated": 8, "dev": 48, "test": 43, "train": 171, "other": 4 }, "duration": 905832, "reportedSentences": 0, "clips": 274, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.32, "thirties": 0.64, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.04, "male": 0.96, "female": 0, "other": 0 } }, "users": 5, "size": 5036025, "checksum": "98d2087eac2b54ee140c76ba62f63c74c6bf1b01876f36b8108589cf7da5abf9", "avgDurationSecs": 3.306, "validDurationSecs": 866.161, "totalHrs": 0.25, "validHrs": 0.24 }, "zh-CN": { "buckets": { "validated": 184407, "invalidated": 58386, "dev": 10626, "test": 10626, "train": 29406, "other": 610981 }, "duration": 3814708416, "reportedSentences": 791, "clips": 853774, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.04, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male": 0.06, "female": 0.01, "other": 0 } }, "users": 6959, "size": 22551088947, "checksum": "7f06e6d149b191cc6a1a244faf1653cadbf30d19e5d74fdb7f2dca501096e567", "avgDurationSecs": 4.468, "validDurationSecs": 823940.451, "totalHrs": 1059.64, "validHrs": 228.87 }, "zh-HK": { "buckets": { "validated": 92436, "invalidated": 4449, "dev": 5594, "test": 5594, "train": 8429, "other": 22185 }, "duration": 496819716, "reportedSentences": 683, "clips": 119070, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.13, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male": 0.42, "female": 0.22, "other": 0.01 } }, "users": 3016, "size": 3549532072, "checksum": "bfdc745e77c356fd8cb0ad905758c624a2d7c1f0870351d0e817aa6ba0adcbb3", "avgDurationSecs": 4.173, "validDurationSecs": 385689.319, "totalHrs": 138, "validHrs": 107.13 }, "zh-TW": { "buckets": { "validated": 82159, "invalidated": 4760, "dev": 4967, "test": 4967, "train": 7084, "other": 47504 }, "duration": 451916626, "reportedSentences": 151, "clips": 134423, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.3, "thirties": 0.19, "teens": 0.05, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.5, "female": 0.23, "other": 0.02 } }, "users": 2177, "size": 3025524565, "checksum": "9b8d69d8e01bcf605c8c9c4ba0bc63cd959f7eb5099b0cb1ac1ccd13a6d2e155", "avgDurationSecs": 3.362, "validDurationSecs": 276210.307, "totalHrs": 125.53, "validHrs": 76.72 } }, "totalDuration": 109183747623, "totalValidDurationSecs": 70824257, "totalHrs": 30328, "totalValidHrs": 19673 } ================================================ FILE: datasets/scripted-speech/cv-corpus-16.0-delta-2023-12-06.json ================================================ { "locales": { "ab": { "duration": 12852, "reportedSentences": 0, "clips": 5, "users": 1, "size": 40428, "checksum": "8f05fa81eb5722601728988c3f5f3d17f4a4199c93d4ab2cff12395683421f3e", "avgDurationSecs": 2.57, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "af": { "duration": 579960, "reportedSentences": 0, "clips": 180, "users": 4, "size": 3387047, "checksum": "84f1b10db30093ee6ee509e55743d3d206a20d702c843dc535af4357bcdf1520", "avgDurationSecs": 3.222, "validDurationSecs": 0, "totalHrs": 0.16, "validHrs": 0 }, "am": { "duration": 1094040, "reportedSentences": 4, "clips": 172, "users": 6, "size": 6459672, "checksum": "4e5584b860ebcd86f44be3fd761fe74f71cd50f0ff5f0b3b4b5891d20c709036", "avgDurationSecs": 6.361, "validDurationSecs": 298.953, "totalHrs": 0.3, "validHrs": 0.08 }, "ar": { "duration": 5485140, "reportedSentences": 0, "clips": 1584, "users": 58, "size": 31963389, "checksum": "11d57d6505741e56923b2249fb8aaa3bf514ffbf62c9ccad7e719e76713d9171", "avgDurationSecs": 3.463, "validDurationSecs": 0, "totalHrs": 1.52, "validHrs": 0 }, "ast": { "duration": 1583460, "reportedSentences": 0, "clips": 382, "users": 12, "size": 9102966, "checksum": "744ea7c18c479ed7fd9853e20742ed024364ce6cf68d8ebca23c2aa8f52b9a12", "avgDurationSecs": 4.145, "validDurationSecs": 16.581, "totalHrs": 0.43, "validHrs": 0 }, "az": { "duration": 27000, "reportedSentences": 0, "clips": 5, "users": 1, "size": 159742, "checksum": "fb2179337ab2889ad696815a7b9fb3ce9f3a0f6084baeedd2505e8bbaf10f058", "avgDurationSecs": 5.4, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ba": { "duration": 181656, "reportedSentences": 0, "clips": 29, "users": 3, "size": 1061223, "checksum": "eb8d4a1235bf77152c0de2498528657a6df3e9b7d0c4deeb5ca905ac4422a521", "avgDurationSecs": 6.264, "validDurationSecs": 50.112, "totalHrs": 0.05, "validHrs": 0.01 }, "bas": { "duration": 98640, "reportedSentences": 0, "clips": 25, "users": 3, "size": 499430, "checksum": "cd06112bc6a5181695f214393696c17e1a61fafc46401f83642a19f2e6df1b9c", "avgDurationSecs": 3.946, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "be": { "duration": 223594452, "reportedSentences": 0, "clips": 47664, "users": 69, "size": 1301244316, "checksum": "53603521a45b9c17ff2c1b3c71d346b4535b68d3831d851db29869d9c67be29e", "avgDurationSecs": 4.691, "validDurationSecs": 222004.184, "totalHrs": 62.1, "validHrs": 61.66 }, "bg": { "duration": 1341360, "reportedSentences": 2, "clips": 234, "users": 12, "size": 7474254, "checksum": "67ef5fc1c8e146a0f7ad0a6614d3fc21c326fba8ded61582014fa626d216eaee", "avgDurationSecs": 5.732, "validDurationSecs": 5.732, "totalHrs": 0.37, "validHrs": 0 }, "bn": { "duration": 5739696, "reportedSentences": 4, "clips": 1043, "users": 22, "size": 33569824, "checksum": "b95f73a836375a3d83fa33acf25b708b6cd9667dd6c7a3e9fe1304b5fa5c87e5", "avgDurationSecs": 5.503, "validDurationSecs": 0, "totalHrs": 1.59, "validHrs": 0 }, "br": { "duration": 3620700, "reportedSentences": 0, "clips": 949, "users": 10, "size": 21052417, "checksum": "3f5e71029c4ce33bbf94b8811d21ac503d9e5c7dfcff4ab9a9ef15330d21072f", "avgDurationSecs": 3.815, "validDurationSecs": 26.707, "totalHrs": 1, "validHrs": 0 }, "ca": { "duration": 618667776, "reportedSentences": 173, "clips": 129073, "users": 562, "size": 3450517056, "checksum": "66210f7d2ae511edd117698cf0dd0d54d92ee5651a6692fb73b140369c443ded", "avgDurationSecs": 4.793, "validDurationSecs": 28207.757, "totalHrs": 171.85, "validHrs": 7.83 }, "ckb": { "duration": 2544084, "reportedSentences": 4, "clips": 575, "users": 19, "size": 14686794, "checksum": "622425eb5fc912f5937003d69abfd616f57073376468b5f434f601d03f982b5f", "avgDurationSecs": 4.424, "validDurationSecs": 0, "totalHrs": 0.7, "validHrs": 0 }, "cnh": { "duration": 1075824, "reportedSentences": 0, "clips": 385, "users": 1, "size": 6232462, "checksum": "cfed4d897f416d6619b5790681f09631a31db527663e30e3f72c836c99eeecc3", "avgDurationSecs": 2.794, "validDurationSecs": 0, "totalHrs": 0.29, "validHrs": 0 }, "cs": { "duration": 5555196, "reportedSentences": 20, "clips": 1165, "users": 31, "size": 30773311, "checksum": "4104db0e247d94ed7751e14add1f78c7a6ab8b9d511792286008edfb12f2b90b", "avgDurationSecs": 4.768, "validDurationSecs": 581.746, "totalHrs": 1.54, "validHrs": 0.16 }, "cy": { "duration": 2692620, "reportedSentences": 0, "clips": 451, "users": 32, "size": 15121071, "checksum": "9b5fa48e121470c5d21bdc44ca17acb2798e9f98424f4710289e246a0b4eae3a", "avgDurationSecs": 5.97, "validDurationSecs": 5.97, "totalHrs": 0.74, "validHrs": 0 }, "da": { "duration": 491868, "reportedSentences": 1, "clips": 149, "users": 12, "size": 2860518, "checksum": "e9770f70292c500487da84dd3c031cad3f7ae924beae0a8cb0eee14ad47307aa", "avgDurationSecs": 3.301, "validDurationSecs": 178.261, "totalHrs": 0.13, "validHrs": 0.04 }, "de": { "duration": 55311120, "reportedSentences": 265, "clips": 9713, "users": 332, "size": 321594121, "checksum": "5370f8c34561a6012379ff6410a824a21be4b816c3b1045aea48ec5bf8e7fe22", "avgDurationSecs": 5.695, "validDurationSecs": 35904.109, "totalHrs": 15.36, "validHrs": 9.97 }, "dv": { "duration": 610704, "reportedSentences": 0, "clips": 99, "users": 6, "size": 3601093, "checksum": "548d827b06d8db15da93eedc26e30fd86f5ea8bfbe1e243fab8e4045498d205e", "avgDurationSecs": 6.169, "validDurationSecs": 0, "totalHrs": 0.16, "validHrs": 0 }, "dyu": { "duration": 1210968, "reportedSentences": 0, "clips": 200, "users": 14, "size": 7104521, "checksum": "e060751c27c3c6750a6fb33ecef29d7db013297756218b20d87bb167de817976", "avgDurationSecs": 6.055, "validDurationSecs": 575.21, "totalHrs": 0.33, "validHrs": 0.15 }, "el": { "duration": 716652, "reportedSentences": 0, "clips": 140, "users": 12, "size": 4201025, "checksum": "099a465b84dc4356a9f3cd1f179492b58087d5c28527e2407656e34e04dd4d2c", "avgDurationSecs": 5.119, "validDurationSecs": 0, "totalHrs": 0.19, "validHrs": 0 }, "en": { "duration": 330493248, "reportedSentences": 612, "clips": 53725, "users": 1659, "size": 1899392951, "checksum": "2830362ac1358594213c34851d60eff88d15692b7308d5a35f899132ae8a0fc3", "avgDurationSecs": 6.152, "validDurationSecs": 14923.716, "totalHrs": 91.8, "validHrs": 4.14 }, "eo": { "duration": 14046084, "reportedSentences": 8, "clips": 2122, "users": 44, "size": 81313317, "checksum": "6d9121e8f3db682470763a6511e51075005cc9726fd7bfb4b6b1c776aa5af06f", "avgDurationSecs": 6.619, "validDurationSecs": 26.477, "totalHrs": 3.9, "validHrs": 0 }, "es": { "duration": 62050824, "reportedSentences": 198, "clips": 11823, "users": 380, "size": 361081958, "checksum": "8232558ac11021e0291d0cf7cb424b49c0918b0b63037c098931946bfa0f582e", "avgDurationSecs": 5.248, "validDurationSecs": 104.966, "totalHrs": 17.23, "validHrs": 0.02 }, "et": { "duration": 11431008, "reportedSentences": 18, "clips": 1678, "users": 50, "size": 67232683, "checksum": "8da88dd23a0e1c8e1d49a96b93d72a9d452dea550d8b56bd11fd9d163997b6c9", "avgDurationSecs": 6.812, "validDurationSecs": 8195.174, "totalHrs": 3.17, "validHrs": 2.27 }, "eu": { "duration": 1527985188, "reportedSentences": 820, "clips": 276658, "users": 7761, "size": 8722515283, "checksum": "0047a4bf9fd0d17453b8cf6763519d895da34532a01686913ceff0593b264c4a", "avgDurationSecs": 5.523, "validDurationSecs": 243675.247, "totalHrs": 424.44, "validHrs": 67.68 }, "fa": { "duration": 16247952, "reportedSentences": 69, "clips": 4549, "users": 61, "size": 94286467, "checksum": "bd6a300702c318d099762a9d60b0288c66ebd399b0e92314935f7738ddc69598", "avgDurationSecs": 3.572, "validDurationSecs": 25.002, "totalHrs": 4.51, "validHrs": 0 }, "fi": { "duration": 1306872, "reportedSentences": 11, "clips": 238, "users": 4, "size": 7697196, "checksum": "adcbbafe4390637acedb9dd463adde8d869b066fb75c03146061bfac3113560d", "avgDurationSecs": 5.491, "validDurationSecs": 0, "totalHrs": 0.36, "validHrs": 0 }, "fr": { "duration": 102597372, "reportedSentences": 80, "clips": 19907, "users": 615, "size": 598073983, "checksum": "48a20803bb9f200111d3d009615b0dc2ff3e7f31c62362d933720fd025a2c960", "avgDurationSecs": 5.154, "validDurationSecs": 9225.363, "totalHrs": 28.49, "validHrs": 2.56 }, "fy-NL": { "duration": 2852604, "reportedSentences": 0, "clips": 526, "users": 17, "size": 14091775, "checksum": "84610262e3302a53aa8d911e635eaeb9e22beffb49204b0ca76bc77994f1a9c7", "avgDurationSecs": 5.423, "validDurationSecs": 0, "totalHrs": 0.79, "validHrs": 0 }, "ga-IE": { "duration": 185580, "reportedSentences": 0, "clips": 36, "users": 3, "size": 1090116, "checksum": "c904e30ed6efb78840e396f4b31c139c4fa8de40514ee5867cdbfc5cba840938", "avgDurationSecs": 5.155, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "gl": { "duration": 11017116, "reportedSentences": 4, "clips": 2055, "users": 28, "size": 64504690, "checksum": "1f69fab5d7708e47ff1fe390b776ccedd4249236b94bb6b470a2c0f07baaae30", "avgDurationSecs": 5.361, "validDurationSecs": 37.528, "totalHrs": 3.06, "validHrs": 0.01 }, "gn": { "duration": 9965664, "reportedSentences": 0, "clips": 2058, "users": 26, "size": 55399202, "checksum": "08823c08c8b32b0788a38da2c884f8879855eae77045b67d82adbc3ce06e8575", "avgDurationSecs": 4.842, "validDurationSecs": 0, "totalHrs": 2.76, "validHrs": 0 }, "ha": { "duration": 20880, "reportedSentences": 0, "clips": 5, "users": 1, "size": 123016, "checksum": "32eb61c3f9b60dd55b47e19b4a09db1e77017c94b0d67dbb26fca079a7706d88", "avgDurationSecs": 4.176, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "he": { "duration": 9460620, "reportedSentences": 9, "clips": 2192, "users": 9, "size": 54404360, "checksum": "9407ee60cdaf20ce1bed6aa6c80c8b6ac6b361de99bf55fe8696abdd416cc991", "avgDurationSecs": 4.316, "validDurationSecs": 2442.843, "totalHrs": 2.62, "validHrs": 0.67 }, "hi": { "duration": 285840, "reportedSentences": 2, "clips": 46, "users": 7, "size": 1618749, "checksum": "bab1847171f2fd7bbdad60beae2a63a7b2feb3153d805c4f2f7853d1d71c9ef6", "avgDurationSecs": 6.214, "validDurationSecs": 0, "totalHrs": 0.07, "validHrs": 0 }, "hu": { "duration": 24088212, "reportedSentences": 30, "clips": 4128, "users": 29, "size": 140967683, "checksum": "9ab33a49266ba6ad91a2c472f96d955b3f37c5bef3b438ea3024ee4d3f163ae7", "avgDurationSecs": 5.835, "validDurationSecs": 23.341, "totalHrs": 6.69, "validHrs": 0 }, "hy-AM": { "duration": 67842324, "reportedSentences": 43, "clips": 12835, "users": 152, "size": 390957136, "checksum": "18ee0777de60d3c2c52ff509ce7d76cd3791a70de1cfc01acba79c6cdaeae6d9", "avgDurationSecs": 5.286, "validDurationSecs": 28442.505, "totalHrs": 18.84, "validHrs": 7.9 }, "ia": { "duration": 93492, "reportedSentences": 0, "clips": 18, "users": 2, "size": 548921, "checksum": "ba73bb417c4c1cc4ae07dce555b88144a090a2b37c5181c37ba67de4af142773", "avgDurationSecs": 5.194, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "id": { "duration": 229824, "reportedSentences": 0, "clips": 50, "users": 8, "size": 1331302, "checksum": "39e5a2410b1a17657afc4437fa136ec73ffddf12a520c8046318ab54f9176153", "avgDurationSecs": 4.596, "validDurationSecs": 0, "totalHrs": 0.06, "validHrs": 0 }, "ig": { "duration": 19080, "reportedSentences": 0, "clips": 5, "users": 1, "size": 112084, "checksum": "1c272c919031cd183d8c83eb09e7bf79b05698446ea3f057572e9a904a49a336", "avgDurationSecs": 3.816, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "it": { "duration": 19022112, "reportedSentences": 12, "clips": 3296, "users": 72, "size": 110586706, "checksum": "e28bb457e194821e006060e5a8bc3f3bd1a241442dbe3a9aa2a6388995210e26", "avgDurationSecs": 5.771, "validDurationSecs": 3630.13, "totalHrs": 5.28, "validHrs": 1 }, "ja": { "duration": 819216936, "reportedSentences": 143, "clips": 167509, "users": 2064, "size": 4810720773, "checksum": "90153ebf02650085c796aca30a4dd71d0c5db5e35b15df7d7b5e100b20bde343", "avgDurationSecs": 4.891, "validDurationSecs": 39921.842, "totalHrs": 227.56, "validHrs": 11.08 }, "ka": { "duration": 29116224, "reportedSentences": 29, "clips": 5762, "users": 56, "size": 170123057, "checksum": "23f01d4431b9b55fe6fed42f286a7be369792db3831de1eb7e91799c61e38b08", "avgDurationSecs": 5.053, "validDurationSecs": 1803.973, "totalHrs": 8.08, "validHrs": 0.5 }, "kab": { "duration": 1989504, "reportedSentences": 41, "clips": 473, "users": 8, "size": 10648823, "checksum": "14f8f8c28fa946d488dcf712fa1ac6c3930f7bac42251a3c494966bc36d852d5", "avgDurationSecs": 4.206, "validDurationSecs": 4.206, "totalHrs": 0.55, "validHrs": 0 }, "kk": { "duration": 4955040, "reportedSentences": 0, "clips": 1019, "users": 64, "size": 27686815, "checksum": "7e06623f518499177f7ceada388d5daabb7a5d5058d9a65623820fd1fb018118", "avgDurationSecs": 4.863, "validDurationSecs": 1468.52, "totalHrs": 1.37, "validHrs": 0.4 }, "kmr": { "duration": 12780, "reportedSentences": 0, "clips": 2, "users": 1, "size": 76184, "checksum": "0c4c96a2f570f817adbf94a96c0a27b174d393ee4cbae29d4c12bd0bbcc34dfb", "avgDurationSecs": 6.39, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ko": { "duration": 2279124, "reportedSentences": 2, "clips": 448, "users": 14, "size": 13033748, "checksum": "d7f6de9d926272d3ff8aa4f24064652e91f0a656b772a493deab4155c70f7add", "avgDurationSecs": 5.087, "validDurationSecs": 25.437, "totalHrs": 0.63, "validHrs": 0 }, "ky": { "duration": 10188, "reportedSentences": 0, "clips": 3, "users": 3, "size": 50025, "checksum": "795137f59455ad00e8a88336318e5da344577fd1e9150be7d6ec020171941ec3", "avgDurationSecs": 3.396, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lg": { "duration": 559008, "reportedSentences": 0, "clips": 187, "users": 5, "size": 3266224, "checksum": "e095947b28027a8d6799066395c5e92aced891d98f7a60cbd30607658b50911c", "avgDurationSecs": 2.989, "validDurationSecs": 0, "totalHrs": 0.15, "validHrs": 0 }, "lij": { "duration": 9635688, "reportedSentences": 27, "clips": 2692, "users": 12, "size": 52542008, "checksum": "1c4f8189a431f108a7af09cc307c6c4f874e373902c62fbc2abc0b64c78cf0b2", "avgDurationSecs": 3.579, "validDurationSecs": 7892.53, "totalHrs": 2.67, "validHrs": 2.19 }, "lo": { "duration": 9324, "reportedSentences": 0, "clips": 2, "users": 1, "size": 56045, "checksum": "33c8cfd9a0fee0aba97760a689f68cf11715507c36d376f74d1545c9be754b77", "avgDurationSecs": 4.662, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lt": { "duration": 222228, "reportedSentences": 0, "clips": 49, "users": 2, "size": 1241217, "checksum": "1ce8fc978d1e458d458fdda3eeb79b0fffefa79209a1989b43f9ffadf087cd9b", "avgDurationSecs": 4.535, "validDurationSecs": 0, "totalHrs": 0.06, "validHrs": 0 }, "ltg": { "duration": 52708572, "reportedSentences": 19, "clips": 13529, "users": 169, "size": 299878065, "checksum": "83477717791aaa1aa6bca19ba896ee9e2fcbf867f36f2688733980dda7453e37", "avgDurationSecs": 3.896, "validDurationSecs": 29091.205, "totalHrs": 14.64, "validHrs": 8.08 }, "lv": { "duration": 290993724, "reportedSentences": 174, "clips": 46981, "users": 2464, "size": 1655566673, "checksum": "afac8f675192e1a9d61e45c5277ef74ae78552d438691d2163deeab1adaca029", "avgDurationSecs": 6.194, "validDurationSecs": 70994.02, "totalHrs": 80.83, "validHrs": 19.72 }, "mhr": { "duration": 115168284, "reportedSentences": 6, "clips": 25042, "users": 59, "size": 650382073, "checksum": "df9ec83de893c54f94bf9934a8ad7b6e66abbe66d918bf934b9bd203f61e2c2b", "avgDurationSecs": 4.599, "validDurationSecs": 102750.97, "totalHrs": 31.99, "validHrs": 28.54 }, "ml": { "duration": 2244888, "reportedSentences": 68, "clips": 502, "users": 5, "size": 10438491, "checksum": "ae2527c85eb4cfe86d672a286d09f0435b7df5d4bedd3b911f40700126ee85aa", "avgDurationSecs": 4.472, "validDurationSecs": 0, "totalHrs": 0.62, "validHrs": 0 }, "mn": { "duration": 775836, "reportedSentences": 2, "clips": 131, "users": 10, "size": 3907369, "checksum": "e2f0baa1ea5f62ff89ce6b32f97f61afc503a78f73847e9ca2f65bde9cdb74d5", "avgDurationSecs": 5.922, "validDurationSecs": 0, "totalHrs": 0.21, "validHrs": 0 }, "mr": { "duration": 78732, "reportedSentences": 0, "clips": 15, "users": 2, "size": 432260, "checksum": "3cfafb9f8b14569a23fed39464acff45c7fedbd8bf1257b869ad55cc9070d053", "avgDurationSecs": 5.249, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "mrj": { "duration": 11031048, "reportedSentences": 0, "clips": 2533, "users": 7, "size": 62600824, "checksum": "b967fbb65158fe9de0572c1ce6608c64c566d62fa1938659eaab2aba53bacba3", "avgDurationSecs": 4.355, "validDurationSecs": 8004.369, "totalHrs": 3.06, "validHrs": 2.22 }, "mt": { "duration": 482652, "reportedSentences": 0, "clips": 105, "users": 8, "size": 2831593, "checksum": "3ef8433b517145ce64055bc41b0845f94b6adc540196f84f9bef8d2c865d9930", "avgDurationSecs": 4.597, "validDurationSecs": 0, "totalHrs": 0.13, "validHrs": 0 }, "nan-tw": { "duration": 111960, "reportedSentences": 1, "clips": 35, "users": 3, "size": 603935, "checksum": "31537851a533998905899a8b61fb60139cdf2ba6df21259479b6de98fb9aaa20", "avgDurationSecs": 3.199, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "ne-NP": { "duration": 1248948, "reportedSentences": 2, "clips": 309, "users": 3, "size": 7291015, "checksum": "ec320545aa0d365d37c24b05aa9b09abd33cf095e63143ae220f969402d0a933", "avgDurationSecs": 4.042, "validDurationSecs": 8.084, "totalHrs": 0.34, "validHrs": 0 }, "nhi": { "duration": 117972, "reportedSentences": 0, "clips": 25, "users": 2, "size": 694221, "checksum": "f89bc4a9f8785127271dadf01f85d59a038b395e8c971e99da1cf2d1828b6bc2", "avgDurationSecs": 4.719, "validDurationSecs": 66.064, "totalHrs": 0.03, "validHrs": 0.01 }, "nl": { "duration": 2434536, "reportedSentences": 2, "clips": 493, "users": 22, "size": 14279713, "checksum": "0778e5cff11af5bb8ce09033ffccd382345bfbe297ac64edd02977f7dc8b493e", "avgDurationSecs": 4.938, "validDurationSecs": 83.95, "totalHrs": 0.67, "validHrs": 0.02 }, "nn-NO": { "duration": 778644, "reportedSentences": 0, "clips": 197, "users": 3, "size": 4540548, "checksum": "4d7ec1ac8a193aa06f926fb0cdaef4b2f1569cd79c0759eacc2da83013625f58", "avgDurationSecs": 3.953, "validDurationSecs": 150.195, "totalHrs": 0.21, "validHrs": 0.04 }, "oc": { "duration": 1164996, "reportedSentences": 0, "clips": 242, "users": 4, "size": 6842601, "checksum": "3db92dc0ee259debdee76dc79d55273a7b349b91b0aa508059d587ed0446dc32", "avgDurationSecs": 4.814, "validDurationSecs": 0, "totalHrs": 0.32, "validHrs": 0 }, "or": { "duration": 349128, "reportedSentences": 0, "clips": 59, "users": 1, "size": 2057662, "checksum": "a32a8d58d8b2b5429ded72f9ebb379456c03ea82a8cca6281c066aeec9043246", "avgDurationSecs": 5.917, "validDurationSecs": 0, "totalHrs": 0.09, "validHrs": 0 }, "os": { "duration": 1057932, "reportedSentences": 1, "clips": 190, "users": 7, "size": 5891417, "checksum": "d74210edbce9db7240a6a80983d6ea5b2e2ddc7075e6912dcce883614590430c", "avgDurationSecs": 5.568, "validDurationSecs": 723.848, "totalHrs": 0.29, "validHrs": 0.2 }, "pa-IN": { "duration": 66096, "reportedSentences": 1, "clips": 11, "users": 3, "size": 317605, "checksum": "81d7558c0313beadf0064402d361e3e9954d89f9f2012847a9eb6ec7b4d7395d", "avgDurationSecs": 6.009, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "pl": { "duration": 6289884, "reportedSentences": 1, "clips": 1168, "users": 39, "size": 36778027, "checksum": "e36c3da9ddb07bb497488fa934412ffeb9df18d723ec844169ba9590498fef68", "avgDurationSecs": 5.385, "validDurationSecs": 1707.1, "totalHrs": 1.74, "validHrs": 0.47 }, "ps": { "duration": 212940, "reportedSentences": 0, "clips": 35, "users": 1, "size": 1256493, "checksum": "7305eae05fcbf5cd55455e4d5ee284c7109a127a37694197d1c5dfce10500ea8", "avgDurationSecs": 6.084, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "pt": { "duration": 14995584, "reportedSentences": 151, "clips": 3424, "users": 100, "size": 87620695, "checksum": "934024d6938cfab3cf90f9ba4076103ee0f54a691038f371ba69ca0b079eef9d", "avgDurationSecs": 4.38, "validDurationSecs": 967.881, "totalHrs": 4.16, "validHrs": 0.26 }, "ro": { "duration": 9045612, "reportedSentences": 15, "clips": 1988, "users": 11, "size": 52974353, "checksum": "b43ff8229173396866f35d04f26079b166f7828660be6209333f32ea25820946", "avgDurationSecs": 4.55, "validDurationSecs": 268.456, "totalHrs": 2.51, "validHrs": 0.07 }, "ru": { "duration": 17772876, "reportedSentences": 19, "clips": 3333, "users": 81, "size": 104285967, "checksum": "02cf51b37f3276d98c11f6027dbe90a2b5dc3160abadf90236314b107aeb1377", "avgDurationSecs": 5.332, "validDurationSecs": 511.91, "totalHrs": 4.93, "validHrs": 0.14 }, "rw": { "duration": 206748, "reportedSentences": 0, "clips": 43, "users": 9, "size": 1126876, "checksum": "c2a68f3ffbfb03c064a24916c134697b63f2eb954ec9101d29ff765a1ce67a56", "avgDurationSecs": 4.808, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "sah": { "duration": 15149340, "reportedSentences": 2, "clips": 2401, "users": 44, "size": 88595624, "checksum": "547df8554074fa75bded7ddfa849de21785fa26598fc6b1dd7ed3a9666335370", "avgDurationSecs": 6.31, "validDurationSecs": 3091.702, "totalHrs": 4.2, "validHrs": 0.85 }, "sc": { "duration": 574128, "reportedSentences": 0, "clips": 120, "users": 3, "size": 3338405, "checksum": "fcb497e31467d9da4a8187ddd252fad2fdc6b03758d3c1a453083177d26304f1", "avgDurationSecs": 4.784, "validDurationSecs": 9.569, "totalHrs": 0.15, "validHrs": 0 }, "sk": { "duration": 461052, "reportedSentences": 3, "clips": 110, "users": 7, "size": 2701738, "checksum": "9e8cea0f311dbcd7c493caff496507d760d593ac129c882fd13720ad6f9a91b4", "avgDurationSecs": 4.191, "validDurationSecs": 0, "totalHrs": 0.12, "validHrs": 0 }, "skr": { "duration": 208404, "reportedSentences": 0, "clips": 44, "users": 4, "size": 1227235, "checksum": "c7dfd30d8f0ce76b917b854bc245fe98eef9d01f03b9f7e0b3a8a2f804f6ad16", "avgDurationSecs": 4.736, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "sl": { "duration": 5795892, "reportedSentences": 10, "clips": 1516, "users": 3, "size": 33818864, "checksum": "06160d1ca44e20c5b7e9e3259d1ef5a4af26cb3125b713465ca652bdd1a26632", "avgDurationSecs": 3.823, "validDurationSecs": 187.334, "totalHrs": 1.6, "validHrs": 0.05 }, "sq": { "duration": 1146996, "reportedSentences": 2, "clips": 271, "users": 9, "size": 6682721, "checksum": "24d784eab27b3084e4b8519ae32a3f308ae07009fc876dfe51e088c99c05740e", "avgDurationSecs": 4.232, "validDurationSecs": 791.47, "totalHrs": 0.31, "validHrs": 0.21 }, "sr": { "duration": 4138164, "reportedSentences": 23, "clips": 1201, "users": 6, "size": 23997593, "checksum": "11e91c61cd69ebd52481fb0fc209b5aca19341d19a036deef3ca5b68cbd86223", "avgDurationSecs": 3.446, "validDurationSecs": 499.612, "totalHrs": 1.14, "validHrs": 0.13 }, "sv-SE": { "duration": 1396224, "reportedSentences": 7, "clips": 274, "users": 15, "size": 8201249, "checksum": "819bd446b2e1b7e4d01bb3f8122458ebaa25fde5e0e7bc1923129fb0a49bee8c", "avgDurationSecs": 5.096, "validDurationSecs": 0, "totalHrs": 0.38, "validHrs": 0 }, "sw": { "duration": 533708604, "reportedSentences": 21, "clips": 85945, "users": 218, "size": 3093781525, "checksum": "16aac1d09915b02d82884bedefb8d69c1ec72eec980172c7b21aed39b2f1d1b4", "avgDurationSecs": 6.21, "validDurationSecs": 1446.903, "totalHrs": 148.25, "validHrs": 0.4 }, "ta": { "duration": 2669076, "reportedSentences": 0, "clips": 523, "users": 26, "size": 15643665, "checksum": "e4096d823312d3aadce987f4b056d968481bd7602e1cc141106d780daef6a581", "avgDurationSecs": 5.103, "validDurationSecs": 0, "totalHrs": 0.74, "validHrs": 0 }, "te": { "duration": 1647972, "reportedSentences": 3, "clips": 346, "users": 11, "size": 9608785, "checksum": "0c75fce65f1c0e780eacef695abeaee0e8d038c8f2aaa5507c9e89bcb61623b0", "avgDurationSecs": 4.763, "validDurationSecs": 481.055, "totalHrs": 0.45, "validHrs": 0.13 }, "th": { "duration": 3318804, "reportedSentences": 21, "clips": 755, "users": 27, "size": 19463741, "checksum": "309521efcf8b7578c0a5689a70f117f0fd5f146640186b0215dff9707a5efe6c", "avgDurationSecs": 4.396, "validDurationSecs": 0, "totalHrs": 0.92, "validHrs": 0 }, "ti": { "duration": 245628, "reportedSentences": 1, "clips": 46, "users": 2, "size": 1392640, "checksum": "31a9136c8d7d419c547c9afdca236dddcc6b1f042286a030c94353d57ea03e46", "avgDurationSecs": 5.34, "validDurationSecs": 0, "totalHrs": 0.06, "validHrs": 0 }, "tk": { "duration": 303444, "reportedSentences": 0, "clips": 55, "users": 5, "size": 1613795, "checksum": "0d48b886365b19a143ea497ea6bc04445fecbe8edd64794e0538fd6187259194", "avgDurationSecs": 5.517, "validDurationSecs": 0, "totalHrs": 0.08, "validHrs": 0 }, "tok": { "duration": 5124744, "reportedSentences": 0, "clips": 939, "users": 6, "size": 30130469, "checksum": "802098b6fb89ae8636b44e062398a2da3fb4b575a682ab727f42f1c01f5fed15", "avgDurationSecs": 5.458, "validDurationSecs": 27.288, "totalHrs": 1.42, "validHrs": 0 }, "tr": { "duration": 8858412, "reportedSentences": 4, "clips": 1675, "users": 37, "size": 52094614, "checksum": "d090c83caf02b171782746d9dbb79f09dc66d2f0653bcbb6b799e721ffd4851e", "avgDurationSecs": 5.289, "validDurationSecs": 8567.539, "totalHrs": 2.46, "validHrs": 2.37 }, "tt": { "duration": 713700, "reportedSentences": 0, "clips": 163, "users": 6, "size": 4118878, "checksum": "0851bfbf4eceee42b272e9660284b3d275326c10957bcaafd4fa41836a1849fb", "avgDurationSecs": 4.379, "validDurationSecs": 122.599, "totalHrs": 0.19, "validHrs": 0.03 }, "ug": { "duration": 156001248, "reportedSentences": 278, "clips": 26648, "users": 117, "size": 869790305, "checksum": "97df8783aa3afe3da5d0edcfbbef3c2c3a48afe9c3cd19128a9cc258ebfbfb33", "avgDurationSecs": 5.854, "validDurationSecs": 50696.893, "totalHrs": 43.33, "validHrs": 14.08 }, "uk": { "duration": 10047492, "reportedSentences": 1, "clips": 2297, "users": 42, "size": 56721081, "checksum": "3c44f72b7b784312d8ba06cd1dc726d01ab153112cdd36dd41ac389231f9caa0", "avgDurationSecs": 4.374, "validDurationSecs": 734.862, "totalHrs": 2.79, "validHrs": 0.2 }, "ur": { "duration": 40865472, "reportedSentences": 129, "clips": 8455, "users": 37, "size": 238601255, "checksum": "1f241f168cd19d53211b9b831853f89ab0cf665800f52bf5ec5dcf4da86237fe", "avgDurationSecs": 4.833, "validDurationSecs": 343.164, "totalHrs": 11.35, "validHrs": 0.09 }, "uz": { "duration": 1215864, "reportedSentences": 4, "clips": 210, "users": 10, "size": 7078650, "checksum": "2f63a56a093bae95ac3156036e3e7c921b590ece0f39862377ddcb3152f9f572", "avgDurationSecs": 5.79, "validDurationSecs": 0, "totalHrs": 0.33, "validHrs": 0 }, "vi": { "duration": 534996, "reportedSentences": 0, "clips": 147, "users": 7, "size": 2665900, "checksum": "0d0cfd4386458b7a773b10c44196a32377491aaf882683c22ec49d39e2fc66dd", "avgDurationSecs": 3.639, "validDurationSecs": 0, "totalHrs": 0.14, "validHrs": 0 }, "vot": { "duration": 3996, "reportedSentences": 0, "clips": 1, "users": 1, "size": 24140, "checksum": "e89d30eb3a5f0976507638282868ac07f6b6c995cf79dd7303308f648e87b8be", "avgDurationSecs": 3.996, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yi": { "duration": 166356, "reportedSentences": 1, "clips": 44, "users": 2, "size": 965890, "checksum": "4034e8de10eceb23348f68e95895dd286219e7aaf2c74850c4765b7a8e69e20f", "avgDurationSecs": 3.781, "validDurationSecs": 162.575, "totalHrs": 0.04, "validHrs": 0.04 }, "yo": { "duration": 51192, "reportedSentences": 1, "clips": 17, "users": 2, "size": 299276, "checksum": "44c0bcbbbff4b53965c23b48797f5de55053ccd12212f570c2b93ae51b2441bd", "avgDurationSecs": 3.011, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "yue": { "duration": 326469564, "reportedSentences": 510, "clips": 87569, "users": 45, "size": 1902073616, "checksum": "3a9fea3af38e4c829e7e1b0ea44618eed0f4ef8fe48d5b29ea2f637f9ffc7e3a", "avgDurationSecs": 3.728, "validDurationSecs": 193.863, "totalHrs": 90.68, "validHrs": 0.05 }, "zh-CN": { "duration": 20409876, "reportedSentences": 62, "clips": 4481, "users": 118, "size": 118497528, "checksum": "8aca2d5d77495957d32ece4801f0a1290a9ae89faf21aa36ff5107865592ab9c", "avgDurationSecs": 4.555, "validDurationSecs": 0, "totalHrs": 5.66, "validHrs": 0 }, "zh-HK": { "duration": 590724, "reportedSentences": 1, "clips": 218, "users": 13, "size": 3435143, "checksum": "6aa2a70b177ab8ff09a4145866ab30fecb89400d6f51f29b85f2c1a06ddcc93b", "avgDurationSecs": 2.71, "validDurationSecs": 130.068, "totalHrs": 0.16, "validHrs": 0.03 }, "zh-TW": { "duration": 3160260, "reportedSentences": 6, "clips": 861, "users": 17, "size": 16384756, "checksum": "7cf1d62c0a78d15eaf01fb1764af5772d04e5abc9593c39cbb59578c70ad7393", "avgDurationSecs": 3.67, "validDurationSecs": 3.67, "totalHrs": 0.87, "validHrs": 0 } }, "totalDuration": 5685798924, "totalValidDurationSecs": 932542, "totalHrs": 1579, "totalValidHrs": 259 } ================================================ FILE: datasets/scripted-speech/cv-corpus-16.1-2023-12-06.json ================================================ { "locales": { "ab": { "buckets": { "validated": 41982, "invalidated": 5278, "dev": 9160, "test": 9116, "train": 21027, "other": 11940 }, "duration": 303854076, "reportedSentences": 225, "clips": 59200, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.18, "thirties": 0.13, "teens": 0.28, "fourties": 0.09, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.18, "male": 0.18, "female": 0.64, "other": 0 } }, "users": 401, "size": 1735457708, "checksum": "75b295a55e2d1a581034d669f87504863ab36f64eac5dd4c7b954dca3ba356c7", "avgDurationSecs": 5.133, "validDurationSecs": 215479.76, "totalHrs": 84.4, "validHrs": 59.85 }, "af": { "buckets": { "validated": 88, "invalidated": 186, "dev": 5, "test": 29, "train": 54, "other": 0 }, "duration": 1231344, "reportedSentences": 0, "clips": 274, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.64, "thirties": 0, "teens": 0.02, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.51, "male": 0.49, "female": 0, "other": 0 } }, "users": 14, "size": 7153846, "checksum": "dbd9f394e2ed136fff6defba793f208993123e458949e72d581f453a27c5f702", "avgDurationSecs": 4.494, "validDurationSecs": 395.468, "totalHrs": 0.34, "validHrs": 0.1 }, "am": { "buckets": { "validated": 800, "invalidated": 22, "dev": 248, "test": 162, "train": 390, "other": 401 }, "duration": 7353648, "reportedSentences": 11, "clips": 1223, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.12, "thirties": 0.75, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male": 0.88, "female": 0, "other": 0 } }, "users": 20, "size": 43146198, "checksum": "ad9d1a049ac9ead170b23ead481379bb685568982cbcbb176438327c1e95e4a6", "avgDurationSecs": 6.013, "validDurationSecs": 4810.236, "totalHrs": 2.04, "validHrs": 1.33 }, "ar": { "buckets": { "validated": 78015, "invalidated": 15082, "dev": 10430, "test": 10478, "train": 28375, "other": 41332 }, "duration": 559114218, "reportedSentences": 2119, "clips": 134429, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.3, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male": 0.28, "female": 0.17, "other": 0 } }, "users": 1537, "size": 3312919539, "checksum": "0b86d4202269cd49a10289148cdc1cc435c8a11577754bd61424159f96d86a49", "avgDurationSecs": 4.159, "validDurationSecs": 324478.317, "totalHrs": 155.3, "validHrs": 90.13 }, "as": { "buckets": { "validated": 1718, "invalidated": 208, "dev": 510, "test": 551, "train": 657, "other": 145 }, "duration": 11784629, "reportedSentences": 10, "clips": 2071, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0.36, "thirties": 0.04, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.59, "male": 0.41, "female": 0, "other": 0 } }, "users": 45, "size": 73637705, "checksum": "e2ce69226877a00bac054299ae89086bbc3d1d22319885b35ef7b23f0baf8384", "avgDurationSecs": 5.69, "validDurationSecs": 9775.95, "totalHrs": 3.27, "validHrs": 2.71 }, "ast": { "buckets": { "validated": 659, "invalidated": 16, "dev": 112, "test": 160, "train": 385, "other": 829 }, "duration": 6680016, "reportedSentences": 0, "clips": 1504, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.02, "thirties": 0.13, "teens": 0.04, "fourties": 0.31, "fifties": 0.23, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.58, "female": 0.16, "other": 0 } }, "users": 27, "size": 37776102, "checksum": "1eea6ea231bd7e605f968d94259f6764aba50c30448798535e357dc4a147b763", "avgDurationSecs": 4.442, "validDurationSecs": 2926.949, "totalHrs": 1.85, "validHrs": 0.81 }, "az": { "buckets": { "validated": 107, "invalidated": 39, "dev": 21, "test": 30, "train": 56, "other": 154 }, "duration": 1614492, "reportedSentences": 5, "clips": 300, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.34, "thirties": 0.21, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male": 0.56, "female": 0, "other": 0 } }, "users": 26, "size": 9302682, "checksum": "1a6653a0db625626e57be79f334359721a1ca93bcb054604dfcce2e38aa802a0", "avgDurationSecs": 5.382, "validDurationSecs": 575.835, "totalHrs": 0.44, "validHrs": 0.15 }, "ba": { "buckets": { "validated": 209601, "invalidated": 7942, "dev": 14566, "test": 14580, "train": 119038, "other": 23 }, "duration": 963020988, "reportedSentences": 867, "clips": 217566, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.3, "female": 0.4, "other": 0 } }, "users": 915, "size": 5400696124, "checksum": "4053dcdab1180c7cd76b4cb0bdcba6e5980f72bb23e2e9ea824bc36a78a86ea8", "avgDurationSecs": 4.426, "validDurationSecs": 927765.194, "totalHrs": 267.5, "validHrs": 257.71 }, "bas": { "buckets": { "validated": 1761, "invalidated": 512, "dev": 457, "test": 541, "train": 763, "other": 18 }, "duration": 10139220, "reportedSentences": 8, "clips": 2291, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male": 0.01, "female": 0.02, "other": 0 } }, "users": 36, "size": 56344113, "checksum": "c9a63cae26034c6a7b88ca3a6af00b93761dcc4b0b830bc2b6ef079498ba434d", "avgDurationSecs": 4.426, "validDurationSecs": 7793.613, "totalHrs": 2.81, "validHrs": 2.16 }, "be": { "buckets": { "validated": 1245806, "invalidated": 35017, "dev": 15880, "test": 15880, "train": 347612, "other": 108 }, "duration": 6096786120, "reportedSentences": 3136, "clips": 1280931, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.86, "male": 0.06, "female": 0.08, "other": 0 } }, "users": 8231, "size": 34608170251, "checksum": "bb3400374ced50d31e80fc60b519626b7318945429a3ce2f37c9de8c1c396d06", "avgDurationSecs": 4.76, "validDurationSecs": 5929603.335, "totalHrs": 1693.55, "validHrs": 1647.11 }, "bg": { "buckets": { "validated": 9211, "invalidated": 736, "dev": 2224, "test": 2678, "train": 4293, "other": 3396 }, "duration": 73053648, "reportedSentences": 187, "clips": 13343, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.59, "female": 0.04, "other": 0 } }, "users": 117, "size": 426436227, "checksum": "87eedd9d81da709daaf29849857f776acb419b3f432a4b2959eb9fab87e3f18f", "avgDurationSecs": 5.475, "validDurationSecs": 50430.724, "totalHrs": 20.29, "validHrs": 14 }, "bn": { "buckets": { "validated": 44023, "invalidated": 7792, "dev": 9314, "test": 9314, "train": 21160, "other": 997016 }, "duration": 4578886548, "reportedSentences": 1887, "clips": 1048831, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male": 0.54, "female": 0.23, "other": 0 } }, "users": 22897, "size": 26364794462, "checksum": "d69cbbb6b2adc1321f62915866fec29aee558d75f89227b683aa8952c9f39fda", "avgDurationSecs": 4.366, "validDurationSecs": 192191.423, "totalHrs": 1271.91, "validHrs": 53.38 }, "br": { "buckets": { "validated": 14592, "invalidated": 1096, "dev": 2179, "test": 2212, "train": 2586, "other": 14216 }, "duration": 95012791, "reportedSentences": 285, "clips": 29904, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.33, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.14, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.67, "female": 0.02, "other": 0 } }, "users": 201, "size": 663544603, "checksum": "9dbebb34bd42711222ca5f656631f80623c74fea4414a967505a720900cf4b16", "avgDurationSecs": 3.177, "validDurationSecs": 46362.582, "totalHrs": 26.39, "validHrs": 12.87 }, "ca": { "buckets": { "validated": 1798644, "invalidated": 108913, "dev": 16402, "test": 16402, "train": 1145141, "other": 450511 }, "duration": 12597142734, "reportedSentences": 6340, "clips": 2358068, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.17, "sixties": 0.28, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.5, "female": 0.2, "other": 0 } }, "users": 35528, "size": 75857088231, "checksum": "398f00bc64d5639ca240cf16f0332c6837c514e7d943efe6963cb23b9fd5bd42", "avgDurationSecs": 5.342, "validDurationSecs": 9608618.24, "totalHrs": 3499.2, "validHrs": 2669.06 }, "ckb": { "buckets": { "validated": 103739, "invalidated": 7506, "dev": 4949, "test": 4957, "train": 7074, "other": 9527 }, "duration": 471696048, "reportedSentences": 2419, "clips": 120772, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.49, "thirties": 0.11, "teens": 0.02, "fourties": 0.02, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.62, "female": 0.06, "other": 0 } }, "users": 1309, "size": 2560627839, "checksum": "afca800223d23986703906c06a4d97ca47ced6631f8433be351e5f29c6d08589", "avgDurationSecs": 3.906, "validDurationSecs": 405170.704, "totalHrs": 131.02, "validHrs": 112.54 }, "cnh": { "buckets": { "validated": 2458, "invalidated": 437, "dev": 761, "test": 763, "train": 817, "other": 3292 }, "duration": 21751656, "reportedSentences": 9, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male": 0.3, "female": 0.21, "other": 0 } }, "users": 300, "size": 167586160, "checksum": "2250cc743acdebc2c36662b15654ede4a8c6f49cc1a9c29e25b46daea2ec39f8", "avgDurationSecs": 3.516, "validDurationSecs": 8641.599, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "validated": 61023, "invalidated": 2169, "dev": 9019, "test": 9028, "train": 20023, "other": 147305 }, "duration": 939359343, "reportedSentences": 929, "clips": 210497, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male": 0.54, "female": 0.22, "other": 0 } }, "users": 957, "size": 5748797721, "checksum": "ed322dbe00e72978ab50036c633c8412cdb6b0e452be2c3ab20c0d02bd3f124e", "avgDurationSecs": 4.463, "validDurationSecs": 272319.915, "totalHrs": 260.93, "validHrs": 75.64 }, "cv": { "buckets": { "validated": 17322, "invalidated": 2137, "dev": 1259, "test": 1290, "train": 1437, "other": 113 }, "duration": 99019848, "reportedSentences": 144, "clips": 19572, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.51, "thirties": 0.01, "teens": 0.19, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.54, "female": 0.27, "other": 0 } }, "users": 112, "size": 672401693, "checksum": "8fd80ff1433d8596a9107a00fe115207933d9aa08bb27faea974a40a3a459716", "avgDurationSecs": 5.059, "validDurationSecs": 87636.512, "totalHrs": 27.5, "validHrs": 24.34 }, "cy": { "buckets": { "validated": 89930, "invalidated": 4432, "dev": 5352, "test": 5355, "train": 7902, "other": 19777 }, "duration": 558447309, "reportedSentences": 172, "clips": 114139, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.06, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.33, "female": 0.25, "other": 0.01 } }, "users": 1832, "size": 4076370134, "checksum": "aac2d405054b9a7f819823027e2f5fb841247c0f51c583d08b29be1044ec8139", "avgDurationSecs": 4.893, "validDurationSecs": 440000.057, "totalHrs": 155.12, "validHrs": 122.22 }, "da": { "buckets": { "validated": 10146, "invalidated": 401, "dev": 2105, "test": 2512, "train": 3464, "other": 381 }, "duration": 45040500, "reportedSentences": 422, "clips": 10928, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.19, "thirties": 0.25, "teens": 0.01, "fourties": 0.2, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.6, "female": 0.08, "other": 0 } }, "users": 254, "size": 258682767, "checksum": "067b5c6f6f2a57b5f77e331d3d5c480f341bc26184b22aa425cb5d1f7b34ba86", "avgDurationSecs": 4.122, "validDurationSecs": 41817.433, "totalHrs": 12.51, "validHrs": 11.61 }, "de": { "buckets": { "validated": 904411, "invalidated": 53837, "dev": 16173, "test": 16173, "train": 576207, "other": 6998 }, "duration": 5049626489, "reportedSentences": 10243, "clips": 965246, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.59, "female": 0.08, "other": 0.01 } }, "users": 18647, "size": 35181209876, "checksum": "116913120c964ed66047734da598f65314255905541a6c760ffdf73e8a867683", "avgDurationSecs": 5.231, "validDurationSecs": 4731371.839, "totalHrs": 1402.67, "validHrs": 1314.26 }, "dv": { "buckets": { "validated": 27579, "invalidated": 1702, "dev": 2210, "test": 2211, "train": 2696, "other": 16339 }, "duration": 230943087, "reportedSentences": 56, "clips": 45620, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.17, "thirties": 0.36, "teens": 0.01, "fourties": 0.21, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male": 0.29, "female": 0.5, "other": 0 } }, "users": 349, "size": 1461023790, "checksum": "cac3aed5224d5a1aae7f4ee45b96cdf10f3f6c94e0007bc0b1bbf747bd2b1b96", "avgDurationSecs": 5.062, "validDurationSecs": 139613.753, "totalHrs": 64.15, "validHrs": 38.78 }, "dyu": { "buckets": { "validated": 182, "invalidated": 8, "dev": 49, "test": 53, "train": 73, "other": 45 }, "duration": 1443960, "reportedSentences": 0, "clips": 235, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.83, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male": 0.47, "female": 0.38, "other": 0 } }, "users": 20, "size": 8475490, "checksum": "c71861ec35fa5e14186c55079cc0cbe5d7ea77ac7c07351aac832c993225ac55", "avgDurationSecs": 6.145, "validDurationSecs": 1118.301, "totalHrs": 0.4, "validHrs": 0.31 }, "el": { "buckets": { "validated": 16071, "invalidated": 829, "dev": 1673, "test": 1701, "train": 1942, "other": 10227 }, "duration": 112289218, "reportedSentences": 70, "clips": 27127, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.37, "teens": 0.01, "fourties": 0.16, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male": 0.64, "female": 0.03, "other": 0.02 } }, "users": 384, "size": 747401627, "checksum": "eca2965788611993d5fd24a1d5cacb973c9a3912b328d8514c928a5111efdd6e", "avgDurationSecs": 4.139, "validDurationSecs": 66524.128, "totalHrs": 31.19, "validHrs": 18.47 }, "en": { "buckets": { "validated": 1782818, "invalidated": 288077, "dev": 16390, "test": 16390, "train": 1090061, "other": 299335 }, "duration": 12374969047, "reportedSentences": 7205, "clips": 2370230, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.45, "female": 0.17, "other": 0.02 } }, "users": 90474, "size": 86755746934, "checksum": "9e05009435ce300b903a52c60f9368f376f17c30bf6f31e2405013a0feae4002", "avgDurationSecs": 5.221, "validDurationSecs": 9308091.437, "totalHrs": 3437.49, "validHrs": 2585.58 }, "eo": { "buckets": { "validated": 850060, "invalidated": 127409, "dev": 14889, "test": 14908, "train": 144158, "other": 150176 }, "duration": 6841056391, "reportedSentences": 2263, "clips": 1127645, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male": 0.69, "female": 0.11, "other": 0 } }, "users": 1714, "size": 40827340340, "checksum": "1241e1f0503fc713e3746bbf0207017007ef3d0deb73b0dfc640c07a9a59534c", "avgDurationSecs": 6.067, "validDurationSecs": 5157038.249, "totalHrs": 1900.29, "validHrs": 1432.51 }, "es": { "buckets": { "validated": 405662, "invalidated": 83144, "dev": 15836, "test": 15836, "train": 327717, "other": 1140004 }, "duration": 7935345234, "reportedSentences": 2511, "clips": 1628810, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.54, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male": 0.55, "female": 0.32, "other": 0 } }, "users": 25695, "size": 49971078497, "checksum": "891ea511c25f0ad98212ff9fbcdd855aaf6fc826d645d226f8481f957edababb", "avgDurationSecs": 4.872, "validDurationSecs": 1976331.198, "totalHrs": 2204.26, "validHrs": 548.98 }, "et": { "buckets": { "validated": 24358, "invalidated": 7447, "dev": 2653, "test": 2653, "train": 3157, "other": 70 }, "duration": 215805926, "reportedSentences": 548, "clips": 31875, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male": 0.55, "female": 0.26, "other": 0 } }, "users": 898, "size": 1453400161, "checksum": "04ae74a77fee8189066eaf6e42dd73ae3912e18a2cd7d81fffb003f4ceba448d", "avgDurationSecs": 6.77, "validDurationSecs": 164912.965, "totalHrs": 59.94, "validHrs": 45.8 }, "eu": { "buckets": { "validated": 144577, "invalidated": 20186, "dev": 12742, "test": 12742, "train": 54390, "other": 219105 }, "duration": 2098456311, "reportedSentences": 966, "clips": 383868, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.2, "thirties": 0.06, "teens": 0.01, "fourties": 0.17, "fifties": 0.08, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male": 0.26, "female": 0.29, "other": 0.01 } }, "users": 9089, "size": 12945214971, "checksum": "3f4410c994572fb4b14db1e789fa9940af8389721a9e454f5ee959abb7f2fedd", "avgDurationSecs": 5.467, "validDurationSecs": 790345.947, "totalHrs": 582.9, "validHrs": 219.54 }, "fa": { "buckets": { "validated": 328292, "invalidated": 14522, "dev": 10554, "test": 10554, "train": 28860, "other": 26644 }, "duration": 1473154388, "reportedSentences": 2405, "clips": 369458, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.32, "thirties": 0.36, "teens": 0.03, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male": 0.71, "female": 0.07, "other": 0 } }, "users": 4409, "size": 10697697659, "checksum": "e6097496f734be8e1b23b86946616a56271cfea91f24c6ce13cf02dfa1181364", "avgDurationSecs": 3.987, "validDurationSecs": 1309011.58, "totalHrs": 409.2, "validHrs": 363.61 }, "fi": { "buckets": { "validated": 9304, "invalidated": 252, "dev": 1676, "test": 1769, "train": 2131, "other": 5785 }, "duration": 70929363, "reportedSentences": 66, "clips": 15341, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.11, "thirties": 0.19, "teens": 0.01, "fourties": 0.32, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male": 0.34, "female": 0.32, "other": 0 } }, "users": 217, "size": 421640372, "checksum": "e26b27458089f5e913d78a8065520122b87531e203bf1b24afca9cd21629d58d", "avgDurationSecs": 4.624, "validDurationSecs": 43017.195, "totalHrs": 19.7, "validHrs": 11.94 }, "fr": { "buckets": { "validated": 709439, "invalidated": 62511, "dev": 16145, "test": 16145, "train": 541822, "other": 26918 }, "duration": 4005163090, "reportedSentences": 7107, "clips": 798868, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.14, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.58, "female": 0.11, "other": 0.01 } }, "users": 18487, "size": 28174266785, "checksum": "f569408c5ba4edd7b3742bed38eb47c33185466839538a35bb499592d2cb2206", "avgDurationSecs": 5.014, "validDurationSecs": 3556806.503, "totalHrs": 1112.54, "validHrs": 988 }, "fy-NL": { "buckets": { "validated": 50873, "invalidated": 3951, "dev": 3171, "test": 3171, "train": 3920, "other": 101888 }, "duration": 760767003, "reportedSentences": 537, "clips": 156712, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.13, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male": 0.13, "female": 0.31, "other": 0 } }, "users": 2021, "size": 4555659205, "checksum": "c01c3db19fde2f524ffe766f1cdcea0e3cc7dde9110b10ca94ed6bbaa31389e5", "avgDurationSecs": 4.855, "validDurationSecs": 246965.77, "totalHrs": 211.32, "validHrs": 68.6 }, "ga-IE": { "buckets": { "validated": 5479, "invalidated": 928, "dev": 515, "test": 516, "train": 538, "other": 3917 }, "duration": 37015731, "reportedSentences": 29, "clips": 10324, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.25, "thirties": 0.25, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.49, "female": 0.14, "other": 0 } }, "users": 185, "size": 252095177, "checksum": "847f74a8a6145833f268b78191baf9b94c707b37a9b9c92c6ea0a90c57736c6a", "avgDurationSecs": 3.585, "validDurationSecs": 19644.439, "totalHrs": 10.28, "validHrs": 5.45 }, "gl": { "buckets": { "validated": 34399, "invalidated": 2903, "dev": 8273, "test": 8394, "train": 17245, "other": 8862 }, "duration": 228079692, "reportedSentences": 366, "clips": 46164, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.2, "thirties": 0.17, "teens": 0.02, "fourties": 0.19, "fifties": 0.07, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.34, "female": 0.28, "other": 0 } }, "users": 1031, "size": 1310051180, "checksum": "53c662f9c8ea5745883d42cbfc932a8f68dddc45ed2d20da8656083ae89c0340", "avgDurationSecs": 4.941, "validDurationSecs": 169953.066, "totalHrs": 63.35, "validHrs": 47.2 }, "gn": { "buckets": { "validated": 2908, "invalidated": 162, "dev": 360, "test": 855, "train": 1571, "other": 18779 }, "duration": 99216036, "reportedSentences": 36, "clips": 21849, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.58, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male": 0.38, "female": 0.49, "other": 0 } }, "users": 164, "size": 567485675, "checksum": "99b716f4f717be49a093bb6a6b94e464025504a12e786c053e5dbbf355a0eb91", "avgDurationSecs": 4.541, "validDurationSecs": 13205.192, "totalHrs": 27.56, "validHrs": 3.66 }, "ha": { "buckets": { "validated": 3280, "invalidated": 169, "dev": 582, "test": 660, "train": 1926, "other": 6662 }, "duration": 43814016, "reportedSentences": 33, "clips": 10111, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.68, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male": 0.59, "female": 0.25, "other": 0 } }, "users": 40, "size": 256205155, "checksum": "85a467b9aadc885ae477bdcdce981830f820059dffb3b501bba75f87e254c60d", "avgDurationSecs": 4.333, "validDurationSecs": 14213.23, "totalHrs": 12.17, "validHrs": 3.94 }, "he": { "buckets": { "validated": 1099, "invalidated": 173, "dev": 254, "test": 241, "train": 391, "other": 1525 }, "duration": 12543840, "reportedSentences": 23, "clips": 2797, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.77, "thirties": 0.16, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.05, "nineties": 0 }, "gender": { "": 0.01, "male": 0.98, "female": 0.01, "other": 0 } }, "users": 14, "size": 72327972, "checksum": "77c240c1691ca7817999a68af7c25d40e223aef30507c2b245e3e5f673d8a00e", "avgDurationSecs": 4.485, "validDurationSecs": 4928.738, "totalHrs": 3.48, "validHrs": 1.36 }, "hi": { "buckets": { "validated": 10279, "invalidated": 770, "dev": 2425, "test": 3123, "train": 4674, "other": 3694 }, "duration": 72197520, "reportedSentences": 138, "clips": 14743, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.34, "thirties": 0.25, "teens": 0.01, "fourties": 0.03, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male": 0.62, "female": 0.04, "other": 0 } }, "users": 403, "size": 425972425, "checksum": "e398cd5ec446cd37cf80becffdf27f4a5ba21a4c1f0a4e3fa7e93ebac3edc5b0", "avgDurationSecs": 4.897, "validDurationSecs": 50336.994, "totalHrs": 20.05, "validHrs": 13.98 }, "hsb": { "buckets": { "validated": 1424, "invalidated": 247, "dev": 172, "test": 444, "train": 808, "other": 94 }, "duration": 10872468, "reportedSentences": 87, "clips": 1765, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.1, "thirties": 0.1, "teens": 0.04, "fourties": 0.52, "fifties": 0.03, "sixties": 0, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male": 0.82, "female": 0, "other": 0 } }, "users": 21, "size": 83667265, "checksum": "deea4f8459d671a4fa2cd9e274a48e6096eb8d79c30b9dbb635bd6993d91e768", "avgDurationSecs": 6.16, "validDurationSecs": 8771.895, "totalHrs": 3.02, "validHrs": 2.43 }, "hu": { "buckets": { "validated": 59822, "invalidated": 3054, "dev": 11381, "test": 11417, "train": 36596, "other": 48016 }, "duration": 612475057, "reportedSentences": 443, "clips": 110892, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.13, "teens": 0.02, "fourties": 0.12, "fifties": 0.31, "sixties": 0.02, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male": 0.25, "female": 0.45, "other": 0.01 } }, "users": 1595, "size": 3573438643, "checksum": "3829fea5c2ca1f23c2e508e1805a960689b4b8b3aee8c30f575ff186a2fa13df", "avgDurationSecs": 5.523, "validDurationSecs": 330406.908, "totalHrs": 170.13, "validHrs": 91.77 }, "hy-AM": { "buckets": { "validated": 9370, "invalidated": 744, "dev": 2656, "test": 2853, "train": 3794, "other": 5513 }, "duration": 84972096, "reportedSentences": 88, "clips": 15627, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.48, "thirties": 0.07, "teens": 0.22, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male": 0.29, "female": 0.48, "other": 0 } }, "users": 224, "size": 491813367, "checksum": "2486b1cbf9348585d3ebd0eb2645fd107db0b8961ff5b4467f40b7394e201d35", "avgDurationSecs": 5.438, "validDurationSecs": 50949.545, "totalHrs": 23.6, "validHrs": 14.15 }, "ia": { "buckets": { "validated": 11658, "invalidated": 336, "dev": 1803, "test": 1792, "train": 5005, "other": 2588 }, "duration": 61036932, "reportedSentences": 276, "clips": 14582, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.3, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.61, "female": 0.01, "other": 0 } }, "users": 66, "size": 413083015, "checksum": "5df4475c33692db10164d35964a934777ae87e689f0cbed8abfe52f5e1ad68f7", "avgDurationSecs": 4.186, "validDurationSecs": 48797.734, "totalHrs": 16.95, "validHrs": 13.55 }, "id": { "buckets": { "validated": 25986, "invalidated": 2597, "dev": 3341, "test": 3643, "train": 4969, "other": 29031 }, "duration": 229771812, "reportedSentences": 345, "clips": 57614, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.45, "female": 0.27, "other": 0.04 } }, "users": 524, "size": 1444568699, "checksum": "307dd20a49b03a2f702d9c43f74f1cf0d1f812039938472eebdaf3e5a285a1a2", "avgDurationSecs": 3.988, "validDurationSecs": 103635.406, "totalHrs": 63.82, "validHrs": 28.78 }, "ig": { "buckets": { "validated": 14, "invalidated": 3, "dev": 2, "test": 4, "train": 8, "other": 5760 }, "duration": 31554684, "reportedSentences": 16, "clips": 5777, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.33, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male": 0.14, "female": 0.3, "other": 0 } }, "users": 114, "size": 184210449, "checksum": "5f38b3cd766bf3c03e913c1426e96c8b0418151d12071ccc433dc5eb89cf0dbf", "avgDurationSecs": 5.462, "validDurationSecs": 76.47, "totalHrs": 8.76, "validHrs": 0.02 }, "is": { "buckets": { "validated": 13, "invalidated": 1, "dev": 0, "test": 4, "train": 9, "other": 1 }, "duration": 110016, "reportedSentences": 0, "clips": 15, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.33, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male": 0.33, "female": 0, "other": 0 } }, "users": 2, "size": 645045, "checksum": "47a8c96c9e25b9a517a32521b8a7942cbc71c0105917743c6dad3ede0ee2a4f6", "avgDurationSecs": 7.334, "validDurationSecs": 95.347, "totalHrs": 0.03, "validHrs": 0.02 }, "it": { "buckets": { "validated": 235478, "invalidated": 19258, "dev": 15136, "test": 15144, "train": 168490, "other": 4552 }, "duration": 1397747892, "reportedSentences": 5928, "clips": 259288, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.21, "thirties": 0.17, "teens": 0.01, "fourties": 0.13, "fifties": 0.15, "sixties": 0.05, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male": 0.61, "female": 0.11, "other": 0 } }, "users": 7039, "size": 9531917157, "checksum": "8d308ac010e04d1a4a2117ad71b6124ac0bcc63fa0aa7fa270ed326ec4af613f", "avgDurationSecs": 5.391, "validDurationSecs": 1269394.951, "totalHrs": 388.26, "validHrs": 352.6 }, "ja": { "buckets": { "validated": 89747, "invalidated": 12757, "dev": 6094, "test": 6094, "train": 9616, "other": 232237 }, "duration": 1631522697, "reportedSentences": 344, "clips": 334741, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.54, "thirties": 0.08, "teens": 0.07, "fourties": 0.1, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male": 0.44, "female": 0.38, "other": 0.01 } }, "users": 3767, "size": 9609850839, "checksum": "b9766d29f815cf10f4cf52b89f00a128ff78281539f5d79ea9d7468bdce46c6c", "avgDurationSecs": 4.874, "validDurationSecs": 437425.554, "totalHrs": 453.2, "validHrs": 121.5 }, "ka": { "buckets": { "validated": 84957, "invalidated": 3465, "dev": 11958, "test": 12004, "train": 44144, "other": 25261 }, "duration": 581233248, "reportedSentences": 700, "clips": 113683, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.19, "thirties": 0.48, "teens": 0.02, "fourties": 0.05, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male": 0.24, "female": 0.56, "other": 0 } }, "users": 1293, "size": 3398943922, "checksum": "3c6bed106ca1d48baa16cfe716bb3d4d9389305f7f3a27ad2653ae628e3db745", "avgDurationSecs": 5.113, "validDurationSecs": 434364.268, "totalHrs": 161.45, "validHrs": 120.65 }, "kab": { "buckets": { "validated": 610621, "invalidated": 19655, "dev": 14998, "test": 14998, "train": 151972, "other": 112691 }, "duration": 2479725556, "reportedSentences": 9460, "clips": 742967, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.19, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.53, "female": 0.2, "other": 0 } }, "users": 1535, "size": 18489990245, "checksum": "db6070a227b821a53f7ba415451633aec88bddd5d10d597bb75f81c329b67d99", "avgDurationSecs": 3.338, "validDurationSecs": 2038007.743, "totalHrs": 688.81, "validHrs": 566.11 }, "kk": { "buckets": { "validated": 1547, "invalidated": 225, "dev": 494, "test": 502, "train": 545, "other": 682 }, "duration": 12029184, "reportedSentences": 42, "clips": 2454, "splits": { "accent": {}, "age": { "": 0.53, "twenties": 0.25, "thirties": 0.03, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male": 0.35, "female": 0.12, "other": 0 } }, "users": 156, "size": 68338397, "checksum": "a1ec1882a2e62bd0c3d633e8680a6997cabf38ae09c4a73f178bcccb4746d33e", "avgDurationSecs": 4.902, "validDurationSecs": 7583.19, "totalHrs": 3.34, "validHrs": 2.1 }, "kmr": { "buckets": { "validated": 52943, "invalidated": 2400, "dev": 3517, "test": 3520, "train": 4469, "other": 31888 }, "duration": 353791656, "reportedSentences": 1034, "clips": 87231, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.08, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male": 0.42, "female": 0.1, "other": 0 } }, "users": 552, "size": 1866641513, "checksum": "5491877fb356b2f86baf6db3b8b0a533a91dc7a64149bfd54d747d505ea08739", "avgDurationSecs": 4.056, "validDurationSecs": 214726.32, "totalHrs": 98.27, "validHrs": 59.64 }, "ko": { "buckets": { "validated": 919, "invalidated": 237, "dev": 235, "test": 282, "train": 401, "other": 1832 }, "duration": 17813484, "reportedSentences": 18, "clips": 2988, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.41, "thirties": 0.34, "teens": 0.04, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male": 0.65, "female": 0.21, "other": 0 } }, "users": 74, "size": 103463451, "checksum": "c4e46b3fe38a1d571034d802c6eca3401e4184170e203d76d6dfe879c88b140a", "avgDurationSecs": 5.962, "validDurationSecs": 5478.779, "totalHrs": 4.94, "validHrs": 1.52 }, "ky": { "buckets": { "validated": 30221, "invalidated": 5636, "dev": 1613, "test": 1613, "train": 1788, "other": 511 }, "duration": 165195912, "reportedSentences": 39, "clips": 36368, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.66, "thirties": 0.08, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.11, "male": 0.53, "female": 0.36, "other": 0 } }, "users": 272, "size": 1064716266, "checksum": "d6b85f45214211de43d93137d1320298a217c4cfff3633b5e0df7fbedbcaf420", "avgDurationSecs": 4.542, "validDurationSecs": 137274.133, "totalHrs": 45.88, "validHrs": 38.13 }, "lg": { "buckets": { "validated": 271801, "invalidated": 39170, "dev": 13331, "test": 13358, "train": 71069, "other": 36998 }, "duration": 2012482071, "reportedSentences": 6451, "clips": 347969, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.36, "female": 0.39, "other": 0 } }, "users": 656, "size": 11784766047, "checksum": "5ca1d645bc91eda1280d374d3ef2d076c92c8b2fed21d32c08171767ac8ec6a4", "avgDurationSecs": 5.784, "validDurationSecs": 1571963.708, "totalHrs": 559.02, "validHrs": 436.65 }, "lij": { "buckets": { "validated": 2239, "invalidated": 453, "dev": 261, "test": 639, "train": 1337, "other": 0 }, "duration": 9635688, "reportedSentences": 27, "clips": 2692, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.13, "thirties": 0.68, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.09, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male": 0.68, "female": 0.23, "other": 0 } }, "users": 12, "size": 52589861, "checksum": "c41b6562992579f1c1642f715c863c68084b293da9bec77f473b20098d8ccf65", "avgDurationSecs": 3.579, "validDurationSecs": 8014.229, "totalHrs": 2.67, "validHrs": 2.22 }, "lo": { "buckets": { "validated": 108, "invalidated": 6, "dev": 28, "test": 25, "train": 55, "other": 77 }, "duration": 1234116, "reportedSentences": 0, "clips": 191, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.36, "thirties": 0.57, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male": 0.93, "female": 0, "other": 0 } }, "users": 10, "size": 7143700, "checksum": "34eb512ed8595ec496d521f4babe2b3a11bdc59aeddc91e4358e48b52b097092", "avgDurationSecs": 6.461, "validDurationSecs": 697.825, "totalHrs": 0.34, "validHrs": 0.19 }, "lt": { "buckets": { "validated": 16478, "invalidated": 792, "dev": 4772, "test": 4778, "train": 6728, "other": 396 }, "duration": 90629648, "reportedSentences": 189, "clips": 17666, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.36, "thirties": 0.31, "teens": 0.03, "fourties": 0.04, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male": 0.57, "female": 0.22, "other": 0 } }, "users": 285, "size": 552247942, "checksum": "3c7402aa79fa7a2b154d86c5dcd2551ccb4aef0ac7475a56a04b79c7f0327dbb", "avgDurationSecs": 5.13, "validDurationSecs": 84535.002, "totalHrs": 25.17, "validHrs": 23.48 }, "ltg": { "buckets": { "validated": 8282, "invalidated": 512, "dev": 1716, "test": 1766, "train": 2190, "other": 4731 }, "duration": 52708572, "reportedSentences": 19, "clips": 13525, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.11, "thirties": 0.19, "teens": 0, "fourties": 0.37, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male": 0.05, "female": 0.63, "other": 0 } }, "users": 169, "size": 299995552, "checksum": "15b81a60a062f0127c92f140dab401f3bb31da40a9f78def821745b6245996d7", "avgDurationSecs": 3.897, "validDurationSecs": 32275.963, "totalHrs": 14.64, "validHrs": 8.96 }, "lv": { "buckets": { "validated": 137691, "invalidated": 7279, "dev": 5217, "test": 5217, "train": 7603, "other": 51096 }, "duration": 883642481, "reportedSentences": 3056, "clips": 196066, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.1, "fifties": 0.09, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male": 0.26, "female": 0.31, "other": 0 } }, "users": 5174, "size": 4901247529, "checksum": "4512ec1e3b4081345c179a776ac83621fdb856a56b46d3fc477d4e587a2e4c0b", "avgDurationSecs": 4.507, "validDurationSecs": 620554.389, "totalHrs": 245.45, "validHrs": 172.37 }, "mdf": { "buckets": { "validated": 336, "invalidated": 9, "dev": 54, "test": 107, "train": 175, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male": 0.08, "female": 0.34, "other": 0 } }, "users": 11, "size": 10639590, "checksum": "9f396b9638e47fbdb73204b96ea5f40efaec63a46357ad278bc4d2991db30d2a", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "validated": 205678, "invalidated": 11609, "dev": 14702, "test": 15069, "train": 173596, "other": 1 }, "duration": 1008639540, "reportedSentences": 100, "clips": 217288, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.32, "thirties": 0.23, "teens": 0.12, "fourties": 0.13, "fifties": 0.04, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male": 0.2, "female": 0.68, "other": 0 } }, "users": 474, "size": 5646638061, "checksum": "17c44ac5d50d5a24966f953ca8ed890e6bfbf594c22aeb59a2317003fd9ae33b", "avgDurationSecs": 4.642, "validDurationSecs": 954746.527, "totalHrs": 280.17, "validHrs": 265.2 }, "mk": { "buckets": { "validated": 198, "invalidated": 16, "dev": 27, "test": 50, "train": 121, "other": 0 }, "duration": 1132524, "reportedSentences": 9, "clips": 214, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.16, "thirties": 0.42, "teens": 0.07, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male": 0.65, "female": 0, "other": 0 } }, "users": 8, "size": 6664258, "checksum": "33d54a41d83028c5079d7d7710f6f4df00559149af16cf5a94aa87ea16ba5830", "avgDurationSecs": 5.292, "validDurationSecs": 1047.849, "totalHrs": 0.31, "validHrs": 0.29 }, "ml": { "buckets": { "validated": 2836, "invalidated": 105, "dev": 714, "test": 664, "train": 1256, "other": 5567 }, "duration": 35543052, "reportedSentences": 192, "clips": 8508, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male": 0.52, "female": 0.21, "other": 0.05 } }, "users": 131, "size": 199418168, "checksum": "2f1116c0b7bfe64fa5b642471ca604f2a5e83a23d72f62cc417110ae813938a1", "avgDurationSecs": 4.178, "validDurationSecs": 11847.684, "totalHrs": 9.87, "validHrs": 3.29 }, "mn": { "buckets": { "validated": 8638, "invalidated": 795, "dev": 1851, "test": 1876, "train": 2208, "other": 4699 }, "duration": 76987844, "reportedSentences": 34, "clips": 14132, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.44, "thirties": 0.22, "teens": 0.02, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0.05 }, "gender": { "": 0.26, "male": 0.39, "female": 0.29, "other": 0.06 } }, "users": 519, "size": 569045206, "checksum": "d56927a5d8f12f4b2bbf09f0fffa34f850ea745f8165e45182df95804332408b", "avgDurationSecs": 5.448, "validDurationSecs": 47057.812, "totalHrs": 21.38, "validHrs": 13.07 }, "mr": { "buckets": { "validated": 10887, "invalidated": 2262, "dev": 1783, "test": 1747, "train": 2215, "other": 2817 }, "duration": 98891496, "reportedSentences": 59, "clips": 15966, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male": 0.19, "female": 0.76, "other": 0 } }, "users": 89, "size": 579388627, "checksum": "d8873646a9b9244df1d02f1a92416dafed2a1702cb5ef80bb6607a4f293c57c9", "avgDurationSecs": 6.194, "validDurationSecs": 67432.777, "totalHrs": 27.46, "validHrs": 18.73 }, "mrj": { "buckets": { "validated": 28624, "invalidated": 994, "dev": 7314, "test": 6815, "train": 14325, "other": 627 }, "duration": 126320148, "reportedSentences": 50, "clips": 30245, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.34, "thirties": 0.23, "teens": 0.05, "fourties": 0.13, "fifties": 0.12, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male": 0.25, "female": 0.65, "other": 0 } }, "users": 58, "size": 702781178, "checksum": "e4640ec0e451f3c7082b90bf3b32a3174bf3df292affcc7993676acce1631cf0", "avgDurationSecs": 4.177, "validDurationSecs": 119549.939, "totalHrs": 35.08, "validHrs": 33.2 }, "mt": { "buckets": { "validated": 6434, "invalidated": 324, "dev": 1586, "test": 1659, "train": 1936, "other": 6315 }, "duration": 61963812, "reportedSentences": 10, "clips": 13073, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.25, "female": 0.47, "other": 0.01 } }, "users": 218, "size": 459720019, "checksum": "d960236c415f8b52afbc3cd63bbf1c7a8946e891027382c2a6dc8b7e405e9ce2", "avgDurationSecs": 4.74, "validDurationSecs": 30496.073, "totalHrs": 17.21, "validHrs": 8.47 }, "myv": { "buckets": { "validated": 1969, "invalidated": 25, "dev": 239, "test": 479, "train": 1241, "other": 1 }, "duration": 11519388, "reportedSentences": 31, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.54, "female": 0.07, "other": 0 } }, "users": 13, "size": 67157080, "checksum": "4e384078f54027ce68de05b8a3857cf837c3a53226547a5a50931ec2b307ca72", "avgDurationSecs": 5.774, "validDurationSecs": 11369.261, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "validated": 7773, "invalidated": 521, "dev": 1679, "test": 2281, "train": 3665, "other": 19451 }, "duration": 71922744, "reportedSentences": 214, "clips": 27745, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.54, "thirties": 0.15, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male": 0.64, "female": 0.12, "other": 0.13 } }, "users": 209, "size": 405920716, "checksum": "b6f26b01b6aab7b15a5074c810919d6a1064463a5f64731b3bf80ba10c947d10", "avgDurationSecs": 2.592, "validDurationSecs": 20149.774, "totalHrs": 19.97, "validHrs": 5.59 }, "ne-NP": { "buckets": { "validated": 718, "invalidated": 60, "dev": 205, "test": 197, "train": 283, "other": 474 }, "duration": 4879440, "reportedSentences": 5, "clips": 1252, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.6, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.74, "female": 0, "other": 0 } }, "users": 26, "size": 28430805, "checksum": "00efc082beabd830e08435fe307198fc7d0491e7baa4d9908815296bc06a28fc", "avgDurationSecs": 3.897, "validDurationSecs": 2798.273, "totalHrs": 1.35, "validHrs": 0.77 }, "nhi": { "buckets": { "validated": 19, "invalidated": 0, "dev": 0, "test": 5, "train": 14, "other": 6 }, "duration": 117972, "reportedSentences": 0, "clips": 25, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0, "thirties": 0.8, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.8, "female": 0, "other": 0 } }, "users": 2, "size": 694316, "checksum": "d5e26ecbf455516a7fcbe1c7d9c43f4590d1d9018cd3419de83a02955a598712", "avgDurationSecs": 4.719, "validDurationSecs": 89.659, "totalHrs": 0.03, "validHrs": 0.02 }, "nl": { "buckets": { "validated": 90095, "invalidated": 5543, "dev": 11223, "test": 11235, "train": 34605, "other": 2294 }, "duration": 426106909, "reportedSentences": 339, "clips": 97932, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.21, "thirties": 0.12, "teens": 0.02, "fourties": 0.15, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.48, "female": 0.11, "other": 0 } }, "users": 1663, "size": 2894099358, "checksum": "f3decbcdbf9174e4753d57194b4f4bb5c4ddcee4648e2c28ae8734badcdf03c3", "avgDurationSecs": 4.351, "validDurationSecs": 392007.74, "totalHrs": 118.36, "validHrs": 108.89 }, "nn-NO": { "buckets": { "validated": 1038, "invalidated": 63, "dev": 249, "test": 323, "train": 461, "other": 225 }, "duration": 5786892, "reportedSentences": 23, "clips": 1326, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.29, "thirties": 0.42, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male": 0.6, "female": 0.14, "other": 0.03 } }, "users": 35, "size": 33243732, "checksum": "78e5db87bb721b7083a2c6a8160dcf9504d0cf9e6b838ea2606647119bf0b48b", "avgDurationSecs": 4.364, "validDurationSecs": 4530.01, "totalHrs": 1.6, "validHrs": 1.25 }, "oc": { "buckets": { "validated": 1281, "invalidated": 156, "dev": 215, "test": 227, "train": 254, "other": 7844 }, "duration": 45233424, "reportedSentences": 10, "clips": 9281, "splits": { "accent": {}, "age": { "": 0.47, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.18, "fifties": 0.17, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male": 0.09, "female": 0.42, "other": 0 } }, "users": 145, "size": 256219201, "checksum": "18342b17757903bef93be866f55486f9d09224cdc519b5a2ee1cf95352781ce9", "avgDurationSecs": 4.874, "validDurationSecs": 6243.294, "totalHrs": 12.56, "validHrs": 1.73 }, "or": { "buckets": { "validated": 3158, "invalidated": 274, "dev": 0, "test": 696, "train": 2048, "other": 5494 }, "duration": 44718252, "reportedSentences": 31, "clips": 8926, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.17, "thirties": 0.71, "teens": 0, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male": 0.88, "female": 0.04, "other": 0 } }, "users": 125, "size": 309388933, "checksum": "6bb8a1b8910cb76718d192edb70b0a19b711d4abf296b29d9d576d779a6cdc40", "avgDurationSecs": 5.01, "validDurationSecs": 15821.223, "totalHrs": 12.42, "validHrs": 4.39 }, "os": { "buckets": { "validated": 140, "invalidated": 27, "dev": 40, "test": 36, "train": 64, "other": 23 }, "duration": 1057932, "reportedSentences": 1, "clips": 190, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0.11, "fourties": 0.5, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male": 0.61, "female": 0, "other": 0 } }, "users": 7, "size": 5894605, "checksum": "0db8edb4f29546b84507b24fcdf2127cd184409dc2d2a690698dff0f09c9895b", "avgDurationSecs": 5.568, "validDurationSecs": 779.529, "totalHrs": 0.29, "validHrs": 0.21 }, "pa-IN": { "buckets": { "validated": 1505, "invalidated": 81, "dev": 286, "test": 487, "train": 732, "other": 1388 }, "duration": 14357366, "reportedSentences": 251, "clips": 2974, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.23, "thirties": 0.38, "teens": 0, "fourties": 0.04, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.69, "female": 0.01, "other": 0 } }, "users": 68, "size": 100825841, "checksum": "523346527290a3cb533758ee87195d4917efc50e5f98b2fe1f74a9a4a1e4ee61", "avgDurationSecs": 4.828, "validDurationSecs": 7265.58, "totalHrs": 3.98, "validHrs": 2.01 }, "pl": { "buckets": { "validated": 131435, "invalidated": 6538, "dev": 9060, "test": 9060, "train": 19894, "other": 954 }, "duration": 627625406, "reportedSentences": 558, "clips": 138927, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.28, "thirties": 0.33, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.25, "male": 0.6, "female": 0.14, "other": 0.01 } }, "users": 3264, "size": 4472067181, "checksum": "dab35f907225411d0fa76348f0195a81ee182a5acc596302790d6dcb139a35e8", "avgDurationSecs": 4.518, "validDurationSecs": 593779.073, "totalHrs": 174.34, "validHrs": 164.93 }, "ps": { "buckets": { "validated": 1224, "invalidated": 12, "dev": 0, "test": 197, "train": 1027, "other": 33 }, "duration": 6190344, "reportedSentences": 1, "clips": 1269, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.13, "thirties": 0.03, "teens": 0, "fourties": 0.81, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.03, "male": 0.97, "female": 0, "other": 0 } }, "users": 8, "size": 36372523, "checksum": "bcf66e5afd49f13a0a8fdc950857ed3938544ffc28f8ac4c86eabba89cf15b55", "avgDurationSecs": 4.878, "validDurationSecs": 5970.828, "totalHrs": 1.71, "validHrs": 1.65 }, "pt": { "buckets": { "validated": 150848, "invalidated": 7182, "dev": 9412, "test": 9414, "train": 21685, "other": 24247 }, "duration": 746365939, "reportedSentences": 2729, "clips": 182277, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.02, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.68, "female": 0.06, "other": 0.01 } }, "users": 3337, "size": 4702929102, "checksum": "36145ed88edaab9872f447bca5d7821fddcb2e33669a3f77fba1080c5f083d60", "avgDurationSecs": 4.095, "validDurationSecs": 617674.249, "totalHrs": 207.32, "validHrs": 171.57 }, "quy": { "buckets": { "validated": 1, "invalidated": 1, "dev": 0, "test": 0, "train": 1, "other": 4 }, "duration": 36792, "reportedSentences": 0, "clips": 6, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male": 0, "female": 0, "other": 0 } }, "users": 2, "size": 217793, "checksum": "e77e3c7560ef638724fb3989c182eee1cd17ab54266131a5d97cdd2f6a39285f", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "rm-sursilv": { "buckets": { "validated": 4305, "invalidated": 680, "dev": 1358, "test": 1363, "train": 1571, "other": 2201 }, "duration": 39264017, "reportedSentences": 17, "clips": 7186, "splits": { "accent": {}, "age": { "": 0.63, "twenties": 0.1, "thirties": 0.03, "teens": 0.05, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.63, "male": 0.18, "female": 0.19, "other": 0 } }, "users": 90, "size": 295754556, "checksum": "f126f17c134b15575104bbe4c4e69637c47a476337798221da13e4df2d47bd61", "avgDurationSecs": 5.464, "validDurationSecs": 23522.348, "totalHrs": 10.9, "validHrs": 6.53 }, "rm-vallader": { "buckets": { "validated": 1526, "invalidated": 394, "dev": 500, "test": 462, "train": 557, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "clips": 2631, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 53, "size": 116706596, "checksum": "50401449141060d92d5b59bb0ea6fbe56bc11b948a2dfe510c211450d2d9b027", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "validated": 17522, "invalidated": 969, "dev": 3904, "test": 3902, "train": 5113, "other": 22506 }, "duration": 165299827, "reportedSentences": 402, "clips": 40997, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.62, "thirties": 0.13, "teens": 0.01, "fourties": 0.06, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male": 0.71, "female": 0.14, "other": 0.01 } }, "users": 416, "size": 1025705898, "checksum": "256570f386506ed8fd48c730460f32f6354ed6a9f59695259d1d336183e1db5c", "avgDurationSecs": 4.032, "validDurationSecs": 70648.671, "totalHrs": 45.91, "validHrs": 19.62 }, "ru": { "buckets": { "validated": 162034, "invalidated": 9906, "dev": 10198, "test": 10199, "train": 26359, "other": 12190 }, "duration": 950615172, "reportedSentences": 465, "clips": 184130, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.34, "thirties": 0.15, "teens": 0.1, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.6, "female": 0.16, "other": 0 } }, "users": 3126, "size": 6448944122, "checksum": "ce8fd4aa18900b701b09304722fddedd9aa98152e0ffe11f99ca2eb4986e39ee", "avgDurationSecs": 5.163, "validDurationSecs": 836539.286, "totalHrs": 264.05, "validHrs": 232.37 }, "rw": { "buckets": { "validated": 1438768, "invalidated": 227806, "dev": 15987, "test": 16213, "train": 1003024, "other": 47299 }, "duration": 8582349497, "reportedSentences": 630, "clips": 1713873, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1128, "size": 60996395511, "checksum": "2908446346f1bf745a0f658ba7ee43ffb85628e058877c66959c8ff607ead7de", "avgDurationSecs": 5.008, "validDurationSecs": 7204740.27, "totalHrs": 2383.98, "validHrs": 2001.31 }, "sah": { "buckets": { "validated": 4842, "invalidated": 151, "dev": 1391, "test": 1432, "train": 1892, "other": 1683 }, "duration": 40858536, "reportedSentences": 5, "clips": 6676, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.2, "thirties": 0.27, "teens": 0.06, "fourties": 0.07, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male": 0.39, "female": 0.23, "other": 0 } }, "users": 99, "size": 281947928, "checksum": "5bdf8918d3a9a8f13ca809b918ef032e15e9129483644a81970b3451216e5435", "avgDurationSecs": 6.12, "validDurationSecs": 29634.067, "totalHrs": 11.34, "validHrs": 8.23 }, "sat": { "buckets": { "validated": 462, "invalidated": 9, "dev": 0, "test": 147, "train": 315, "other": 353 }, "duration": 3676320, "reportedSentences": 7, "clips": 824, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.43, "thirties": 0.1, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.58, "female": 0.01, "other": 0 } }, "users": 13, "size": 20283797, "checksum": "058c7f26cf7a3a9f477c20ca2857a4ac6ce036a41c93b2692ce2e80b5d20e73f", "avgDurationSecs": 4.462, "validDurationSecs": 2061.238, "totalHrs": 1.02, "validHrs": 0.57 }, "sc": { "buckets": { "validated": 1183, "invalidated": 26, "dev": 262, "test": 224, "train": 531, "other": 321 }, "duration": 6899760, "reportedSentences": 3, "clips": 1530, "splits": { "accent": {}, "age": { "": 0.59, "twenties": 0.1, "thirties": 0.31, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.59, "male": 0.11, "female": 0.31, "other": 0 } }, "users": 14, "size": 39926578, "checksum": "294641d815d9ba96bf7e5bc81fbbd33191c51c5cc1e2a45f55da9bceff40d3b6", "avgDurationSecs": 4.51, "validDurationSecs": 5334.912, "totalHrs": 1.91, "validHrs": 1.48 }, "sk": { "buckets": { "validated": 19328, "invalidated": 826, "dev": 2636, "test": 2599, "train": 3244, "other": 3109 }, "duration": 94837680, "reportedSentences": 66, "clips": 23263, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.09, "thirties": 0.21, "teens": 0.08, "fourties": 0.18, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male": 0.48, "female": 0.07, "other": 0.01 } }, "users": 223, "size": 535384059, "checksum": "345581c840dc4c85d04edde224121c20c53ea01640f5038e0a74df2d7e5a69e9", "avgDurationSecs": 4.077, "validDurationSecs": 78795.627, "totalHrs": 26.34, "validHrs": 21.88 }, "skr": { "buckets": { "validated": 3642, "invalidated": 462, "dev": 1122, "test": 1004, "train": 1504, "other": 1618 }, "duration": 23697324, "reportedSentences": 17, "clips": 5722, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0.38, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.64, "female": 0, "other": 0 } }, "users": 56, "size": 135621315, "checksum": "5aa6803cb99ab58cfdf5b88e8552310176e921cc904f30ada3ca18794f54970f", "avgDurationSecs": 4.141, "validDurationSecs": 15083.127, "totalHrs": 6.58, "validHrs": 4.18 }, "sl": { "buckets": { "validated": 10728, "invalidated": 278, "dev": 1204, "test": 1250, "train": 1401, "other": 3023 }, "duration": 53100802, "reportedSentences": 48, "clips": 14029, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.42, "thirties": 0.04, "teens": 0.07, "fourties": 0.18, "fifties": 0.06, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male": 0.7, "female": 0.13, "other": 0 } }, "users": 149, "size": 365001932, "checksum": "4e633c3bbcb352824c8a831ce228237a5c4388f39bd707eec52279fc8c12029d", "avgDurationSecs": 3.785, "validDurationSecs": 40606.273, "totalHrs": 14.75, "validHrs": 11.27 }, "sq": { "buckets": { "validated": 1252, "invalidated": 15, "dev": 322, "test": 384, "train": 542, "other": 0 }, "duration": 5729292, "reportedSentences": 12, "clips": 1267, "splits": { "accent": {}, "age": { "": 0.65, "twenties": 0.07, "thirties": 0.19, "teens": 0, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male": 0.41, "female": 0.09, "other": 0 } }, "users": 45, "size": 32924611, "checksum": "e832d32644124ee70c4be1f57b7a2c61ec9e3219702252a535f5b4bd54fb4032", "avgDurationSecs": 4.522, "validDurationSecs": 5661.463, "totalHrs": 1.59, "validHrs": 1.57 }, "sr": { "buckets": { "validated": 5733, "invalidated": 268, "dev": 1442, "test": 1543, "train": 1877, "other": 1770 }, "duration": 23130792, "reportedSentences": 53, "clips": 7771, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.38, "thirties": 0.17, "teens": 0, "fourties": 0.06, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male": 0.53, "female": 0.24, "other": 0 } }, "users": 149, "size": 123806923, "checksum": "38a33b4e344beaf6c4b79ac6a7d5b0d2ea4b1bffa949b33a0c8d50d9d71b8748", "avgDurationSecs": 2.977, "validDurationSecs": 17064.577, "totalHrs": 6.42, "validHrs": 4.74 }, "sv-SE": { "buckets": { "validated": 40530, "invalidated": 1418, "dev": 5222, "test": 5206, "train": 7657, "other": 6609 }, "duration": 194228439, "reportedSentences": 589, "clips": 48557, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.38, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male": 0.48, "female": 0.33, "other": 0.01 } }, "users": 823, "size": 1226193538, "checksum": "26fdd5e3a7a006230f2445da5fc780f9f1088f9ce9e7a37d3ef4c332ff8bd879", "avgDurationSecs": 4, "validDurationSecs": 162120.366, "totalHrs": 53.95, "validHrs": 45.03 }, "sw": { "buckets": { "validated": 265365, "invalidated": 79915, "dev": 12231, "test": 12234, "train": 46209, "other": 376717 }, "duration": 3889601568, "reportedSentences": 1901, "clips": 721997, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male": 0.35, "female": 0.37, "other": 0 } }, "users": 1433, "size": 22663824721, "checksum": "baa58dc2a736c184e9ae9fb9f9bae4b416a4e9869999e8ec737b935fef6a59a0", "avgDurationSecs": 5.387, "validDurationSecs": 1429596.134, "totalHrs": 1080.44, "validHrs": 397.11 }, "ta": { "buckets": { "validated": 134550, "invalidated": 5673, "dev": 12049, "test": 12114, "train": 44839, "other": 88049 }, "duration": 1418463816, "reportedSentences": 3354, "clips": 228272, "splits": { "accent": {}, "age": { "": 0.71, "twenties": 0.09, "thirties": 0.09, "teens": 0.03, "fourties": 0.03, "fifties": 0.03, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male": 0.16, "female": 0.13, "other": 0 } }, "users": 886, "size": 8491571742, "checksum": "73022103c73d2727cb09db863c8eaa36aa99a78ac8b6f2c79b7f6f57468c0ab5", "avgDurationSecs": 6.214, "validDurationSecs": 836082.859, "totalHrs": 394.01, "validHrs": 232.24 }, "te": { "buckets": { "validated": 106, "invalidated": 1, "dev": 25, "test": 30, "train": 38, "other": 239 }, "duration": 1647972, "reportedSentences": 3, "clips": 346, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.17, "thirties": 0.07, "teens": 0, "fourties": 0.06, "fifties": 0, "sixties": 0.52, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male": 0.83, "female": 0, "other": 0 } }, "users": 11, "size": 9609717, "checksum": "23f849894bee028dc2e4d29166a153d90d3eb7e1b7ae437a3e15f7c7a9663a7c", "avgDurationSecs": 4.763, "validDurationSecs": 504.87, "totalHrs": 0.45, "validHrs": 0.14 }, "th": { "buckets": { "validated": 146735, "invalidated": 9250, "dev": 11038, "test": 11038, "train": 32789, "other": 205532 }, "duration": 1515310068, "reportedSentences": 4234, "clips": 361517, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male": 0.42, "female": 0.17, "other": 0.01 } }, "users": 7849, "size": 8812954106, "checksum": "584bbbfd6ba5a4aaba5c9b002341cb02015f4b5f05849951a7271769337c8bda", "avgDurationSecs": 4.192, "validDurationSecs": 615044.446, "totalHrs": 420.91, "validHrs": 170.84 }, "ti": { "buckets": { "validated": 23, "invalidated": 1, "dev": 5, "test": 4, "train": 14, "other": 45 }, "duration": 378540, "reportedSentences": 3, "clips": 69, "splits": { "accent": {}, "age": { "": 0.78, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male": 0.22, "female": 0, "other": 0 } }, "users": 6, "size": 2176784, "checksum": "7a1b466a2bf47fde2474c94ce89b5f0d8deff099445ac32cf3a626f0b0b8048b", "avgDurationSecs": 5.486, "validDurationSecs": 126.18, "totalHrs": 0.1, "validHrs": 0.03 }, "tig": { "buckets": { "validated": 294, "invalidated": 13, "dev": 0, "test": 45, "train": 249, "other": 14 }, "duration": 1620288, "reportedSentences": 1, "clips": 321, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0.02, "thirties": 0.06, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.92, "male": 0.08, "female": 0, "other": 0 } }, "users": 9, "size": 9523609, "checksum": "26a4c1183631c7ec7a9fa3a1370c8ea72f3b0d3e5e5bd7c0b8ebeda33f755492", "avgDurationSecs": 5.048, "validDurationSecs": 1484.002, "totalHrs": 0.45, "validHrs": 0.41 }, "tk": { "buckets": { "validated": 1727, "invalidated": 115, "dev": 334, "test": 486, "train": 804, "other": 1636 }, "duration": 18731232, "reportedSentences": 18, "clips": 3478, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.63, "thirties": 0.11, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.5, "female": 0.24, "other": 0 } }, "users": 82, "size": 106232349, "checksum": "39e85dbb3a2598ff9d54a27099bd21e669e34433767a3c7ab15e750883829c49", "avgDurationSecs": 5.386, "validDurationSecs": 9300.988, "totalHrs": 5.2, "validHrs": 2.58 }, "tok": { "buckets": { "validated": 11397, "invalidated": 323, "dev": 2133, "test": 2125, "train": 2602, "other": 3305 }, "duration": 60932340, "reportedSentences": 142, "clips": 15025, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.21, "thirties": 0.07, "teens": 0.26, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.12, "nineties": 0 }, "gender": { "": 0.33, "male": 0.49, "female": 0.06, "other": 0.12 } }, "users": 138, "size": 355139704, "checksum": "486b1e6ef24db3bc3805ebb1e3e110ece5e87cfd6d234cd4057f55495ddc069b", "avgDurationSecs": 4.055, "validDurationSecs": 46219.36, "totalHrs": 16.92, "validHrs": 12.83 }, "tr": { "buckets": { "validated": 111192, "invalidated": 4412, "dev": 10996, "test": 11035, "train": 32802, "other": 116 }, "duration": 421520247, "reportedSentences": 396, "clips": 115720, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.24, "thirties": 0.08, "teens": 0.02, "fourties": 0.03, "fifties": 0.08, "sixties": 0.2, "seventies": 0.02, "eighties": 0.08, "nineties": 0 }, "gender": { "": 0.26, "male": 0.44, "female": 0.3, "other": 0 } }, "users": 1542, "size": 2548282361, "checksum": "3e8eb99bfa0d046fe2170bf4540721aa6666f7372a1855b5b1ff8dc7f19987b7", "avgDurationSecs": 3.643, "validDurationSecs": 405026.61, "totalHrs": 117.08, "validHrs": 112.5 }, "tt": { "buckets": { "validated": 29059, "invalidated": 419, "dev": 3620, "test": 4951, "train": 9396, "other": 124 }, "duration": 111451166, "reportedSentences": 6, "clips": 29602, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.05, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male": 0.78, "female": 0.02, "other": 0 } }, "users": 251, "size": 820112480, "checksum": "7671166bb8e60ca4146598abaf0e2b17652eec288299623831cb29d97ea21e67", "avgDurationSecs": 3.765, "validDurationSecs": 109406.778, "totalHrs": 30.95, "validHrs": 30.39 }, "tw": { "buckets": { "validated": 56, "invalidated": 0, "dev": 0, "test": 3, "train": 53, "other": 181 }, "duration": 943884, "reportedSentences": 3, "clips": 237, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.02, "thirties": 0.91, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male": 0.93, "female": 0, "other": 0 } }, "users": 5, "size": 5527555, "checksum": "7fbf52c157d949ece8fe1fbc079a53afb173aeeef6f8090cbc6065694a2eafeb", "avgDurationSecs": 3.983, "validDurationSecs": 223.027, "totalHrs": 0.26, "validHrs": 0.06 }, "ug": { "buckets": { "validated": 87865, "invalidated": 5004, "dev": 5983, "test": 5983, "train": 9342, "other": 15002 }, "duration": 647441316, "reportedSentences": 577, "clips": 107871, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.11, "thirties": 0.26, "teens": 0.03, "fourties": 0.1, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male": 0.37, "female": 0.15, "other": 0 } }, "users": 1183, "size": 3715539505, "checksum": "ccfd40ab2d82e78cb742d846e90e3c2d8562df3be590dbe1653027e956f915fe", "avgDurationSecs": 6.002, "validDurationSecs": 527365.383, "totalHrs": 179.84, "validHrs": 146.49 }, "uk": { "buckets": { "validated": 74065, "invalidated": 3124, "dev": 9832, "test": 9838, "train": 24079, "other": 5503 }, "duration": 386177244, "reportedSentences": 638, "clips": 82692, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.32, "thirties": 0.25, "teens": 0.09, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male": 0.58, "female": 0.18, "other": 0 } }, "users": 1057, "size": 2539371486, "checksum": "c825357be6042d21a9d6862218c84af5b05b2dbca45a196703d8334647ab6dd6", "avgDurationSecs": 4.67, "validDurationSecs": 345888.569, "totalHrs": 107.27, "validHrs": 96.08 }, "ur": { "buckets": { "validated": 53852, "invalidated": 6814, "dev": 4056, "test": 4056, "train": 5368, "other": 135803 }, "duration": 834105816, "reportedSentences": 237, "clips": 196469, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.9, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.08, "male": 0.67, "female": 0.25, "other": 0 } }, "users": 342, "size": 4849602502, "checksum": "9c03c3b2d2e419cab8a5287be7fa6c49abb663cd7ef79d574c38a64160c4bad0", "avgDurationSecs": 4.245, "validDurationSecs": 228627.755, "totalHrs": 231.69, "validHrs": 63.5 }, "uz": { "buckets": { "validated": 86372, "invalidated": 13920, "dev": 12131, "test": 12341, "train": 48452, "other": 127930 }, "duration": 946998252, "reportedSentences": 1831, "clips": 228222, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.44, "female": 0.15, "other": 0 } }, "users": 2151, "size": 5281382551, "checksum": "09162fb6ad3a0090c3a8f27f8bc7d8623e68adc97a0f57bc28e6c28df953c420", "avgDurationSecs": 4.149, "validDurationSecs": 358397.232, "totalHrs": 263.05, "validHrs": 99.55 }, "vi": { "buckets": { "validated": 4976, "invalidated": 365, "dev": 477, "test": 1326, "train": 2350, "other": 11657 }, "duration": 67351544, "reportedSentences": 187, "clips": 16998, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.02, "teens": 0.21, "fourties": 0.02, "fifties": 0, "sixties": 0.3, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.54, "female": 0.19, "other": 0.02 } }, "users": 276, "size": 390726806, "checksum": "5deaf435f01299fddd361c9a423109a008f083258a20f9553c470bc7bb6338e3", "avgDurationSecs": 3.962, "validDurationSecs": 19716.513, "totalHrs": 18.7, "validHrs": 5.47 }, "vot": { "buckets": { "validated": 102, "invalidated": 324, "dev": 0, "test": 6, "train": 96, "other": 1 }, "duration": 1029972, "reportedSentences": 0, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male": 0.75, "female": 0, "other": 0 } }, "users": 6, "size": 7915922, "checksum": "0ad63ec0575865d5919fb9a2c89c7b37f3ef8979a1f533a6fdbdc3d55fc059ee", "avgDurationSecs": 2.412, "validDurationSecs": 246.035, "totalHrs": 0.28, "validHrs": 0.06 }, "yi": { "buckets": { "validated": 43, "invalidated": 1, "dev": 0, "test": 2, "train": 41, "other": 0 }, "duration": 166356, "reportedSentences": 1, "clips": 44, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.73, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male": 0, "female": 0, "other": 0 } }, "users": 2, "size": 966132, "checksum": "9a5545e43149860d684e3f66020436fc647992a3237bb4f9216e1d27bfb1aa16", "avgDurationSecs": 3.781, "validDurationSecs": 162.575, "totalHrs": 0.04, "validHrs": 0.04 }, "yo": { "buckets": { "validated": 3075, "invalidated": 229, "dev": 863, "test": 998, "train": 1212, "other": 1115 }, "duration": 26232120, "reportedSentences": 19, "clips": 4419, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.63, "thirties": 0.02, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male": 0.32, "female": 0.38, "other": 0 } }, "users": 108, "size": 153132119, "checksum": "826acdc2c50a2b9287108dba81d3d5b899fc66c3d81a20af278e03fe584da32e", "avgDurationSecs": 5.936, "validDurationSecs": 18253.851, "totalHrs": 7.28, "validHrs": 5.07 }, "yue": { "buckets": { "validated": 21148, "invalidated": 1690, "dev": 2591, "test": 2585, "train": 3076, "other": 139436 }, "duration": 629086572, "reportedSentences": 1669, "clips": 162274, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.32, "thirties": 0.45, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male": 0.25, "female": 0.56, "other": 0.01 } }, "users": 897, "size": 3607938980, "checksum": "d4860d2f805cd30623f63fa93f58b39c2337676aabad48c9a30c638ee32460b4", "avgDurationSecs": 3.877, "validDurationSecs": 81984.316, "totalHrs": 174.74, "validHrs": 22.77 }, "zgh": { "buckets": { "validated": 266, "invalidated": 8, "dev": 48, "test": 43, "train": 175, "other": 0 }, "duration": 905832, "reportedSentences": 0, "clips": 274, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.32, "thirties": 0.64, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.04, "male": 0.96, "female": 0, "other": 0 } }, "users": 5, "size": 5036073, "checksum": "35578b18ea2dfc8555707bd9d6f36dd6f35c2ca4e6148f61922daf456bf55bf5", "avgDurationSecs": 3.306, "validDurationSecs": 879.384, "totalHrs": 0.25, "validHrs": 0.24 }, "zh-CN": { "buckets": { "validated": 184605, "invalidated": 58420, "dev": 10626, "test": 10626, "train": 29406, "other": 595659 }, "duration": 3814708416, "reportedSentences": 791, "clips": 838684, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.04, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male": 0.06, "female": 0.01, "other": 0 } }, "users": 6938, "size": 22549894675, "checksum": "0083247487c89fa1ba7841c674bbe145c5bb673a8474c33acae41a7244b0db59", "avgDurationSecs": 4.548, "validDurationSecs": 839665.771, "totalHrs": 1059.64, "validHrs": 233.24 }, "zh-HK": { "buckets": { "validated": 92439, "invalidated": 4439, "dev": 5594, "test": 5594, "train": 8429, "other": 22076 }, "duration": 496819716, "reportedSentences": 683, "clips": 118954, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.13, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male": 0.42, "female": 0.22, "other": 0.01 } }, "users": 3014, "size": 3549548457, "checksum": "ea5415ea0e947c3f6c867893d5f9bc2a8f39f0a7fb84f848f4df54b005886e24", "avgDurationSecs": 4.177, "validDurationSecs": 386077.961, "totalHrs": 138, "validHrs": 107.24 }, "zh-TW": { "buckets": { "validated": 82297, "invalidated": 4769, "dev": 4976, "test": 4976, "train": 7101, "other": 47297 }, "duration": 451916626, "reportedSentences": 151, "clips": 134363, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.3, "thirties": 0.19, "teens": 0.05, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male": 0.5, "female": 0.23, "other": 0.02 } }, "users": 2176, "size": 3025493363, "checksum": "f09cdb68d303edc68dde1f984ecbe28c997ee92e9ae2f470204d8745f6846d7c", "avgDurationSecs": 3.363, "validDurationSecs": 276797.798, "totalHrs": 125.53, "validHrs": 76.88 } }, "totalDuration": 109183755259, "totalValidDurationSecs": 71695462, "totalHrs": 30328, "totalValidHrs": 19915 } ================================================ FILE: datasets/scripted-speech/cv-corpus-16.1-delta-2023-12-06.json ================================================ { "locales": { "ab": { "duration": 12852, "reportedSentences": 2, "clips": 5, "users": 1, "size": -1254393, "checksum": "75b295a55e2d1a581034d669f87504863ab36f64eac5dd4c7b954dca3ba356c7", "avgDurationSecs": 5.133, "validDurationSecs": 21.71, "totalHrs": 0, "validHrs": 0.01 }, "af": { "duration": 579960, "clips": 180, "users": 4, "size": 3383480, "checksum": "dbd9f394e2ed136fff6defba793f208993123e458949e72d581f453a27c5f702", "avgDurationSecs": 4.494, "validDurationSecs": 118.28, "totalHrs": 0.16, "validHrs": 0.03 }, "am": { "duration": 1094040, "reportedSentences": 10, "clips": 172, "users": 4, "size": 6458295, "checksum": "ad9d1a049ac9ead170b23ead481379bb685568982cbcbb176438327c1e95e4a6", "avgDurationSecs": 6.013, "validDurationSecs": 3607.15, "totalHrs": 0.31, "validHrs": 1 }, "ar": { "duration": 5485140, "reportedSentences": 9, "clips": 1584, "users": 56, "size": 29412183, "checksum": "0b86d4202269cd49a10289148cdc1cc435c8a11577754bd61424159f96d86a49", "avgDurationSecs": 4.159, "validDurationSecs": 273.26, "totalHrs": 1.52, "validHrs": 0.08 }, "as": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -55176, "checksum": "e2ce69226877a00bac054299ae89086bbc3d1d22319885b35ef7b23f0baf8384", "avgDurationSecs": 5.69, "validDurationSecs": 5.69, "totalHrs": 0, "validHrs": 0 }, "ast": { "duration": 1583460, "clips": 382, "users": 12, "size": 9086616, "checksum": "1eea6ea231bd7e605f968d94259f6764aba50c30448798535e357dc4a147b763", "avgDurationSecs": 4.442, "validDurationSecs": 864.71, "totalHrs": 0.44, "validHrs": 0.24 }, "az": { "duration": 27000, "reportedSentences": 0, "clips": 5, "users": 1, "size": 149613, "checksum": "1a6653a0db625626e57be79f334359721a1ca93bcb054604dfcce2e38aa802a0", "avgDurationSecs": 5.382, "validDurationSecs": 69.99, "totalHrs": 0, "validHrs": 0.01 }, "ba": { "duration": 181656, "reportedSentences": 4, "clips": 29, "users": 3, "size": -3431888, "checksum": "4053dcdab1180c7cd76b4cb0bdcba6e5980f72bb23e2e9ea824bc36a78a86ea8", "avgDurationSecs": 4.426, "validDurationSecs": 750.67, "totalHrs": 0.05, "validHrs": 0.21 }, "bas": { "duration": 98640, "reportedSentences": 0, "clips": 25, "users": 3, "size": 455439, "checksum": "c9a63cae26034c6a7b88ca3a6af00b93761dcc4b0b830bc2b6ef079498ba434d", "avgDurationSecs": 4.426, "validDurationSecs": 17.26, "totalHrs": 0.03, "validHrs": 0 }, "be": { "duration": 223594452, "reportedSentences": 2, "clips": 47638, "users": 26, "size": 1275189471, "checksum": "bb3400374ced50d31e80fc60b519626b7318945429a3ce2f37c9de8c1c396d06", "avgDurationSecs": 4.76, "validDurationSecs": 222193.48, "totalHrs": 62.11, "validHrs": 61.72 }, "bg": { "duration": 1341360, "reportedSentences": 2, "clips": 234, "users": 11, "size": 7197139, "checksum": "87eedd9d81da709daaf29849857f776acb419b3f432a4b2959eb9fab87e3f18f", "avgDurationSecs": 5.475, "validDurationSecs": 2892.41, "totalHrs": 0.37, "validHrs": 0.8 }, "bn": { "duration": 5739696, "reportedSentences": 10, "clips": 1043, "users": 18, "size": 12477188, "checksum": "d69cbbb6b2adc1321f62915866fec29aee558d75f89227b683aa8952c9f39fda", "avgDurationSecs": 4.366, "validDurationSecs": 442.65, "totalHrs": 1.6, "validHrs": 0.12 }, "br": { "duration": 3620700, "reportedSentences": 1, "clips": 949, "users": 5, "size": 20620163, "checksum": "9dbebb34bd42711222ca5f656631f80623c74fea4414a967505a720900cf4b16", "avgDurationSecs": 3.177, "validDurationSecs": 4995.47, "totalHrs": 1.01, "validHrs": 1.38 }, "ca": { "duration": 618637428, "reportedSentences": 213, "clips": 129014, "users": 466, "size": 3398862019, "checksum": "398f00bc64d5639ca240cf16f0332c6837c514e7d943efe6963cb23b9fd5bd42", "avgDurationSecs": 5.342, "validDurationSecs": 415856.71, "totalHrs": 171.84, "validHrs": 115.52 }, "ckb": { "duration": 2544084, "reportedSentences": 5, "clips": 575, "users": 15, "size": 12375817, "checksum": "afca800223d23986703906c06a4d97ca47ced6631f8433be351e5f29c6d08589", "avgDurationSecs": 3.906, "validDurationSecs": 1307.43, "totalHrs": 0.71, "validHrs": 0.36 }, "cnh": { "duration": 1075824, "reportedSentences": 0, "clips": 385, "users": 1, "size": 6122276, "checksum": "2250cc743acdebc2c36662b15654ede4a8c6f49cc1a9c29e25b46daea2ec39f8", "avgDurationSecs": 3.516, "validDurationSecs": -117.66, "totalHrs": 0.3, "validHrs": -0.03 }, "cs": { "duration": 5555196, "reportedSentences": 27, "clips": 1165, "users": 26, "size": 26643567, "checksum": "ed322dbe00e72978ab50036c633c8412cdb6b0e452be2c3ab20c0d02bd3f124e", "avgDurationSecs": 4.463, "validDurationSecs": 5309.71, "totalHrs": 1.54, "validHrs": 1.48 }, "cv": { "duration": 0, "reportedSentences": 5, "clips": 0, "users": 0, "size": -512382, "checksum": "8fd80ff1433d8596a9107a00fe115207933d9aa08bb27faea974a40a3a459716", "avgDurationSecs": 5.059, "validDurationSecs": 20.24, "totalHrs": 0, "validHrs": 0.01 }, "cy": { "duration": 2692620, "reportedSentences": 0, "clips": 451, "users": 32, "size": 12513823, "checksum": "aac2d405054b9a7f819823027e2f5fb841247c0f51c583d08b29be1044ec8139", "avgDurationSecs": 4.893, "validDurationSecs": 1640.77, "totalHrs": 0.75, "validHrs": 0.46 }, "da": { "duration": 491868, "reportedSentences": 3, "clips": 149, "users": 11, "size": 2638413, "checksum": "067b5c6f6f2a57b5f77e331d3d5c480f341bc26184b22aa425cb5d1f7b34ba86", "avgDurationSecs": 4.122, "validDurationSecs": 818.97, "totalHrs": 0.14, "validHrs": 0.23 }, "de": { "duration": 54953412, "reportedSentences": 283, "clips": 9617, "users": 295, "size": 297671298, "checksum": "116913120c964ed66047734da598f65314255905541a6c760ffdf73e8a867683", "avgDurationSecs": 5.231, "validDurationSecs": 48652.52, "totalHrs": 15.27, "validHrs": 13.51 }, "dv": { "duration": 610704, "reportedSentences": 0, "clips": 99, "users": 6, "size": 2582409, "checksum": "cac3aed5224d5a1aae7f4ee45b96cdf10f3f6c94e0007bc0b1bbf747bd2b1b96", "avgDurationSecs": 5.062, "validDurationSecs": 410.44, "totalHrs": 0.17, "validHrs": 0.12 }, "dyu": { "duration": 1210968, "clips": 200, "users": 14, "size": 7105356, "checksum": "c71861ec35fa5e14186c55079cc0cbe5d7ea77ac7c07351aac832c993225ac55", "avgDurationSecs": 6.145, "validDurationSecs": 898.62, "totalHrs": 0.34, "validHrs": 0.25 }, "el": { "duration": 716652, "reportedSentences": 0, "clips": 140, "users": 12, "size": 3665745, "checksum": "eca2965788611993d5fd24a1d5cacb973c9a3912b328d8514c928a5111efdd6e", "avgDurationSecs": 4.139, "validDurationSecs": 1346.76, "totalHrs": 0.2, "validHrs": 0.37 }, "en": { "duration": 327663948, "reportedSentences": 697, "clips": 53654, "users": 1570, "size": 1829259099, "checksum": "9e05009435ce300b903a52c60f9368f376f17c30bf6f31e2405013a0feae4002", "avgDurationSecs": 5.221, "validDurationSecs": 194823.85, "totalHrs": 91.02, "validHrs": 54.12 }, "eo": { "duration": 14046084, "reportedSentences": 9, "clips": 2122, "users": 32, "size": 40901750, "checksum": "1241e1f0503fc713e3746bbf0207017007ef3d0deb73b0dfc640c07a9a59534c", "avgDurationSecs": 6.067, "validDurationSecs": 4634.18, "totalHrs": 3.9, "validHrs": 1.29 }, "es": { "duration": 62015580, "reportedSentences": 221, "clips": 11823, "users": 357, "size": 329581725, "checksum": "891ea511c25f0ad98212ff9fbcdd855aaf6fc826d645d226f8481f957edababb", "avgDurationSecs": 4.872, "validDurationSecs": 83624.99, "totalHrs": 17.23, "validHrs": 23.23 }, "et": { "duration": 11431008, "reportedSentences": 4, "clips": 1678, "users": 44, "size": 66322355, "checksum": "04ae74a77fee8189066eaf6e42dd73ae3912e18a2cd7d81fffb003f4ceba448d", "avgDurationSecs": 6.77, "validDurationSecs": 8239.29, "totalHrs": 3.17, "validHrs": 2.28 }, "eu": { "duration": 1527952956, "reportedSentences": 821, "clips": 274744, "users": 7664, "size": 8721676148, "checksum": "3f4410c994572fb4b14db1e789fa9940af8389721a9e454f5ee959abb7f2fedd", "avgDurationSecs": 5.467, "validDurationSecs": 413922.69, "totalHrs": 424.43, "validHrs": 114.98 }, "fa": { "duration": 16247952, "reportedSentences": 79, "clips": 4549, "users": 50, "size": 84657954, "checksum": "e6097496f734be8e1b23b86946616a56271cfea91f24c6ce13cf02dfa1181364", "avgDurationSecs": 3.987, "validDurationSecs": 3800.94, "totalHrs": 4.51, "validHrs": 1.06 }, "fi": { "duration": 1306872, "reportedSentences": 11, "clips": 238, "users": 3, "size": 7357980, "checksum": "e26b27458089f5e913d78a8065520122b87531e203bf1b24afca9cd21629d58d", "avgDurationSecs": 4.624, "validDurationSecs": 1404.12, "totalHrs": 0.37, "validHrs": 0.39 }, "fr": { "duration": 102566736, "reportedSentences": 158, "clips": 19602, "users": 576, "size": 577320085, "checksum": "f569408c5ba4edd7b3742bed38eb47c33185466839538a35bb499592d2cb2206", "avgDurationSecs": 5.014, "validDurationSecs": 76097.57, "totalHrs": 28.49, "validHrs": 21.14 }, "fy-NL": { "duration": 2852604, "reportedSentences": 3, "clips": 526, "users": 15, "size": 10896440, "checksum": "c01c3db19fde2f524ffe766f1cdcea0e3cc7dde9110b10ca94ed6bbaa31389e5", "avgDurationSecs": 4.855, "validDurationSecs": 718.56, "totalHrs": 0.79, "validHrs": 0.2 }, "ga-IE": { "duration": 185580, "reportedSentences": 0, "clips": 36, "users": 1, "size": 888557, "checksum": "847f74a8a6145833f268b78191baf9b94c707b37a9b9c92c6ea0a90c57736c6a", "avgDurationSecs": 3.585, "validDurationSecs": 481.16, "totalHrs": 0.05, "validHrs": 0.13 }, "gl": { "duration": 11017116, "reportedSentences": 5, "clips": 2055, "users": 23, "size": 63839958, "checksum": "53c662f9c8ea5745883d42cbfc932a8f68dddc45ed2d20da8656083ae89c0340", "avgDurationSecs": 4.941, "validDurationSecs": 36617.23, "totalHrs": 3.06, "validHrs": 10.17 }, "gn": { "duration": 9965664, "reportedSentences": 0, "clips": 2058, "users": 24, "size": 55066045, "checksum": "99b716f4f717be49a093bb6a6b94e464025504a12e786c053e5dbbf355a0eb91", "avgDurationSecs": 4.541, "validDurationSecs": 235.45, "totalHrs": 2.77, "validHrs": 0.06 }, "ha": { "duration": 20880, "reportedSentences": 1, "clips": 5, "users": 1, "size": -106176, "checksum": "85a467b9aadc885ae477bdcdce981830f820059dffb3b501bba75f87e254c60d", "avgDurationSecs": 4.333, "validDurationSecs": -0.26, "totalHrs": 0.01, "validHrs": 0 }, "he": { "duration": 9460620, "reportedSentences": 9, "clips": 2192, "users": 8, "size": 54433082, "checksum": "77c240c1691ca7817999a68af7c25d40e223aef30507c2b245e3e5f673d8a00e", "avgDurationSecs": 4.485, "validDurationSecs": 4684.12, "totalHrs": 2.63, "validHrs": 1.3 }, "hi": { "duration": 285840, "reportedSentences": 2, "clips": 46, "users": 7, "size": 1315228, "checksum": "e398cd5ec446cd37cf80becffdf27f4a5ba21a4c1f0a4e3fa7e93ebac3edc5b0", "avgDurationSecs": 4.897, "validDurationSecs": 561.02, "totalHrs": 0.08, "validHrs": 0.16 }, "hsb": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -50854, "checksum": "deea4f8459d671a4fa2cd9e274a48e6096eb8d79c30b9dbb635bd6993d91e768", "avgDurationSecs": 6.16, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hu": { "duration": 24088212, "reportedSentences": 30, "clips": 4128, "users": 24, "size": 138515572, "checksum": "3829fea5c2ca1f23c2e508e1805a960689b4b8b3aee8c30f575ff186a2fa13df", "avgDurationSecs": 5.523, "validDurationSecs": 14720.2, "totalHrs": 6.69, "validHrs": 4.08 }, "hy-AM": { "duration": 67842324, "reportedSentences": 43, "clips": 12824, "users": 152, "size": 391183489, "checksum": "2486b1cbf9348585d3ebd0eb2645fd107db0b8961ff5b4467f40b7394e201d35", "avgDurationSecs": 5.438, "validDurationSecs": 41733.81, "totalHrs": 18.85, "validHrs": 11.6 }, "ia": { "duration": 93492, "reportedSentences": 5, "clips": 18, "users": 0, "size": 259417, "checksum": "5df4475c33692db10164d35964a934777ae87e689f0cbed8abfe52f5e1ad68f7", "avgDurationSecs": 4.186, "validDurationSecs": 73.11, "totalHrs": 0.03, "validHrs": 0.02 }, "id": { "duration": 229824, "reportedSentences": 0, "clips": 50, "users": 8, "size": 190689, "checksum": "307dd20a49b03a2f702d9c43f74f1cf0d1f812039938472eebdaf3e5a285a1a2", "avgDurationSecs": 3.988, "validDurationSecs": 1217.99, "totalHrs": 0.06, "validHrs": 0.34 }, "ig": { "duration": 19080, "reportedSentences": 1, "clips": 5, "users": 1, "size": -14277, "checksum": "5f38b3cd766bf3c03e913c1426e96c8b0418151d12071ccc433dc5eb89cf0dbf", "avgDurationSecs": 5.462, "validDurationSecs": -0.02, "totalHrs": 0.01, "validHrs": 0 }, "is": { "duration": 0, "clips": 0, "users": 0, "size": -474, "checksum": "47a8c96c9e25b9a517a32521b8a7942cbc71c0105917743c6dad3ede0ee2a4f6", "avgDurationSecs": 7.334, "validDurationSecs": 36.67, "totalHrs": 0, "validHrs": 0.01 }, "it": { "duration": 19022112, "reportedSentences": 17, "clips": 3278, "users": 62, "size": 104933836, "checksum": "8d308ac010e04d1a4a2117ad71b6124ac0bcc63fa0aa7fa270ed326ec4af613f", "avgDurationSecs": 5.391, "validDurationSecs": 12110.78, "totalHrs": 5.29, "validHrs": 3.36 }, "ja": { "duration": 818207244, "reportedSentences": 144, "clips": 165353, "users": 2054, "size": 4801558397, "checksum": "b9766d29f815cf10f4cf52b89f00a128ff78281539f5d79ea9d7468bdce46c6c", "avgDurationSecs": 4.874, "validDurationSecs": 157781.75, "totalHrs": 227.28, "validHrs": 43.83 }, "ka": { "duration": 29116224, "reportedSentences": 29, "clips": 5762, "users": 39, "size": 167801934, "checksum": "3c6bed106ca1d48baa16cfe716bb3d4d9389305f7f3a27ad2653ae628e3db745", "avgDurationSecs": 5.113, "validDurationSecs": 38145.19, "totalHrs": 8.09, "validHrs": 10.59 }, "kab": { "duration": 1989504, "reportedSentences": 66, "clips": 473, "users": 3, "size": -2395614, "checksum": "db6070a227b821a53f7ba415451633aec88bddd5d10d597bb75f81c329b67d99", "avgDurationSecs": 3.338, "validDurationSecs": 3694.92, "totalHrs": 0.55, "validHrs": 1.03 }, "kk": { "duration": 4955040, "reportedSentences": 0, "clips": 1018, "users": 64, "size": 27664921, "checksum": "a1ec1882a2e62bd0c3d633e8680a6997cabf38ae09c4a73f178bcccb4746d33e", "avgDurationSecs": 4.902, "validDurationSecs": 1484.45, "totalHrs": 1.38, "validHrs": 0.41 }, "kmr": { "duration": 12780, "reportedSentences": 6, "clips": 2, "users": 1, "size": -1513260, "checksum": "5491877fb356b2f86baf6db3b8b0a533a91dc7a64149bfd54d747d505ea08739", "avgDurationSecs": 4.056, "validDurationSecs": 217.79, "totalHrs": 0, "validHrs": 0.06 }, "ko": { "duration": 2279124, "reportedSentences": 2, "clips": 448, "users": 13, "size": 12970217, "checksum": "c4e46b3fe38a1d571034d802c6eca3401e4184170e203d76d6dfe879c88b140a", "avgDurationSecs": 5.962, "validDurationSecs": 1069.22, "totalHrs": 0.63, "validHrs": 0.3 }, "ky": { "duration": 10188, "reportedSentences": 0, "clips": 3, "users": 3, "size": -727726, "checksum": "d6b85f45214211de43d93137d1320298a217c4cfff3633b5e0df7fbedbcaf420", "avgDurationSecs": 4.542, "validDurationSecs": 115.25, "totalHrs": 0, "validHrs": 0.04 }, "lg": { "duration": 559008, "reportedSentences": 6, "clips": 187, "users": 5, "size": -4727180, "checksum": "5ca1d645bc91eda1280d374d3ef2d076c92c8b2fed21d32c08171767ac8ec6a4", "avgDurationSecs": 5.784, "validDurationSecs": -350.5, "totalHrs": 0.16, "validHrs": -0.1 }, "lo": { "duration": 9324, "clips": 2, "users": 1, "size": 48556, "checksum": "34eb512ed8595ec496d521f4babe2b3a11bdc59aeddc91e4358e48b52b097092", "avgDurationSecs": 6.461, "validDurationSecs": 23.87, "totalHrs": 0, "validHrs": 0.01 }, "lt": { "duration": 222228, "reportedSentences": 1, "clips": 49, "users": 2, "size": 848254, "checksum": "3c7402aa79fa7a2b154d86c5dcd2551ccb4aef0ac7475a56a04b79c7f0327dbb", "avgDurationSecs": 5.13, "validDurationSecs": 3267.37, "totalHrs": 0.06, "validHrs": 0.91 }, "lv": { "duration": 290993724, "reportedSentences": 210, "clips": 46864, "users": 2401, "size": 1653485114, "checksum": "4512ec1e3b4081345c179a776ac83621fdb856a56b46d3fc477d4e587a2e4c0b", "avgDurationSecs": 4.507, "validDurationSecs": 307217.41, "totalHrs": 80.83, "validHrs": 85.34 }, "mdf": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -9491, "checksum": "9f396b9638e47fbdb73204b96ea5f40efaec63a46357ad278bc4d2991db30d2a", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 115168284, "reportedSentences": 8, "clips": 24951, "users": 49, "size": 647780041, "checksum": "17c44ac5d50d5a24966f953ca8ed890e6bfbf594c22aeb59a2317003fd9ae33b", "avgDurationSecs": 4.642, "validDurationSecs": 228075.59, "totalHrs": 31.99, "validHrs": 63.35 }, "mk": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -4367, "checksum": "33d54a41d83028c5079d7d7710f6f4df00559149af16cf5a94aa87ea16ba5830", "avgDurationSecs": 5.292, "validDurationSecs": 201.1, "totalHrs": 0, "validHrs": 0.06 }, "ml": { "duration": 2244888, "reportedSentences": 69, "clips": 502, "users": 3, "size": 10307616, "checksum": "2f1116c0b7bfe64fa5b642471ca604f2a5e83a23d72f62cc417110ae813938a1", "avgDurationSecs": 4.178, "validDurationSecs": 1395.74, "totalHrs": 0.63, "validHrs": 0.39 }, "mn": { "duration": 775836, "reportedSentences": 2, "clips": 131, "users": 9, "size": 3579108, "checksum": "d56927a5d8f12f4b2bbf09f0fffa34f850ea745f8165e45182df95804332408b", "avgDurationSecs": 5.448, "validDurationSecs": 718.78, "totalHrs": 0.21, "validHrs": 0.2 }, "mr": { "duration": 78732, "reportedSentences": 0, "clips": 15, "users": 2, "size": 7201, "checksum": "d8873646a9b9244df1d02f1a92416dafed2a1702cb5ef80bb6607a4f293c57c9", "avgDurationSecs": 6.194, "validDurationSecs": 151.39, "totalHrs": 0.02, "validHrs": 0.05 }, "mrj": { "duration": 11031048, "reportedSentences": 0, "clips": 2510, "users": 7, "size": 62310936, "checksum": "e4640ec0e451f3c7082b90bf3b32a3174bf3df292affcc7993676acce1631cf0", "avgDurationSecs": 4.177, "validDurationSecs": 42428.66, "totalHrs": 3.06, "validHrs": 11.78 }, "mt": { "duration": 482652, "reportedSentences": 0, "clips": 105, "users": 6, "size": 2542817, "checksum": "d960236c415f8b52afbc3cd63bbf1c7a8946e891027382c2a6dc8b7e405e9ce2", "avgDurationSecs": 4.74, "validDurationSecs": 182.18, "totalHrs": 0.14, "validHrs": 0.05 }, "myv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -57135, "checksum": "4e384078f54027ce68de05b8a3857cf837c3a53226547a5a50931ec2b307ca72", "avgDurationSecs": 5.774, "validDurationSecs": -11.55, "totalHrs": 0, "validHrs": -0.01 }, "nan-tw": { "duration": -339444, "reportedSentences": 4, "clips": 35, "users": 2, "size": -2324722, "checksum": "b6f26b01b6aab7b15a5074c810919d6a1064463a5f64731b3bf80ba10c947d10", "avgDurationSecs": 2.592, "validDurationSecs": 1436.19, "totalHrs": -0.1, "validHrs": 0.4 }, "ne-NP": { "duration": 1248948, "reportedSentences": 2, "clips": 309, "users": 3, "size": 7279003, "checksum": "00efc082beabd830e08435fe307198fc7d0491e7baa4d9908815296bc06a28fc", "avgDurationSecs": 3.897, "validDurationSecs": 1085.05, "totalHrs": 0.35, "validHrs": 0.3 }, "nl": { "duration": 2434536, "reportedSentences": 3, "clips": 492, "users": 18, "size": 12179434, "checksum": "f3decbcdbf9174e4753d57194b4f4bb5c4ddcee4648e2c28ae8734badcdf03c3", "avgDurationSecs": 4.351, "validDurationSecs": 2937.02, "totalHrs": 0.68, "validHrs": 0.82 }, "nn-NO": { "duration": 778644, "reportedSentences": 0, "clips": 197, "users": 3, "size": 4519855, "checksum": "78e5db87bb721b7083a2c6a8160dcf9504d0cf9e6b838ea2606647119bf0b48b", "avgDurationSecs": 4.364, "validDurationSecs": 386.78, "totalHrs": 0.21, "validHrs": 0.1 }, "oc": { "duration": 1164996, "reportedSentences": 0, "clips": 242, "users": 4, "size": 6654918, "checksum": "18342b17757903bef93be866f55486f9d09224cdc519b5a2ee1cf95352781ce9", "avgDurationSecs": 4.874, "validDurationSecs": 470.86, "totalHrs": 0.32, "validHrs": 0.13 }, "or": { "duration": 349128, "reportedSentences": 0, "clips": 59, "users": 0, "size": 1831743, "checksum": "6bb8a1b8910cb76718d192edb70b0a19b711d4abf296b29d9d576d779a6cdc40", "avgDurationSecs": 5.01, "validDurationSecs": 209.22, "totalHrs": 0.1, "validHrs": 0.06 }, "pa-IN": { "duration": 66096, "reportedSentences": 7, "clips": 11, "users": 3, "size": 247438, "checksum": "523346527290a3cb533758ee87195d4917efc50e5f98b2fe1f74a9a4a1e4ee61", "avgDurationSecs": 4.828, "validDurationSecs": 50.01, "totalHrs": 0.02, "validHrs": 0.01 }, "pl": { "duration": 6289884, "reportedSentences": 1, "clips": 1168, "users": 34, "size": 33883808, "checksum": "dab35f907225411d0fa76348f0195a81ee182a5acc596302790d6dcb139a35e8", "avgDurationSecs": 4.518, "validDurationSecs": 6739.93, "totalHrs": 1.75, "validHrs": 1.87 }, "ps": { "duration": 212940, "reportedSentences": 0, "clips": 35, "users": 1, "size": 1227028, "checksum": "bcf66e5afd49f13a0a8fdc950857ed3938544ffc28f8ac4c86eabba89cf15b55", "avgDurationSecs": 4.878, "validDurationSecs": 51.55, "totalHrs": 0.05, "validHrs": 0.01 }, "pt": { "duration": 14995584, "reportedSentences": 157, "clips": 3424, "users": 90, "size": 84282515, "checksum": "36145ed88edaab9872f447bca5d7821fddcb2e33669a3f77fba1080c5f083d60", "avgDurationSecs": 4.095, "validDurationSecs": 15928.27, "totalHrs": 4.17, "validHrs": 4.42 }, "quy": { "duration": 0, "clips": 0, "users": 0, "size": -254, "checksum": "e77e3c7560ef638724fb3989c182eee1cd17ab54266131a5d97cdd2f6a39285f", "avgDurationSecs": 6.132, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -162718, "checksum": "f126f17c134b15575104bbe4c4e69637c47a476337798221da13e4df2d47bd61", "avgDurationSecs": 5.464, "validDurationSecs": 136.6, "totalHrs": 0, "validHrs": 0.04 }, "rm-vallader": { "duration": 0, "reportedSentences": 2, "clips": 0, "users": 0, "size": -60446, "checksum": "50401449141060d92d5b59bb0ea6fbe56bc11b948a2dfe510c211450d2d9b027", "avgDurationSecs": 5.826, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 9045612, "reportedSentences": 15, "clips": 1988, "users": 8, "size": 52212949, "checksum": "256570f386506ed8fd48c730460f32f6354ed6a9f59695259d1d336183e1db5c", "avgDurationSecs": 4.032, "validDurationSecs": 4007.6, "totalHrs": 2.51, "validHrs": 1.11 }, "ru": { "duration": 17772876, "reportedSentences": 24, "clips": 3333, "users": 73, "size": 100275580, "checksum": "ce8fd4aa18900b701b09304722fddedd9aa98152e0ffe11f99ca2eb4986e39ee", "avgDurationSecs": 5.163, "validDurationSecs": 19169.1, "totalHrs": 4.93, "validHrs": 5.33 }, "rw": { "duration": 206748, "reportedSentences": 0, "clips": 43, "users": 8, "size": -34195966, "checksum": "2908446346f1bf745a0f658ba7ee43ffb85628e058877c66959c8ff607ead7de", "avgDurationSecs": 5.008, "validDurationSecs": 228.15, "totalHrs": 0.06, "validHrs": 0.06 }, "sah": { "duration": 15149340, "reportedSentences": 2, "clips": 2399, "users": 42, "size": 88499599, "checksum": "5bdf8918d3a9a8f13ca809b918ef032e15e9129483644a81970b3451216e5435", "avgDurationSecs": 6.12, "validDurationSecs": 5403.59, "totalHrs": 4.2, "validHrs": 1.5 }, "sat": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -18668, "checksum": "058c7f26cf7a3a9f477c20ca2857a4ac6ce036a41c93b2692ce2e80b5d20e73f", "avgDurationSecs": 4.462, "validDurationSecs": 4.46, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 574128, "reportedSentences": 0, "clips": 120, "users": 0, "size": 3306145, "checksum": "294641d815d9ba96bf7e5bc81fbbd33191c51c5cc1e2a45f55da9bceff40d3b6", "avgDurationSecs": 4.51, "validDurationSecs": 413.48, "totalHrs": 0.16, "validHrs": 0.12 }, "sk": { "duration": 461052, "reportedSentences": 4, "clips": 110, "users": 7, "size": 2287395, "checksum": "345581c840dc4c85d04edde224121c20c53ea01640f5038e0a74df2d7e5a69e9", "avgDurationSecs": 4.077, "validDurationSecs": 393.69, "totalHrs": 0.13, "validHrs": 0.11 }, "skr": { "duration": 208404, "reportedSentences": 15, "clips": 44, "users": 3, "size": 1135434, "checksum": "5aa6803cb99ab58cfdf5b88e8552310176e921cc904f30ada3ca18794f54970f", "avgDurationSecs": 4.141, "validDurationSecs": 471.84, "totalHrs": 0.06, "validHrs": 0.13 }, "sl": { "duration": 5795892, "reportedSentences": 10, "clips": 1516, "users": 3, "size": 33578034, "checksum": "4e633c3bbcb352824c8a831ce228237a5c4388f39bd707eec52279fc8c12029d", "avgDurationSecs": 3.785, "validDurationSecs": 2699.59, "totalHrs": 1.61, "validHrs": 0.75 }, "sq": { "duration": 1146996, "reportedSentences": 2, "clips": 271, "users": 7, "size": 6668063, "checksum": "e832d32644124ee70c4be1f57b7a2c61ec9e3219702252a535f5b4bd54fb4032", "avgDurationSecs": 4.522, "validDurationSecs": 1166.58, "totalHrs": 0.32, "validHrs": 0.33 }, "sr": { "duration": 4138164, "reportedSentences": 23, "clips": 1201, "users": 5, "size": 23923304, "checksum": "38a33b4e344beaf6c4b79ac6a7d5b0d2ea4b1bffa949b33a0c8d50d9d71b8748", "avgDurationSecs": 2.977, "validDurationSecs": 3425.73, "totalHrs": 1.15, "validHrs": 0.96 }, "sv-SE": { "duration": 1396224, "reportedSentences": 7, "clips": 274, "users": 15, "size": 7216910, "checksum": "26fdd5e3a7a006230f2445da5fc780f9f1088f9ce9e7a37d3ef4c332ff8bd879", "avgDurationSecs": 4, "validDurationSecs": 2069.19, "totalHrs": 0.39, "validHrs": 0.58 }, "sw": { "duration": 533708604, "reportedSentences": 27, "clips": 85855, "users": 198, "size": 3080505569, "checksum": "baa58dc2a736c184e9ae9fb9f9bae4b416a4e9869999e8ec737b935fef6a59a0", "avgDurationSecs": 5.387, "validDurationSecs": 101892.91, "totalHrs": 148.25, "validHrs": 28.31 }, "ta": { "duration": 2669076, "reportedSentences": 1, "clips": 523, "users": 20, "size": 8663775, "checksum": "73022103c73d2727cb09db863c8eaa36aa99a78ac8b6f2c79b7f6f57468c0ab5", "avgDurationSecs": 6.214, "validDurationSecs": 6681.48, "totalHrs": 0.74, "validHrs": 1.86 }, "th": { "duration": 3318804, "reportedSentences": 22, "clips": 754, "users": 21, "size": 12441309, "checksum": "584bbbfd6ba5a4aaba5c9b002341cb02015f4b5f05849951a7271769337c8bda", "avgDurationSecs": 4.192, "validDurationSecs": 2952.09, "totalHrs": 0.92, "validHrs": 0.82 }, "ti": { "duration": 245628, "reportedSentences": 1, "clips": 46, "users": 2, "size": 1390948, "checksum": "7a1b466a2bf47fde2474c94ce89b5f0d8deff099445ac32cf3a626f0b0b8048b", "avgDurationSecs": 5.486, "validDurationSecs": -0.95, "totalHrs": 0.07, "validHrs": 0 }, "tig": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -761, "checksum": "26a4c1183631c7ec7a9fa3a1370c8ea72f3b0d3e5e5bd7c0b8ebeda33f755492", "avgDurationSecs": 5.048, "validDurationSecs": 1277.05, "totalHrs": 0, "validHrs": 0.36 }, "tk": { "duration": 303444, "reportedSentences": 0, "clips": 55, "users": 3, "size": 1538409, "checksum": "39e85dbb3a2598ff9d54a27099bd21e669e34433767a3c7ab15e750883829c49", "avgDurationSecs": 5.386, "validDurationSecs": 143.62, "totalHrs": 0.09, "validHrs": 0.04 }, "tok": { "duration": 5124744, "reportedSentences": 0, "clips": 939, "users": 3, "size": 29857929, "checksum": "486b1e6ef24db3bc3805ebb1e3e110ece5e87cfd6d234cd4057f55495ddc069b", "avgDurationSecs": 4.055, "validDurationSecs": 3854.56, "totalHrs": 1.42, "validHrs": 1.07 }, "tr": { "duration": 8858412, "reportedSentences": 4, "clips": 1675, "users": 31, "size": 50182565, "checksum": "3e8eb99bfa0d046fe2170bf4540721aa6666f7372a1855b5b1ff8dc7f19987b7", "avgDurationSecs": 3.643, "validDurationSecs": 8777.89, "totalHrs": 2.46, "validHrs": 2.44 }, "tt": { "duration": 713700, "reportedSentences": 0, "clips": 163, "users": 5, "size": 3381463, "checksum": "7671166bb8e60ca4146598abaf0e2b17652eec288299623831cb29d97ea21e67", "avgDurationSecs": 3.765, "validDurationSecs": 264.23, "totalHrs": 0.19, "validHrs": 0.08 }, "tw": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": -4413, "checksum": "7fbf52c157d949ece8fe1fbc079a53afb173aeeef6f8090cbc6065694a2eafeb", "avgDurationSecs": 3.983, "validDurationSecs": 31.86, "totalHrs": 0, "validHrs": 0.01 }, "ug": { "duration": 156001248, "reportedSentences": 275, "clips": 26510, "users": 98, "size": 868038357, "checksum": "ccfd40ab2d82e78cb742d846e90e3c2d8562df3be590dbe1653027e956f915fe", "avgDurationSecs": 6.002, "validDurationSecs": 68005.04, "totalHrs": 43.33, "validHrs": 18.89 }, "uk": { "duration": 10047492, "reportedSentences": 1, "clips": 2297, "users": 33, "size": 55202099, "checksum": "c825357be6042d21a9d6862218c84af5b05b2dbca45a196703d8334647ab6dd6", "avgDurationSecs": 4.67, "validDurationSecs": 9853.75, "totalHrs": 2.79, "validHrs": 2.74 }, "ur": { "duration": 40865472, "reportedSentences": 1, "clips": 8454, "users": 26, "size": 235763896, "checksum": "9c03c3b2d2e419cab8a5287be7fa6c49abb663cd7ef79d574c38a64160c4bad0", "avgDurationSecs": 4.245, "validDurationSecs": 2475.26, "totalHrs": 11.35, "validHrs": 0.68 }, "uz": { "duration": 1215864, "reportedSentences": 22, "clips": 210, "users": 10, "size": 2471583, "checksum": "09162fb6ad3a0090c3a8f27f8bc7d8623e68adc97a0f57bc28e6c28df953c420", "avgDurationSecs": 4.149, "validDurationSecs": 690.46, "totalHrs": 0.34, "validHrs": 0.19 }, "vi": { "duration": 534996, "reportedSentences": 2, "clips": 147, "users": 7, "size": 2377718, "checksum": "5deaf435f01299fddd361c9a423109a008f083258a20f9553c470bc7bb6338e3", "avgDurationSecs": 3.962, "validDurationSecs": 307.16, "totalHrs": 0.14, "validHrs": 0.08 }, "vot": { "duration": 3996, "clips": 1, "users": 1, "size": 17876, "checksum": "0ad63ec0575865d5919fb9a2c89c7b37f3ef8979a1f533a6fdbdc3d55fc059ee", "avgDurationSecs": 2.412, "validDurationSecs": 0.38, "totalHrs": 0, "validHrs": 0 }, "yo": { "duration": 51192, "reportedSentences": 1, "clips": 17, "users": 2, "size": 199068, "checksum": "826acdc2c50a2b9287108dba81d3d5b899fc66c3d81a20af278e03fe584da32e", "avgDurationSecs": 5.936, "validDurationSecs": 12.85, "totalHrs": 0.01, "validHrs": 0.01 }, "yue": { "duration": 326460852, "reportedSentences": 510, "clips": 87327, "users": 32, "size": 1900636425, "checksum": "d4860d2f805cd30623f63fa93f58b39c2337676aabad48c9a30c638ee32460b4", "avgDurationSecs": 3.877, "validDurationSecs": -2209.17, "totalHrs": 90.68, "validHrs": -0.61 }, "zgh": { "duration": 0, "clips": 0, "users": 0, "size": -2723, "checksum": "35578b18ea2dfc8555707bd9d6f36dd6f35c2ca4e6148f61922daf456bf55bf5", "avgDurationSecs": 3.306, "validDurationSecs": 59.51, "totalHrs": 0, "validHrs": 0.02 }, "zh-CN": { "duration": 20281500, "reportedSentences": 66, "clips": 4477, "users": 115, "size": 102124504, "checksum": "0083247487c89fa1ba7841c674bbe145c5bb673a8474c33acae41a7244b0db59", "avgDurationSecs": 4.548, "validDurationSecs": 5599.33, "totalHrs": 5.64, "validHrs": 1.56 }, "zh-HK": { "duration": 590724, "reportedSentences": 13, "clips": 218, "users": 13, "size": 909950, "checksum": "ea5415ea0e947c3f6c867893d5f9bc2a8f39f0a7fb84f848f4df54b005886e24", "avgDurationSecs": 4.177, "validDurationSecs": 377.94, "totalHrs": 0.16, "validHrs": 0.11 }, "zh-TW": { "duration": 2695356, "reportedSentences": 6, "clips": 861, "users": 13, "size": 11348215, "checksum": "f09cdb68d303edc68dde1f984ecbe28c997ee92e9ae2f470204d8745f6846d7c", "avgDurationSecs": 3.363, "validDurationSecs": 1484.81, "totalHrs": 0.75, "validHrs": 0.41 } }, "totalDuration": 5680420368, "totalValidDurationSecs": 2721187, "totalHrs": 1578, "totalValidHrs": 756 } ================================================ FILE: datasets/scripted-speech/cv-corpus-17.0-2024-03-15.json ================================================ { "locales": { "ab": { "buckets": { "validated": 41992, "invalidated": 5279, "dev": 9160, "test": 9117, "train": 21027, "other": 11939 }, "duration": 303891084, "reportedSentences": 225, "validatedSentences": 1046776, "unvalidatedSentences": 5, "clips": 59210, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.18, "thirties": 0.13, "teens": 0.28, "fourties": 0.09, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.18, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 59210, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 403, "size": 1814691581, "checksum": "5f2a3edff62a2cb20fbde11605b86124a8da9263e435cfe85506147efc05d8a2", "avgDurationSecs": 5.132, "validDurationSecs": 215520.932, "totalHrs": 84.41, "validHrs": 59.86 }, "af": { "buckets": { "validated": 198, "invalidated": 192, "dev": 54, "test": 62, "train": 82, "other": 15 }, "duration": 2027340, "reportedSentences": 0, "validatedSentences": 4729, "unvalidatedSentences": 16529, "clips": 405, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.54, "thirties": 0, "teens": 0.01, "fourties": 0.17, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.33, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 405, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 13416190, "checksum": "ff5238c18b85e7cb60eae6de24917509f0cc61b164299141f80740df4c574264", "avgDurationSecs": 5.006, "validDurationSecs": 991.144, "totalHrs": 0.56, "validHrs": 0.27 }, "am": { "buckets": { "validated": 903, "invalidated": 25, "dev": 248, "test": 205, "train": 450, "other": 600 }, "duration": 9472104, "reportedSentences": 11, "validatedSentences": 2198, "unvalidatedSentences": 33, "clips": 1528, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.15, "thirties": 0.64, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.79, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1528, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 55918089, "checksum": "745ac58bd4dbe63a3d5f0c48f5d7a04af8cb227c1e197c28c7351134d3ed9118", "avgDurationSecs": 6.199, "validDurationSecs": 5597.716, "totalHrs": 2.63, "validHrs": 1.55 }, "ar": { "buckets": { "validated": 78156, "invalidated": 15108, "dev": 10470, "test": 10480, "train": 28369, "other": 41628 }, "duration": 560925018, "reportedSentences": 2124, "validatedSentences": 59398, "unvalidatedSentences": 1742086, "clips": 134892, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.3, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 134892, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1561, "size": 3454615354, "checksum": "1c40e8c30dd832aa62876f222ed53be556d29c62501e2e56b6559c7790de06c7", "avgDurationSecs": 4.158, "validDurationSecs": 324998.189, "totalHrs": 155.81, "validHrs": 90.27 }, "as": { "buckets": { "validated": 1726, "invalidated": 210, "dev": 511, "test": 551, "train": 664, "other": 136 }, "duration": 11790785, "reportedSentences": 10, "validatedSentences": 5184, "unvalidatedSentences": 3926, "clips": 2072, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0.36, "thirties": 0.04, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.59, "male_masculine": 0.41, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2072, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 74511557, "checksum": "a8d2e2951ae5ddee65d0af581c37e7671932edae31f9309e04942e5bbe01ec4c", "avgDurationSecs": 5.691, "validDurationSecs": 9821.86, "totalHrs": 3.27, "validHrs": 2.72 }, "ast": { "buckets": { "validated": 663, "invalidated": 16, "dev": 112, "test": 162, "train": 387, "other": 865 }, "duration": 6869952, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1544, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.02, "thirties": 0.13, "teens": 0.04, "fourties": 0.3, "fifties": 0.22, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.56, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1544, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 29, "size": 39093221, "checksum": "00a3a29f183e688a7b7205d1e4d9a748cb0e3216f0f989bffd5b04e8c225bc43", "avgDurationSecs": 4.449, "validDurationSecs": 2949.986, "totalHrs": 1.9, "validHrs": 0.81 }, "az": { "buckets": { "validated": 130, "invalidated": 43, "dev": 32, "test": 33, "train": 65, "other": 127 }, "duration": 1614492, "reportedSentences": 5, "validatedSentences": 93154, "unvalidatedSentences": 1855, "clips": 300, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.34, "thirties": 0.21, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.56, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 300, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26, "size": 15633400, "checksum": "81211d60ab76bd4af9aa494bc9d2c2bb6c4d7f988d282f89b53ad144ec30247b", "avgDurationSecs": 5.382, "validDurationSecs": 699.613, "totalHrs": 0.44, "validHrs": 0.19 }, "ba": { "buckets": { "validated": 209653, "invalidated": 7960, "dev": 14495, "test": 14513, "train": 119180, "other": 493 }, "duration": 965409948, "reportedSentences": 867, "validatedSentences": 153957, "unvalidatedSentences": 16, "clips": 218106, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218106, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 917, "size": 5439825234, "checksum": "8371fc0b4b73c73a509a05d285ef96533112d9770850ad5fbaf9c026e04318b0", "avgDurationSecs": 4.426, "validDurationSecs": 927994.149, "totalHrs": 268.16, "validHrs": 257.77 }, "bas": { "buckets": { "validated": 1761, "invalidated": 515, "dev": 457, "test": 541, "train": 763, "other": 15 }, "duration": 10139220, "reportedSentences": 8, "validatedSentences": 5015, "unvalidatedSentences": 245, "clips": 2291, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.01, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2291, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 56770508, "checksum": "d6ce0866a109310c58db0c99f5172e0e7abf4a9bedfb144f5e6a12a508667e16", "avgDurationSecs": 4.426, "validDurationSecs": 7793.613, "totalHrs": 2.81, "validHrs": 2.16 }, "be": { "buckets": { "validated": 1298295, "invalidated": 35634, "dev": 15880, "test": 15878, "train": 347637, "other": 736 }, "duration": 6354908712, "reportedSentences": 3143, "validatedSentences": 379506, "unvalidatedSentences": 9, "clips": 1334665, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.05, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.86, "male_masculine": 0.06, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1334665, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8291, "size": 36212531173, "checksum": "21ea3e18b9bfd5f7cf70b615867f06a84e77768819690f0db6ae0aa86cdc70d1", "avgDurationSecs": 4.761, "validDurationSecs": 6181735.646, "totalHrs": 1765.25, "validHrs": 1717.14 }, "bg": { "buckets": { "validated": 10832, "invalidated": 746, "dev": 2766, "test": 3201, "train": 4849, "other": 2087 }, "duration": 74771424, "reportedSentences": 187, "validatedSentences": 39570, "unvalidatedSentences": 3, "clips": 13665, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13665, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 134, "size": 440592978, "checksum": "36bfe12cfbe90f30b301acaa1bc19b12cc212095e6edecff6d1f64f26351fc35", "avgDurationSecs": 5.472, "validDurationSecs": 59269.964, "totalHrs": 20.76, "validHrs": 16.46 }, "bn": { "buckets": { "validated": 44121, "invalidated": 7811, "dev": 9327, "test": 9327, "train": 21228, "other": 997561 }, "duration": 4582608948, "reportedSentences": 1888, "validatedSentences": 562724, "unvalidatedSentences": 94216, "clips": 1049493, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1049493, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22913, "size": 26481033802, "checksum": "add341d2ad3392a56c71a429bace4c6ea658e4d77a2e82664d5795ad5d54814b", "avgDurationSecs": 4.366, "validDurationSecs": 192654.252, "totalHrs": 1272.94, "validHrs": 53.51 }, "br": { "buckets": { "validated": 21007, "invalidated": 1364, "dev": 2253, "test": 2212, "train": 2663, "other": 8037 }, "duration": 96715555, "reportedSentences": 285, "validatedSentences": 7722, "unvalidatedSentences": 78, "clips": 30408, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.32, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.06, "sixties": 0.14, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.67, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30408, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 207, "size": 675135478, "checksum": "8aeb892de3b66841a700ee9744d40c5931c1a8812add8c93c52b69f620561949", "avgDurationSecs": 3.181, "validDurationSecs": 66814.775, "totalHrs": 26.86, "validHrs": 18.55 }, "ca": { "buckets": { "validated": 1824534, "invalidated": 110738, "dev": 16402, "test": 16402, "train": 1146209, "other": 490342 }, "duration": 12911554530, "reportedSentences": 6489, "validatedSentences": 1213896, "unvalidatedSentences": 332, "clips": 2425614, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.17, "sixties": 0.28, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.51, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2425393, "agriculture": 4, "automotive": 2, "finance": 0, "food_service_retail": 26, "general": 86, "healthcare": 6, "history_law_government": 15, "language_fundamentals": 6, "media_entertainment": 14, "nature_environment": 58, "news_current_affairs": 4, "technology_robotics": 0 } }, "users": 35751, "size": 77866036725, "checksum": "2807fcd5a34287f4cb4370d13308fe6940c104a9f47513eec79d7702b3a1ddfb", "avgDurationSecs": 5.323, "validDurationSecs": 9712002.913, "totalHrs": 3586.54, "validHrs": 2697.77 }, "ckb": { "buckets": { "validated": 116429, "invalidated": 9038, "dev": 5263, "test": 5263, "train": 7699, "other": 28471 }, "duration": 619783488, "reportedSentences": 2452, "validatedSentences": 18499, "unvalidatedSentences": 15789, "clips": 153938, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.44, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.66, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 153854, "agriculture": 0, "automotive": 9, "finance": 0, "food_service_retail": 0, "general": 66, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 9, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1727, "size": 3409668697, "checksum": "e0ff2ba2aebf0b862978e11cb4450ed4c92112f9bcae2785ddfa69efe8eb8309", "avgDurationSecs": 4.026, "validDurationSecs": 468765.163, "totalHrs": 172.16, "validHrs": 130.21 }, "cnh": { "buckets": { "validated": 2458, "invalidated": 437, "dev": 761, "test": 763, "train": 817, "other": 3292 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168190983, "checksum": "55bbb3df292c725f8f1eaeb0e885954babdfa88d4f96bac9b0f7bce790b81321", "avgDurationSecs": 3.516, "validDurationSecs": 8641.599, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "validated": 61391, "invalidated": 2213, "dev": 9009, "test": 9067, "train": 20144, "other": 148316 }, "duration": 945609159, "reportedSentences": 933, "validatedSentences": 451042, "unvalidatedSentences": 57, "clips": 211920, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.54, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 211920, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 983, "size": 5825442364, "checksum": "7c854be87bd0948a7ef0e000676e2cf715c470af89265b7f75c4f4671501824b", "avgDurationSecs": 4.462, "validDurationSecs": 273933.05, "totalHrs": 262.66, "validHrs": 76.09 }, "cv": { "buckets": { "validated": 17339, "invalidated": 2139, "dev": 1253, "test": 1290, "train": 1443, "other": 99 }, "duration": 99044076, "reportedSentences": 144, "validatedSentences": 4378, "unvalidatedSentences": 4, "clips": 19577, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.51, "thirties": 0.01, "teens": 0.19, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.54, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19577, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 112, "size": 673869390, "checksum": "264dfca45a352803f31c540010c26d3cb33bb3576da92f7d22be864a43e1c7a9", "avgDurationSecs": 5.059, "validDurationSecs": 87721.573, "totalHrs": 27.51, "validHrs": 24.36 }, "cy": { "buckets": { "validated": 90369, "invalidated": 4449, "dev": 5371, "test": 5379, "train": 7960, "other": 20145 }, "duration": 563690421, "reportedSentences": 172, "validatedSentences": 118032, "unvalidatedSentences": 24, "clips": 114963, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 114963, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1884, "size": 4120164501, "checksum": "c17b54ba654ee667e6dfa8fba4c7b00b600c8e6f80a0bff94897ddf11ce70433", "avgDurationSecs": 4.903, "validDurationSecs": 443100.299, "totalHrs": 156.58, "validHrs": 123.08 }, "da": { "buckets": { "validated": 10225, "invalidated": 404, "dev": 2105, "test": 2530, "train": 3484, "other": 396 }, "duration": 45373392, "reportedSentences": 422, "validatedSentences": 20437, "unvalidatedSentences": 13, "clips": 11025, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.25, "teens": 0.01, "fourties": 0.2, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.6, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11025, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 259, "size": 262637042, "checksum": "5bb5816e18046fdd1cb7ab2d4c8091f1aeb7e7e8f63211396e1775f025bb9473", "avgDurationSecs": 4.116, "validDurationSecs": 42080.992, "totalHrs": 12.6, "validHrs": 11.68 }, "de": { "buckets": { "validated": 917350, "invalidated": 54751, "dev": 16183, "test": 16183, "train": 589100, "other": 7116 }, "duration": 5126033069, "reportedSentences": 10702, "validatedSentences": 2056515, "unvalidatedSentences": 168, "clips": 979217, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.59, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 979177, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 5, "general": 26, "healthcare": 3, "history_law_government": 2, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 19042, "size": 35836600875, "checksum": "aa9375ad31bdde6bc4729968f94f82e2ca01fc1a74be67dc3e48b1476514fb14", "avgDurationSecs": 5.235, "validDurationSecs": 4802169.934, "totalHrs": 1423.89, "validHrs": 1333.93 }, "dv": { "buckets": { "validated": 27640, "invalidated": 1707, "dev": 2213, "test": 2215, "train": 2689, "other": 16312 }, "duration": 231166251, "reportedSentences": 56, "validatedSentences": 7139, "unvalidatedSentences": 32, "clips": 45659, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.17, "thirties": 0.36, "teens": 0.01, "fourties": 0.21, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.29, "female_feminine": 0.5, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 45659, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 357, "size": 1465001071, "checksum": "e02abcbd77ecce584dec3fb827860eeb0a2ac1015274821d601d4b3d99c4bed4", "avgDurationSecs": 5.063, "validDurationSecs": 139938.132, "totalHrs": 64.21, "validHrs": 38.87 }, "dyu": { "buckets": { "validated": 192, "invalidated": 11, "dev": 50, "test": 59, "train": 76, "other": 80 }, "duration": 1748772, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 49, "clips": 283, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.69, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.39, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 283, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 33, "size": 10126355, "checksum": "3e8b46b42c2ddc0874fb49a74bec9ec3d6e1a3d512ea1a41799e9acf53f9771b", "avgDurationSecs": 6.179, "validDurationSecs": 1186.446, "totalHrs": 0.48, "validHrs": 0.32 }, "el": { "buckets": { "validated": 16199, "invalidated": 837, "dev": 1700, "test": 1701, "train": 1920, "other": 10330 }, "duration": 113390638, "reportedSentences": 70, "validatedSentences": 6558, "unvalidatedSentences": 4742, "clips": 27366, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.37, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.64, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 27366, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 396, "size": 755766887, "checksum": "8fca51270b8d5010d08aeae556c6d1d408a3c5b1fa72b416a53df420b66428e4", "avgDurationSecs": 4.143, "validDurationSecs": 67120.33, "totalHrs": 31.49, "validHrs": 18.64 }, "en": { "buckets": { "validated": 1799288, "invalidated": 292773, "dev": 16393, "test": 16393, "train": 1101170, "other": 321347 }, "duration": 12625984447, "reportedSentences": 7928, "validatedSentences": 1676433, "unvalidatedSentences": 2294, "clips": 2413408, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.45, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2413304, "agriculture": 1, "automotive": 0, "finance": 0, "food_service_retail": 6, "general": 60, "healthcare": 1, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 8, "nature_environment": 9, "news_current_affairs": 2, "technology_robotics": 16 } }, "users": 92325, "size": 88478352967, "checksum": "e55889fb825803d8eea9deaddd7cae1421470464d892d75dcda670477ce2cb56", "avgDurationSecs": 5.232, "validDurationSecs": 9413154.47, "totalHrs": 3507.21, "validHrs": 2614.76 }, "eo": { "buckets": { "validated": 850369, "invalidated": 127433, "dev": 14913, "test": 14916, "train": 144164, "other": 152561 }, "duration": 6857896075, "reportedSentences": 2268, "validatedSentences": 180209, "unvalidatedSentences": 22566, "clips": 1130363, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.69, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1130363, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1739, "size": 40991793547, "checksum": "8960050ea5d0c41d7f74cef4bc1d2fafef82be124bc7dcda2f2ca32de0b12814", "avgDurationSecs": 6.067, "validDurationSecs": 5159176.501, "totalHrs": 1904.97, "validHrs": 1433.1 }, "es": { "buckets": { "validated": 415157, "invalidated": 86227, "dev": 15857, "test": 15857, "train": 336846, "other": 1138631 }, "duration": 7989786594, "reportedSentences": 2557, "validatedSentences": 1080636, "unvalidatedSentences": 5897, "clips": 1640015, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.54, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1640015, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26042, "size": 50438686609, "checksum": "b08770c585ea59fb7dcc7ddc87f5dfca986e85c4310a20591fe93f4ad3592f6a", "avgDurationSecs": 4.872, "validDurationSecs": 2022552.131, "totalHrs": 2219.38, "validHrs": 561.82 }, "et": { "buckets": { "validated": 24381, "invalidated": 7449, "dev": 2653, "test": 2653, "train": 3157, "other": 60 }, "duration": 215902046, "reportedSentences": 549, "validatedSentences": 10267, "unvalidatedSentences": 3, "clips": 31890, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.55, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31890, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 901, "size": 1456525833, "checksum": "34cbc147766055cd03b25e89a6b4157b36df161ce75805224a349f6f6cab0ce1", "avgDurationSecs": 6.77, "validDurationSecs": 165064.528, "totalHrs": 59.97, "validHrs": 45.85 }, "eu": { "buckets": { "validated": 179132, "invalidated": 28652, "dev": 13630, "test": 13630, "train": 75336, "other": 232648 }, "duration": 2420687919, "reportedSentences": 1223, "validatedSentences": 181151, "unvalidatedSentences": 2753, "clips": 440432, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.18, "thirties": 0.06, "teens": 0.01, "fourties": 0.16, "fifties": 0.1, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.26, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 438792, "agriculture": 0, "automotive": 4, "finance": 2, "food_service_retail": 4, "general": 1598, "healthcare": 19, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 4, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 5 } }, "users": 10522, "size": 14830129695, "checksum": "119700995a84de9de3525c642b61b69e7644975400805a0151dc0a1108bbbc4b", "avgDurationSecs": 5.496, "validDurationSecs": 984539.426, "totalHrs": 672.41, "validHrs": 273.48 }, "fa": { "buckets": { "validated": 328720, "invalidated": 14558, "dev": 10559, "test": 10559, "train": 28893, "other": 32421 }, "duration": 1495310300, "reportedSentences": 2479, "validatedSentences": 55595, "unvalidatedSentences": 283211, "clips": 375699, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.32, "thirties": 0.36, "teens": 0.03, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.71, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 375699, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4438, "size": 10859855096, "checksum": "55f1c80567dda595f6cb86e8a00457460bebc259d946d49f91277c7beb3b63c1", "avgDurationSecs": 3.98, "validDurationSecs": 1308330.344, "totalHrs": 415.36, "validHrs": 363.42 }, "fi": { "buckets": { "validated": 10447, "invalidated": 293, "dev": 1770, "test": 1763, "train": 2076, "other": 6202 }, "duration": 77950947, "reportedSentences": 77, "validatedSentences": 5758, "unvalidatedSentences": 48958, "clips": 16942, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.16, "thirties": 0.18, "teens": 0.01, "fourties": 0.31, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.37, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16942, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 466315843, "checksum": "7db7efaa6762bd5b5ba6bde727d9c1f1d0adc89c1cf54db5fc6c5886ab051ea5", "avgDurationSecs": 4.601, "validDurationSecs": 48067.143, "totalHrs": 21.65, "validHrs": 13.35 }, "fr": { "buckets": { "validated": 726005, "invalidated": 63835, "dev": 16159, "test": 16159, "train": 558054, "other": 32469 }, "duration": 4130672122, "reportedSentences": 7326, "validatedSentences": 1645651, "unvalidatedSentences": 44415, "clips": 822309, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.14, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.58, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 822308, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 19459, "size": 29067489590, "checksum": "258cbf940ab0f0cfa5982dc7e57afcfb5138e16d27484b88a51f1375665d48b3", "avgDurationSecs": 5.023, "validDurationSecs": 3646912.066, "totalHrs": 1147.4, "validHrs": 1013.03 }, "fy-NL": { "buckets": { "validated": 50980, "invalidated": 3961, "dev": 3170, "test": 3171, "train": 3921, "other": 102355 }, "duration": 763509267, "reportedSentences": 543, "validatedSentences": 10351, "unvalidatedSentences": 1, "clips": 157296, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157296, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2032, "size": 4579023754, "checksum": "287fe058384fdd7a933a6c2618eeb28bb139c4e6165985670aeda47fddb79e4b", "avgDurationSecs": 4.854, "validDurationSecs": 247455.132, "totalHrs": 212.08, "validHrs": 68.73 }, "ga-IE": { "buckets": { "validated": 5503, "invalidated": 935, "dev": 517, "test": 517, "train": 535, "other": 4058 }, "duration": 37694547, "reportedSentences": 29, "validatedSentences": 1617, "unvalidatedSentences": 516, "clips": 10496, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.25, "thirties": 0.24, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.49, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10496, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 192, "size": 255538679, "checksum": "20c1cfc6ecf9adf3a652550152c9c6eff8fd53e0d50ee35191aae25073798c55", "avgDurationSecs": 3.591, "validDurationSecs": 19763.061, "totalHrs": 10.47, "validHrs": 5.48 }, "gl": { "buckets": { "validated": 45780, "invalidated": 4688, "dev": 9982, "test": 9990, "train": 25159, "other": 41313 }, "duration": 473274288, "reportedSentences": 543, "validatedSentences": 691612, "unvalidatedSentences": 8106, "clips": 91781, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.13, "thirties": 0.12, "teens": 0.01, "fourties": 0.15, "fifties": 0.16, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.31, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 91780, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3125, "size": 2752628542, "checksum": "500f010587ecee66e73391c3c29f6d3bb38b40eec8d1de79f7ddbefcb796d40c", "avgDurationSecs": 5.157, "validDurationSecs": 236067.344, "totalHrs": 131.46, "validHrs": 65.57 }, "gn": { "buckets": { "validated": 2909, "invalidated": 162, "dev": 360, "test": 855, "train": 1571, "other": 18778 }, "duration": 99216036, "reportedSentences": 36, "validatedSentences": 5892, "unvalidatedSentences": 298, "clips": 21849, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.58, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.38, "female_feminine": 0.49, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21849, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 164, "size": 568826918, "checksum": "0d26c489f69afc44705820f90d6d31f42cef3d1149d43cd7bfb0a4461df67263", "avgDurationSecs": 4.541, "validDurationSecs": 13209.733, "totalHrs": 27.56, "validHrs": 3.66 }, "ha": { "buckets": { "validated": 3281, "invalidated": 170, "dev": 582, "test": 661, "train": 1925, "other": 6660 }, "duration": 43814016, "reportedSentences": 33, "validatedSentences": 5365, "unvalidatedSentences": 323, "clips": 10111, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.68, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0.59, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10111, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 257073415, "checksum": "b01a4b640c9f3bbc77e41f181bece28f8c676432e193120f10ff3493383d49d6", "avgDurationSecs": 4.333, "validDurationSecs": 14217.564, "totalHrs": 12.17, "validHrs": 3.94 }, "he": { "buckets": { "validated": 1799, "invalidated": 223, "dev": 270, "test": 260, "train": 951, "other": 2871 }, "duration": 21817800, "reportedSentences": 30, "validatedSentences": 3829, "unvalidatedSentences": 7180, "clips": 4893, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.78, "thirties": 0.17, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.03, "nineties": 0 }, "gender": { "": 0.01, "male_masculine": 0.98, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4893, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 126995444, "checksum": "c91dd4b45ee2b402b223a87d5cc21e482a2735a4ac5c47e85ba90f2367b3609c", "avgDurationSecs": 4.459, "validDurationSecs": 8021.709, "totalHrs": 6.06, "validHrs": 2.22 }, "hi": { "buckets": { "validated": 10329, "invalidated": 775, "dev": 2428, "test": 3154, "train": 4689, "other": 4044 }, "duration": 74503212, "reportedSentences": 138, "validatedSentences": 32120, "unvalidatedSentences": 9986, "clips": 15148, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.33, "thirties": 0.26, "teens": 0.01, "fourties": 0.04, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.63, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15148, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 414, "size": 442606699, "checksum": "e02967c30a19db302523ece3a7cb9ad1f0bd9c0a89ded5b8c674f21e1dffbbd3", "avgDurationSecs": 4.918, "validDurationSecs": 50801.669, "totalHrs": 20.69, "validHrs": 14.11 }, "hsb": { "buckets": { "validated": 1424, "invalidated": 247, "dev": 172, "test": 444, "train": 808, "other": 94 }, "duration": 10872468, "reportedSentences": 87, "validatedSentences": 7172, "unvalidatedSentences": 35, "clips": 1765, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.1, "thirties": 0.1, "teens": 0.04, "fourties": 0.52, "fifties": 0.03, "sixties": 0, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.82, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1765, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 84323116, "checksum": "d909e55268e58d0357e262e38e996554abfe1288c3965abbfb1c8ad072789dfb", "avgDurationSecs": 6.16, "validDurationSecs": 8771.895, "totalHrs": 3.02, "validHrs": 2.43 }, "ht": { "buckets": { "validated": 0, "invalidated": 0, "dev": 0, "test": 0, "train": 0, "other": 5 }, "duration": 20664, "reportedSentences": 0, "validatedSentences": 72, "unvalidatedSentences": 1633, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 226806, "checksum": "9cd3481281d910557326fe3e6b6bec6d0db195fb104c6ad68f1cc89b1e61803f", "avgDurationSecs": 4.133, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hu": { "buckets": { "validated": 60358, "invalidated": 3091, "dev": 11350, "test": 11435, "train": 37140, "other": 49019 }, "duration": 621458389, "reportedSentences": 472, "validatedSentences": 326040, "unvalidatedSentences": 177, "clips": 112468, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.13, "teens": 0.02, "fourties": 0.12, "fifties": 0.31, "sixties": 0.02, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.24, "female_feminine": 0.45, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 112468, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1614, "size": 3655132549, "checksum": "2a30eed0e8db5f3cd8e8c7de41813887e6e78766dc3f8d7af05260cfd5941695", "avgDurationSecs": 5.526, "validDurationSecs": 333516.96, "totalHrs": 172.62, "validHrs": 92.64 }, "hy-AM": { "buckets": { "validated": 14813, "invalidated": 1017, "dev": 4214, "test": 4281, "train": 6180, "other": 15672 }, "duration": 170506476, "reportedSentences": 169, "validatedSentences": 237435, "unvalidatedSentences": 1465, "clips": 31502, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.61, "thirties": 0.08, "teens": 0.13, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.28, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31500, "agriculture": 0, "automotive": 0, "finance": 1, "food_service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 390, "size": 1006054142, "checksum": "6773f9a6037ef1f08898c37e20ba29f07b1ce8a3000087734d4ad940450d7297", "avgDurationSecs": 5.413, "validDurationSecs": 80176.256, "totalHrs": 47.36, "validHrs": 22.27 }, "ia": { "buckets": { "validated": 11796, "invalidated": 342, "dev": 1820, "test": 1817, "train": 4972, "other": 2501 }, "duration": 61347396, "reportedSentences": 277, "validatedSentences": 9069, "unvalidatedSentences": 31, "clips": 14639, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.3, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14639, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 67, "size": 416339197, "checksum": "bd055acff157438a9e53b5f108bd1b1d069bab87a6f257347860418dac0e04df", "avgDurationSecs": 4.191, "validDurationSecs": 49433.287, "totalHrs": 17.04, "validHrs": 13.73 }, "id": { "buckets": { "validated": 26108, "invalidated": 2605, "dev": 3349, "test": 3641, "train": 4970, "other": 29508 }, "duration": 232300056, "reportedSentences": 350, "validatedSentences": 13001, "unvalidatedSentences": 12079, "clips": 58221, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.45, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 58221, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 551, "size": 1463664210, "checksum": "f207cc8a5e310d6c2bac457844c7a9291b2134a1334813490b47dc0ac96f3428", "avgDurationSecs": 3.99, "validDurationSecs": 104170.142, "totalHrs": 64.52, "validHrs": 28.93 }, "ig": { "buckets": { "validated": 15, "invalidated": 3, "dev": 2, "test": 4, "train": 9, "other": 5759 }, "duration": 31554684, "reportedSentences": 16, "validatedSentences": 14454, "unvalidatedSentences": 3857, "clips": 5777, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.33, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5777, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 114, "size": 185529992, "checksum": "48bc023632c406c94099dada32d9d53e36d94d7141c8e2cd10e043e729a48c7d", "avgDurationSecs": 5.462, "validDurationSecs": 81.932, "totalHrs": 8.76, "validHrs": 0.02 }, "is": { "buckets": { "validated": 14, "invalidated": 1, "dev": 0, "test": 0, "train": 14, "other": 25 }, "duration": 284040, "reportedSentences": 0, "validatedSentences": 4993, "unvalidatedSentences": 202, "clips": 40, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.13, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2051225, "checksum": "0d51a20909da4fe9abaf831026aca230ba869a1fe53613d5cf34af04d7479156", "avgDurationSecs": 7.101, "validDurationSecs": 99.414, "totalHrs": 0.07, "validHrs": 0.02 }, "it": { "buckets": { "validated": 236777, "invalidated": 19705, "dev": 15149, "test": 15155, "train": 169771, "other": 7385 }, "duration": 1424069328, "reportedSentences": 5943, "validatedSentences": 924058, "unvalidatedSentences": 1800, "clips": 263867, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.21, "thirties": 0.17, "teens": 0.01, "fourties": 0.13, "fifties": 0.15, "sixties": 0.05, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.61, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 263867, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7122, "size": 9767905551, "checksum": "2c437c7dc197f5e7240318e366a4bf5c602ed0686f7f7452515504bba172cce0", "avgDurationSecs": 5.397, "validDurationSecs": 1277866.741, "totalHrs": 395.57, "validHrs": 354.96 }, "ja": { "buckets": { "validated": 93022, "invalidated": 13547, "dev": 6261, "test": 6261, "train": 10039, "other": 250352 }, "duration": 1717144161, "reportedSentences": 363, "validatedSentences": 30214, "unvalidatedSentences": 206, "clips": 356921, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.53, "thirties": 0.08, "teens": 0.07, "fourties": 0.1, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.44, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 356921, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4027, "size": 10125050963, "checksum": "c3f0f36bd61863bded8b53462934777bc63c8264ecf400d93c72c7c1eee6562d", "avgDurationSecs": 4.811, "validDurationSecs": 447528.12, "totalHrs": 476.98, "validHrs": 124.31 }, "ka": { "buckets": { "validated": 97230, "invalidated": 4252, "dev": 12545, "test": 12618, "train": 52321, "other": 48563 }, "duration": 771140196, "reportedSentences": 807, "validatedSentences": 131221, "unvalidatedSentences": 1232, "clips": 150045, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.16, "thirties": 0.45, "teens": 0.05, "fourties": 0.07, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.22, "female_feminine": 0.57, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 150011, "agriculture": 3, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 15, "healthcare": 1, "history_law_government": 2, "language_fundamentals": 2, "media_entertainment": 8, "nature_environment": 2, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 1679, "size": 4526892116, "checksum": "7f8a17abce169528f9da00025318287f1423520f2d71b1c9f8499313d3fec9b0", "avgDurationSecs": 5.139, "validDurationSecs": 499703.164, "totalHrs": 214.2, "validHrs": 138.8 }, "kab": { "buckets": { "validated": 610839, "invalidated": 19664, "dev": 14998, "test": 14998, "train": 151986, "other": 113274 }, "duration": 2483172736, "reportedSentences": 9466, "validatedSentences": 182732, "unvalidatedSentences": 205344, "clips": 743777, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.19, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 743777, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1547, "size": 18569666791, "checksum": "728a88c25196665616e33a1989936952fb20026abffac6d2a86ce37a7fa3d0e8", "avgDurationSecs": 3.339, "validDurationSecs": 2039346.136, "totalHrs": 689.77, "validHrs": 566.48 }, "kk": { "buckets": { "validated": 1566, "invalidated": 226, "dev": 498, "test": 514, "train": 548, "other": 737 }, "duration": 12439692, "reportedSentences": 42, "validatedSentences": 6165, "unvalidatedSentences": 303, "clips": 2529, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.24, "thirties": 0.03, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.34, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2529, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 166, "size": 71238025, "checksum": "814840fa69d73063c8243613e77a020879e4bf825d7fba2f856fec602381fb95", "avgDurationSecs": 4.919, "validDurationSecs": 7702.87, "totalHrs": 3.45, "validHrs": 2.13 }, "kmr": { "buckets": { "validated": 59791, "invalidated": 2558, "dev": 3891, "test": 3904, "train": 5148, "other": 25952 }, "duration": 359257896, "reportedSentences": 1053, "validatedSentences": 16782, "unvalidatedSentences": 25, "clips": 88301, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.43, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 88300, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 561, "size": 1902769434, "checksum": "edb750e4af5cfb4babe311d0400a4d136dcd97e1db8e305b4398091fc3139eb3", "avgDurationSecs": 4.069, "validDurationSecs": 243263.257, "totalHrs": 99.79, "validHrs": 67.57 }, "ko": { "buckets": { "validated": 1046, "invalidated": 270, "dev": 330, "test": 339, "train": 376, "other": 2057 }, "duration": 20050308, "reportedSentences": 19, "validatedSentences": 6581, "unvalidatedSentences": 3236, "clips": 3373, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.42, "thirties": 0.34, "teens": 0.04, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.63, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3373, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 90, "size": 117386168, "checksum": "ea1ccc118693b2327e68cc7beedf84220251dbb17e6725dcab0a698f0012260a", "avgDurationSecs": 5.944, "validDurationSecs": 6217.795, "totalHrs": 5.56, "validHrs": 1.72 }, "ky": { "buckets": { "validated": 30375, "invalidated": 5642, "dev": 1613, "test": 1614, "train": 1789, "other": 1612 }, "duration": 171353280, "reportedSentences": 39, "validatedSentences": 5027, "unvalidatedSentences": 4, "clips": 37629, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37629, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 283, "size": 1102666456, "checksum": "d87252d289131c291636c636f28fe7ce1eebc200822d29131a436eff29fc0671", "avgDurationSecs": 4.554, "validDurationSecs": 138320.335, "totalHrs": 47.59, "validHrs": 38.42 }, "lg": { "buckets": { "validated": 271812, "invalidated": 39172, "dev": 13333, "test": 13364, "train": 71070, "other": 37088 }, "duration": 2013202179, "reportedSentences": 6451, "validatedSentences": 191407, "unvalidatedSentences": 357, "clips": 348072, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348072, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 657, "size": 11819568695, "checksum": "7b8fdeb24561acf5a23597959f88105973ad023d05478235885fcc2935c36837", "avgDurationSecs": 5.784, "validDurationSecs": 1572124.476, "totalHrs": 559.22, "validHrs": 436.7 }, "lij": { "buckets": { "validated": 2785, "invalidated": 469, "dev": 487, "test": 694, "train": 1602, "other": 9 }, "duration": 11844360, "reportedSentences": 33, "validatedSentences": 5300, "unvalidatedSentences": 22, "clips": 3263, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.11, "thirties": 0.67, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.08, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male_masculine": 0.67, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3263, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 65932552, "checksum": "58fcc3bb8fa69672f45ea065ea133dbd7f4957ea55aacb47b4fc522d9e1d905a", "avgDurationSecs": 3.63, "validDurationSecs": 10109.268, "totalHrs": 3.29, "validHrs": 2.8 }, "lo": { "buckets": { "validated": 116, "invalidated": 6, "dev": 28, "test": 33, "train": 55, "other": 86 }, "duration": 1342152, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 208, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.33, "thirties": 0.6, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.93, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 208, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 8666462, "checksum": "0b05554b4d9d3c48a71ff91983e1640f21db1427caf96f5a6fef343781b73745", "avgDurationSecs": 6.453, "validDurationSecs": 748.508, "totalHrs": 0.37, "validHrs": 0.2 }, "lt": { "buckets": { "validated": 16643, "invalidated": 817, "dev": 4436, "test": 4753, "train": 7253, "other": 228 }, "duration": 90751112, "reportedSentences": 191, "validatedSentences": 129214, "unvalidatedSentences": 4814, "clips": 17688, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.36, "thirties": 0.31, "teens": 0.03, "fourties": 0.04, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.57, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17688, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 289, "size": 563113237, "checksum": "7a5c42b55ce00ac9e712741b4a8af64d57a502933a89192e93c9491caa7202b4", "avgDurationSecs": 5.131, "validDurationSecs": 85389.572, "totalHrs": 25.2, "validHrs": 23.71 }, "ltg": { "buckets": { "validated": 17185, "invalidated": 1560, "dev": 2772, "test": 2907, "train": 3681, "other": 627 }, "duration": 83896848, "reportedSentences": 36, "validatedSentences": 9583, "unvalidatedSentences": 160, "clips": 19372, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.11, "thirties": 0.22, "teens": 0, "fourties": 0.37, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.06, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18847, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 525, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 250, "size": 483270255, "checksum": "09dc5b437e248956f3e76000bee1138b7ca4c537354a2818b49504a8097ee981", "avgDurationSecs": 4.331, "validDurationSecs": 74425.322, "totalHrs": 23.3, "validHrs": 20.67 }, "lv": { "buckets": { "validated": 171652, "invalidated": 9533, "dev": 6752, "test": 6752, "train": 11364, "other": 32248 }, "duration": 995779601, "reportedSentences": 3108, "validatedSentences": 28110, "unvalidatedSentences": 70001, "clips": 213433, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.09, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.26, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 213406, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 12, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 3, "nature_environment": 6, "news_current_affairs": 3, "technology_robotics": 3 } }, "users": 5712, "size": 5566590802, "checksum": "bea5a56df5917c9cc060425adb87c1fa4fce509dda2174ed1aa1a39e7e22119c", "avgDurationSecs": 4.666, "validDurationSecs": 800848.791, "totalHrs": 276.6, "validHrs": 222.45 }, "mdf": { "buckets": { "validated": 336, "invalidated": 9, "dev": 54, "test": 107, "train": 175, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11057061, "checksum": "dd15d00a557a0f84a98c9bff621a8c950df7a317167d65e00b63c16e7c01ebd0", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "validated": 218410, "invalidated": 16021, "dev": 15015, "test": 15125, "train": 185906, "other": 170 }, "duration": 1084486284, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234601, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234601, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 496, "size": 6119118464, "checksum": "43cb2e7eba1cb0966bc2aaf494e3606d87a890c377c6a41abd5ec41ac31bb8b0", "avgDurationSecs": 4.623, "validDurationSecs": 1009640.408, "totalHrs": 301.24, "validHrs": 280.45 }, "mk": { "buckets": { "validated": 6512, "invalidated": 243, "dev": 1289, "test": 1097, "train": 1686, "other": 12289 }, "duration": 82396512, "reportedSentences": 48, "validatedSentences": 5210, "unvalidatedSentences": 1377, "clips": 19044, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.47, "thirties": 0.48, "teens": 0, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.01, "male_masculine": 0.59, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19044, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 446262505, "checksum": "79c3ffa5f2bed57032c86fe7f585842a63b75a93b5a3dc19617cedc16815587a", "avgDurationSecs": 4.327, "validDurationSecs": 28175.073, "totalHrs": 22.88, "validHrs": 7.82 }, "ml": { "buckets": { "validated": 2984, "invalidated": 106, "dev": 764, "test": 710, "train": 1259, "other": 5621 }, "duration": 36391608, "reportedSentences": 198, "validatedSentences": 5497, "unvalidatedSentences": 4748, "clips": 8711, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.52, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8711, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 134, "size": 205676100, "checksum": "54270b0e8d0dded3909ee3015b0ea132d7b2e1e95639eedeed63011230f83544", "avgDurationSecs": 4.178, "validDurationSecs": 12466.141, "totalHrs": 10.1, "validHrs": 3.46 }, "mn": { "buckets": { "validated": 8757, "invalidated": 809, "dev": 1870, "test": 1896, "train": 2175, "other": 5773 }, "duration": 83077424, "reportedSentences": 34, "validatedSentences": 6105, "unvalidatedSentences": 11, "clips": 15339, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.48, "thirties": 0.21, "teens": 0.02, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.05 }, "gender": { "": 0.3, "male_masculine": 0.37, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15339, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 532, "size": 606233637, "checksum": "0b5f23b70caa3f6baa14f6b2083f5a26a7416902c6469dc237f5a16a6cf093e7", "avgDurationSecs": 5.416, "validDurationSecs": 47428.711, "totalHrs": 23.07, "validHrs": 13.17 }, "mr": { "buckets": { "validated": 10901, "invalidated": 2265, "dev": 1780, "test": 1751, "train": 2215, "other": 2805 }, "duration": 98926344, "reportedSentences": 59, "validatedSentences": 6691, "unvalidatedSentences": 12129, "clips": 15971, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0.19, "female_feminine": 0.76, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15971, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 90, "size": 582097757, "checksum": "dd614a235d279de2d7c8b0b4826322482718f2a84d3390f249ca5359a56f76da", "avgDurationSecs": 6.194, "validDurationSecs": 67522.139, "totalHrs": 27.47, "validHrs": 18.75 }, "mrj": { "buckets": { "validated": 28917, "invalidated": 1248, "dev": 7314, "test": 7108, "train": 14325, "other": 1449 }, "duration": 132372540, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31614, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31614, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 744266813, "checksum": "0f6e6e873f5c1e2f518692a6dab4ee4a294d40f7c24d09b8b4fc81f8ac63b734", "avgDurationSecs": 4.187, "validDurationSecs": 121079.798, "totalHrs": 36.77, "validHrs": 33.63 }, "mt": { "buckets": { "validated": 6442, "invalidated": 324, "dev": 1585, "test": 1662, "train": 1937, "other": 6307 }, "duration": 61963812, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11741, "clips": 13073, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13073, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 218, "size": 461611976, "checksum": "2d446e705d0dd4974f9c0e262bf61f1e632cb5a459bde97eec48152018e66344", "avgDurationSecs": 4.74, "validDurationSecs": 30533.992, "totalHrs": 17.21, "validHrs": 8.48 }, "myv": { "buckets": { "validated": 1969, "invalidated": 25, "dev": 239, "test": 479, "train": 1241, "other": 1 }, "duration": 11519388, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1995, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 67700769, "checksum": "c2ffa5b18443c31805c6cd0fc90d04962d7130d0b625ba07e1a5fe9166eb3afd", "avgDurationSecs": 5.774, "validDurationSecs": 11369.261, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "validated": 7943, "invalidated": 529, "dev": 1702, "test": 2318, "train": 3763, "other": 19530 }, "duration": 72766728, "reportedSentences": 214, "validatedSentences": 24504, "unvalidatedSentences": 347, "clips": 28002, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.54, "thirties": 0.15, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.64, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 28002, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 213, "size": 413782362, "checksum": "d25102e84eb15a3554735146d82c48df63e72a8c699b76251cf88584dda8fe29", "avgDurationSecs": 2.599, "validDurationSecs": 20640.887, "totalHrs": 20.21, "validHrs": 5.73 }, "ne-NP": { "buckets": { "validated": 742, "invalidated": 64, "dev": 204, "test": 217, "train": 283, "other": 595 }, "duration": 5550300, "reportedSentences": 5, "validatedSentences": 1214, "unvalidatedSentences": 129, "clips": 1401, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.61, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.73, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1401, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 32, "size": 32541569, "checksum": "bcc4230ea3c5777b83f8e0d598ab8848b108c31d417a499abbfe335947f5c94f", "avgDurationSecs": 3.962, "validDurationSecs": 2939.559, "totalHrs": 1.54, "validHrs": 0.81 }, "nhi": { "buckets": { "validated": 19, "invalidated": 0, "dev": 0, "test": 5, "train": 14, "other": 6 }, "duration": 117972, "reportedSentences": 0, "validatedSentences": 780, "unvalidatedSentences": 10, "clips": 25, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0, "thirties": 0.8, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.8, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 25, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 740104, "checksum": "189b6497fb4d2d1f9a9c520bf9a5dd2c8568cb12fdd79298b42ac3850c205717", "avgDurationSecs": 4.719, "validDurationSecs": 89.659, "totalHrs": 0.03, "validHrs": 0.02 }, "nl": { "buckets": { "validated": 90449, "invalidated": 5580, "dev": 11252, "test": 11266, "train": 34898, "other": 2771 }, "duration": 430549021, "reportedSentences": 340, "validatedSentences": 258898, "unvalidatedSentences": 27, "clips": 98800, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.21, "thirties": 0.12, "teens": 0.02, "fourties": 0.15, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.48, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 98800, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1709, "size": 2942412622, "checksum": "79bc529eaf3fd2695ca7b8ff447b709bf3c2d6811b49b535caa1a01224230604", "avgDurationSecs": 4.358, "validDurationSecs": 394157.17, "totalHrs": 119.59, "validHrs": 109.48 }, "nn-NO": { "buckets": { "validated": 1173, "invalidated": 66, "dev": 283, "test": 370, "train": 515, "other": 130 }, "duration": 5995944, "reportedSentences": 23, "validatedSentences": 5197, "unvalidatedSentences": 127, "clips": 1369, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.41, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.59, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1369, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 38, "size": 34852331, "checksum": "955a49e3001a03c781ba40335cf5548b16501d553b3f28c02ddda8c446506ec0", "avgDurationSecs": 4.38, "validDurationSecs": 5137.504, "totalHrs": 1.66, "validHrs": 1.42 }, "nso": { "buckets": { "validated": 0, "invalidated": 0, "dev": 0, "test": 0, "train": 0, "other": 20 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552612, "checksum": "0b02b755d0e497a01d9bf5c891d19f8317258dcf476356fe353e9a8ed282a5f3", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "oc": { "buckets": { "validated": 1668, "invalidated": 182, "dev": 260, "test": 254, "train": 271, "other": 7632 }, "duration": 46195056, "reportedSentences": 10, "validatedSentences": 4929, "unvalidatedSentences": 5, "clips": 9482, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.18, "fifties": 0.16, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male_masculine": 0.09, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9482, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 145, "size": 262575540, "checksum": "7e33c9deecd5d18ed17e0369b34f22d0d318d41e87e6be2d806b4f9c1c85818c", "avgDurationSecs": 4.872, "validDurationSecs": 8126.276, "totalHrs": 12.83, "validHrs": 2.25 }, "or": { "buckets": { "validated": 3161, "invalidated": 275, "dev": 0, "test": 696, "train": 2048, "other": 5550 }, "duration": 45048732, "reportedSentences": 31, "validatedSentences": 6091, "unvalidatedSentences": 119, "clips": 8986, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.17, "thirties": 0.71, "teens": 0, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male_masculine": 0.88, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8986, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 125, "size": 312275948, "checksum": "f0944a2538aefe29a298fdd1deb3bcf512f884ba9e95bc8a4da8d94511c5a835", "avgDurationSecs": 5.013, "validDurationSecs": 15846.766, "totalHrs": 12.51, "validHrs": 4.4 }, "os": { "buckets": { "validated": 184, "invalidated": 18, "dev": 43, "test": 50, "train": 91, "other": 0 }, "duration": 1131624, "reportedSentences": 1, "validatedSentences": 961, "unvalidatedSentences": 920, "clips": 202, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0, "thirties": 0, "teens": 0.1, "fourties": 0.5, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.6, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 202, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 6488459, "checksum": "5bcf89d5a4747d2711130500a4473e31e794dcbb1148141d8d93bedd3e3f6349", "avgDurationSecs": 5.602, "validDurationSecs": 1030.786, "totalHrs": 0.31, "validHrs": 0.28 }, "pa-IN": { "buckets": { "validated": 1505, "invalidated": 81, "dev": 286, "test": 487, "train": 732, "other": 1388 }, "duration": 14357366, "reportedSentences": 251, "validatedSentences": 14816, "unvalidatedSentences": 27, "clips": 2974, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.23, "thirties": 0.38, "teens": 0, "fourties": 0.04, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.69, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2974, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 68, "size": 101962623, "checksum": "6f7c8358d6eaf0bae2ec8c2b6dc37b5ad870798f0860b59117fa3d0edaae592b", "avgDurationSecs": 4.828, "validDurationSecs": 7265.58, "totalHrs": 3.98, "validHrs": 2.01 }, "pl": { "buckets": { "validated": 132661, "invalidated": 6605, "dev": 9230, "test": 9230, "train": 20729, "other": 662 }, "duration": 633047942, "reportedSentences": 558, "validatedSentences": 228676, "unvalidatedSentences": 11796, "clips": 139928, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.28, "thirties": 0.33, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 139928, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3309, "size": 4528294687, "checksum": "aa17b3c5e549540b0916e85be1ccf8e2d27292062aaf4baf2a509b918c0db4ed", "avgDurationSecs": 4.524, "validDurationSecs": 600171.324, "totalHrs": 175.84, "validHrs": 166.71 }, "ps": { "buckets": { "validated": 1226, "invalidated": 12, "dev": 0, "test": 199, "train": 1027, "other": 291 }, "duration": 7610400, "reportedSentences": 1, "validatedSentences": 2033, "unvalidatedSentences": 247, "clips": 1529, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.11, "thirties": 0.02, "teens": 0, "fourties": 0.68, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.81, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1529, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 44988618, "checksum": "150715041110fb250ca7565276d21cc8431e311a3b6978799f9846f38fd185f0", "avgDurationSecs": 4.977, "validDurationSecs": 6102.257, "totalHrs": 2.11, "validHrs": 1.69 }, "pt": { "buckets": { "validated": 152647, "invalidated": 7307, "dev": 9464, "test": 9467, "train": 21968, "other": 24946 }, "duration": 759497047, "reportedSentences": 2772, "validatedSentences": 43136, "unvalidatedSentences": 70, "clips": 184900, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 184893, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 5, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3453, "size": 4788293242, "checksum": "3ae8ff0c3734d3615005cb2e20d4d39bef62ff92493bbee1725a268287d44340", "avgDurationSecs": 4.108, "validDurationSecs": 627014.309, "totalHrs": 210.97, "validHrs": 174.17 }, "quy": { "buckets": { "validated": 1, "invalidated": 1, "dev": 0, "test": 0, "train": 1, "other": 4 }, "duration": 36792, "reportedSentences": 0, "validatedSentences": 751, "unvalidatedSentences": 290, "clips": 6, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 281321, "checksum": "fad84f8b35973a76bb1ebc31430bb09a4581c9f4cd37400007c34ea4acf1e7dc", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "rm-sursilv": { "buckets": { "validated": 4305, "invalidated": 680, "dev": 1358, "test": 1363, "train": 1571, "other": 2201 }, "duration": 39264017, "reportedSentences": 17, "validatedSentences": 8705, "unvalidatedSentences": 7531, "clips": 7186, "splits": { "accent": {}, "age": { "": 0.63, "twenties": 0.1, "thirties": 0.03, "teens": 0.05, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.64, "male_masculine": 0.18, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7186, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 90, "size": 297203822, "checksum": "4fb32e3c094a8e6401419d762b4cbfffdf2835246fbe9e68a87f63c487b814bc", "avgDurationSecs": 5.464, "validDurationSecs": 23522.348, "totalHrs": 10.9, "validHrs": 6.53 }, "rm-vallader": { "buckets": { "validated": 1526, "invalidated": 394, "dev": 500, "test": 462, "train": 557, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2631, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2631, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 117352317, "checksum": "d9daac60e82d508e60ebb2bbf817a157403931f2db8a25d6d12ecacd2c21fa51", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "validated": 17737, "invalidated": 977, "dev": 3881, "test": 3896, "train": 5141, "other": 23087 }, "duration": 168478807, "reportedSentences": 414, "validatedSentences": 13741, "unvalidatedSentences": 89122, "clips": 41801, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.61, "thirties": 0.13, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male_masculine": 0.72, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 41801, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 427, "size": 1052105258, "checksum": "4ca54b652b4697426927b3df04cd06479b67c8de01c20f464a202da112435cd4", "avgDurationSecs": 4.03, "validDurationSecs": 71488.926, "totalHrs": 46.79, "validHrs": 19.85 }, "ru": { "buckets": { "validated": 163387, "invalidated": 10018, "dev": 10203, "test": 10203, "train": 26377, "other": 17456 }, "duration": 986019516, "reportedSentences": 485, "validatedSentences": 46909, "unvalidatedSentences": 69, "clips": 190861, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.17, "teens": 0.09, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.61, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 190861, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3206, "size": 6670474844, "checksum": "3ebfecaac4d8961210c863f8422143717867623b5d8037319b69d39f102e31dc", "avgDurationSecs": 5.166, "validDurationSecs": 844084.285, "totalHrs": 273.89, "validHrs": 234.46 }, "rw": { "buckets": { "validated": 1438799, "invalidated": 227807, "dev": 15987, "test": 16213, "train": 1003028, "other": 47288 }, "duration": 8582410481, "reportedSentences": 630, "validatedSentences": 1404853, "unvalidatedSentences": 5, "clips": 1713894, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1713894, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1131, "size": 61193318886, "checksum": "fe916bcdc0e5fd4eaea828a8c691fe5916293d85cdd09fbab97e724d4ac03566", "avgDurationSecs": 5.008, "validDurationSecs": 7204858.42, "totalHrs": 2384, "validHrs": 2001.34 }, "sah": { "buckets": { "validated": 4882, "invalidated": 152, "dev": 1394, "test": 1455, "train": 1883, "other": 2393 }, "duration": 45556824, "reportedSentences": 11, "validatedSentences": 6049, "unvalidatedSentences": 647, "clips": 7427, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.25, "thirties": 0.24, "teens": 0.06, "fourties": 0.06, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.38, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7427, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 111, "size": 310346391, "checksum": "2cd79209324d913f1e5eb194ec31fafd33f897be0e58c8a0207c38a027b8dbd9", "avgDurationSecs": 6.134, "validDurationSecs": 29945.929, "totalHrs": 12.65, "validHrs": 8.31 }, "sat": { "buckets": { "validated": 462, "invalidated": 9, "dev": 0, "test": 147, "train": 315, "other": 353 }, "duration": 3676320, "reportedSentences": 7, "validatedSentences": 5407, "unvalidatedSentences": 547, "clips": 824, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.43, "thirties": 0.1, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.58, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 824, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 20610966, "checksum": "bab1d9df6d0f3677c16072037a0018ddb44fffb829b5b7d299df45071f91d8b0", "avgDurationSecs": 4.462, "validDurationSecs": 2061.238, "totalHrs": 1.02, "validHrs": 0.57 }, "sc": { "buckets": { "validated": 1200, "invalidated": 29, "dev": 270, "test": 232, "train": 533, "other": 327 }, "duration": 7012800, "reportedSentences": 3, "validatedSentences": 5231, "unvalidatedSentences": 229, "clips": 1556, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.12, "thirties": 0.3, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.11, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1556, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 41036393, "checksum": "61cb3789fb37f2691bb63445fc6644c3997061c2f48591bec7fe9770e226cb92", "avgDurationSecs": 4.507, "validDurationSecs": 5408.329, "totalHrs": 1.94, "validHrs": 1.5 }, "sk": { "buckets": { "validated": 19513, "invalidated": 833, "dev": 2588, "test": 2647, "train": 3258, "other": 3392 }, "duration": 96807024, "reportedSentences": 66, "validatedSentences": 8673, "unvalidatedSentences": 518, "clips": 23738, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.08, "thirties": 0.21, "teens": 0.09, "fourties": 0.18, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.49, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 23738, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 231, "size": 547180725, "checksum": "fbeb5775eadce28f77c76e48a8d49ee72cd5d7bf06081f678ece65ac9a625134", "avgDurationSecs": 4.078, "validDurationSecs": 79576.858, "totalHrs": 26.89, "validHrs": 22.1 }, "skr": { "buckets": { "validated": 3654, "invalidated": 462, "dev": 1124, "test": 1010, "train": 1508, "other": 1611 }, "duration": 23728140, "reportedSentences": 17, "validatedSentences": 5994, "unvalidatedSentences": 106, "clips": 5727, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0.38, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.64, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5727, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 136553829, "checksum": "7c7089f2216a20ceb6d37a1559c34bb1b65794bcdd26f7e9879dfa774983859e", "avgDurationSecs": 4.143, "validDurationSecs": 15139.274, "totalHrs": 6.59, "validHrs": 4.2 }, "sl": { "buckets": { "validated": 10819, "invalidated": 281, "dev": 1232, "test": 1242, "train": 1388, "other": 3145 }, "duration": 53946766, "reportedSentences": 49, "validatedSentences": 4067, "unvalidatedSentences": 128, "clips": 14245, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.42, "thirties": 0.04, "teens": 0.06, "fourties": 0.19, "fifties": 0.06, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.7, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14245, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 154, "size": 370901115, "checksum": "24b258a9808a8c6cba01fbd996925daff3add221d8a82bc3a5512a77f274bf31", "avgDurationSecs": 3.787, "validDurationSecs": 40972.275, "totalHrs": 14.98, "validHrs": 11.38 }, "sq": { "buckets": { "validated": 1524, "invalidated": 20, "dev": 385, "test": 475, "train": 660, "other": 0 }, "duration": 7079940, "reportedSentences": 12, "validatedSentences": 6372, "unvalidatedSentences": 1420, "clips": 1544, "splits": { "accent": {}, "age": { "": 0.62, "twenties": 0.07, "thirties": 0.19, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male_masculine": 0.4, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1528, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 16, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 41443116, "checksum": "92fb6fcc6f45283195a9dcfb9fc81335efcd98cfcf65cc63b15dbf2a6e390b99", "avgDurationSecs": 4.585, "validDurationSecs": 6988.231, "totalHrs": 1.96, "validHrs": 1.94 }, "sr": { "buckets": { "validated": 5970, "invalidated": 290, "dev": 1583, "test": 1539, "train": 1879, "other": 1781 }, "duration": 24325632, "reportedSentences": 60, "validatedSentences": 6374, "unvalidatedSentences": 1861, "clips": 8041, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.4, "thirties": 0.17, "teens": 0, "fourties": 0.06, "fifties": 0.15, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.51, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8041, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 153, "size": 131712731, "checksum": "b3c312892664d2509dcccaa8804cd3d88cb56ec23b80d713268120c9a06cb60c", "avgDurationSecs": 3.025, "validDurationSecs": 18060.443, "totalHrs": 6.75, "validHrs": 5.01 }, "sv-SE": { "buckets": { "validated": 40770, "invalidated": 1428, "dev": 5210, "test": 5259, "train": 7744, "other": 6759 }, "duration": 196198323, "reportedSentences": 592, "validatedSentences": 25836, "unvalidatedSentences": 7373, "clips": 48957, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.38, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.48, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 48957, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 836, "size": 1242093611, "checksum": "aeb753e62cca50d3ec178840b87b096f10756563e66f936d19c55d758a89af44", "avgDurationSecs": 4.008, "validDurationSecs": 163388.394, "totalHrs": 54.49, "validHrs": 45.38 }, "sw": { "buckets": { "validated": 267001, "invalidated": 80612, "dev": 12251, "test": 12253, "train": 46494, "other": 377365 }, "duration": 3904990164, "reportedSentences": 1901, "validatedSentences": 135986, "unvalidatedSentences": 5845, "clips": 724978, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 724978, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1452, "size": 22792311032, "checksum": "273c7e417cb8a1538e62a03211da1160fffb8e8f9db155c7a40e4694a20af06c", "avgDurationSecs": 5.386, "validDurationSecs": 1438162.646, "totalHrs": 1084.71, "validHrs": 399.48 }, "ta": { "buckets": { "validated": 135391, "invalidated": 5693, "dev": 12095, "test": 12074, "train": 45587, "other": 93989 }, "duration": 1453813872, "reportedSentences": 3358, "validatedSentences": 118169, "unvalidatedSentences": 653, "clips": 235073, "splits": { "accent": {}, "age": { "": 0.69, "twenties": 0.09, "thirties": 0.11, "teens": 0.03, "fourties": 0.03, "fifties": 0.03, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.69, "male_masculine": 0.19, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 235073, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 906, "size": 8722013939, "checksum": "3d48a3ac0610b2a3c26f551a835f84d3da6caba7226e81bdcd5262cdbbaef260", "avgDurationSecs": 6.185, "validDurationSecs": 837328.464, "totalHrs": 403.83, "validHrs": 232.59 }, "te": { "buckets": { "validated": 224, "invalidated": 18, "dev": 48, "test": 49, "train": 62, "other": 1732 }, "duration": 8268660, "reportedSentences": 3, "validatedSentences": 270, "unvalidatedSentences": 19175, "clips": 1974, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.67, "thirties": 0.01, "teens": 0.09, "fourties": 0.01, "fifties": 0, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.67, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1974, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 39, "size": 49698600, "checksum": "6fa35a1e280f77fa05c028780149148d5618aaf983af01a86a7ba5e79466534d", "avgDurationSecs": 4.189, "validDurationSecs": 938.288, "totalHrs": 2.29, "validHrs": 0.26 }, "th": { "buckets": { "validated": 147160, "invalidated": 9267, "dev": 11042, "test": 11042, "train": 32823, "other": 206935 }, "duration": 1522653420, "reportedSentences": 4250, "validatedSentences": 64142, "unvalidatedSentences": 122013, "clips": 363362, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 363362, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7880, "size": 8884377696, "checksum": "a46355d7dd52a7149f7147add27c3759e0a4881f995f6d52fcad7d8d5d93fed3", "avgDurationSecs": 4.19, "validDurationSecs": 616667.888, "totalHrs": 422.95, "validHrs": 171.29 }, "ti": { "buckets": { "validated": 23, "invalidated": 1, "dev": 5, "test": 4, "train": 14, "other": 45 }, "duration": 378540, "reportedSentences": 3, "validatedSentences": 4531, "unvalidatedSentences": 141, "clips": 69, "splits": { "accent": {}, "age": { "": 0.78, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male_masculine": 0.22, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 69, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 2505445, "checksum": "9474b4da73b8081802d766b04358162e1deeb2cae31ae71cbc4029f84431e616", "avgDurationSecs": 5.486, "validDurationSecs": 126.18, "totalHrs": 0.1, "validHrs": 0.03 }, "tig": { "buckets": { "validated": 635, "invalidated": 16, "dev": 99, "test": 171, "train": 365, "other": 922 }, "duration": 9666972, "reportedSentences": 2, "validatedSentences": 5382, "unvalidatedSentences": 20, "clips": 1573, "splits": { "accent": {}, "age": { "": 0.97, "twenties": 0.01, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.97, "male_masculine": 0.03, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1573, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 50315847, "checksum": "d93c7932eb3d94055d3a4efe774571780e42b0f07d44222aa2235e39602d4525", "avgDurationSecs": 6.146, "validDurationSecs": 3902.433, "totalHrs": 2.68, "validHrs": 1.08 }, "tk": { "buckets": { "validated": 1822, "invalidated": 120, "dev": 354, "test": 546, "train": 776, "other": 2396 }, "duration": 23469732, "reportedSentences": 27, "validatedSentences": 2456, "unvalidatedSentences": 242, "clips": 4338, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.47, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4338, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 112, "size": 131993510, "checksum": "f8f281f92ba8d98962d274f68a105c0052e8d29adf4728da5b54f61651972f07", "avgDurationSecs": 5.41, "validDurationSecs": 9857.504, "totalHrs": 6.51, "validHrs": 2.73 }, "tok": { "buckets": { "validated": 11734, "invalidated": 337, "dev": 2103, "test": 2179, "train": 2616, "other": 4205 }, "duration": 67910328, "reportedSentences": 143, "validatedSentences": 7196, "unvalidatedSentences": 2512, "clips": 16276, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.2, "thirties": 0.07, "teens": 0.24, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.17, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.52, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16276, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 149, "size": 397601735, "checksum": "6594bd66e121f95beafd87afb307de9fc55b41b2ebd493cf9c261fccc73964c1", "avgDurationSecs": 4.172, "validDurationSecs": 48959.191, "totalHrs": 18.86, "validHrs": 13.59 }, "tr": { "buckets": { "validated": 114056, "invalidated": 4530, "dev": 11258, "test": 11290, "train": 35147, "other": 117 }, "duration": 439430499, "reportedSentences": 404, "validatedSentences": 409934, "unvalidatedSentences": 3432, "clips": 118703, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.08, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.43, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 118702, "agriculture": 0, "automotive": 1, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1588, "size": 2689729822, "checksum": "8bd6c15db1beb24e17fbfa1b6b45f3a4cb31314d0ac709b6ac8ba3561355ba18", "avgDurationSecs": 3.702, "validDurationSecs": 422227.618, "totalHrs": 122.06, "validHrs": 117.28 }, "tt": { "buckets": { "validated": 29162, "invalidated": 425, "dev": 3691, "test": 4964, "train": 9315, "other": 195 }, "duration": 112475294, "reportedSentences": 6, "validatedSentences": 18031, "unvalidatedSentences": 49, "clips": 29782, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.05, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.78, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 29782, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 258, "size": 828607842, "checksum": "a6dff40a51c72145c0ad80548e5519781a60707e3126354cb5c9f51fbbbe4d18", "avgDurationSecs": 3.777, "validDurationSecs": 110133.79, "totalHrs": 31.24, "validHrs": 30.59 }, "tw": { "buckets": { "validated": 145, "invalidated": 2, "dev": 0, "test": 9, "train": 136, "other": 95 }, "duration": 975492, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 2, "clips": 242, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.02, "thirties": 0.89, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male_masculine": 0.91, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 242, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8482327, "checksum": "0e563467bf3abe711fc3e4a3f15d456d1f10788cd90383335797b916f3f611c3", "avgDurationSecs": 4.031, "validDurationSecs": 584.489, "totalHrs": 0.27, "validHrs": 0.16 }, "ug": { "buckets": { "validated": 121377, "invalidated": 6883, "dev": 6108, "test": 6108, "train": 9651, "other": 15737 }, "duration": 851884272, "reportedSentences": 704, "validatedSentences": 22778, "unvalidatedSentences": 3744, "clips": 143997, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.18, "thirties": 0.3, "teens": 0.05, "fourties": 0.08, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.45, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 143975, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 19, "healthcare": 3, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1258, "size": 4879842131, "checksum": "92fdb8439a55bec314d843a6e523e57512df2ae718aef4e2bca60b48e677a525", "avgDurationSecs": 5.916, "validDurationSecs": 718064.663, "totalHrs": 236.63, "validHrs": 199.46 }, "uk": { "buckets": { "validated": 75489, "invalidated": 3204, "dev": 10007, "test": 10011, "train": 25137, "other": 7851 }, "duration": 402162108, "reportedSentences": 647, "validatedSentences": 212627, "unvalidatedSentences": 9, "clips": 86544, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.33, "thirties": 0.25, "teens": 0.09, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 86544, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1075, "size": 2651464987, "checksum": "8a4b7c53d6853035cf7c2ca371c4faea2268dad6a8de117863b2c7f76e790b4d", "avgDurationSecs": 4.647, "validDurationSecs": 350790.527, "totalHrs": 111.71, "validHrs": 97.44 }, "ur": { "buckets": { "validated": 53858, "invalidated": 6818, "dev": 4057, "test": 4056, "train": 5368, "other": 135861 }, "duration": 834477336, "reportedSentences": 237, "validatedSentences": 18209, "unvalidatedSentences": 50977, "clips": 196537, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.9, "thirties": 0.01, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.08, "male_masculine": 0.67, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 196537, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 349, "size": 4864902210, "checksum": "08b9f08f6356bed138fb16e14d13ac93df414b7f38eceb37ab8fe94b626e66aa", "avgDurationSecs": 4.246, "validDurationSecs": 228675.925, "totalHrs": 231.79, "validHrs": 63.52 }, "uz": { "buckets": { "validated": 86430, "invalidated": 13936, "dev": 12134, "test": 12348, "train": 48475, "other": 127978 }, "duration": 947648124, "reportedSentences": 1832, "validatedSentences": 148657, "unvalidatedSentences": 137931, "clips": 228344, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 228344, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2170, "size": 5316491135, "checksum": "af34005bdc1e4ed6f58a3f1101cfae161fc25e3b8fefeb35a42f3af2d7f10921", "avgDurationSecs": 4.15, "validDurationSecs": 358692.268, "totalHrs": 263.23, "validHrs": 99.63 }, "vi": { "buckets": { "validated": 5135, "invalidated": 377, "dev": 641, "test": 1274, "train": 2298, "other": 11533 }, "duration": 67546520, "reportedSentences": 187, "validatedSentences": 5738, "unvalidatedSentences": 5348, "clips": 17045, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.02, "teens": 0.21, "fourties": 0.02, "fifties": 0, "sixties": 0.3, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.53, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17045, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 284, "size": 393361257, "checksum": "da04c83c35beed10770aa4f445fb16dd969e22490eb4255789018f36867fa3a4", "avgDurationSecs": 3.963, "validDurationSecs": 20349.157, "totalHrs": 18.76, "validHrs": 5.65 }, "vot": { "buckets": { "validated": 102, "invalidated": 324, "dev": 0, "test": 6, "train": 96, "other": 1 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186606, "checksum": "655cc3be46df0baba6a1f9a60f686a17f1039fed22e0bf264259a2dba8f745bf", "avgDurationSecs": 2.412, "validDurationSecs": 246.035, "totalHrs": 0.28, "validHrs": 0.06 }, "yi": { "buckets": { "validated": 47, "invalidated": 1, "dev": 0, "test": 6, "train": 41, "other": 1 }, "duration": 186912, "reportedSentences": 1, "validatedSentences": 821, "unvalidatedSentences": 677, "clips": 49, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.65, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 49, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1186524, "checksum": "40b639d24296e0f9aa09a58e44abe61b49372534a271481100eebe7a90607a3b", "avgDurationSecs": 3.815, "validDurationSecs": 179.283, "totalHrs": 0.05, "validHrs": 0.04 }, "yo": { "buckets": { "validated": 3077, "invalidated": 229, "dev": 863, "test": 999, "train": 1213, "other": 1113 }, "duration": 26232120, "reportedSentences": 19, "validatedSentences": 5413, "unvalidatedSentences": 16, "clips": 4419, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.63, "thirties": 0.02, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.32, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4419, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 108, "size": 153828417, "checksum": "94af710507bc62b169adee9e7d60f76533525c47d0e8e396d27afa44c0095876", "avgDurationSecs": 5.936, "validDurationSecs": 18265.724, "totalHrs": 7.28, "validHrs": 5.07 }, "yue": { "buckets": { "validated": 21758, "invalidated": 1760, "dev": 2602, "test": 2626, "train": 3150, "other": 141697 }, "duration": 639920484, "reportedSentences": 1696, "validatedSentences": 10434, "unvalidatedSentences": 8471, "clips": 165215, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.32, "thirties": 0.45, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.25, "female_feminine": 0.56, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 165215, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 913, "size": 3679118374, "checksum": "f5c51d5279a03a4cc756def51168beb96cfe85d08ad811e4cfa90c2b37123a76", "avgDurationSecs": 3.873, "validDurationSecs": 84274.369, "totalHrs": 177.75, "validHrs": 23.4 }, "zgh": { "buckets": { "validated": 557, "invalidated": 41, "dev": 0, "test": 159, "train": 398, "other": 990 }, "duration": 5280732, "reportedSentences": 3, "validatedSentences": 2668, "unvalidatedSentences": 400, "clips": 1588, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.27, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.08, "male_masculine": 0.89, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1568, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 17, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 28926601, "checksum": "f8043616bf12c9d4e821748be684fd75d9a0d6e909602fc75c19feaaced8121b", "avgDurationSecs": 3.325, "validDurationSecs": 1852.247, "totalHrs": 1.46, "validHrs": 0.51 }, "zh-CN": { "buckets": { "validated": 185072, "invalidated": 58503, "dev": 10626, "test": 10626, "train": 29406, "other": 596652 }, "duration": 3820765380, "reportedSentences": 862, "validatedSentences": 58781, "unvalidatedSentences": 79, "clips": 840227, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.04, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 840224, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 7005, "size": 22625001945, "checksum": "b30f2ab0e3afd88ff93974ba7e24091955f150c49486293e1dcda52439ce587a", "avgDurationSecs": 4.547, "validDurationSecs": 841578.157, "totalHrs": 1061.32, "validHrs": 233.77 }, "zh-HK": { "buckets": { "validated": 92619, "invalidated": 4449, "dev": 5595, "test": 5595, "train": 8429, "other": 22107 }, "duration": 497622624, "reportedSentences": 683, "validatedSentences": 19732, "unvalidatedSentences": 26, "clips": 119175, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.42, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 119175, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3025, "size": 3560699784, "checksum": "4bc64b2db61c3bf8451d9e45082fcb43d88a184c8f82a9ea93f14abfd6b7cc70", "avgDurationSecs": 4.176, "validDurationSecs": 386736.394, "totalHrs": 138.22, "validHrs": 107.42 }, "zh-TW": { "buckets": { "validated": 82483, "invalidated": 4788, "dev": 4982, "test": 4982, "train": 7113, "other": 47568 }, "duration": 453749062, "reportedSentences": 151, "validatedSentences": 19454, "unvalidatedSentences": 31, "clips": 134839, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.05, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.5, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 134839, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2194, "size": 3043465319, "checksum": "1d8550f01215fb1e0ed8a1adac93b9a1551e212e1e74cf421f97708476fa8a37", "avgDurationSecs": 3.365, "validDurationSecs": 277564.977, "totalHrs": 126.04, "validHrs": 77.1 }, "zu": { "buckets": { "validated": 0, "invalidated": 0, "dev": 0, "test": 0, "train": 0, "other": 35 }, "duration": 178344, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19380, "clips": 35, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 35, "agriculture": 0, "automotive": 0, "finance": 0, "food_service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 2480794, "checksum": "c138fa8fc8c4bf07c49fe2aa313406c2438bb64d0c9dc784b9585d9274c9a414", "avgDurationSecs": 5.096, "validDurationSecs": 0, "totalHrs": 0.04, "validHrs": 0 }, "zza": { "buckets": { "validated": 312, "invalidated": 45, "dev": 0, "test": 54, "train": 258, "other": 55 }, "duration": 1606968, "reportedSentences": 891, "validatedSentences": 5112, "unvalidatedSentences": 943, "clips": 412, "splits": { "accent": {}, "age": { "": 0, "twenties": 0, "thirties": 0.36, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.64, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.98, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 159, "agriculture": 1, "automotive": 1, "finance": 1, "food_service_retail": 4, "general": 222, "healthcare": 10, "history_law_government": 0, "language_fundamentals": 12, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 9780731, "checksum": "9e9c166b73677681c37a58084348f6bd368dbd4414d4576e5265a94eaaeb18ff", "avgDurationSecs": 3.9, "validDurationSecs": 1216.927, "totalHrs": 0.44, "validHrs": 0.33 } }, "totalDuration": 112231196515, "totalValidDurationSecs": 73470947, "totalHrs": 31175, "totalValidHrs": 20408 } ================================================ FILE: datasets/scripted-speech/cv-corpus-17.0-delta-2024-03-15.json ================================================ { "locales": { "ab": { "duration": 37008, "reportedSentences": 0, "clips": 10, "users": 2, "size": 79233873, "checksum": "5f2a3edff62a2cb20fbde11605b86124a8da9263e435cfe85506147efc05d8a2", "avgDurationSecs": 5.132, "validDurationSecs": 41.17, "totalHrs": 0.01, "validHrs": 0.01 }, "af": { "duration": 795996, "reportedSentences": 0, "clips": 131, "users": 9, "size": 6262344, "checksum": "ff5238c18b85e7cb60eae6de24917509f0cc61b164299141f80740df4c574264", "avgDurationSecs": 5.006, "validDurationSecs": 595.68, "totalHrs": 0.22, "validHrs": 0.17 }, "am": { "duration": 2118456, "reportedSentences": 0, "clips": 305, "users": 10, "size": 12771891, "checksum": "745ac58bd4dbe63a3d5f0c48f5d7a04af8cb227c1e197c28c7351134d3ed9118", "avgDurationSecs": 6.199, "validDurationSecs": 787.48, "totalHrs": 0.59, "validHrs": 0.22 }, "ar": { "duration": 1810800, "reportedSentences": 5, "clips": 463, "users": 24, "size": 141695815, "checksum": "1c40e8c30dd832aa62876f222ed53be556d29c62501e2e56b6559c7790de06c7", "avgDurationSecs": 4.158, "validDurationSecs": 519.87, "totalHrs": 0.51, "validHrs": 0.14 }, "as": { "duration": 6156, "reportedSentences": 0, "clips": 1, "users": 1, "size": 873852, "checksum": "a8d2e2951ae5ddee65d0af581c37e7671932edae31f9309e04942e5bbe01ec4c", "avgDurationSecs": 5.691, "validDurationSecs": 45.91, "totalHrs": 0, "validHrs": 0.01 }, "ast": { "duration": 189936, "reportedSentences": 0, "clips": 40, "users": 2, "size": 1317119, "checksum": "00a3a29f183e688a7b7205d1e4d9a748cb0e3216f0f989bffd5b04e8c225bc43", "avgDurationSecs": 4.449, "validDurationSecs": 23.04, "totalHrs": 0.05, "validHrs": 0 }, "az": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 6330718, "checksum": "81211d60ab76bd4af9aa494bc9d2c2bb6c4d7f988d282f89b53ad144ec30247b", "avgDurationSecs": 5.382, "validDurationSecs": 123.78, "totalHrs": 0, "validHrs": 0.04 }, "ba": { "duration": 2388960, "reportedSentences": 0, "clips": 540, "users": 2, "size": 39129110, "checksum": "8371fc0b4b73c73a509a05d285ef96533112d9770850ad5fbaf9c026e04318b0", "avgDurationSecs": 4.426, "validDurationSecs": 228.95, "totalHrs": 0.66, "validHrs": 0.06 }, "bas": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 426395, "checksum": "d6ce0866a109310c58db0c99f5172e0e7abf4a9bedfb144f5e6a12a508667e16", "avgDurationSecs": 4.426, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "be": { "duration": 258122592, "reportedSentences": 7, "clips": 53734, "users": 60, "size": 1604360922, "checksum": "21ea3e18b9bfd5f7cf70b615867f06a84e77768819690f0db6ae0aa86cdc70d1", "avgDurationSecs": 4.761, "validDurationSecs": 252132.31, "totalHrs": 71.7, "validHrs": 70.03 }, "bg": { "duration": 1717776, "reportedSentences": 0, "clips": 322, "users": 17, "size": 14156751, "checksum": "36bfe12cfbe90f30b301acaa1bc19b12cc212095e6edecff6d1f64f26351fc35", "avgDurationSecs": 5.472, "validDurationSecs": 8839.24, "totalHrs": 0.47, "validHrs": 2.46 }, "bn": { "duration": 3722400, "reportedSentences": 1, "clips": 662, "users": 16, "size": 116239340, "checksum": "add341d2ad3392a56c71a429bace4c6ea658e4d77a2e82664d5795ad5d54814b", "avgDurationSecs": 4.366, "validDurationSecs": 462.83, "totalHrs": 1.03, "validHrs": 0.13 }, "br": { "duration": 1702764, "reportedSentences": 0, "clips": 504, "users": 6, "size": 11590875, "checksum": "8aeb892de3b66841a700ee9744d40c5931c1a8812add8c93c52b69f620561949", "avgDurationSecs": 3.181, "validDurationSecs": 20452.19, "totalHrs": 0.47, "validHrs": 5.68 }, "ca": { "duration": 314411796, "reportedSentences": 149, "clips": 67546, "users": 223, "size": 2008948494, "checksum": "2807fcd5a34287f4cb4370d13308fe6940c104a9f47513eec79d7702b3a1ddfb", "avgDurationSecs": 5.323, "validDurationSecs": 103384.67, "totalHrs": 87.34, "validHrs": 28.71 }, "ckb": { "duration": 148087440, "reportedSentences": 33, "clips": 33166, "users": 418, "size": 849040858, "checksum": "e0ff2ba2aebf0b862978e11cb4450ed4c92112f9bcae2785ddfa69efe8eb8309", "avgDurationSecs": 4.026, "validDurationSecs": 63594.46, "totalHrs": 41.14, "validHrs": 17.67 }, "cnh": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 604823, "checksum": "55bbb3df292c725f8f1eaeb0e885954babdfa88d4f96bac9b0f7bce790b81321", "avgDurationSecs": 3.516, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 6249816, "reportedSentences": 4, "clips": 1423, "users": 26, "size": 76644643, "checksum": "7c854be87bd0948a7ef0e000676e2cf715c470af89265b7f75c4f4671501824b", "avgDurationSecs": 4.462, "validDurationSecs": 1613.14, "totalHrs": 1.73, "validHrs": 0.45 }, "cv": { "duration": 24228, "reportedSentences": 0, "clips": 5, "users": 0, "size": 1467697, "checksum": "264dfca45a352803f31c540010c26d3cb33bb3576da92f7d22be864a43e1c7a9", "avgDurationSecs": 5.059, "validDurationSecs": 85.06, "totalHrs": 0.01, "validHrs": 0.02 }, "cy": { "duration": 5243112, "reportedSentences": 0, "clips": 824, "users": 52, "size": 43794367, "checksum": "c17b54ba654ee667e6dfa8fba4c7b00b600c8e6f80a0bff94897ddf11ce70433", "avgDurationSecs": 4.903, "validDurationSecs": 3100.24, "totalHrs": 1.46, "validHrs": 0.86 }, "da": { "duration": 332892, "reportedSentences": 0, "clips": 97, "users": 5, "size": 3954275, "checksum": "5bb5816e18046fdd1cb7ab2d4c8091f1aeb7e7e8f63211396e1775f025bb9473", "avgDurationSecs": 4.116, "validDurationSecs": 263.56, "totalHrs": 0.09, "validHrs": 0.07 }, "de": { "duration": 76406580, "reportedSentences": 459, "clips": 13971, "users": 395, "size": 655390999, "checksum": "aa9375ad31bdde6bc4729968f94f82e2ca01fc1a74be67dc3e48b1476514fb14", "avgDurationSecs": 5.235, "validDurationSecs": 70798.1, "totalHrs": 21.22, "validHrs": 19.67 }, "dv": { "duration": 223164, "reportedSentences": 0, "clips": 39, "users": 8, "size": 3977281, "checksum": "e02abcbd77ecce584dec3fb827860eeb0a2ac1015274821d601d4b3d99c4bed4", "avgDurationSecs": 5.063, "validDurationSecs": 324.38, "totalHrs": 0.06, "validHrs": 0.09 }, "dyu": { "duration": 304812, "reportedSentences": 0, "clips": 48, "users": 13, "size": 1650865, "checksum": "3e8b46b42c2ddc0874fb49a74bec9ec3d6e1a3d512ea1a41799e9acf53f9771b", "avgDurationSecs": 6.179, "validDurationSecs": 68.14, "totalHrs": 0.08, "validHrs": 0.01 }, "el": { "duration": 1101420, "reportedSentences": 0, "clips": 239, "users": 12, "size": 8365260, "checksum": "8fca51270b8d5010d08aeae556c6d1d408a3c5b1fa72b416a53df420b66428e4", "avgDurationSecs": 4.143, "validDurationSecs": 596.2, "totalHrs": 0.3, "validHrs": 0.17 }, "en": { "duration": 251015400, "reportedSentences": 723, "clips": 43178, "users": 1851, "size": 1722606033, "checksum": "e55889fb825803d8eea9deaddd7cae1421470464d892d75dcda670477ce2cb56", "avgDurationSecs": 5.232, "validDurationSecs": 105063.03, "totalHrs": 69.72, "validHrs": 29.18 }, "eo": { "duration": 16839684, "reportedSentences": 5, "clips": 2718, "users": 25, "size": 164453207, "checksum": "8960050ea5d0c41d7f74cef4bc1d2fafef82be124bc7dcda2f2ca32de0b12814", "avgDurationSecs": 6.067, "validDurationSecs": 2138.25, "totalHrs": 4.68, "validHrs": 0.59 }, "es": { "duration": 54441360, "reportedSentences": 46, "clips": 11205, "users": 347, "size": 467608112, "checksum": "b08770c585ea59fb7dcc7ddc87f5dfca986e85c4310a20591fe93f4ad3592f6a", "avgDurationSecs": 4.872, "validDurationSecs": 46220.93, "totalHrs": 15.12, "validHrs": 12.84 }, "et": { "duration": 96120, "reportedSentences": 1, "clips": 15, "users": 3, "size": 3125672, "checksum": "34cbc147766055cd03b25e89a6b4157b36df161ce75805224a349f6f6cab0ce1", "avgDurationSecs": 6.77, "validDurationSecs": 151.56, "totalHrs": 0.03, "validHrs": 0.05 }, "eu": { "duration": 322231608, "reportedSentences": 257, "clips": 56564, "users": 1433, "size": 1884914724, "checksum": "119700995a84de9de3525c642b61b69e7644975400805a0151dc0a1108bbbc4b", "avgDurationSecs": 5.496, "validDurationSecs": 194193.48, "totalHrs": 89.51, "validHrs": 53.94 }, "fa": { "duration": 22155912, "reportedSentences": 74, "clips": 6241, "users": 29, "size": 162157437, "checksum": "55f1c80567dda595f6cb86e8a00457460bebc259d946d49f91277c7beb3b63c1", "avgDurationSecs": 3.98, "validDurationSecs": -681.24, "totalHrs": 6.16, "validHrs": -0.19 }, "fi": { "duration": 7021584, "reportedSentences": 11, "clips": 1601, "users": 5, "size": 44675471, "checksum": "7db7efaa6762bd5b5ba6bde727d9c1f1d0adc89c1cf54db5fc6c5886ab051ea5", "avgDurationSecs": 4.601, "validDurationSecs": 5049.95, "totalHrs": 1.95, "validHrs": 1.41 }, "fr": { "duration": 125509032, "reportedSentences": 219, "clips": 23441, "users": 972, "size": 893222805, "checksum": "258cbf940ab0f0cfa5982dc7e57afcfb5138e16d27484b88a51f1375665d48b3", "avgDurationSecs": 5.023, "validDurationSecs": 90105.56, "totalHrs": 34.86, "validHrs": 25.03 }, "fy-NL": { "duration": 2742264, "reportedSentences": 6, "clips": 584, "users": 11, "size": 23364549, "checksum": "287fe058384fdd7a933a6c2618eeb28bb139c4e6165985670aeda47fddb79e4b", "avgDurationSecs": 4.854, "validDurationSecs": 489.36, "totalHrs": 0.76, "validHrs": 0.13 }, "ga-IE": { "duration": 678816, "reportedSentences": 0, "clips": 172, "users": 7, "size": 3443502, "checksum": "20c1cfc6ecf9adf3a652550152c9c6eff8fd53e0d50ee35191aae25073798c55", "avgDurationSecs": 3.591, "validDurationSecs": 118.62, "totalHrs": 0.19, "validHrs": 0.03 }, "gl": { "duration": 245194596, "reportedSentences": 177, "clips": 45617, "users": 2094, "size": 1442577362, "checksum": "500f010587ecee66e73391c3c29f6d3bb38b40eec8d1de79f7ddbefcb796d40c", "avgDurationSecs": 5.157, "validDurationSecs": 66114.28, "totalHrs": 68.11, "validHrs": 18.37 }, "gn": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1341243, "checksum": "0d26c489f69afc44705820f90d6d31f42cef3d1149d43cd7bfb0a4461df67263", "avgDurationSecs": 4.541, "validDurationSecs": 4.54, "totalHrs": 0, "validHrs": 0 }, "ha": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 868260, "checksum": "b01a4b640c9f3bbc77e41f181bece28f8c676432e193120f10ff3493383d49d6", "avgDurationSecs": 4.333, "validDurationSecs": 4.33, "totalHrs": 0, "validHrs": 0 }, "he": { "duration": 9273960, "reportedSentences": 7, "clips": 2096, "users": 7, "size": 54667472, "checksum": "c91dd4b45ee2b402b223a87d5cc21e482a2735a4ac5c47e85ba90f2367b3609c", "avgDurationSecs": 4.459, "validDurationSecs": 3092.97, "totalHrs": 2.58, "validHrs": 0.86 }, "hi": { "duration": 2305692, "reportedSentences": 0, "clips": 405, "users": 11, "size": 16634274, "checksum": "e02967c30a19db302523ece3a7cb9ad1f0bd9c0a89ded5b8c674f21e1dffbbd3", "avgDurationSecs": 4.918, "validDurationSecs": 464.68, "totalHrs": 0.64, "validHrs": 0.13 }, "hsb": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 655851, "checksum": "d909e55268e58d0357e262e38e996554abfe1288c3965abbfb1c8ad072789dfb", "avgDurationSecs": 6.16, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hu": { "duration": 8983332, "reportedSentences": 29, "clips": 1576, "users": 19, "size": 81693906, "checksum": "2a30eed0e8db5f3cd8e8c7de41813887e6e78766dc3f8d7af05260cfd5941695", "avgDurationSecs": 5.526, "validDurationSecs": 3110.05, "totalHrs": 2.49, "validHrs": 0.87 }, "hy-AM": { "duration": 85534380, "reportedSentences": 81, "clips": 15875, "users": 166, "size": 514240775, "checksum": "6773f9a6037ef1f08898c37e20ba29f07b1ce8a3000087734d4ad940450d7297", "avgDurationSecs": 5.413, "validDurationSecs": 29226.71, "totalHrs": 23.76, "validHrs": 8.12 }, "ia": { "duration": 310464, "reportedSentences": 1, "clips": 57, "users": 1, "size": 3256182, "checksum": "bd055acff157438a9e53b5f108bd1b1d069bab87a6f257347860418dac0e04df", "avgDurationSecs": 4.191, "validDurationSecs": 635.55, "totalHrs": 0.09, "validHrs": 0.18 }, "id": { "duration": 2528244, "reportedSentences": 5, "clips": 607, "users": 27, "size": 19095511, "checksum": "f207cc8a5e310d6c2bac457844c7a9291b2134a1334813490b47dc0ac96f3428", "avgDurationSecs": 3.99, "validDurationSecs": 534.74, "totalHrs": 0.7, "validHrs": 0.15 }, "ig": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1319543, "checksum": "48bc023632c406c94099dada32d9d53e36d94d7141c8e2cd10e043e729a48c7d", "avgDurationSecs": 5.462, "validDurationSecs": 5.46, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 174024, "reportedSentences": 0, "clips": 25, "users": 2, "size": 1406180, "checksum": "0d51a20909da4fe9abaf831026aca230ba869a1fe53613d5cf34af04d7479156", "avgDurationSecs": 7.101, "validDurationSecs": 4.07, "totalHrs": 0.04, "validHrs": 0 }, "it": { "duration": 26321436, "reportedSentences": 15, "clips": 4579, "users": 83, "size": 235988394, "checksum": "2c437c7dc197f5e7240318e366a4bf5c602ed0686f7f7452515504bba172cce0", "avgDurationSecs": 5.397, "validDurationSecs": 8471.79, "totalHrs": 7.31, "validHrs": 2.36 }, "ja": { "duration": 85621464, "reportedSentences": 19, "clips": 22180, "users": 260, "size": 515200124, "checksum": "c3f0f36bd61863bded8b53462934777bc63c8264ecf400d93c72c7c1eee6562d", "avgDurationSecs": 4.811, "validDurationSecs": 10102.57, "totalHrs": 23.78, "validHrs": 2.81 }, "ka": { "duration": 189906948, "reportedSentences": 107, "clips": 36362, "users": 386, "size": 1127948194, "checksum": "7f8a17abce169528f9da00025318287f1423520f2d71b1c9f8499313d3fec9b0", "avgDurationSecs": 5.139, "validDurationSecs": 65338.9, "totalHrs": 52.75, "validHrs": 18.15 }, "kab": { "duration": 3447180, "reportedSentences": 6, "clips": 810, "users": 12, "size": 79676546, "checksum": "728a88c25196665616e33a1989936952fb20026abffac6d2a86ce37a7fa3d0e8", "avgDurationSecs": 3.339, "validDurationSecs": 1338.39, "totalHrs": 0.96, "validHrs": 0.37 }, "kk": { "duration": 410508, "reportedSentences": 0, "clips": 75, "users": 10, "size": 2899628, "checksum": "814840fa69d73063c8243613e77a020879e4bf825d7fba2f856fec602381fb95", "avgDurationSecs": 4.919, "validDurationSecs": 119.68, "totalHrs": 0.11, "validHrs": 0.03 }, "kmr": { "duration": 5466240, "reportedSentences": 19, "clips": 1070, "users": 9, "size": 36127921, "checksum": "edb750e4af5cfb4babe311d0400a4d136dcd97e1db8e305b4398091fc3139eb3", "avgDurationSecs": 4.069, "validDurationSecs": 28536.94, "totalHrs": 1.52, "validHrs": 7.93 }, "ko": { "duration": 2236824, "reportedSentences": 1, "clips": 385, "users": 16, "size": 13922717, "checksum": "ea1ccc118693b2327e68cc7beedf84220251dbb17e6725dcab0a698f0012260a", "avgDurationSecs": 5.944, "validDurationSecs": 739.02, "totalHrs": 0.62, "validHrs": 0.2 }, "ky": { "duration": 6157368, "reportedSentences": 0, "clips": 1261, "users": 11, "size": 37950190, "checksum": "d87252d289131c291636c636f28fe7ce1eebc200822d29131a436eff29fc0671", "avgDurationSecs": 4.554, "validDurationSecs": 1046.2, "totalHrs": 1.71, "validHrs": 0.29 }, "lg": { "duration": 720108, "reportedSentences": 0, "clips": 103, "users": 1, "size": 34802648, "checksum": "7b8fdeb24561acf5a23597959f88105973ad023d05478235885fcc2935c36837", "avgDurationSecs": 5.784, "validDurationSecs": 160.77, "totalHrs": 0.2, "validHrs": 0.05 }, "lij": { "duration": 2208672, "reportedSentences": 6, "clips": 571, "users": 7, "size": 13342691, "checksum": "58fcc3bb8fa69672f45ea065ea133dbd7f4957ea55aacb47b4fc522d9e1d905a", "avgDurationSecs": 3.63, "validDurationSecs": 2095.04, "totalHrs": 0.62, "validHrs": 0.58 }, "lo": { "duration": 108036, "reportedSentences": 0, "clips": 17, "users": 3, "size": 1522762, "checksum": "0b05554b4d9d3c48a71ff91983e1640f21db1427caf96f5a6fef343781b73745", "avgDurationSecs": 6.453, "validDurationSecs": 50.68, "totalHrs": 0.03, "validHrs": 0.01 }, "lt": { "duration": 121464, "reportedSentences": 2, "clips": 22, "users": 4, "size": 10865295, "checksum": "7a5c42b55ce00ac9e712741b4a8af64d57a502933a89192e93c9491caa7202b4", "avgDurationSecs": 5.131, "validDurationSecs": 854.57, "totalHrs": 0.03, "validHrs": 0.23 }, "ltg": { "duration": 31188276, "reportedSentences": 17, "clips": 5847, "users": 81, "size": 183274703, "checksum": "09dc5b437e248956f3e76000bee1138b7ca4c537354a2818b49504a8097ee981", "avgDurationSecs": 4.331, "validDurationSecs": 42149.36, "totalHrs": 8.66, "validHrs": 11.71 }, "lv": { "duration": 112137120, "reportedSentences": 52, "clips": 17367, "users": 538, "size": 665343273, "checksum": "bea5a56df5917c9cc060425adb87c1fa4fce509dda2174ed1aa1a39e7e22119c", "avgDurationSecs": 4.666, "validDurationSecs": 180294.4, "totalHrs": 31.15, "validHrs": 50.08 }, "mdf": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 417471, "checksum": "dd15d00a557a0f84a98c9bff621a8c950df7a317167d65e00b63c16e7c01ebd0", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 75846744, "reportedSentences": 2, "clips": 17313, "users": 22, "size": 472480403, "checksum": "43cb2e7eba1cb0966bc2aaf494e3606d87a890c377c6a41abd5ec41ac31bb8b0", "avgDurationSecs": 4.623, "validDurationSecs": 54893.88, "totalHrs": 21.07, "validHrs": 15.25 }, "mk": { "duration": 81263988, "reportedSentences": 39, "clips": 18830, "users": 11, "size": 439598247, "checksum": "79c3ffa5f2bed57032c86fe7f585842a63b75a93b5a3dc19617cedc16815587a", "avgDurationSecs": 4.327, "validDurationSecs": 27127.22, "totalHrs": 22.57, "validHrs": 7.53 }, "ml": { "duration": 848556, "reportedSentences": 6, "clips": 203, "users": 3, "size": 6257932, "checksum": "54270b0e8d0dded3909ee3015b0ea132d7b2e1e95639eedeed63011230f83544", "avgDurationSecs": 4.178, "validDurationSecs": 618.46, "totalHrs": 0.23, "validHrs": 0.17 }, "mn": { "duration": 6089580, "reportedSentences": 0, "clips": 1207, "users": 13, "size": 37188431, "checksum": "0b5f23b70caa3f6baa14f6b2083f5a26a7416902c6469dc237f5a16a6cf093e7", "avgDurationSecs": 5.416, "validDurationSecs": 370.9, "totalHrs": 1.69, "validHrs": 0.1 }, "mr": { "duration": 34848, "reportedSentences": 0, "clips": 5, "users": 1, "size": 2709130, "checksum": "dd614a235d279de2d7c8b0b4826322482718f2a84d3390f249ca5359a56f76da", "avgDurationSecs": 6.194, "validDurationSecs": 89.36, "totalHrs": 0.01, "validHrs": 0.02 }, "mrj": { "duration": 6052392, "reportedSentences": 0, "clips": 1369, "users": 2, "size": 41485635, "checksum": "0f6e6e873f5c1e2f518692a6dab4ee4a294d40f7c24d09b8b4fc81f8ac63b734", "avgDurationSecs": 4.187, "validDurationSecs": 1529.86, "totalHrs": 1.69, "validHrs": 0.43 }, "mt": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1891957, "checksum": "2d446e705d0dd4974f9c0e262bf61f1e632cb5a459bde97eec48152018e66344", "avgDurationSecs": 4.74, "validDurationSecs": 37.92, "totalHrs": 0, "validHrs": 0.01 }, "myv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 543689, "checksum": "c2ffa5b18443c31805c6cd0fc90d04962d7130d0b625ba07e1a5fe9166eb3afd", "avgDurationSecs": 5.774, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 843984, "reportedSentences": 0, "clips": 257, "users": 4, "size": 7861646, "checksum": "d25102e84eb15a3554735146d82c48df63e72a8c699b76251cf88584dda8fe29", "avgDurationSecs": 2.599, "validDurationSecs": 491.11, "totalHrs": 0.24, "validHrs": 0.14 }, "ne-NP": { "duration": 670860, "reportedSentences": 0, "clips": 149, "users": 6, "size": 4110764, "checksum": "bcc4230ea3c5777b83f8e0d598ab8848b108c31d417a499abbfe335947f5c94f", "avgDurationSecs": 3.962, "validDurationSecs": 141.29, "totalHrs": 0.19, "validHrs": 0.04 }, "nhi": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 45788, "checksum": "189b6497fb4d2d1f9a9c520bf9a5dd2c8568cb12fdd79298b42ac3850c205717", "avgDurationSecs": 4.719, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nl": { "duration": 4442112, "reportedSentences": 1, "clips": 868, "users": 46, "size": 48313264, "checksum": "79bc529eaf3fd2695ca7b8ff447b709bf3c2d6811b49b535caa1a01224230604", "avgDurationSecs": 4.358, "validDurationSecs": 2149.43, "totalHrs": 1.23, "validHrs": 0.59 }, "nn-NO": { "duration": 209052, "reportedSentences": 0, "clips": 43, "users": 3, "size": 1608599, "checksum": "955a49e3001a03c781ba40335cf5548b16501d553b3f28c02ddda8c446506ec0", "avgDurationSecs": 4.38, "validDurationSecs": 607.49, "totalHrs": 0.06, "validHrs": 0.17 }, "oc": { "duration": 961632, "reportedSentences": 0, "clips": 201, "users": 0, "size": 6356339, "checksum": "7e33c9deecd5d18ed17e0369b34f22d0d318d41e87e6be2d806b4f9c1c85818c", "avgDurationSecs": 4.872, "validDurationSecs": 1882.98, "totalHrs": 0.27, "validHrs": 0.52 }, "or": { "duration": 330480, "reportedSentences": 0, "clips": 60, "users": 0, "size": 2887015, "checksum": "f0944a2538aefe29a298fdd1deb3bcf512f884ba9e95bc8a4da8d94511c5a835", "avgDurationSecs": 5.013, "validDurationSecs": 25.54, "totalHrs": 0.09, "validHrs": 0.01 }, "os": { "duration": 73692, "reportedSentences": 0, "clips": 12, "users": 1, "size": 593854, "checksum": "5bcf89d5a4747d2711130500a4473e31e794dcbb1148141d8d93bedd3e3f6349", "avgDurationSecs": 5.602, "validDurationSecs": 251.26, "totalHrs": 0.02, "validHrs": 0.07 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1136782, "checksum": "6f7c8358d6eaf0bae2ec8c2b6dc37b5ad870798f0860b59117fa3d0edaae592b", "avgDurationSecs": 4.828, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "pl": { "duration": 5422536, "reportedSentences": 0, "clips": 1001, "users": 45, "size": 56227506, "checksum": "aa17b3c5e549540b0916e85be1ccf8e2d27292062aaf4baf2a509b918c0db4ed", "avgDurationSecs": 4.524, "validDurationSecs": 6392.25, "totalHrs": 1.5, "validHrs": 1.78 }, "ps": { "duration": 1420056, "reportedSentences": 0, "clips": 260, "users": 1, "size": 8616095, "checksum": "150715041110fb250ca7565276d21cc8431e311a3b6978799f9846f38fd185f0", "avgDurationSecs": 4.977, "validDurationSecs": 131.43, "totalHrs": 0.4, "validHrs": 0.04 }, "pt": { "duration": 13131108, "reportedSentences": 43, "clips": 2623, "users": 116, "size": 85364140, "checksum": "3ae8ff0c3734d3615005cb2e20d4d39bef62ff92493bbee1725a268287d44340", "avgDurationSecs": 4.108, "validDurationSecs": 9340.06, "totalHrs": 3.65, "validHrs": 2.6 }, "quy": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 63528, "checksum": "fad84f8b35973a76bb1ebc31430bb09a4581c9f4cd37400007c34ea4acf1e7dc", "avgDurationSecs": 6.132, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 1449266, "checksum": "4fb32e3c094a8e6401419d762b4cbfffdf2835246fbe9e68a87f63c487b814bc", "avgDurationSecs": 5.464, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-vallader": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 645721, "checksum": "d9daac60e82d508e60ebb2bbf817a157403931f2db8a25d6d12ecacd2c21fa51", "avgDurationSecs": 5.826, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 3178980, "reportedSentences": 12, "clips": 804, "users": 11, "size": 26399360, "checksum": "4ca54b652b4697426927b3df04cd06479b67c8de01c20f464a202da112435cd4", "avgDurationSecs": 4.03, "validDurationSecs": 840.26, "totalHrs": 0.88, "validHrs": 0.23 }, "ru": { "duration": 35404344, "reportedSentences": 20, "clips": 6731, "users": 80, "size": 221530722, "checksum": "3ebfecaac4d8961210c863f8422143717867623b5d8037319b69d39f102e31dc", "avgDurationSecs": 5.166, "validDurationSecs": 7545, "totalHrs": 9.84, "validHrs": 2.09 }, "rw": { "duration": 60984, "reportedSentences": 0, "clips": 21, "users": 3, "size": 196923375, "checksum": "fe916bcdc0e5fd4eaea828a8c691fe5916293d85cdd09fbab97e724d4ac03566", "avgDurationSecs": 5.008, "validDurationSecs": 118.15, "totalHrs": 0.02, "validHrs": 0.03 }, "sah": { "duration": 4698288, "reportedSentences": 6, "clips": 751, "users": 12, "size": 28398463, "checksum": "2cd79209324d913f1e5eb194ec31fafd33f897be0e58c8a0207c38a027b8dbd9", "avgDurationSecs": 6.134, "validDurationSecs": 311.86, "totalHrs": 1.31, "validHrs": 0.08 }, "sat": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 327169, "checksum": "bab1d9df6d0f3677c16072037a0018ddb44fffb829b5b7d299df45071f91d8b0", "avgDurationSecs": 4.462, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 113040, "reportedSentences": 0, "clips": 26, "users": 0, "size": 1109815, "checksum": "61cb3789fb37f2691bb63445fc6644c3997061c2f48591bec7fe9770e226cb92", "avgDurationSecs": 4.507, "validDurationSecs": 73.42, "totalHrs": 0.03, "validHrs": 0.02 }, "sk": { "duration": 1969344, "reportedSentences": 0, "clips": 475, "users": 8, "size": 11796666, "checksum": "fbeb5775eadce28f77c76e48a8d49ee72cd5d7bf06081f678ece65ac9a625134", "avgDurationSecs": 4.078, "validDurationSecs": 781.23, "totalHrs": 0.55, "validHrs": 0.22 }, "skr": { "duration": 30816, "reportedSentences": 0, "clips": 5, "users": 1, "size": 932514, "checksum": "7c7089f2216a20ceb6d37a1559c34bb1b65794bcdd26f7e9879dfa774983859e", "avgDurationSecs": 4.143, "validDurationSecs": 56.15, "totalHrs": 0.01, "validHrs": 0.02 }, "sl": { "duration": 845964, "reportedSentences": 1, "clips": 216, "users": 5, "size": 5899183, "checksum": "24b258a9808a8c6cba01fbd996925daff3add221d8a82bc3a5512a77f274bf31", "avgDurationSecs": 3.787, "validDurationSecs": 366, "totalHrs": 0.23, "validHrs": 0.11 }, "sq": { "duration": 1350648, "reportedSentences": 0, "clips": 277, "users": 10, "size": 8518505, "checksum": "92fb6fcc6f45283195a9dcfb9fc81335efcd98cfcf65cc63b15dbf2a6e390b99", "avgDurationSecs": 4.585, "validDurationSecs": 1326.77, "totalHrs": 0.37, "validHrs": 0.37 }, "sr": { "duration": 1194840, "reportedSentences": 7, "clips": 270, "users": 4, "size": 7905808, "checksum": "b3c312892664d2509dcccaa8804cd3d88cb56ec23b80d713268120c9a06cb60c", "avgDurationSecs": 3.025, "validDurationSecs": 995.87, "totalHrs": 0.33, "validHrs": 0.27 }, "sv-SE": { "duration": 1969884, "reportedSentences": 3, "clips": 400, "users": 13, "size": 15900073, "checksum": "aeb753e62cca50d3ec178840b87b096f10756563e66f936d19c55d758a89af44", "avgDurationSecs": 4.008, "validDurationSecs": 1268.03, "totalHrs": 0.54, "validHrs": 0.35 }, "sw": { "duration": 15388596, "reportedSentences": 0, "clips": 2981, "users": 19, "size": 128486311, "checksum": "273c7e417cb8a1538e62a03211da1160fffb8e8f9db155c7a40e4694a20af06c", "avgDurationSecs": 5.386, "validDurationSecs": 8566.51, "totalHrs": 4.27, "validHrs": 2.37 }, "ta": { "duration": 35350056, "reportedSentences": 4, "clips": 6801, "users": 20, "size": 230442197, "checksum": "3d48a3ac0610b2a3c26f551a835f84d3da6caba7226e81bdcd5262cdbbaef260", "avgDurationSecs": 6.185, "validDurationSecs": 1245.6, "totalHrs": 9.82, "validHrs": 0.35 }, "te": { "duration": 6620688, "reportedSentences": 0, "clips": 1628, "users": 28, "size": 40088883, "checksum": "6fa35a1e280f77fa05c028780149148d5618aaf983af01a86a7ba5e79466534d", "avgDurationSecs": 4.189, "validDurationSecs": 433.42, "totalHrs": 1.84, "validHrs": 0.12 }, "th": { "duration": 7343352, "reportedSentences": 16, "clips": 1845, "users": 31, "size": 71423590, "checksum": "a46355d7dd52a7149f7147add27c3759e0a4881f995f6d52fcad7d8d5d93fed3", "avgDurationSecs": 4.19, "validDurationSecs": 1623.44, "totalHrs": 2.04, "validHrs": 0.45 }, "ti": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 328661, "checksum": "9474b4da73b8081802d766b04358162e1deeb2cae31ae71cbc4029f84431e616", "avgDurationSecs": 5.486, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tig": { "duration": 8046684, "reportedSentences": 1, "clips": 1252, "users": 15, "size": 40792238, "checksum": "d93c7932eb3d94055d3a4efe774571780e42b0f07d44222aa2235e39602d4525", "avgDurationSecs": 6.146, "validDurationSecs": 2418.43, "totalHrs": 2.23, "validHrs": 0.67 }, "tk": { "duration": 4738500, "reportedSentences": 9, "clips": 860, "users": 30, "size": 25761161, "checksum": "f8f281f92ba8d98962d274f68a105c0052e8d29adf4728da5b54f61651972f07", "avgDurationSecs": 5.41, "validDurationSecs": 556.52, "totalHrs": 1.31, "validHrs": 0.15 }, "tok": { "duration": 6977988, "reportedSentences": 1, "clips": 1251, "users": 11, "size": 42462031, "checksum": "6594bd66e121f95beafd87afb307de9fc55b41b2ebd493cf9c261fccc73964c1", "avgDurationSecs": 4.172, "validDurationSecs": 2739.83, "totalHrs": 1.94, "validHrs": 0.76 }, "tr": { "duration": 17910252, "reportedSentences": 8, "clips": 2983, "users": 46, "size": 141447461, "checksum": "8bd6c15db1beb24e17fbfa1b6b45f3a4cb31314d0ac709b6ac8ba3561355ba18", "avgDurationSecs": 3.702, "validDurationSecs": 17201.01, "totalHrs": 4.98, "validHrs": 4.78 }, "tt": { "duration": 1024128, "reportedSentences": 0, "clips": 180, "users": 7, "size": 8495362, "checksum": "a6dff40a51c72145c0ad80548e5519781a60707e3126354cb5c9f51fbbbe4d18", "avgDurationSecs": 3.777, "validDurationSecs": 727.01, "totalHrs": 0.29, "validHrs": 0.2 }, "tw": { "duration": 31608, "reportedSentences": 0, "clips": 5, "users": 1, "size": 2954772, "checksum": "0e563467bf3abe711fc3e4a3f15d456d1f10788cd90383335797b916f3f611c3", "avgDurationSecs": 4.031, "validDurationSecs": 361.46, "totalHrs": 0.01, "validHrs": 0.1 }, "ug": { "duration": 204442956, "reportedSentences": 127, "clips": 36126, "users": 75, "size": 1164302626, "checksum": "92fdb8439a55bec314d843a6e523e57512df2ae718aef4e2bca60b48e677a525", "avgDurationSecs": 5.916, "validDurationSecs": 190699.28, "totalHrs": 56.79, "validHrs": 52.97 }, "uk": { "duration": 15984864, "reportedSentences": 9, "clips": 3852, "users": 18, "size": 112093501, "checksum": "8a4b7c53d6853035cf7c2ca371c4faea2268dad6a8de117863b2c7f76e790b4d", "avgDurationSecs": 4.647, "validDurationSecs": 4901.96, "totalHrs": 4.44, "validHrs": 1.36 }, "ur": { "duration": 371520, "reportedSentences": 0, "clips": 68, "users": 7, "size": 15299708, "checksum": "08b9f08f6356bed138fb16e14d13ac93df414b7f38eceb37ab8fe94b626e66aa", "avgDurationSecs": 4.246, "validDurationSecs": 48.17, "totalHrs": 0.1, "validHrs": 0.02 }, "uz": { "duration": 649872, "reportedSentences": 1, "clips": 122, "users": 19, "size": 35108584, "checksum": "af34005bdc1e4ed6f58a3f1101cfae161fc25e3b8fefeb35a42f3af2d7f10921", "avgDurationSecs": 4.15, "validDurationSecs": 295.04, "totalHrs": 0.18, "validHrs": 0.08 }, "vi": { "duration": 194976, "reportedSentences": 0, "clips": 47, "users": 8, "size": 2634451, "checksum": "da04c83c35beed10770aa4f445fb16dd969e22490eb4255789018f36867fa3a4", "avgDurationSecs": 3.963, "validDurationSecs": 632.64, "totalHrs": 0.06, "validHrs": 0.18 }, "vot": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 270684, "checksum": "655cc3be46df0baba6a1f9a60f686a17f1039fed22e0bf264259a2dba8f745bf", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yi": { "duration": 20556, "reportedSentences": 0, "clips": 5, "users": 1, "size": 220392, "checksum": "40b639d24296e0f9aa09a58e44abe61b49372534a271481100eebe7a90607a3b", "avgDurationSecs": 3.815, "validDurationSecs": 16.71, "totalHrs": 0.01, "validHrs": 0 }, "yo": { "duration": 0, "reportedSentences": 0, "clips": 0, "users": 0, "size": 696298, "checksum": "94af710507bc62b169adee9e7d60f76533525c47d0e8e396d27afa44c0095876", "avgDurationSecs": 5.936, "validDurationSecs": 11.87, "totalHrs": 0, "validHrs": 0 }, "yue": { "duration": 10833912, "reportedSentences": 27, "clips": 2941, "users": 16, "size": 71179394, "checksum": "f5c51d5279a03a4cc756def51168beb96cfe85d08ad811e4cfa90c2b37123a76", "avgDurationSecs": 3.873, "validDurationSecs": 2290.05, "totalHrs": 3.01, "validHrs": 0.63 }, "zgh": { "duration": 4374900, "reportedSentences": 3, "clips": 1314, "users": 12, "size": 23890528, "checksum": "f8043616bf12c9d4e821748be684fd75d9a0d6e909602fc75c19feaaced8121b", "avgDurationSecs": 3.325, "validDurationSecs": 972.86, "totalHrs": 1.21, "validHrs": 0.27 }, "zh-CN": { "duration": 6056964, "reportedSentences": 71, "clips": 1543, "users": 67, "size": 75107270, "checksum": "b30f2ab0e3afd88ff93974ba7e24091955f150c49486293e1dcda52439ce587a", "avgDurationSecs": 4.547, "validDurationSecs": 1912.39, "totalHrs": 1.68, "validHrs": 0.53 }, "zh-HK": { "duration": 802908, "reportedSentences": 0, "clips": 221, "users": 11, "size": 11151327, "checksum": "4bc64b2db61c3bf8451d9e45082fcb43d88a184c8f82a9ea93f14abfd6b7cc70", "avgDurationSecs": 4.176, "validDurationSecs": 658.43, "totalHrs": 0.22, "validHrs": 0.18 }, "zh-TW": { "duration": 1832436, "reportedSentences": 0, "clips": 476, "users": 18, "size": 17971956, "checksum": "1d8550f01215fb1e0ed8a1adac93b9a1551e212e1e74cf421f97708476fa8a37", "avgDurationSecs": 3.365, "validDurationSecs": 767.18, "totalHrs": 0.51, "validHrs": 0.22 } }, "totalDuration": 3047441256, "totalValidDurationSecs": 1775485, "totalHrs": 847, "totalValidHrs": 493 } ================================================ FILE: datasets/scripted-speech/cv-corpus-18.0-2024-06-14.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9160, "test": 9117, "train": 21027, "validated": 42011, "invalidated": 5280, "other": 11939 }, "duration": 303992352, "reportedSentences": 225, "validatedSentences": 1046778, "unvalidatedSentences": 5, "clips": 59230, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.18, "thirties": 0.13, "teens": 0.28, "fourties": 0.09, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.18, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 59230, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 404, "size": 1815372743, "checksum": "366a8953e6deb728f1ad9d955e5e2bf55341101f5af1843ac32503393b5e9e8c", "avgDurationSecs": 5.132, "validDurationSecs": 215617.469, "totalHrs": 84.44, "validHrs": 59.89 }, "af": { "buckets": { "dev": 64, "test": 67, "train": 109, "validated": 240, "invalidated": 193, "other": 220 }, "duration": 3667140, "reportedSentences": 0, "validatedSentences": 4743, "unvalidatedSentences": 16519, "clips": 653, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.44, "thirties": 0, "teens": 0.01, "fourties": 0.11, "fifties": 0.22, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.47, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 653, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 34, "size": 23111102, "checksum": "04fb3c2718801cae018915dee307f46efc9611c5749099a8e7a76b2a49747b7d", "avgDurationSecs": 5.616, "validDurationSecs": 1347.8, "totalHrs": 1.01, "validHrs": 0.37 }, "am": { "buckets": { "dev": 248, "test": 217, "train": 465, "validated": 930, "invalidated": 25, "other": 588 }, "duration": 9583020, "reportedSentences": 11, "validatedSentences": 2198, "unvalidatedSentences": 34, "clips": 1543, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.15, "thirties": 0.64, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.79, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1543, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 56578372, "checksum": "65a24ac53d584e90743fcf0fd0e63441ccb71162daf196f33716a238740c4ee5", "avgDurationSecs": 6.211, "validDurationSecs": 5775.897, "totalHrs": 2.66, "validHrs": 1.6 }, "ar": { "buckets": { "dev": 10471, "test": 10471, "train": 28410, "validated": 78302, "invalidated": 15120, "other": 41586 }, "duration": 561417858, "reportedSentences": 2126, "validatedSentences": 59408, "unvalidatedSentences": 1742082, "clips": 135008, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.3, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 135008, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1578, "size": 3457567716, "checksum": "6b0648d0944bdffd9ba911e14d054a04b4cad16e1a17cc9979b267e44ec6f907", "avgDurationSecs": 4.158, "validDurationSecs": 325611.379, "totalHrs": 155.94, "validHrs": 90.44 }, "as": { "buckets": { "dev": 512, "test": 551, "train": 683, "validated": 1746, "invalidated": 215, "other": 111 }, "duration": 11790785, "reportedSentences": 10, "validatedSentences": 5210, "unvalidatedSentences": 3900, "clips": 2072, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0.36, "thirties": 0.04, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.59, "male_masculine": 0.41, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2072, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 74518135, "checksum": "ddd59e9ea1a88f9dc04300219bc471c7e2ec0b2d8c7bffb401b391502296f019", "avgDurationSecs": 5.691, "validDurationSecs": 9935.671, "totalHrs": 3.27, "validHrs": 2.75 }, "ast": { "buckets": { "dev": 113, "test": 171, "train": 399, "validated": 685, "invalidated": 16, "other": 978 }, "duration": 7444944, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1679, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.1, "thirties": 0.12, "teens": 0.04, "fourties": 0.28, "fifties": 0.21, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1679, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 42477547, "checksum": "1d65421230d6576fde81ab098a0d0403c6541012595c97919cbdb16b2a42f4e3", "avgDurationSecs": 4.434, "validDurationSecs": 3037.395, "totalHrs": 2.06, "validHrs": 0.84 }, "az": { "buckets": { "dev": 54, "test": 67, "train": 100, "validated": 221, "invalidated": 49, "other": 527 }, "duration": 4329756, "reportedSentences": 5, "validatedSentences": 93159, "unvalidatedSentences": 1850, "clips": 797, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.13, "thirties": 0.67, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0.21, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 797, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 31636623, "checksum": "bf7596f1b497787dcf10f8cc90f2d08d110deaf832cc08f6541b7ac95203cfa0", "avgDurationSecs": 5.433, "validDurationSecs": 1200.597, "totalHrs": 1.2, "validHrs": 0.33 }, "ba": { "buckets": { "dev": 14492, "test": 14581, "train": 119127, "validated": 209866, "invalidated": 7964, "other": 327 }, "duration": 965667456, "reportedSentences": 867, "validatedSentences": 153957, "unvalidatedSentences": 16, "clips": 218157, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218157, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 921, "size": 5441509003, "checksum": "e9eed2645525f9f3a8a4ee23e02506596e73ab78a1f97b4d41c0db44d47b9b9b", "avgDurationSecs": 4.426, "validDurationSecs": 928967.516, "totalHrs": 268.24, "validHrs": 258.04 }, "bas": { "buckets": { "dev": 457, "test": 548, "train": 763, "validated": 1768, "invalidated": 523, "other": 10 }, "duration": 10188720, "reportedSentences": 8, "validatedSentences": 5015, "unvalidatedSentences": 245, "clips": 2301, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.01, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2301, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 38, "size": 57065119, "checksum": "aa42f11f6c4a1f43e6bdbaa9b415fb4e9e9c34c0469a7ebcc2172ed06df3dc33", "avgDurationSecs": 4.428, "validDurationSecs": 7828.621, "totalHrs": 2.83, "validHrs": 2.17 }, "be": { "buckets": { "dev": 15875, "test": 15880, "train": 347664, "validated": 1334493, "invalidated": 36341, "other": 108 }, "duration": 6532330284, "reportedSentences": 3146, "validatedSentences": 379522, "unvalidatedSentences": 10, "clips": 1370942, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.05, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1370904, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 24, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 0 } }, "users": 8400, "size": 37237031653, "checksum": "4e9b63c2b16c17f7003ba856fc54658410573b4b0ebfd8162033f8ddc414b0bc", "avgDurationSecs": 4.765, "validDurationSecs": 6358656.338, "totalHrs": 1814.53, "validHrs": 1766.29 }, "bg": { "buckets": { "dev": 2824, "test": 3259, "train": 4891, "validated": 10990, "invalidated": 756, "other": 1993 }, "duration": 75169440, "reportedSentences": 188, "validatedSentences": 39570, "unvalidatedSentences": 3, "clips": 13739, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13739, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 136, "size": 442957687, "checksum": "0babc9036643a743f242d1bd1244db783b12076616dc772048af639787afb330", "avgDurationSecs": 5.471, "validDurationSecs": 60128.987, "totalHrs": 20.88, "validHrs": 16.7 }, "bn": { "buckets": { "dev": 9339, "test": 9339, "train": 21284, "validated": 44210, "invalidated": 7833, "other": 997593 }, "duration": 4583427840, "reportedSentences": 1888, "validatedSentences": 562732, "unvalidatedSentences": 94211, "clips": 1049636, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1049636, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22926, "size": 26486384068, "checksum": "d35df335e50b942e73a55e92e6b2df30694e65ae7965b817e410f19bc0c746da", "avgDurationSecs": 4.367, "validDurationSecs": 193051.062, "totalHrs": 1273.17, "validHrs": 53.62 }, "br": { "buckets": { "dev": 2343, "test": 2345, "train": 2747, "validated": 27806, "invalidated": 1670, "other": 1078 }, "duration": 97097047, "reportedSentences": 286, "validatedSentences": 7731, "unvalidatedSentences": 96, "clips": 30554, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.32, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.06, "sixties": 0.14, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.67, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30551, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 209, "size": 677393128, "checksum": "83fb17e1b2a78bc3428e6ce25edf59cf551a3def1c5020b86b2f48e571148ce3", "avgDurationSecs": 3.178, "validDurationSecs": 88364.224, "totalHrs": 26.97, "validHrs": 24.54 }, "ca": { "buckets": { "dev": 16403, "test": 16403, "train": 1149761, "validated": 1892545, "invalidated": 115414, "other": 479380 }, "duration": 13185843678, "reportedSentences": 6692, "validatedSentences": 1288656, "unvalidatedSentences": 335, "clips": 2487339, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.18, "sixties": 0.28, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.51, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2486813, "agriculture_food": 11, "automotive_transport": 2, "finance": 0, "service_retail": 57, "general": 294, "healthcare": 17, "history_law_government": 25, "language_fundamentals": 12, "media_entertainment": 24, "nature_environment": 64, "news_current_affairs": 10, "technology_robotics": 14 } }, "users": 36041, "size": 79446798277, "checksum": "d9f6a2f56852a264c34904a38d1c8e9137b49ca11896d5a03ce927707e04582d", "avgDurationSecs": 5.301, "validDurationSecs": 10032730.771, "totalHrs": 3662.73, "validHrs": 2786.86 }, "ckb": { "buckets": { "dev": 5306, "test": 5307, "train": 7794, "validated": 117812, "invalidated": 9154, "other": 38246 }, "duration": 672668892, "reportedSentences": 2501, "validatedSentences": 18830, "unvalidatedSentences": 15653, "clips": 165212, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 165020, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 101, "healthcare": 0, "history_law_government": 8, "language_fundamentals": 10, "media_entertainment": 63, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1802, "size": 3714393823, "checksum": "41288edc8073cc35dfc71492134c66c8bbfda68f2e2a325784118f12c96d2c3b", "avgDurationSecs": 4.072, "validDurationSecs": 479677.43, "totalHrs": 186.85, "validHrs": 133.24 }, "cnh": { "buckets": { "dev": 761, "test": 764, "train": 817, "validated": 2459, "invalidated": 437, "other": 3291 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168197749, "checksum": "de5207e339c7c2f851b26ffad9c6d6d0e3b838aa096b738b182da162d053adbc", "avgDurationSecs": 3.516, "validDurationSecs": 8645.114, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "dev": 9109, "test": 9152, "train": 20444, "validated": 62007, "invalidated": 2282, "other": 149367 }, "duration": 953831559, "reportedSentences": 953, "validatedSentences": 451091, "unvalidatedSentences": 16, "clips": 213656, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.18, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.54, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 213648, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 1011, "size": 5873161922, "checksum": "5f2d12019ab2adf15492b1090342f62f47f0843b6a607b7c2bb1589879a1e1db", "avgDurationSecs": 4.464, "validDurationSecs": 276819.904, "totalHrs": 264.95, "validHrs": 76.89 }, "cv": { "buckets": { "dev": 1253, "test": 1290, "train": 1443, "validated": 17340, "invalidated": 2139, "other": 98 }, "duration": 99044076, "reportedSentences": 144, "validatedSentences": 4378, "unvalidatedSentences": 4, "clips": 19577, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.51, "thirties": 0.01, "teens": 0.19, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.54, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19577, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 112, "size": 673893782, "checksum": "12c80beecfe9bd7c73ce71df7cfbace2d9e4b548df0eeff2f128f6c9bbaec275", "avgDurationSecs": 5.059, "validDurationSecs": 87726.632, "totalHrs": 27.51, "validHrs": 24.36 }, "cy": { "buckets": { "dev": 5381, "test": 5387, "train": 7972, "validated": 90495, "invalidated": 4455, "other": 20475 }, "duration": 566718777, "reportedSentences": 172, "validatedSentences": 118032, "unvalidatedSentences": 25, "clips": 115425, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115425, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1893, "size": 4138122478, "checksum": "7196f4a5b77928ff274d65932ddc2b3858e34e46d49be3ded90e1cf3eace0bd8", "avgDurationSecs": 4.91, "validDurationSecs": 444316.359, "totalHrs": 157.42, "validHrs": 123.42 }, "da": { "buckets": { "dev": 2105, "test": 2548, "train": 3505, "validated": 10285, "invalidated": 407, "other": 389 }, "duration": 45526104, "reportedSentences": 426, "validatedSentences": 20444, "unvalidatedSentences": 12, "clips": 11081, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.25, "teens": 0.01, "fourties": 0.2, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.6, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11081, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 266, "size": 263485817, "checksum": "fe5126bb8de26ec0a97829941fa75e069f1af535fcab50c7c35094a3eb076c25", "avgDurationSecs": 4.108, "validDurationSecs": 42255.751, "totalHrs": 12.64, "validHrs": 11.73 }, "dav": { "buckets": { "dev": 292, "test": 294, "train": 489, "validated": 1235, "invalidated": 7, "other": 24847 }, "duration": 106990956, "reportedSentences": 11, "validatedSentences": 5340, "unvalidatedSentences": 23014, "clips": 26089, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.48, "thirties": 0.16, "teens": 0, "fourties": 0.01, "fifties": 0.02, "sixties": 0.28, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 485, "agriculture_food": 336, "automotive_transport": 59, "finance": 201, "service_retail": 926, "general": 23241, "healthcare": 650, "history_law_government": 236, "language_fundamentals": 98, "media_entertainment": 50, "nature_environment": 599, "news_current_affairs": 59, "technology_robotics": 37 } }, "users": 19, "size": 584663515, "checksum": "5154801c01582b5cb1d058e8b23d8e78c9bfa69fb22a051cf8073b275ffa7ae7", "avgDurationSecs": 4.101, "validDurationSecs": 5064.733, "totalHrs": 29.71, "validHrs": 1.4 }, "de": { "buckets": { "dev": 16186, "test": 16186, "train": 593009, "validated": 921275, "invalidated": 54972, "other": 6996 }, "duration": 5148390113, "reportedSentences": 10841, "validatedSentences": 2056583, "unvalidatedSentences": 247, "clips": 983243, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.59, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 983186, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 8, "general": 33, "healthcare": 4, "history_law_government": 7, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 19146, "size": 35967213698, "checksum": "f502e72090d9a7c8b876b06087113c2b9b1cb524ff458d03ad39f55ed270cf84", "avgDurationSecs": 5.236, "validDurationSecs": 4823917.487, "totalHrs": 1430.1, "validHrs": 1339.97 }, "dv": { "buckets": { "dev": 2213, "test": 2228, "train": 2677, "validated": 27716, "invalidated": 1715, "other": 16758 }, "duration": 233766963, "reportedSentences": 59, "validatedSentences": 7159, "unvalidatedSentences": 15, "clips": 46189, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.17, "thirties": 0.36, "teens": 0.01, "fourties": 0.21, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.29, "female_feminine": 0.49, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 46189, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 366, "size": 1480268931, "checksum": "a2927e8a9e4fce2d319e013e7b3a0df4172072fc8c56d55d75daaeb701207b66", "avgDurationSecs": 5.061, "validDurationSecs": 140273.337, "totalHrs": 64.93, "validHrs": 38.96 }, "dyu": { "buckets": { "dev": 50, "test": 59, "train": 76, "validated": 193, "invalidated": 11, "other": 84 }, "duration": 1786860, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 49, "clips": 288, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.67, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 288, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 34, "size": 10352115, "checksum": "438f73de95261c445a7631584c9a4f166e12a969e52d686aabcb33563f093533", "avgDurationSecs": 6.204, "validDurationSecs": 1197.444, "totalHrs": 0.49, "validHrs": 0.33 }, "el": { "buckets": { "dev": 1667, "test": 1707, "train": 1948, "validated": 16260, "invalidated": 839, "other": 10283 }, "duration": 113463286, "reportedSentences": 70, "validatedSentences": 6619, "unvalidatedSentences": 4715, "clips": 27382, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.64, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 27382, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 400, "size": 756160970, "checksum": "ca6d9c76e86d51ef11a35a6c50b1dfda2f98e394685286b55600c98cba0cf638", "avgDurationSecs": 4.144, "validDurationSecs": 67376.855, "totalHrs": 31.51, "validHrs": 18.71 }, "en": { "buckets": { "dev": 16395, "test": 16395, "train": 1110033, "validated": 1812713, "invalidated": 296488, "other": 331395 }, "duration": 12792633811, "reportedSentences": 8917, "validatedSentences": 1677188, "unvalidatedSentences": 2484, "clips": 2440596, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.45, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2440105, "agriculture_food": 9, "automotive_transport": 3, "finance": 38, "service_retail": 24, "general": 321, "healthcare": 12, "history_law_government": 19, "language_fundamentals": 6, "media_entertainment": 34, "nature_environment": 28, "news_current_affairs": 7, "technology_robotics": 35 } }, "users": 93279, "size": 89454492229, "checksum": "8a71c2f5daa5443ee7513e5abbb1451098a3ebc6620aedeba26e0ae9c877ea4a", "avgDurationSecs": 5.242, "validDurationSecs": 9501520.782, "totalHrs": 3553.5, "validHrs": 2639.31 }, "eo": { "buckets": { "dev": 14917, "test": 14904, "train": 144213, "validated": 850793, "invalidated": 127449, "other": 154592 }, "duration": 6873965827, "reportedSentences": 2314, "validatedSentences": 180689, "unvalidatedSentences": 22221, "clips": 1132834, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.69, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1132832, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1768, "size": 41087143566, "checksum": "b1dd7d3a23fab4d1e295e70ff86c9efea80e6e533e691ec575c4b50482974a4d", "avgDurationSecs": 6.068, "validDurationSecs": 5162558.687, "totalHrs": 1909.43, "validHrs": 1434.04 }, "es": { "buckets": { "dev": 15872, "test": 15872, "train": 343872, "validated": 422489, "invalidated": 91866, "other": 1132898 }, "duration": 8024877666, "reportedSentences": 2584, "validatedSentences": 1080760, "unvalidatedSentences": 5723, "clips": 1647253, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1647216, "agriculture_food": 0, "automotive_transport": 2, "finance": 4, "service_retail": 2, "general": 8, "healthcare": 4, "history_law_government": 0, "language_fundamentals": 6, "media_entertainment": 2, "nature_environment": 3, "news_current_affairs": 6, "technology_robotics": 8 } }, "users": 26107, "size": 50643915546, "checksum": "c17f95e1a8c2b694c5df41411044188798f2bef8efd0254071af20f995ca5b03", "avgDurationSecs": 4.872, "validDurationSecs": 2058228.178, "totalHrs": 2229.13, "validHrs": 571.73 }, "et": { "buckets": { "dev": 2653, "test": 2653, "train": 3157, "validated": 24415, "invalidated": 7450, "other": 30 }, "duration": 215946866, "reportedSentences": 549, "validatedSentences": 10267, "unvalidatedSentences": 3, "clips": 31895, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.55, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31895, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 901, "size": 1456815800, "checksum": "7a0cab555bd0ec106fbe86d6db7c7bb1b9669fb5c9f7b9e87e2e3bded0bbbaf9", "avgDurationSecs": 6.771, "validDurationSecs": 165303.111, "totalHrs": 59.98, "validHrs": 45.91 }, "eu": { "buckets": { "dev": 14312, "test": 14312, "train": 102309, "validated": 232875, "invalidated": 41030, "other": 176719 }, "duration": 2475966135, "reportedSentences": 1240, "validatedSentences": 1019270, "unvalidatedSentences": 2751, "clips": 450624, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.01, "fourties": 0.16, "fifties": 0.1, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.25, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443685, "agriculture_food": 0, "automotive_transport": 5, "finance": 4, "service_retail": 7, "general": 6870, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 10707, "size": 15204582322, "checksum": "8d91e5c97cf8d9fd0f3d22ed5bb544a6bc3c1337b74e4984b9f9ecdd2242fc1d", "avgDurationSecs": 5.495, "validDurationSecs": 1279538.182, "totalHrs": 687.76, "validHrs": 355.42 }, "fa": { "buckets": { "dev": 10586, "test": 10586, "train": 29096, "validated": 331071, "invalidated": 14689, "other": 32572 }, "duration": 1504209068, "reportedSentences": 2517, "validatedSentences": 55921, "unvalidatedSentences": 283031, "clips": 378332, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.36, "teens": 0.03, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.71, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 378328, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4461, "size": 10911723149, "checksum": "f61c857b2a791823a4079746ffe286ffc289d80644f5d92111c9fc4e79a2ba5b", "avgDurationSecs": 3.976, "validDurationSecs": 1316304.199, "totalHrs": 417.83, "validHrs": 365.64 }, "fi": { "buckets": { "dev": 1773, "test": 1776, "train": 2070, "validated": 10659, "invalidated": 306, "other": 6289 }, "duration": 79634199, "reportedSentences": 77, "validatedSentences": 5843, "unvalidatedSentences": 48873, "clips": 17254, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.31, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.36, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17254, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 226, "size": 476129496, "checksum": "8d39c7a6c8c1f630ea1053a1de3edbc40f1421fe491d42926cb90e94c76a2940", "avgDurationSecs": 4.615, "validDurationSecs": 49195.603, "totalHrs": 22.12, "validHrs": 13.66 }, "fr": { "buckets": { "dev": 16165, "test": 16165, "train": 565638, "validated": 733834, "invalidated": 64591, "other": 29097 }, "duration": 4157229034, "reportedSentences": 7347, "validatedSentences": 1646769, "unvalidatedSentences": 43672, "clips": 827522, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.13, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.58, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 827499, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 14, "healthcare": 0, "history_law_government": 8, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 5, "news_current_affairs": 1, "technology_robotics": 2 } }, "users": 19621, "size": 29222878001, "checksum": "2e44cb8ac73130aa98f54da1453e42a92a1e788f5f8bd1c0c23c8ac5cf60637d", "avgDurationSecs": 5.024, "validDurationSecs": 3686567.863, "totalHrs": 1154.78, "validHrs": 1024.04 }, "fy-NL": { "buckets": { "dev": 3171, "test": 3172, "train": 3922, "validated": 51359, "invalidated": 4001, "other": 102486 }, "duration": 766106595, "reportedSentences": 543, "validatedSentences": 10351, "unvalidatedSentences": 1, "clips": 157846, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157846, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2054, "size": 4593804301, "checksum": "4c2182435609eb33cc0fa17d63a6cb56dedd5028fddd0dc69e7eb8eaaf243ac3", "avgDurationSecs": 4.854, "validDurationSecs": 249271.243, "totalHrs": 212.8, "validHrs": 69.24 }, "ga-IE": { "buckets": { "dev": 517, "test": 516, "train": 537, "validated": 5726, "invalidated": 949, "other": 3841 }, "duration": 37782783, "reportedSentences": 29, "validatedSentences": 1618, "unvalidatedSentences": 597, "clips": 10516, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.25, "thirties": 0.24, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.49, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10516, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 195, "size": 256052938, "checksum": "8055eca68d486f4eec40c5f492827b8439d76a54289c6d5be094e5f5f02388b3", "avgDurationSecs": 3.593, "validDurationSecs": 20572.862, "totalHrs": 10.49, "validHrs": 5.71 }, "gl": { "buckets": { "dev": 10631, "test": 10700, "train": 30155, "validated": 52239, "invalidated": 4931, "other": 39998 }, "duration": 506240136, "reportedSentences": 576, "validatedSentences": 692145, "unvalidatedSentences": 7650, "clips": 97168, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.13, "thirties": 0.11, "teens": 0.01, "fourties": 0.14, "fifties": 0.19, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.34, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 97160, "agriculture_food": 0, "automotive_transport": 0, "finance": 3, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3175, "size": 2947208082, "checksum": "077f59cb2d4444b1aa11f77b0641c06f9f021957699325a2cfe4eefb09905e7d", "avgDurationSecs": 5.21, "validDurationSecs": 272162.425, "totalHrs": 140.62, "validHrs": 75.6 }, "gn": { "buckets": { "dev": 360, "test": 857, "train": 1571, "validated": 2913, "invalidated": 162, "other": 18774 }, "duration": 99216036, "reportedSentences": 36, "validatedSentences": 5892, "unvalidatedSentences": 298, "clips": 21849, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.58, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.38, "female_feminine": 0.49, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 164, "size": 568829712, "checksum": "cb7cc8a00e050c33a8ef33ed56cb52af2994571f5897326bed50f1f268da4c39", "avgDurationSecs": 4.541, "validDurationSecs": 13227.897, "totalHrs": 27.56, "validHrs": 3.67 }, "ha": { "buckets": { "dev": 582, "test": 663, "train": 1928, "validated": 3287, "invalidated": 170, "other": 6670 }, "duration": 43920720, "reportedSentences": 33, "validatedSentences": 5376, "unvalidatedSentences": 312, "clips": 10127, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.67, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0.59, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10127, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 44, "size": 257655354, "checksum": "ddc6a88a2fd6a31eaec09eca5f0c3f78bef626d3b388b4b34fac82da09545990", "avgDurationSecs": 4.337, "validDurationSecs": 14255.693, "totalHrs": 12.2, "validHrs": 3.95 }, "he": { "buckets": { "dev": 277, "test": 269, "train": 1041, "validated": 1933, "invalidated": 227, "other": 2760 }, "duration": 21980304, "reportedSentences": 32, "validatedSentences": 4053, "unvalidatedSentences": 6958, "clips": 4920, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.78, "thirties": 0.17, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.03, "nineties": 0 }, "gender": { "": 0.02, "male_masculine": 0.98, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4920, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 25, "size": 127894592, "checksum": "a255f29b8ab646eb93b18e06e0f6c3bef788d3521add36aef1e98bcc770ab3cb", "avgDurationSecs": 4.468, "validDurationSecs": 8635.758, "totalHrs": 6.1, "validHrs": 2.39 }, "hi": { "buckets": { "dev": 2656, "test": 3214, "train": 4808, "validated": 10749, "invalidated": 839, "other": 3795 }, "duration": 75826752, "reportedSentences": 145, "validatedSentences": 32182, "unvalidatedSentences": 9982, "clips": 15383, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.34, "thirties": 0.26, "teens": 0.01, "fourties": 0.04, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.62, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15383, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 428, "size": 450360132, "checksum": "648965874a0f5089afd2f01acab8e267a6848b4d0ae620691c858afcd195d9c6", "avgDurationSecs": 4.929, "validDurationSecs": 52984.578, "totalHrs": 21.06, "validHrs": 14.71 }, "hsb": { "buckets": { "dev": 172, "test": 456, "train": 808, "validated": 1436, "invalidated": 247, "other": 190 }, "duration": 11584872, "reportedSentences": 130, "validatedSentences": 7177, "unvalidatedSentences": 31, "clips": 1873, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.11, "thirties": 0.09, "teens": 0.07, "fourties": 0.49, "fifties": 0.03, "sixties": 0, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.77, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1873, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 88543386, "checksum": "eefae8b0d6f562d94ce93128ba8692b584a3ddd953548f76e7878c441564d1d5", "avgDurationSecs": 6.185, "validDurationSecs": 8881.941, "totalHrs": 3.21, "validHrs": 2.46 }, "ht": { "buckets": { "dev": 0, "test": 0, "train": 4, "validated": 4, "invalidated": 0, "other": 1 }, "duration": 20664, "reportedSentences": 0, "validatedSentences": 72, "unvalidatedSentences": 1633, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 226991, "checksum": "f6f86372f3671626bb5cd3678123dbbe536fc8d46570071fcbd8a2c40f14a1ee", "avgDurationSecs": 4.133, "validDurationSecs": 16.531, "totalHrs": 0, "validHrs": 0 }, "hu": { "buckets": { "dev": 11469, "test": 11502, "train": 37449, "validated": 60858, "invalidated": 3140, "other": 49751 }, "duration": 629010469, "reportedSentences": 474, "validatedSentences": 326041, "unvalidatedSentences": 177, "clips": 113749, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.12, "teens": 0.02, "fourties": 0.11, "fifties": 0.32, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.24, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 113749, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1619, "size": 3699568569, "checksum": "0c2c5133c2c875238efd34be1b22dacbe62489d6fc88bd4a4d713d5d36419a9c", "avgDurationSecs": 5.53, "validDurationSecs": 336533.237, "totalHrs": 174.72, "validHrs": 93.48 }, "hy-AM": { "buckets": { "dev": 4517, "test": 4486, "train": 6292, "validated": 15441, "invalidated": 1044, "other": 15294 }, "duration": 172087884, "reportedSentences": 169, "validatedSentences": 237379, "unvalidatedSentences": 1407, "clips": 31779, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.6, "thirties": 0.08, "teens": 0.13, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.29, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31777, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 398, "size": 1015128374, "checksum": "510a36a932fec32829a192ef6512c7759f9fe67faf4814b5a05bf7780cb20e7a", "avgDurationSecs": 5.415, "validDurationSecs": 83615.25, "totalHrs": 47.8, "validHrs": 23.22 }, "ia": { "buckets": { "dev": 1825, "test": 1823, "train": 4965, "validated": 11819, "invalidated": 342, "other": 2483 }, "duration": 61387104, "reportedSentences": 277, "validatedSentences": 9069, "unvalidatedSentences": 35, "clips": 14644, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.3, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14644, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 68, "size": 416477676, "checksum": "606fc8a7a07468aff48aee436eeb3f6d1862d9733a5742ac09440475fef076a0", "avgDurationSecs": 4.192, "validDurationSecs": 49544.809, "totalHrs": 17.05, "validHrs": 13.76 }, "id": { "buckets": { "dev": 3424, "test": 3668, "train": 4968, "validated": 28016, "invalidated": 2702, "other": 27766 }, "duration": 233562252, "reportedSentences": 354, "validatedSentences": 13131, "unvalidatedSentences": 11968, "clips": 58484, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.45, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 58484, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 570, "size": 1470789744, "checksum": "220891c9228c1b9ce9d46b37b1bf142e7a0b82cb0d054c44ca7a1a189205b38c", "avgDurationSecs": 3.994, "validDurationSecs": 111884.961, "totalHrs": 64.87, "validHrs": 31.07 }, "ig": { "buckets": { "dev": 2, "test": 5, "train": 9, "validated": 16, "invalidated": 3, "other": 5764 }, "duration": 31583232, "reportedSentences": 16, "validatedSentences": 14454, "unvalidatedSentences": 3857, "clips": 5783, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5783, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 116, "size": 185706228, "checksum": "cf8eae99a5ffd294398f708ab2fb74811457ac747c2f38173abf88eebf665d30", "avgDurationSecs": 5.461, "validDurationSecs": 87.382, "totalHrs": 8.77, "validHrs": 0.02 }, "is": { "buckets": { "dev": 0, "test": 1, "train": 14, "validated": 15, "invalidated": 1, "other": 24 }, "duration": 284040, "reportedSentences": 0, "validatedSentences": 4993, "unvalidatedSentences": 202, "clips": 40, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.13, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2051329, "checksum": "c0c9723d50d635eedb284eecae1a3a9c2f3929d398a3078cb78c262b744920a3", "avgDurationSecs": 7.101, "validDurationSecs": 106.515, "totalHrs": 0.07, "validHrs": 0.02 }, "it": { "buckets": { "dev": 15157, "test": 15162, "train": 170729, "validated": 237672, "invalidated": 19955, "other": 10033 }, "duration": 1446605016, "reportedSentences": 5947, "validatedSentences": 924376, "unvalidatedSentences": 1522, "clips": 267660, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.21, "thirties": 0.16, "teens": 0.01, "fourties": 0.13, "fifties": 0.14, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.61, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 267656, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7166, "size": 9900284728, "checksum": "4bcec1c73da03e73d1ffec3717d568ce7d78703e1138c4b7a9b8714b383d39a9", "avgDurationSecs": 5.405, "validDurationSecs": 1284530.775, "totalHrs": 401.83, "validHrs": 356.81 }, "ja": { "buckets": { "dev": 6270, "test": 6270, "train": 10062, "validated": 93233, "invalidated": 13590, "other": 250391 }, "duration": 1718294685, "reportedSentences": 377, "validatedSentences": 30216, "unvalidatedSentences": 204, "clips": 357214, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.53, "thirties": 0.08, "teens": 0.07, "fourties": 0.1, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.44, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 357142, "agriculture_food": 0, "automotive_transport": 0, "finance": 6, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 59, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4048, "size": 10131552007, "checksum": "fb0c2ff889aad458e986ace5466cb7a73072741a38bb4c4bdc994e61746e68f4", "avgDurationSecs": 4.81, "validDurationSecs": 448475.615, "totalHrs": 477.3, "validHrs": 124.57 }, "ka": { "buckets": { "dev": 12788, "test": 12881, "train": 58160, "validated": 107191, "invalidated": 5340, "other": 81994 }, "duration": 1009362672, "reportedSentences": 944, "validatedSentences": 132101, "unvalidatedSentences": 1209, "clips": 194525, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.16, "thirties": 0.37, "teens": 0.11, "fourties": 0.07, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.19, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 194158, "agriculture_food": 7, "automotive_transport": 3, "finance": 2, "service_retail": 6, "general": 154, "healthcare": 7, "history_law_government": 150, "language_fundamentals": 20, "media_entertainment": 14, "nature_environment": 7, "news_current_affairs": 7, "technology_robotics": 2 } }, "users": 2352, "size": 5904522303, "checksum": "47635132f7713b2d86bfe755115c1634b160864e845671e6abb22adfd8b9ac43", "avgDurationSecs": 5.189, "validDurationSecs": 556198.916, "totalHrs": 280.37, "validHrs": 154.49 }, "kab": { "buckets": { "dev": 14999, "test": 14999, "train": 152059, "validated": 611700, "invalidated": 19784, "other": 113008 }, "duration": 2485760848, "reportedSentences": 9546, "validatedSentences": 182732, "unvalidatedSentences": 205349, "clips": 744492, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.19, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 744492, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1553, "size": 18583581375, "checksum": "1c168895b4e70296d98220ed16be876d7a5ad623422d8f715f21ba92f686e559", "avgDurationSecs": 3.339, "validDurationSecs": 2042385.829, "totalHrs": 690.48, "validHrs": 567.32 }, "kk": { "buckets": { "dev": 509, "test": 510, "train": 548, "validated": 1573, "invalidated": 226, "other": 732 }, "duration": 12454092, "reportedSentences": 42, "validatedSentences": 6168, "unvalidatedSentences": 300, "clips": 2531, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.24, "thirties": 0.03, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.34, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2531, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 167, "size": 71323596, "checksum": "6b27afb80b6a96b6c4e5d7c2777ede87745b2f6af700c10a953e40746f910212", "avgDurationSecs": 4.921, "validDurationSecs": 7740.137, "totalHrs": 3.45, "validHrs": 2.15 }, "kln": { "buckets": { "dev": 4074, "test": 4651, "train": 7270, "validated": 18378, "invalidated": 337, "other": 37863 }, "duration": 260667396, "reportedSentences": 0, "validatedSentences": 33793, "unvalidatedSentences": -1, "clips": 56578, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.64, "thirties": 0.06, "teens": 0.01, "fourties": 0.27, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0, "female_feminine": 0.71, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 4, "automotive_transport": 14, "finance": 26, "service_retail": 501, "general": 55581, "healthcare": 44, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 20, "news_current_affairs": 0, "technology_robotics": 386 } }, "users": 35, "size": 1402458026, "checksum": "84a0a278b4d47da32742f28c58467cd5b0054da71cca802c357b92280550646d", "avgDurationSecs": 4.607, "validDurationSecs": 84671.523, "totalHrs": 72.4, "validHrs": 23.51 }, "kmr": { "buckets": { "dev": 3901, "test": 3918, "train": 5145, "validated": 59953, "invalidated": 2562, "other": 26003 }, "duration": 360548964, "reportedSentences": 1058, "validatedSentences": 16785, "unvalidatedSentences": 24, "clips": 88518, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.34, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.43, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 88517, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 567, "size": 1910215330, "checksum": "4f86959d7540278b323f0f9ef6d3fc27a3e5665bf13fcdf5784cb20d7243426c", "avgDurationSecs": 4.073, "validDurationSecs": 244198.83, "totalHrs": 100.15, "validHrs": 67.83 }, "ko": { "buckets": { "dev": 310, "test": 332, "train": 509, "validated": 1152, "invalidated": 290, "other": 2307 }, "duration": 21848184, "reportedSentences": 22, "validatedSentences": 6656, "unvalidatedSentences": 3161, "clips": 3749, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.43, "thirties": 0.32, "teens": 0.03, "fourties": 0.06, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.58, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3749, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 104, "size": 127874298, "checksum": "459126ad7b23a86a9dcfd42f5d5f9e6ae132d2107021028db72a679b585a7c5a", "avgDurationSecs": 5.828, "validDurationSecs": 6713.552, "totalHrs": 6.06, "validHrs": 1.86 }, "ky": { "buckets": { "dev": 1613, "test": 1614, "train": 1789, "validated": 30385, "invalidated": 5646, "other": 1673 }, "duration": 171639696, "reportedSentences": 39, "validatedSentences": 5027, "unvalidatedSentences": 4, "clips": 37704, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37704, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 287, "size": 1104335773, "checksum": "5f271683e8df0ee0d2bc52f733d9c48b076b712c75ad6e455d161f6be8309113", "avgDurationSecs": 4.552, "validDurationSecs": 138321.456, "totalHrs": 47.67, "validHrs": 38.42 }, "lg": { "buckets": { "dev": 13339, "test": 13359, "train": 71071, "validated": 271814, "invalidated": 39173, "other": 37090 }, "duration": 2013227919, "reportedSentences": 6451, "validatedSentences": 191407, "unvalidatedSentences": 357, "clips": 348077, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348077, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 658, "size": 11819707123, "checksum": "abdf91598f56b50f11a312489b605df925e81bcbd30d151f711f024cd0a026bc", "avgDurationSecs": 5.784, "validDurationSecs": 1572133.561, "totalHrs": 559.22, "validHrs": 436.7 }, "lij": { "buckets": { "dev": 487, "test": 788, "train": 1685, "validated": 3027, "invalidated": 473, "other": 0 }, "duration": 12692592, "reportedSentences": 368, "validatedSentences": 5385, "unvalidatedSentences": 12, "clips": 3500, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.1, "thirties": 0.66, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.07, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0.66, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3500, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 25, "size": 70900990, "checksum": "e7bdfdac6b1d81554863ed829ca03fbd7d805bc71c4129c1ba3575c6ebef1ceb", "avgDurationSecs": 3.626, "validDurationSecs": 10977.279, "totalHrs": 3.52, "validHrs": 3.04 }, "lo": { "buckets": { "dev": 28, "test": 34, "train": 55, "validated": 117, "invalidated": 6, "other": 85 }, "duration": 1342152, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 208, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.33, "thirties": 0.6, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.93, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 208, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 8666190, "checksum": "13c9d8fb87cedacb334d8f67995e5f54fbc308c8cb9019614b8fbbe6982beac7", "avgDurationSecs": 6.453, "validDurationSecs": 754.961, "totalHrs": 0.37, "validHrs": 0.2 }, "lt": { "buckets": { "dev": 4566, "test": 4916, "train": 7337, "validated": 17021, "invalidated": 839, "other": 1906 }, "duration": 101440016, "reportedSentences": 191, "validatedSentences": 129241, "unvalidatedSentences": 4787, "clips": 19766, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.32, "thirties": 0.29, "teens": 0.02, "fourties": 0.13, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.52, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19766, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 304, "size": 625425977, "checksum": "fea7723995eadd7a1cdb89d2753f072f15a69b670620ec3917917b374f0177c4", "avgDurationSecs": 5.132, "validDurationSecs": 87352.55, "totalHrs": 28.17, "validHrs": 24.26 }, "ltg": { "buckets": { "dev": 3292, "test": 3307, "train": 4150, "validated": 19512, "invalidated": 1759, "other": 23 }, "duration": 95944968, "reportedSentences": 45, "validatedSentences": 11521, "unvalidatedSentences": 4270, "clips": 21294, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.1, "thirties": 0.2, "teens": 0, "fourties": 0.36, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.06, "female_feminine": 0.61, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19129, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2165, "healthcare": 0, "history_law_government": 149, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 321, "size": 551519306, "checksum": "204b3db5723997b2d8ac5fbcad343a85082c33f8b69c64f9e1fb7b38d8bd4c0f", "avgDurationSecs": 4.506, "validDurationSecs": 87915.761, "totalHrs": 26.65, "validHrs": 24.42 }, "luo": { "buckets": { "dev": 2723, "test": 2885, "train": 4632, "validated": 19404, "invalidated": 617, "other": 58837 }, "duration": 401357340, "reportedSentences": 8, "validatedSentences": 17819, "unvalidatedSentences": 16831, "clips": 78858, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.34, "thirties": 0.27, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.62, "male_masculine": 0, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1076, "automotive_transport": 321, "finance": 919, "service_retail": 2624, "general": 66223, "healthcare": 2150, "history_law_government": 827, "language_fundamentals": 526, "media_entertainment": 776, "nature_environment": 609, "news_current_affairs": 2567, "technology_robotics": 229 } }, "users": 41, "size": 2219933243, "checksum": "f0452573e1230e342936f5a89ce6286db3b6b7beb44c28f6945d31f462cff51d", "avgDurationSecs": 5.09, "validDurationSecs": 98759.008, "totalHrs": 111.48, "validHrs": 27.43 }, "lv": { "buckets": { "dev": 7203, "test": 7203, "train": 12702, "validated": 184676, "invalidated": 10523, "other": 26601 }, "duration": 1052172701, "reportedSentences": 3112, "validatedSentences": 31303, "unvalidatedSentences": 67790, "clips": 221800, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 221330, "agriculture_food": 5, "automotive_transport": 3, "finance": 2, "service_retail": 10, "general": 394, "healthcare": 18, "history_law_government": 3, "language_fundamentals": 8, "media_entertainment": 7, "nature_environment": 17, "news_current_affairs": 3, "technology_robotics": 8 } }, "users": 6086, "size": 5888196506, "checksum": "1717cc3459e8831c5c1766cb3da38eacbbd01029d675b44a54cfdb60965dc557", "avgDurationSecs": 4.744, "validDurationSecs": 876064.228, "totalHrs": 292.27, "validHrs": 243.35 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11057468, "checksum": "41226dad3ac926f78dd7a5ed983ae5b506bb8f7d449df5da409b24cb5292fb28", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "dev": 15246, "test": 15119, "train": 185916, "validated": 218645, "invalidated": 16021, "other": 70 }, "duration": 1085203404, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234736, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234736, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 496, "size": 6123092863, "checksum": "3e5464e78e17037bc524981c58f53f38c73ae48235e88490385a6f73fa1c15e0", "avgDurationSecs": 4.623, "validDurationSecs": 1010813.417, "totalHrs": 301.44, "validHrs": 280.78 }, "mk": { "buckets": { "dev": 1285, "test": 1168, "train": 1773, "validated": 7031, "invalidated": 271, "other": 11772 }, "duration": 82556640, "reportedSentences": 48, "validatedSentences": 5210, "unvalidatedSentences": 1381, "clips": 19074, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.47, "thirties": 0.48, "teens": 0, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.01, "male_masculine": 0.58, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19074, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 447198664, "checksum": "f44bd86f316e511f7f16ba905349465c05cd4aa8212181264c75bd64039bcffc", "avgDurationSecs": 4.328, "validDurationSecs": 30431.778, "totalHrs": 22.93, "validHrs": 8.45 }, "ml": { "buckets": { "dev": 785, "test": 728, "train": 1256, "validated": 3034, "invalidated": 112, "other": 5565 }, "duration": 36391608, "reportedSentences": 198, "validatedSentences": 5497, "unvalidatedSentences": 4749, "clips": 8711, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.52, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8711, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 134, "size": 205671066, "checksum": "ef4a05e53f4446d54aeb7d9af17dadaf2db40d1f570efc42dd961571ef146ac5", "avgDurationSecs": 4.178, "validDurationSecs": 12675.025, "totalHrs": 10.1, "validHrs": 3.52 }, "mn": { "buckets": { "dev": 1931, "test": 1932, "train": 2195, "validated": 33143, "invalidated": 3169, "other": 59933 }, "duration": 505560500, "reportedSentences": 45, "validatedSentences": 6090, "unvalidatedSentences": 16, "clips": 96245, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96230, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 575, "size": 3075543347, "checksum": "f06a9c3abb2c63410d36edb819743dea7a4daac7860218b0ce97d4b08d428213", "avgDurationSecs": 5.253, "validDurationSecs": 174095.191, "totalHrs": 140.43, "validHrs": 48.35 }, "mr": { "buckets": { "dev": 1779, "test": 1757, "train": 2210, "validated": 10911, "invalidated": 2271, "other": 2840 }, "duration": 99227268, "reportedSentences": 59, "validatedSentences": 6691, "unvalidatedSentences": 12129, "clips": 16022, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0.19, "female_feminine": 0.76, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16022, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 93, "size": 583536282, "checksum": "3e66c479e904d75d86519b5baeb050d98f101608b3e3ab7e9fc667daa2f38731", "avgDurationSecs": 6.193, "validDurationSecs": 67573.881, "totalHrs": 27.56, "validHrs": 18.77 }, "mrj": { "buckets": { "dev": 7314, "test": 7111, "train": 14325, "validated": 28920, "invalidated": 1249, "other": 1475 }, "duration": 132560172, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31644, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31644, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 745349255, "checksum": "6bbf414603d4451b053f799940fb83e44334e46d8ea1b96dc8218aedde735b4b", "avgDurationSecs": 4.189, "validDurationSecs": 121149.038, "totalHrs": 36.82, "validHrs": 33.65 }, "mt": { "buckets": { "dev": 1623, "test": 1648, "train": 1920, "validated": 6547, "invalidated": 325, "other": 6330 }, "duration": 62759880, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11741, "clips": 13202, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13202, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 221, "size": 466326010, "checksum": "aeb4e78c6e5a073bccf36c908ce34b229478a4d13f8f8888a2a52efabcb7bb63", "avgDurationSecs": 4.754, "validDurationSecs": 31123.234, "totalHrs": 17.43, "validHrs": 8.64 }, "myv": { "buckets": { "dev": 239, "test": 479, "train": 1241, "validated": 1969, "invalidated": 25, "other": 1 }, "duration": 11519388, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1995, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 67698468, "checksum": "49a232fd0729da121c1c9f3eb48777670f4744b325d65c09164a1790c7a4913a", "avgDurationSecs": 5.774, "validDurationSecs": 11369.261, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "dev": 2727, "test": 2589, "train": 3474, "validated": 9067, "invalidated": 751, "other": 19477 }, "duration": 76744368, "reportedSentences": 216, "validatedSentences": 24512, "unvalidatedSentences": 342, "clips": 29295, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.53, "thirties": 0.15, "teens": 0.01, "fourties": 0.16, "fifties": 0.01, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.65, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 29295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 436647062, "checksum": "130452ea1604ddbc048b2872c8a1b8d4bf5043e40df50f818e5607bf637741a4", "avgDurationSecs": 2.62, "validDurationSecs": 23752.899, "totalHrs": 21.31, "validHrs": 6.59 }, "ne-NP": { "buckets": { "dev": 205, "test": 241, "train": 281, "validated": 772, "invalidated": 64, "other": 643 }, "duration": 5912244, "reportedSentences": 5, "validatedSentences": 1251, "unvalidatedSentences": 93, "clips": 1479, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.61, "thirties": 0.11, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.69, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1479, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 42, "size": 34670685, "checksum": "f143a3e117f4b536cb95930d3c0b01c930dfd54a21eaea1a26505211dbdccb18", "avgDurationSecs": 3.997, "validDurationSecs": 3086.039, "totalHrs": 1.64, "validHrs": 0.85 }, "nhi": { "buckets": { "dev": 0, "test": 5, "train": 14, "validated": 19, "invalidated": 0, "other": 16 }, "duration": 163872, "reportedSentences": 0, "validatedSentences": 780, "unvalidatedSentences": 10, "clips": 35, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0, "thirties": 0.57, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.57, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 35, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1009323, "checksum": "e1d2fc2afe1ef116b68450171e27ec5b38ed53e49602b19aaa86e53e50b9af4f", "avgDurationSecs": 4.682, "validDurationSecs": 88.959, "totalHrs": 0.04, "validHrs": 0.02 }, "nl": { "buckets": { "dev": 11300, "test": 11309, "train": 35309, "validated": 90956, "invalidated": 5601, "other": 2858 }, "duration": 433412713, "reportedSentences": 340, "validatedSentences": 258936, "unvalidatedSentences": 12, "clips": 99415, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.21, "thirties": 0.13, "teens": 0.02, "fourties": 0.15, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.48, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 99413, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1719, "size": 2959304642, "checksum": "207e6cd7b199eb63758ac51b7da2fc7a9f4674014cd80aab24dc583684800232", "avgDurationSecs": 4.36, "validDurationSecs": 396534.595, "totalHrs": 120.39, "validHrs": 110.14 }, "nn-NO": { "buckets": { "dev": 283, "test": 372, "train": 521, "validated": 1181, "invalidated": 66, "other": 122 }, "duration": 5995944, "reportedSentences": 23, "validatedSentences": 5201, "unvalidatedSentences": 123, "clips": 1369, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.41, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.59, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1369, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 38, "size": 34852162, "checksum": "fd3b2fcab8b4063df89c43d62a0cdcd12c94a1d833917cb6ddb35933fb923047", "avgDurationSecs": 4.38, "validDurationSecs": 5172.542, "totalHrs": 1.66, "validHrs": 1.43 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552848, "checksum": "deb504c75264141b937fe8eec2e4edc820bbd7538e804694a31cd86573f44e2e", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "oc": { "buckets": { "dev": 259, "test": 265, "train": 284, "validated": 1805, "invalidated": 190, "other": 7562 }, "duration": 46559268, "reportedSentences": 10, "validatedSentences": 4929, "unvalidatedSentences": 5, "clips": 9557, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.18, "fifties": 0.16, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male_masculine": 0.09, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9557, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 146, "size": 264719161, "checksum": "f87001cc29b5c7491f712bb41cfe2f612e73993ae9d982727e76fc5520c6a9ae", "avgDurationSecs": 4.872, "validDurationSecs": 8793.5, "totalHrs": 12.93, "validHrs": 2.44 }, "or": { "buckets": { "dev": 0, "test": 696, "train": 2048, "validated": 3169, "invalidated": 275, "other": 5562 }, "duration": 45160800, "reportedSentences": 31, "validatedSentences": 6091, "unvalidatedSentences": 120, "clips": 9006, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.17, "thirties": 0.71, "teens": 0, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male_masculine": 0.87, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9006, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 126, "size": 312932195, "checksum": "af92de040ec30c2d5c8b82c83cc4f25745bb9a247acc9a0330ddfebfa45f8b00", "avgDurationSecs": 5.015, "validDurationSecs": 15891.025, "totalHrs": 12.54, "validHrs": 4.41 }, "os": { "buckets": { "dev": 43, "test": 57, "train": 91, "validated": 191, "invalidated": 18, "other": 97 }, "duration": 1601100, "reportedSentences": 1, "validatedSentences": 961, "unvalidatedSentences": 922, "clips": 306, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.27, "thirties": 0, "teens": 0.07, "fourties": 0.37, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.43, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 306, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 9196384, "checksum": "f5837cb66e61edf95def1ae14c10d715f75775f1804c6e199adba989898dd4d0", "avgDurationSecs": 5.232, "validDurationSecs": 999.379, "totalHrs": 0.44, "validHrs": 0.27 }, "pa-IN": { "buckets": { "dev": 403, "test": 523, "train": 788, "validated": 1714, "invalidated": 90, "other": 1267 }, "duration": 14820614, "reportedSentences": 251, "validatedSentences": 14822, "unvalidatedSentences": 22, "clips": 3071, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.23, "thirties": 0.39, "teens": 0, "fourties": 0.04, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.67, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3071, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 70, "size": 104689746, "checksum": "b4e25102c68280c05e78db807e0ae23c46ed15dbd7a137414f602851ceceec59", "avgDurationSecs": 4.826, "validDurationSecs": 8271.746, "totalHrs": 4.11, "validHrs": 2.29 }, "pl": { "buckets": { "dev": 9295, "test": 9295, "train": 21057, "validated": 133125, "invalidated": 6671, "other": 1113 }, "duration": 638308370, "reportedSentences": 561, "validatedSentences": 228754, "unvalidatedSentences": 11722, "clips": 140909, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.28, "thirties": 0.33, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 140909, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3331, "size": 4559258818, "checksum": "e9dc79f31d68e5a7586c02dda1e7b3ad6f40781ac58de8ac109dc62bb490e0b9", "avgDurationSecs": 4.53, "validDurationSecs": 603047.369, "totalHrs": 177.3, "validHrs": 167.51 }, "ps": { "buckets": { "dev": 1463, "test": 1463, "train": 1609, "validated": 13263, "invalidated": 1443, "other": 40289 }, "duration": 273385872, "reportedSentences": 827, "validatedSentences": 6592, "unvalidatedSentences": 892, "clips": 54995, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.46, "thirties": 0.1, "teens": 0.05, "fourties": 0.06, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.97, "male_masculine": 0.03, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 36743, "agriculture_food": 558, "automotive_transport": 228, "finance": 89, "service_retail": 386, "general": 13943, "healthcare": 510, "history_law_government": 1693, "language_fundamentals": 1497, "media_entertainment": 654, "nature_environment": 542, "news_current_affairs": 496, "technology_robotics": 232 } }, "users": 971, "size": 1508548992, "checksum": "d48a5aa53661bcc337ee463357bc9e874b9b527674ba42e00c4bdf77b2436449", "avgDurationSecs": 4.971, "validDurationSecs": 65931.754, "totalHrs": 75.94, "validHrs": 18.31 }, "pt": { "buckets": { "dev": 9489, "test": 9494, "train": 22116, "validated": 153669, "invalidated": 7448, "other": 25777 }, "duration": 769871887, "reportedSentences": 2791, "validatedSentences": 43181, "unvalidatedSentences": 83, "clips": 186894, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 186763, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 110, "healthcare": 2, "history_law_government": 5, "language_fundamentals": 3, "media_entertainment": 14, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 3516, "size": 4849160820, "checksum": "8682e4e996e13943c27dc4d0356e195a58dd41c1b1cc241ec9f8e5019178c27e", "avgDurationSecs": 4.119, "validDurationSecs": 633008.245, "totalHrs": 213.85, "validHrs": 175.83 }, "quy": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 1, "other": 4 }, "duration": 36792, "reportedSentences": 0, "validatedSentences": 751, "unvalidatedSentences": 290, "clips": 6, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 281340, "checksum": "19f3e331093943469d4191d7718f2a9fd8d650ed32f8014cdd56ee261688e865", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "rm-sursilv": { "buckets": { "dev": 1358, "test": 1364, "train": 1572, "validated": 4307, "invalidated": 680, "other": 2199 }, "duration": 39264017, "reportedSentences": 17, "validatedSentences": 8705, "unvalidatedSentences": 7531, "clips": 7186, "splits": { "accent": {}, "age": { "": 0.63, "twenties": 0.1, "thirties": 0.03, "teens": 0.05, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.64, "male_masculine": 0.18, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7186, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 90, "size": 297207375, "checksum": "d3d3c0e600296f7e4a9d4f32dc98e7ecccb3a6b499c7b84937bd757de2795347", "avgDurationSecs": 5.464, "validDurationSecs": 23533.276, "totalHrs": 10.9, "validHrs": 6.53 }, "rm-vallader": { "buckets": { "dev": 500, "test": 462, "train": 557, "validated": 1526, "invalidated": 394, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2631, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2631, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 117352658, "checksum": "faedc40f8353a39a6a9bd7ff5048bf0c8f173a62827e49d954d32a020538a4f2", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "dev": 3896, "test": 3906, "train": 5123, "validated": 17808, "invalidated": 981, "other": 23042 }, "duration": 168623455, "reportedSentences": 414, "validatedSentences": 13741, "unvalidatedSentences": 89123, "clips": 41831, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.61, "thirties": 0.13, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male_masculine": 0.72, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 41831, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 431, "size": 1052952494, "checksum": "edc7289111c4fbbec32e87dc4fac186e2d7ece67a8c24e5732f5972813d31800", "avgDurationSecs": 4.031, "validDurationSecs": 71785.195, "totalHrs": 46.83, "validHrs": 19.94 }, "ru": { "buckets": { "dev": 10206, "test": 10207, "train": 26408, "validated": 165192, "invalidated": 10092, "other": 17177 }, "duration": 994570128, "reportedSentences": 509, "validatedSentences": 47091, "unvalidatedSentences": 851, "clips": 192461, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.16, "teens": 0.09, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.61, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 192460, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3241, "size": 6720780435, "checksum": "2fb52e1f3f8b74af2041ffcfd99803d4f4d6bbe012deff6587818f3b8a34deb5", "avgDurationSecs": 5.168, "validDurationSecs": 853653.616, "totalHrs": 276.26, "validHrs": 237.12 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003028, "validated": 1438822, "invalidated": 227811, "other": 47284 }, "duration": 8582532053, "reportedSentences": 630, "validatedSentences": 1404853, "unvalidatedSentences": 5, "clips": 1713917, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1713917, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1136, "size": 61194919279, "checksum": "c32fa590ff0281dc9e4c39ec847c228ffadfe91829fa694f59eaa213f84929c7", "avgDurationSecs": 5.008, "validDurationSecs": 7204978.965, "totalHrs": 2384.03, "validHrs": 2001.38 }, "sah": { "buckets": { "dev": 1396, "test": 1470, "train": 1876, "validated": 4899, "invalidated": 152, "other": 2411 }, "duration": 45790860, "reportedSentences": 11, "validatedSentences": 6011, "unvalidatedSentences": 642, "clips": 7462, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.25, "thirties": 0.24, "teens": 0.06, "fourties": 0.06, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.38, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7462, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 114, "size": 311564297, "checksum": "031fc812b9b286620a13ec99fca2f84d3f4e6c47f79cd99fdc9307b3c4747beb", "avgDurationSecs": 6.137, "validDurationSecs": 30062.908, "totalHrs": 12.71, "validHrs": 8.35 }, "sat": { "buckets": { "dev": 0, "test": 147, "train": 315, "validated": 462, "invalidated": 9, "other": 353 }, "duration": 3676320, "reportedSentences": 7, "validatedSentences": 5407, "unvalidatedSentences": 547, "clips": 824, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.43, "thirties": 0.1, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.58, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 824, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 20610286, "checksum": "3665773b018c827a52af8cd62abbef27e37ddbac89f2ab55d346d9cfd72de8bc", "avgDurationSecs": 4.462, "validDurationSecs": 2061.238, "totalHrs": 1.02, "validHrs": 0.57 }, "sc": { "buckets": { "dev": 283, "test": 290, "train": 542, "validated": 1284, "invalidated": 29, "other": 283 }, "duration": 7277328, "reportedSentences": 3, "validatedSentences": 5231, "unvalidatedSentences": 229, "clips": 1596, "splits": { "accent": {}, "age": { "": 0.57, "twenties": 0.11, "thirties": 0.31, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.11, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1596, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 42557601, "checksum": "be0824b225751fead4c56c467fa79c25896769318b4a9f7d55bded09e6e197f5", "avgDurationSecs": 4.56, "validDurationSecs": 5854.692, "totalHrs": 2.02, "validHrs": 1.62 }, "sk": { "buckets": { "dev": 2484, "test": 2634, "train": 3386, "validated": 19622, "invalidated": 847, "other": 3437 }, "duration": 97441308, "reportedSentences": 194, "validatedSentences": 8681, "unvalidatedSentences": 510, "clips": 23906, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.08, "thirties": 0.21, "teens": 0.09, "fourties": 0.18, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.5, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 23906, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 238, "size": 550902171, "checksum": "ccf1054a78b87fd55e174b9c4d25e82b9ffd17a0e2273303851e81db34d746ee", "avgDurationSecs": 4.076, "validDurationSecs": 79979.643, "totalHrs": 27.06, "validHrs": 22.21 }, "skr": { "buckets": { "dev": 1136, "test": 1020, "train": 1534, "validated": 3702, "invalidated": 463, "other": 1562 }, "duration": 23728140, "reportedSentences": 17, "validatedSentences": 5994, "unvalidatedSentences": 106, "clips": 5727, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0.38, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.64, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5727, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 136549348, "checksum": "9c104eb726e8cfe53eeb13db8d63b8bf1e6cdea3e82c509dba8700c2c75dbd69", "avgDurationSecs": 4.143, "validDurationSecs": 15338.148, "totalHrs": 6.59, "validHrs": 4.26 }, "sl": { "buckets": { "dev": 1227, "test": 1261, "train": 1401, "validated": 11193, "invalidated": 285, "other": 3081 }, "duration": 55248742, "reportedSentences": 49, "validatedSentences": 4149, "unvalidatedSentences": 48, "clips": 14559, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.41, "thirties": 0.04, "teens": 0.06, "fourties": 0.19, "fifties": 0.06, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.69, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14559, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 177, "size": 378525860, "checksum": "5c5e547959157e9c539bcef23199786e38c3f391639aaf6cfe82a0744251a8f4", "avgDurationSecs": 3.795, "validDurationSecs": 42475.388, "totalHrs": 15.34, "validHrs": 11.79 }, "sq": { "buckets": { "dev": 578, "test": 691, "train": 911, "validated": 2187, "invalidated": 35, "other": 0 }, "duration": 10989036, "reportedSentences": 12, "validatedSentences": 55277, "unvalidatedSentences": 2626, "clips": 2222, "splits": { "accent": {}, "age": { "": 0.53, "twenties": 0.06, "thirties": 0.15, "teens": 0, "fourties": 0.24, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.43, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1538, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 467, "healthcare": 0, "history_law_government": 217, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 84, "size": 66807155, "checksum": "e531383d26ed0e6df337f12ecc04c4c12120fe913cb564aa8a95b27ed61e18cc", "avgDurationSecs": 4.946, "validDurationSecs": 10815.941, "totalHrs": 3.05, "validHrs": 3 }, "sr": { "buckets": { "dev": 1689, "test": 1777, "train": 2183, "validated": 7068, "invalidated": 387, "other": 1101 }, "duration": 26450892, "reportedSentences": 62, "validatedSentences": 6654, "unvalidatedSentences": 1594, "clips": 8556, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.39, "thirties": 0.17, "teens": 0, "fourties": 0.05, "fifties": 0.17, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.5, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8556, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 159, "size": 144213552, "checksum": "85b572e256dfc767ae864dad7be93187773706a95657441ff101e56e6543e9aa", "avgDurationSecs": 3.092, "validDurationSecs": 21850.737, "totalHrs": 7.34, "validHrs": 6.06 }, "sv-SE": { "buckets": { "dev": 5289, "test": 5310, "train": 7844, "validated": 41229, "invalidated": 1438, "other": 6449 }, "duration": 196975779, "reportedSentences": 592, "validatedSentences": 25876, "unvalidatedSentences": 7329, "clips": 49116, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.38, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.48, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 49116, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 851, "size": 1246648181, "checksum": "bed7ed0c83ddf17219c28e74a6c26b993873d85fbdf0b0164351cc0d7cdfe70d", "avgDurationSecs": 4.01, "validDurationSecs": 165345.598, "totalHrs": 54.71, "validHrs": 45.92 }, "sw": { "buckets": { "dev": 12254, "test": 12255, "train": 46498, "validated": 267035, "invalidated": 80620, "other": 377381 }, "duration": 3905357796, "reportedSentences": 1901, "validatedSentences": 134673, "unvalidatedSentences": 5828, "clips": 725036, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 725036, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1456, "size": 22794585413, "checksum": "2950a7d13020bdfa7d5c487453bd8bbd4b04806d93bedf33980f65f82803023e", "avgDurationSecs": 5.386, "validDurationSecs": 1438366.121, "totalHrs": 1084.82, "validHrs": 399.54 }, "ta": { "buckets": { "dev": 12108, "test": 12171, "train": 45982, "validated": 135961, "invalidated": 5704, "other": 100667 }, "duration": 1497737868, "reportedSentences": 3431, "validatedSentences": 118205, "unvalidatedSentences": 620, "clips": 242332, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0.1, "thirties": 0.12, "teens": 0.03, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male_masculine": 0.19, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 242332, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 928, "size": 8981404173, "checksum": "1aff6191ee3dc782536c749481e10570f26c57f46b67b9332d0423dbed7b1467", "avgDurationSecs": 6.181, "validDurationSecs": 840309.733, "totalHrs": 416.03, "validHrs": 233.41 }, "te": { "buckets": { "dev": 49, "test": 50, "train": 61, "validated": 228, "invalidated": 21, "other": 1751 }, "duration": 8399952, "reportedSentences": 3, "validatedSentences": 270, "unvalidatedSentences": 19175, "clips": 2000, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.67, "thirties": 0.01, "teens": 0.08, "fourties": 0.01, "fifties": 0, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.66, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2000, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 50471093, "checksum": "c127b52f863afc6e6acd45b67d8949c3c9e64e0ac7b608d8d8bb17151ea1caa9", "avgDurationSecs": 4.2, "validDurationSecs": 957.595, "totalHrs": 2.33, "validHrs": 0.26 }, "th": { "buckets": { "dev": 11044, "test": 11044, "train": 32838, "validated": 147301, "invalidated": 9271, "other": 207403 }, "duration": 1525322280, "reportedSentences": 4277, "validatedSentences": 64208, "unvalidatedSentences": 121950, "clips": 363975, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 363975, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7894, "size": 8899675233, "checksum": "061f0d5e21303fed93644aee8ec3020700331ada15790a9b7a0ad0970964e51d", "avgDurationSecs": 4.191, "validDurationSecs": 617299.257, "totalHrs": 423.7, "validHrs": 171.47 }, "ti": { "buckets": { "dev": 5, "test": 6, "train": 14, "validated": 25, "invalidated": 1, "other": 268 }, "duration": 1472724, "reportedSentences": 3, "validatedSentences": 4532, "unvalidatedSentences": 140, "clips": 294, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.03, "teens": 0, "fourties": 0.74, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0.05, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 294, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 8932196, "checksum": "6d328ab9007ce37bc5c3e5cafcb30880e6e57ceaca896da6f34edd37b034164a", "avgDurationSecs": 5.009, "validDurationSecs": 125.232, "totalHrs": 0.4, "validHrs": 0.03 }, "tig": { "buckets": { "dev": 103, "test": 183, "train": 369, "validated": 655, "invalidated": 16, "other": 959 }, "duration": 10043568, "reportedSentences": 2, "validatedSentences": 5382, "unvalidatedSentences": 20, "clips": 1630, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.02, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1630, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 29, "size": 51991862, "checksum": "4442d7baeedacf29637059577892858bba543043a3638a72f5c241ccb441a097", "avgDurationSecs": 6.162, "validDurationSecs": 4035.912, "totalHrs": 2.78, "validHrs": 1.12 }, "tk": { "buckets": { "dev": 425, "test": 483, "train": 769, "validated": 1832, "invalidated": 124, "other": 2382 }, "duration": 23469732, "reportedSentences": 27, "validatedSentences": 2456, "unvalidatedSentences": 251, "clips": 4338, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.47, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4338, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 112, "size": 131990126, "checksum": "1d90c64655782c1fd76bf34db648bf706ff49a851330a888f7e58c15561c76d6", "avgDurationSecs": 5.41, "validDurationSecs": 9911.607, "totalHrs": 6.51, "validHrs": 2.75 }, "tn": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 5 }, "duration": 16164, "reportedSentences": 0, "validatedSentences": 1324, "unvalidatedSentences": 15769, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1136258, "checksum": "7f3f138c3b1f574b3db4f95d434e9f67dfc93f2d76a4175b7140e661f993f3b3", "avgDurationSecs": 3.233, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tok": { "buckets": { "dev": 2178, "test": 2157, "train": 2584, "validated": 11893, "invalidated": 341, "other": 4453 }, "duration": 69583968, "reportedSentences": 145, "validatedSentences": 7248, "unvalidatedSentences": 2468, "clips": 16687, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.2, "thirties": 0.07, "teens": 0.25, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.17, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.51, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16687, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 154, "size": 407417859, "checksum": "6e083175a5f27e9b4cc0742e41d4dd70e1d19a2b0ee18985c97f76c7b084da37", "avgDurationSecs": 4.17, "validDurationSecs": 49593.224, "totalHrs": 19.32, "validHrs": 13.77 }, "tr": { "buckets": { "dev": 11413, "test": 11427, "train": 36500, "validated": 115710, "invalidated": 4615, "other": 116 }, "duration": 449878419, "reportedSentences": 413, "validatedSentences": 409971, "unvalidatedSentences": 3389, "clips": 120441, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.09, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.43, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 120439, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1624, "size": 2751498185, "checksum": "10ff35a4793d15f7543a01129a5fe5be5dd3a506c9a03d153bcc70304536da43", "avgDurationSecs": 3.735, "validDurationSecs": 432206.905, "totalHrs": 124.96, "validHrs": 120.05 }, "tt": { "buckets": { "dev": 3693, "test": 4967, "train": 9310, "validated": 29169, "invalidated": 425, "other": 337 }, "duration": 113269274, "reportedSentences": 6, "validatedSentences": 18031, "unvalidatedSentences": 50, "clips": 29931, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.05, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.78, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 29931, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 265, "size": 833135698, "checksum": "447d884ba85e352a234a47ba7578588a5abb9f9b1149c70233e02af6d5cf85f9", "avgDurationSecs": 3.784, "validDurationSecs": 110385.602, "totalHrs": 31.46, "validHrs": 30.66 }, "tw": { "buckets": { "dev": 0, "test": 11, "train": 165, "validated": 176, "invalidated": 2, "other": 79 }, "duration": 1067076, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 20, "clips": 257, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.02, "thirties": 0.84, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.86, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 257, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 9024530, "checksum": "2631eeb8099af0c63146888aa987728811d6c422087acc3769c86e1f0e44967c", "avgDurationSecs": 4.152, "validDurationSecs": 730.76, "totalHrs": 0.29, "validHrs": 0.2 }, "ug": { "buckets": { "dev": 8451, "test": 8451, "train": 17173, "validated": 147781, "invalidated": 7801, "other": 83243 }, "duration": 1353678372, "reportedSentences": 1293, "validatedSentences": 172505, "unvalidatedSentences": 964, "clips": 238825, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.3, "thirties": 0.3, "teens": 0.05, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.32, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 236755, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2064, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 6, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1386, "size": 7704841095, "checksum": "1585f30bba2e01165cee458a1d0f2cd789ad2521f5093b5f10efbb0a79a9a388", "avgDurationSecs": 5.668, "validDurationSecs": 837634.014, "totalHrs": 376.02, "validHrs": 232.67 }, "uk": { "buckets": { "dev": 10083, "test": 10077, "train": 25600, "validated": 76097, "invalidated": 3239, "other": 8196 }, "duration": 406502304, "reportedSentences": 650, "validatedSentences": 212628, "unvalidatedSentences": 9, "clips": 87532, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.33, "thirties": 0.25, "teens": 0.09, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 87532, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1091, "size": 2676854237, "checksum": "c6773d52e058f45795bb5c4d450f9808262ef6faa043785375bc0280d86284b5", "avgDurationSecs": 4.644, "validDurationSecs": 353397.681, "totalHrs": 112.91, "validHrs": 98.16 }, "ur": { "buckets": { "dev": 4071, "test": 4071, "train": 5390, "validated": 53999, "invalidated": 6839, "other": 135720 }, "duration": 834606036, "reportedSentences": 237, "validatedSentences": 18235, "unvalidatedSentences": 50951, "clips": 196558, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.9, "thirties": 0.01, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.08, "male_masculine": 0.67, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 196558, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 354, "size": 4865669948, "checksum": "42a3177e953c347334170192e9634a426df9f4d809917fba5be8158b2f8ce2e3", "avgDurationSecs": 4.246, "validDurationSecs": 229285.459, "totalHrs": 231.83, "validHrs": 63.69 }, "uz": { "buckets": { "dev": 12136, "test": 12364, "train": 48523, "validated": 86543, "invalidated": 13969, "other": 128391 }, "duration": 950400216, "reportedSentences": 1832, "validatedSentences": 148681, "unvalidatedSentences": 137910, "clips": 228903, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 228903, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2189, "size": 5332758006, "checksum": "3e6d39e86b2067d2f59df815790a793b2657b38ae65cd017cb5e8ed9051199a8", "avgDurationSecs": 4.152, "validDurationSecs": 359324.63, "totalHrs": 264, "validHrs": 99.81 }, "vi": { "buckets": { "dev": 672, "test": 1312, "train": 2260, "validated": 5247, "invalidated": 385, "other": 11741 }, "duration": 68681888, "reportedSentences": 189, "validatedSentences": 5754, "unvalidatedSentences": 5336, "clips": 17373, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.02, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0.29, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.54, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17373, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 290, "size": 399991461, "checksum": "98e3acf4551035b9785f192f83db046bbae333d2ffa1fe01addff820c6900028", "avgDurationSecs": 3.953, "validDurationSecs": 20743.33, "totalHrs": 19.07, "validHrs": 5.76 }, "vot": { "buckets": { "dev": 0, "test": 6, "train": 96, "validated": 102, "invalidated": 324, "other": 1 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186595, "checksum": "97f07f07365e619d4494245acd859cd3a933ac3338b0a2aee816f5887dc1d83f", "avgDurationSecs": 2.412, "validDurationSecs": 246.035, "totalHrs": 0.28, "validHrs": 0.06 }, "xh": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 10 }, "duration": 60408, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 10, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 1492312, "checksum": "43353e9ae84cb0b59676fc10e1a22b409a8202af389e7f7aee83d2e782cf4ef1", "avgDurationSecs": 6.041, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "yi": { "buckets": { "dev": 97, "test": 109, "train": 151, "validated": 370, "invalidated": 20, "other": 10 }, "duration": 1680084, "reportedSentences": 1, "validatedSentences": 846, "unvalidatedSentences": 655, "clips": 400, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.15, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 400, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 9311705, "checksum": "fbcc20909e126bb13804d7067abbe58fee7c8729232583a5856f360f2ee981d1", "avgDurationSecs": 4.2, "validDurationSecs": 1554.078, "totalHrs": 0.46, "validHrs": 0.43 }, "yo": { "buckets": { "dev": 868, "test": 1004, "train": 1225, "validated": 3099, "invalidated": 230, "other": 1125 }, "duration": 26494596, "reportedSentences": 21, "validatedSentences": 5413, "unvalidatedSentences": 16, "clips": 4454, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.63, "thirties": 0.02, "teens": 0.04, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.32, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4454, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 113, "size": 155282570, "checksum": "c5a4138b115c67bf72f5338f7fcc7ae486959ac1211d0f4f19920eb4e80c6c50", "avgDurationSecs": 5.948, "validDurationSecs": 18434.385, "totalHrs": 7.35, "validHrs": 5.12 }, "yue": { "buckets": { "dev": 3676, "test": 3677, "train": 4721, "validated": 49572, "invalidated": 2773, "other": 126985 }, "duration": 698217228, "reportedSentences": 1806, "validatedSentences": 17492, "unvalidatedSentences": 8243, "clips": 179330, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.32, "thirties": 0.44, "teens": 0.01, "fourties": 0.05, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.24, "female_feminine": 0.56, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 179020, "agriculture_food": 2, "automotive_transport": 6, "finance": 12, "service_retail": 71, "general": 182, "healthcare": 41, "history_law_government": 8, "language_fundamentals": 0, "media_entertainment": 23, "nature_environment": 20, "news_current_affairs": 12, "technology_robotics": 12 } }, "users": 1079, "size": 4019294829, "checksum": "388a13379401d59d9364c929d8f51414f23a055d5d9f0af888bdb26b9d3ce0d6", "avgDurationSecs": 3.893, "validDurationSecs": 193007.441, "totalHrs": 193.94, "validHrs": 53.61 }, "zgh": { "buckets": { "dev": 286, "test": 193, "train": 564, "validated": 1043, "invalidated": 74, "other": 642 }, "duration": 5920884, "reportedSentences": 4, "validatedSentences": 2716, "unvalidatedSentences": 366, "clips": 1759, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.28, "thirties": 0.64, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.08, "male_masculine": 0.88, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1734, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 21, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 32623129, "checksum": "f3350db48b467e7a9a09e7ff1b5317c67d5395aacffe0eac2d402922963690e9", "avgDurationSecs": 3.366, "validDurationSecs": 3510.791, "totalHrs": 1.64, "validHrs": 0.97 }, "zh-CN": { "buckets": { "dev": 10627, "test": 10627, "train": 29415, "validated": 185904, "invalidated": 58620, "other": 597654 }, "duration": 3828871140, "reportedSentences": 937, "validatedSentences": 58851, "unvalidatedSentences": 80, "clips": 842178, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.04, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 842043, "agriculture_food": 0, "automotive_transport": 8, "finance": 2, "service_retail": 6, "general": 84, "healthcare": 5, "history_law_government": 35, "language_fundamentals": 0, "media_entertainment": 5, "nature_environment": 2, "news_current_affairs": 4, "technology_robotics": 8 } }, "users": 7086, "size": 22671955881, "checksum": "6b5e4c14960f56b167300fa6ddc205bdfd13366dc570b2f5df8f1f532b7cfedd", "avgDurationSecs": 4.546, "validDurationSecs": 845192.418, "totalHrs": 1063.57, "validHrs": 234.77 }, "zh-HK": { "buckets": { "dev": 5597, "test": 5597, "train": 8435, "validated": 93199, "invalidated": 4476, "other": 22335 }, "duration": 500984196, "reportedSentences": 704, "validatedSentences": 20014, "unvalidatedSentences": 26, "clips": 120010, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.42, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 119990, "agriculture_food": 0, "automotive_transport": 2, "finance": 1, "service_retail": 5, "general": 8, "healthcare": 2, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3045, "size": 3580167192, "checksum": "99f69a756c102242106a8b17634bb081b4b55a533726ba0ebf5561157b02ced2", "avgDurationSecs": 4.175, "validDurationSecs": 389061.129, "totalHrs": 139.16, "validHrs": 108.07 }, "zh-TW": { "buckets": { "dev": 5004, "test": 5004, "train": 7157, "validated": 82939, "invalidated": 4809, "other": 48646 }, "duration": 458913946, "reportedSentences": 161, "validatedSentences": 19478, "unvalidatedSentences": 32, "clips": 136394, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.5, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 136361, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 20, "healthcare": 0, "history_law_government": 3, "language_fundamentals": 0, "media_entertainment": 7, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2212, "size": 3073595323, "checksum": "6dc7254a37d2d1b23a1349811f702591c379a919ebc1ccebcce69c8eab93d813", "avgDurationSecs": 3.365, "validDurationSecs": 279058.197, "totalHrs": 127.47, "validHrs": 77.51 }, "zu": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 0, "other": 34 }, "duration": 178344, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19380, "clips": 35, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 35, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 2480948, "checksum": "9c86c4efc924798ecec4dcce5c82761cd137d373802ce36b885036723403dcfb", "avgDurationSecs": 5.096, "validDurationSecs": 5.096, "totalHrs": 0.04, "validHrs": 0 }, "zza": { "buckets": { "dev": 185, "test": 96, "train": 463, "validated": 744, "invalidated": 112, "other": 70 }, "duration": 4109796, "reportedSentences": 2027, "validatedSentences": 5673, "unvalidatedSentences": 1000, "clips": 926, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0, "thirties": 0.44, "teens": 0, "fourties": 0, "fifties": 0.03, "sixties": 0.29, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 314, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 5, "general": 562, "healthcare": 13, "history_law_government": 0, "language_fundamentals": 22, "media_entertainment": 2, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 11, "size": 24581922, "checksum": "29de34c777992d4474de5bacd0d4b603e044fb1d4ee26cfa1461afbc81b80d74", "avgDurationSecs": 4.438, "validDurationSecs": 3302.039, "totalHrs": 1.14, "validHrs": 0.91 } }, "totalDuration": 115639805731, "totalValidDurationSecs": 75397609, "totalHrs": 32121, "totalValidHrs": 20943 } ================================================ FILE: datasets/scripted-speech/cv-corpus-18.0-delta-2024-06-14.json ================================================ { "locales": { "ab": { "duration": 101268, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 0, "clips": 20, "users": 1, "size": 681162, "checksum": "366a8953e6deb728f1ad9d955e5e2bf55341101f5af1843ac32503393b5e9e8c", "avgDurationSecs": 5.132, "validDurationSecs": 96.54, "totalHrs": 0.03, "validHrs": 0.03 }, "af": { "duration": 1639800, "reportedSentences": 0, "validatedSentences": 14, "unvalidatedSentences": -10, "clips": 248, "users": 11, "size": 9694912, "checksum": "04fb3c2718801cae018915dee307f46efc9611c5749099a8e7a76b2a49747b7d", "avgDurationSecs": 5.616, "validDurationSecs": 356.66, "totalHrs": 0.45, "validHrs": 0.1 }, "am": { "duration": 110916, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 15, "users": 1, "size": 660283, "checksum": "65a24ac53d584e90743fcf0fd0e63441ccb71162daf196f33716a238740c4ee5", "avgDurationSecs": 6.211, "validDurationSecs": 178.18, "totalHrs": 0.03, "validHrs": 0.05 }, "ar": { "duration": 492840, "reportedSentences": 2, "validatedSentences": 10, "unvalidatedSentences": -4, "clips": 116, "users": 17, "size": 2952362, "checksum": "6b0648d0944bdffd9ba911e14d054a04b4cad16e1a17cc9979b267e44ec6f907", "avgDurationSecs": 4.158, "validDurationSecs": 613.19, "totalHrs": 0.13, "validHrs": 0.17 }, "as": { "duration": 0, "reportedSentences": 0, "validatedSentences": 26, "unvalidatedSentences": -26, "clips": 0, "users": 0, "size": 6578, "checksum": "ddd59e9ea1a88f9dc04300219bc471c7e2ec0b2d8c7bffb401b391502296f019", "avgDurationSecs": 5.691, "validDurationSecs": 113.81, "totalHrs": 0, "validHrs": 0.03 }, "ast": { "duration": 574992, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 135, "users": 1, "size": 3384326, "checksum": "1d65421230d6576fde81ab098a0d0403c6541012595c97919cbdb16b2a42f4e3", "avgDurationSecs": 4.434, "validDurationSecs": 87.41, "totalHrs": 0.16, "validHrs": 0.03 }, "az": { "duration": 2715264, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": -5, "clips": 497, "users": 5, "size": 16003223, "checksum": "bf7596f1b497787dcf10f8cc90f2d08d110deaf832cc08f6541b7ac95203cfa0", "avgDurationSecs": 5.433, "validDurationSecs": 500.98, "totalHrs": 0.76, "validHrs": 0.14 }, "ba": { "duration": 257508, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 51, "users": 4, "size": 1683769, "checksum": "e9eed2645525f9f3a8a4ee23e02506596e73ab78a1f97b4d41c0db44d47b9b9b", "avgDurationSecs": 4.426, "validDurationSecs": 973.37, "totalHrs": 0.08, "validHrs": 0.27 }, "bas": { "duration": 49500, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 294611, "checksum": "aa42f11f6c4a1f43e6bdbaa9b415fb4e9e9c34c0469a7ebcc2172ed06df3dc33", "avgDurationSecs": 4.428, "validDurationSecs": 35.01, "totalHrs": 0.02, "validHrs": 0.01 }, "be": { "duration": 177421572, "reportedSentences": 3, "validatedSentences": 16, "unvalidatedSentences": 1, "clips": 36277, "users": 109, "size": 1024500480, "checksum": "4e9b63c2b16c17f7003ba856fc54658410573b4b0ebfd8162033f8ddc414b0bc", "avgDurationSecs": 4.765, "validDurationSecs": 176920.69, "totalHrs": 49.28, "validHrs": 49.15 }, "bg": { "duration": 398016, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 74, "users": 2, "size": 2364709, "checksum": "0babc9036643a743f242d1bd1244db783b12076616dc772048af639787afb330", "avgDurationSecs": 5.471, "validDurationSecs": 859.02, "totalHrs": 0.12, "validHrs": 0.24 }, "bn": { "duration": 818892, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": -5, "clips": 143, "users": 13, "size": 5350266, "checksum": "d35df335e50b942e73a55e92e6b2df30694e65ae7965b817e410f19bc0c746da", "avgDurationSecs": 4.367, "validDurationSecs": 396.81, "totalHrs": 0.23, "validHrs": 0.11 }, "br": { "duration": 381492, "reportedSentences": 1, "validatedSentences": 9, "unvalidatedSentences": 18, "clips": 146, "users": 2, "size": 2257650, "checksum": "83fb17e1b2a78bc3428e6ce25edf59cf551a3def1c5020b86b2f48e571148ce3", "avgDurationSecs": 3.178, "validDurationSecs": 21549.45, "totalHrs": 0.11, "validHrs": 5.99 }, "ca": { "duration": 274289148, "reportedSentences": 203, "validatedSentences": 74760, "unvalidatedSentences": 3, "clips": 61725, "users": 290, "size": 1580761552, "checksum": "d9f6a2f56852a264c34904a38d1c8e9137b49ca11896d5a03ce927707e04582d", "avgDurationSecs": 5.301, "validDurationSecs": 320727.86, "totalHrs": 76.19, "validHrs": 89.09 }, "ckb": { "duration": 52885404, "reportedSentences": 49, "validatedSentences": 331, "unvalidatedSentences": -136, "clips": 11274, "users": 75, "size": 304725126, "checksum": "41288edc8073cc35dfc71492134c66c8bbfda68f2e2a325784118f12c96d2c3b", "avgDurationSecs": 4.072, "validDurationSecs": 10912.27, "totalHrs": 14.69, "validHrs": 3.03 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 6766, "checksum": "de5207e339c7c2f851b26ffad9c6d6d0e3b838aa096b738b182da162d053adbc", "avgDurationSecs": 3.516, "validDurationSecs": 3.51, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 8222400, "reportedSentences": 20, "validatedSentences": 49, "unvalidatedSentences": -41, "clips": 1736, "users": 28, "size": 47719558, "checksum": "5f2d12019ab2adf15492b1090342f62f47f0843b6a607b7c2bb1589879a1e1db", "avgDurationSecs": 4.464, "validDurationSecs": 2886.85, "totalHrs": 2.29, "validHrs": 0.8 }, "cv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 24392, "checksum": "12c80beecfe9bd7c73ce71df7cfbace2d9e4b548df0eeff2f128f6c9bbaec275", "avgDurationSecs": 5.059, "validDurationSecs": 5.06, "totalHrs": 0, "validHrs": 0 }, "cy": { "duration": 3028356, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 462, "users": 9, "size": 17957977, "checksum": "7196f4a5b77928ff274d65932ddc2b3858e34e46d49be3ded90e1cf3eace0bd8", "avgDurationSecs": 4.91, "validDurationSecs": 1216.06, "totalHrs": 0.84, "validHrs": 0.34 }, "da": { "duration": 152712, "reportedSentences": 4, "validatedSentences": 7, "unvalidatedSentences": -1, "clips": 56, "users": 7, "size": 848775, "checksum": "fe5126bb8de26ec0a97829941fa75e069f1af535fcab50c7c35094a3eb076c25", "avgDurationSecs": 4.108, "validDurationSecs": 174.76, "totalHrs": 0.04, "validHrs": 0.05 }, "de": { "duration": 22357044, "reportedSentences": 139, "validatedSentences": 68, "unvalidatedSentences": 79, "clips": 4026, "users": 104, "size": 130612823, "checksum": "f502e72090d9a7c8b876b06087113c2b9b1cb524ff458d03ad39f55ed270cf84", "avgDurationSecs": 5.236, "validDurationSecs": 21747.55, "totalHrs": 6.21, "validHrs": 6.04 }, "dv": { "duration": 2600712, "reportedSentences": 3, "validatedSentences": 20, "unvalidatedSentences": -17, "clips": 530, "users": 9, "size": 15267860, "checksum": "a2927e8a9e4fce2d319e013e7b3a0df4172072fc8c56d55d75daaeb701207b66", "avgDurationSecs": 5.061, "validDurationSecs": 335.2, "totalHrs": 0.72, "validHrs": 0.09 }, "dyu": { "duration": 38088, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 225760, "checksum": "438f73de95261c445a7631584c9a4f166e12a969e52d686aabcb33563f093533", "avgDurationSecs": 6.204, "validDurationSecs": 11, "totalHrs": 0.01, "validHrs": 0.01 }, "el": { "duration": 72648, "reportedSentences": 0, "validatedSentences": 61, "unvalidatedSentences": -27, "clips": 16, "users": 4, "size": 394083, "checksum": "ca6d9c76e86d51ef11a35a6c50b1dfda2f98e394685286b55600c98cba0cf638", "avgDurationSecs": 4.144, "validDurationSecs": 256.52, "totalHrs": 0.02, "validHrs": 0.07 }, "en": { "duration": 166649364, "reportedSentences": 989, "validatedSentences": 755, "unvalidatedSentences": 190, "clips": 27188, "users": 954, "size": 976139262, "checksum": "8a71c2f5daa5443ee7513e5abbb1451098a3ebc6620aedeba26e0ae9c877ea4a", "avgDurationSecs": 5.242, "validDurationSecs": 88366.31, "totalHrs": 46.29, "validHrs": 24.55 }, "eo": { "duration": 16069752, "reportedSentences": 46, "validatedSentences": 480, "unvalidatedSentences": -345, "clips": 2471, "users": 29, "size": 95350019, "checksum": "b1dd7d3a23fab4d1e295e70ff86c9efea80e6e533e691ec575c4b50482974a4d", "avgDurationSecs": 6.068, "validDurationSecs": 3382.19, "totalHrs": 4.46, "validHrs": 0.94 }, "es": { "duration": 35091072, "reportedSentences": 27, "validatedSentences": 124, "unvalidatedSentences": -174, "clips": 7238, "users": 65, "size": 205228937, "checksum": "c17f95e1a8c2b694c5df41411044188798f2bef8efd0254071af20f995ca5b03", "avgDurationSecs": 4.872, "validDurationSecs": 35676.05, "totalHrs": 9.75, "validHrs": 9.91 }, "et": { "duration": 44820, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 0, "size": 289967, "checksum": "7a0cab555bd0ec106fbe86d6db7c7bb1b9669fb5c9f7b9e87e2e3bded0bbbaf9", "avgDurationSecs": 6.771, "validDurationSecs": 238.58, "totalHrs": 0.01, "validHrs": 0.06 }, "eu": { "duration": 55278216, "reportedSentences": 17, "validatedSentences": 838119, "unvalidatedSentences": -2, "clips": 10192, "users": 185, "size": 374452627, "checksum": "8d91e5c97cf8d9fd0f3d22ed5bb544a6bc3c1337b74e4984b9f9ecdd2242fc1d", "avgDurationSecs": 5.495, "validDurationSecs": 294998.76, "totalHrs": 15.35, "validHrs": 81.94 }, "fa": { "duration": 8898768, "reportedSentences": 38, "validatedSentences": 326, "unvalidatedSentences": -180, "clips": 2633, "users": 23, "size": 51868053, "checksum": "f61c857b2a791823a4079746ffe286ffc289d80644f5d92111c9fc4e79a2ba5b", "avgDurationSecs": 3.976, "validDurationSecs": 7973.85, "totalHrs": 2.47, "validHrs": 2.22 }, "fi": { "duration": 1683252, "reportedSentences": 0, "validatedSentences": 85, "unvalidatedSentences": -85, "clips": 312, "users": 4, "size": 9813653, "checksum": "8d39c7a6c8c1f630ea1053a1de3edbc40f1421fe491d42926cb90e94c76a2940", "avgDurationSecs": 4.615, "validDurationSecs": 1128.46, "totalHrs": 0.47, "validHrs": 0.31 }, "fr": { "duration": 26556912, "reportedSentences": 21, "validatedSentences": 1118, "unvalidatedSentences": -743, "clips": 5213, "users": 162, "size": 155388411, "checksum": "2e44cb8ac73130aa98f54da1453e42a92a1e788f5f8bd1c0c23c8ac5cf60637d", "avgDurationSecs": 5.024, "validDurationSecs": 39655.8, "totalHrs": 7.38, "validHrs": 11.01 }, "fy-NL": { "duration": 2597328, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 550, "users": 22, "size": 14780547, "checksum": "4c2182435609eb33cc0fa17d63a6cb56dedd5028fddd0dc69e7eb8eaaf243ac3", "avgDurationSecs": 4.854, "validDurationSecs": 1816.11, "totalHrs": 0.72, "validHrs": 0.51 }, "ga-IE": { "duration": 88236, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 81, "clips": 20, "users": 3, "size": 514259, "checksum": "8055eca68d486f4eec40c5f492827b8439d76a54289c6d5be094e5f5f02388b3", "avgDurationSecs": 3.593, "validDurationSecs": 809.8, "totalHrs": 0.02, "validHrs": 0.23 }, "gl": { "duration": 32965848, "reportedSentences": 33, "validatedSentences": 533, "unvalidatedSentences": -456, "clips": 5387, "users": 50, "size": 194579540, "checksum": "077f59cb2d4444b1aa11f77b0641c06f9f021957699325a2cfe4eefb09905e7d", "avgDurationSecs": 5.21, "validDurationSecs": 36095.08, "totalHrs": 9.16, "validHrs": 10.03 }, "gn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2794, "checksum": "cb7cc8a00e050c33a8ef33ed56cb52af2994571f5897326bed50f1f268da4c39", "avgDurationSecs": 4.541, "validDurationSecs": 18.16, "totalHrs": 0, "validHrs": 0.01 }, "ha": { "duration": 106704, "reportedSentences": 0, "validatedSentences": 11, "unvalidatedSentences": -11, "clips": 16, "users": 4, "size": 581939, "checksum": "ddc6a88a2fd6a31eaec09eca5f0c3f78bef626d3b388b4b34fac82da09545990", "avgDurationSecs": 4.337, "validDurationSecs": 38.13, "totalHrs": 0.03, "validHrs": 0.01 }, "he": { "duration": 162504, "reportedSentences": 2, "validatedSentences": 224, "unvalidatedSentences": -222, "clips": 27, "users": 4, "size": 899148, "checksum": "a255f29b8ab646eb93b18e06e0f6c3bef788d3521add36aef1e98bcc770ab3cb", "avgDurationSecs": 4.468, "validDurationSecs": 614.05, "totalHrs": 0.04, "validHrs": 0.17 }, "hi": { "duration": 1323540, "reportedSentences": 7, "validatedSentences": 62, "unvalidatedSentences": -4, "clips": 235, "users": 14, "size": 7753433, "checksum": "648965874a0f5089afd2f01acab8e267a6848b4d0ae620691c858afcd195d9c6", "avgDurationSecs": 4.929, "validDurationSecs": 2182.91, "totalHrs": 0.37, "validHrs": 0.6 }, "hsb": { "duration": 712404, "reportedSentences": 43, "validatedSentences": 5, "unvalidatedSentences": -4, "clips": 108, "users": 2, "size": 4220270, "checksum": "eefae8b0d6f562d94ce93128ba8692b584a3ddd953548f76e7878c441564d1d5", "avgDurationSecs": 6.185, "validDurationSecs": 110.05, "totalHrs": 0.19, "validHrs": 0.03 }, "ht": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 185, "checksum": "f6f86372f3671626bb5cd3678123dbbe536fc8d46570071fcbd8a2c40f14a1ee", "avgDurationSecs": 4.133, "validDurationSecs": 16.531, "totalHrs": 0, "validHrs": 0 }, "hu": { "duration": 7552080, "reportedSentences": 2, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 1281, "users": 5, "size": 44436020, "checksum": "0c2c5133c2c875238efd34be1b22dacbe62489d6fc88bd4a4d713d5d36419a9c", "avgDurationSecs": 5.53, "validDurationSecs": 3016.28, "totalHrs": 2.1, "validHrs": 0.84 }, "hy-AM": { "duration": 1581408, "reportedSentences": 0, "validatedSentences": -56, "unvalidatedSentences": -58, "clips": 277, "users": 8, "size": 9074232, "checksum": "510a36a932fec32829a192ef6512c7759f9fe67faf4814b5a05bf7780cb20e7a", "avgDurationSecs": 5.415, "validDurationSecs": 3438.99, "totalHrs": 0.44, "validHrs": 0.95 }, "ia": { "duration": 39708, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 5, "users": 1, "size": 138479, "checksum": "606fc8a7a07468aff48aee436eeb3f6d1862d9733a5742ac09440475fef076a0", "avgDurationSecs": 4.192, "validDurationSecs": 111.52, "totalHrs": 0.01, "validHrs": 0.03 }, "id": { "duration": 1262196, "reportedSentences": 4, "validatedSentences": 130, "unvalidatedSentences": -111, "clips": 263, "users": 19, "size": 7125534, "checksum": "220891c9228c1b9ce9d46b37b1bf142e7a0b82cb0d054c44ca7a1a189205b38c", "avgDurationSecs": 3.994, "validDurationSecs": 7714.82, "totalHrs": 0.35, "validHrs": 2.14 }, "ig": { "duration": 28548, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 6, "users": 2, "size": 176236, "checksum": "cf8eae99a5ffd294398f708ab2fb74811457ac747c2f38173abf88eebf665d30", "avgDurationSecs": 5.461, "validDurationSecs": 5.45, "totalHrs": 0.01, "validHrs": 0 }, "is": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 104, "checksum": "c0c9723d50d635eedb284eecae1a3a9c2f3929d398a3078cb78c262b744920a3", "avgDurationSecs": 7.101, "validDurationSecs": 7.1, "totalHrs": 0, "validHrs": 0 }, "it": { "duration": 22535688, "reportedSentences": 4, "validatedSentences": 318, "unvalidatedSentences": -278, "clips": 3793, "users": 44, "size": 132379177, "checksum": "4bcec1c73da03e73d1ffec3717d568ce7d78703e1138c4b7a9b8714b383d39a9", "avgDurationSecs": 5.405, "validDurationSecs": 6664.03, "totalHrs": 6.26, "validHrs": 1.85 }, "ja": { "duration": 1150524, "reportedSentences": 14, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 293, "users": 21, "size": 6501044, "checksum": "fb0c2ff889aad458e986ace5466cb7a73072741a38bb4c4bdc994e61746e68f4", "avgDurationSecs": 4.81, "validDurationSecs": 947.49, "totalHrs": 0.32, "validHrs": 0.26 }, "ka": { "duration": 238222476, "reportedSentences": 137, "validatedSentences": 880, "unvalidatedSentences": -23, "clips": 44480, "users": 673, "size": 1377630187, "checksum": "47635132f7713b2d86bfe755115c1634b160864e845671e6abb22adfd8b9ac43", "avgDurationSecs": 5.189, "validDurationSecs": 56495.75, "totalHrs": 66.17, "validHrs": 15.69 }, "kab": { "duration": 2588112, "reportedSentences": 80, "validatedSentences": 0, "unvalidatedSentences": 5, "clips": 715, "users": 6, "size": 13914584, "checksum": "1c168895b4e70296d98220ed16be876d7a5ad623422d8f715f21ba92f686e559", "avgDurationSecs": 3.339, "validDurationSecs": 3039.69, "totalHrs": 0.71, "validHrs": 0.84 }, "kk": { "duration": 14400, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 2, "users": 1, "size": 85571, "checksum": "6b27afb80b6a96b6c4e5d7c2777ede87745b2f6af700c10a953e40746f910212", "avgDurationSecs": 4.921, "validDurationSecs": 37.27, "totalHrs": 0, "validHrs": 0.02 }, "kmr": { "duration": 1291068, "reportedSentences": 5, "validatedSentences": 3, "unvalidatedSentences": -1, "clips": 217, "users": 6, "size": 7445896, "checksum": "4f86959d7540278b323f0f9ef6d3fc27a3e5665bf13fcdf5784cb20d7243426c", "avgDurationSecs": 4.073, "validDurationSecs": 935.57, "totalHrs": 0.36, "validHrs": 0.26 }, "ko": { "duration": 1797876, "reportedSentences": 3, "validatedSentences": 75, "unvalidatedSentences": -75, "clips": 376, "users": 14, "size": 10488130, "checksum": "459126ad7b23a86a9dcfd42f5d5f9e6ae132d2107021028db72a679b585a7c5a", "avgDurationSecs": 5.828, "validDurationSecs": 495.76, "totalHrs": 0.5, "validHrs": 0.14 }, "ky": { "duration": 286416, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 75, "users": 4, "size": 1669317, "checksum": "5f271683e8df0ee0d2bc52f733d9c48b076b712c75ad6e455d161f6be8309113", "avgDurationSecs": 4.552, "validDurationSecs": 1.12, "totalHrs": 0.08, "validHrs": 0 }, "lg": { "duration": 25740, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 138428, "checksum": "abdf91598f56b50f11a312489b605df925e81bcbd30d151f711f024cd0a026bc", "avgDurationSecs": 5.784, "validDurationSecs": 9.08, "totalHrs": 0, "validHrs": 0 }, "lij": { "duration": 848232, "reportedSentences": 335, "validatedSentences": 85, "unvalidatedSentences": -10, "clips": 237, "users": 6, "size": 4968438, "checksum": "e7bdfdac6b1d81554863ed829ca03fbd7d805bc71c4129c1ba3575c6ebef1ceb", "avgDurationSecs": 3.626, "validDurationSecs": 868.01, "totalHrs": 0.23, "validHrs": 0.24 }, "lo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -272, "checksum": "13c9d8fb87cedacb334d8f67995e5f54fbc308c8cb9019614b8fbbe6982beac7", "avgDurationSecs": 6.453, "validDurationSecs": 6.45, "totalHrs": 0, "validHrs": 0 }, "lt": { "duration": 10688904, "reportedSentences": 0, "validatedSentences": 27, "unvalidatedSentences": -27, "clips": 2078, "users": 15, "size": 62312740, "checksum": "fea7723995eadd7a1cdb89d2753f072f15a69b670620ec3917917b374f0177c4", "avgDurationSecs": 5.132, "validDurationSecs": 1962.98, "totalHrs": 2.97, "validHrs": 0.55 }, "ltg": { "duration": 12048120, "reportedSentences": 9, "validatedSentences": 1938, "unvalidatedSentences": 4110, "clips": 1922, "users": 71, "size": 68249051, "checksum": "204b3db5723997b2d8ac5fbcad343a85082c33f8b69c64f9e1fb7b38d8bd4c0f", "avgDurationSecs": 4.506, "validDurationSecs": 13490.44, "totalHrs": 3.35, "validHrs": 3.75 }, "lv": { "duration": 56393100, "reportedSentences": 4, "validatedSentences": 3193, "unvalidatedSentences": -2211, "clips": 8367, "users": 374, "size": 321605704, "checksum": "1717cc3459e8831c5c1766cb3da38eacbbd01029d675b44a54cfdb60965dc557", "avgDurationSecs": 4.744, "validDurationSecs": 75215.44, "totalHrs": 15.67, "validHrs": 20.9 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 407, "checksum": "41226dad3ac926f78dd7a5ed983ae5b506bb8f7d449df5da409b24cb5292fb28", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 717120, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 135, "users": 0, "size": 3974399, "checksum": "3e5464e78e17037bc524981c58f53f38c73ae48235e88490385a6f73fa1c15e0", "avgDurationSecs": 4.623, "validDurationSecs": 1173.01, "totalHrs": 0.2, "validHrs": 0.33 }, "mk": { "duration": 160128, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 30, "users": 0, "size": 936159, "checksum": "f44bd86f316e511f7f16ba905349465c05cd4aa8212181264c75bd64039bcffc", "avgDurationSecs": 4.328, "validDurationSecs": 2256.7, "totalHrs": 0.05, "validHrs": 0.63 }, "ml": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": -5034, "checksum": "ef4a05e53f4446d54aeb7d9af17dadaf2db40d1f570efc42dd961571ef146ac5", "avgDurationSecs": 4.178, "validDurationSecs": 208.88, "totalHrs": 0, "validHrs": 0.06 }, "mn": { "duration": 422483076, "reportedSentences": 11, "validatedSentences": -15, "unvalidatedSentences": 5, "clips": 80906, "users": 43, "size": 2469309710, "checksum": "f06a9c3abb2c63410d36edb819743dea7a4daac7860218b0ce97d4b08d428213", "avgDurationSecs": 5.253, "validDurationSecs": 126666.48, "totalHrs": 117.36, "validHrs": 35.18 }, "mr": { "duration": 300924, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 51, "users": 3, "size": 1438525, "checksum": "3e66c479e904d75d86519b5baeb050d98f101608b3e3ab7e9fc667daa2f38731", "avgDurationSecs": 6.193, "validDurationSecs": 51.74, "totalHrs": 0.09, "validHrs": 0.02 }, "mrj": { "duration": 187632, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 30, "users": 0, "size": 1082442, "checksum": "6bbf414603d4451b053f799940fb83e44334e46d8ea1b96dc8218aedde735b4b", "avgDurationSecs": 4.189, "validDurationSecs": 69.24, "totalHrs": 0.05, "validHrs": 0.02 }, "mt": { "duration": 796068, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 129, "users": 3, "size": 4714034, "checksum": "aeb4e78c6e5a073bccf36c908ce34b229478a4d13f8f8888a2a52efabcb7bb63", "avgDurationSecs": 4.754, "validDurationSecs": 589.24, "totalHrs": 0.22, "validHrs": 0.16 }, "myv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2301, "checksum": "49a232fd0729da121c1c9f3eb48777670f4744b325d65c09164a1790c7a4913a", "avgDurationSecs": 5.774, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 3977640, "reportedSentences": 2, "validatedSentences": 8, "unvalidatedSentences": -5, "clips": 1293, "users": 9, "size": 22864700, "checksum": "130452ea1604ddbc048b2872c8a1b8d4bf5043e40df50f818e5607bf637741a4", "avgDurationSecs": 2.62, "validDurationSecs": 3112.01, "totalHrs": 1.1, "validHrs": 0.86 }, "ne-NP": { "duration": 361944, "reportedSentences": 0, "validatedSentences": 37, "unvalidatedSentences": -36, "clips": 78, "users": 10, "size": 2129116, "checksum": "f143a3e117f4b536cb95930d3c0b01c930dfd54a21eaea1a26505211dbdccb18", "avgDurationSecs": 3.997, "validDurationSecs": 146.48, "totalHrs": 0.1, "validHrs": 0.04 }, "nhi": { "duration": 45900, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 1, "size": 269219, "checksum": "e1d2fc2afe1ef116b68450171e27ec5b38ed53e49602b19aaa86e53e50b9af4f", "avgDurationSecs": 4.682, "validDurationSecs": -0.7, "totalHrs": 0.01, "validHrs": 0 }, "nl": { "duration": 2863692, "reportedSentences": 0, "validatedSentences": 38, "unvalidatedSentences": -15, "clips": 615, "users": 10, "size": 16892020, "checksum": "207e6cd7b199eb63758ac51b7da2fc7a9f4674014cd80aab24dc583684800232", "avgDurationSecs": 4.36, "validDurationSecs": 2377.42, "totalHrs": 0.8, "validHrs": 0.66 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 0, "users": 0, "size": -169, "checksum": "fd3b2fcab8b4063df89c43d62a0cdcd12c94a1d833917cb6ddb35933fb923047", "avgDurationSecs": 4.38, "validDurationSecs": 35.04, "totalHrs": 0, "validHrs": 0.01 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 236, "checksum": "deb504c75264141b937fe8eec2e4edc820bbd7538e804694a31cd86573f44e2e", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 364212, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 75, "users": 1, "size": 2143621, "checksum": "f87001cc29b5c7491f712bb41cfe2f612e73993ae9d982727e76fc5520c6a9ae", "avgDurationSecs": 4.872, "validDurationSecs": 667.22, "totalHrs": 0.1, "validHrs": 0.19 }, "or": { "duration": 112068, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 20, "users": 1, "size": 656247, "checksum": "af92de040ec30c2d5c8b82c83cc4f25745bb9a247acc9a0330ddfebfa45f8b00", "avgDurationSecs": 5.015, "validDurationSecs": 44.26, "totalHrs": 0.03, "validHrs": 0.01 }, "os": { "duration": 469476, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 104, "users": 3, "size": 2707925, "checksum": "f5837cb66e61edf95def1ae14c10d715f75775f1804c6e199adba989898dd4d0", "avgDurationSecs": 5.232, "validDurationSecs": -31.41, "totalHrs": 0.13, "validHrs": -0.01 }, "pa-IN": { "duration": 463248, "reportedSentences": 0, "validatedSentences": 6, "unvalidatedSentences": -5, "clips": 97, "users": 2, "size": 2727123, "checksum": "b4e25102c68280c05e78db807e0ae23c46ed15dbd7a137414f602851ceceec59", "avgDurationSecs": 4.826, "validDurationSecs": 1006.17, "totalHrs": 0.13, "validHrs": 0.28 }, "pl": { "duration": 5260428, "reportedSentences": 3, "validatedSentences": 78, "unvalidatedSentences": -74, "clips": 981, "users": 22, "size": 30964131, "checksum": "e9dc79f31d68e5a7586c02dda1e7b3ad6f40781ac58de8ac109dc62bb490e0b9", "avgDurationSecs": 4.53, "validDurationSecs": 2876.04, "totalHrs": 1.46, "validHrs": 0.8 }, "ps": { "duration": 265775472, "reportedSentences": 826, "validatedSentences": 4559, "unvalidatedSentences": 645, "clips": 53466, "users": 962, "size": 1463560374, "checksum": "d48a5aa53661bcc337ee463357bc9e874b9b527674ba42e00c4bdf77b2436449", "avgDurationSecs": 4.971, "validDurationSecs": 59829.5, "totalHrs": 73.83, "validHrs": 16.62 }, "pt": { "duration": 10374840, "reportedSentences": 19, "validatedSentences": 45, "unvalidatedSentences": 13, "clips": 1994, "users": 63, "size": 60867578, "checksum": "8682e4e996e13943c27dc4d0356e195a58dd41c1b1cc241ec9f8e5019178c27e", "avgDurationSecs": 4.119, "validDurationSecs": 5993.94, "totalHrs": 2.88, "validHrs": 1.66 }, "quy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 19, "checksum": "19f3e331093943469d4191d7718f2a9fd8d650ed32f8014cdd56ee261688e865", "avgDurationSecs": 6.132, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 3553, "checksum": "d3d3c0e600296f7e4a9d4f32dc98e7ecccb3a6b499c7b84937bd757de2795347", "avgDurationSecs": 5.464, "validDurationSecs": 10.93, "totalHrs": 0, "validHrs": 0 }, "rm-vallader": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 341, "checksum": "faedc40f8353a39a6a9bd7ff5048bf0c8f173a62827e49d954d32a020538a4f2", "avgDurationSecs": 5.826, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 144648, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 30, "users": 4, "size": 847236, "checksum": "edc7289111c4fbbec32e87dc4fac186e2d7ece67a8c24e5732f5972813d31800", "avgDurationSecs": 4.031, "validDurationSecs": 296.27, "totalHrs": 0.04, "validHrs": 0.09 }, "ru": { "duration": 8550612, "reportedSentences": 24, "validatedSentences": 182, "unvalidatedSentences": 782, "clips": 1600, "users": 35, "size": 50305591, "checksum": "2fb52e1f3f8b74af2041ffcfd99803d4f4d6bbe012deff6587818f3b8a34deb5", "avgDurationSecs": 5.168, "validDurationSecs": 9569.33, "totalHrs": 2.37, "validHrs": 2.66 }, "rw": { "duration": 121572, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 23, "users": 5, "size": 1600393, "checksum": "c32fa590ff0281dc9e4c39ec847c228ffadfe91829fa694f59eaa213f84929c7", "avgDurationSecs": 5.008, "validDurationSecs": 120.54, "totalHrs": 0.03, "validHrs": 0.04 }, "sah": { "duration": 234036, "reportedSentences": 0, "validatedSentences": -38, "unvalidatedSentences": -5, "clips": 35, "users": 3, "size": 1217906, "checksum": "031fc812b9b286620a13ec99fca2f84d3f4e6c47f79cd99fdc9307b3c4747beb", "avgDurationSecs": 6.137, "validDurationSecs": 116.98, "totalHrs": 0.06, "validHrs": 0.04 }, "sat": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -680, "checksum": "3665773b018c827a52af8cd62abbef27e37ddbac89f2ab55d346d9cfd72de8bc", "avgDurationSecs": 4.462, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 264528, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 40, "users": 3, "size": 1521208, "checksum": "be0824b225751fead4c56c467fa79c25896769318b4a9f7d55bded09e6e197f5", "avgDurationSecs": 4.56, "validDurationSecs": 446.36, "totalHrs": 0.08, "validHrs": 0.12 }, "sk": { "duration": 634284, "reportedSentences": 128, "validatedSentences": 8, "unvalidatedSentences": -8, "clips": 168, "users": 7, "size": 3721446, "checksum": "ccf1054a78b87fd55e174b9c4d25e82b9ffd17a0e2273303851e81db34d746ee", "avgDurationSecs": 4.076, "validDurationSecs": 402.79, "totalHrs": 0.17, "validHrs": 0.11 }, "skr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4481, "checksum": "9c104eb726e8cfe53eeb13db8d63b8bf1e6cdea3e82c509dba8700c2c75dbd69", "avgDurationSecs": 4.143, "validDurationSecs": 198.87, "totalHrs": 0, "validHrs": 0.06 }, "sl": { "duration": 1301976, "reportedSentences": 0, "validatedSentences": 82, "unvalidatedSentences": -80, "clips": 314, "users": 23, "size": 7624745, "checksum": "5c5e547959157e9c539bcef23199786e38c3f391639aaf6cfe82a0744251a8f4", "avgDurationSecs": 3.795, "validDurationSecs": 1503.11, "totalHrs": 0.36, "validHrs": 0.41 }, "sq": { "duration": 3909096, "reportedSentences": 0, "validatedSentences": 48905, "unvalidatedSentences": 1206, "clips": 678, "users": 29, "size": 25364039, "checksum": "e531383d26ed0e6df337f12ecc04c4c12120fe913cb564aa8a95b27ed61e18cc", "avgDurationSecs": 4.946, "validDurationSecs": 3827.71, "totalHrs": 1.09, "validHrs": 1.06 }, "sr": { "duration": 2125260, "reportedSentences": 2, "validatedSentences": 280, "unvalidatedSentences": -267, "clips": 515, "users": 6, "size": 12500821, "checksum": "85b572e256dfc767ae864dad7be93187773706a95657441ff101e56e6543e9aa", "avgDurationSecs": 3.092, "validDurationSecs": 3790.29, "totalHrs": 0.59, "validHrs": 1.05 }, "sv-SE": { "duration": 777456, "reportedSentences": 0, "validatedSentences": 40, "unvalidatedSentences": -44, "clips": 159, "users": 15, "size": 4554570, "checksum": "bed7ed0c83ddf17219c28e74a6c26b993873d85fbdf0b0164351cc0d7cdfe70d", "avgDurationSecs": 4.01, "validDurationSecs": 1957.2, "totalHrs": 0.22, "validHrs": 0.54 }, "sw": { "duration": 367632, "reportedSentences": 0, "validatedSentences": -1313, "unvalidatedSentences": -17, "clips": 58, "users": 4, "size": 2274381, "checksum": "2950a7d13020bdfa7d5c487453bd8bbd4b04806d93bedf33980f65f82803023e", "avgDurationSecs": 5.386, "validDurationSecs": 203.48, "totalHrs": 0.11, "validHrs": 0.06 }, "ta": { "duration": 43923996, "reportedSentences": 73, "validatedSentences": 36, "unvalidatedSentences": -33, "clips": 7259, "users": 22, "size": 259390234, "checksum": "1aff6191ee3dc782536c749481e10570f26c57f46b67b9332d0423dbed7b1467", "avgDurationSecs": 6.181, "validDurationSecs": 2981.27, "totalHrs": 12.2, "validHrs": 0.82 }, "te": { "duration": 131292, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 26, "users": 6, "size": 772493, "checksum": "c127b52f863afc6e6acd45b67d8949c3c9e64e0ac7b608d8d8bb17151ea1caa9", "avgDurationSecs": 4.2, "validDurationSecs": 19.31, "totalHrs": 0.04, "validHrs": 0 }, "th": { "duration": 2668860, "reportedSentences": 27, "validatedSentences": 66, "unvalidatedSentences": -63, "clips": 613, "users": 14, "size": 15297537, "checksum": "061f0d5e21303fed93644aee8ec3020700331ada15790a9b7a0ad0970964e51d", "avgDurationSecs": 4.191, "validDurationSecs": 631.37, "totalHrs": 0.75, "validHrs": 0.18 }, "ti": { "duration": 1094184, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": -1, "clips": 225, "users": 3, "size": 6426751, "checksum": "6d328ab9007ce37bc5c3e5cafcb30880e6e57ceaca896da6f34edd37b034164a", "avgDurationSecs": 5.009, "validDurationSecs": -0.95, "totalHrs": 0.3, "validHrs": 0 }, "tig": { "duration": 376596, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 57, "users": 5, "size": 1676015, "checksum": "4442d7baeedacf29637059577892858bba543043a3638a72f5c241ccb441a097", "avgDurationSecs": 6.162, "validDurationSecs": 133.48, "totalHrs": 0.1, "validHrs": 0.04 }, "tk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 9, "clips": 0, "users": 0, "size": -3384, "checksum": "1d90c64655782c1fd76bf34db648bf706ff49a851330a888f7e58c15561c76d6", "avgDurationSecs": 5.41, "validDurationSecs": 54.1, "totalHrs": 0, "validHrs": 0.02 }, "tok": { "duration": 1673640, "reportedSentences": 2, "validatedSentences": 52, "unvalidatedSentences": -44, "clips": 411, "users": 5, "size": 9816124, "checksum": "6e083175a5f27e9b4cc0742e41d4dd70e1d19a2b0ee18985c97f76c7b084da37", "avgDurationSecs": 4.17, "validDurationSecs": 634.03, "totalHrs": 0.46, "validHrs": 0.18 }, "tr": { "duration": 10447920, "reportedSentences": 9, "validatedSentences": 37, "unvalidatedSentences": -43, "clips": 1738, "users": 36, "size": 61768363, "checksum": "10ff35a4793d15f7543a01129a5fe5be5dd3a506c9a03d153bcc70304536da43", "avgDurationSecs": 3.735, "validDurationSecs": 9979.29, "totalHrs": 2.9, "validHrs": 2.77 }, "tt": { "duration": 793980, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 149, "users": 7, "size": 4527856, "checksum": "447d884ba85e352a234a47ba7578588a5abb9f9b1149c70233e02af6d5cf85f9", "avgDurationSecs": 3.784, "validDurationSecs": 251.81, "totalHrs": 0.22, "validHrs": 0.07 }, "tw": { "duration": 91584, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 18, "clips": 15, "users": 2, "size": 542203, "checksum": "2631eeb8099af0c63146888aa987728811d6c422087acc3769c86e1f0e44967c", "avgDurationSecs": 4.152, "validDurationSecs": 146.27, "totalHrs": 0.02, "validHrs": 0.04 }, "ug": { "duration": 501794100, "reportedSentences": 589, "validatedSentences": 149727, "unvalidatedSentences": -2780, "clips": 94828, "users": 128, "size": 2824998964, "checksum": "1585f30bba2e01165cee458a1d0f2cd789ad2521f5093b5f10efbb0a79a9a388", "avgDurationSecs": 5.668, "validDurationSecs": 119569.35, "totalHrs": 139.39, "validHrs": 33.21 }, "uk": { "duration": 4340196, "reportedSentences": 3, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 988, "users": 16, "size": 25389250, "checksum": "c6773d52e058f45795bb5c4d450f9808262ef6faa043785375bc0280d86284b5", "avgDurationSecs": 4.644, "validDurationSecs": 2607.15, "totalHrs": 1.2, "validHrs": 0.72 }, "ur": { "duration": 128700, "reportedSentences": 0, "validatedSentences": 26, "unvalidatedSentences": -26, "clips": 21, "users": 5, "size": 767738, "checksum": "42a3177e953c347334170192e9634a426df9f4d809917fba5be8158b2f8ce2e3", "avgDurationSecs": 4.246, "validDurationSecs": 609.53, "totalHrs": 0.04, "validHrs": 0.17 }, "uz": { "duration": 2752092, "reportedSentences": 0, "validatedSentences": 24, "unvalidatedSentences": -21, "clips": 559, "users": 19, "size": 16266871, "checksum": "3e6d39e86b2067d2f59df815790a793b2657b38ae65cd017cb5e8ed9051199a8", "avgDurationSecs": 4.152, "validDurationSecs": 632.36, "totalHrs": 0.77, "validHrs": 0.18 }, "vi": { "duration": 1135368, "reportedSentences": 2, "validatedSentences": 16, "unvalidatedSentences": -12, "clips": 328, "users": 6, "size": 6630204, "checksum": "98e3acf4551035b9785f192f83db046bbae333d2ffa1fe01addff820c6900028", "avgDurationSecs": 3.953, "validDurationSecs": 394.17, "totalHrs": 0.31, "validHrs": 0.11 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11, "checksum": "97f07f07365e619d4494245acd859cd3a933ac3338b0a2aee816f5887dc1d83f", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yi": { "duration": 1493172, "reportedSentences": 0, "validatedSentences": 25, "unvalidatedSentences": -22, "clips": 351, "users": 32, "size": 8125181, "checksum": "fbcc20909e126bb13804d7067abbe58fee7c8729232583a5856f360f2ee981d1", "avgDurationSecs": 4.2, "validDurationSecs": 1374.8, "totalHrs": 0.41, "validHrs": 0.39 }, "yo": { "duration": 262476, "reportedSentences": 2, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 35, "users": 5, "size": 1454153, "checksum": "c5a4138b115c67bf72f5338f7fcc7ae486959ac1211d0f4f19920eb4e80c6c50", "avgDurationSecs": 5.948, "validDurationSecs": 168.66, "totalHrs": 0.07, "validHrs": 0.05 }, "yue": { "duration": 58296744, "reportedSentences": 110, "validatedSentences": 7058, "unvalidatedSentences": -228, "clips": 14115, "users": 166, "size": 340176455, "checksum": "388a13379401d59d9364c929d8f51414f23a055d5d9f0af888bdb26b9d3ce0d6", "avgDurationSecs": 3.893, "validDurationSecs": 108733.07, "totalHrs": 16.19, "validHrs": 30.21 }, "zgh": { "duration": 640152, "reportedSentences": 1, "validatedSentences": 48, "unvalidatedSentences": -34, "clips": 171, "users": 2, "size": 3696528, "checksum": "f3350db48b467e7a9a09e7ff1b5317c67d5395aacffe0eac2d402922963690e9", "avgDurationSecs": 3.366, "validDurationSecs": 1658.54, "totalHrs": 0.18, "validHrs": 0.46 }, "zh-CN": { "duration": 8105760, "reportedSentences": 75, "validatedSentences": 70, "unvalidatedSentences": 1, "clips": 1951, "users": 81, "size": 46953936, "checksum": "6b5e4c14960f56b167300fa6ddc205bdfd13366dc570b2f5df8f1f532b7cfedd", "avgDurationSecs": 4.546, "validDurationSecs": 3614.26, "totalHrs": 2.25, "validHrs": 1 }, "zh-HK": { "duration": 3361572, "reportedSentences": 21, "validatedSentences": 282, "unvalidatedSentences": 0, "clips": 835, "users": 20, "size": 19467408, "checksum": "99f69a756c102242106a8b17634bb081b4b55a533726ba0ebf5561157b02ced2", "avgDurationSecs": 4.175, "validDurationSecs": 2324.74, "totalHrs": 0.94, "validHrs": 0.65 }, "zh-TW": { "duration": 5164884, "reportedSentences": 10, "validatedSentences": 24, "unvalidatedSentences": 1, "clips": 1555, "users": 18, "size": 30130004, "checksum": "6dc7254a37d2d1b23a1349811f702591c379a919ebc1ccebcce69c8eab93d813", "avgDurationSecs": 3.365, "validDurationSecs": 1493.22, "totalHrs": 1.43, "validHrs": 0.41 }, "zu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 154, "checksum": "9c86c4efc924798ecec4dcce5c82761cd137d373802ce36b885036723403dcfb", "avgDurationSecs": 5.096, "validDurationSecs": 5.096, "totalHrs": 0, "validHrs": 0 }, "zza": { "duration": 2502828, "reportedSentences": 1136, "validatedSentences": 561, "unvalidatedSentences": 57, "clips": 514, "users": 7, "size": 14801191, "checksum": "29de34c777992d4474de5bacd0d4b603e044fb1d4ee26cfa1461afbc81b80d74", "avgDurationSecs": 4.438, "validDurationSecs": 2085.11, "totalHrs": 0.7, "validHrs": 0.58 } }, "totalDuration": 3408609216, "totalValidDurationSecs": 1926662, "totalHrs": 946, "totalValidHrs": 535 } ================================================ FILE: datasets/scripted-speech/cv-corpus-19.0-2024-09-13.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9160, "test": 9117, "train": 21027, "validated": 42053, "invalidated": 5282, "other": 15703 }, "duration": 325886184, "reportedSentences": 225, "validatedSentences": 1046778, "unvalidatedSentences": 5, "clips": 63038, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.17, "thirties": 0.17, "teens": 0.26, "fourties": 0.08, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.17, "female_feminine": 0.65, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 63038, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 411, "size": 1938017123, "checksum": "83103a47fd2fa6309ba398f9ae3fbaa7c5ded32e22c0b38e19f947fec1be2049", "avgDurationSecs": 5.17, "validDurationSecs": 217400.484, "totalHrs": 90.52, "validHrs": 60.38 }, "af": { "buckets": { "dev": 68, "test": 67, "train": 109, "validated": 244, "invalidated": 193, "other": 216 }, "duration": 3667140, "reportedSentences": 0, "validatedSentences": 4743, "unvalidatedSentences": 16519, "clips": 653, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.44, "thirties": 0, "teens": 0.01, "fourties": 0.11, "fifties": 0.22, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.47, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 653, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 34, "size": 23110158, "checksum": "ee6cd907a2272386e72c250a5a0bb5d76b9b73fe144384178264e91761f35c8a", "avgDurationSecs": 5.616, "validDurationSecs": 1370.264, "totalHrs": 1.01, "validHrs": 0.38 }, "am": { "buckets": { "dev": 248, "test": 222, "train": 481, "validated": 951, "invalidated": 25, "other": 577 }, "duration": 9662868, "reportedSentences": 11, "validatedSentences": 2195, "unvalidatedSentences": 23, "clips": 1553, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.15, "thirties": 0.63, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.78, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1553, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 33, "size": 57008856, "checksum": "8dc2d5ba1a1174401b4193b04a6a649fce8205daa1bb1adaca803d3d0219b464", "avgDurationSecs": 6.222, "validDurationSecs": 5917.184, "totalHrs": 2.68, "validHrs": 1.64 }, "ar": { "buckets": { "dev": 10352, "test": 10474, "train": 28553, "validated": 78473, "invalidated": 15138, "other": 41624 }, "duration": 562536630, "reportedSentences": 2126, "validatedSentences": 59408, "unvalidatedSentences": 1742080, "clips": 135235, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.3, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 135235, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1588, "size": 3464042001, "checksum": "9647389b84a2b999158c463321a997c3b9852211d53eabdc11fa1ead0b895cea", "avgDurationSecs": 4.16, "validDurationSecs": 326423.906, "totalHrs": 156.26, "validHrs": 90.67 }, "as": { "buckets": { "dev": 516, "test": 552, "train": 671, "validated": 1739, "invalidated": 265, "other": 2094 }, "duration": 24205349, "reportedSentences": 182, "validatedSentences": 6167, "unvalidatedSentences": 3434, "clips": 4098, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.02, "teens": 0, "fourties": 0.46, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0.21, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4098, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 50, "size": 147799612, "checksum": "fccbeb34f997c54cf4224b078377e270c0c8997353e46b8aa2bcde35eff47065", "avgDurationSecs": 5.907, "validDurationSecs": 10271.621, "totalHrs": 6.72, "validHrs": 2.85 }, "ast": { "buckets": { "dev": 113, "test": 171, "train": 399, "validated": 685, "invalidated": 16, "other": 978 }, "duration": 7444944, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 289, "clips": 1679, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.1, "thirties": 0.12, "teens": 0.04, "fourties": 0.28, "fifties": 0.21, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1679, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 42477616, "checksum": "ca8d749d5b772d76f63d31a5e748c7713a3e21c3a715f503f6e56efd424e7a0b", "avgDurationSecs": 4.434, "validDurationSecs": 3037.395, "totalHrs": 2.06, "validHrs": 0.84 }, "az": { "buckets": { "dev": 54, "test": 78, "train": 104, "validated": 236, "invalidated": 49, "other": 522 }, "duration": 4385880, "reportedSentences": 5, "validatedSentences": 93159, "unvalidatedSentences": 1851, "clips": 807, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.13, "thirties": 0.66, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0.21, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 807, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 33, "size": 31968647, "checksum": "f7fd36de6e305ca83a7a8b1334d13709ec377ddaecaccb670806b4f9acbb1566", "avgDurationSecs": 5.435, "validDurationSecs": 1282.612, "totalHrs": 1.21, "validHrs": 0.35 }, "ba": { "buckets": { "dev": 14492, "test": 14584, "train": 119125, "validated": 209884, "invalidated": 7966, "other": 308 }, "duration": 965672496, "reportedSentences": 867, "validatedSentences": 153957, "unvalidatedSentences": 14, "clips": 218158, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218158, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 922, "size": 5441374729, "checksum": "08a33cbb933b524e38807a821109c0c5f7d4166277ff781489b6d897a508d801", "avgDurationSecs": 4.426, "validDurationSecs": 929047.783, "totalHrs": 268.24, "validHrs": 258.06 }, "bas": { "buckets": { "dev": 457, "test": 582, "train": 763, "validated": 1802, "invalidated": 525, "other": 95 }, "duration": 10705176, "reportedSentences": 8, "validatedSentences": 5015, "unvalidatedSentences": 245, "clips": 2422, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.01, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2422, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 42, "size": 60079628, "checksum": "1f144860a7965c7efac261f4b6a5d23f208e72c22c4638f70af91d86efb376b1", "avgDurationSecs": 4.42, "validDurationSecs": 7964.792, "totalHrs": 2.97, "validHrs": 2.21 }, "be": { "buckets": { "dev": 15877, "test": 15878, "train": 347670, "validated": 1345234, "invalidated": 36545, "other": 8939 }, "duration": 6644825568, "reportedSentences": 3151, "validatedSentences": 379525, "unvalidatedSentences": 11, "clips": 1390718, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.04, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1390664, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 33, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 7 } }, "users": 8444, "size": 37873861521, "checksum": "25ad9959b67aa92bdc773c6d21af975f9b52528ee96b9e2654725555a65ed8b7", "avgDurationSecs": 4.778, "validDurationSecs": 6427503.835, "totalHrs": 1845.78, "validHrs": 1785.41 }, "bg": { "buckets": { "dev": 2874, "test": 3292, "train": 4919, "validated": 11101, "invalidated": 774, "other": 1899 }, "duration": 75376368, "reportedSentences": 188, "validatedSentences": 39570, "unvalidatedSentences": 3, "clips": 13774, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13774, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 140, "size": 444171641, "checksum": "0e27fa2838331c27b5a5aeb1db155af655f63135a39742da1e92f6e8e1535b48", "avgDurationSecs": 5.472, "validDurationSecs": 60748.734, "totalHrs": 20.93, "validHrs": 16.87 }, "bn": { "buckets": { "dev": 9343, "test": 9343, "train": 21308, "validated": 44244, "invalidated": 7845, "other": 997721 }, "duration": 4584456144, "reportedSentences": 1889, "validatedSentences": 562743, "unvalidatedSentences": 94192, "clips": 1049810, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1049810, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22931, "size": 26491800264, "checksum": "93f5537c1cce3a80baf97251d518225091a0b49d4f68540f906066618f75ab85", "avgDurationSecs": 4.367, "validDurationSecs": 193210.845, "totalHrs": 1273.46, "validHrs": 53.66 }, "br": { "buckets": { "dev": 2320, "test": 2356, "train": 2788, "validated": 28030, "invalidated": 1719, "other": 1302 }, "duration": 98928259, "reportedSentences": 293, "validatedSentences": 7887, "unvalidatedSentences": 199, "clips": 31051, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.32, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.06, "sixties": 0.13, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.66, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31003, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 2, "general": 41, "healthcare": 1, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 209, "size": 688031626, "checksum": "994f854aaa691a0d6b6f408c27f3f5c140e411c8a11d936354fd7ccc91c72105", "avgDurationSecs": 3.186, "validDurationSecs": 89303.375, "totalHrs": 27.48, "validHrs": 24.8 }, "ca": { "buckets": { "dev": 16407, "test": 16407, "train": 1168790, "validated": 2076099, "invalidated": 127023, "other": 331027 }, "duration": 13360177782, "reportedSentences": 6811, "validatedSentences": 1288879, "unvalidatedSentences": 485, "clips": 2534149, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.18, "sixties": 0.29, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.52, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2533554, "agriculture_food": 12, "automotive_transport": 3, "finance": 0, "service_retail": 59, "general": 344, "healthcare": 19, "history_law_government": 32, "language_fundamentals": 15, "media_entertainment": 24, "nature_environment": 65, "news_current_affairs": 16, "technology_robotics": 14 } }, "users": 36145, "size": 80430502768, "checksum": "c1e22ca2739346039c414d548382141fbdb680cc2cc22860a5a5f75c667fa547", "avgDurationSecs": 5.272, "validDurationSecs": 10945312.108, "totalHrs": 3711.16, "validHrs": 3040.36 }, "ckb": { "buckets": { "dev": 5309, "test": 5309, "train": 7797, "validated": 118145, "invalidated": 9173, "other": 38933 }, "duration": 678012768, "reportedSentences": 2523, "validatedSentences": 18847, "unvalidatedSentences": 15575, "clips": 166251, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 166048, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 103, "healthcare": 0, "history_law_government": 10, "language_fundamentals": 10, "media_entertainment": 70, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1827, "size": 3744991755, "checksum": "054f41a5df99e9b8f08d642d0bbe2719d1dff7b3a6a548a7de2fca1223e5e7df", "avgDurationSecs": 4.078, "validDurationSecs": 481824.581, "totalHrs": 188.33, "validHrs": 133.84 }, "cnh": { "buckets": { "dev": 762, "test": 764, "train": 818, "validated": 2461, "invalidated": 437, "other": 3289 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168180967, "checksum": "f2099a17273e9a66684c05d968381a5cb83fdd6d2df72c07e797e881ade45d98", "avgDurationSecs": 3.516, "validDurationSecs": 8652.146, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "dev": 9191, "test": 9145, "train": 20597, "validated": 62377, "invalidated": 2313, "other": 149450 }, "duration": 955729623, "reportedSentences": 965, "validatedSentences": 451054, "unvalidatedSentences": 33, "clips": 214140, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.18, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.54, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 214131, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 5, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 3, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 4 } }, "users": 1023, "size": 5884034872, "checksum": "0f06fbcd98c507e6f3d47645f419a6700388d2e86efe773af85f7f3830e6e832", "avgDurationSecs": 4.463, "validDurationSecs": 278395.193, "totalHrs": 265.48, "validHrs": 77.33 }, "cv": { "buckets": { "dev": 1246, "test": 1288, "train": 1452, "validated": 17428, "invalidated": 2200, "other": 697 }, "duration": 102493776, "reportedSentences": 144, "validatedSentences": 4380, "unvalidatedSentences": 6, "clips": 20325, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.5, "thirties": 0.01, "teens": 0.18, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.52, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20325, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 115, "size": 694137926, "checksum": "618499b806bb710003c645a9d00d922153adbc5932d966951a9767eb48d92bb3", "avgDurationSecs": 5.043, "validDurationSecs": 87884.946, "totalHrs": 28.47, "validHrs": 24.41 }, "cy": { "buckets": { "dev": 5391, "test": 5396, "train": 7991, "validated": 90631, "invalidated": 4459, "other": 20601 }, "duration": 568275417, "reportedSentences": 176, "validatedSentences": 118032, "unvalidatedSentences": 25, "clips": 115691, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115691, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1898, "size": 4146988719, "checksum": "6943b7d5edd960895931beaf2354c2e8d05be4ab3cf7b6518a2ebbd33009bd77", "avgDurationSecs": 4.912, "validDurationSecs": 445180.432, "totalHrs": 157.85, "validHrs": 123.66 }, "da": { "buckets": { "dev": 2222, "test": 2516, "train": 3556, "validated": 10455, "invalidated": 417, "other": 239 }, "duration": 45609084, "reportedSentences": 426, "validatedSentences": 20467, "unvalidatedSentences": 10, "clips": 11111, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.25, "teens": 0.01, "fourties": 0.2, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.6, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11111, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 270, "size": 263865846, "checksum": "10e28bc5831bd2cec70ecfd43cd1f2d078456a7b7e6ad58aa31e3714b837a443", "avgDurationSecs": 4.105, "validDurationSecs": 42916.297, "totalHrs": 12.66, "validHrs": 11.92 }, "dav": { "buckets": { "dev": 1136, "test": 878, "train": 1928, "validated": 7247, "invalidated": 661, "other": 41686 }, "duration": 201268188, "reportedSentences": 11, "validatedSentences": 11648, "unvalidatedSentences": 20173, "clips": 49594, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.61, "thirties": 0.1, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0.23, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.73, "male_masculine": 0, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 970, "agriculture_food": 499, "automotive_transport": 100, "finance": 303, "service_retail": 1218, "general": 45289, "healthcare": 966, "history_law_government": 333, "language_fundamentals": 150, "media_entertainment": 68, "nature_environment": 871, "news_current_affairs": 86, "technology_robotics": 75 } }, "users": 24, "size": 1083164228, "checksum": "176a747610235c249c0c97898e30fdd8decce8b8b763e658769da7300e98d32a", "avgDurationSecs": 4.058, "validDurationSecs": 29410.625, "totalHrs": 55.9, "validHrs": 8.16 }, "de": { "buckets": { "dev": 16188, "test": 16188, "train": 595998, "validated": 924225, "invalidated": 55296, "other": 7242 }, "duration": 5168107097, "reportedSentences": 10906, "validatedSentences": 2056872, "unvalidatedSentences": 189, "clips": 986763, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.59, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 986602, "agriculture_food": 0, "automotive_transport": 2, "finance": 0, "service_retail": 10, "general": 123, "healthcare": 10, "history_law_government": 10, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 9, "news_current_affairs": 1, "technology_robotics": 5 } }, "users": 19225, "size": 36080413393, "checksum": "366c90d5a634c6406b8b78f54833ec8ed8618d5c7bc114601f15f3d8584ed555", "avgDurationSecs": 5.237, "validDurationSecs": 4840568.385, "totalHrs": 1435.58, "validHrs": 1344.6 }, "dv": { "buckets": { "dev": 2204, "test": 2237, "train": 2677, "validated": 26292, "invalidated": 1639, "other": 15131 }, "duration": 220322583, "reportedSentences": 59, "validatedSentences": 7162, "unvalidatedSentences": 15, "clips": 43062, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.11, "thirties": 0.39, "teens": 0.01, "fourties": 0.23, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.24, "female_feminine": 0.53, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43062, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 376, "size": 1401685091, "checksum": "1df473e457b82f861cd2b79223a84a738520ca2da7bc847beeb1fe247c1eb3ac", "avgDurationSecs": 5.116, "validDurationSecs": 134520.49, "totalHrs": 61.2, "validHrs": 37.36 }, "dyu": { "buckets": { "dev": 50, "test": 59, "train": 78, "validated": 195, "invalidated": 13, "other": 80 }, "duration": 1786860, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 288, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.67, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 288, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 34, "size": 10351680, "checksum": "2797d268f11f342ee2e781c76502b7f6b9a86d5511501bf1800dab9188036e1d", "avgDurationSecs": 6.204, "validDurationSecs": 1209.853, "totalHrs": 0.49, "validHrs": 0.33 }, "el": { "buckets": { "dev": 1699, "test": 1707, "train": 1919, "validated": 16407, "invalidated": 846, "other": 10236 }, "duration": 114039322, "reportedSentences": 70, "validatedSentences": 6696, "unvalidatedSentences": 4645, "clips": 27489, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.64, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 27489, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 412, "size": 759535201, "checksum": "5257da4476db0a5d2440b193935a665b25a6156dca46cc05e947837540609b72", "avgDurationSecs": 4.149, "validDurationSecs": 68065.159, "totalHrs": 31.67, "validHrs": 18.9 }, "en": { "buckets": { "dev": 16396, "test": 16396, "train": 1116566, "validated": 1822496, "invalidated": 298826, "other": 337807 }, "duration": 12910015591, "reportedSentences": 9146, "validatedSentences": 1674115, "unvalidatedSentences": 2389, "clips": 2459129, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.45, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2458495, "agriculture_food": 12, "automotive_transport": 5, "finance": 39, "service_retail": 24, "general": 414, "healthcare": 16, "history_law_government": 30, "language_fundamentals": 8, "media_entertainment": 49, "nature_environment": 38, "news_current_affairs": 8, "technology_robotics": 46 } }, "users": 93896, "size": 90144298040, "checksum": "d7496f0d49c5db16ca46effd2775ce9820b1e360403a0915b228768be4f9bf35", "avgDurationSecs": 5.25, "validDurationSecs": 9567798.914, "totalHrs": 3586.11, "validHrs": 2657.72 }, "eo": { "buckets": { "dev": 14895, "test": 14901, "train": 144268, "validated": 851216, "invalidated": 127461, "other": 155456 }, "duration": 6884026279, "reportedSentences": 2323, "validatedSentences": 180700, "unvalidatedSentences": 22166, "clips": 1134133, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.69, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1134130, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1796, "size": 41145397654, "checksum": "67097912176fd841589ff38dfeab6731aae4c9d604760704c6e7b28ae737453d", "avgDurationSecs": 6.07, "validDurationSecs": 5166760.259, "totalHrs": 1912.22, "validHrs": 1435.21 }, "es": { "buckets": { "dev": 15877, "test": 15877, "train": 346059, "validated": 424773, "invalidated": 92252, "other": 1135298 }, "duration": 8052116130, "reportedSentences": 2591, "validatedSentences": 1080869, "unvalidatedSentences": 5605, "clips": 1652323, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1652278, "agriculture_food": 0, "automotive_transport": 2, "finance": 4, "service_retail": 2, "general": 12, "healthcare": 4, "history_law_government": 2, "language_fundamentals": 6, "media_entertainment": 2, "nature_environment": 3, "news_current_affairs": 6, "technology_robotics": 12 } }, "users": 26150, "size": 50803986940, "checksum": "8cce988566e86b011de98152ec42a1f5a06580f9a5ce4b8a7f1e2987bac90906", "avgDurationSecs": 4.873, "validDurationSecs": 2070007.816, "totalHrs": 2236.69, "validHrs": 575 }, "et": { "buckets": { "dev": 2796, "test": 2796, "train": 3362, "validated": 26004, "invalidated": 7727, "other": 0 }, "duration": 227800190, "reportedSentences": 565, "validatedSentences": 10871, "unvalidatedSentences": 890, "clips": 33731, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.52, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33278, "agriculture_food": 21, "automotive_transport": 25, "finance": 20, "service_retail": 4, "general": 294, "healthcare": 13, "history_law_government": 17, "language_fundamentals": 8, "media_entertainment": 14, "nature_environment": 59, "news_current_affairs": 7, "technology_robotics": 22 } }, "users": 988, "size": 1526668645, "checksum": "c4ac8706f5f829d7f810ee0bf226eb6dc91fceab9b442028761cce22d0f4d4e7", "avgDurationSecs": 6.753, "validDurationSecs": 175616.381, "totalHrs": 63.27, "validHrs": 48.78 }, "eu": { "buckets": { "dev": 14353, "test": 14353, "train": 104438, "validated": 237705, "invalidated": 41752, "other": 172642 }, "duration": 2484272343, "reportedSentences": 1248, "validatedSentences": 1017596, "unvalidatedSentences": 2068, "clips": 452099, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.01, "fourties": 0.16, "fifties": 0.1, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.25, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443685, "agriculture_food": 0, "automotive_transport": 5, "finance": 4, "service_retail": 7, "general": 8345, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 10733, "size": 15252058224, "checksum": "076cbfe342ed08fd9cc34a3e3a41018fec4bb7b0ccd21adce062211b062e0583", "avgDurationSecs": 5.495, "validDurationSecs": 1306182.843, "totalHrs": 690.07, "validHrs": 362.82 }, "fa": { "buckets": { "dev": 10596, "test": 10596, "train": 29175, "validated": 332169, "invalidated": 14761, "other": 32324 }, "duration": 1507530932, "reportedSentences": 2534, "validatedSentences": 56098, "unvalidatedSentences": 283093, "clips": 379254, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.36, "teens": 0.03, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.71, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 379248, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4484, "size": 10931054581, "checksum": "14719e3dc7bbdc8f00f61469228c96a8cd5c1838659841b1856f05a27a70505b", "avgDurationSecs": 3.975, "validDurationSecs": 1320368.519, "totalHrs": 418.75, "validHrs": 366.76 }, "fi": { "buckets": { "dev": 1773, "test": 1787, "train": 2060, "validated": 10704, "invalidated": 308, "other": 6242 }, "duration": 79634199, "reportedSentences": 77, "validatedSentences": 5843, "unvalidatedSentences": 48875, "clips": 17254, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.31, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.36, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17254, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 226, "size": 476120554, "checksum": "57466ec0b4ecadfc32cea149bca6085bbe60c6730b32bce09654ecf88d845b3f", "avgDurationSecs": 4.615, "validDurationSecs": 49403.296, "totalHrs": 22.12, "validHrs": 13.72 }, "fr": { "buckets": { "dev": 16169, "test": 16169, "train": 570806, "validated": 739181, "invalidated": 65005, "other": 25469 }, "duration": 4167791326, "reportedSentences": 7369, "validatedSentences": 1647106, "unvalidatedSentences": 43327, "clips": 829655, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.13, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.58, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 829626, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 19, "healthcare": 1, "history_law_government": 11, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 5, "news_current_affairs": 1, "technology_robotics": 2 } }, "users": 19710, "size": 29284823400, "checksum": "8910e681be476efc9eb35c6787a0c16d0cab80a4a52bdd4f85e3cab30a0c19c5", "avgDurationSecs": 5.024, "validDurationSecs": 3713293.068, "totalHrs": 1157.71, "validHrs": 1031.47 }, "fy-NL": { "buckets": { "dev": 3170, "test": 3172, "train": 3923, "validated": 51408, "invalidated": 4003, "other": 102451 }, "duration": 766185615, "reportedSentences": 543, "validatedSentences": 10351, "unvalidatedSentences": 1, "clips": 157862, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157862, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2057, "size": 4594248083, "checksum": "06f15d0a9d22556e998ffe7b11098ce762cad4f27bd3fc372fb2e0fda4dec0f6", "avgDurationSecs": 4.854, "validDurationSecs": 249509.509, "totalHrs": 212.82, "validHrs": 69.3 }, "ga-IE": { "buckets": { "dev": 515, "test": 516, "train": 539, "validated": 5792, "invalidated": 954, "other": 3774 }, "duration": 37791243, "reportedSentences": 29, "validatedSentences": 1618, "unvalidatedSentences": 587, "clips": 10520, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.25, "thirties": 0.24, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.49, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10519, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 195, "size": 256110207, "checksum": "635216593deef246227f4fab368e2f42a3abd8f1345ed67a49f4366b3b6e648e", "avgDurationSecs": 3.592, "validDurationSecs": 20806.738, "totalHrs": 10.49, "validHrs": 5.77 }, "gl": { "buckets": { "dev": 10795, "test": 10865, "train": 31440, "validated": 53892, "invalidated": 5006, "other": 44114 }, "duration": 535843908, "reportedSentences": 591, "validatedSentences": 693458, "unvalidatedSentences": 6313, "clips": 103012, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.12, "thirties": 0.12, "teens": 0.01, "fourties": 0.14, "fifties": 0.2, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.32, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 103000, "agriculture_food": 1, "automotive_transport": 0, "finance": 4, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3303, "size": 3120066536, "checksum": "c4431211c01884b751519744fa0a4a02a264380543b1a6799a95a1024ee2e039", "avgDurationSecs": 5.202, "validDurationSecs": 280333.358, "totalHrs": 148.84, "validHrs": 77.87 }, "gn": { "buckets": { "dev": 360, "test": 860, "train": 1576, "validated": 2924, "invalidated": 164, "other": 18949 }, "duration": 100196712, "reportedSentences": 36, "validatedSentences": 5894, "unvalidatedSentences": 292, "clips": 22037, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.58, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.38, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 22037, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 187, "size": 574499179, "checksum": "dc1c72fc83c23cfcae8b47568970e3712dd5ce4d741e8cbcd653467a72bc25d1", "avgDurationSecs": 4.547, "validDurationSecs": 13294.695, "totalHrs": 27.83, "validHrs": 3.69 }, "ha": { "buckets": { "dev": 596, "test": 681, "train": 1922, "validated": 3326, "invalidated": 171, "other": 6660 }, "duration": 44082072, "reportedSentences": 33, "validatedSentences": 5390, "unvalidatedSentences": 298, "clips": 10157, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.67, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0.59, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10157, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 49, "size": 258537982, "checksum": "e5922e4323d11af5365a0899a6cb47f7012bc62076a22b02691d9f7f47459a4d", "avgDurationSecs": 4.34, "validDurationSecs": 14435.067, "totalHrs": 12.24, "validHrs": 4 }, "he": { "buckets": { "dev": 287, "test": 272, "train": 1113, "validated": 2046, "invalidated": 243, "other": 2717 }, "duration": 22443768, "reportedSentences": 33, "validatedSentences": 4051, "unvalidatedSentences": 6960, "clips": 5006, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.77, "thirties": 0.16, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.03, "nineties": 0 }, "gender": { "": 0.03, "male_masculine": 0.96, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5006, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 130623045, "checksum": "3ba2789558b19dc244f0e941fa69a8789cc3c973a48a3b2c055e13787e530695", "avgDurationSecs": 4.483, "validDurationSecs": 9172.982, "totalHrs": 6.23, "validHrs": 2.54 }, "hi": { "buckets": { "dev": 2687, "test": 3287, "train": 4849, "validated": 10899, "invalidated": 871, "other": 4072 }, "duration": 78704808, "reportedSentences": 145, "validatedSentences": 32204, "unvalidatedSentences": 9961, "clips": 15842, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.33, "thirties": 0.25, "teens": 0.01, "fourties": 0.05, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.6, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15842, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 438, "size": 467283926, "checksum": "e3c7477526f0e376afec842771b63412fac0be63ff6db2184280cfe48bf1bbf5", "avgDurationSecs": 4.968, "validDurationSecs": 54147.437, "totalHrs": 21.86, "validHrs": 15.04 }, "hsb": { "buckets": { "dev": 172, "test": 463, "train": 808, "validated": 1443, "invalidated": 247, "other": 218 }, "duration": 11915892, "reportedSentences": 130, "validatedSentences": 7177, "unvalidatedSentences": 30, "clips": 1908, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.11, "thirties": 0.09, "teens": 0.07, "fourties": 0.48, "fifties": 0.03, "sixties": 0.02, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.78, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1908, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 90484961, "checksum": "893c7f65be18bed8e66984c85a3d545055277b8ffd9e5cae76f5df77e3ce2531", "avgDurationSecs": 6.245, "validDurationSecs": 9011.862, "totalHrs": 3.3, "validHrs": 2.5 }, "ht": { "buckets": { "dev": 0, "test": 0, "train": 5, "validated": 5, "invalidated": 0, "other": 0 }, "duration": 20664, "reportedSentences": 0, "validatedSentences": 72, "unvalidatedSentences": 1633, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 226968, "checksum": "d8d01c1cb0d035cfaeeb4440f6d9c53d6f629ce942569d8c31eca54066fdf29d", "avgDurationSecs": 4.133, "validDurationSecs": 20.664, "totalHrs": 0, "validHrs": 0 }, "hu": { "buckets": { "dev": 11487, "test": 11506, "train": 37768, "validated": 61201, "invalidated": 3169, "other": 50287 }, "duration": 634215097, "reportedSentences": 474, "validatedSentences": 326038, "unvalidatedSentences": 184, "clips": 114657, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.12, "teens": 0.02, "fourties": 0.11, "fifties": 0.32, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.24, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 114657, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1634, "size": 3730103630, "checksum": "314de1b5642aaf942139d6e6c270b952f076436a9b77326fa3ada19c29d02423", "avgDurationSecs": 5.531, "validDurationSecs": 338527.941, "totalHrs": 176.17, "validHrs": 94.03 }, "hy-AM": { "buckets": { "dev": 4539, "test": 4518, "train": 6305, "validated": 15509, "invalidated": 1046, "other": 16007 }, "duration": 176735772, "reportedSentences": 169, "validatedSentences": 237393, "unvalidatedSentences": 1362, "clips": 32562, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.59, "thirties": 0.08, "teens": 0.12, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.28, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 32560, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 403, "size": 1039227751, "checksum": "a5cfbbe9220657400a35962ada79304f7b83623bfb3cbffe639d9257ac71c7a8", "avgDurationSecs": 5.428, "validDurationSecs": 84177.725, "totalHrs": 49.09, "validHrs": 23.38 }, "ia": { "buckets": { "dev": 1836, "test": 1837, "train": 4951, "validated": 11918, "invalidated": 346, "other": 2386 }, "duration": 61415724, "reportedSentences": 277, "validatedSentences": 9069, "unvalidatedSentences": 38, "clips": 14650, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.3, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14650, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 70, "size": 416648389, "checksum": "6a7f83cb6dc3a7a8782b642724ff90d16a25bd55d9999b285ea10b164bfd2479", "avgDurationSecs": 4.192, "validDurationSecs": 49962.635, "totalHrs": 17.05, "validHrs": 13.87 }, "id": { "buckets": { "dev": 3456, "test": 3687, "train": 4972, "validated": 29617, "invalidated": 2763, "other": 26361 }, "duration": 234762780, "reportedSentences": 354, "validatedSentences": 13189, "unvalidatedSentences": 11914, "clips": 58741, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.45, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 58741, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 588, "size": 1477844248, "checksum": "a0cb0508c4c978a0d85555a919e9dd0cb618a0e95b88c22b87a996c4ad365f77", "avgDurationSecs": 3.997, "validDurationSecs": 118366.546, "totalHrs": 65.21, "validHrs": 32.87 }, "ig": { "buckets": { "dev": 2, "test": 5, "train": 9, "validated": 16, "invalidated": 3, "other": 5765 }, "duration": 31592808, "reportedSentences": 16, "validatedSentences": 14454, "unvalidatedSentences": 3857, "clips": 5784, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.33, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5784, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 117, "size": 185751835, "checksum": "8e743890338360354d3d68f1947e28533ca650f36658f2f09021ec98a0051cae", "avgDurationSecs": 5.462, "validDurationSecs": 87.394, "totalHrs": 8.77, "validHrs": 0.02 }, "is": { "buckets": { "dev": 0, "test": 1, "train": 14, "validated": 15, "invalidated": 1, "other": 24 }, "duration": 284040, "reportedSentences": 0, "validatedSentences": 4993, "unvalidatedSentences": 202, "clips": 40, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.13, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2051325, "checksum": "2a6fe6c7f8e5ca401e27531fef46b2e0854c77d6c40c9192ff64393c4b0bcebe", "avgDurationSecs": 7.101, "validDurationSecs": 106.515, "totalHrs": 0.07, "validHrs": 0.02 }, "it": { "buckets": { "dev": 15162, "test": 15167, "train": 171388, "validated": 238341, "invalidated": 20123, "other": 12189 }, "duration": 1464519984, "reportedSentences": 5955, "validatedSentences": 924435, "unvalidatedSentences": 1463, "clips": 270653, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.2, "thirties": 0.16, "teens": 0.01, "fourties": 0.13, "fifties": 0.14, "sixties": 0.07, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.62, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 270647, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 6, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7179, "size": 10006037052, "checksum": "633c5e41211a31f2fb32182bd715f6a21eb79217fb6f2923f55c45d9b10e7d90", "avgDurationSecs": 5.411, "validDurationSecs": 1289677.77, "totalHrs": 406.81, "validHrs": 358.24 }, "ja": { "buckets": { "dev": 6288, "test": 6288, "train": 10110, "validated": 93628, "invalidated": 13678, "other": 251131 }, "duration": 1723048053, "reportedSentences": 381, "validatedSentences": 29975, "unvalidatedSentences": 249, "clips": 358437, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.53, "thirties": 0.08, "teens": 0.07, "fourties": 0.1, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.43, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 357567, "agriculture_food": 92, "automotive_transport": 0, "finance": 16, "service_retail": 11, "general": 481, "healthcare": 0, "history_law_government": 159, "language_fundamentals": 117, "media_entertainment": 11, "nature_environment": 4, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 4074, "size": 10159358783, "checksum": "07ef4dc984e0779531ac49095ad7d49f67a5199565b5f66a1cf902992e0ebc91", "avgDurationSecs": 4.807, "validDurationSecs": 450080.609, "totalHrs": 478.62, "validHrs": 125.02 }, "ka": { "buckets": { "dev": 12753, "test": 12966, "train": 59215, "validated": 109430, "invalidated": 5683, "other": 95191 }, "duration": 1092783384, "reportedSentences": 962, "validatedSentences": 131974, "unvalidatedSentences": 1111, "clips": 210304, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.21, "thirties": 0.35, "teens": 0.11, "fourties": 0.06, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.18, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 209888, "agriculture_food": 7, "automotive_transport": 4, "finance": 3, "service_retail": 7, "general": 182, "healthcare": 8, "history_law_government": 165, "language_fundamentals": 24, "media_entertainment": 15, "nature_environment": 7, "news_current_affairs": 8, "technology_robotics": 2 } }, "users": 2443, "size": 6392666547, "checksum": "0603655df7532e8b2b4604b20f159d8c06b6ce135b6ac0a7cda736bd5bb89c6d", "avgDurationSecs": 5.196, "validDurationSecs": 568621.071, "totalHrs": 303.55, "validHrs": 157.95 }, "kab": { "buckets": { "dev": 15001, "test": 15001, "train": 152282, "validated": 613198, "invalidated": 19940, "other": 113045 }, "duration": 2492900656, "reportedSentences": 9646, "validatedSentences": 182728, "unvalidatedSentences": 205435, "clips": 746183, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.19, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 746183, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1571, "size": 18620919551, "checksum": "bd5fd60b7fd5d7836670ed883e511b3bfd96380e42c1519870d733be0f2106a4", "avgDurationSecs": 3.341, "validDurationSecs": 2048615.013, "totalHrs": 692.47, "validHrs": 569.05 }, "kk": { "buckets": { "dev": 509, "test": 516, "train": 550, "validated": 1581, "invalidated": 228, "other": 722 }, "duration": 12454092, "reportedSentences": 42, "validatedSentences": 6168, "unvalidatedSentences": 299, "clips": 2531, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.24, "thirties": 0.03, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.34, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2531, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 167, "size": 71327545, "checksum": "d927003484ed013062eca32f28041cc26b39d94b1d88942cf8f2aea4d74dddd1", "avgDurationSecs": 4.921, "validDurationSecs": 7779.502, "totalHrs": 3.45, "validHrs": 2.16 }, "kln": { "buckets": { "dev": 6384, "test": 6036, "train": 11055, "validated": 32115, "invalidated": 535, "other": 37541 }, "duration": 330232284, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70191, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.59, "thirties": 0.05, "teens": 0.05, "fourties": 0.29, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.03, "female_feminine": 0.7, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4, "agriculture_food": 5, "automotive_transport": 16, "finance": 32, "service_retail": 625, "general": 68944, "healthcare": 54, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 484 } }, "users": 39, "size": 1800599728, "checksum": "50304fe541b13425a953eaf16b73d04b4c09cc277e06042009d8c5f76799553e", "avgDurationSecs": 4.705, "validDurationSecs": 151093.585, "totalHrs": 91.73, "validHrs": 41.97 }, "kmr": { "buckets": { "dev": 3930, "test": 3930, "train": 5149, "validated": 60226, "invalidated": 2568, "other": 26278 }, "duration": 364106484, "reportedSentences": 1058, "validatedSentences": 16796, "unvalidatedSentences": 687, "clips": 89072, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.42, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89066, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 1, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 573, "size": 1931160727, "checksum": "a027d3a4c6a21266765af001483f1bddd2c7207246912118dec0871ab6e23330", "avgDurationSecs": 4.088, "validDurationSecs": 246190.465, "totalHrs": 101.14, "validHrs": 68.38 }, "ko": { "buckets": { "dev": 316, "test": 352, "train": 533, "validated": 1202, "invalidated": 295, "other": 2416 }, "duration": 22639752, "reportedSentences": 29, "validatedSentences": 6910, "unvalidatedSentences": 2964, "clips": 3913, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.43, "thirties": 0.33, "teens": 0.03, "fourties": 0.06, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.57, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3913, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 111, "size": 132537966, "checksum": "a69d1b0cccf300d956243bd073ffce199d96c86abeefbff73228971266f91358", "avgDurationSecs": 5.786, "validDurationSecs": 6954.506, "totalHrs": 6.28, "validHrs": 1.93 }, "ky": { "buckets": { "dev": 1613, "test": 1614, "train": 1789, "validated": 30389, "invalidated": 5646, "other": 1670 }, "duration": 171646536, "reportedSentences": 39, "validatedSentences": 5027, "unvalidatedSentences": 4, "clips": 37705, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37705, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 288, "size": 1104370377, "checksum": "5fa0c164f09ce8811f72d381b92857fec4fe6dc1b11964dadb44022f58653437", "avgDurationSecs": 4.552, "validDurationSecs": 138341.509, "totalHrs": 47.67, "validHrs": 38.42 }, "lg": { "buckets": { "dev": 13340, "test": 13362, "train": 71073, "validated": 271822, "invalidated": 39174, "other": 37086 }, "duration": 2013266727, "reportedSentences": 6451, "validatedSentences": 191407, "unvalidatedSentences": 173, "clips": 348082, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348082, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 659, "size": 11819759502, "checksum": "7badf1e48c436b4770d1fd89706d8bed72b4ec751db0b48be1024b39b9d440eb", "avgDurationSecs": 5.784, "validDurationSecs": 1572187.554, "totalHrs": 559.24, "validHrs": 436.71 }, "lij": { "buckets": { "dev": 487, "test": 788, "train": 1685, "validated": 3027, "invalidated": 473, "other": 5 }, "duration": 12707856, "reportedSentences": 371, "validatedSentences": 5384, "unvalidatedSentences": 12, "clips": 3505, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.1, "thirties": 0.66, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.07, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.66, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3505, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 25, "size": 70989670, "checksum": "18dc88fdaaefef43e7d51b2994c22c099dda3f065a8574abaeeb58c86e204f94", "avgDurationSecs": 3.626, "validDurationSecs": 10974.802, "totalHrs": 3.52, "validHrs": 3.04 }, "lo": { "buckets": { "dev": 28, "test": 40, "train": 55, "validated": 123, "invalidated": 7, "other": 78 }, "duration": 1342152, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 208, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.33, "thirties": 0.6, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.93, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 208, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 8667245, "checksum": "ec3e6e4a839ac703f9c3adffc71d8cbe1d38100dbd372fa92a392362337c134b", "avgDurationSecs": 6.453, "validDurationSecs": 793.676, "totalHrs": 0.37, "validHrs": 0.22 }, "lt": { "buckets": { "dev": 4730, "test": 4925, "train": 7339, "validated": 17196, "invalidated": 845, "other": 3840 }, "duration": 112541048, "reportedSentences": 191, "validatedSentences": 129241, "unvalidatedSentences": 4787, "clips": 21881, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.29, "thirties": 0.26, "teens": 0.02, "fourties": 0.21, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.47, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21881, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 305, "size": 690853878, "checksum": "0fbd9936cd36af91d1f876f0ab762a5d90e8f0ba0c62d5556f4a372e25c4d7a5", "avgDurationSecs": 5.143, "validDurationSecs": 88444.58, "totalHrs": 31.26, "validHrs": 24.56 }, "ltg": { "buckets": { "dev": 3361, "test": 3373, "train": 4286, "validated": 19898, "invalidated": 1785, "other": 464 }, "duration": 101102580, "reportedSentences": 62, "validatedSentences": 11570, "unvalidatedSentences": 4225, "clips": 22147, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.09, "thirties": 0.23, "teens": 0, "fourties": 0.34, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.06, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19130, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3017, "healthcare": 0, "history_law_government": 291, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 331, "size": 581423398, "checksum": "b04a1f2c06f94f17021cc7c311ef7d16c83b4c2f7215334b84d8afbad3e1d459", "avgDurationSecs": 4.565, "validDurationSecs": 90835.74, "totalHrs": 28.08, "validHrs": 25.23 }, "luo": { "buckets": { "dev": 3060, "test": 2996, "train": 4499, "validated": 20336, "invalidated": 792, "other": 61578 }, "duration": 430450452, "reportedSentences": 8, "validatedSentences": 14690, "unvalidatedSentences": 13872, "clips": 82706, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.33, "thirties": 0.28, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1135, "automotive_transport": 341, "finance": 967, "service_retail": 2746, "general": 69461, "healthcare": 2236, "history_law_government": 864, "language_fundamentals": 555, "media_entertainment": 806, "nature_environment": 634, "news_current_affairs": 2709, "technology_robotics": 241 } }, "users": 43, "size": 2391314678, "checksum": "cad460a87e5eaf687327e51eaff215a04eab7de60039f0cd9cc2049446123b9e", "avgDurationSecs": 5.205, "validDurationSecs": 105840.452, "totalHrs": 119.56, "validHrs": 29.4 }, "lv": { "buckets": { "dev": 7459, "test": 7459, "train": 13522, "validated": 191086, "invalidated": 10965, "other": 21135 }, "duration": 1061245493, "reportedSentences": 3113, "validatedSentences": 33312, "unvalidatedSentences": 65858, "clips": 223186, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 222686, "agriculture_food": 5, "automotive_transport": 3, "finance": 2, "service_retail": 11, "general": 418, "healthcare": 19, "history_law_government": 3, "language_fundamentals": 8, "media_entertainment": 8, "nature_environment": 19, "news_current_affairs": 4, "technology_robotics": 8 } }, "users": 6158, "size": 5939867974, "checksum": "e094b09c34700fe27f48923397748580671759e4be6bb107ffe3460df34cd9ae", "avgDurationSecs": 4.755, "validDurationSecs": 908610.559, "totalHrs": 294.79, "validHrs": 252.39 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11056943, "checksum": "b33a49f4c446d53d5e14c27e5b16bbbab78315917f7066629d1e37237b34f379", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "dev": 15254, "test": 15125, "train": 185917, "validated": 218660, "invalidated": 16021, "other": 55 }, "duration": 1085203404, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234736, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234736, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 496, "size": 6123092943, "checksum": "6feae35d46588cfca4d2be72dab19518d356050392b4b9f4882fde43df16e2be", "avgDurationSecs": 4.623, "validDurationSecs": 1010882.763, "totalHrs": 301.44, "validHrs": 280.8 }, "mk": { "buckets": { "dev": 1287, "test": 1170, "train": 1775, "validated": 7042, "invalidated": 271, "other": 11766 }, "duration": 82575972, "reportedSentences": 48, "validatedSentences": 5211, "unvalidatedSentences": 1379, "clips": 19079, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.47, "thirties": 0.48, "teens": 0, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.01, "male_masculine": 0.58, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19079, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 447304674, "checksum": "a035c4d04ae2c0a31626ffe328594f68d71d84f8794a7c188d2f3cfdd56031b5", "avgDurationSecs": 4.328, "validDurationSecs": 30478.536, "totalHrs": 22.93, "validHrs": 8.46 }, "ml": { "buckets": { "dev": 799, "test": 743, "train": 1254, "validated": 3076, "invalidated": 112, "other": 5573 }, "duration": 36680040, "reportedSentences": 198, "validatedSentences": 5465, "unvalidatedSentences": 4746, "clips": 8761, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.57, "thirties": 0.09, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.52, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8761, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 139, "size": 207316188, "checksum": "bf6680aea30453afb89345b49e6eeeb71f130665d4e3ccde2c36ee96919fd129", "avgDurationSecs": 4.187, "validDurationSecs": 12878.416, "totalHrs": 10.18, "validHrs": 3.57 }, "mn": { "buckets": { "dev": 1931, "test": 1932, "train": 2195, "validated": 33173, "invalidated": 3172, "other": 59902 }, "duration": 505583828, "reportedSentences": 45, "validatedSentences": 6089, "unvalidatedSentences": 13, "clips": 96247, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96232, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 576, "size": 3075576503, "checksum": "2a04dffb81c8883b04115090cb453408f035b2ea9f3fb0f179ba708e1411d33b", "avgDurationSecs": 5.253, "validDurationSecs": 174257.196, "totalHrs": 140.43, "validHrs": 48.4 }, "mr": { "buckets": { "dev": 1766, "test": 1792, "train": 2191, "validated": 10979, "invalidated": 2282, "other": 2768 }, "duration": 99291024, "reportedSentences": 59, "validatedSentences": 6715, "unvalidatedSentences": 12104, "clips": 16029, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0.19, "female_feminine": 0.76, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16029, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 95, "size": 583909047, "checksum": "8f88beb23d118b0784590f8bd2c112aafc9982a527cc2a2e1ecc8fa880973518", "avgDurationSecs": 6.194, "validDurationSecs": 68008.993, "totalHrs": 27.58, "validHrs": 18.89 }, "mrj": { "buckets": { "dev": 7314, "test": 7111, "train": 14325, "validated": 28920, "invalidated": 1249, "other": 1509 }, "duration": 132771780, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31678, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31678, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 746601715, "checksum": "72f06648f4c690814ac2c2e2ac573d75f3955867944ec840d30bd9e172115cb3", "avgDurationSecs": 4.191, "validDurationSecs": 121212.194, "totalHrs": 36.88, "validHrs": 33.67 }, "mt": { "buckets": { "dev": 1625, "test": 1649, "train": 1918, "validated": 6553, "invalidated": 325, "other": 6324 }, "duration": 62759880, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11773, "clips": 13202, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13202, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 221, "size": 466315849, "checksum": "0296cf1cd1847997f0ff1caaa6a83bc6ed87ea83669a83d305369f63c73d333a", "avgDurationSecs": 4.754, "validDurationSecs": 31151.757, "totalHrs": 17.43, "validHrs": 8.65 }, "myv": { "buckets": { "dev": 239, "test": 478, "train": 1241, "validated": 1968, "invalidated": 26, "other": 1 }, "duration": 11519388, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1995, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 67700687, "checksum": "092397b845e17dcc5f637118898ac964951e9742fc9e931698abd750c829e124", "avgDurationSecs": 5.774, "validDurationSecs": 11363.487, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "dev": 2785, "test": 2660, "train": 3613, "validated": 9379, "invalidated": 824, "other": 19350 }, "duration": 77669208, "reportedSentences": 216, "validatedSentences": 24587, "unvalidatedSentences": 219, "clips": 29553, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.53, "thirties": 0.15, "teens": 0.01, "fourties": 0.16, "fifties": 0.01, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.65, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 29553, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 228, "size": 442070374, "checksum": "c1ec8b8c6076485166875958ac759ed521adff6623ece63c18ef6b0aac259af6", "avgDurationSecs": 2.628, "validDurationSecs": 24649.257, "totalHrs": 21.57, "validHrs": 6.84 }, "ne-NP": { "buckets": { "dev": 147, "test": 205, "train": 381, "validated": 781, "invalidated": 67, "other": 638 }, "duration": 5949612, "reportedSentences": 5, "validatedSentences": 1298, "unvalidatedSentences": 46, "clips": 1486, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.6, "thirties": 0.11, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.69, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1486, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 44, "size": 34891701, "checksum": "dba4f5cd3f4e14950bad9a77e21b06e201c647066b9bcad64dae845002d66cab", "avgDurationSecs": 4.004, "validDurationSecs": 3126.95, "totalHrs": 1.65, "validHrs": 0.86 }, "nhi": { "buckets": { "dev": 0, "test": 5, "train": 16, "validated": 21, "invalidated": 0, "other": 14 }, "duration": 163872, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 9, "clips": 35, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0, "thirties": 0.57, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.57, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 35, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1009292, "checksum": "be02ae48562f59c848ce6da422669d048019c4771c8902725f9ea3b93379b832", "avgDurationSecs": 4.682, "validDurationSecs": 98.323, "totalHrs": 0.04, "validHrs": 0.02 }, "nl": { "buckets": { "dev": 11431, "test": 11431, "train": 36520, "validated": 92424, "invalidated": 5666, "other": 2544 }, "duration": 438995665, "reportedSentences": 340, "validatedSentences": 258940, "unvalidatedSentences": 12, "clips": 100634, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.2, "thirties": 0.13, "teens": 0.02, "fourties": 0.15, "fifties": 0.07, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.48, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 100629, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 1756, "size": 2991713246, "checksum": "cd09cdb595eb36b102d71f5d009a115983e374d1e650e0a212a411c33cc1a849", "avgDurationSecs": 4.362, "validDurationSecs": 403181.185, "totalHrs": 121.94, "validHrs": 111.99 }, "nn-NO": { "buckets": { "dev": 283, "test": 372, "train": 522, "validated": 1182, "invalidated": 66, "other": 121 }, "duration": 5995944, "reportedSentences": 23, "validatedSentences": 5201, "unvalidatedSentences": 123, "clips": 1369, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.41, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.59, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1369, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 38, "size": 34853137, "checksum": "0a057655ae1e37607a0ae3dcaf226caa505a01b29d1960449bd1b32f61c7df71", "avgDurationSecs": 4.38, "validDurationSecs": 5176.922, "totalHrs": 1.66, "validHrs": 1.43 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552765, "checksum": "702a5131c32f040dd7219d084c457d67f034fabf129084d9f94f99f7dd0f7a57", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "oc": { "buckets": { "dev": 274, "test": 275, "train": 281, "validated": 1900, "invalidated": 196, "other": 7649 }, "duration": 47490300, "reportedSentences": 10, "validatedSentences": 4929, "unvalidatedSentences": 10, "clips": 9745, "splits": { "accent": {}, "age": { "": 0.5, "twenties": 0.01, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.51, "male_masculine": 0.09, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9745, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 146, "size": 270187116, "checksum": "8742f2e242ebe6f871bf49b571f0863934fe3dd049cc57f6001280b4aae3f610", "avgDurationSecs": 4.873, "validDurationSecs": 9259.268, "totalHrs": 13.19, "validHrs": 2.57 }, "or": { "buckets": { "dev": 0, "test": 881, "train": 2134, "validated": 3654, "invalidated": 389, "other": 16019 }, "duration": 110016168, "reportedSentences": 96, "validatedSentences": 6195, "unvalidatedSentences": 37, "clips": 20062, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.32, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.4, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.25 }, "sentence_domain": { "": 20058, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 152, "size": 688098908, "checksum": "73cc1a5d98d4ba60c1964e448806c0ef7abc60536dc865d740d0520256e5b254", "avgDurationSecs": 5.484, "validDurationSecs": 20037.837, "totalHrs": 30.56, "validHrs": 5.56 }, "os": { "buckets": { "dev": 43, "test": 104, "train": 207, "validated": 354, "invalidated": 36, "other": 24 }, "duration": 2205900, "reportedSentences": 2, "validatedSentences": 980, "unvalidatedSentences": 902, "clips": 414, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.2, "thirties": 0.12, "teens": 0.05, "fourties": 0.35, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.61, "male_masculine": 0.39, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 414, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 12766474, "checksum": "89dd008b8dbcbbec1fbf3f48e0c10e4dabea1219950d22fab0908d34cc8af8e7", "avgDurationSecs": 5.328, "validDurationSecs": 1886.204, "totalHrs": 0.61, "validHrs": 0.52 }, "pa-IN": { "buckets": { "dev": 404, "test": 530, "train": 788, "validated": 1722, "invalidated": 90, "other": 1259 }, "duration": 14820614, "reportedSentences": 251, "validatedSentences": 14822, "unvalidatedSentences": 21, "clips": 3071, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.23, "thirties": 0.39, "teens": 0, "fourties": 0.04, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.67, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3071, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 70, "size": 104686805, "checksum": "87868cb9c4d0d2643c5f4c07ded7043cf35651f64d12d89d0bb87c2b6de6b051", "avgDurationSecs": 4.826, "validDurationSecs": 8310.354, "totalHrs": 4.11, "validHrs": 2.3 }, "pl": { "buckets": { "dev": 9326, "test": 9326, "train": 21221, "validated": 133353, "invalidated": 6687, "other": 2989 }, "duration": 650126882, "reportedSentences": 572, "validatedSentences": 229062, "unvalidatedSentences": 11432, "clips": 143029, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.27, "thirties": 0.34, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.27, "male_masculine": 0.59, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 143027, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3347, "size": 4628488862, "checksum": "006281fec6f84154f48a9fae4740ae233ba1c6a5391930be5d85d11562820689", "avgDurationSecs": 4.545, "validDurationSecs": 606145.398, "totalHrs": 180.59, "validHrs": 168.37 }, "ps": { "buckets": { "dev": 1611, "test": 1612, "train": 1789, "validated": 15326, "invalidated": 1622, "other": 47715 }, "duration": 323015364, "reportedSentences": 863, "validatedSentences": 8645, "unvalidatedSentences": 825, "clips": 64663, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.42, "thirties": 0.16, "teens": 0.04, "fourties": 0.06, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.02, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 39731, "agriculture_food": 589, "automotive_transport": 263, "finance": 90, "service_retail": 410, "general": 20172, "healthcare": 548, "history_law_government": 1825, "language_fundamentals": 1723, "media_entertainment": 762, "nature_environment": 597, "news_current_affairs": 560, "technology_robotics": 260 } }, "users": 1098, "size": 1794159003, "checksum": "509de45a51a3f18b5cba1dd3939b77d0d79d5442eb224f864f45d907bd68f6ef", "avgDurationSecs": 4.995, "validDurationSecs": 76558.982, "totalHrs": 89.72, "validHrs": 21.26 }, "pt": { "buckets": { "dev": 9517, "test": 9522, "train": 22269, "validated": 154712, "invalidated": 7560, "other": 25717 }, "duration": 775827943, "reportedSentences": 2833, "validatedSentences": 43294, "unvalidatedSentences": 89, "clips": 187989, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 187667, "agriculture_food": 4, "automotive_transport": 9, "finance": 6, "service_retail": 6, "general": 242, "healthcare": 8, "history_law_government": 20, "language_fundamentals": 3, "media_entertainment": 21, "nature_environment": 8, "news_current_affairs": 0, "technology_robotics": 25 } }, "users": 3559, "size": 4881606161, "checksum": "03ff231421a520bf1546b2a7e9cf5a806816273c686db751f258381d1ea4af25", "avgDurationSecs": 4.127, "validDurationSecs": 638494.235, "totalHrs": 215.5, "validHrs": 177.35 }, "quy": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 1, "other": 4 }, "duration": 36792, "reportedSentences": 0, "validatedSentences": 751, "unvalidatedSentences": 289, "clips": 6, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 281322, "checksum": "aa51c0a003ec1f16d7062dc526f0165ae95cdd16f60a7d2996e1f066377f39ab", "avgDurationSecs": 6.132, "validDurationSecs": 6.132, "totalHrs": 0.01, "validHrs": 0 }, "rm-sursilv": { "buckets": { "dev": 1365, "test": 1386, "train": 1585, "validated": 4351, "invalidated": 684, "other": 2469 }, "duration": 40806869, "reportedSentences": 17, "validatedSentences": 8780, "unvalidatedSentences": 7455, "clips": 7504, "splits": { "accent": {}, "age": { "": 0.61, "twenties": 0.09, "thirties": 0.07, "teens": 0.05, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0.17, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 91, "size": 306289124, "checksum": "1791c595bc6234238701bd08cf3bd7b5632a71bf1130f12f4f863215dcebb97c", "avgDurationSecs": 5.438, "validDurationSecs": 23660.806, "totalHrs": 11.33, "validHrs": 6.57 }, "rm-vallader": { "buckets": { "dev": 500, "test": 462, "train": 557, "validated": 1526, "invalidated": 394, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2631, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2631, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 117350635, "checksum": "7fe612ef4b735a17a2a5a4d2b1fd9d86253c2b72c60d7760599f68103b7c93f0", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "dev": 3897, "test": 3905, "train": 5134, "validated": 17965, "invalidated": 989, "other": 23129 }, "duration": 169684735, "reportedSentences": 419, "validatedSentences": 13741, "unvalidatedSentences": 89123, "clips": 42083, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.61, "thirties": 0.13, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male_masculine": 0.71, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 42083, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 435, "size": 1059027606, "checksum": "296f203873935921a145e03ee2de3f239c06a5b7dce0c00bf91795c0cc048e63", "avgDurationSecs": 4.032, "validDurationSecs": 72437.475, "totalHrs": 47.13, "validHrs": 20.12 }, "ru": { "buckets": { "dev": 10212, "test": 10215, "train": 26459, "validated": 167055, "invalidated": 10237, "other": 16095 }, "duration": 1000001952, "reportedSentences": 517, "validatedSentences": 47231, "unvalidatedSentences": 706, "clips": 193387, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.16, "teens": 0.1, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 193380, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3296, "size": 6752316969, "checksum": "33200a55464e6c177c2928b9b4c252794cd787067955cc057cfe86daad206fdd", "avgDurationSecs": 5.171, "validDurationSecs": 863839.483, "totalHrs": 277.77, "validHrs": 239.95 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003030, "validated": 1438915, "invalidated": 227816, "other": 47288 }, "duration": 8582936477, "reportedSentences": 630, "validatedSentences": 1404853, "unvalidatedSentences": 7, "clips": 1714019, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1714019, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1145, "size": 61196339470, "checksum": "761d64abc17a83d74be5c2f70eb3f0e01897e9c280fab52823af206d541d7f21", "avgDurationSecs": 5.007, "validDurationSecs": 7205355.39, "totalHrs": 2384.14, "validHrs": 2001.48 }, "sah": { "buckets": { "dev": 1397, "test": 1472, "train": 1876, "validated": 4904, "invalidated": 152, "other": 2481 }, "duration": 46254288, "reportedSentences": 11, "validatedSentences": 5989, "unvalidatedSentences": 468, "clips": 7537, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.24, "teens": 0.06, "fourties": 0.06, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.38, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7537, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 117, "size": 314035816, "checksum": "2a4f50ed7f7f898bb0c84d0c0b631ab1498743d3eebf51c1b9f8043cf839991a", "avgDurationSecs": 6.137, "validDurationSecs": 30095.665, "totalHrs": 12.84, "validHrs": 8.35 }, "sat": { "buckets": { "dev": 0, "test": 147, "train": 315, "validated": 462, "invalidated": 10, "other": 352 }, "duration": 3676320, "reportedSentences": 7, "validatedSentences": 5407, "unvalidatedSentences": 547, "clips": 824, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.43, "thirties": 0.1, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.58, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 824, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 20610407, "checksum": "cd2ddd84b5464adb558b463732923c220be471a3a8c76a37fc6885c3f997f2d3", "avgDurationSecs": 4.462, "validDurationSecs": 2061.238, "totalHrs": 1.02, "validHrs": 0.57 }, "sc": { "buckets": { "dev": 296, "test": 414, "train": 602, "validated": 1480, "invalidated": 49, "other": 441 }, "duration": 8986248, "reportedSentences": 8, "validatedSentences": 5231, "unvalidatedSentences": 211, "clips": 1970, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0.09, "thirties": 0.29, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.61, "male_masculine": 0.14, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1970, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 52199163, "checksum": "9a2d34ec3b703c40a6471d936751c2a48f15b13938d7989303b8c7db2abc51e9", "avgDurationSecs": 4.562, "validDurationSecs": 6751.09, "totalHrs": 2.49, "validHrs": 1.87 }, "sd": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 5 }, "duration": 36684, "reportedSentences": 0, "validatedSentences": 13273, "unvalidatedSentences": 21, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1058274, "checksum": "971473a9491c984a779d1ffe88fc6519b82d49dc94d8c87ec5debab0761a0cb0", "avgDurationSecs": 7.337, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "sk": { "buckets": { "dev": 2543, "test": 2627, "train": 3359, "validated": 19907, "invalidated": 874, "other": 3201 }, "duration": 97793604, "reportedSentences": 195, "validatedSentences": 9163, "unvalidatedSentences": 49, "clips": 23982, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.08, "thirties": 0.21, "teens": 0.09, "fourties": 0.18, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.49, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 23982, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 245, "size": 552919820, "checksum": "127e91f36eef029e0570d258ce50d0fe32909f1aff2f870d6d28a88b88d70c17", "avgDurationSecs": 4.078, "validDurationSecs": 81176.602, "totalHrs": 27.16, "validHrs": 22.54 }, "skr": { "buckets": { "dev": 1138, "test": 1028, "train": 1538, "validated": 3716, "invalidated": 463, "other": 1548 }, "duration": 23728140, "reportedSentences": 17, "validatedSentences": 5994, "unvalidatedSentences": 104, "clips": 5727, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0.38, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.64, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5727, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 136546963, "checksum": "6b5e1adbff47bd27520601c71f351968fe5cd13f3928ecaf9299db39729ebe5d", "avgDurationSecs": 4.143, "validDurationSecs": 15396.153, "totalHrs": 6.59, "validHrs": 4.27 }, "sl": { "buckets": { "dev": 1275, "test": 1272, "train": 1396, "validated": 11801, "invalidated": 296, "other": 2924 }, "duration": 57320794, "reportedSentences": 51, "validatedSentences": 4149, "unvalidatedSentences": 48, "clips": 15021, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.4, "thirties": 0.04, "teens": 0.07, "fourties": 0.19, "fifties": 0.06, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.67, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15021, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 248, "size": 390556393, "checksum": "ca35dc8cdf3035943ba18e56dd8592f6b4d4355e76268706d8eee52486e1b704", "avgDurationSecs": 3.816, "validDurationSecs": 45033.133, "totalHrs": 15.92, "validHrs": 12.5 }, "sq": { "buckets": { "dev": 1645, "test": 1782, "train": 2658, "validated": 6095, "invalidated": 145, "other": 0 }, "duration": 31738500, "reportedSentences": 132, "validatedSentences": 50001, "unvalidatedSentences": 2620, "clips": 6240, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.08, "thirties": 0.16, "teens": 0, "fourties": 0.17, "fifties": 0.36, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.49, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1575, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1777, "healthcare": 0, "history_law_government": 2870, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 18, "technology_robotics": 0 } }, "users": 129, "size": 189292127, "checksum": "5b427434430222a16741825ba65591867dd55300e4212a1940b13ea02d0c7aab", "avgDurationSecs": 5.086, "validDurationSecs": 31000.987, "totalHrs": 8.81, "validHrs": 8.61 }, "sr": { "buckets": { "dev": 1910, "test": 1900, "train": 2352, "validated": 8141, "invalidated": 424, "other": 4860 }, "duration": 43365384, "reportedSentences": 78, "validatedSentences": 7874, "unvalidatedSentences": 776, "clips": 13425, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.25, "thirties": 0.11, "teens": 0, "fourties": 0.1, "fifties": 0.13, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.32, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13387, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 12, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 25 } }, "users": 171, "size": 243188430, "checksum": "8ed0b26b008b4ff7f73e87810f010412d6379cae4a54038dce27ff8a4fe069d0", "avgDurationSecs": 3.23, "validDurationSecs": 26297.027, "totalHrs": 12.04, "validHrs": 7.3 }, "sv-SE": { "buckets": { "dev": 5299, "test": 5324, "train": 7854, "validated": 41304, "invalidated": 1438, "other": 6435 }, "duration": 197251179, "reportedSentences": 592, "validatedSentences": 25877, "unvalidatedSentences": 7329, "clips": 49177, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.38, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.48, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 49177, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 856, "size": 1248274759, "checksum": "b38b317cc2060beecbdfa13e4b13c997f7d209ffad37bb6695b3c000005cbb12", "avgDurationSecs": 4.011, "validDurationSecs": 165672.219, "totalHrs": 54.79, "validHrs": 46.02 }, "sw": { "buckets": { "dev": 12248, "test": 12254, "train": 46511, "validated": 267065, "invalidated": 80628, "other": 377368 }, "duration": 3905516736, "reportedSentences": 1901, "validatedSentences": 134682, "unvalidatedSentences": 5801, "clips": 725061, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 725061, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1459, "size": 22795319111, "checksum": "c3c790200721824d886ccec10878138589271d0674894cb66e1dd96ef74526ec", "avgDurationSecs": 5.386, "validDurationSecs": 1438536.657, "totalHrs": 1084.86, "validHrs": 399.59 }, "ta": { "buckets": { "dev": 12111, "test": 12189, "train": 46068, "validated": 136081, "invalidated": 5705, "other": 105054 }, "duration": 1530036024, "reportedSentences": 3437, "validatedSentences": 118205, "unvalidatedSentences": 629, "clips": 246840, "splits": { "accent": {}, "age": { "": 0.66, "twenties": 0.11, "thirties": 0.12, "teens": 0.04, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.19, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 246840, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 939, "size": 9172195398, "checksum": "6ffb1c85ba2f8c2fa6570933c93adf4fe0a16f80caeb2ea29829c2819641eb15", "avgDurationSecs": 6.198, "validDurationSecs": 843497.132, "totalHrs": 425.01, "validHrs": 234.3 }, "te": { "buckets": { "dev": 54, "test": 53, "train": 64, "validated": 260, "invalidated": 34, "other": 1955 }, "duration": 9375840, "reportedSentences": 15, "validatedSentences": 322, "unvalidatedSentences": 19298, "clips": 2249, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.7, "thirties": 0.01, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.08, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.7, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2249, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 56177577, "checksum": "7ae183fc43049daee6960ede819721824c362d202fe01742a798edcaf256ab39", "avgDurationSecs": 4.169, "validDurationSecs": 1083.912, "totalHrs": 2.6, "validHrs": 0.3 }, "th": { "buckets": { "dev": 11046, "test": 11046, "train": 32855, "validated": 147484, "invalidated": 9284, "other": 207551 }, "duration": 1526810988, "reportedSentences": 4289, "validatedSentences": 64265, "unvalidatedSentences": 121871, "clips": 364319, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 364319, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7909, "size": 8908312109, "checksum": "f116818d3310ea8eb531f4c6b45a3e968a684aaf7c8623b1ddf522ead6c57973", "avgDurationSecs": 4.191, "validDurationSecs": 618085.227, "totalHrs": 424.11, "validHrs": 171.69 }, "ti": { "buckets": { "dev": 5, "test": 7, "train": 14, "validated": 26, "invalidated": 1, "other": 268 }, "duration": 1474344, "reportedSentences": 3, "validatedSentences": 4532, "unvalidatedSentences": 138, "clips": 295, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.03, "teens": 0, "fourties": 0.74, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0.05, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 8940950, "checksum": "2a0d1a0121ee9a54dd21a7389889ba744cad82bbb8ed3b7ec07553baa964a172", "avgDurationSecs": 4.998, "validDurationSecs": 129.942, "totalHrs": 0.4, "validHrs": 0.03 }, "tig": { "buckets": { "dev": 104, "test": 185, "train": 369, "validated": 658, "invalidated": 16, "other": 956 }, "duration": 10043568, "reportedSentences": 2, "validatedSentences": 5382, "unvalidatedSentences": 20, "clips": 1630, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.02, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1630, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 29, "size": 51991047, "checksum": "53323d4922aaf497fedb086d259baa5c9efbea6b16a52f94fa0ba00fe5f13d86", "avgDurationSecs": 6.162, "validDurationSecs": 4054.397, "totalHrs": 2.78, "validHrs": 1.12 }, "tk": { "buckets": { "dev": 431, "test": 501, "train": 760, "validated": 1864, "invalidated": 128, "other": 2397 }, "duration": 23740200, "reportedSentences": 27, "validatedSentences": 2456, "unvalidatedSentences": 236, "clips": 4389, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.56, "thirties": 0.09, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.47, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4389, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 114, "size": 133583470, "checksum": "cbfebd05c16c6c1e1fbe89e064a16eb6ccd8e60ee6201eb2e80f41a272fbcc3d", "avgDurationSecs": 5.409, "validDurationSecs": 10082.418, "totalHrs": 6.59, "validHrs": 2.8 }, "tn": { "buckets": { "dev": 0, "test": 0, "train": 4, "validated": 4, "invalidated": 1, "other": 1798 }, "duration": 7811964, "reportedSentences": 1, "validatedSentences": 1324, "unvalidatedSentences": 15769, "clips": 1803, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0, "thirties": 0.35, "teens": 0, "fourties": 0, "fifties": 0.64, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1803, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 40926570, "checksum": "f7a41f00c63807485e14b46847a0e32db21022b00cd949b3f35a86e4d3f593be", "avgDurationSecs": 4.333, "validDurationSecs": 17.331, "totalHrs": 2.16, "validHrs": 0 }, "tok": { "buckets": { "dev": 2184, "test": 2174, "train": 2570, "validated": 11983, "invalidated": 345, "other": 4384 }, "duration": 69712560, "reportedSentences": 145, "validatedSentences": 7257, "unvalidatedSentences": 2458, "clips": 16712, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.2, "thirties": 0.07, "teens": 0.25, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.17, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.51, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16712, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 158, "size": 408148148, "checksum": "4c44f6171fe38cb9c4e6ed10a98ff20626be077cfb7e40f735f855a5daab09f3", "avgDurationSecs": 4.171, "validDurationSecs": 49985.975, "totalHrs": 19.36, "validHrs": 13.88 }, "tr": { "buckets": { "dev": 11566, "test": 11613, "train": 38508, "validated": 118077, "invalidated": 4710, "other": 116 }, "duration": 465363783, "reportedSentences": 427, "validatedSentences": 409976, "unvalidatedSentences": 3387, "clips": 122903, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.1, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.42, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 122901, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1644, "size": 2843118717, "checksum": "7863c59321bc14ce7a1ec51f60677c4885f70b668aa1216b4521ec241d4532c0", "avgDurationSecs": 3.786, "validDurationSecs": 447090.465, "totalHrs": 129.26, "validHrs": 124.19 }, "ts": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 1 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1040832, "checksum": "266744d63dfb4beff1ad4cde549c7ca71eaf38b2017ad7103259c19083faede3", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 3693, "test": 4973, "train": 9304, "validated": 29180, "invalidated": 427, "other": 324 }, "duration": 113269274, "reportedSentences": 6, "validatedSentences": 18031, "unvalidatedSentences": 51, "clips": 29931, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.05, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.78, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 29931, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 265, "size": 833136092, "checksum": "20ec24eee4576b696f1fb2aba624efa0bbb871599110e83763d85ce934559f0e", "avgDurationSecs": 3.784, "validDurationSecs": 110427.23, "totalHrs": 31.46, "validHrs": 30.67 }, "tw": { "buckets": { "dev": 0, "test": 12, "train": 178, "validated": 190, "invalidated": 3, "other": 79 }, "duration": 1172304, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 18, "clips": 272, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.02, "thirties": 0.79, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.81, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 9648767, "checksum": "de765b0863bc07cd616c1648ec886ad8fd2036a553df5aefa3592ae0af664168", "avgDurationSecs": 4.31, "validDurationSecs": 818.889, "totalHrs": 0.32, "validHrs": 0.22 }, "ug": { "buckets": { "dev": 12878, "test": 12878, "train": 56970, "validated": 198176, "invalidated": 10389, "other": 34085 }, "duration": 1376154288, "reportedSentences": 1349, "validatedSentences": 172116, "unvalidatedSentences": 725, "clips": 242650, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.29, "thirties": 0.3, "teens": 0.05, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.31, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 240479, "agriculture_food": 1, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 2158, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 5, "media_entertainment": 11, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 1413, "size": 7828564315, "checksum": "cd347627e036c363a61aa46323428c15a353577a55157c31c70d4f5ad79f9553", "avgDurationSecs": 5.671, "validDurationSecs": 1123926.446, "totalHrs": 382.26, "validHrs": 312.2 }, "uk": { "buckets": { "dev": 10099, "test": 10103, "train": 25722, "validated": 76261, "invalidated": 3252, "other": 8434 }, "duration": 408323868, "reportedSentences": 652, "validatedSentences": 212602, "unvalidatedSentences": 9, "clips": 87947, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.33, "thirties": 0.25, "teens": 0.09, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 87947, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1104, "size": 2687472082, "checksum": "8c12ea71f168323d0757789e1aecfd59979491221026042ffb5e8fee059c0054", "avgDurationSecs": 4.643, "validDurationSecs": 354067.637, "totalHrs": 113.42, "validHrs": 98.35 }, "ur": { "buckets": { "dev": 5014, "test": 5015, "train": 7183, "validated": 66973, "invalidated": 10429, "other": 174407 }, "duration": 1081428660, "reportedSentences": 325, "validatedSentences": 21451, "unvalidatedSentences": 48294, "clips": 251809, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.92, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.53, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 251703, "agriculture_food": 12, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 48, "healthcare": 46, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 454, "size": 6314052917, "checksum": "2ff5ba74d57078a78cf3222e04f1922a0042ee71c0344a3862f8d17d70c41eff", "avgDurationSecs": 4.295, "validDurationSecs": 287624.833, "totalHrs": 300.39, "validHrs": 79.89 }, "uz": { "buckets": { "dev": 12137, "test": 12369, "train": 48546, "validated": 86600, "invalidated": 13985, "other": 128398 }, "duration": 950887944, "reportedSentences": 1832, "validatedSentences": 148681, "unvalidatedSentences": 137915, "clips": 228983, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 228983, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2200, "size": 5335524403, "checksum": "c75bd18e5664bc84c80bd1e78280ad7f5812eb4dd7da77ff69b7174b6ccd0f45", "avgDurationSecs": 4.153, "validDurationSecs": 359620.129, "totalHrs": 264.13, "validHrs": 99.89 }, "vi": { "buckets": { "dev": 693, "test": 1327, "train": 2239, "validated": 5300, "invalidated": 387, "other": 11843 }, "duration": 69270992, "reportedSentences": 201, "validatedSentences": 5771, "unvalidatedSentences": 5318, "clips": 17530, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.02, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0.29, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.54, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17530, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 296, "size": 403442932, "checksum": "e1854be8cf645bf0d518eff3fa6413cd18196cd13518db6b26c2f2f85122a95e", "avgDurationSecs": 3.952, "validDurationSecs": 20943.312, "totalHrs": 19.24, "validHrs": 5.81 }, "vot": { "buckets": { "dev": 0, "test": 7, "train": 96, "validated": 103, "invalidated": 324, "other": 0 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186748, "checksum": "d9f1003cf6efa48e96b1019df91e2df3932d56b5f3936613989bfc5e30328c48", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "xh": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 0, "other": 14 }, "duration": 94140, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 15, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1691705, "checksum": "8e455d4546b10b24860fe6633034329b1cc1fe7948e3bb7358f6676f9869eada", "avgDurationSecs": 6.276, "validDurationSecs": 6.276, "totalHrs": 0.02, "validHrs": 0 }, "yi": { "buckets": { "dev": 97, "test": 115, "train": 151, "validated": 376, "invalidated": 25, "other": 5 }, "duration": 1702152, "reportedSentences": 1, "validatedSentences": 846, "unvalidatedSentences": 655, "clips": 406, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.15, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 406, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 9441231, "checksum": "92f522ee0088e70cb3cab6a2bc87791bb236bc711747f0886810916fa09d5b1a", "avgDurationSecs": 4.192, "validDurationSecs": 1576.377, "totalHrs": 0.47, "validHrs": 0.43 }, "yo": { "buckets": { "dev": 874, "test": 1014, "train": 1246, "validated": 3136, "invalidated": 230, "other": 1181 }, "duration": 27191520, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 14, "clips": 4547, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.63, "thirties": 0.02, "teens": 0.04, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.32, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4547, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 121, "size": 159352552, "checksum": "7c1ec812ad2cffd5855ae9bda5ca678570fd121f5f668f00cf0809acf5204322", "avgDurationSecs": 5.98, "validDurationSecs": 18753.597, "totalHrs": 7.55, "validHrs": 5.2 }, "yue": { "buckets": { "dev": 5128, "test": 5128, "train": 7413, "validated": 190596, "invalidated": 8014, "other": 33321 }, "duration": 913569084, "reportedSentences": 2121, "validatedSentences": 17780, "unvalidatedSentences": 7956, "clips": 231931, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.32, "thirties": 0.45, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.21, "female_feminine": 0.59, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 230295, "agriculture_food": 9, "automotive_transport": 28, "finance": 74, "service_retail": 349, "general": 964, "healthcare": 238, "history_law_government": 37, "language_fundamentals": 0, "media_entertainment": 113, "nature_environment": 111, "news_current_affairs": 63, "technology_robotics": 56 } }, "users": 1125, "size": 5281785733, "checksum": "f19073fb2bff609f8a9f4bb4e06fc342366fe441948680b3ecc6041f9c21e2b8", "avgDurationSecs": 3.939, "validDurationSecs": 750751.789, "totalHrs": 253.76, "validHrs": 208.54 }, "zgh": { "buckets": { "dev": 290, "test": 197, "train": 700, "validated": 1187, "invalidated": 77, "other": 541 }, "duration": 6105096, "reportedSentences": 5, "validatedSentences": 2724, "unvalidatedSentences": 359, "clips": 1805, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.29, "thirties": 0.62, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male_masculine": 0.88, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1779, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 22, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 33710311, "checksum": "ed94c480001878d57c916c02886bd93db4f2cb441b080fd78cfdbefc5012722a", "avgDurationSecs": 3.382, "validDurationSecs": 4014.819, "totalHrs": 1.69, "validHrs": 1.11 }, "zh-CN": { "buckets": { "dev": 10628, "test": 10628, "train": 29420, "validated": 186502, "invalidated": 58692, "other": 598336 }, "duration": 3834334572, "reportedSentences": 942, "validatedSentences": 58952, "unvalidatedSentences": 95, "clips": 843530, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.05, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 843227, "agriculture_food": 2, "automotive_transport": 10, "finance": 4, "service_retail": 13, "general": 143, "healthcare": 12, "history_law_government": 63, "language_fundamentals": 2, "media_entertainment": 68, "nature_environment": 3, "news_current_affairs": 6, "technology_robotics": 20 } }, "users": 7163, "size": 22704122823, "checksum": "775ed324eae93f72590e99bee0b45e5c1a923f102c24645a8f2e11ec070e81d5", "avgDurationSecs": 4.546, "validDurationSecs": 847760.087, "totalHrs": 1065.09, "validHrs": 235.48 }, "zh-HK": { "buckets": { "dev": 5598, "test": 5598, "train": 8439, "validated": 93403, "invalidated": 4484, "other": 23013 }, "duration": 504021552, "reportedSentences": 814, "validatedSentences": 20021, "unvalidatedSentences": 20, "clips": 120900, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.41, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 120553, "agriculture_food": 0, "automotive_transport": 11, "finance": 14, "service_retail": 88, "general": 217, "healthcare": 27, "history_law_government": 17, "language_fundamentals": 29, "media_entertainment": 15, "nature_environment": 17, "news_current_affairs": 8, "technology_robotics": 7 } }, "users": 3053, "size": 3597926431, "checksum": "5a79b07d1f8310188483e43f96e3577b2264fd54ea18d519ba7d1bfed6308db4", "avgDurationSecs": 4.169, "validDurationSecs": 389388.958, "totalHrs": 140, "validHrs": 108.16 }, "zh-TW": { "buckets": { "dev": 5013, "test": 5013, "train": 7178, "validated": 83118, "invalidated": 4818, "other": 48865 }, "duration": 460151590, "reportedSentences": 164, "validatedSentences": 19479, "unvalidatedSentences": 29, "clips": 136801, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.49, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 136745, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 36, "healthcare": 0, "history_law_government": 5, "language_fundamentals": 2, "media_entertainment": 11, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2223, "size": 3080834112, "checksum": "b6729c36ad10f1d2d004b48523e932b9ef68f4afa441fc05bad9d2f80c9c34fe", "avgDurationSecs": 3.364, "validDurationSecs": 279580.411, "totalHrs": 127.81, "validHrs": 77.66 }, "zu": { "buckets": { "dev": 0, "test": 0, "train": 7, "validated": 7, "invalidated": 0, "other": 34 }, "duration": 220896, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 41, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 41, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 2730877, "checksum": "2a08e35cca0090a0fbf4c18d8f24c51b8735660a5c6e6548e45dea2a75cadfd8", "avgDurationSecs": 5.388, "validDurationSecs": 37.714, "totalHrs": 0.06, "validHrs": 0.01 }, "zza": { "buckets": { "dev": 185, "test": 118, "train": 463, "validated": 766, "invalidated": 112, "other": 334 }, "duration": 5214708, "reportedSentences": 2043, "validatedSentences": 5593, "unvalidatedSentences": 957, "clips": 1212, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0, "thirties": 0.56, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0.22, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.48, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 596, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 5, "general": 566, "healthcare": 13, "history_law_government": 0, "language_fundamentals": 22, "media_entertainment": 2, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 11, "size": 31050851, "checksum": "4379fd88b639f55fe7c0decace26c9e5e1e3064db712e913b727e0c73f93d418", "avgDurationSecs": 4.303, "validDurationSecs": 3295.764, "totalHrs": 1.44, "validHrs": 0.91 } }, "totalDuration": 117305242375, "totalValidDurationSecs": 77735868, "totalHrs": 32584, "totalValidHrs": 21593 } ================================================ FILE: datasets/scripted-speech/cv-corpus-19.0-delta-2024-09-13.json ================================================ { "locales": { "ab": { "duration": 21893832, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 3808, "users": 7, "size": 122644380, "checksum": "83103a47fd2fa6309ba398f9ae3fbaa7c5ded32e22c0b38e19f947fec1be2049", "avgDurationSecs": 5.17, "validDurationSecs": 1783.01, "totalHrs": 6.08, "validHrs": 0.49 }, "af": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -944, "checksum": "ee6cd907a2272386e72c250a5a0bb5d76b9b73fe144384178264e91761f35c8a", "avgDurationSecs": 5.616, "validDurationSecs": 22.46, "totalHrs": 0, "validHrs": 0.01 }, "am": { "duration": 79848, "reportedSentences": 0, "validatedSentences": -3, "unvalidatedSentences": -11, "clips": 10, "users": 2, "size": 430484, "checksum": "8dc2d5ba1a1174401b4193b04a6a649fce8205daa1bb1adaca803d3d0219b464", "avgDurationSecs": 6.222, "validDurationSecs": 141.29, "totalHrs": 0.02, "validHrs": 0.04 }, "ar": { "duration": 1118772, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -2, "clips": 227, "users": 10, "size": 6474285, "checksum": "9647389b84a2b999158c463321a997c3b9852211d53eabdc11fa1ead0b895cea", "avgDurationSecs": 4.16, "validDurationSecs": 812.53, "totalHrs": 0.32, "validHrs": 0.23 }, "as": { "duration": 12414564, "reportedSentences": 172, "validatedSentences": 957, "unvalidatedSentences": -466, "clips": 2026, "users": 4, "size": 73281477, "checksum": "fccbeb34f997c54cf4224b078377e270c0c8997353e46b8aa2bcde35eff47065", "avgDurationSecs": 5.907, "validDurationSecs": 335.95, "totalHrs": 3.45, "validHrs": 0.1 }, "ast": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -1, "clips": 0, "users": 0, "size": 69, "checksum": "ca8d749d5b772d76f63d31a5e748c7713a3e21c3a715f503f6e56efd424e7a0b", "avgDurationSecs": 4.434, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 56124, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 10, "users": 2, "size": 332024, "checksum": "f7fd36de6e305ca83a7a8b1334d13709ec377ddaecaccb670806b4f9acbb1566", "avgDurationSecs": 5.435, "validDurationSecs": 82.02, "totalHrs": 0.01, "validHrs": 0.02 }, "ba": { "duration": 5040, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -2, "clips": 1, "users": 1, "size": -134274, "checksum": "08a33cbb933b524e38807a821109c0c5f7d4166277ff781489b6d897a508d801", "avgDurationSecs": 4.426, "validDurationSecs": 80.27, "totalHrs": 0, "validHrs": 0.02 }, "bas": { "duration": 516456, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 121, "users": 4, "size": 3014509, "checksum": "1f144860a7965c7efac261f4b6a5d23f208e72c22c4638f70af91d86efb376b1", "avgDurationSecs": 4.42, "validDurationSecs": 136.17, "totalHrs": 0.14, "validHrs": 0.04 }, "be": { "duration": 112495284, "reportedSentences": 5, "validatedSentences": 3, "unvalidatedSentences": 1, "clips": 19776, "users": 44, "size": 636829868, "checksum": "25ad9959b67aa92bdc773c6d21af975f9b52528ee96b9e2654725555a65ed8b7", "avgDurationSecs": 4.778, "validDurationSecs": 68847.5, "totalHrs": 31.25, "validHrs": 19.12 }, "bg": { "duration": 206928, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 35, "users": 4, "size": 1213954, "checksum": "0e27fa2838331c27b5a5aeb1db155af655f63135a39742da1e92f6e8e1535b48", "avgDurationSecs": 5.472, "validDurationSecs": 619.75, "totalHrs": 0.05, "validHrs": 0.17 }, "bn": { "duration": 1028304, "reportedSentences": 1, "validatedSentences": 11, "unvalidatedSentences": -19, "clips": 174, "users": 5, "size": 5416196, "checksum": "93f5537c1cce3a80baf97251d518225091a0b49d4f68540f906066618f75ab85", "avgDurationSecs": 4.367, "validDurationSecs": 159.78, "totalHrs": 0.29, "validHrs": 0.04 }, "br": { "duration": 1831212, "reportedSentences": 7, "validatedSentences": 156, "unvalidatedSentences": 103, "clips": 497, "users": 0, "size": 10638498, "checksum": "994f854aaa691a0d6b6f408c27f3f5c140e411c8a11d936354fd7ccc91c72105", "avgDurationSecs": 3.186, "validDurationSecs": 939.15, "totalHrs": 0.51, "validHrs": 0.26 }, "ca": { "duration": 174334104, "reportedSentences": 119, "validatedSentences": 223, "unvalidatedSentences": 150, "clips": 46810, "users": 104, "size": 983704491, "checksum": "c1e22ca2739346039c414d548382141fbdb680cc2cc22860a5a5f75c667fa547", "avgDurationSecs": 5.272, "validDurationSecs": 912581.34, "totalHrs": 48.43, "validHrs": 253.5 }, "ckb": { "duration": 5343876, "reportedSentences": 22, "validatedSentences": 17, "unvalidatedSentences": -78, "clips": 1039, "users": 25, "size": 30597932, "checksum": "054f41a5df99e9b8f08d642d0bbe2719d1dff7b3a6a548a7de2fca1223e5e7df", "avgDurationSecs": 4.078, "validDurationSecs": 2147.15, "totalHrs": 1.48, "validHrs": 0.6 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -16782, "checksum": "f2099a17273e9a66684c05d968381a5cb83fdd6d2df72c07e797e881ade45d98", "avgDurationSecs": 3.516, "validDurationSecs": 7.03, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 1898064, "reportedSentences": 12, "validatedSentences": -37, "unvalidatedSentences": 17, "clips": 484, "users": 12, "size": 10872950, "checksum": "0f06fbcd98c507e6f3d47645f419a6700388d2e86efe773af85f7f3830e6e832", "avgDurationSecs": 4.463, "validDurationSecs": 1575.29, "totalHrs": 0.53, "validHrs": 0.44 }, "cv": { "duration": 3449700, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 2, "clips": 748, "users": 3, "size": 20244144, "checksum": "618499b806bb710003c645a9d00d922153adbc5932d966951a9767eb48d92bb3", "avgDurationSecs": 5.043, "validDurationSecs": 158.31, "totalHrs": 0.96, "validHrs": 0.05 }, "cy": { "duration": 1556640, "reportedSentences": 4, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 266, "users": 5, "size": 8866241, "checksum": "6943b7d5edd960895931beaf2354c2e8d05be4ab3cf7b6518a2ebbd33009bd77", "avgDurationSecs": 4.912, "validDurationSecs": 864.07, "totalHrs": 0.43, "validHrs": 0.24 }, "da": { "duration": 82980, "reportedSentences": 0, "validatedSentences": 23, "unvalidatedSentences": -2, "clips": 30, "users": 4, "size": 380029, "checksum": "10e28bc5831bd2cec70ecfd43cd1f2d078456a7b7e6ad58aa31e3714b837a443", "avgDurationSecs": 4.105, "validDurationSecs": 660.55, "totalHrs": 0.02, "validHrs": 0.19 }, "dav": { "duration": 94277232, "reportedSentences": 0, "validatedSentences": 6308, "unvalidatedSentences": -2841, "clips": 23505, "users": 5, "size": 498500713, "checksum": "176a747610235c249c0c97898e30fdd8decce8b8b763e658769da7300e98d32a", "avgDurationSecs": 4.058, "validDurationSecs": 24345.89, "totalHrs": 26.19, "validHrs": 6.76 }, "de": { "duration": 19716984, "reportedSentences": 65, "validatedSentences": 289, "unvalidatedSentences": -58, "clips": 3520, "users": 79, "size": 113199695, "checksum": "366c90d5a634c6406b8b78f54833ec8ed8618d5c7bc114601f15f3d8584ed555", "avgDurationSecs": 5.237, "validDurationSecs": 16650.9, "totalHrs": 5.48, "validHrs": 4.63 }, "dv": { "duration": -13444380, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": 0, "clips": -3127, "users": 10, "size": -78583840, "checksum": "1df473e457b82f861cd2b79223a84a738520ca2da7bc847beeb1fe247c1eb3ac", "avgDurationSecs": 5.116, "validDurationSecs": -5752.85, "totalHrs": -3.73, "validHrs": -1.6 }, "dyu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -8, "clips": 0, "users": 0, "size": -435, "checksum": "2797d268f11f342ee2e781c76502b7f6b9a86d5511501bf1800dab9188036e1d", "avgDurationSecs": 6.204, "validDurationSecs": 12.41, "totalHrs": 0, "validHrs": 0 }, "el": { "duration": 576036, "reportedSentences": 0, "validatedSentences": 77, "unvalidatedSentences": -70, "clips": 107, "users": 12, "size": 3374231, "checksum": "5257da4476db0a5d2440b193935a665b25a6156dca46cc05e947837540609b72", "avgDurationSecs": 4.149, "validDurationSecs": 688.3, "totalHrs": 0.16, "validHrs": 0.19 }, "en": { "duration": 117381780, "reportedSentences": 229, "validatedSentences": -3073, "unvalidatedSentences": -95, "clips": 18533, "users": 617, "size": 689805811, "checksum": "d7496f0d49c5db16ca46effd2775ce9820b1e360403a0915b228768be4f9bf35", "avgDurationSecs": 5.25, "validDurationSecs": 66278.13, "totalHrs": 32.61, "validHrs": 18.41 }, "eo": { "duration": 10060452, "reportedSentences": 9, "validatedSentences": 11, "unvalidatedSentences": -55, "clips": 1299, "users": 28, "size": 58254088, "checksum": "67097912176fd841589ff38dfeab6731aae4c9d604760704c6e7b28ae737453d", "avgDurationSecs": 6.07, "validDurationSecs": 4201.57, "totalHrs": 2.79, "validHrs": 1.17 }, "es": { "duration": 27238464, "reportedSentences": 7, "validatedSentences": 109, "unvalidatedSentences": -118, "clips": 5070, "users": 43, "size": 160071394, "checksum": "8cce988566e86b011de98152ec42a1f5a06580f9a5ce4b8a7f1e2987bac90906", "avgDurationSecs": 4.873, "validDurationSecs": 11779.64, "totalHrs": 7.56, "validHrs": 3.27 }, "et": { "duration": 11853324, "reportedSentences": 16, "validatedSentences": 604, "unvalidatedSentences": 887, "clips": 1836, "users": 87, "size": 69852845, "checksum": "c4ac8706f5f829d7f810ee0bf226eb6dc91fceab9b442028761cce22d0f4d4e7", "avgDurationSecs": 6.753, "validDurationSecs": 10313.27, "totalHrs": 3.29, "validHrs": 2.87 }, "eu": { "duration": 8306208, "reportedSentences": 8, "validatedSentences": -1674, "unvalidatedSentences": -683, "clips": 1475, "users": 26, "size": 47475902, "checksum": "076cbfe342ed08fd9cc34a3e3a41018fec4bb7b0ccd21adce062211b062e0583", "avgDurationSecs": 5.495, "validDurationSecs": 26644.66, "totalHrs": 2.31, "validHrs": 7.4 }, "fa": { "duration": 3321864, "reportedSentences": 17, "validatedSentences": 177, "unvalidatedSentences": 62, "clips": 922, "users": 23, "size": 19331432, "checksum": "14719e3dc7bbdc8f00f61469228c96a8cd5c1838659841b1856f05a27a70505b", "avgDurationSecs": 3.975, "validDurationSecs": 4064.32, "totalHrs": 0.92, "validHrs": 1.12 }, "fi": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 0, "users": 0, "size": -8942, "checksum": "57466ec0b4ecadfc32cea149bca6085bbe60c6730b32bce09654ecf88d845b3f", "avgDurationSecs": 4.615, "validDurationSecs": 207.69, "totalHrs": 0, "validHrs": 0.06 }, "fr": { "duration": 10562292, "reportedSentences": 22, "validatedSentences": 337, "unvalidatedSentences": -345, "clips": 2133, "users": 89, "size": 61945399, "checksum": "8910e681be476efc9eb35c6787a0c16d0cab80a4a52bdd4f85e3cab30a0c19c5", "avgDurationSecs": 5.024, "validDurationSecs": 26725.21, "totalHrs": 2.93, "validHrs": 7.43 }, "fy-NL": { "duration": 79020, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 16, "users": 3, "size": 443782, "checksum": "06f15d0a9d22556e998ffe7b11098ce762cad4f27bd3fc372fb2e0fda4dec0f6", "avgDurationSecs": 4.854, "validDurationSecs": 238.27, "totalHrs": 0.02, "validHrs": 0.06 }, "ga-IE": { "duration": 8460, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -10, "clips": 4, "users": 0, "size": 57269, "checksum": "635216593deef246227f4fab368e2f42a3abd8f1345ed67a49f4366b3b6e648e", "avgDurationSecs": 3.592, "validDurationSecs": 233.88, "totalHrs": 0, "validHrs": 0.06 }, "gl": { "duration": 29603772, "reportedSentences": 15, "validatedSentences": 1313, "unvalidatedSentences": -1337, "clips": 5844, "users": 128, "size": 172858454, "checksum": "c4431211c01884b751519744fa0a4a02a264380543b1a6799a95a1024ee2e039", "avgDurationSecs": 5.202, "validDurationSecs": 8170.93, "totalHrs": 8.22, "validHrs": 2.27 }, "gn": { "duration": 980676, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -6, "clips": 188, "users": 23, "size": 5669467, "checksum": "dc1c72fc83c23cfcae8b47568970e3712dd5ce4d741e8cbcd653467a72bc25d1", "avgDurationSecs": 4.547, "validDurationSecs": 66.8, "totalHrs": 0.27, "validHrs": 0.02 }, "ha": { "duration": 161352, "reportedSentences": 0, "validatedSentences": 14, "unvalidatedSentences": -14, "clips": 30, "users": 5, "size": 882628, "checksum": "e5922e4323d11af5365a0899a6cb47f7012bc62076a22b02691d9f7f47459a4d", "avgDurationSecs": 4.34, "validDurationSecs": 179.37, "totalHrs": 0.04, "validHrs": 0.05 }, "he": { "duration": 463464, "reportedSentences": 1, "validatedSentences": -2, "unvalidatedSentences": 2, "clips": 86, "users": 10, "size": 2728453, "checksum": "3ba2789558b19dc244f0e941fa69a8789cc3c973a48a3b2c055e13787e530695", "avgDurationSecs": 4.483, "validDurationSecs": 537.22, "totalHrs": 0.13, "validHrs": 0.15 }, "hi": { "duration": 2878056, "reportedSentences": 0, "validatedSentences": 22, "unvalidatedSentences": -21, "clips": 459, "users": 10, "size": 16923794, "checksum": "e3c7477526f0e376afec842771b63412fac0be63ff6db2184280cfe48bf1bbf5", "avgDurationSecs": 4.968, "validDurationSecs": 1162.86, "totalHrs": 0.8, "validHrs": 0.33 }, "hsb": { "duration": 331020, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -1, "clips": 35, "users": 1, "size": 1941575, "checksum": "893c7f65be18bed8e66984c85a3d545055277b8ffd9e5cae76f5df77e3ce2531", "avgDurationSecs": 6.245, "validDurationSecs": 129.92, "totalHrs": 0.09, "validHrs": 0.04 }, "ht": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23, "checksum": "d8d01c1cb0d035cfaeeb4440f6d9c53d6f629ce942569d8c31eca54066fdf29d", "avgDurationSecs": 4.133, "validDurationSecs": 4.13, "totalHrs": 0, "validHrs": 0 }, "hu": { "duration": 5204628, "reportedSentences": 0, "validatedSentences": -3, "unvalidatedSentences": 7, "clips": 908, "users": 15, "size": 30535061, "checksum": "314de1b5642aaf942139d6e6c270b952f076436a9b77326fa3ada19c29d02423", "avgDurationSecs": 5.531, "validDurationSecs": 1994.7, "totalHrs": 1.45, "validHrs": 0.55 }, "hy-AM": { "duration": 4647888, "reportedSentences": 0, "validatedSentences": 14, "unvalidatedSentences": -45, "clips": 783, "users": 5, "size": 24099377, "checksum": "a5cfbbe9220657400a35962ada79304f7b83623bfb3cbffe639d9257ac71c7a8", "avgDurationSecs": 5.428, "validDurationSecs": 562.48, "totalHrs": 1.29, "validHrs": 0.16 }, "ia": { "duration": 28620, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 6, "users": 2, "size": 170713, "checksum": "6a7f83cb6dc3a7a8782b642724ff90d16a25bd55d9999b285ea10b164bfd2479", "avgDurationSecs": 4.192, "validDurationSecs": 417.83, "totalHrs": 0, "validHrs": 0.11 }, "id": { "duration": 1200528, "reportedSentences": 0, "validatedSentences": 58, "unvalidatedSentences": -54, "clips": 257, "users": 18, "size": 7054504, "checksum": "a0cb0508c4c978a0d85555a919e9dd0cb618a0e95b88c22b87a996c4ad365f77", "avgDurationSecs": 3.997, "validDurationSecs": 6481.59, "totalHrs": 0.34, "validHrs": 1.8 }, "ig": { "duration": 9576, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 45607, "checksum": "8e743890338360354d3d68f1947e28533ca650f36658f2f09021ec98a0051cae", "avgDurationSecs": 5.462, "validDurationSecs": 0.01, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4, "checksum": "2a6fe6c7f8e5ca401e27531fef46b2e0854c77d6c40c9192ff64393c4b0bcebe", "avgDurationSecs": 7.101, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "it": { "duration": 17914968, "reportedSentences": 8, "validatedSentences": 59, "unvalidatedSentences": -59, "clips": 2993, "users": 13, "size": 105752324, "checksum": "633c5e41211a31f2fb32182bd715f6a21eb79217fb6f2923f55c45d9b10e7d90", "avgDurationSecs": 5.411, "validDurationSecs": 5147, "totalHrs": 4.98, "validHrs": 1.43 }, "ja": { "duration": 4753368, "reportedSentences": 4, "validatedSentences": -241, "unvalidatedSentences": 45, "clips": 1223, "users": 26, "size": 27806776, "checksum": "07ef4dc984e0779531ac49095ad7d49f67a5199565b5f66a1cf902992e0ebc91", "avgDurationSecs": 4.807, "validDurationSecs": 1604.99, "totalHrs": 1.32, "validHrs": 0.45 }, "ka": { "duration": 83420712, "reportedSentences": 18, "validatedSentences": -127, "unvalidatedSentences": -98, "clips": 15779, "users": 91, "size": 488144244, "checksum": "0603655df7532e8b2b4604b20f159d8c06b6ce135b6ac0a7cda736bd5bb89c6d", "avgDurationSecs": 5.196, "validDurationSecs": 12422.16, "totalHrs": 23.18, "validHrs": 3.46 }, "kab": { "duration": 7139808, "reportedSentences": 100, "validatedSentences": -4, "unvalidatedSentences": 86, "clips": 1691, "users": 18, "size": 37338176, "checksum": "bd5fd60b7fd5d7836670ed883e511b3bfd96380e42c1519870d733be0f2106a4", "avgDurationSecs": 3.341, "validDurationSecs": 6229.18, "totalHrs": 1.99, "validHrs": 1.73 }, "kk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -1, "clips": 0, "users": 0, "size": 3949, "checksum": "d927003484ed013062eca32f28041cc26b39d94b1d88942cf8f2aea4d74dddd1", "avgDurationSecs": 4.921, "validDurationSecs": 39.37, "totalHrs": 0, "validHrs": 0.01 }, "kln": { "duration": 69564888, "reportedSentences": 0, "validatedSentences": -3893, "unvalidatedSentences": 62, "clips": 13613, "users": 4, "size": 398141702, "checksum": "50304fe541b13425a953eaf16b73d04b4c09cc277e06042009d8c5f76799553e", "avgDurationSecs": 4.705, "validDurationSecs": 66422.06, "totalHrs": 19.33, "validHrs": 18.46 }, "kmr": { "duration": 3557520, "reportedSentences": 0, "validatedSentences": 11, "unvalidatedSentences": 663, "clips": 554, "users": 6, "size": 20945397, "checksum": "a027d3a4c6a21266765af001483f1bddd2c7207246912118dec0871ab6e23330", "avgDurationSecs": 4.088, "validDurationSecs": 1991.64, "totalHrs": 0.99, "validHrs": 0.55 }, "ko": { "duration": 791568, "reportedSentences": 7, "validatedSentences": 254, "unvalidatedSentences": -197, "clips": 164, "users": 7, "size": 4663668, "checksum": "a69d1b0cccf300d956243bd073ffce199d96c86abeefbff73228971266f91358", "avgDurationSecs": 5.786, "validDurationSecs": 240.95, "totalHrs": 0.22, "validHrs": 0.07 }, "ky": { "duration": 6840, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 34604, "checksum": "5fa0c164f09ce8811f72d381b92857fec4fe6dc1b11964dadb44022f58653437", "avgDurationSecs": 4.552, "validDurationSecs": 20.05, "totalHrs": 0, "validHrs": 0 }, "lg": { "duration": 38808, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -184, "clips": 5, "users": 1, "size": 52379, "checksum": "7badf1e48c436b4770d1fd89706d8bed72b4ec751db0b48be1024b39b9d440eb", "avgDurationSecs": 5.784, "validDurationSecs": 53.99, "totalHrs": 0.02, "validHrs": 0.01 }, "lij": { "duration": 15264, "reportedSentences": 3, "validatedSentences": -1, "unvalidatedSentences": 0, "clips": 5, "users": 0, "size": 88680, "checksum": "18dc88fdaaefef43e7d51b2994c22c099dda3f065a8574abaeeb58c86e204f94", "avgDurationSecs": 3.626, "validDurationSecs": -2.48, "totalHrs": 0, "validHrs": 0 }, "lo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1055, "checksum": "ec3e6e4a839ac703f9c3adffc71d8cbe1d38100dbd372fa92a392362337c134b", "avgDurationSecs": 6.453, "validDurationSecs": 38.72, "totalHrs": 0, "validHrs": 0.02 }, "lt": { "duration": 11101032, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 2115, "users": 1, "size": 65427901, "checksum": "0fbd9936cd36af91d1f876f0ab762a5d90e8f0ba0c62d5556f4a372e25c4d7a5", "avgDurationSecs": 5.143, "validDurationSecs": 1092.03, "totalHrs": 3.09, "validHrs": 0.3 }, "ltg": { "duration": 5157612, "reportedSentences": 17, "validatedSentences": 49, "unvalidatedSentences": -45, "clips": 853, "users": 10, "size": 29904092, "checksum": "b04a1f2c06f94f17021cc7c311ef7d16c83b4c2f7215334b84d8afbad3e1d459", "avgDurationSecs": 4.565, "validDurationSecs": 2919.98, "totalHrs": 1.43, "validHrs": 0.81 }, "luo": { "duration": 29093112, "reportedSentences": 0, "validatedSentences": -3129, "unvalidatedSentences": -2959, "clips": 3848, "users": 2, "size": 171381435, "checksum": "cad460a87e5eaf687327e51eaff215a04eab7de60039f0cd9cc2049446123b9e", "avgDurationSecs": 5.205, "validDurationSecs": 7081.44, "totalHrs": 8.08, "validHrs": 1.97 }, "lv": { "duration": 9072792, "reportedSentences": 1, "validatedSentences": 2009, "unvalidatedSentences": -1932, "clips": 1386, "users": 72, "size": 51671468, "checksum": "e094b09c34700fe27f48923397748580671759e4be6bb107ffe3460df34cd9ae", "avgDurationSecs": 4.755, "validDurationSecs": 32546.33, "totalHrs": 2.52, "validHrs": 9.04 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -525, "checksum": "b33a49f4c446d53d5e14c27e5b16bbbab78315917f7066629d1e37237b34f379", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 80, "checksum": "6feae35d46588cfca4d2be72dab19518d356050392b4b9f4882fde43df16e2be", "avgDurationSecs": 4.623, "validDurationSecs": 69.35, "totalHrs": 0, "validHrs": 0.02 }, "mk": { "duration": 19332, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": -2, "clips": 5, "users": 1, "size": 106010, "checksum": "a035c4d04ae2c0a31626ffe328594f68d71d84f8794a7c188d2f3cfdd56031b5", "avgDurationSecs": 4.328, "validDurationSecs": 46.76, "totalHrs": 0, "validHrs": 0.01 }, "ml": { "duration": 288432, "reportedSentences": 0, "validatedSentences": -32, "unvalidatedSentences": -3, "clips": 50, "users": 5, "size": 1645122, "checksum": "bf6680aea30453afb89345b49e6eeeb71f130665d4e3ccde2c36ee96919fd129", "avgDurationSecs": 4.187, "validDurationSecs": 203.39, "totalHrs": 0.08, "validHrs": 0.05 }, "mn": { "duration": 23328, "reportedSentences": 0, "validatedSentences": -1, "unvalidatedSentences": -3, "clips": 2, "users": 1, "size": 33156, "checksum": "2a04dffb81c8883b04115090cb453408f035b2ea9f3fb0f179ba708e1411d33b", "avgDurationSecs": 5.253, "validDurationSecs": 162.01, "totalHrs": 0, "validHrs": 0.05 }, "mr": { "duration": 63756, "reportedSentences": 0, "validatedSentences": 24, "unvalidatedSentences": -25, "clips": 7, "users": 2, "size": 372765, "checksum": "8f88beb23d118b0784590f8bd2c112aafc9982a527cc2a2e1ecc8fa880973518", "avgDurationSecs": 6.194, "validDurationSecs": 435.11, "totalHrs": 0.02, "validHrs": 0.12 }, "mrj": { "duration": 211608, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 34, "users": 0, "size": 1252460, "checksum": "72f06648f4c690814ac2c2e2ac573d75f3955867944ec840d30bd9e172115cb3", "avgDurationSecs": 4.191, "validDurationSecs": 63.16, "totalHrs": 0.06, "validHrs": 0.02 }, "mt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 32, "clips": 0, "users": 0, "size": -10161, "checksum": "0296cf1cd1847997f0ff1caaa6a83bc6ed87ea83669a83d305369f63c73d333a", "avgDurationSecs": 4.754, "validDurationSecs": 28.52, "totalHrs": 0, "validHrs": 0.01 }, "myv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2219, "checksum": "092397b845e17dcc5f637118898ac964951e9742fc9e931698abd750c829e124", "avgDurationSecs": 5.774, "validDurationSecs": -5.77, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 924840, "reportedSentences": 0, "validatedSentences": 75, "unvalidatedSentences": -123, "clips": 258, "users": 6, "size": 5423312, "checksum": "c1ec8b8c6076485166875958ac759ed521adff6623ece63c18ef6b0aac259af6", "avgDurationSecs": 2.628, "validDurationSecs": 896.36, "totalHrs": 0.26, "validHrs": 0.25 }, "ne-NP": { "duration": 37368, "reportedSentences": 0, "validatedSentences": 47, "unvalidatedSentences": -47, "clips": 7, "users": 2, "size": 221016, "checksum": "dba4f5cd3f4e14950bad9a77e21b06e201c647066b9bcad64dae845002d66cab", "avgDurationSecs": 4.004, "validDurationSecs": 40.91, "totalHrs": 0.01, "validHrs": 0.01 }, "nhi": { "duration": 0, "reportedSentences": 0, "validatedSentences": -22, "unvalidatedSentences": -1, "clips": 0, "users": 0, "size": -31, "checksum": "be02ae48562f59c848ce6da422669d048019c4771c8902725f9ea3b93379b832", "avgDurationSecs": 4.682, "validDurationSecs": 9.36, "totalHrs": 0, "validHrs": 0 }, "nl": { "duration": 5582952, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": 0, "clips": 1219, "users": 37, "size": 32408604, "checksum": "cd09cdb595eb36b102d71f5d009a115983e374d1e650e0a212a411c33cc1a849", "avgDurationSecs": 4.362, "validDurationSecs": 6646.59, "totalHrs": 1.55, "validHrs": 1.85 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 975, "checksum": "0a057655ae1e37607a0ae3dcaf226caa505a01b29d1960449bd1b32f61c7df71", "avgDurationSecs": 4.38, "validDurationSecs": 4.38, "totalHrs": 0, "validHrs": 0 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -83, "checksum": "702a5131c32f040dd7219d084c457d67f034fabf129084d9f94f99f7dd0f7a57", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 931032, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 5, "clips": 188, "users": 0, "size": 5467955, "checksum": "8742f2e242ebe6f871bf49b571f0863934fe3dd049cc57f6001280b4aae3f610", "avgDurationSecs": 4.873, "validDurationSecs": 465.77, "totalHrs": 0.26, "validHrs": 0.13 }, "or": { "duration": 64855368, "reportedSentences": 65, "validatedSentences": 104, "unvalidatedSentences": -83, "clips": 11056, "users": 26, "size": 375166713, "checksum": "73cc1a5d98d4ba60c1964e448806c0ef7abc60536dc865d740d0520256e5b254", "avgDurationSecs": 5.484, "validDurationSecs": 4146.81, "totalHrs": 18.02, "validHrs": 1.15 }, "os": { "duration": 604800, "reportedSentences": 1, "validatedSentences": 19, "unvalidatedSentences": -20, "clips": 108, "users": 7, "size": 3570090, "checksum": "89dd008b8dbcbbec1fbf3f48e0c10e4dabea1219950d22fab0908d34cc8af8e7", "avgDurationSecs": 5.328, "validDurationSecs": 886.82, "totalHrs": 0.17, "validHrs": 0.25 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -1, "clips": 0, "users": 0, "size": -2941, "checksum": "87868cb9c4d0d2643c5f4c07ded7043cf35651f64d12d89d0bb87c2b6de6b051", "avgDurationSecs": 4.826, "validDurationSecs": 38.61, "totalHrs": 0, "validHrs": 0.01 }, "pl": { "duration": 11818512, "reportedSentences": 11, "validatedSentences": 308, "unvalidatedSentences": -290, "clips": 2120, "users": 16, "size": 69230044, "checksum": "006281fec6f84154f48a9fae4740ae233ba1c6a5391930be5d85d11562820689", "avgDurationSecs": 4.545, "validDurationSecs": 3098.03, "totalHrs": 3.29, "validHrs": 0.86 }, "ps": { "duration": 49629492, "reportedSentences": 36, "validatedSentences": 2053, "unvalidatedSentences": -67, "clips": 9668, "users": 127, "size": 285610011, "checksum": "509de45a51a3f18b5cba1dd3939b77d0d79d5442eb224f864f45d907bd68f6ef", "avgDurationSecs": 4.995, "validDurationSecs": 10627.23, "totalHrs": 13.78, "validHrs": 2.95 }, "pt": { "duration": 5956056, "reportedSentences": 42, "validatedSentences": 113, "unvalidatedSentences": 6, "clips": 1095, "users": 43, "size": 32445341, "checksum": "03ff231421a520bf1546b2a7e9cf5a806816273c686db751f258381d1ea4af25", "avgDurationSecs": 4.127, "validDurationSecs": 5485.99, "totalHrs": 1.65, "validHrs": 1.52 }, "quy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -1, "clips": 0, "users": 0, "size": -18, "checksum": "aa51c0a003ec1f16d7062dc526f0165ae95cdd16f60a7d2996e1f066377f39ab", "avgDurationSecs": 6.132, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-sursilv": { "duration": 1542852, "reportedSentences": 0, "validatedSentences": 75, "unvalidatedSentences": -76, "clips": 318, "users": 1, "size": 9081749, "checksum": "1791c595bc6234238701bd08cf3bd7b5632a71bf1130f12f4f863215dcebb97c", "avgDurationSecs": 5.438, "validDurationSecs": 127.53, "totalHrs": 0.43, "validHrs": 0.04 }, "rm-vallader": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2023, "checksum": "7fe612ef4b735a17a2a5a4d2b1fd9d86253c2b72c60d7760599f68103b7c93f0", "avgDurationSecs": 5.826, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 1061280, "reportedSentences": 5, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 252, "users": 4, "size": 6075112, "checksum": "296f203873935921a145e03ee2de3f239c06a5b7dce0c00bf91795c0cc048e63", "avgDurationSecs": 4.032, "validDurationSecs": 652.28, "totalHrs": 0.3, "validHrs": 0.18 }, "ru": { "duration": 5431824, "reportedSentences": 8, "validatedSentences": 140, "unvalidatedSentences": -145, "clips": 926, "users": 55, "size": 31536534, "checksum": "33200a55464e6c177c2928b9b4c252794cd787067955cc057cfe86daad206fdd", "avgDurationSecs": 5.171, "validDurationSecs": 10185.87, "totalHrs": 1.51, "validHrs": 2.83 }, "rw": { "duration": 404424, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 102, "users": 9, "size": 1420191, "checksum": "761d64abc17a83d74be5c2f70eb3f0e01897e9c280fab52823af206d541d7f21", "avgDurationSecs": 5.007, "validDurationSecs": 376.42, "totalHrs": 0.11, "validHrs": 0.1 }, "sah": { "duration": 463428, "reportedSentences": 0, "validatedSentences": -22, "unvalidatedSentences": -174, "clips": 75, "users": 3, "size": 2471519, "checksum": "2a4f50ed7f7f898bb0c84d0c0b631ab1498743d3eebf51c1b9f8043cf839991a", "avgDurationSecs": 6.137, "validDurationSecs": 32.76, "totalHrs": 0.13, "validHrs": 0 }, "sat": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 121, "checksum": "cd2ddd84b5464adb558b463732923c220be471a3a8c76a37fc6885c3f997f2d3", "avgDurationSecs": 4.462, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 1708920, "reportedSentences": 5, "validatedSentences": 0, "unvalidatedSentences": -18, "clips": 374, "users": 5, "size": 9641562, "checksum": "9a2d34ec3b703c40a6471d936751c2a48f15b13938d7989303b8c7db2abc51e9", "avgDurationSecs": 4.562, "validDurationSecs": 896.4, "totalHrs": 0.47, "validHrs": 0.25 }, "sk": { "duration": 352296, "reportedSentences": 1, "validatedSentences": 482, "unvalidatedSentences": -461, "clips": 76, "users": 7, "size": 2017649, "checksum": "127e91f36eef029e0570d258ce50d0fe32909f1aff2f870d6d28a88b88d70c17", "avgDurationSecs": 4.078, "validDurationSecs": 1196.96, "totalHrs": 0.1, "validHrs": 0.33 }, "skr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -2, "clips": 0, "users": 0, "size": -2385, "checksum": "6b5e1adbff47bd27520601c71f351968fe5cd13f3928ecaf9299db39729ebe5d", "avgDurationSecs": 4.143, "validDurationSecs": 58.01, "totalHrs": 0, "validHrs": 0.01 }, "sl": { "duration": 2072052, "reportedSentences": 2, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 462, "users": 71, "size": 12030533, "checksum": "ca35dc8cdf3035943ba18e56dd8592f6b4d4355e76268706d8eee52486e1b704", "avgDurationSecs": 3.816, "validDurationSecs": 2557.75, "totalHrs": 0.58, "validHrs": 0.71 }, "sq": { "duration": 20749464, "reportedSentences": 120, "validatedSentences": -5276, "unvalidatedSentences": -6, "clips": 4018, "users": 45, "size": 122484972, "checksum": "5b427434430222a16741825ba65591867dd55300e4212a1940b13ea02d0c7aab", "avgDurationSecs": 5.086, "validDurationSecs": 20185.05, "totalHrs": 5.76, "validHrs": 5.61 }, "sr": { "duration": 16914492, "reportedSentences": 16, "validatedSentences": 1220, "unvalidatedSentences": -818, "clips": 4869, "users": 12, "size": 98974878, "checksum": "8ed0b26b008b4ff7f73e87810f010412d6379cae4a54038dce27ff8a4fe069d0", "avgDurationSecs": 3.23, "validDurationSecs": 4446.29, "totalHrs": 4.7, "validHrs": 1.24 }, "sv-SE": { "duration": 275400, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 61, "users": 5, "size": 1626578, "checksum": "b38b317cc2060beecbdfa13e4b13c997f7d209ffad37bb6695b3c000005cbb12", "avgDurationSecs": 4.011, "validDurationSecs": 326.62, "totalHrs": 0.08, "validHrs": 0.1 }, "sw": { "duration": 158940, "reportedSentences": 0, "validatedSentences": 9, "unvalidatedSentences": -27, "clips": 25, "users": 3, "size": 733698, "checksum": "c3c790200721824d886ccec10878138589271d0674894cb66e1dd96ef74526ec", "avgDurationSecs": 5.386, "validDurationSecs": 170.54, "totalHrs": 0.04, "validHrs": 0.05 }, "ta": { "duration": 32298156, "reportedSentences": 6, "validatedSentences": 0, "unvalidatedSentences": 9, "clips": 4508, "users": 11, "size": 190791225, "checksum": "6ffb1c85ba2f8c2fa6570933c93adf4fe0a16f80caeb2ea29829c2819641eb15", "avgDurationSecs": 6.198, "validDurationSecs": 3187.4, "totalHrs": 8.98, "validHrs": 0.89 }, "te": { "duration": 975888, "reportedSentences": 12, "validatedSentences": 52, "unvalidatedSentences": 123, "clips": 249, "users": 1, "size": 5706484, "checksum": "7ae183fc43049daee6960ede819721824c362d202fe01742a798edcaf256ab39", "avgDurationSecs": 4.169, "validDurationSecs": 126.32, "totalHrs": 0.27, "validHrs": 0.04 }, "th": { "duration": 1488708, "reportedSentences": 12, "validatedSentences": 57, "unvalidatedSentences": -79, "clips": 344, "users": 15, "size": 8636876, "checksum": "f116818d3310ea8eb531f4c6b45a3e968a684aaf7c8623b1ddf522ead6c57973", "avgDurationSecs": 4.191, "validDurationSecs": 785.97, "totalHrs": 0.41, "validHrs": 0.22 }, "ti": { "duration": 1620, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -2, "clips": 1, "users": 1, "size": 8754, "checksum": "2a0d1a0121ee9a54dd21a7389889ba744cad82bbb8ed3b7ec07553baa964a172", "avgDurationSecs": 4.998, "validDurationSecs": 4.71, "totalHrs": 0, "validHrs": 0 }, "tig": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -815, "checksum": "53323d4922aaf497fedb086d259baa5c9efbea6b16a52f94fa0ba00fe5f13d86", "avgDurationSecs": 6.162, "validDurationSecs": 18.49, "totalHrs": 0, "validHrs": 0 }, "tk": { "duration": 270468, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -15, "clips": 51, "users": 2, "size": 1593344, "checksum": "cbfebd05c16c6c1e1fbe89e064a16eb6ccd8e60ee6201eb2e80f41a272fbcc3d", "avgDurationSecs": 5.409, "validDurationSecs": 170.81, "totalHrs": 0.08, "validHrs": 0.05 }, "tn": { "duration": 7795800, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1798, "users": 3, "size": 39790312, "checksum": "f7a41f00c63807485e14b46847a0e32db21022b00cd949b3f35a86e4d3f593be", "avgDurationSecs": 4.333, "validDurationSecs": 17.331, "totalHrs": 2.16, "validHrs": 0 }, "tok": { "duration": 128592, "reportedSentences": 0, "validatedSentences": 9, "unvalidatedSentences": -10, "clips": 25, "users": 4, "size": 730289, "checksum": "4c44f6171fe38cb9c4e6ed10a98ff20626be077cfb7e40f735f855a5daab09f3", "avgDurationSecs": 4.171, "validDurationSecs": 392.75, "totalHrs": 0.04, "validHrs": 0.11 }, "tr": { "duration": 15485364, "reportedSentences": 14, "validatedSentences": 5, "unvalidatedSentences": -2, "clips": 2462, "users": 20, "size": 91620532, "checksum": "7863c59321bc14ce7a1ec51f60677c4885f70b668aa1216b4521ec241d4532c0", "avgDurationSecs": 3.786, "validDurationSecs": 14883.56, "totalHrs": 4.3, "validHrs": 4.14 }, "tt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": 394, "checksum": "20ec24eee4576b696f1fb2aba624efa0bbb871599110e83763d85ce934559f0e", "avgDurationSecs": 3.784, "validDurationSecs": 41.63, "totalHrs": 0, "validHrs": 0.01 }, "tw": { "duration": 105228, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -2, "clips": 15, "users": 3, "size": 624237, "checksum": "de765b0863bc07cd616c1648ec886ad8fd2036a553df5aefa3592ae0af664168", "avgDurationSecs": 4.31, "validDurationSecs": 88.13, "totalHrs": 0.03, "validHrs": 0.02 }, "ug": { "duration": 22475916, "reportedSentences": 56, "validatedSentences": -389, "unvalidatedSentences": -239, "clips": 3825, "users": 27, "size": 123723220, "checksum": "cd347627e036c363a61aa46323428c15a353577a55157c31c70d4f5ad79f9553", "avgDurationSecs": 5.671, "validDurationSecs": 286292.43, "totalHrs": 6.24, "validHrs": 79.53 }, "uk": { "duration": 1821564, "reportedSentences": 2, "validatedSentences": -26, "unvalidatedSentences": 0, "clips": 415, "users": 13, "size": 10617845, "checksum": "8c12ea71f168323d0757789e1aecfd59979491221026042ffb5e8fee059c0054", "avgDurationSecs": 4.643, "validDurationSecs": 669.96, "totalHrs": 0.51, "validHrs": 0.19 }, "ur": { "duration": 246822624, "reportedSentences": 88, "validatedSentences": 3216, "unvalidatedSentences": -2657, "clips": 55251, "users": 100, "size": 1448382969, "checksum": "2ff5ba74d57078a78cf3222e04f1922a0042ee71c0344a3862f8d17d70c41eff", "avgDurationSecs": 4.295, "validDurationSecs": 58339.37, "totalHrs": 68.56, "validHrs": 16.2 }, "uz": { "duration": 487728, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 5, "clips": 80, "users": 11, "size": 2766397, "checksum": "c75bd18e5664bc84c80bd1e78280ad7f5812eb4dd7da77ff69b7174b6ccd0f45", "avgDurationSecs": 4.153, "validDurationSecs": 295.5, "totalHrs": 0.13, "validHrs": 0.08 }, "vi": { "duration": 589104, "reportedSentences": 12, "validatedSentences": 17, "unvalidatedSentences": -18, "clips": 157, "users": 6, "size": 3451471, "checksum": "e1854be8cf645bf0d518eff3fa6413cd18196cd13518db6b26c2f2f85122a95e", "avgDurationSecs": 3.952, "validDurationSecs": 199.98, "totalHrs": 0.17, "validHrs": 0.05 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 153, "checksum": "d9f1003cf6efa48e96b1019df91e2df3932d56b5f3936613989bfc5e30328c48", "avgDurationSecs": 2.412, "validDurationSecs": 2.41, "totalHrs": 0, "validHrs": 0 }, "xh": { "duration": 33732, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 199393, "checksum": "8e455d4546b10b24860fe6633034329b1cc1fe7948e3bb7358f6676f9869eada", "avgDurationSecs": 6.276, "validDurationSecs": 6.276, "totalHrs": 0.01, "validHrs": 0 }, "yi": { "duration": 22068, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 6, "users": 1, "size": 129526, "checksum": "92f522ee0088e70cb3cab6a2bc87791bb236bc711747f0886810916fa09d5b1a", "avgDurationSecs": 4.192, "validDurationSecs": 22.3, "totalHrs": 0.01, "validHrs": 0 }, "yo": { "duration": 696924, "reportedSentences": 0, "validatedSentences": 6, "unvalidatedSentences": -2, "clips": 93, "users": 8, "size": 4069982, "checksum": "7c1ec812ad2cffd5855ae9bda5ca678570fd121f5f668f00cf0809acf5204322", "avgDurationSecs": 5.98, "validDurationSecs": 319.21, "totalHrs": 0.2, "validHrs": 0.08 }, "yue": { "duration": 215351856, "reportedSentences": 315, "validatedSentences": 288, "unvalidatedSentences": -287, "clips": 52601, "users": 46, "size": 1262490904, "checksum": "f19073fb2bff609f8a9f4bb4e06fc342366fe441948680b3ecc6041f9c21e2b8", "avgDurationSecs": 3.939, "validDurationSecs": 557744.35, "totalHrs": 59.82, "validHrs": 154.93 }, "zgh": { "duration": 184212, "reportedSentences": 1, "validatedSentences": 8, "unvalidatedSentences": -7, "clips": 46, "users": 4, "size": 1087182, "checksum": "ed94c480001878d57c916c02886bd93db4f2cb441b080fd78cfdbefc5012722a", "avgDurationSecs": 3.382, "validDurationSecs": 504.03, "totalHrs": 0.05, "validHrs": 0.14 }, "zh-CN": { "duration": 5463432, "reportedSentences": 5, "validatedSentences": 101, "unvalidatedSentences": 15, "clips": 1352, "users": 77, "size": 32166942, "checksum": "775ed324eae93f72590e99bee0b45e5c1a923f102c24645a8f2e11ec070e81d5", "avgDurationSecs": 4.546, "validDurationSecs": 2567.67, "totalHrs": 1.52, "validHrs": 0.71 }, "zh-HK": { "duration": 3037356, "reportedSentences": 110, "validatedSentences": 7, "unvalidatedSentences": -6, "clips": 890, "users": 8, "size": 17759239, "checksum": "5a79b07d1f8310188483e43f96e3577b2264fd54ea18d519ba7d1bfed6308db4", "avgDurationSecs": 4.169, "validDurationSecs": 327.83, "totalHrs": 0.84, "validHrs": 0.09 }, "zh-TW": { "duration": 1237644, "reportedSentences": 3, "validatedSentences": 1, "unvalidatedSentences": -3, "clips": 407, "users": 11, "size": 7238789, "checksum": "b6729c36ad10f1d2d004b48523e932b9ef68f4afa441fc05bad9d2f80c9c34fe", "avgDurationSecs": 3.364, "validDurationSecs": 522.21, "totalHrs": 0.34, "validHrs": 0.15 }, "zu": { "duration": 42552, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": -1, "clips": 6, "users": 2, "size": 249929, "checksum": "2a08e35cca0090a0fbf4c18d8f24c51b8735660a5c6e6548e45dea2a75cadfd8", "avgDurationSecs": 5.388, "validDurationSecs": 32.62, "totalHrs": 0.02, "validHrs": 0.01 }, "zza": { "duration": 1104912, "reportedSentences": 16, "validatedSentences": -80, "unvalidatedSentences": -43, "clips": 286, "users": 0, "size": 6468929, "checksum": "4379fd88b639f55fe7c0decace26c9e5e1e3064db712e913b727e0c73f93d418", "avgDurationSecs": 4.303, "validDurationSecs": -6.28, "totalHrs": 0.3, "validHrs": 0 } }, "totalDuration": 1665436644, "totalValidDurationSecs": 2338259, "totalHrs": 463, "totalValidHrs": 650 } ================================================ FILE: datasets/scripted-speech/cv-corpus-2.json ================================================ { "date": "2019-06-11", "locales": { "en": { "clips": 895794, "splits": { "accent": { "": 0.52, "canada": 0.03, "england": 0.09, "us": 0.23, "indian": 0.04, "australia": 0.03, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0, "philippines": 0, "singapore": 0, "scotland": 0.01, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.02 }, "age": { "": 0.41, "twenties": 0.21, "sixties": 0.03, "thirties": 0.15, "teens": 0.05, "seventies": 0.01, "fourties": 0.08, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.47, "female": 0.11, "other": 0.02 } }, "users": 39534, "duration": 3907985784, "buckets": { "dev": 13021, "invalidated": 78156, "other": 178524, "test": 13021, "train": 59875, "validated": 639080 }, "size": 30453965853 }, "de": { "clips": 292805, "splits": { "accent": { "germany": 0.65, "russia": 0, "": 0.3, "austria": 0.02, "liechtenstein": 0, "switzerland": 0.01, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0 }, "age": { "twenties": 0.28, "fourties": 0.17, "": 0.22, "thirties": 0.18, "teens": 0.04, "sixties": 0.02, "fifties": 0.09, "seventies": 0 }, "gender": { "male": 0.68, "": 0.22, "female": 0.1, "other": 0 } }, "users": 4987, "duration": 1221585696, "buckets": { "dev": 5633, "invalidated": 12568, "other": 2591, "test": 5633, "train": 8518, "validated": 277602 }, "size": 9607377541 }, "fr": { "clips": 160949, "splits": { "accent": { "": 0.29, "france": 0.64, "germany": 0, "belgium": 0.03, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0 }, "age": { "twenties": 0.24, "thirties": 0.21, "": 0.22, "teens": 0.04, "fourties": 0.17, "fifties": 0.05, "sixties": 0.06, "seventies": 0 }, "gender": { "male": 0.7, "": 0.2, "female": 0.09, "other": 0.01 } }, "users": 2985, "duration": 658525248, "buckets": { "dev": 10244, "invalidated": 9880, "other": 1547, "test": 10244, "train": 26652, "validated": 149161 }, "size": 5187818129 }, "cy": { "clips": 38049, "splits": { "accent": { "united_kingdom": 0.43, "": 0.55, "other": 0.02 }, "age": { "fourties": 0.12, "twenties": 0.09, "sixties": 0.13, "fifties": 0.08, "": 0.49, "thirties": 0.07, "seventies": 0.01, "eighties": 0, "teens": 0.01 }, "gender": { "male": 0.33, "female": 0.18, "": 0.49 } }, "users": 738, "duration": 171799488, "buckets": { "dev": 484, "invalidated": 1657, "other": 3351, "test": 484, "train": 500, "validated": 32993 }, "size": 1342887417 }, "br": { "clips": 12155, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.14, "": 0.5, "fifties": 0.08, "fourties": 0.11, "thirties": 0.13, "sixties": 0.04 }, "gender": { "male": 0.47, "": 0.5, "female": 0.03 } }, "users": 118, "duration": 36714648, "buckets": { "dev": 1138, "invalidated": 282, "other": 7563, "test": 1283, "train": 1603, "validated": 4310 }, "size": 284094160 }, "cv": { "clips": 2355, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.04, "": 0.53, "fourties": 0.38, "thirties": 0.04, "teens": 0.01 }, "gender": { "male": 0.47, "": 0.53 } }, "users": 38, "duration": 10101114, "buckets": { "dev": 78, "invalidated": 108, "other": 1221, "test": 242, "train": 566, "validated": 1026 }, "size": 79983634 }, "tr": { "clips": 9700, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "": 0.19, "thirties": 0.33, "twenties": 0.42, "teens": 0.04, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.19, "male": 0.7, "female": 0.11 } }, "users": 344, "duration": 37589112, "buckets": { "dev": 1611, "invalidated": 838, "other": 71, "test": 1610, "train": 1790, "validated": 8791 }, "size": 292352864 }, "tt": { "clips": 25448, "splits": { "accent": { "": 1 }, "age": { "": 0.16, "thirties": 0.78, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0 }, "gender": { "": 0.16, "male": 0.82, "female": 0.02 } }, "users": 132, "duration": 93602928, "buckets": { "dev": 1949, "invalidated": 241, "other": 3311, "test": 4070, "train": 8077, "validated": 21896 }, "size": 710563115 }, "ky": { "clips": 15885, "splits": { "accent": { "": 0.91, "other": 0.09 }, "age": { "thirties": 0.15, "": 0.04, "fourties": 0.01, "twenties": 0.79, "teens": 0.01 }, "gender": { "male": 0.44, "": 0.09, "female": 0.47, "other": 0 } }, "users": 97, "duration": 72991440, "buckets": { "dev": 971, "invalidated": 779, "other": 8249, "test": 1501, "train": 2320, "validated": 6857 }, "size": 532536032 }, "ga-IE": { "clips": 3685, "splits": { "accent": { "": 0.29, "connachta": 0.41, "other": 0.1, "ulaidh": 0.09, "mumhain": 0.1 }, "age": { "twenties": 0.26, "": 0.24, "thirties": 0.39, "fourties": 0.08, "sixties": 0.01, "teens": 0.03 }, "gender": { "male": 0.62, "": 0.24, "female": 0.15 } }, "users": 63, "duration": 12648528, "buckets": { "dev": 409, "invalidated": 196, "other": 1088, "test": 474, "train": 558, "validated": 2382 }, "size": 97115621 }, "kab": { "clips": 191313, "splits": { "accent": { "": 0.79, "other": 0.21 }, "age": { "fourties": 0.12, "thirties": 0.32, "": 0.34, "twenties": 0.13, "fifties": 0.03, "eighties": 0, "teens": 0, "sixties": 0.05, "seventies": 0 }, "gender": { "male": 0.55, "": 0.3, "female": 0.15, "other": 0.01 } }, "users": 584, "duration": 690930408, "buckets": { "dev": 6646, "invalidated": 9659, "other": 2594, "test": 6647, "train": 11073, "validated": 179040 }, "size": 5441484242 }, "ca": { "clips": 95666, "splits": { "accent": { "valencian": 0.06, "central": 0.68, "": 0.21, "other": 0.01, "balearic": 0.01, "northwestern": 0.03, "northern": 0 }, "age": { "thirties": 0.12, "fifties": 0.28, "fourties": 0.25, "twenties": 0.1, "": 0.2, "sixties": 0.04, "teens": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "male": 0.37, "": 0.2, "female": 0.43, "other": 0 } }, "users": 1832, "duration": 434567232, "buckets": { "dev": 8406, "invalidated": 4681, "other": 6498, "test": 8406, "train": 16991, "validated": 84487 }, "size": 3412793392 }, "zh-TW": { "clips": 52609, "splits": { "accent": { "": 0.65, "other": 0.14, "kaohsiung_city": 0.01, "new_taipei_city": 0.05, "yilan_county": 0.01, "taoyuan_city": 0, "hsinchu_city": 0.01, "taipei_city": 0.07, "chiayi_city": 0.01, "taichung_city": 0.01, "yunlin_county": 0, "nantou_county": 0, "tainan_city": 0.03, "changhua_county": 0, "pingtung_county": 0, "hsinchu_county": 0 }, "age": { "thirties": 0.32, "twenties": 0.38, "teens": 0.04, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.02 }, "gender": { "male": 0.46, "": 0.23, "female": 0.29, "other": 0.03 } }, "users": 948, "duration": 154702608, "buckets": { "dev": 1539, "invalidated": 2367, "other": 10785, "test": 1539, "train": 1698, "validated": 39457 }, "size": 1214984798 }, "sl": { "clips": 4714, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.75, "teens": 0, "": 0.05, "sixties": 0, "fifties": 0.17, "fourties": 0.02, "thirties": 0.01 }, "gender": { "female": 0.2, "male": 0.75, "": 0.05 } }, "users": 42, "duration": 18543912, "buckets": { "dev": 464, "invalidated": 148, "other": 2108, "test": 339, "train": 1159, "validated": 2458 }, "size": 146860630 }, "it": { "clips": 30204, "splits": { "accent": { "": 0.83, "other": 0.17 }, "age": { "thirties": 0.11, "twenties": 0.32, "": 0.2, "fifties": 0.19, "fourties": 0.1, "seventies": 0, "sixties": 0, "teens": 0.06 }, "gender": { "female": 0.18, "male": 0.62, "": 0.2, "other": 0 } }, "users": 600, "duration": 146478288, "buckets": { "dev": 4551, "invalidated": 3211, "other": 140, "test": 4565, "train": 6368, "validated": 26843 }, "size": 1153374637 }, "nl": { "clips": 22227, "splits": { "accent": { "": 0.25, "netherlands": 0.65, "belgium": 0.1, "other": 0 }, "age": { "": 0.22, "twenties": 0.28, "fourties": 0.14, "thirties": 0.11, "teens": 0.04, "fifties": 0.19, "sixties": 0.01 }, "gender": { "": 0.26, "male": 0.72, "female": 0.02, "other": 0 } }, "users": 501, "duration": 84606576, "buckets": { "dev": 1688, "invalidated": 936, "other": 4694, "test": 1688, "train": 1879, "validated": 16597 }, "size": 659865776 }, "cnh": { "clips": 4746, "splits": { "accent": { "": 0.82, "other": 0.18 }, "age": { "": 0.56, "twenties": 0.28, "fourties": 0.01, "teens": 0.02, "thirties": 0.1, "fifties": 0.03 }, "gender": { "": 0.56, "male": 0.24, "female": 0.2 } }, "users": 280, "duration": 17936232, "buckets": { "dev": 686, "invalidated": 464, "other": 2039, "test": 690, "train": 750, "validated": 2243 }, "size": 141968674 }, "eo": { "clips": 13189, "splits": { "accent": { "": 0.51, "internacia": 0.47, "other": 0.02 }, "age": { "twenties": 0.65, "thirties": 0.05, "": 0.11, "fourties": 0.16, "fifties": 0.01, "seventies": 0, "teens": 0.02, "sixties": 0 }, "gender": { "male": 0.51, "": 0.38, "female": 0.11, "other": 0 } }, "users": 129, "duration": 60188640, "buckets": { "dev": 1898, "invalidated": 728, "other": 1717, "test": 1863, "train": 2195, "validated": 10744 }, "size": 472299674 }, "et": { "clips": 6930, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "thirties": 0.11, "twenties": 0.7, "fourties": 0.13, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.05, "male": 0.57, "female": 0.37, "other": 0 } }, "users": 225, "duration": 46332792, "buckets": { "dev": 1771, "invalidated": 426, "other": 62, "test": 1778, "train": 2015, "validated": 6442 }, "size": 361729763 }, "eu": { "clips": 58835, "splits": { "accent": { "mendebalekoa": 0.23, "erdialdekoa_nafarra": 0.35, "": 0.42, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.14, "thirties": 0.05, "fifties": 0.17, "twenties": 0.38, "": 0.23, "teens": 0.01, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.53, "female": 0.22, "": 0.24, "other": 0.02 } }, "users": 508, "duration": 298847808, "buckets": { "dev": 1981, "invalidated": 2065, "other": 24799, "test": 1981, "train": 2250, "validated": 31971 }, "size": 2347679831 }, "es": { "clips": 27128, "splits": { "accent": { "nortepeninsular": 0.29, "": 0.29, "rioplatense": 0.07, "centrosurpeninsular": 0.04, "americacentral": 0.01, "surpeninsular": 0.03, "canario": 0.02, "andino": 0.12, "mexicano": 0.03, "chileno": 0.08, "caribe": 0.01, "filipinas": 0.01 }, "age": { "thirties": 0.12, "": 0.19, "fifties": 0.17, "twenties": 0.32, "teens": 0.02, "fourties": 0.16, "sixties": 0.02 }, "gender": { "male": 0.73, "": 0.16, "other": 0.01, "female": 0.09 } }, "users": 602, "duration": 112263528, "buckets": { "dev": 2724, "invalidated": 3533, "other": 116, "test": 2727, "train": 3275, "validated": 23405 }, "size": 872813432 }, "zh-CN": { "clips": 7009, "splits": { "accent": { "110000": 0.12, "130000": 0.01, "210000": 0.03, "220000": 0.01, "230000": 0.04, "310000": 0, "320000": 0.08, "330000": 0.07, "340000": 0.03, "350000": 0.03, "360000": 0.01, "370000": 0.03, "410000": 0.01, "420000": 0.01, "430000": 0.01, "440000": 0.03, "450000": 0.01, "500000": 0.01, "510000": 0.05, "520000": 0, "610000": 0.01, "620000": 0, "650000": 0.01, "710000": 0.02, "": 0.36 }, "age": { "": 0.23, "teens": 0.11, "twenties": 0.41, "thirties": 0.25, "fourties": 0 }, "gender": { "": 0.23, "male": 0.76, "female": 0 } }, "users": 289, "duration": 45810480, "buckets": { "dev": 1947, "invalidated": 777, "other": 19, "test": 1950, "train": 2301, "validated": 6213 }, "size": 362279087 }, "mn": { "clips": 6217, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.3, "": 0.33, "twenties": 0.22, "fourties": 0.02, "teens": 0.02, "nineties": 0.12 }, "gender": { "male": 0.34, "": 0.33, "female": 0.22, "other": 0.12 } }, "users": 227, "duration": 33959040, "buckets": { "dev": 1502, "invalidated": 456, "other": 514, "test": 1549, "train": 1822, "validated": 5247 }, "size": 268234867 }, "sah": { "clips": 3788, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 35, "duration": 22603752, "buckets": { "dev": 0, "invalidated": 51, "other": 1845, "test": 516, "train": 1374, "validated": 1892 }, "size": 174871319 }, "dv": { "clips": 5944, "splits": { "accent": { "": 1 }, "age": { "": 0.08, "twenties": 0.29, "thirties": 0.57, "fourties": 0.02, "teens": 0.04 }, "gender": { "": 0.08, "male": 0.27, "female": 0.65 } }, "users": 92, "duration": 32169912, "buckets": { "dev": 1094, "invalidated": 262, "other": 1863, "test": 1232, "train": 1483, "validated": 3819 }, "size": 250209674 }, "rw": { "clips": 1469, "splits": { "accent": { "": 1 }, "age": { "": 0.87, "twenties": 0.12, "thirties": 0.01 }, "gender": { "": 0.87, "male": 0.13 } }, "users": 31, "duration": 6803592, "buckets": { "dev": 35, "invalidated": 29, "other": 1329, "test": 33, "train": 43, "validated": 111 }, "size": 53737286 }, "sv-SE": { "clips": 4098, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.38, "": 0.21, "teens": 0.07, "fifties": 0.27, "twenties": 0.03, "fourties": 0.04 }, "gender": { "male": 0.76, "": 0.21, "female": 0.03 } }, "users": 43, "duration": 12464208, "buckets": { "dev": 1241, "invalidated": 117, "other": 1, "test": 1120, "train": 1602, "validated": 3980 }, "size": 90672899 }, "ru": { "clips": 14873, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.72, "teens": 0, "": 0.03, "fourties": 0.17, "thirties": 0.03, "fifties": 0.05 }, "gender": { "male": 0.63, "": 0.03, "other": 0, "female": 0.33 } }, "users": 55, "duration": 75811968, "buckets": { "dev": 1160, "invalidated": 270, "other": 6022, "test": 1561, "train": 2738, "validated": 7665 }, "size": 588154518 } }, "totalDuration": 8518564962, "totalValidDurationSecs": 6739368, "totalHrs": 2366, "totalValidHrs": 1872, "totalClips": 2007784 } ================================================ FILE: datasets/scripted-speech/cv-corpus-20.0-2024-12-06.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9160, "test": 9117, "train": 21027, "validated": 42068, "invalidated": 5282, "other": 16637 }, "duration": 331319412, "reportedSentences": 225, "validatedSentences": 1046778, "unvalidatedSentences": 5, "clips": 63987, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.17, "teens": 0.26, "fourties": 0.08, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 63987, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 415, "size": 1968561579, "checksum": "b08f789609e649f395561b107f5d2eacb6d51fa4888e34ebc92eec87617c93c3", "avgDurationSecs": 5.178, "validDurationSecs": 217824.637, "totalHrs": 92.03, "validHrs": 60.5 }, "af": { "buckets": { "dev": 88, "test": 97, "train": 147, "validated": 332, "invalidated": 197, "other": 274 }, "duration": 4621968, "reportedSentences": 0, "validatedSentences": 4779, "unvalidatedSentences": 16587, "clips": 803, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.37, "thirties": 0, "teens": 0.01, "fourties": 0.14, "fifties": 0.18, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.38, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 803, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 51, "size": 28608326, "checksum": "5cd09c6271b049e64fea831daed656d6ce2aa0a29ec4471c4d1b889b45332a19", "avgDurationSecs": 5.756, "validDurationSecs": 1910.951, "totalHrs": 1.28, "validHrs": 0.53 }, "am": { "buckets": { "dev": 248, "test": 232, "train": 496, "validated": 976, "invalidated": 25, "other": 562 }, "duration": 9740520, "reportedSentences": 11, "validatedSentences": 2195, "unvalidatedSentences": 23, "clips": 1563, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.15, "thirties": 0.63, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.78, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1563, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 34, "size": 57468324, "checksum": "a2dadc69339073906db2df92b36d6cd3c362596fb439b15bc568c2c86c40794b", "avgDurationSecs": 6.232, "validDurationSecs": 6082.372, "totalHrs": 2.7, "validHrs": 1.68 }, "ar": { "buckets": { "dev": 10406, "test": 10497, "train": 28530, "validated": 78720, "invalidated": 15161, "other": 41444 }, "duration": 562952178, "reportedSentences": 2127, "validatedSentences": 59414, "unvalidatedSentences": 1742105, "clips": 135325, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.3, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 135325, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1602, "size": 3466362578, "checksum": "a648911b8ddc0155dc238a6941f19b4fd4fd6f8c0d837b6593a78fe71d0934e0", "avgDurationSecs": 4.16, "validDurationSecs": 327475.304, "totalHrs": 156.37, "validHrs": 90.96 }, "as": { "buckets": { "dev": 517, "test": 573, "train": 677, "validated": 1767, "invalidated": 280, "other": 2560 }, "duration": 27185465, "reportedSentences": 245, "validatedSentences": 6711, "unvalidatedSentences": 3788, "clips": 4607, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.16, "thirties": 0.02, "teens": 0, "fourties": 0.52, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.81, "male_masculine": 0.18, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4607, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 50, "size": 165459276, "checksum": "7debdabf59cea322a077dd7018023d0936ba6d1b065206afe93743c8c32e3365", "avgDurationSecs": 5.901, "validDurationSecs": 10426.897, "totalHrs": 7.55, "validHrs": 2.89 }, "ast": { "buckets": { "dev": 113, "test": 175, "train": 399, "validated": 691, "invalidated": 18, "other": 970 }, "duration": 7444944, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1679, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.1, "thirties": 0.12, "teens": 0.04, "fourties": 0.28, "fifties": 0.21, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1679, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 42474506, "checksum": "b7d2c93a072bc82cf657601f7ec011f7abda8a1998890bdbb50ab971dcb019e3", "avgDurationSecs": 4.434, "validDurationSecs": 3064, "totalHrs": 2.06, "validHrs": 0.85 }, "az": { "buckets": { "dev": 85, "test": 84, "train": 109, "validated": 278, "invalidated": 49, "other": 480 }, "duration": 4385880, "reportedSentences": 5, "validatedSentences": 93159, "unvalidatedSentences": 1851, "clips": 807, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.13, "thirties": 0.66, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0.21, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 807, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 33, "size": 31970697, "checksum": "36afc5d5fc14cdb091377c27d32b98a7330cf288c28783b6a0c7f3ec8c671f6b", "avgDurationSecs": 5.435, "validDurationSecs": 1510.873, "totalHrs": 1.21, "validHrs": 0.41 }, "ba": { "buckets": { "dev": 14492, "test": 14589, "train": 119120, "validated": 209914, "invalidated": 7967, "other": 297 }, "duration": 965795040, "reportedSentences": 867, "validatedSentences": 153957, "unvalidatedSentences": 14, "clips": 218178, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218178, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 923, "size": 5442116836, "checksum": "058decc1fa22e29855f7673371ba38655d2ef35d9ffa4beaa4d9b843cee18869", "avgDurationSecs": 4.427, "validDurationSecs": 929213.303, "totalHrs": 268.27, "validHrs": 258.11 }, "bas": { "buckets": { "dev": 1328, "test": 1547, "train": 2109, "validated": 11134, "invalidated": 1217, "other": 102 }, "duration": 48872484, "reportedSentences": 8, "validatedSentences": 5226, "unvalidatedSentences": 105, "clips": 12453, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.61, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12449, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 52, "size": 253347893, "checksum": "8c9780e4f91d8456447c5f991ec0044a7904301063eb309d42c6b6733b9f797c", "avgDurationSecs": 3.925, "validDurationSecs": 43695.996, "totalHrs": 13.57, "validHrs": 12.13 }, "be": { "buckets": { "dev": 15876, "test": 15880, "train": 347672, "validated": 1350744, "invalidated": 36642, "other": 12732 }, "duration": 6695903376, "reportedSentences": 3196, "validatedSentences": 379528, "unvalidatedSentences": 1938, "clips": 1400118, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.04, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1400059, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 33, "healthcare": 0, "history_law_government": 5, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 7 } }, "users": 8521, "size": 38156117482, "checksum": "094871bb35673b61fa572f75486a8b0022f8368a6842ed2da4fd80e2a1804610", "avgDurationSecs": 4.782, "validDurationSecs": 6459777.897, "totalHrs": 1859.97, "validHrs": 1794.38 }, "bg": { "buckets": { "dev": 2888, "test": 3303, "train": 4924, "validated": 11131, "invalidated": 774, "other": 1870 }, "duration": 75384468, "reportedSentences": 188, "validatedSentences": 39570, "unvalidatedSentences": 5, "clips": 13775, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13775, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 141, "size": 444221553, "checksum": "4d5e2f226f61a985b1d53a1e4b93242c6eba8c9aaa2eb080b7d842cd97c2059b", "avgDurationSecs": 5.473, "validDurationSecs": 60915.028, "totalHrs": 20.94, "validHrs": 16.92 }, "bn": { "buckets": { "dev": 9356, "test": 9356, "train": 21375, "validated": 44351, "invalidated": 7860, "other": 998046 }, "duration": 4587623640, "reportedSentences": 1890, "validatedSentences": 562755, "unvalidatedSentences": 94185, "clips": 1050257, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1050257, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22946, "size": 26510270813, "checksum": "85801c1fcbd60d1527a490b908ab8c9c96698057032fdc7b3094ae5995108d58", "avgDurationSecs": 4.368, "validDurationSecs": 193729.436, "totalHrs": 1274.33, "validHrs": 53.81 }, "br": { "buckets": { "dev": 2400, "test": 2398, "train": 2902, "validated": 29629, "invalidated": 1792, "other": 332 }, "duration": 101344255, "reportedSentences": 301, "validatedSentences": 8107, "unvalidatedSentences": 55, "clips": 31753, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.31, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.08, "sixties": 0.13, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.65, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31576, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 155, "healthcare": 5, "history_law_government": 4, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 215, "size": 702084050, "checksum": "599380a6168453bb13f71cf02f69fa069c19e752ccb06898fa56a49e01da00ba", "avgDurationSecs": 3.192, "validDurationSecs": 94565.204, "totalHrs": 28.15, "validHrs": 26.26 }, "ca": { "buckets": { "dev": 16412, "test": 16412, "train": 1193920, "validated": 2211337, "invalidated": 135581, "other": 225770 }, "duration": 13507730730, "reportedSentences": 6879, "validatedSentences": 1289263, "unvalidatedSentences": 461, "clips": 2572688, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.18, "sixties": 0.29, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.52, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2571882, "agriculture_food": 14, "automotive_transport": 9, "finance": 1, "service_retail": 61, "general": 488, "healthcare": 22, "history_law_government": 51, "language_fundamentals": 21, "media_entertainment": 31, "nature_environment": 81, "news_current_affairs": 27, "technology_robotics": 17 } }, "users": 36408, "size": 81268150185, "checksum": "48e79bd0f40b830a5ad84549f4128e692bd9fb03e4aee6eed87e189918ad58bc", "avgDurationSecs": 5.25, "validDurationSecs": 11610480.847, "totalHrs": 3752.14, "validHrs": 3225.13 }, "ckb": { "buckets": { "dev": 5313, "test": 5314, "train": 7811, "validated": 118701, "invalidated": 9191, "other": 39105 }, "duration": 681967152, "reportedSentences": 2524, "validatedSentences": 19003, "unvalidatedSentences": 15440, "clips": 166997, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 166783, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 106, "healthcare": 0, "history_law_government": 10, "language_fundamentals": 10, "media_entertainment": 78, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1869, "size": 3767644376, "checksum": "26fc9f595cddac9cd397d7df87db593cacdf632ea1cc97801ffd2fda75d3647f", "avgDurationSecs": 4.084, "validDurationSecs": 484740.342, "totalHrs": 189.43, "validHrs": 134.65 }, "cnh": { "buckets": { "dev": 763, "test": 765, "train": 818, "validated": 2463, "invalidated": 437, "other": 3287 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168180623, "checksum": "595e9ba7eca9df6a041e359e2cea919d6992057270aed8355c221ea06eab5de3", "avgDurationSecs": 3.516, "validDurationSecs": 8659.177, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "dev": 9215, "test": 9193, "train": 20830, "validated": 62777, "invalidated": 2352, "other": 149362 }, "duration": 957541827, "reportedSentences": 967, "validatedSentences": 451073, "unvalidatedSentences": 42, "clips": 214491, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.18, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.54, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 214478, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 9, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 4, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 5 } }, "users": 1035, "size": 5894654317, "checksum": "d339a4f82201c8181993245fd563a67fb415e337e37ecacc99d4e7c12c92ddc3", "avgDurationSecs": 4.464, "validDurationSecs": 280252.334, "totalHrs": 265.98, "validHrs": 77.84 }, "cv": { "buckets": { "dev": 1242, "test": 1288, "train": 1456, "validated": 17488, "invalidated": 2210, "other": 712 }, "duration": 102844668, "reportedSentences": 144, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 20410, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.5, "thirties": 0.01, "teens": 0.18, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.52, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20410, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 122, "size": 696095939, "checksum": "83930665aca6eb4b0f2f7c9a69a7b9e70abe532b5ea5349bf75692492f41ba50", "avgDurationSecs": 5.039, "validDurationSecs": 88120.899, "totalHrs": 28.56, "validHrs": 24.47 }, "cy": { "buckets": { "dev": 5392, "test": 5399, "train": 8003, "validated": 90678, "invalidated": 4458, "other": 20613 }, "duration": 568646397, "reportedSentences": 176, "validatedSentences": 118032, "unvalidatedSentences": 25, "clips": 115749, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115749, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1905, "size": 4148928578, "checksum": "8820cc4bf0dfbbf971d62f7ebf99ef779e3ebfff00bd5a7ee2e38dfa56b76d78", "avgDurationSecs": 4.913, "validDurationSecs": 445478.734, "totalHrs": 157.95, "validHrs": 123.74 }, "da": { "buckets": { "dev": 2222, "test": 2601, "train": 3582, "validated": 10583, "invalidated": 417, "other": 297 }, "duration": 46306764, "reportedSentences": 434, "validatedSentences": 20491, "unvalidatedSentences": 8, "clips": 11297, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.19, "thirties": 0.24, "teens": 0.01, "fourties": 0.2, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.59, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11291, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 3, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 272, "size": 267953152, "checksum": "ce47ab19e93a34a1c50de1daf0a69504f5c6663805826b49df58ce159d1acb0c", "avgDurationSecs": 4.099, "validDurationSecs": 43380.055, "totalHrs": 12.86, "validHrs": 12.05 }, "dav": { "buckets": { "dev": 1275, "test": 1002, "train": 2098, "validated": 8256, "invalidated": 663, "other": 40711 }, "duration": 201431016, "reportedSentences": 11, "validatedSentences": 11773, "unvalidatedSentences": 20090, "clips": 49630, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.61, "thirties": 0.1, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0.23, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.73, "male_masculine": 0, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 971, "agriculture_food": 499, "automotive_transport": 100, "finance": 304, "service_retail": 1218, "general": 45323, "healthcare": 967, "history_law_government": 334, "language_fundamentals": 150, "media_entertainment": 68, "nature_environment": 873, "news_current_affairs": 86, "technology_robotics": 75 } }, "users": 24, "size": 1084109285, "checksum": "b819fd9a384f2b38dc64f5b699be5aac74b27070c094e9648cd06375cf762f9e", "avgDurationSecs": 4.059, "validDurationSecs": 33508.25, "totalHrs": 55.95, "validHrs": 9.3 }, "de": { "buckets": { "dev": 16191, "test": 16191, "train": 599647, "validated": 927893, "invalidated": 55669, "other": 7300 }, "duration": 5192667989, "reportedSentences": 11115, "validatedSentences": 2057198, "unvalidatedSentences": 1842, "clips": 990862, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.59, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 990696, "agriculture_food": 0, "automotive_transport": 3, "finance": 0, "service_retail": 11, "general": 125, "healthcare": 10, "history_law_government": 10, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 9, "news_current_affairs": 1, "technology_robotics": 7 } }, "users": 19348, "size": 36224820079, "checksum": "f5703ecaca978282a7727358d60c209be019cf3f4365830d410d6fec8c9170e5", "avgDurationSecs": 5.241, "validDurationSecs": 4862675.406, "totalHrs": 1442.4, "validHrs": 1350.74 }, "dv": { "buckets": { "dev": 2201, "test": 2253, "train": 2665, "validated": 26400, "invalidated": 1645, "other": 15072 }, "duration": 220671711, "reportedSentences": 59, "validatedSentences": 7172, "unvalidatedSentences": 6, "clips": 43117, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.11, "thirties": 0.39, "teens": 0.01, "fourties": 0.23, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.24, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43116, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 384, "size": 1403728850, "checksum": "22fdb61e0fa87e0cbc410d737a798ddd087380b4531a37b1e7c13a2693209b18", "avgDurationSecs": 5.118, "validDurationSecs": 135114.53, "totalHrs": 61.29, "validHrs": 37.53 }, "dyu": { "buckets": { "dev": 50, "test": 59, "train": 78, "validated": 195, "invalidated": 13, "other": 80 }, "duration": 1786860, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 288, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.67, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 288, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 34, "size": 10351657, "checksum": "c982c813da26cff20c40bcc10fb121d1bd03b6e7d2bb7d680b12515f522b336b", "avgDurationSecs": 6.204, "validDurationSecs": 1209.853, "totalHrs": 0.49, "validHrs": 0.33 }, "el": { "buckets": { "dev": 1674, "test": 1710, "train": 1942, "validated": 16474, "invalidated": 848, "other": 10232 }, "duration": 114299818, "reportedSentences": 70, "validatedSentences": 6764, "unvalidatedSentences": 4591, "clips": 27554, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.63, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 27554, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 419, "size": 761057645, "checksum": "fcb4f8d8f1c660031ba2db2cd0206a9fd1380e9af108c031f93f3200931f35ea", "avgDurationSecs": 4.148, "validDurationSecs": 68337.635, "totalHrs": 31.74, "validHrs": 18.98 }, "en": { "buckets": { "dev": 16398, "test": 16398, "train": 1124486, "validated": 1834556, "invalidated": 301822, "other": 348975 }, "duration": 13070886019, "reportedSentences": 9269, "validatedSentences": 1675165, "unvalidatedSentences": 33082, "clips": 2485353, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.44, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2484670, "agriculture_food": 16, "automotive_transport": 5, "finance": 41, "service_retail": 25, "general": 447, "healthcare": 18, "history_law_government": 36, "language_fundamentals": 8, "media_entertainment": 54, "nature_environment": 39, "news_current_affairs": 8, "technology_robotics": 49 } }, "users": 94911, "size": 91088615899, "checksum": "cc7ba69840e96bb55c9f71ee56b2ddf0df67a5996fb2c24499d282c5a8e74e8a", "avgDurationSecs": 5.259, "validDurationSecs": 9648236.034, "totalHrs": 3630.8, "validHrs": 2680.06 }, "eo": { "buckets": { "dev": 14917, "test": 14919, "train": 144259, "validated": 851434, "invalidated": 127470, "other": 156491 }, "duration": 6891248455, "reportedSentences": 2338, "validatedSentences": 180733, "unvalidatedSentences": 22223, "clips": 1135395, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.69, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1135392, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1809, "size": 41188330316, "checksum": "bd0ffa5898c25d84746e48cca935b23e301337402228334ecc346a86efe33814", "avgDurationSecs": 6.069, "validDurationSecs": 5167755.043, "totalHrs": 1914.23, "validHrs": 1435.48 }, "es": { "buckets": { "dev": 15882, "test": 15882, "train": 348515, "validated": 427357, "invalidated": 92892, "other": 1137109 }, "duration": 8077638546, "reportedSentences": 2609, "validatedSentences": 1081170, "unvalidatedSentences": 5524, "clips": 1657358, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1657283, "agriculture_food": 0, "automotive_transport": 4, "finance": 4, "service_retail": 2, "general": 21, "healthcare": 4, "history_law_government": 17, "language_fundamentals": 6, "media_entertainment": 3, "nature_environment": 3, "news_current_affairs": 7, "technology_robotics": 18 } }, "users": 26290, "size": 50953372117, "checksum": "5b33a51daebcf91f7a4b86abb2c5265cb9e7068558cbe26de056ce0a38ab7401", "avgDurationSecs": 4.874, "validDurationSecs": 2082854.384, "totalHrs": 2243.78, "validHrs": 578.57 }, "et": { "buckets": { "dev": 2807, "test": 2807, "train": 3380, "validated": 26119, "invalidated": 7732, "other": 6 }, "duration": 228595646, "reportedSentences": 565, "validatedSentences": 10910, "unvalidatedSentences": 851, "clips": 33857, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.52, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33363, "agriculture_food": 24, "automotive_transport": 25, "finance": 25, "service_retail": 4, "general": 320, "healthcare": 13, "history_law_government": 19, "language_fundamentals": 9, "media_entertainment": 15, "nature_environment": 66, "news_current_affairs": 8, "technology_robotics": 23 } }, "users": 995, "size": 1531311028, "checksum": "12c5baa1ae433c5e85c30fab4f8bc072b962a7ed05b97fcaf41e9784d4cb7c63", "avgDurationSecs": 6.752, "validDurationSecs": 176350.228, "totalHrs": 63.49, "validHrs": 48.98 }, "eu": { "buckets": { "dev": 14371, "test": 14371, "train": 105389, "validated": 239799, "invalidated": 42231, "other": 175551 }, "duration": 2514696339, "reportedSentences": 1249, "validatedSentences": 1017647, "unvalidatedSentences": 2031, "clips": 457581, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.01, "fourties": 0.16, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.25, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443685, "agriculture_food": 0, "automotive_transport": 5, "finance": 4, "service_retail": 7, "general": 13827, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 10810, "size": 15430027865, "checksum": "bdad6fdc620229a7e4e4109a737e2a562c0a751b7565240e7a4ee9f0fdbf3dd7", "avgDurationSecs": 5.496, "validDurationSecs": 1317846.824, "totalHrs": 698.52, "validHrs": 366.06 }, "fa": { "buckets": { "dev": 10629, "test": 10629, "train": 29422, "validated": 334567, "invalidated": 14949, "other": 34724 }, "duration": 1524765680, "reportedSentences": 2596, "validatedSentences": 57116, "unvalidatedSentences": 282331, "clips": 384240, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.36, "teens": 0.03, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.71, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 384229, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 11, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4535, "size": 11030639163, "checksum": "ef9d5f7f6007bd5d0453c72c7fbb44d71f63d00513127cbd23cf49e5ba8e3c14", "avgDurationSecs": 3.968, "validDurationSecs": 1327650.113, "totalHrs": 423.54, "validHrs": 368.79 }, "fi": { "buckets": { "dev": 1747, "test": 1770, "train": 2116, "validated": 11095, "invalidated": 371, "other": 5848 }, "duration": 79958235, "reportedSentences": 77, "validatedSentences": 5845, "unvalidatedSentences": 49167, "clips": 17314, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.31, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.36, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17314, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 234, "size": 478067393, "checksum": "5f113127d92f39a8ab4caf4d94072b06a0feab31efb39577f2ea423e19a0d5f0", "avgDurationSecs": 4.618, "validDurationSecs": 51238.109, "totalHrs": 22.21, "validHrs": 14.23 }, "fr": { "buckets": { "dev": 16175, "test": 16175, "train": 578480, "validated": 747206, "invalidated": 66042, "other": 21011 }, "duration": 4192077106, "reportedSentences": 7459, "validatedSentences": 1647872, "unvalidatedSentences": 44415, "clips": 834259, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.13, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.57, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 834203, "agriculture_food": 0, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 26, "healthcare": 2, "history_law_government": 14, "language_fundamentals": 4, "media_entertainment": 14, "nature_environment": 6, "news_current_affairs": 2, "technology_robotics": 15 } }, "users": 19893, "size": 29426527761, "checksum": "c427bfa2ae2b710b4c085a7d4166cedbc362af67d8b889cee903792076239649", "avgDurationSecs": 5.025, "validDurationSecs": 3754643.541, "totalHrs": 1164.46, "validHrs": 1042.95 }, "fy-NL": { "buckets": { "dev": 3172, "test": 3173, "train": 3923, "validated": 51522, "invalidated": 4014, "other": 102339 }, "duration": 766292535, "reportedSentences": 543, "validatedSentences": 10351, "unvalidatedSentences": 1, "clips": 157875, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157875, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2060, "size": 4594783483, "checksum": "e6a0231b25fd6e838fcf3f91f794e469c19287bf0a548cc6051d1a57d5ee49f2", "avgDurationSecs": 4.854, "validDurationSecs": 250077.112, "totalHrs": 212.85, "validHrs": 69.46 }, "ga-IE": { "buckets": { "dev": 514, "test": 516, "train": 540, "validated": 5896, "invalidated": 959, "other": 3665 }, "duration": 37791243, "reportedSentences": 29, "validatedSentences": 1618, "unvalidatedSentences": 592, "clips": 10520, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.25, "thirties": 0.24, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.49, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10519, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 195, "size": 256102483, "checksum": "289e8542a404d4a68a418fa2925e9d914842778ec5477d3e16b1fe1a1315916e", "avgDurationSecs": 3.592, "validDurationSecs": 21180.339, "totalHrs": 10.49, "validHrs": 5.88 }, "gl": { "buckets": { "dev": 12138, "test": 12151, "train": 45135, "validated": 70487, "invalidated": 7287, "other": 143449 }, "duration": 1126087560, "reportedSentences": 740, "validatedSentences": 695516, "unvalidatedSentences": 5214, "clips": 221223, "splits": { "accent": {}, "age": { "": 0.62, "twenties": 0.1, "thirties": 0.07, "teens": 0.01, "fourties": 0.09, "fifties": 0.11, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.69, "male_masculine": 0.15, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 220932, "agriculture_food": 13, "automotive_transport": 4, "finance": 34, "service_retail": 9, "general": 179, "healthcare": 1, "history_law_government": 28, "language_fundamentals": 4, "media_entertainment": 28, "nature_environment": 13, "news_current_affairs": 9, "technology_robotics": 3 } }, "users": 5972, "size": 6530813674, "checksum": "8b5c52fb10c0ceda8f0b7d74c31884373f6be692fef71451f543cff03caf3624", "avgDurationSecs": 5.09, "validDurationSecs": 358798.741, "totalHrs": 312.8, "validHrs": 99.66 }, "gn": { "buckets": { "dev": 360, "test": 863, "train": 1578, "validated": 2930, "invalidated": 164, "other": 19399 }, "duration": 102441348, "reportedSentences": 36, "validatedSentences": 5894, "unvalidatedSentences": 292, "clips": 22493, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.57, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.38, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 22493, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 203, "size": 587661297, "checksum": "911578f233b385f06e6e60b146503ab3205bd00dce9cb3132ed80782736806bb", "avgDurationSecs": 4.554, "validDurationSecs": 13344.292, "totalHrs": 28.45, "validHrs": 3.7 }, "ha": { "buckets": { "dev": 608, "test": 709, "train": 1916, "validated": 3374, "invalidated": 173, "other": 6620 }, "duration": 44144064, "reportedSentences": 33, "validatedSentences": 5445, "unvalidatedSentences": 244, "clips": 10167, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.12, "thirties": 0.67, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.59, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10167, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 51, "size": 258897602, "checksum": "02485733363a6310960ea5e066b2344a291aea15fbf07c624188a3fdd2325eb8", "avgDurationSecs": 4.342, "validDurationSecs": 14649.56, "totalHrs": 12.26, "validHrs": 4.06 }, "he": { "buckets": { "dev": 296, "test": 333, "train": 1187, "validated": 2230, "invalidated": 258, "other": 2765 }, "duration": 23790348, "reportedSentences": 33, "validatedSentences": 4099, "unvalidatedSentences": 6913, "clips": 5253, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.73, "thirties": 0.16, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.03, "nineties": 0 }, "gender": { "": 0.06, "male_masculine": 0.92, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5253, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 43, "size": 138522774, "checksum": "40a1afbe28bf0700b2f05ea01abf3da130faf7098feacab07004d863af5b2181", "avgDurationSecs": 4.529, "validDurationSecs": 10099.462, "totalHrs": 6.6, "validHrs": 2.8 }, "hi": { "buckets": { "dev": 2696, "test": 3326, "train": 4865, "validated": 10965, "invalidated": 883, "other": 4336 }, "duration": 81003048, "reportedSentences": 146, "validatedSentences": 32204, "unvalidatedSentences": 9961, "clips": 16184, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.32, "thirties": 0.24, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16184, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 447, "size": 480804511, "checksum": "48b5728bc49cdadfb78a66c365fe1a15f6393dc38dae979c95302307d07fdf2f", "avgDurationSecs": 5.005, "validDurationSecs": 54881.267, "totalHrs": 22.5, "validHrs": 15.24 }, "hsb": { "buckets": { "dev": 172, "test": 463, "train": 808, "validated": 1443, "invalidated": 247, "other": 218 }, "duration": 11915892, "reportedSentences": 130, "validatedSentences": 7177, "unvalidatedSentences": 30, "clips": 1908, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.11, "thirties": 0.09, "teens": 0.07, "fourties": 0.48, "fifties": 0.03, "sixties": 0.02, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.78, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1908, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 90483823, "checksum": "213b69c6e6ea39016c5ae2c32dffc4c61aa6a45b2b3960189d914ffba7ccf253", "avgDurationSecs": 6.245, "validDurationSecs": 9011.862, "totalHrs": 3.3, "validHrs": 2.5 }, "ht": { "buckets": { "dev": 0, "test": 0, "train": 5, "validated": 5, "invalidated": 0, "other": 15 }, "duration": 105336, "reportedSentences": 0, "validatedSentences": 72, "unvalidatedSentences": 1633, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 727282, "checksum": "742355acc35502154314b3997926c0370de890d99ab7729d49736de3f67c8386", "avgDurationSecs": 5.267, "validDurationSecs": 26.334, "totalHrs": 0.02, "validHrs": 0 }, "hu": { "buckets": { "dev": 11476, "test": 11563, "train": 38452, "validated": 61941, "invalidated": 3221, "other": 50735 }, "duration": 641425645, "reportedSentences": 513, "validatedSentences": 326050, "unvalidatedSentences": 184, "clips": 115897, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.12, "teens": 0.02, "fourties": 0.11, "fifties": 0.33, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.24, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115897, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1640, "size": 3772546093, "checksum": "b5815581d9df2ab34817e6c0b78c9f7936be0d3f866b6e6c0c44f5e9a16eec41", "avgDurationSecs": 5.534, "validDurationSecs": 342809.097, "totalHrs": 178.17, "validHrs": 95.22 }, "hy-AM": { "buckets": { "dev": 5706, "test": 5856, "train": 9270, "validated": 21045, "invalidated": 1209, "other": 15254 }, "duration": 202233276, "reportedSentences": 183, "validatedSentences": 237402, "unvalidatedSentences": 1353, "clips": 37508, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.63, "thirties": 0.07, "teens": 0.11, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.24, "female_feminine": 0.56, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37506, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 434, "size": 1189710521, "checksum": "3b6afc134d275edfc79d28d7ae7af830ccdd399b76ccef3971a73116aee860a0", "avgDurationSecs": 5.392, "validDurationSecs": 113469.108, "totalHrs": 56.17, "validHrs": 31.51 }, "ia": { "buckets": { "dev": 1838, "test": 1840, "train": 4947, "validated": 11934, "invalidated": 346, "other": 2390 }, "duration": 61506660, "reportedSentences": 277, "validatedSentences": 9069, "unvalidatedSentences": 41, "clips": 14670, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.3, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14670, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 70, "size": 417180982, "checksum": "0c3ed2626118551bbdc2fc6b6298438ca3b98b5ea905384994fb9be9ba85fb45", "avgDurationSecs": 4.193, "validDurationSecs": 50035.479, "totalHrs": 17.08, "validHrs": 13.89 }, "id": { "buckets": { "dev": 3459, "test": 3690, "train": 4973, "validated": 29784, "invalidated": 2769, "other": 26455 }, "duration": 236103528, "reportedSentences": 354, "validatedSentences": 13241, "unvalidatedSentences": 11913, "clips": 59008, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.45, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 59008, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 604, "size": 1485674627, "checksum": "45ec154d8fdcebac0dbfd205569d75d2816e44aa078d1377ad05828765f22bd9", "avgDurationSecs": 4.001, "validDurationSecs": 119172.103, "totalHrs": 65.58, "validHrs": 33.1 }, "ig": { "buckets": { "dev": 2, "test": 5, "train": 9, "validated": 16, "invalidated": 3, "other": 5766 }, "duration": 31597848, "reportedSentences": 16, "validatedSentences": 14454, "unvalidatedSentences": 3857, "clips": 5785, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5785, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 118, "size": 185779133, "checksum": "28591f6c82628485e06f53091bfb9fea174d97c2126e49bfac204146f73f2dd3", "avgDurationSecs": 5.462, "validDurationSecs": 87.392, "totalHrs": 8.77, "validHrs": 0.02 }, "is": { "buckets": { "dev": 9, "test": 9, "train": 16, "validated": 34, "invalidated": 5, "other": 1 }, "duration": 284040, "reportedSentences": 0, "validatedSentences": 4993, "unvalidatedSentences": 202, "clips": 40, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.13, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2052856, "checksum": "8ec476f3c955dc2e2079544e297cbdb96d82b67ab44e4599526ad203f215b9f0", "avgDurationSecs": 7.101, "validDurationSecs": 241.434, "totalHrs": 0.07, "validHrs": 0.06 }, "it": { "buckets": { "dev": 15162, "test": 15170, "train": 171780, "validated": 238736, "invalidated": 20194, "other": 13680 }, "duration": 1476230136, "reportedSentences": 5963, "validatedSentences": 924551, "unvalidatedSentences": 1430, "clips": 272610, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.2, "thirties": 0.16, "teens": 0.01, "fourties": 0.13, "fifties": 0.14, "sixties": 0.08, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.62, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 272601, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 8, "healthcare": 1, "history_law_government": 2, "language_fundamentals": 2, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 7218, "size": 10074274491, "checksum": "9c578e0e3a99b171e1ff160d19e39f7c8bb6d121cd7012b63aa9a24e8313d2a2", "avgDurationSecs": 5.415, "validDurationSecs": 1292796.588, "totalHrs": 410.06, "validHrs": 359.11 }, "ja": { "buckets": { "dev": 7786, "test": 7786, "train": 14633, "validated": 163010, "invalidated": 28191, "other": 249547 }, "duration": 2006489661, "reportedSentences": 680, "validatedSentences": 35907, "unvalidatedSentences": 4344, "clips": 440748, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.56, "thirties": 0.07, "teens": 0.08, "fourties": 0.09, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.35, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 413286, "agriculture_food": 4229, "automotive_transport": 803, "finance": 753, "service_retail": 340, "general": 13375, "healthcare": 871, "history_law_government": 2081, "language_fundamentals": 2205, "media_entertainment": 3186, "nature_environment": 1322, "news_current_affairs": 556, "technology_robotics": 274 } }, "users": 5085, "size": 11816167814, "checksum": "843e09a81524af61f55813173d08f8c401795f84630e3cb30860b70c9c699ee7", "avgDurationSecs": 4.552, "validDurationSecs": 742097.252, "totalHrs": 557.35, "validHrs": 206.13 }, "ka": { "buckets": { "dev": 12835, "test": 13002, "train": 59860, "validated": 110923, "invalidated": 5922, "other": 95476 }, "duration": 1103913864, "reportedSentences": 962, "validatedSentences": 132312, "unvalidatedSentences": 1370, "clips": 212321, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.21, "thirties": 0.34, "teens": 0.11, "fourties": 0.06, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.18, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 211880, "agriculture_food": 7, "automotive_transport": 5, "finance": 3, "service_retail": 7, "general": 201, "healthcare": 8, "history_law_government": 167, "language_fundamentals": 25, "media_entertainment": 16, "nature_environment": 8, "news_current_affairs": 8, "technology_robotics": 3 } }, "users": 2482, "size": 6458272763, "checksum": "712f5476dff0173c737475936ad30f7fb35299f6cded5039c37fa80f1ea07892", "avgDurationSecs": 5.199, "validDurationSecs": 576718.448, "totalHrs": 306.64, "validHrs": 160.19 }, "kab": { "buckets": { "dev": 15002, "test": 15002, "train": 152338, "validated": 613531, "invalidated": 19965, "other": 113160 }, "duration": 2494645324, "reportedSentences": 9718, "validatedSentences": 182746, "unvalidatedSentences": 205429, "clips": 746656, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.18, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 746656, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1579, "size": 18630647461, "checksum": "4a4d246eeb0715bcfe3243bb6c77ca81626e799975f7fe545d0228cc3dba82d3", "avgDurationSecs": 3.341, "validDurationSecs": 2049862.641, "totalHrs": 692.95, "validHrs": 569.4 }, "kk": { "buckets": { "dev": 483, "test": 521, "train": 607, "validated": 1617, "invalidated": 235, "other": 720 }, "duration": 12682440, "reportedSentences": 42, "validatedSentences": 6176, "unvalidatedSentences": 292, "clips": 2572, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.24, "thirties": 0.03, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.33, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2572, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 173, "size": 72514593, "checksum": "eb2b77e90f2caa679101131d3d9764a424fc76a93b79214f110f8bc3fbaa3774", "avgDurationSecs": 4.931, "validDurationSecs": 7973.369, "totalHrs": 3.52, "validHrs": 2.21 }, "kln": { "buckets": { "dev": 6408, "test": 6073, "train": 11064, "validated": 32296, "invalidated": 536, "other": 37366 }, "duration": 330271956, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70198, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.59, "thirties": 0.05, "teens": 0.05, "fourties": 0.29, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.03, "female_feminine": 0.7, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4, "agriculture_food": 5, "automotive_transport": 16, "finance": 32, "service_retail": 625, "general": 68951, "healthcare": 54, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 484 } }, "users": 41, "size": 1800757998, "checksum": "0381c6c550886ccebaa9a2075cae4f010ea7358cc3a6b1afd2dea40a44073511", "avgDurationSecs": 4.705, "validDurationSecs": 151948.248, "totalHrs": 91.74, "validHrs": 42.2 }, "kmr": { "buckets": { "dev": 3970, "test": 3964, "train": 5263, "validated": 61268, "invalidated": 2584, "other": 25615 }, "duration": 366336108, "reportedSentences": 1058, "validatedSentences": 16805, "unvalidatedSentences": 945, "clips": 89467, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.42, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89459, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 1, "media_entertainment": 1, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 578, "size": 1943758770, "checksum": "e4b6e980a3f8ac30ed97838823358ab3c2b8978de661716021f8c3159007a7d5", "avgDurationSecs": 4.095, "validDurationSecs": 250871.055, "totalHrs": 101.76, "validHrs": 69.68 }, "ko": { "buckets": { "dev": 425, "test": 447, "train": 502, "validated": 1375, "invalidated": 361, "other": 3598 }, "duration": 28641240, "reportedSentences": 29, "validatedSentences": 7165, "unvalidatedSentences": 2709, "clips": 5334, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.24, "teens": 0.04, "fourties": 0.05, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.42, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5331, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 137, "size": 167578592, "checksum": "025b56e2b4354aae15256949ba7fe118b462fab8524d907fa4040bace748d4b6", "avgDurationSecs": 5.37, "validDurationSecs": 7383.147, "totalHrs": 7.95, "validHrs": 2.05 }, "ky": { "buckets": { "dev": 1615, "test": 1615, "train": 1789, "validated": 30608, "invalidated": 5672, "other": 1539 }, "duration": 172213716, "reportedSentences": 39, "validatedSentences": 5027, "unvalidatedSentences": 259, "clips": 37819, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37819, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 291, "size": 1107733812, "checksum": "3b97cfaa402c77f675a95c7c62bfc20774a80372195c442d84c352034f00984f", "avgDurationSecs": 4.554, "validDurationSecs": 139377.493, "totalHrs": 47.83, "validHrs": 38.71 }, "lg": { "buckets": { "dev": 13344, "test": 13366, "train": 71074, "validated": 271832, "invalidated": 39178, "other": 37072 }, "duration": 2013266727, "reportedSentences": 6451, "validatedSentences": 191409, "unvalidatedSentences": 172, "clips": 348082, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348082, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 659, "size": 11819889167, "checksum": "5c869b0c3bbd5836b3a2e44b27eedf39d04bea4da499c76f75aee1c86ce19a9f", "avgDurationSecs": 5.784, "validDurationSecs": 1572245.393, "totalHrs": 559.24, "validHrs": 436.73 }, "lij": { "buckets": { "dev": 1259, "test": 1344, "train": 1725, "validated": 4409, "invalidated": 561, "other": 213 }, "duration": 20307384, "reportedSentences": 373, "validatedSentences": 5678, "unvalidatedSentences": 446, "clips": 5183, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.07, "thirties": 0.46, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.17, "seventies": 0.12, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.45, "female_feminine": 0.29, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4848, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 335, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 112976667, "checksum": "34a98990e45ef4c923b1174f210ca42cabcce29405889839d5f3c88d649cba14", "avgDurationSecs": 3.918, "validDurationSecs": 17274.794, "totalHrs": 5.64, "validHrs": 4.79 }, "lo": { "buckets": { "dev": 24, "test": 23, "train": 83, "validated": 130, "invalidated": 8, "other": 70 }, "duration": 1342152, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 208, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.33, "thirties": 0.6, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.93, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 208, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 8667647, "checksum": "7acda03333851716f329ea07e1057a2c77476317beb1ab4fbec8ad43ab5b2d5c", "avgDurationSecs": 6.453, "validDurationSecs": 838.845, "totalHrs": 0.37, "validHrs": 0.23 }, "lt": { "buckets": { "dev": 4815, "test": 4930, "train": 7339, "validated": 17286, "invalidated": 849, "other": 3771 }, "duration": 112688972, "reportedSentences": 191, "validatedSentences": 129253, "unvalidatedSentences": 4810, "clips": 21906, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.29, "thirties": 0.26, "teens": 0.02, "fourties": 0.21, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.47, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21906, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 309, "size": 691721794, "checksum": "787d852ec6503f38f7c578dfee566ac6d165475d1b69157d8708d9044dfa37cf", "avgDurationSecs": 5.144, "validDurationSecs": 88922.741, "totalHrs": 31.3, "validHrs": 24.7 }, "ltg": { "buckets": { "dev": 3377, "test": 3421, "train": 4403, "validated": 20299, "invalidated": 1795, "other": 781 }, "duration": 105267708, "reportedSentences": 66, "validatedSentences": 11634, "unvalidatedSentences": 4171, "clips": 22875, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.09, "thirties": 0.24, "teens": 0, "fourties": 0.35, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.06, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19139, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3736, "healthcare": 0, "history_law_government": 402, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 335, "size": 606016722, "checksum": "50a0f1bd115d298ef956716428ce9744182aea57171eecf009e43a23153e25a5", "avgDurationSecs": 4.602, "validDurationSecs": 93413.299, "totalHrs": 29.24, "validHrs": 25.94 }, "luo": { "buckets": { "dev": 3060, "test": 3000, "train": 4498, "validated": 20342, "invalidated": 792, "other": 61577 }, "duration": 430492824, "reportedSentences": 8, "validatedSentences": 14692, "unvalidatedSentences": 13870, "clips": 82711, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.33, "thirties": 0.28, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1135, "automotive_transport": 341, "finance": 967, "service_retail": 2746, "general": 69465, "healthcare": 2236, "history_law_government": 864, "language_fundamentals": 555, "media_entertainment": 806, "nature_environment": 634, "news_current_affairs": 2710, "technology_robotics": 241 } }, "users": 44, "size": 2391434805, "checksum": "49cacebc580834ea7b7090df3b891bffd9461c4064670d7be5bb891046de508d", "avgDurationSecs": 5.205, "validDurationSecs": 105875.7, "totalHrs": 119.58, "validHrs": 29.4 }, "lv": { "buckets": { "dev": 7578, "test": 7578, "train": 13916, "validated": 193876, "invalidated": 11218, "other": 19839 }, "duration": 1072375325, "reportedSentences": 3131, "validatedSentences": 33449, "unvalidatedSentences": 65835, "clips": 224933, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 224344, "agriculture_food": 5, "automotive_transport": 3, "finance": 2, "service_retail": 11, "general": 496, "healthcare": 22, "history_law_government": 5, "language_fundamentals": 8, "media_entertainment": 9, "nature_environment": 27, "news_current_affairs": 4, "technology_robotics": 8 } }, "users": 6299, "size": 6003287936, "checksum": "733246bafe7947cdf680b463e620d7384bc776984320929e16b6e1c15432bb02", "avgDurationSecs": 4.768, "validDurationSecs": 924310.077, "totalHrs": 297.88, "validHrs": 256.75 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11056920, "checksum": "56d35bda3d7f478844ff8663c32066c5cc993a4c85f03317cd08190a1278f352", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "dev": 14663, "test": 15129, "train": 186565, "validated": 218721, "invalidated": 16025, "other": 25 }, "duration": 1085385744, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234771, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234771, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 499, "size": 6124145587, "checksum": "0ce20cc001f9df9cea34b29ffdcec18d2aabc0b98b66e88416f7f27dd1f33a88", "avgDurationSecs": 4.623, "validDurationSecs": 1011183.9, "totalHrs": 301.49, "validHrs": 280.88 }, "mk": { "buckets": { "dev": 1478, "test": 1301, "train": 2130, "validated": 10110, "invalidated": 588, "other": 21145 }, "duration": 149614560, "reportedSentences": 93, "validatedSentences": 5759, "unvalidatedSentences": 890, "clips": 31843, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.44, "thirties": 0.32, "teens": 0.01, "fourties": 0.09, "fifties": 0.02, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.35, "female_feminine": 0.43, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31744, "agriculture_food": 12, "automotive_transport": 0, "finance": 6, "service_retail": 0, "general": 71, "healthcare": 13, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 6, "nature_environment": 7, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 278, "size": 827555491, "checksum": "b427d5f4020a113ca46f784b2894f3df7150a7aa12aa2c9d73c337b9aa444ed3", "avgDurationSecs": 4.699, "validDurationSecs": 47501.906, "totalHrs": 41.55, "validHrs": 13.19 }, "ml": { "buckets": { "dev": 915, "test": 858, "train": 1235, "validated": 3401, "invalidated": 119, "other": 5960 }, "duration": 39971268, "reportedSentences": 243, "validatedSentences": 5485, "unvalidatedSentences": 4728, "clips": 9480, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.59, "thirties": 0.08, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.48, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9480, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 144, "size": 226627888, "checksum": "f6966dbfcfcd463c2d9195745fe563e05508aa884229993a0d6889f076d31d88", "avgDurationSecs": 4.216, "validDurationSecs": 14339.903, "totalHrs": 11.1, "validHrs": 3.98 }, "mn": { "buckets": { "dev": 1932, "test": 1932, "train": 2194, "validated": 33259, "invalidated": 3180, "other": 59814 }, "duration": 505642436, "reportedSentences": 46, "validatedSentences": 6091, "unvalidatedSentences": 12, "clips": 96253, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96238, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 580, "size": 3075868424, "checksum": "fe0a314f977ff391c4433bfe9c68865e3572945951b884852872c651317903b5", "avgDurationSecs": 5.253, "validDurationSecs": 174718.313, "totalHrs": 140.45, "validHrs": 48.53 }, "mr": { "buckets": { "dev": 1767, "test": 1793, "train": 2191, "validated": 10985, "invalidated": 2287, "other": 2772 }, "duration": 99429120, "reportedSentences": 59, "validatedSentences": 6715, "unvalidatedSentences": 12104, "clips": 16044, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0.19, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16044, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 98, "size": 584717636, "checksum": "6838278c3827ba1fe563694bdf0ba9439115deb0a6cb404d84817c3f5af55aa3", "avgDurationSecs": 6.197, "validDurationSecs": 68077.093, "totalHrs": 27.61, "validHrs": 18.91 }, "mrj": { "buckets": { "dev": 7314, "test": 7111, "train": 14325, "validated": 28920, "invalidated": 1249, "other": 1509 }, "duration": 132771780, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31678, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31678, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 746586546, "checksum": "fc9913b6456d9b6c86714267dfd6569e256576f79f1c9b54d9f345fbb6fc54fe", "avgDurationSecs": 4.191, "validDurationSecs": 121212.194, "totalHrs": 36.88, "validHrs": 33.67 }, "mt": { "buckets": { "dev": 1627, "test": 1654, "train": 1913, "validated": 6576, "invalidated": 326, "other": 6305 }, "duration": 62787780, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11773, "clips": 13207, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13207, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 466484659, "checksum": "4f832199e35c088b8da578ee404e87f6451998bd49372b6a522ee82c2c68fe84", "avgDurationSecs": 4.754, "validDurationSecs": 31263.151, "totalHrs": 17.44, "validHrs": 8.68 }, "myv": { "buckets": { "dev": 239, "test": 478, "train": 1241, "validated": 1968, "invalidated": 26, "other": 1 }, "duration": 11519388, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1995, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 67697560, "checksum": "c8c899378b233bd0a9e9a0fe59ad5883c1423d0d0869cddf101c9edca345f0f1", "avgDurationSecs": 5.774, "validDurationSecs": 11363.487, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "dev": 5334, "test": 5918, "train": 10529, "validated": 25071, "invalidated": 742, "other": 4589 }, "duration": 80175888, "reportedSentences": 226, "validatedSentences": 24804, "unvalidatedSentences": 75, "clips": 30402, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.51, "thirties": 0.15, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.64, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30336, "agriculture_food": 2, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 17, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 46, "media_entertainment": 0, "nature_environment": 46, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 257, "size": 457238267, "checksum": "8fe8ef56fa85d95d04f23411626ee5b2b397002f0501013a56afeebee53ab40c", "avgDurationSecs": 2.637, "validDurationSecs": 66117.022, "totalHrs": 22.27, "validHrs": 18.36 }, "ne-NP": { "buckets": { "dev": 245, "test": 240, "train": 274, "validated": 825, "invalidated": 72, "other": 630 }, "duration": 6130224, "reportedSentences": 5, "validatedSentences": 1300, "unvalidatedSentences": 45, "clips": 1527, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.59, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.67, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1527, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 35948203, "checksum": "f2f20ff7c7d0b23b9c16dffcf886cffbfbcf66bbdc2c1eaadb3b26329a7c104e", "avgDurationSecs": 4.015, "validDurationSecs": 3312.007, "totalHrs": 1.7, "validHrs": 0.92 }, "nhi": { "buckets": { "dev": 10, "test": 5, "train": 22, "validated": 37, "invalidated": 0, "other": 53 }, "duration": 406548, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 9, "clips": 90, "splits": { "accent": {}, "age": { "": 0.61, "twenties": 0, "thirties": 0.39, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.61, "male_masculine": 0.39, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 90, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 2433668, "checksum": "d66f031a9260f9417679a64b0127884e3d880f87dde7711b52416421f6d7bc29", "avgDurationSecs": 4.517, "validDurationSecs": 167.136, "totalHrs": 0.11, "validHrs": 0.04 }, "nl": { "buckets": { "dev": 11501, "test": 11501, "train": 37240, "validated": 93288, "invalidated": 5694, "other": 2828 }, "duration": 444606517, "reportedSentences": 344, "validatedSentences": 259007, "unvalidatedSentences": 685, "clips": 101810, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.2, "thirties": 0.13, "teens": 0.02, "fourties": 0.15, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.48, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 101800, "agriculture_food": 0, "automotive_transport": 4, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 1769, "size": 3024675526, "checksum": "5598af8d7aa7350a6e327ad97175aed67005c8ad8a6b43ed23c69e653b1eef59", "avgDurationSecs": 4.367, "validDurationSecs": 407390.755, "totalHrs": 123.5, "validHrs": 113.16 }, "nn-NO": { "buckets": { "dev": 283, "test": 385, "train": 545, "validated": 1218, "invalidated": 70, "other": 86 }, "duration": 6016860, "reportedSentences": 23, "validatedSentences": 5272, "unvalidatedSentences": 52, "clips": 1374, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.41, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.58, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1374, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 39, "size": 34977181, "checksum": "d949c3dceaaa91bce2ce1fc5adb21adc3f4e479c03af30883a826ee767c1d67a", "avgDurationSecs": 4.379, "validDurationSecs": 5333.723, "totalHrs": 1.67, "validHrs": 1.48 }, "nr": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 2 }, "duration": 19476, "reportedSentences": 0, "validatedSentences": 319, "unvalidatedSentences": 7468, "clips": 2, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 637951, "checksum": "8067759f20184db51c0e89fb5787e971aa35049ba8db7199675b83d2b8a99037", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552773, "checksum": "3ca2fa8a5b5e79540b3c5a83e65036f2c118fedf3027a86e2265e47bbaa7396c", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "oc": { "buckets": { "dev": 266, "test": 274, "train": 305, "validated": 1955, "invalidated": 202, "other": 7693 }, "duration": 47998944, "reportedSentences": 10, "validatedSentences": 4929, "unvalidatedSentences": 10, "clips": 9850, "splits": { "accent": {}, "age": { "": 0.5, "twenties": 0.01, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0.08, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9850, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 146, "size": 273188337, "checksum": "bdb0b2bfdd95b4ff2f9ad15b7d2e93dc3f9e1436f259a9fc7850d83804ff9980", "avgDurationSecs": 4.873, "validDurationSecs": 9526.694, "totalHrs": 13.33, "validHrs": 2.64 }, "or": { "buckets": { "dev": 630, "test": 474, "train": 2149, "validated": 4041, "invalidated": 431, "other": 19421 }, "duration": 133095660, "reportedSentences": 100, "validatedSentences": 6195, "unvalidatedSentences": 89, "clips": 23893, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.66, "thirties": 0.27, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.34, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.21 }, "sentence_domain": { "": 23886, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 156, "size": 823685635, "checksum": "98230b027614a6ce946bec172e7261c77ac647a5a0dca09a55b1e7174f98eadc", "avgDurationSecs": 5.57, "validDurationSecs": 22510.34, "totalHrs": 36.97, "validHrs": 6.25 }, "os": { "buckets": { "dev": 84, "test": 121, "train": 257, "validated": 465, "invalidated": 42, "other": 221 }, "duration": 3975948, "reportedSentences": 3, "validatedSentences": 1155, "unvalidatedSentences": 727, "clips": 728, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.12, "thirties": 0.45, "teens": 0.03, "fourties": 0.2, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.77, "male_masculine": 0.23, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 728, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 23196502, "checksum": "5dab24f662aa5ed4123b4c28b2cf346140754b4efe69111642cb572663e2a6b4", "avgDurationSecs": 5.461, "validDurationSecs": 2539.582, "totalHrs": 1.1, "validHrs": 0.7 }, "pa-IN": { "buckets": { "dev": 404, "test": 536, "train": 788, "validated": 1728, "invalidated": 91, "other": 1252 }, "duration": 14820614, "reportedSentences": 251, "validatedSentences": 14822, "unvalidatedSentences": 21, "clips": 3071, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.23, "thirties": 0.39, "teens": 0, "fourties": 0.04, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.67, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3071, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 70, "size": 104687359, "checksum": "89dc50097a37a43bddaf320aed2df54632304f1ddd1ec87d20be235e30a9a1ca", "avgDurationSecs": 4.826, "validDurationSecs": 8339.31, "totalHrs": 4.11, "validHrs": 2.31 }, "pl": { "buckets": { "dev": 9667, "test": 9667, "train": 23071, "validated": 135886, "invalidated": 6822, "other": 1743 }, "duration": 658321310, "reportedSentences": 586, "validatedSentences": 229712, "unvalidatedSentences": 10981, "clips": 144451, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.27, "thirties": 0.34, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.28, "male_masculine": 0.58, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 144449, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3374, "size": 4676855775, "checksum": "daed31407d5b2c886d18ccb89c7520238961755ebef80f0032058698da0b2065", "avgDurationSecs": 4.557, "validDurationSecs": 619287.16, "totalHrs": 182.86, "validHrs": 172.02 }, "ps": { "buckets": { "dev": 2801, "test": 2801, "train": 3368, "validated": 46538, "invalidated": 3625, "other": 33077 }, "duration": 414484236, "reportedSentences": 990, "validatedSentences": 15151, "unvalidatedSentences": 2515, "clips": 83240, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.37, "thirties": 0.17, "teens": 0.08, "fourties": 0.11, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.02, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 47484, "agriculture_food": 629, "automotive_transport": 304, "finance": 107, "service_retail": 436, "general": 30457, "healthcare": 652, "history_law_government": 2025, "language_fundamentals": 1921, "media_entertainment": 1036, "nature_environment": 757, "news_current_affairs": 686, "technology_robotics": 305 } }, "users": 1231, "size": 2309933190, "checksum": "386b25cdfa49881d7b863d537faf200bb791915a8048a8bc541e73b254878df2", "avgDurationSecs": 4.979, "validDurationSecs": 231730.747, "totalHrs": 115.13, "validHrs": 64.36 }, "pt": { "buckets": { "dev": 9577, "test": 9579, "train": 22580, "validated": 156753, "invalidated": 7633, "other": 25639 }, "duration": 788421715, "reportedSentences": 2835, "validatedSentences": 43329, "unvalidatedSentences": 99, "clips": 190025, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 189333, "agriculture_food": 18, "automotive_transport": 28, "finance": 14, "service_retail": 13, "general": 505, "healthcare": 27, "history_law_government": 36, "language_fundamentals": 14, "media_entertainment": 39, "nature_environment": 17, "news_current_affairs": 0, "technology_robotics": 66 } }, "users": 3616, "size": 4955762330, "checksum": "0323c6b99d1618868fa558c5022a1765612b7f67c99d238eebb73de09b37625f", "avgDurationSecs": 4.149, "validDurationSecs": 650374.788, "totalHrs": 219, "validHrs": 180.65 }, "quy": { "buckets": { "dev": 0, "test": 1, "train": 3, "validated": 4, "invalidated": 2, "other": 15 }, "duration": 113364, "reportedSentences": 0, "validatedSentences": 751, "unvalidatedSentences": 289, "clips": 21, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 731949, "checksum": "4ead29b9b0e0c69924f93e263f2154c852450eb66a126f5ba1e3179bfe1fd20b", "avgDurationSecs": 5.398, "validDurationSecs": 21.593, "totalHrs": 0.03, "validHrs": 0 }, "rm-sursilv": { "buckets": { "dev": 1365, "test": 1387, "train": 1589, "validated": 4356, "invalidated": 684, "other": 2464 }, "duration": 40806869, "reportedSentences": 17, "validatedSentences": 8780, "unvalidatedSentences": 7455, "clips": 7504, "splits": { "accent": {}, "age": { "": 4550, "twenties": 705, "thirties": 529, "teens": 390, "fourties": 1330, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 4893, "male_masculine": 1277, "female_feminine": 1334, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 91, "size": 306292643, "checksum": "f25dea28d2ce77b641fe127f131810d72e8367a509bc33a45d6cea517ac4a224", "avgDurationSecs": 5.438, "validDurationSecs": 23687.996, "totalHrs": 11.33, "validHrs": 6.57 }, "rm-vallader": { "buckets": { "dev": 500, "test": 462, "train": 557, "validated": 1526, "invalidated": 394, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2631, "splits": { "accent": {}, "age": { "": 949, "twenties": 367, "thirties": 150, "teens": 0, "fourties": 1080, "fifties": 10, "sixties": 75, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 964, "male_masculine": 1161, "female_feminine": 506, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2631, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 117356544, "checksum": "070e9cb69a81106c943434193c40fe98022058bf2f52fc9de124438e92d4420a", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "dev": 3903, "test": 3905, "train": 5141, "validated": 18169, "invalidated": 999, "other": 23104 }, "duration": 170373163, "reportedSentences": 420, "validatedSentences": 13743, "unvalidatedSentences": 89199, "clips": 42272, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.61, "thirties": 0.13, "teens": 0.01, "fourties": 0.08, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male_masculine": 0.72, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 42272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 437, "size": 1063042869, "checksum": "8edc3b2768a332b8d465f2f71ad622f72304573060f82f77e000d3e9532fcf48", "avgDurationSecs": 4.03, "validDurationSecs": 73228.378, "totalHrs": 47.32, "validHrs": 20.34 }, "ru": { "buckets": { "dev": 10217, "test": 10220, "train": 26498, "validated": 168633, "invalidated": 10342, "other": 16240 }, "duration": 1010110032, "reportedSentences": 541, "validatedSentences": 47531, "unvalidatedSentences": 486, "clips": 195215, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.17, "teens": 0.09, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 195197, "agriculture_food": 0, "automotive_transport": 1, "finance": 1, "service_retail": 6, "general": 10, "healthcare": 2, "history_law_government": 4, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3365, "size": 6810860947, "checksum": "525461b8ab3df2ef98d14a7c8f26a74497aa3bc471f1f17159a8d43a93c88375", "avgDurationSecs": 5.174, "validDurationSecs": 872565.556, "totalHrs": 280.58, "validHrs": 242.37 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003033, "validated": 1438996, "invalidated": 227819, "other": 47293 }, "duration": 8583228617, "reportedSentences": 630, "validatedSentences": 1404856, "unvalidatedSentences": 4, "clips": 1714108, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1714108, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1154, "size": 61198191971, "checksum": "f3b00094579052557cf63908f0152d57d7ed3d834a73255d2def0c9cc8ee91a9", "avgDurationSecs": 5.007, "validDurationSecs": 7205632.111, "totalHrs": 2384.23, "validHrs": 2001.56 }, "sah": { "buckets": { "dev": 1398, "test": 1478, "train": 1875, "validated": 4912, "invalidated": 153, "other": 2472 }, "duration": 46254288, "reportedSentences": 11, "validatedSentences": 5989, "unvalidatedSentences": 468, "clips": 7537, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.24, "teens": 0.06, "fourties": 0.06, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.38, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7537, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 117, "size": 314028821, "checksum": "4db2ef24526a0f5b9eacffe46616e5a43970243998abceee84d0fb711f91b585", "avgDurationSecs": 6.137, "validDurationSecs": 30144.761, "totalHrs": 12.84, "validHrs": 8.37 }, "sat": { "buckets": { "dev": 63, "test": 125, "train": 332, "validated": 520, "invalidated": 17, "other": 312 }, "duration": 3801708, "reportedSentences": 7, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 849, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.42, "thirties": 0.12, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.59, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 21352800, "checksum": "bbdd77609991531d17ccefd5739cdc1767b54c46f477045a29ff14b1c0623cfa", "avgDurationSecs": 4.478, "validDurationSecs": 2328.49, "totalHrs": 1.05, "validHrs": 0.64 }, "sc": { "buckets": { "dev": 496, "test": 448, "train": 928, "validated": 2045, "invalidated": 64, "other": 140 }, "duration": 10414512, "reportedSentences": 8, "validatedSentences": 5403, "unvalidatedSentences": 76, "clips": 2249, "splits": { "accent": {}, "age": { "": 0.53, "twenties": 0.09, "thirties": 0.3, "teens": 0, "fourties": 0.01, "fifties": 0.04, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0.15, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2249, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 60582724, "checksum": "c76f2fdf79a1bdef91594479380e3753a973fe0aa9472a2ec75aaa409a3cee8d", "avgDurationSecs": 4.631, "validDurationSecs": 9469.843, "totalHrs": 2.89, "validHrs": 2.63 }, "sd": { "buckets": { "dev": 0, "test": 36, "train": 253, "validated": 289, "invalidated": 68, "other": 10384 }, "duration": 47441412, "reportedSentences": 2, "validatedSentences": 13330, "unvalidatedSentences": 24, "clips": 10741, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0, "thirties": 0.16, "teens": 0, "fourties": 0.8, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.84, "male_masculine": 0.16, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10715, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 25, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 244071581, "checksum": "d60ebc56a39e67340278525515f8ebec524d61cf1c21724d248270f355699189", "avgDurationSecs": 4.417, "validDurationSecs": 1276.47, "totalHrs": 13.17, "validHrs": 0.35 }, "sk": { "buckets": { "dev": 2619, "test": 2679, "train": 3264, "validated": 20547, "invalidated": 917, "other": 2583 }, "duration": 98055720, "reportedSentences": 196, "validatedSentences": 9384, "unvalidatedSentences": 172, "clips": 24047, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.08, "thirties": 0.21, "teens": 0.09, "fourties": 0.18, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.49, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 24044, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 249, "size": 554418764, "checksum": "e4f76256f8480090bad4d981f871b6f2bca945fa05e5ec971e0eda1d47dcab0c", "avgDurationSecs": 4.078, "validDurationSecs": 83783.877, "totalHrs": 27.23, "validHrs": 23.27 }, "skr": { "buckets": { "dev": 1138, "test": 1028, "train": 1540, "validated": 3718, "invalidated": 465, "other": 1544 }, "duration": 23728140, "reportedSentences": 17, "validatedSentences": 6006, "unvalidatedSentences": 92, "clips": 5727, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0.38, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.64, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5727, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 136547875, "checksum": "855d65e3cb3c2b5906429b9e93a7d7b1f2762be224672a71e9f28366ce086eb4", "avgDurationSecs": 4.143, "validDurationSecs": 15404.439, "totalHrs": 6.59, "validHrs": 4.27 }, "sl": { "buckets": { "dev": 1323, "test": 1328, "train": 1453, "validated": 14189, "invalidated": 394, "other": 3170 }, "duration": 70166782, "reportedSentences": 51, "validatedSentences": 4149, "unvalidatedSentences": 76, "clips": 17753, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.34, "thirties": 0.04, "teens": 0.06, "fourties": 0.16, "fifties": 0.05, "sixties": 0.05, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.57, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17753, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 723, "size": 465546317, "checksum": "54ba3265e466235833cc5dddd31e08ac5d9b622761e92b737d23cdef8b30590f", "avgDurationSecs": 3.952, "validDurationSecs": 56080.464, "totalHrs": 19.49, "validHrs": 15.57 }, "sq": { "buckets": { "dev": 1645, "test": 1853, "train": 2658, "validated": 6166, "invalidated": 165, "other": 60 }, "duration": 32578344, "reportedSentences": 133, "validatedSentences": 50002, "unvalidatedSentences": 2640, "clips": 6391, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.08, "thirties": 0.15, "teens": 0, "fourties": 0.18, "fifties": 0.35, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.48, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1576, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1786, "healthcare": 0, "history_law_government": 3009, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 20, "technology_robotics": 0 } }, "users": 140, "size": 194216270, "checksum": "a69dd796f2262cdd97a88b02ff8957b1bcd41603dbb1ede44d32777e442da29d", "avgDurationSecs": 5.098, "validDurationSecs": 31431.399, "totalHrs": 9.04, "validHrs": 8.73 }, "sr": { "buckets": { "dev": 1909, "test": 1926, "train": 2347, "validated": 8177, "invalidated": 424, "other": 4867 }, "duration": 43529040, "reportedSentences": 78, "validatedSentences": 7898, "unvalidatedSentences": 753, "clips": 13468, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.25, "thirties": 0.11, "teens": 0, "fourties": 0.1, "fifties": 0.13, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.32, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13428, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 26 } }, "users": 175, "size": 244155973, "checksum": "e045df293cd4b6c6784ec879dc84e2fe369dfa8112be33913dbaa161f8f2afcb", "avgDurationSecs": 3.232, "validDurationSecs": 26428.346, "totalHrs": 12.09, "validHrs": 7.34 }, "st": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 60 }, "duration": 118296, "reportedSentences": 0, "validatedSentences": 2337, "unvalidatedSentences": 14838, "clips": 60, "splits": { "accent": {}, "age": { "": 0, "twenties": 1, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 60, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1722449, "checksum": "5b7d11fd3556446185b25e93f8d5f2d5b874b9e5b0c8f4dccef751b8ed8042ee", "avgDurationSecs": 1.972, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 5292, "test": 5345, "train": 7952, "validated": 41501, "invalidated": 1563, "other": 6722 }, "duration": 199965399, "reportedSentences": 592, "validatedSentences": 25912, "unvalidatedSentences": 7369, "clips": 49786, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.37, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.47, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 49785, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 859, "size": 1264214007, "checksum": "ab865dea0eddb088a87980d7c400cd15dcb168b267fdb28e630e711d9eb79488", "avgDurationSecs": 4.016, "validDurationSecs": 166688.708, "totalHrs": 55.54, "validHrs": 46.3 }, "sw": { "buckets": { "dev": 12252, "test": 12253, "train": 46511, "validated": 267093, "invalidated": 80638, "other": 377360 }, "duration": 3905713836, "reportedSentences": 1902, "validatedSentences": 134698, "unvalidatedSentences": 5785, "clips": 725091, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 725091, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1467, "size": 22796331909, "checksum": "726ec386b9448b683e3cf37cc8e44e5b6c1033395d3297b3ad6a2afa46c58b8a", "avgDurationSecs": 5.387, "validDurationSecs": 1438700.557, "totalHrs": 1084.92, "validHrs": 399.63 }, "ta": { "buckets": { "dev": 12112, "test": 12203, "train": 46095, "validated": 136128, "invalidated": 5707, "other": 105052 }, "duration": 1530376512, "reportedSentences": 3437, "validatedSentences": 118206, "unvalidatedSentences": 628, "clips": 246887, "splits": { "accent": {}, "age": { "": 0.66, "twenties": 0.11, "thirties": 0.12, "teens": 0.04, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.19, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 246887, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 947, "size": 9174059266, "checksum": "f4d01a296f1bfa3575299e34db103a0c6b9470438f6e1e83177d3097c7a7cfc5", "avgDurationSecs": 6.199, "validDurationSecs": 843815.567, "totalHrs": 425.1, "validHrs": 234.39 }, "te": { "buckets": { "dev": 58, "test": 58, "train": 67, "validated": 287, "invalidated": 39, "other": 1965 }, "duration": 9572868, "reportedSentences": 15, "validatedSentences": 322, "unvalidatedSentences": 19298, "clips": 2291, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.69, "thirties": 0.02, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.08, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.68, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2291, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 52, "size": 57336099, "checksum": "8c05337553e2f98fc0fdf898a6b94501ff70d688548d154a18b0d6f26f35bbaa", "avgDurationSecs": 4.178, "validDurationSecs": 1199.22, "totalHrs": 2.65, "validHrs": 0.33 }, "th": { "buckets": { "dev": 11052, "test": 11052, "train": 32914, "validated": 148057, "invalidated": 9317, "other": 207438 }, "duration": 1528989132, "reportedSentences": 4296, "validatedSentences": 64447, "unvalidatedSentences": 121693, "clips": 364812, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 364812, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7938, "size": 8921149759, "checksum": "ae332f94c69a60d62265cd136cbe037c312ecd52058d8d2ccec109111c0be71c", "avgDurationSecs": 4.191, "validDurationSecs": 620532.066, "totalHrs": 424.71, "validHrs": 172.37 }, "ti": { "buckets": { "dev": 5, "test": 7, "train": 14, "validated": 26, "invalidated": 1, "other": 268 }, "duration": 1474344, "reportedSentences": 3, "validatedSentences": 4532, "unvalidatedSentences": 138, "clips": 295, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.03, "teens": 0, "fourties": 0.74, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0.05, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 8940852, "checksum": "ba60abc72a9c372a7f2b375774a06c48566dd703df0ed7f4056673f60e14d8ae", "avgDurationSecs": 4.998, "validDurationSecs": 129.942, "totalHrs": 0.4, "validHrs": 0.03 }, "tig": { "buckets": { "dev": 104, "test": 187, "train": 370, "validated": 661, "invalidated": 16, "other": 953 }, "duration": 10043568, "reportedSentences": 2, "validatedSentences": 5382, "unvalidatedSentences": 20, "clips": 1630, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.02, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1630, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 29, "size": 51992836, "checksum": "e7041d1eb07073c10b6cad3c7fb3cefa1b06a832d960fd4b4b96bca1ad60f43f", "avgDurationSecs": 6.162, "validDurationSecs": 4072.882, "totalHrs": 2.78, "validHrs": 1.13 }, "tk": { "buckets": { "dev": 430, "test": 506, "train": 758, "validated": 1869, "invalidated": 128, "other": 2433 }, "duration": 24022008, "reportedSentences": 27, "validatedSentences": 2456, "unvalidatedSentences": 236, "clips": 4430, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.56, "thirties": 0.09, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.46, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4430, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 118, "size": 135090376, "checksum": "84bcc6e35b9059055953f8a981a32d564dba5601a5a256c205f513d4561b1ba3", "avgDurationSecs": 5.423, "validDurationSecs": 10134.793, "totalHrs": 6.67, "validHrs": 2.81 }, "tn": { "buckets": { "dev": 317, "test": 369, "train": 1073, "validated": 3495, "invalidated": 11, "other": 543 }, "duration": 17693028, "reportedSentences": 1, "validatedSentences": 1765, "unvalidatedSentences": 15342, "clips": 4049, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.02, "thirties": 0.51, "teens": 0, "fourties": 0, "fifties": 0.43, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.04 }, "sentence_domain": { "": 4049, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 89816201, "checksum": "8cdc306c671bdd7674857754c5474daa33dcbe6cd9f2d7e9fa0bb7eaf2673bbf", "avgDurationSecs": 4.37, "validDurationSecs": 15272.199, "totalHrs": 4.91, "validHrs": 4.24 }, "tok": { "buckets": { "dev": 2198, "test": 2186, "train": 2553, "validated": 12031, "invalidated": 347, "other": 4379 }, "duration": 69909120, "reportedSentences": 145, "validatedSentences": 7347, "unvalidatedSentences": 2374, "clips": 16757, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.2, "thirties": 0.07, "teens": 0.25, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.17, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.51, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16757, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 160, "size": 409300752, "checksum": "4ac83b57790016aaee91513abbea484f64ae33eb3411595c616f30cc5def9855", "avgDurationSecs": 4.172, "validDurationSecs": 50192.554, "totalHrs": 19.41, "validHrs": 13.94 }, "tr": { "buckets": { "dev": 11646, "test": 11660, "train": 38992, "validated": 118688, "invalidated": 4756, "other": 116 }, "duration": 469295163, "reportedSentences": 444, "validatedSentences": 410385, "unvalidatedSentences": 3436, "clips": 123560, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.1, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.42, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 123551, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 6, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1668, "size": 2866153999, "checksum": "b16a43fc1fa8700c602da57ab56dc07d303b9b7baf685e2b28e9e273d30cba1e", "avgDurationSecs": 3.798, "validDurationSecs": 450790.744, "totalHrs": 130.35, "validHrs": 125.21 }, "ts": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 1 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1040829, "checksum": "c4d33e7851d31b8980bcc242ea1d56bc85f7cfa343e49c322331647ae0b473f1", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 3980, "test": 4915, "train": 9078, "validated": 29466, "invalidated": 440, "other": 792 }, "duration": 117416006, "reportedSentences": 6, "validatedSentences": 18040, "unvalidatedSentences": 43, "clips": 30698, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.05, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.77, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30696, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 273, "size": 857602085, "checksum": "80b3c6bdbec06c10aa009d89292f89b138f2a0734dd3b4e227238de7c5e1ffa3", "avgDurationSecs": 3.825, "validDurationSecs": 112703.76, "totalHrs": 32.61, "validHrs": 31.3 }, "tw": { "buckets": { "dev": 0, "test": 13, "train": 179, "validated": 192, "invalidated": 3, "other": 77 }, "duration": 1172304, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 18, "clips": 272, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.02, "thirties": 0.79, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.81, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 9648880, "checksum": "e937fb87a4f4d6fd289447ce860e82e3d02fdc8361379a668f99b953e4dfa52a", "avgDurationSecs": 4.31, "validDurationSecs": 827.509, "totalHrs": 0.32, "validHrs": 0.22 }, "ug": { "buckets": { "dev": 13758, "test": 13758, "train": 79418, "validated": 223660, "invalidated": 17017, "other": 2789 }, "duration": 1381223844, "reportedSentences": 1350, "validatedSentences": 172148, "unvalidatedSentences": 761, "clips": 243466, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.29, "thirties": 0.3, "teens": 0.05, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.32, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01, "intersex": 0 }, "sentence_domain": { "": 241277, "agriculture_food": 1, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 2175, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 6, "media_entertainment": 12, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 1424, "size": 7859930867, "checksum": "0a5e3a672d815499f4c9570604c934bd5496a1b03b125bb891aa3473567b67d1", "avgDurationSecs": 5.673, "validDurationSecs": 1268861.052, "totalHrs": 383.67, "validHrs": 352.46 }, "uk": { "buckets": { "dev": 10132, "test": 10133, "train": 25917, "validated": 76522, "invalidated": 3262, "other": 8523 }, "duration": 409935732, "reportedSentences": 653, "validatedSentences": 212603, "unvalidatedSentences": 1659, "clips": 88307, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.33, "thirties": 0.25, "teens": 0.09, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 88307, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1120, "size": 2697092870, "checksum": "785ce2c0c3ad8004fde1559a8cb560d22b5fcde35dd55cd3804256824a1b4465", "avgDurationSecs": 4.642, "validDurationSecs": 355227.808, "totalHrs": 113.87, "validHrs": 98.67 }, "ur": { "buckets": { "dev": 5025, "test": 5026, "train": 7205, "validated": 67147, "invalidated": 10494, "other": 174289 }, "duration": 1082072628, "reportedSentences": 325, "validatedSentences": 21527, "unvalidatedSentences": 48219, "clips": 251930, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.92, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.53, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 251824, "agriculture_food": 12, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 48, "healthcare": 46, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 464, "size": 6317563187, "checksum": "9b06700606a2356ea9d4f52b8c4bdf9f6350630d8f145776055ac3967a064980", "avgDurationSecs": 4.295, "validDurationSecs": 288405.235, "totalHrs": 300.57, "validHrs": 80.11 }, "uz": { "buckets": { "dev": 12186, "test": 12348, "train": 48606, "validated": 86731, "invalidated": 14014, "other": 128437 }, "duration": 952015716, "reportedSentences": 1833, "validatedSentences": 148681, "unvalidatedSentences": 137916, "clips": 229182, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229182, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2211, "size": 5342141478, "checksum": "4eb855cc571f325615927e6a41b445567064448d2be9a15937c738883d6f9078", "avgDurationSecs": 4.154, "validDurationSecs": 360278.185, "totalHrs": 264.44, "validHrs": 100.07 }, "vi": { "buckets": { "dev": 719, "test": 1356, "train": 2219, "validated": 5388, "invalidated": 398, "other": 11929 }, "duration": 69963416, "reportedSentences": 201, "validatedSentences": 5784, "unvalidatedSentences": 5310, "clips": 17715, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.02, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0.28, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.53, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17715, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 303, "size": 407488121, "checksum": "0bfb9be27efbf15566ee2a9feb61a13b7793a20c78934708655814faf2029609", "avgDurationSecs": 3.949, "validDurationSecs": 21279.305, "totalHrs": 19.43, "validHrs": 5.91 }, "vot": { "buckets": { "dev": 0, "test": 7, "train": 96, "validated": 103, "invalidated": 324, "other": 0 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186669, "checksum": "1458a620c79a215e01f9be52f5af32d2d5564ed7a74f2e443e7aaff67996dfcf", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "xh": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 0, "other": 14 }, "duration": 94140, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 15, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1691677, "checksum": "8edd101c64b46ac47847d5b566567c79f118276b9944e686b36a8b6636fc3611", "avgDurationSecs": 6.276, "validDurationSecs": 6.276, "totalHrs": 0.02, "validHrs": 0 }, "yi": { "buckets": { "dev": 97, "test": 122, "train": 151, "validated": 383, "invalidated": 26, "other": 8 }, "duration": 1753524, "reportedSentences": 1, "validatedSentences": 862, "unvalidatedSentences": 640, "clips": 417, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.15, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 417, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 39, "size": 9744049, "checksum": "f7597af16c97a8fa0f0b6538093c07df85a9deaacbd0e2c6b43e4ecf47cfb689", "avgDurationSecs": 4.205, "validDurationSecs": 1610.551, "totalHrs": 0.48, "validHrs": 0.44 }, "yo": { "buckets": { "dev": 880, "test": 1024, "train": 1257, "validated": 3163, "invalidated": 234, "other": 1159 }, "duration": 27253224, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 16, "clips": 4556, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.63, "thirties": 0.02, "teens": 0.04, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.32, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4556, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 125, "size": 159723826, "checksum": "ed89244ffac4c189bd6f90b2e55bc7a2a793faee665c0211e94dfccc61c4efa2", "avgDurationSecs": 5.982, "validDurationSecs": 18920.533, "totalHrs": 7.57, "validHrs": 5.25 }, "yue": { "buckets": { "dev": 5128, "test": 5128, "train": 7415, "validated": 190766, "invalidated": 8023, "other": 78963 }, "duration": 1099056384, "reportedSentences": 2245, "validatedSentences": 17800, "unvalidatedSentences": 7936, "clips": 277752, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.29, "thirties": 0.47, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 274961, "agriculture_food": 16, "automotive_transport": 48, "finance": 126, "service_retail": 596, "general": 1645, "healthcare": 405, "history_law_government": 64, "language_fundamentals": 0, "media_entertainment": 188, "nature_environment": 187, "news_current_affairs": 112, "technology_robotics": 93 } }, "users": 1145, "size": 6369157154, "checksum": "8e4b414dcb0b973e347b0e76bcf52faf19a3f641cf753cd1e6a96f0384203d42", "avgDurationSecs": 3.957, "validDurationSecs": 754855.375, "totalHrs": 305.29, "validHrs": 209.68 }, "zgh": { "buckets": { "dev": 294, "test": 213, "train": 807, "validated": 1314, "invalidated": 83, "other": 591 }, "duration": 6804180, "reportedSentences": 6, "validatedSentences": 2730, "unvalidatedSentences": 353, "clips": 1988, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.35, "thirties": 0.56, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male_masculine": 0.88, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1955, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 27, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 4, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 25, "size": 36946206, "checksum": "78f93bee259abf989ed91cfdea0b34ecbf1b7240356c2c12e066f7d651f57109", "avgDurationSecs": 3.423, "validDurationSecs": 4497.33, "totalHrs": 1.89, "validHrs": 1.24 }, "zh-CN": { "buckets": { "dev": 10630, "test": 10630, "train": 29438, "validated": 187414, "invalidated": 58849, "other": 600247 }, "duration": 3846056316, "reportedSentences": 1003, "validatedSentences": 58974, "unvalidatedSentences": 888, "clips": 846510, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0.05, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 845401, "agriculture_food": 12, "automotive_transport": 26, "finance": 8, "service_retail": 31, "general": 381, "healthcare": 45, "history_law_government": 137, "language_fundamentals": 25, "media_entertainment": 499, "nature_environment": 12, "news_current_affairs": 25, "technology_robotics": 71 } }, "users": 7245, "size": 22772407323, "checksum": "e71104ee0b8246ef412b7d08fe0965552d4f6990ce5e0ece236b234d983391de", "avgDurationSecs": 4.543, "validDurationSecs": 851501.811, "totalHrs": 1068.34, "validHrs": 236.52 }, "zh-HK": { "buckets": { "dev": 5599, "test": 5599, "train": 8439, "validated": 93487, "invalidated": 4532, "other": 23320 }, "duration": 505446720, "reportedSentences": 814, "validatedSentences": 20021, "unvalidatedSentences": 27, "clips": 121339, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.41, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 120839, "agriculture_food": 0, "automotive_transport": 12, "finance": 22, "service_retail": 118, "general": 326, "healthcare": 44, "history_law_government": 23, "language_fundamentals": 41, "media_entertainment": 22, "nature_environment": 21, "news_current_affairs": 14, "technology_robotics": 10 } }, "users": 3070, "size": 3606132545, "checksum": "7425e31e97793615cf9401976355738cd286e95aead930da1f601da4083dc8d8", "avgDurationSecs": 4.166, "validDurationSecs": 389427.122, "totalHrs": 140.4, "validHrs": 108.17 }, "zh-TW": { "buckets": { "dev": 5043, "test": 5043, "train": 7238, "validated": 83703, "invalidated": 4847, "other": 48635 }, "duration": 461453674, "reportedSentences": 172, "validatedSentences": 19573, "unvalidatedSentences": 34, "clips": 137185, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.49, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 137078, "agriculture_food": 1, "automotive_transport": 15, "finance": 0, "service_retail": 18, "general": 57, "healthcare": 2, "history_law_government": 9, "language_fundamentals": 3, "media_entertainment": 21, "nature_environment": 4, "news_current_affairs": 4, "technology_robotics": 2 } }, "users": 2231, "size": 3088335271, "checksum": "1a28bd64b6cea58c04e688ac2d33772c3c3b0384dc87d47eb4ff39c97b278fc1", "avgDurationSecs": 3.364, "validDurationSecs": 281554.52, "totalHrs": 128.18, "validHrs": 78.2 }, "zu": { "buckets": { "dev": 0, "test": 0, "train": 7, "validated": 7, "invalidated": 0, "other": 34 }, "duration": 220896, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 41, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 41, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 2730844, "checksum": "bc4da09091af54178f1f3fcaa51b087d2e04875c2d2226a4d16930d8c7d2b48a", "avgDurationSecs": 5.388, "validDurationSecs": 37.714, "totalHrs": 0.06, "validHrs": 0.01 }, "zza": { "buckets": { "dev": 463, "test": 373, "train": 680, "validated": 1516, "invalidated": 141, "other": 518 }, "duration": 8926596, "reportedSentences": 2321, "validatedSentences": 5636, "unvalidatedSentences": 981, "clips": 2175, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.71, "male_masculine": 0.27, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1456, "agriculture_food": 3, "automotive_transport": 1, "finance": 1, "service_retail": 8, "general": 657, "healthcare": 16, "history_law_government": 0, "language_fundamentals": 27, "media_entertainment": 2, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 17, "size": 52288733, "checksum": "1955369eea1132514b3841ae63f9a6fe3c01cb5864869b74927107d53a716c3f", "avgDurationSecs": 4.104, "validDurationSecs": 6221.94, "totalHrs": 2.47, "validHrs": 1.72 } }, "totalDuration": 119356481119, "totalValidDurationSecs": 79584947, "totalHrs": 33154, "totalValidHrs": 22106 } ================================================ FILE: datasets/scripted-speech/cv-corpus-20.0-delta-2024-12-06.json ================================================ { "locales": { "ab": { "duration": 5433228, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 949, "users": 4, "size": 30544456, "checksum": "b08f789609e649f395561b107f5d2eacb6d51fa4888e34ebc92eec87617c93c3", "avgDurationSecs": 5.178, "validDurationSecs": 424.15, "totalHrs": 1.51, "validHrs": 0.12 }, "af": { "duration": 954828, "reportedSentences": 0, "validatedSentences": 36, "unvalidatedSentences": 68, "clips": 150, "users": 17, "size": 5498168, "checksum": "5cd09c6271b049e64fea831daed656d6ce2aa0a29ec4471c4d1b889b45332a19", "avgDurationSecs": 5.756, "validDurationSecs": 540.69, "totalHrs": 0.27, "validHrs": 0.15 }, "am": { "duration": 77652, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 1, "size": 459468, "checksum": "a2dadc69339073906db2df92b36d6cd3c362596fb439b15bc568c2c86c40794b", "avgDurationSecs": 6.232, "validDurationSecs": 165.19, "totalHrs": 0.02, "validHrs": 0.04 }, "ar": { "duration": 415548, "reportedSentences": 1, "validatedSentences": 6, "unvalidatedSentences": 25, "clips": 90, "users": 14, "size": 2320577, "checksum": "a648911b8ddc0155dc238a6941f19b4fd4fd6f8c0d837b6593a78fe71d0934e0", "avgDurationSecs": 4.16, "validDurationSecs": 1051.4, "totalHrs": 0.11, "validHrs": 0.29 }, "as": { "duration": 2980116, "reportedSentences": 63, "validatedSentences": 544, "unvalidatedSentences": 354, "clips": 509, "users": 0, "size": 17659664, "checksum": "7debdabf59cea322a077dd7018023d0936ba6d1b065206afe93743c8c32e3365", "avgDurationSecs": 5.901, "validDurationSecs": 155.28, "totalHrs": 0.83, "validHrs": 0.04 }, "ast": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": -3110, "checksum": "b7d2c93a072bc82cf657601f7ec011f7abda8a1998890bdbb50ab971dcb019e3", "avgDurationSecs": 4.434, "validDurationSecs": 26.61, "totalHrs": 0, "validHrs": 0.01 }, "az": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2050, "checksum": "36afc5d5fc14cdb091377c27d32b98a7330cf288c28783b6a0c7f3ec8c671f6b", "avgDurationSecs": 5.435, "validDurationSecs": 228.26, "totalHrs": 0, "validHrs": 0.06 }, "ba": { "duration": 122544, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 20, "users": 1, "size": 742107, "checksum": "058decc1fa22e29855f7673371ba38655d2ef35d9ffa4beaa4d9b843cee18869", "avgDurationSecs": 4.427, "validDurationSecs": 165.52, "totalHrs": 0.03, "validHrs": 0.05 }, "bas": { "duration": 38167308, "reportedSentences": 0, "validatedSentences": 211, "unvalidatedSentences": -140, "clips": 10031, "users": 10, "size": 193268265, "checksum": "8c9780e4f91d8456447c5f991ec0044a7904301063eb309d42c6b6733b9f797c", "avgDurationSecs": 3.925, "validDurationSecs": 35731.2, "totalHrs": 10.6, "validHrs": 9.92 }, "be": { "duration": 51077808, "reportedSentences": 45, "validatedSentences": 3, "unvalidatedSentences": 1927, "clips": 9400, "users": 77, "size": 282255961, "checksum": "094871bb35673b61fa572f75486a8b0022f8368a6842ed2da4fd80e2a1804610", "avgDurationSecs": 4.782, "validDurationSecs": 32274.06, "totalHrs": 14.19, "validHrs": 8.97 }, "bg": { "duration": 8100, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 1, "users": 1, "size": 49912, "checksum": "4d5e2f226f61a985b1d53a1e4b93242c6eba8c9aaa2eb080b7d842cd97c2059b", "avgDurationSecs": 5.473, "validDurationSecs": 166.29, "totalHrs": 0.01, "validHrs": 0.05 }, "bn": { "duration": 3167496, "reportedSentences": 1, "validatedSentences": 12, "unvalidatedSentences": -7, "clips": 447, "users": 15, "size": 18470549, "checksum": "85801c1fcbd60d1527a490b908ab8c9c96698057032fdc7b3094ae5995108d58", "avgDurationSecs": 4.368, "validDurationSecs": 518.59, "totalHrs": 0.87, "validHrs": 0.15 }, "br": { "duration": 2415996, "reportedSentences": 8, "validatedSentences": 220, "unvalidatedSentences": -144, "clips": 702, "users": 6, "size": 14052424, "checksum": "599380a6168453bb13f71cf02f69fa069c19e752ccb06898fa56a49e01da00ba", "avgDurationSecs": 3.192, "validDurationSecs": 5261.83, "totalHrs": 0.67, "validHrs": 1.46 }, "ca": { "duration": 147552948, "reportedSentences": 68, "validatedSentences": 384, "unvalidatedSentences": -24, "clips": 38539, "users": 263, "size": 837647417, "checksum": "48e79bd0f40b830a5ad84549f4128e692bd9fb03e4aee6eed87e189918ad58bc", "avgDurationSecs": 5.25, "validDurationSecs": 665168.74, "totalHrs": 40.98, "validHrs": 184.77 }, "ckb": { "duration": 3954384, "reportedSentences": 1, "validatedSentences": 156, "unvalidatedSentences": -135, "clips": 746, "users": 42, "size": 22652621, "checksum": "26fc9f595cddac9cd397d7df87db593cacdf632ea1cc97801ffd2fda75d3647f", "avgDurationSecs": 4.084, "validDurationSecs": 2915.76, "totalHrs": 1.1, "validHrs": 0.81 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -344, "checksum": "595e9ba7eca9df6a041e359e2cea919d6992057270aed8355c221ea06eab5de3", "avgDurationSecs": 3.516, "validDurationSecs": 7.03, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 1812204, "reportedSentences": 2, "validatedSentences": 19, "unvalidatedSentences": 9, "clips": 351, "users": 12, "size": 10619445, "checksum": "d339a4f82201c8181993245fd563a67fb415e337e37ecacc99d4e7c12c92ddc3", "avgDurationSecs": 4.464, "validDurationSecs": 1857.14, "totalHrs": 0.5, "validHrs": 0.51 }, "cv": { "duration": 350892, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 85, "users": 7, "size": 1958013, "checksum": "83930665aca6eb4b0f2f7c9a69a7b9e70abe532b5ea5349bf75692492f41ba50", "avgDurationSecs": 5.039, "validDurationSecs": 235.95, "totalHrs": 0.09, "validHrs": 0.06 }, "cy": { "duration": 370980, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 58, "users": 7, "size": 1939859, "checksum": "8820cc4bf0dfbbf971d62f7ebf99ef779e3ebfff00bd5a7ee2e38dfa56b76d78", "avgDurationSecs": 4.913, "validDurationSecs": 298.3, "totalHrs": 0.1, "validHrs": 0.08 }, "da": { "duration": 697680, "reportedSentences": 8, "validatedSentences": 24, "unvalidatedSentences": -2, "clips": 186, "users": 2, "size": 4087306, "checksum": "ce47ab19e93a34a1c50de1daf0a69504f5c6663805826b49df58ce159d1acb0c", "avgDurationSecs": 4.099, "validDurationSecs": 463.76, "totalHrs": 0.2, "validHrs": 0.13 }, "dav": { "duration": 162828, "reportedSentences": 0, "validatedSentences": 125, "unvalidatedSentences": -83, "clips": 36, "users": 0, "size": 945057, "checksum": "b819fd9a384f2b38dc64f5b699be5aac74b27070c094e9648cd06375cf762f9e", "avgDurationSecs": 4.059, "validDurationSecs": 4097.63, "totalHrs": 0.05, "validHrs": 1.14 }, "de": { "duration": 24560892, "reportedSentences": 209, "validatedSentences": 326, "unvalidatedSentences": 1653, "clips": 4099, "users": 123, "size": 144406686, "checksum": "f5703ecaca978282a7727358d60c209be019cf3f4365830d410d6fec8c9170e5", "avgDurationSecs": 5.241, "validDurationSecs": 22107.02, "totalHrs": 6.82, "validHrs": 6.14 }, "dv": { "duration": 349128, "reportedSentences": 0, "validatedSentences": 10, "unvalidatedSentences": -9, "clips": 55, "users": 8, "size": 2043759, "checksum": "22fdb61e0fa87e0cbc410d737a798ddd087380b4531a37b1e7c13a2693209b18", "avgDurationSecs": 5.118, "validDurationSecs": 594.04, "totalHrs": 0.09, "validHrs": 0.17 }, "dyu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23, "checksum": "c982c813da26cff20c40bcc10fb121d1bd03b6e7d2bb7d680b12515f522b336b", "avgDurationSecs": 6.204, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "el": { "duration": 260496, "reportedSentences": 0, "validatedSentences": 68, "unvalidatedSentences": -54, "clips": 65, "users": 7, "size": 1522444, "checksum": "fcb4f8d8f1c660031ba2db2cd0206a9fd1380e9af108c031f93f3200931f35ea", "avgDurationSecs": 4.148, "validDurationSecs": 272.48, "totalHrs": 0.07, "validHrs": 0.08 }, "en": { "duration": 160870428, "reportedSentences": 123, "validatedSentences": 1050, "unvalidatedSentences": 30693, "clips": 26224, "users": 1015, "size": 944317859, "checksum": "cc7ba69840e96bb55c9f71ee56b2ddf0df67a5996fb2c24499d282c5a8e74e8a", "avgDurationSecs": 5.259, "validDurationSecs": 80437.12, "totalHrs": 44.69, "validHrs": 22.34 }, "eo": { "duration": 7222176, "reportedSentences": 15, "validatedSentences": 33, "unvalidatedSentences": 57, "clips": 1262, "users": 13, "size": 42932662, "checksum": "bd0ffa5898c25d84746e48cca935b23e301337402228334ecc346a86efe33814", "avgDurationSecs": 6.069, "validDurationSecs": 994.78, "totalHrs": 2.01, "validHrs": 0.27 }, "es": { "duration": 25522416, "reportedSentences": 18, "validatedSentences": 301, "unvalidatedSentences": -81, "clips": 5035, "users": 140, "size": 149385177, "checksum": "5b33a51daebcf91f7a4b86abb2c5265cb9e7068558cbe26de056ce0a38ab7401", "avgDurationSecs": 4.874, "validDurationSecs": 12846.57, "totalHrs": 7.09, "validHrs": 3.57 }, "et": { "duration": 795456, "reportedSentences": 0, "validatedSentences": 39, "unvalidatedSentences": -39, "clips": 126, "users": 7, "size": 4642383, "checksum": "12c5baa1ae433c5e85c30fab4f8bc072b962a7ed05b97fcaf41e9784d4cb7c63", "avgDurationSecs": 6.752, "validDurationSecs": 733.85, "totalHrs": 0.22, "validHrs": 0.2 }, "eu": { "duration": 30423996, "reportedSentences": 1, "validatedSentences": 51, "unvalidatedSentences": -37, "clips": 5482, "users": 77, "size": 177969641, "checksum": "bdad6fdc620229a7e4e4109a737e2a562c0a751b7565240e7a4ee9f0fdbf3dd7", "avgDurationSecs": 5.496, "validDurationSecs": 11663.98, "totalHrs": 8.45, "validHrs": 3.24 }, "fa": { "duration": 17234748, "reportedSentences": 62, "validatedSentences": 1018, "unvalidatedSentences": -762, "clips": 4986, "users": 51, "size": 99584582, "checksum": "ef9d5f7f6007bd5d0453c72c7fbb44d71f63d00513127cbd23cf49e5ba8e3c14", "avgDurationSecs": 3.968, "validDurationSecs": 7281.59, "totalHrs": 4.79, "validHrs": 2.03 }, "fi": { "duration": 324036, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 292, "clips": 60, "users": 8, "size": 1946839, "checksum": "5f113127d92f39a8ab4caf4d94072b06a0feab31efb39577f2ea423e19a0d5f0", "avgDurationSecs": 4.618, "validDurationSecs": 1834.81, "totalHrs": 0.09, "validHrs": 0.51 }, "fr": { "duration": 24285780, "reportedSentences": 90, "validatedSentences": 766, "unvalidatedSentences": 1088, "clips": 4604, "users": 183, "size": 141704361, "checksum": "c427bfa2ae2b710b4c085a7d4166cedbc362af67d8b889cee903792076239649", "avgDurationSecs": 5.025, "validDurationSecs": 41350.47, "totalHrs": 6.75, "validHrs": 11.48 }, "fy-NL": { "duration": 106920, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 13, "users": 3, "size": 535400, "checksum": "e6a0231b25fd6e838fcf3f91f794e469c19287bf0a548cc6051d1a57d5ee49f2", "avgDurationSecs": 4.854, "validDurationSecs": 567.6, "totalHrs": 0.03, "validHrs": 0.16 }, "ga-IE": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 5, "clips": 0, "users": 0, "size": -7724, "checksum": "289e8542a404d4a68a418fa2925e9d914842778ec5477d3e16b1fe1a1315916e", "avgDurationSecs": 3.592, "validDurationSecs": 373.6, "totalHrs": 0, "validHrs": 0.11 }, "gl": { "duration": 590243652, "reportedSentences": 149, "validatedSentences": 2058, "unvalidatedSentences": -1099, "clips": 118211, "users": 2669, "size": 3410747138, "checksum": "8b5c52fb10c0ceda8f0b7d74c31884373f6be692fef71451f543cff03caf3624", "avgDurationSecs": 5.09, "validDurationSecs": 78465.38, "totalHrs": 163.96, "validHrs": 21.79 }, "gn": { "duration": 2244636, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 456, "users": 16, "size": 13162118, "checksum": "911578f233b385f06e6e60b146503ab3205bd00dce9cb3132ed80782736806bb", "avgDurationSecs": 4.554, "validDurationSecs": 49.6, "totalHrs": 0.62, "validHrs": 0.01 }, "ha": { "duration": 61992, "reportedSentences": 0, "validatedSentences": 55, "unvalidatedSentences": -54, "clips": 10, "users": 2, "size": 359620, "checksum": "02485733363a6310960ea5e066b2344a291aea15fbf07c624188a3fdd2325eb8", "avgDurationSecs": 4.342, "validDurationSecs": 214.49, "totalHrs": 0.02, "validHrs": 0.06 }, "he": { "duration": 1346580, "reportedSentences": 0, "validatedSentences": 48, "unvalidatedSentences": -47, "clips": 247, "users": 8, "size": 7899729, "checksum": "40a1afbe28bf0700b2f05ea01abf3da130faf7098feacab07004d863af5b2181", "avgDurationSecs": 4.529, "validDurationSecs": 926.48, "totalHrs": 0.37, "validHrs": 0.26 }, "hi": { "duration": 2298240, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 342, "users": 9, "size": 13520585, "checksum": "48b5728bc49cdadfb78a66c365fe1a15f6393dc38dae979c95302307d07fdf2f", "avgDurationSecs": 5.005, "validDurationSecs": 733.83, "totalHrs": 0.64, "validHrs": 0.2 }, "hsb": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1138, "checksum": "213b69c6e6ea39016c5ae2c32dffc4c61aa6a45b2b3960189d914ffba7ccf253", "avgDurationSecs": 6.245, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ht": { "duration": 84672, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 1, "size": 500314, "checksum": "742355acc35502154314b3997926c0370de890d99ab7729d49736de3f67c8386", "avgDurationSecs": 5.267, "validDurationSecs": 5.67, "totalHrs": 0.02, "validHrs": 0 }, "hu": { "duration": 7210548, "reportedSentences": 39, "validatedSentences": 12, "unvalidatedSentences": 0, "clips": 1240, "users": 6, "size": 42442463, "checksum": "b5815581d9df2ab34817e6c0b78c9f7936be0d3f866b6e6c0c44f5e9a16eec41", "avgDurationSecs": 5.534, "validDurationSecs": 4281.16, "totalHrs": 2, "validHrs": 1.19 }, "hy-AM": { "duration": 25497504, "reportedSentences": 14, "validatedSentences": 9, "unvalidatedSentences": -9, "clips": 4946, "users": 31, "size": 150482770, "checksum": "3b6afc134d275edfc79d28d7ae7af830ccdd399b76ccef3971a73116aee860a0", "avgDurationSecs": 5.392, "validDurationSecs": 29291.38, "totalHrs": 7.08, "validHrs": 8.13 }, "ia": { "duration": 90936, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 20, "users": 0, "size": 532593, "checksum": "0c3ed2626118551bbdc2fc6b6298438ca3b98b5ea905384994fb9be9ba85fb45", "avgDurationSecs": 4.193, "validDurationSecs": 72.84, "totalHrs": 0.03, "validHrs": 0.02 }, "id": { "duration": 1340748, "reportedSentences": 0, "validatedSentences": 52, "unvalidatedSentences": -1, "clips": 267, "users": 16, "size": 7830379, "checksum": "45ec154d8fdcebac0dbfd205569d75d2816e44aa078d1377ad05828765f22bd9", "avgDurationSecs": 4.001, "validDurationSecs": 805.56, "totalHrs": 0.37, "validHrs": 0.23 }, "ig": { "duration": 5040, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 27298, "checksum": "28591f6c82628485e06f53091bfb9fea174d97c2126e49bfac204146f73f2dd3", "avgDurationSecs": 5.462, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1531, "checksum": "8ec476f3c955dc2e2079544e297cbdb96d82b67ab44e4599526ad203f215b9f0", "avgDurationSecs": 7.101, "validDurationSecs": 134.92, "totalHrs": 0, "validHrs": 0.04 }, "it": { "duration": 11710152, "reportedSentences": 8, "validatedSentences": 116, "unvalidatedSentences": -33, "clips": 1957, "users": 39, "size": 68237439, "checksum": "9c578e0e3a99b171e1ff160d19e39f7c8bb6d121cd7012b63aa9a24e8313d2a2", "avgDurationSecs": 5.415, "validDurationSecs": 3118.82, "totalHrs": 3.25, "validHrs": 0.87 }, "ja": { "duration": 283441608, "reportedSentences": 299, "validatedSentences": 5932, "unvalidatedSentences": 4095, "clips": 82311, "users": 1011, "size": 1656809031, "checksum": "843e09a81524af61f55813173d08f8c401795f84630e3cb30860b70c9c699ee7", "avgDurationSecs": 4.552, "validDurationSecs": 292016.64, "totalHrs": 78.73, "validHrs": 81.11 }, "ka": { "duration": 11130480, "reportedSentences": 0, "validatedSentences": 338, "unvalidatedSentences": 259, "clips": 2017, "users": 39, "size": 65606216, "checksum": "712f5476dff0173c737475936ad30f7fb35299f6cded5039c37fa80f1ea07892", "avgDurationSecs": 5.199, "validDurationSecs": 8097.38, "totalHrs": 3.09, "validHrs": 2.24 }, "kab": { "duration": 1744668, "reportedSentences": 72, "validatedSentences": 18, "unvalidatedSentences": -6, "clips": 473, "users": 8, "size": 9727910, "checksum": "4a4d246eeb0715bcfe3243bb6c77ca81626e799975f7fe545d0228cc3dba82d3", "avgDurationSecs": 3.341, "validDurationSecs": 1247.63, "totalHrs": 0.48, "validHrs": 0.35 }, "kk": { "duration": 228348, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": -7, "clips": 41, "users": 6, "size": 1187048, "checksum": "eb2b77e90f2caa679101131d3d9764a424fc76a93b79214f110f8bc3fbaa3774", "avgDurationSecs": 4.931, "validDurationSecs": 193.87, "totalHrs": 0.07, "validHrs": 0.05 }, "kln": { "duration": 39672, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 7, "users": 2, "size": 158270, "checksum": "0381c6c550886ccebaa9a2075cae4f010ea7358cc3a6b1afd2dea40a44073511", "avgDurationSecs": 4.705, "validDurationSecs": 854.66, "totalHrs": 0.01, "validHrs": 0.23 }, "kmr": { "duration": 2229624, "reportedSentences": 0, "validatedSentences": 9, "unvalidatedSentences": 258, "clips": 395, "users": 5, "size": 12598043, "checksum": "e4b6e980a3f8ac30ed97838823358ab3c2b8978de661716021f8c3159007a7d5", "avgDurationSecs": 4.095, "validDurationSecs": 4680.59, "totalHrs": 0.62, "validHrs": 1.3 }, "ko": { "duration": 6001488, "reportedSentences": 0, "validatedSentences": 255, "unvalidatedSentences": -255, "clips": 1421, "users": 26, "size": 35040626, "checksum": "025b56e2b4354aae15256949ba7fe118b462fab8524d907fa4040bace748d4b6", "avgDurationSecs": 5.37, "validDurationSecs": 428.64, "totalHrs": 1.67, "validHrs": 0.12 }, "ky": { "duration": 567180, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 255, "clips": 114, "users": 3, "size": 3363435, "checksum": "3b97cfaa402c77f675a95c7c62bfc20774a80372195c442d84c352034f00984f", "avgDurationSecs": 4.554, "validDurationSecs": 1035.98, "totalHrs": 0.16, "validHrs": 0.29 }, "lg": { "duration": 0, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -1, "clips": 0, "users": 0, "size": 129665, "checksum": "5c869b0c3bbd5836b3a2e44b27eedf39d04bea4da499c76f75aee1c86ce19a9f", "avgDurationSecs": 5.784, "validDurationSecs": 57.84, "totalHrs": 0, "validHrs": 0.02 }, "lij": { "duration": 7599528, "reportedSentences": 2, "validatedSentences": 294, "unvalidatedSentences": 434, "clips": 1678, "users": 10, "size": 41986997, "checksum": "34a98990e45ef4c923b1174f210ca42cabcce29405889839d5f3c88d649cba14", "avgDurationSecs": 3.918, "validDurationSecs": 6299.99, "totalHrs": 2.12, "validHrs": 1.75 }, "lo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 402, "checksum": "7acda03333851716f329ea07e1057a2c77476317beb1ab4fbec8ad43ab5b2d5c", "avgDurationSecs": 6.453, "validDurationSecs": 45.17, "totalHrs": 0, "validHrs": 0.01 }, "lt": { "duration": 147924, "reportedSentences": 0, "validatedSentences": 12, "unvalidatedSentences": 23, "clips": 25, "users": 4, "size": 867916, "checksum": "787d852ec6503f38f7c578dfee566ac6d165475d1b69157d8708d9044dfa37cf", "avgDurationSecs": 5.144, "validDurationSecs": 478.16, "totalHrs": 0.04, "validHrs": 0.14 }, "ltg": { "duration": 4165128, "reportedSentences": 4, "validatedSentences": 64, "unvalidatedSentences": -54, "clips": 728, "users": 4, "size": 24593324, "checksum": "50a0f1bd115d298ef956716428ce9744182aea57171eecf009e43a23153e25a5", "avgDurationSecs": 4.602, "validDurationSecs": 2577.56, "totalHrs": 1.16, "validHrs": 0.71 }, "luo": { "duration": 42372, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 5, "users": 1, "size": 120127, "checksum": "49cacebc580834ea7b7090df3b891bffd9461c4064670d7be5bb891046de508d", "avgDurationSecs": 5.205, "validDurationSecs": 35.25, "totalHrs": 0.02, "validHrs": 0 }, "lv": { "duration": 11129832, "reportedSentences": 18, "validatedSentences": 137, "unvalidatedSentences": -23, "clips": 1747, "users": 141, "size": 63419962, "checksum": "733246bafe7947cdf680b463e620d7384bc776984320929e16b6e1c15432bb02", "avgDurationSecs": 4.768, "validDurationSecs": 15699.52, "totalHrs": 3.09, "validHrs": 4.36 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23, "checksum": "56d35bda3d7f478844ff8663c32066c5cc993a4c85f03317cd08190a1278f352", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 182340, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 35, "users": 3, "size": 1052644, "checksum": "0ce20cc001f9df9cea34b29ffdcec18d2aabc0b98b66e88416f7f27dd1f33a88", "avgDurationSecs": 4.623, "validDurationSecs": 301.14, "totalHrs": 0.05, "validHrs": 0.08 }, "mk": { "duration": 67038588, "reportedSentences": 45, "validatedSentences": 548, "unvalidatedSentences": -489, "clips": 12764, "users": 258, "size": 380250817, "checksum": "b427d5f4020a113ca46f784b2894f3df7150a7aa12aa2c9d73c337b9aa444ed3", "avgDurationSecs": 4.699, "validDurationSecs": 17023.37, "totalHrs": 18.62, "validHrs": 4.73 }, "ml": { "duration": 3291228, "reportedSentences": 45, "validatedSentences": 20, "unvalidatedSentences": -18, "clips": 719, "users": 5, "size": 19311700, "checksum": "f6966dbfcfcd463c2d9195745fe563e05508aa884229993a0d6889f076d31d88", "avgDurationSecs": 4.216, "validDurationSecs": 1461.49, "totalHrs": 0.92, "validHrs": 0.41 }, "mn": { "duration": 58608, "reportedSentences": 1, "validatedSentences": 2, "unvalidatedSentences": -1, "clips": 6, "users": 4, "size": 291921, "checksum": "fe0a314f977ff391c4433bfe9c68865e3572945951b884852872c651317903b5", "avgDurationSecs": 5.253, "validDurationSecs": 461.12, "totalHrs": 0.02, "validHrs": 0.13 }, "mr": { "duration": 138096, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 3, "size": 808589, "checksum": "6838278c3827ba1fe563694bdf0ba9439115deb0a6cb404d84817c3f5af55aa3", "avgDurationSecs": 6.197, "validDurationSecs": 68.1, "totalHrs": 0.03, "validHrs": 0.02 }, "mrj": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15169, "checksum": "fc9913b6456d9b6c86714267dfd6569e256576f79f1c9b54d9f345fbb6fc54fe", "avgDurationSecs": 4.191, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mt": { "duration": 27900, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 168810, "checksum": "4f832199e35c088b8da578ee404e87f6451998bd49372b6a522ee82c2c68fe84", "avgDurationSecs": 4.754, "validDurationSecs": 111.39, "totalHrs": 0.01, "validHrs": 0.03 }, "myv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -3127, "checksum": "c8c899378b233bd0a9e9a0fe59ad5883c1423d0d0869cddf101c9edca345f0f1", "avgDurationSecs": 5.774, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 2506680, "reportedSentences": 10, "validatedSentences": 217, "unvalidatedSentences": -144, "clips": 849, "users": 29, "size": 15167893, "checksum": "8fe8ef56fa85d95d04f23411626ee5b2b397002f0501013a56afeebee53ab40c", "avgDurationSecs": 2.637, "validDurationSecs": 41467.76, "totalHrs": 0.7, "validHrs": 11.52 }, "ne-NP": { "duration": 180612, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -1, "clips": 41, "users": 2, "size": 1056502, "checksum": "f2f20ff7c7d0b23b9c16dffcf886cffbfbcf66bbdc2c1eaadb3b26329a7c104e", "avgDurationSecs": 4.015, "validDurationSecs": 185.06, "totalHrs": 0.05, "validHrs": 0.06 }, "nhi": { "duration": 242676, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 55, "users": 0, "size": 1424376, "checksum": "d66f031a9260f9417679a64b0127884e3d880f87dde7711b52416421f6d7bc29", "avgDurationSecs": 4.517, "validDurationSecs": 68.81, "totalHrs": 0.07, "validHrs": 0.02 }, "nl": { "duration": 5610852, "reportedSentences": 4, "validatedSentences": 67, "unvalidatedSentences": 673, "clips": 1176, "users": 13, "size": 32962280, "checksum": "5598af8d7aa7350a6e327ad97175aed67005c8ad8a6b43ed23c69e653b1eef59", "avgDurationSecs": 4.367, "validDurationSecs": 4209.57, "totalHrs": 1.56, "validHrs": 1.17 }, "nn-NO": { "duration": 20916, "reportedSentences": 0, "validatedSentences": 71, "unvalidatedSentences": -71, "clips": 5, "users": 1, "size": 124044, "checksum": "d949c3dceaaa91bce2ce1fc5adb21adc3f4e479c03af30883a826ee767c1d67a", "avgDurationSecs": 4.379, "validDurationSecs": 156.8, "totalHrs": 0.01, "validHrs": 0.05 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 8, "checksum": "3ca2fa8a5b5e79540b3c5a83e65036f2c118fedf3027a86e2265e47bbaa7396c", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 508644, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 105, "users": 0, "size": 3001221, "checksum": "bdb0b2bfdd95b4ff2f9ad15b7d2e93dc3f9e1436f259a9fc7850d83804ff9980", "avgDurationSecs": 4.873, "validDurationSecs": 267.43, "totalHrs": 0.14, "validHrs": 0.07 }, "or": { "duration": 23079492, "reportedSentences": 4, "validatedSentences": 0, "unvalidatedSentences": 52, "clips": 3831, "users": 4, "size": 135586727, "checksum": "98230b027614a6ce946bec172e7261c77ac647a5a0dca09a55b1e7174f98eadc", "avgDurationSecs": 5.57, "validDurationSecs": 2472.5, "totalHrs": 6.41, "validHrs": 0.69 }, "os": { "duration": 1770048, "reportedSentences": 1, "validatedSentences": 175, "unvalidatedSentences": -175, "clips": 314, "users": 2, "size": 10430028, "checksum": "5dab24f662aa5ed4123b4c28b2cf346140754b4efe69111642cb572663e2a6b4", "avgDurationSecs": 5.461, "validDurationSecs": 653.38, "totalHrs": 0.49, "validHrs": 0.18 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 554, "checksum": "89dc50097a37a43bddaf320aed2df54632304f1ddd1ec87d20be235e30a9a1ca", "avgDurationSecs": 4.826, "validDurationSecs": 28.96, "totalHrs": 0, "validHrs": 0.01 }, "pl": { "duration": 8194428, "reportedSentences": 14, "validatedSentences": 650, "unvalidatedSentences": -451, "clips": 1422, "users": 27, "size": 48366913, "checksum": "daed31407d5b2c886d18ccb89c7520238961755ebef80f0032058698da0b2065", "avgDurationSecs": 4.557, "validDurationSecs": 13141.76, "totalHrs": 2.27, "validHrs": 3.65 }, "ps": { "duration": 91468872, "reportedSentences": 127, "validatedSentences": 6506, "unvalidatedSentences": 1690, "clips": 18577, "users": 133, "size": 515774187, "checksum": "386b25cdfa49881d7b863d537faf200bb791915a8048a8bc541e73b254878df2", "avgDurationSecs": 4.979, "validDurationSecs": 155171.77, "totalHrs": 25.41, "validHrs": 43.1 }, "pt": { "duration": 12593772, "reportedSentences": 2, "validatedSentences": 35, "unvalidatedSentences": 10, "clips": 2036, "users": 57, "size": 74156169, "checksum": "0323c6b99d1618868fa558c5022a1765612b7f67c99d238eebb73de09b37625f", "avgDurationSecs": 4.149, "validDurationSecs": 11880.55, "totalHrs": 3.5, "validHrs": 3.3 }, "quy": { "duration": 76572, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 2, "size": 450627, "checksum": "4ead29b9b0e0c69924f93e263f2154c852450eb66a126f5ba1e3179bfe1fd20b", "avgDurationSecs": 5.398, "validDurationSecs": 15.46, "totalHrs": 0.02, "validHrs": 0 }, "ro": { "duration": 688428, "reportedSentences": 1, "validatedSentences": 2, "unvalidatedSentences": 76, "clips": 189, "users": 2, "size": 4015263, "checksum": "8edc3b2768a332b8d465f2f71ad622f72304573060f82f77e000d3e9532fcf48", "avgDurationSecs": 4.03, "validDurationSecs": 790.9, "totalHrs": 0.19, "validHrs": 0.22 }, "ru": { "duration": 10108080, "reportedSentences": 24, "validatedSentences": 300, "unvalidatedSentences": -220, "clips": 1828, "users": 69, "size": 58543978, "checksum": "525461b8ab3df2ef98d14a7c8f26a74497aa3bc471f1f17159a8d43a93c88375", "avgDurationSecs": 5.174, "validDurationSecs": 8726.07, "totalHrs": 2.81, "validHrs": 2.42 }, "rw": { "duration": 292140, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 89, "users": 9, "size": 1852501, "checksum": "f3b00094579052557cf63908f0152d57d7ed3d834a73255d2def0c9cc8ee91a9", "avgDurationSecs": 5.007, "validDurationSecs": 276.72, "totalHrs": 0.09, "validHrs": 0.08 }, "sah": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6995, "checksum": "4db2ef24526a0f5b9eacffe46616e5a43970243998abceee84d0fb711f91b585", "avgDurationSecs": 6.137, "validDurationSecs": 49.1, "totalHrs": 0, "validHrs": 0.02 }, "sat": { "duration": 125388, "reportedSentences": 0, "validatedSentences": 14, "unvalidatedSentences": -7, "clips": 25, "users": 0, "size": 742393, "checksum": "bbdd77609991531d17ccefd5739cdc1767b54c46f477045a29ff14b1c0623cfa", "avgDurationSecs": 4.478, "validDurationSecs": 267.25, "totalHrs": 0.03, "validHrs": 0.07 }, "sc": { "duration": 1428264, "reportedSentences": 0, "validatedSentences": 172, "unvalidatedSentences": -135, "clips": 279, "users": 8, "size": 8383561, "checksum": "c76f2fdf79a1bdef91594479380e3753a973fe0aa9472a2ec75aaa409a3cee8d", "avgDurationSecs": 4.631, "validDurationSecs": 2718.75, "totalHrs": 0.4, "validHrs": 0.76 }, "sd": { "duration": 47404728, "reportedSentences": 2, "validatedSentences": 57, "unvalidatedSentences": 3, "clips": 10736, "users": 16, "size": 243013307, "checksum": "d60ebc56a39e67340278525515f8ebec524d61cf1c21724d248270f355699189", "avgDurationSecs": 4.417, "validDurationSecs": 1276.47, "totalHrs": 13.16, "validHrs": 0.35 }, "sk": { "duration": 262116, "reportedSentences": 1, "validatedSentences": 221, "unvalidatedSentences": 123, "clips": 65, "users": 4, "size": 1498944, "checksum": "e4f76256f8480090bad4d981f871b6f2bca945fa05e5ec971e0eda1d47dcab0c", "avgDurationSecs": 4.078, "validDurationSecs": 2607.27, "totalHrs": 0.07, "validHrs": 0.73 }, "skr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 12, "unvalidatedSentences": -12, "clips": 0, "users": 0, "size": 912, "checksum": "855d65e3cb3c2b5906429b9e93a7d7b1f2762be224672a71e9f28366ce086eb4", "avgDurationSecs": 4.143, "validDurationSecs": 8.29, "totalHrs": 0, "validHrs": 0 }, "sl": { "duration": 12845988, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 28, "clips": 2732, "users": 475, "size": 74989924, "checksum": "54ba3265e466235833cc5dddd31e08ac5d9b622761e92b737d23cdef8b30590f", "avgDurationSecs": 3.952, "validDurationSecs": 11047.33, "totalHrs": 3.57, "validHrs": 3.07 }, "sq": { "duration": 839844, "reportedSentences": 1, "validatedSentences": 1, "unvalidatedSentences": 20, "clips": 151, "users": 11, "size": 4924143, "checksum": "a69dd796f2262cdd97a88b02ff8957b1bcd41603dbb1ede44d32777e442da29d", "avgDurationSecs": 5.098, "validDurationSecs": 430.41, "totalHrs": 0.23, "validHrs": 0.12 }, "sr": { "duration": 163656, "reportedSentences": 0, "validatedSentences": 24, "unvalidatedSentences": -23, "clips": 43, "users": 4, "size": 967543, "checksum": "e045df293cd4b6c6784ec879dc84e2fe369dfa8112be33913dbaa161f8f2afcb", "avgDurationSecs": 3.232, "validDurationSecs": 131.32, "totalHrs": 0.05, "validHrs": 0.04 }, "sv-SE": { "duration": 2714220, "reportedSentences": 0, "validatedSentences": 35, "unvalidatedSentences": 40, "clips": 609, "users": 3, "size": 15939248, "checksum": "ab865dea0eddb088a87980d7c400cd15dcb168b267fdb28e630e711d9eb79488", "avgDurationSecs": 4.016, "validDurationSecs": 1016.49, "totalHrs": 0.75, "validHrs": 0.28 }, "sw": { "duration": 197100, "reportedSentences": 1, "validatedSentences": 16, "unvalidatedSentences": -16, "clips": 30, "users": 8, "size": 1012798, "checksum": "726ec386b9448b683e3cf37cc8e44e5b6c1033395d3297b3ad6a2afa46c58b8a", "avgDurationSecs": 5.387, "validDurationSecs": 163.9, "totalHrs": 0.06, "validHrs": 0.04 }, "ta": { "duration": 340488, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": -1, "clips": 47, "users": 8, "size": 1863868, "checksum": "f4d01a296f1bfa3575299e34db103a0c6b9470438f6e1e83177d3097c7a7cfc5", "avgDurationSecs": 6.199, "validDurationSecs": 318.44, "totalHrs": 0.09, "validHrs": 0.09 }, "te": { "duration": 197028, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 42, "users": 6, "size": 1158522, "checksum": "8c05337553e2f98fc0fdf898a6b94501ff70d688548d154a18b0d6f26f35bbaa", "avgDurationSecs": 4.178, "validDurationSecs": 115.31, "totalHrs": 0.05, "validHrs": 0.03 }, "th": { "duration": 2178144, "reportedSentences": 7, "validatedSentences": 182, "unvalidatedSentences": -178, "clips": 493, "users": 29, "size": 12837650, "checksum": "ae332f94c69a60d62265cd136cbe037c312ecd52058d8d2ccec109111c0be71c", "avgDurationSecs": 4.191, "validDurationSecs": 2446.84, "totalHrs": 0.6, "validHrs": 0.68 }, "ti": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -98, "checksum": "ba60abc72a9c372a7f2b375774a06c48566dd703df0ed7f4056673f60e14d8ae", "avgDurationSecs": 4.998, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tig": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1789, "checksum": "e7041d1eb07073c10b6cad3c7fb3cefa1b06a832d960fd4b4b96bca1ad60f43f", "avgDurationSecs": 6.162, "validDurationSecs": 18.49, "totalHrs": 0, "validHrs": 0.01 }, "tk": { "duration": 281808, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 41, "users": 4, "size": 1506906, "checksum": "84bcc6e35b9059055953f8a981a32d564dba5601a5a256c205f513d4561b1ba3", "avgDurationSecs": 5.423, "validDurationSecs": 52.38, "totalHrs": 0.08, "validHrs": 0.01 }, "tn": { "duration": 9881064, "reportedSentences": 0, "validatedSentences": 441, "unvalidatedSentences": -427, "clips": 2246, "users": 13, "size": 48889631, "checksum": "8cdc306c671bdd7674857754c5474daa33dcbe6cd9f2d7e9fa0bb7eaf2673bbf", "avgDurationSecs": 4.37, "validDurationSecs": 15254.87, "totalHrs": 2.75, "validHrs": 4.24 }, "tok": { "duration": 196560, "reportedSentences": 0, "validatedSentences": 90, "unvalidatedSentences": -84, "clips": 45, "users": 2, "size": 1152604, "checksum": "4ac83b57790016aaee91513abbea484f64ae33eb3411595c616f30cc5def9855", "avgDurationSecs": 4.172, "validDurationSecs": 206.58, "totalHrs": 0.05, "validHrs": 0.06 }, "tr": { "duration": 3931380, "reportedSentences": 17, "validatedSentences": 409, "unvalidatedSentences": 49, "clips": 657, "users": 24, "size": 23035282, "checksum": "b16a43fc1fa8700c602da57ab56dc07d303b9b7baf685e2b28e9e273d30cba1e", "avgDurationSecs": 3.798, "validDurationSecs": 3700.28, "totalHrs": 1.09, "validHrs": 1.02 }, "ts": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -3, "checksum": "c4d33e7851d31b8980bcc242ea1d56bc85f7cfa343e49c322331647ae0b473f1", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "duration": 4146732, "reportedSentences": 0, "validatedSentences": 9, "unvalidatedSentences": -8, "clips": 767, "users": 8, "size": 24465993, "checksum": "80b3c6bdbec06c10aa009d89292f89b138f2a0734dd3b4e227238de7c5e1ffa3", "avgDurationSecs": 3.825, "validDurationSecs": 2276.53, "totalHrs": 1.15, "validHrs": 0.63 }, "tw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 113, "checksum": "e937fb87a4f4d6fd289447ce860e82e3d02fdc8361379a668f99b953e4dfa52a", "avgDurationSecs": 4.31, "validDurationSecs": 8.62, "totalHrs": 0, "validHrs": 0 }, "ug": { "duration": 5069556, "reportedSentences": 1, "validatedSentences": 32, "unvalidatedSentences": 36, "clips": 816, "users": 11, "size": 31366552, "checksum": "0a5e3a672d815499f4c9570604c934bd5496a1b03b125bb891aa3473567b67d1", "avgDurationSecs": 5.673, "validDurationSecs": 144934.61, "totalHrs": 1.41, "validHrs": 40.26 }, "uk": { "duration": 1611864, "reportedSentences": 1, "validatedSentences": 1, "unvalidatedSentences": 1650, "clips": 360, "users": 16, "size": 9620788, "checksum": "785ce2c0c3ad8004fde1559a8cb560d22b5fcde35dd55cd3804256824a1b4465", "avgDurationSecs": 4.642, "validDurationSecs": 1160.17, "totalHrs": 0.45, "validHrs": 0.32 }, "ur": { "duration": 643968, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": -75, "clips": 121, "users": 10, "size": 3510270, "checksum": "9b06700606a2356ea9d4f52b8c4bdf9f6350630d8f145776055ac3967a064980", "avgDurationSecs": 4.295, "validDurationSecs": 780.4, "totalHrs": 0.18, "validHrs": 0.22 }, "uz": { "duration": 1127772, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 199, "users": 11, "size": 6617075, "checksum": "4eb855cc571f325615927e6a41b445567064448d2be9a15937c738883d6f9078", "avgDurationSecs": 4.154, "validDurationSecs": 658.06, "totalHrs": 0.31, "validHrs": 0.18 }, "vi": { "duration": 692424, "reportedSentences": 0, "validatedSentences": 13, "unvalidatedSentences": -8, "clips": 185, "users": 7, "size": 4045189, "checksum": "0bfb9be27efbf15566ee2a9feb61a13b7793a20c78934708655814faf2029609", "avgDurationSecs": 3.949, "validDurationSecs": 335.99, "totalHrs": 0.19, "validHrs": 0.1 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -79, "checksum": "1458a620c79a215e01f9be52f5af32d2d5564ed7a74f2e443e7aaff67996dfcf", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -28, "checksum": "8edd101c64b46ac47847d5b566567c79f118276b9944e686b36a8b6636fc3611", "avgDurationSecs": 6.276, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yi": { "duration": 51372, "reportedSentences": 0, "validatedSentences": 16, "unvalidatedSentences": -15, "clips": 11, "users": 3, "size": 302818, "checksum": "f7597af16c97a8fa0f0b6538093c07df85a9deaacbd0e2c6b43e4ecf47cfb689", "avgDurationSecs": 4.205, "validDurationSecs": 34.17, "totalHrs": 0.01, "validHrs": 0.01 }, "yo": { "duration": 61704, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 9, "users": 4, "size": 371274, "checksum": "ed89244ffac4c189bd6f90b2e55bc7a2a793faee665c0211e94dfccc61c4efa2", "avgDurationSecs": 5.982, "validDurationSecs": 166.94, "totalHrs": 0.02, "validHrs": 0.05 }, "yue": { "duration": 185487300, "reportedSentences": 124, "validatedSentences": 20, "unvalidatedSentences": -20, "clips": 45821, "users": 20, "size": 1087371421, "checksum": "8e4b414dcb0b973e347b0e76bcf52faf19a3f641cf753cd1e6a96f0384203d42", "avgDurationSecs": 3.957, "validDurationSecs": 4103.59, "totalHrs": 51.53, "validHrs": 1.14 }, "zgh": { "duration": 699084, "reportedSentences": 1, "validatedSentences": 6, "unvalidatedSentences": -6, "clips": 183, "users": 2, "size": 3235895, "checksum": "78f93bee259abf989ed91cfdea0b34ecbf1b7240356c2c12e066f7d651f57109", "avgDurationSecs": 3.423, "validDurationSecs": 482.51, "totalHrs": 0.2, "validHrs": 0.13 }, "zh-CN": { "duration": 11721744, "reportedSentences": 61, "validatedSentences": 22, "unvalidatedSentences": 793, "clips": 2980, "users": 82, "size": 68284500, "checksum": "e71104ee0b8246ef412b7d08fe0965552d4f6990ce5e0ece236b234d983391de", "avgDurationSecs": 4.543, "validDurationSecs": 3741.72, "totalHrs": 3.25, "validHrs": 1.04 }, "zh-HK": { "duration": 1425168, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 7, "clips": 439, "users": 17, "size": 8206114, "checksum": "7425e31e97793615cf9401976355738cd286e95aead930da1f601da4083dc8d8", "avgDurationSecs": 4.166, "validDurationSecs": 38.16, "totalHrs": 0.4, "validHrs": 0.01 }, "zh-TW": { "duration": 1302084, "reportedSentences": 8, "validatedSentences": 94, "unvalidatedSentences": 5, "clips": 384, "users": 8, "size": 7501159, "checksum": "1a28bd64b6cea58c04e688ac2d33772c3c3b0384dc87d47eb4ff39c97b278fc1", "avgDurationSecs": 3.364, "validDurationSecs": 1974.11, "totalHrs": 0.37, "validHrs": 0.54 }, "zu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -33, "checksum": "bc4da09091af54178f1f3fcaa51b087d2e04875c2d2226a4d16930d8c7d2b48a", "avgDurationSecs": 5.388, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "zza": { "duration": 3711888, "reportedSentences": 278, "validatedSentences": 43, "unvalidatedSentences": 24, "clips": 963, "users": 6, "size": 21237882, "checksum": "1955369eea1132514b3841ae63f9a6fe3c01cb5864869b74927107d53a716c3f", "avgDurationSecs": 4.104, "validDurationSecs": 2926.18, "totalHrs": 1.03, "validHrs": 0.81 } }, "totalDuration": 2037326885, "totalValidDurationSecs": 1853557, "totalHrs": 566, "totalValidHrs": 515 } ================================================ FILE: datasets/scripted-speech/cv-corpus-21.0-2025-03-14.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9150, "test": 9117, "train": 21037, "validated": 42091, "invalidated": 5283, "other": 16628 }, "duration": 331373232, "reportedSentences": 225, "validatedSentences": 1046778, "unvalidatedSentences": 5, "clips": 64002, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.17, "teens": 0.26, "fourties": 0.08, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 64002, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 417, "size": 1968855358, "checksum": "ba64a435f1413e9eaa328ec6d148b55131f23f60cc168ef1fa5b4df8d421da7f", "avgDurationSecs": 5.178, "validDurationSecs": 217928.045, "totalHrs": 92.04, "validHrs": 60.53 }, "af": { "buckets": { "dev": 123, "test": 117, "train": 131, "validated": 371, "invalidated": 197, "other": 301 }, "duration": 5108004, "reportedSentences": 0, "validatedSentences": 4779, "unvalidatedSentences": 16587, "clips": 869, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.35, "thirties": 0, "teens": 0.01, "fourties": 0.14, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male_masculine": 0.35, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 869, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 58, "size": 31166300, "checksum": "07e6c1ec1a4dc4044d0ef564349a11ae132744c0bba521b765ce0878dfcaac1b", "avgDurationSecs": 5.878, "validDurationSecs": 2180.747, "totalHrs": 1.41, "validHrs": 0.6 }, "am": { "buckets": { "dev": 248, "test": 246, "train": 514, "validated": 1008, "invalidated": 28, "other": 563 }, "duration": 10004400, "reportedSentences": 11, "validatedSentences": 2205, "unvalidatedSentences": 15, "clips": 1599, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.15, "thirties": 0.61, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.76, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1599, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 58959697, "checksum": "96e19ee23235d139efaba48bca825a38be5f947f891fcc058066f097901b2cd9", "avgDurationSecs": 6.257, "validDurationSecs": 6306.714, "totalHrs": 2.77, "validHrs": 1.75 }, "ar": { "buckets": { "dev": 10466, "test": 10496, "train": 28526, "validated": 78917, "invalidated": 15182, "other": 41494 }, "duration": 564156342, "reportedSentences": 2131, "validatedSentences": 59434, "unvalidatedSentences": 1742100, "clips": 135593, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.29, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 135593, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1620, "size": 3473267222, "checksum": "d417e72f24c42c8d55c3e229636da56198beb6bcbf6323957f38340d4b49702f", "avgDurationSecs": 4.161, "validDurationSecs": 328346.788, "totalHrs": 156.71, "validHrs": 91.2 }, "as": { "buckets": { "dev": 485, "test": 366, "train": 950, "validated": 1801, "invalidated": 283, "other": 2572 }, "duration": 27454637, "reportedSentences": 247, "validatedSentences": 7142, "unvalidatedSentences": 3359, "clips": 4656, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.16, "thirties": 0.02, "teens": 0, "fourties": 0.52, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0.18, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4650, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 6 } }, "users": 51, "size": 166974285, "checksum": "1f2646de2a7c11fffcc7cc683dd6197af953600d8eab9c39083a3dc3658b3a54", "avgDurationSecs": 5.897, "validDurationSecs": 10619.803, "totalHrs": 7.62, "validHrs": 2.94 }, "ast": { "buckets": { "dev": 113, "test": 199, "train": 430, "validated": 749, "invalidated": 31, "other": 899 }, "duration": 7444944, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1679, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.1, "thirties": 0.12, "teens": 0.04, "fourties": 0.28, "fifties": 0.21, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1679, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 42479346, "checksum": "88c410b36a62f85a0dfb7b0326f8e1455377d239a28a406a95b3d1163659acf6", "avgDurationSecs": 4.434, "validDurationSecs": 3321.181, "totalHrs": 2.06, "validHrs": 0.92 }, "az": { "buckets": { "dev": 71, "test": 92, "train": 146, "validated": 309, "invalidated": 50, "other": 515 }, "duration": 4753116, "reportedSentences": 5, "validatedSentences": 93161, "unvalidatedSentences": 1849, "clips": 874, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.12, "thirties": 0.67, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.81, "male_masculine": 0.19, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 874, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 34052238, "checksum": "b205514ce6bfdbed3fbc67445c23fde0ec886032dd6a9bd51b43b713e0f51e1a", "avgDurationSecs": 5.438, "validDurationSecs": 1680.449, "totalHrs": 1.32, "validHrs": 0.46 }, "ba": { "buckets": { "dev": 14545, "test": 14575, "train": 119093, "validated": 210118, "invalidated": 7992, "other": 113 }, "duration": 966014964, "reportedSentences": 868, "validatedSentences": 153966, "unvalidatedSentences": 7, "clips": 218223, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218223, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 924, "size": 5443356471, "checksum": "8978e8f586c025fb35ca3abc9616acaa1c4f348ba301df6ddaf227ad812030e8", "avgDurationSecs": 4.427, "validDurationSecs": 930136.293, "totalHrs": 268.33, "validHrs": 258.37 }, "bas": { "buckets": { "dev": 1328, "test": 1547, "train": 2109, "validated": 11134, "invalidated": 1217, "other": 102 }, "duration": 48872484, "reportedSentences": 8, "validatedSentences": 5226, "unvalidatedSentences": 105, "clips": 12453, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.61, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12449, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 52, "size": 253326878, "checksum": "e731cb7a6408106dad1ce9c138348852cb91738b54789ed4f2a9568ad6a20ffe", "avgDurationSecs": 3.925, "validDurationSecs": 43695.996, "totalHrs": 13.57, "validHrs": 12.13 }, "be": { "buckets": { "dev": 15879, "test": 15877, "train": 347673, "validated": 1354857, "invalidated": 36746, "other": 16367 }, "duration": 6740607024, "reportedSentences": 3197, "validatedSentences": 379528, "unvalidatedSentences": 1946, "clips": 1407970, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.04, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1407908, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 33, "healthcare": 0, "history_law_government": 8, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 7 } }, "users": 8560, "size": 38396801804, "checksum": "887e3191c4c9315cc16d8fb496c70eecd6ee5474b81a946ba8bf4b24bb834d52", "avgDurationSecs": 4.787, "validDurationSecs": 6486330.398, "totalHrs": 1872.39, "validHrs": 1801.75 }, "bg": { "buckets": { "dev": 2901, "test": 3318, "train": 4927, "validated": 11162, "invalidated": 775, "other": 1848 }, "duration": 75432564, "reportedSentences": 188, "validatedSentences": 39572, "unvalidatedSentences": 3, "clips": 13785, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13785, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 142, "size": 444502531, "checksum": "72ce07c237caa9a83391b1770cd7c04193d83aed77d68b6590500bf0ebf9501f", "avgDurationSecs": 5.472, "validDurationSecs": 61079.309, "totalHrs": 20.95, "validHrs": 16.96 }, "bn": { "buckets": { "dev": 9363, "test": 9363, "train": 21409, "validated": 44405, "invalidated": 7874, "other": 998064 }, "duration": 4588172352, "reportedSentences": 1890, "validatedSentences": 562771, "unvalidatedSentences": 94169, "clips": 1050343, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1050343, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22958, "size": 26513562897, "checksum": "019d31ffe72d5f6dccc2d00e4fff59eb3d12854d023fc2ae5786afc0eb855df3", "avgDurationSecs": 4.368, "validDurationSecs": 193972.629, "totalHrs": 1274.49, "validHrs": 53.88 }, "br": { "buckets": { "dev": 2400, "test": 2407, "train": 2901, "validated": 29716, "invalidated": 1961, "other": 105 }, "duration": 101443687, "reportedSentences": 301, "validatedSentences": 8129, "unvalidatedSentences": 35, "clips": 31782, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.31, "thirties": 0.09, "teens": 0.01, "fourties": 0.06, "fifties": 0.08, "sixties": 0.13, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.65, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31600, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 159, "healthcare": 5, "history_law_government": 5, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 216, "size": 702645563, "checksum": "d55de8106570dc0a0a4ab51d764ed23127ddab8b0c7a3355890bf23dd4c16c10", "avgDurationSecs": 3.192, "validDurationSecs": 94849.305, "totalHrs": 28.17, "validHrs": 26.34 }, "ca": { "buckets": { "dev": 16413, "test": 16413, "train": 1200540, "validated": 2240433, "invalidated": 138496, "other": 227254 }, "duration": 13633444170, "reportedSentences": 7345, "validatedSentences": 1290554, "unvalidatedSentences": 712, "clips": 2606183, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.18, "sixties": 0.29, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.52, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 2605186, "agriculture_food": 38, "automotive_transport": 24, "finance": 4, "service_retail": 62, "general": 579, "healthcare": 30, "history_law_government": 63, "language_fundamentals": 28, "media_entertainment": 31, "nature_environment": 110, "news_current_affairs": 31, "technology_robotics": 18 } }, "users": 36567, "size": 81994007088, "checksum": "95e4326db4f1900d1aeeb628dba0bfff9270b03ceecf6089f784df42a9b7a7c7", "avgDurationSecs": 5.231, "validDurationSecs": 11720135.624, "totalHrs": 3787.06, "validHrs": 3255.59 }, "ckb": { "buckets": { "dev": 5340, "test": 5340, "train": 7869, "validated": 119212, "invalidated": 9232, "other": 39258 }, "duration": 685979676, "reportedSentences": 2526, "validatedSentences": 19015, "unvalidatedSentences": 15448, "clips": 167702, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 167480, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 110, "healthcare": 0, "history_law_government": 10, "language_fundamentals": 10, "media_entertainment": 82, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1904, "size": 3790908411, "checksum": "bcb94fe2c37bba8d9904e4ee2836f89c7754b30b3c4c73ddd99e9cfd09255b7d", "avgDurationSecs": 4.09, "validDurationSecs": 487632.879, "totalHrs": 190.54, "validHrs": 135.45 }, "cnh": { "buckets": { "dev": 763, "test": 765, "train": 818, "validated": 2463, "invalidated": 437, "other": 3287 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168182322, "checksum": "de263297edb0be56224cb2b932826eaf71b3a1287c7ffa0ca3b42c34b20f63d6", "avgDurationSecs": 3.516, "validDurationSecs": 8659.177, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "dev": 9337, "test": 9371, "train": 21533, "validated": 64072, "invalidated": 2510, "other": 149173 }, "duration": 962903667, "reportedSentences": 969, "validatedSentences": 451093, "unvalidatedSentences": 250, "clips": 215755, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.54, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 215734, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 4, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 7 } }, "users": 1074, "size": 5926000616, "checksum": "35506b6a8db390de138e8dd71830ddf3a71ee102c033faa6d15bb0ab2fa58e0b", "avgDurationSecs": 4.463, "validDurationSecs": 285950.1, "totalHrs": 267.47, "validHrs": 79.43 }, "cv": { "buckets": { "dev": 1242, "test": 1288, "train": 1456, "validated": 17489, "invalidated": 2214, "other": 753 }, "duration": 103130544, "reportedSentences": 144, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 20456, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.5, "thirties": 0.01, "teens": 0.18, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.51, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20455, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 122, "size": 697765872, "checksum": "da0aa4a1cd3f7fbd7f7affc70ea23087c22089852a224cbb12f1def4c01af7f9", "avgDurationSecs": 5.042, "validDurationSecs": 88172.179, "totalHrs": 28.64, "validHrs": 24.49 }, "cy": { "buckets": { "dev": 5403, "test": 5406, "train": 8014, "validated": 90758, "invalidated": 4462, "other": 20570 }, "duration": 568928061, "reportedSentences": 176, "validatedSentences": 118044, "unvalidatedSentences": 14, "clips": 115790, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115790, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1909, "size": 4150668083, "checksum": "be3980a2ae1faf4854ff8fa039fef86b82765f64f26a9e078247f1e54a9157ad", "avgDurationSecs": 4.913, "validDurationSecs": 445934.649, "totalHrs": 158.03, "validHrs": 123.87 }, "da": { "buckets": { "dev": 2629, "test": 2750, "train": 3602, "validated": 11271, "invalidated": 437, "other": 1985 }, "duration": 55629648, "reportedSentences": 631, "validatedSentences": 20493, "unvalidatedSentences": 527, "clips": 13693, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.16, "thirties": 0.22, "teens": 0.01, "fourties": 0.17, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.49, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13672, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 11, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 8, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 285, "size": 322530448, "checksum": "eb495dfdedaf84db5d8c631bdef854ecd9b1545e13558c6ab7269dc9df0c3949", "avgDurationSecs": 4.063, "validDurationSecs": 45789.948, "totalHrs": 15.45, "validHrs": 12.71 }, "dav": { "buckets": { "dev": 1275, "test": 1003, "train": 2098, "validated": 8259, "invalidated": 663, "other": 40708 }, "duration": 201431016, "reportedSentences": 11, "validatedSentences": 11773, "unvalidatedSentences": 20119, "clips": 49630, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.61, "thirties": 0.1, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0.23, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.73, "male_masculine": 0, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 971, "agriculture_food": 499, "automotive_transport": 100, "finance": 304, "service_retail": 1218, "general": 45323, "healthcare": 967, "history_law_government": 334, "language_fundamentals": 150, "media_entertainment": 68, "nature_environment": 873, "news_current_affairs": 86, "technology_robotics": 75 } }, "users": 24, "size": 1084130696, "checksum": "6032f6de3c4da75162ec1118e99e589fe4b2d07814b7aaa7ef6c7b0ad7d160e4", "avgDurationSecs": 4.059, "validDurationSecs": 33520.426, "totalHrs": 55.95, "validHrs": 9.31 }, "de": { "buckets": { "dev": 16195, "test": 16195, "train": 604554, "validated": 932816, "invalidated": 55874, "other": 8208 }, "duration": 5227131581, "reportedSentences": 11232, "validatedSentences": 2057827, "unvalidatedSentences": 1328, "clips": 996898, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.59, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 996710, "agriculture_food": 0, "automotive_transport": 5, "finance": 0, "service_retail": 11, "general": 133, "healthcare": 14, "history_law_government": 14, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 11, "news_current_affairs": 4, "technology_robotics": 9 } }, "users": 19438, "size": 36427381082, "checksum": "a9fa0c62e0135c7d034ed9ec3a50e6a63b0e866acbfb9da5e292f039f8f4e70a", "avgDurationSecs": 5.243, "validDurationSecs": 4891124.24, "totalHrs": 1451.98, "validHrs": 1358.64 }, "dv": { "buckets": { "dev": 2242, "test": 2218, "train": 2659, "validated": 26458, "invalidated": 1651, "other": 15123 }, "duration": 221364675, "reportedSentences": 59, "validatedSentences": 7172, "unvalidatedSentences": 9, "clips": 43232, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.11, "thirties": 0.39, "teens": 0.01, "fourties": 0.23, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.24, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43231, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 387, "size": 1407847838, "checksum": "1169afebc94a80df739de0a09e90d8531713dda94726d46b117a8ecd4c9c4bb8", "avgDurationSecs": 5.12, "validDurationSecs": 135475.263, "totalHrs": 61.49, "validHrs": 37.63 }, "dyu": { "buckets": { "dev": 50, "test": 63, "train": 86, "validated": 207, "invalidated": 14, "other": 74 }, "duration": 1861236, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 295, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.66, "thirties": 0, "teens": 0.02, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 10793129, "checksum": "ab6606b9336a0905f2b39e0bf1bb2dc2c4a21ac5434edea83629e82673555b6e", "avgDurationSecs": 6.309, "validDurationSecs": 1306.02, "totalHrs": 0.51, "validHrs": 0.36 }, "el": { "buckets": { "dev": 1710, "test": 1707, "train": 1917, "validated": 16645, "invalidated": 857, "other": 10263 }, "duration": 115194742, "reportedSentences": 70, "validatedSentences": 6845, "unvalidatedSentences": 4513, "clips": 27765, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.63, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 27765, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 422, "size": 766306669, "checksum": "8d3f99a1caf3c647ecbff802b558568be0b2310eab9af39b6eff803f84d80835", "avgDurationSecs": 4.149, "validDurationSecs": 69058.76, "totalHrs": 31.99, "validHrs": 19.18 }, "en": { "buckets": { "dev": 16399, "test": 16399, "train": 1131718, "validated": 1845369, "invalidated": 304560, "other": 357220 }, "duration": 13210515979, "reportedSentences": 9384, "validatedSentences": 1678488, "unvalidatedSentences": 32223, "clips": 2507149, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.44, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2506316, "agriculture_food": 20, "automotive_transport": 6, "finance": 41, "service_retail": 25, "general": 517, "healthcare": 20, "history_law_government": 59, "language_fundamentals": 8, "media_entertainment": 80, "nature_environment": 40, "news_current_affairs": 8, "technology_robotics": 76 } }, "users": 95904, "size": 91907809533, "checksum": "9c368b384739ff7ae1a7ca67cc11cb475e877b2d4f10a27f2707c077d685d9f4", "avgDurationSecs": 5.269, "validDurationSecs": 9723505.329, "totalHrs": 3669.58, "validHrs": 2700.97 }, "eo": { "buckets": { "dev": 14897, "test": 14918, "train": 144307, "validated": 851677, "invalidated": 127487, "other": 156981 }, "duration": 6896446315, "reportedSentences": 2355, "validatedSentences": 180733, "unvalidatedSentences": 22291, "clips": 1136145, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.68, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1136142, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1838, "size": 41218983982, "checksum": "c6cf0d4936053c58f0787d976ff6e529500a435bd08ea6f46d3155579e7b8cc9", "avgDurationSecs": 6.07, "validDurationSecs": 5169713.996, "totalHrs": 1915.67, "validHrs": 1436.03 }, "es": { "buckets": { "dev": 15888, "test": 15888, "train": 351294, "validated": 430259, "invalidated": 93722, "other": 1139310 }, "duration": 8107639614, "reportedSentences": 2622, "validatedSentences": 1081601, "unvalidatedSentences": 5527, "clips": 1663291, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1663186, "agriculture_food": 0, "automotive_transport": 4, "finance": 4, "service_retail": 2, "general": 31, "healthcare": 4, "history_law_government": 23, "language_fundamentals": 7, "media_entertainment": 5, "nature_environment": 11, "news_current_affairs": 13, "technology_robotics": 20 } }, "users": 26413, "size": 51129492189, "checksum": "81dabff368ec7292ecd34b6f09f4872802f6eed17077b67e6f09cfb425b1582d", "avgDurationSecs": 4.874, "validDurationSecs": 2097278.776, "totalHrs": 2252.12, "validHrs": 582.57 }, "et": { "buckets": { "dev": 2808, "test": 2808, "train": 3381, "validated": 26083, "invalidated": 7740, "other": 171 }, "duration": 229492874, "reportedSentences": 565, "validatedSentences": 10910, "unvalidatedSentences": 855, "clips": 33994, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.52, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33464, "agriculture_food": 25, "automotive_transport": 27, "finance": 27, "service_retail": 5, "general": 338, "healthcare": 15, "history_law_government": 21, "language_fundamentals": 9, "media_entertainment": 17, "nature_environment": 75, "news_current_affairs": 8, "technology_robotics": 25 } }, "users": 999, "size": 1536616905, "checksum": "5957bb0b5fc83a381d78af1e2ba386e52805d6cddf32d9c3c8fdb7ba6425cf6a", "avgDurationSecs": 6.751, "validDurationSecs": 176085.857, "totalHrs": 63.74, "validHrs": 48.91 }, "eu": { "buckets": { "dev": 14383, "test": 14383, "train": 106004, "validated": 241076, "invalidated": 43131, "other": 175864 }, "duration": 2531019567, "reportedSentences": 1265, "validatedSentences": 1017846, "unvalidatedSentences": 7379, "clips": 460071, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.01, "fourties": 0.16, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.25, "female_feminine": 0.29, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443660, "agriculture_food": 1, "automotive_transport": 6, "finance": 4, "service_retail": 7, "general": 16351, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 10885, "size": 15523864711, "checksum": "55263915d92928214e6a3fed4d662efbfbcc8f260b17e02d063241702437fba0", "avgDurationSecs": 5.501, "validDurationSecs": 1326247.629, "totalHrs": 703.06, "validHrs": 368.4 }, "fa": { "buckets": { "dev": 10668, "test": 10668, "train": 29729, "validated": 337669, "invalidated": 15195, "other": 33978 }, "duration": 1532418632, "reportedSentences": 2695, "validatedSentences": 57377, "unvalidatedSentences": 282475, "clips": 386842, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.35, "teens": 0.03, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.71, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 386822, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 20, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4574, "size": 11074651003, "checksum": "1262bbb7a735d05463cf834323733f079115e212b37ab414e5719ba31517b7c2", "avgDurationSecs": 3.961, "validDurationSecs": 1337626.905, "totalHrs": 425.67, "validHrs": 371.56 }, "fi": { "buckets": { "dev": 1811, "test": 1794, "train": 2059, "validated": 11865, "invalidated": 408, "other": 5121 }, "duration": 80392179, "reportedSentences": 77, "validatedSentences": 5848, "unvalidatedSentences": 49164, "clips": 17394, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.31, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.36, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17394, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 236, "size": 480598412, "checksum": "0295ec92158874ff7318cb0f8aebecfb44ad260626c15718059d763c944968f6", "avgDurationSecs": 4.622, "validDurationSecs": 54838.059, "totalHrs": 22.33, "validHrs": 15.23 }, "fr": { "buckets": { "dev": 16182, "test": 16182, "train": 586763, "validated": 755819, "invalidated": 66717, "other": 17442 }, "duration": 4220888518, "reportedSentences": 7515, "validatedSentences": 1648434, "unvalidatedSentences": 44164, "clips": 839978, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.13, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.57, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 839889, "agriculture_food": 0, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 52, "healthcare": 5, "history_law_government": 17, "language_fundamentals": 7, "media_entertainment": 16, "nature_environment": 8, "news_current_affairs": 2, "technology_robotics": 18 } }, "users": 20046, "size": 29594425711, "checksum": "bd362910a453ad08ffc6dd212cebe219841ef297ae88b118af305fa08ed71e55", "avgDurationSecs": 5.025, "validDurationSecs": 3797989.636, "totalHrs": 1172.46, "validHrs": 1054.99 }, "fy-NL": { "buckets": { "dev": 3173, "test": 3173, "train": 3922, "validated": 51537, "invalidated": 4014, "other": 102328 }, "duration": 766328211, "reportedSentences": 543, "validatedSentences": 10351, "unvalidatedSentences": 1, "clips": 157879, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157879, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2060, "size": 4594941239, "checksum": "64217b0db1411e2b1bfc504a1b50eb5c9e058d184fa97108a64450351bdd68e1", "avgDurationSecs": 4.854, "validDurationSecs": 250155.227, "totalHrs": 212.86, "validHrs": 69.48 }, "ga-IE": { "buckets": { "dev": 518, "test": 518, "train": 536, "validated": 6063, "invalidated": 966, "other": 3492 }, "duration": 37793223, "reportedSentences": 29, "validatedSentences": 1618, "unvalidatedSentences": 915, "clips": 10521, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.25, "thirties": 0.24, "teens": 0.01, "fourties": 0.06, "fifties": 0.05, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.49, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10520, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 195, "size": 256136000, "checksum": "d1feaf2c198a58f75f8cde0485f610a787751f0477e5b5b4ad21df8789a4a59f", "avgDurationSecs": 3.592, "validDurationSecs": 21779.328, "totalHrs": 10.49, "validHrs": 6.04 }, "gl": { "buckets": { "dev": 13228, "test": 13257, "train": 65240, "validated": 93156, "invalidated": 9363, "other": 146193 }, "duration": 1250330904, "reportedSentences": 756, "validatedSentences": 696056, "unvalidatedSentences": 4767, "clips": 248712, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.13, "thirties": 0.07, "teens": 0.01, "fourties": 0.09, "fifties": 0.1, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.14, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 248347, "agriculture_food": 28, "automotive_transport": 4, "finance": 38, "service_retail": 14, "general": 216, "healthcare": 1, "history_law_government": 43, "language_fundamentals": 7, "media_entertainment": 37, "nature_environment": 14, "news_current_affairs": 11, "technology_robotics": 3 } }, "users": 6153, "size": 7245076780, "checksum": "2aab68c0e17bc948017349cff57ee37296ab2ac45b4cbacf42c5854e6ae20bf6", "avgDurationSecs": 5.027, "validDurationSecs": 468316.067, "totalHrs": 347.31, "validHrs": 130.08 }, "gn": { "buckets": { "dev": 361, "test": 883, "train": 1593, "validated": 2979, "invalidated": 166, "other": 19359 }, "duration": 102495456, "reportedSentences": 36, "validatedSentences": 5894, "unvalidatedSentences": 293, "clips": 22504, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.57, "thirties": 0.19, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.38, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 22504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 205, "size": 587952257, "checksum": "596ece1eb88a9066391fa13d511e47d487a049f6afe46b88aa423f7fc3d0c42f", "avgDurationSecs": 4.555, "validDurationSecs": 13567.986, "totalHrs": 28.47, "validHrs": 3.76 }, "ha": { "buckets": { "dev": 613, "test": 718, "train": 1912, "validated": 3389, "invalidated": 174, "other": 6660 }, "duration": 44485776, "reportedSentences": 34, "validatedSentences": 5445, "unvalidatedSentences": 244, "clips": 10223, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.13, "thirties": 0.67, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.59, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10223, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 56, "size": 260699102, "checksum": "e1f107a81ce4b422566557c6bf2657b5625a94176544eb245fe03a7f33506771", "avgDurationSecs": 4.352, "validDurationSecs": 14747.363, "totalHrs": 12.35, "validHrs": 4.09 }, "he": { "buckets": { "dev": 301, "test": 377, "train": 1230, "validated": 2342, "invalidated": 259, "other": 2675 }, "duration": 23926788, "reportedSentences": 33, "validatedSentences": 4104, "unvalidatedSentences": 6908, "clips": 5276, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.73, "thirties": 0.16, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.03, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.92, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5276, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 48, "size": 139302236, "checksum": "46def9d7f7670c444ab6bf74436a037916fcbaaa219402c0ccafd1c581a70f69", "avgDurationSecs": 4.535, "validDurationSecs": 10621.027, "totalHrs": 6.64, "validHrs": 2.95 }, "hi": { "buckets": { "dev": 2698, "test": 3337, "train": 4865, "validated": 10979, "invalidated": 887, "other": 4393 }, "duration": 81454740, "reportedSentences": 146, "validatedSentences": 32204, "unvalidatedSentences": 9963, "clips": 16259, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.32, "thirties": 0.24, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.59, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16259, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 454, "size": 483443153, "checksum": "6fd2ba711746ce502271a35fdabe0314716c8dd106465ca7395cd331813c6b0e", "avgDurationSecs": 5.01, "validDurationSecs": 55002.866, "totalHrs": 22.62, "validHrs": 15.27 }, "hsb": { "buckets": { "dev": 172, "test": 476, "train": 809, "validated": 1457, "invalidated": 247, "other": 204 }, "duration": 11915892, "reportedSentences": 130, "validatedSentences": 7177, "unvalidatedSentences": 30, "clips": 1908, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.11, "thirties": 0.09, "teens": 0.07, "fourties": 0.48, "fifties": 0.03, "sixties": 0.02, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.78, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1908, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 90485413, "checksum": "0a39241105eb510fa552c3d3a949479726298994a7fecb5f17a44d1cf54f76d9", "avgDurationSecs": 6.245, "validDurationSecs": 9099.295, "totalHrs": 3.3, "validHrs": 2.52 }, "ht": { "buckets": { "dev": 0, "test": 5, "train": 11, "validated": 17, "invalidated": 0, "other": 13 }, "duration": 166968, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": 1629, "clips": 30, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0, "thirties": 0.33, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1091347, "checksum": "438e8bee117965d87dbfedd1c2d7411c79c85b594288b2963e8fb8352098b862", "avgDurationSecs": 5.566, "validDurationSecs": 94.615, "totalHrs": 0.04, "validHrs": 0.02 }, "hu": { "buckets": { "dev": 11608, "test": 11638, "train": 38753, "validated": 62457, "invalidated": 3244, "other": 50847 }, "duration": 645619753, "reportedSentences": 554, "validatedSentences": 326060, "unvalidatedSentences": 556, "clips": 116548, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.13, "teens": 0.02, "fourties": 0.11, "fifties": 0.32, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.24, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 116543, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 1656, "size": 3797167908, "checksum": "418125d0794a0022b2da0be531b101f45566b86b9ac166b1ef16c146eb88bf02", "avgDurationSecs": 5.54, "validDurationSecs": 345981.681, "totalHrs": 179.33, "validHrs": 96.1 }, "hy-AM": { "buckets": { "dev": 5856, "test": 5818, "train": 9300, "validated": 21188, "invalidated": 1221, "other": 15102 }, "duration": 202252752, "reportedSentences": 183, "validatedSentences": 237419, "unvalidatedSentences": 1338, "clips": 37511, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.63, "thirties": 0.07, "teens": 0.11, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.24, "female_feminine": 0.56, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37509, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 436, "size": 1189829396, "checksum": "087eea26efb201b1f81bddb44bb01b23e5d28334c0f2ba1d2d291a3b563d3314", "avgDurationSecs": 5.392, "validDurationSecs": 114241.991, "totalHrs": 56.18, "validHrs": 31.73 }, "ia": { "buckets": { "dev": 1842, "test": 1841, "train": 4943, "validated": 11945, "invalidated": 346, "other": 2469 }, "duration": 62016204, "reportedSentences": 279, "validatedSentences": 9069, "unvalidatedSentences": 75, "clips": 14760, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.29, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14760, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 72, "size": 420133884, "checksum": "3fafab7c04c5e83df42efb07703f8a75fd3eb90d3047b88b72f0ba5b59841128", "avgDurationSecs": 4.202, "validDurationSecs": 50188.588, "totalHrs": 17.22, "validHrs": 13.94 }, "id": { "buckets": { "dev": 3466, "test": 3687, "train": 4973, "validated": 29985, "invalidated": 2782, "other": 26516 }, "duration": 237294120, "reportedSentences": 354, "validatedSentences": 13314, "unvalidatedSentences": 11862, "clips": 59283, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.45, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 59283, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 626, "size": 1492369934, "checksum": "9168779499ea6d1a14b6d56fefc53e5c5a2e896b2c27cdf18b0f3504d7f1aaf0", "avgDurationSecs": 4.003, "validDurationSecs": 120021.999, "totalHrs": 65.91, "validHrs": 33.33 }, "ig": { "buckets": { "dev": 3, "test": 5, "train": 9, "validated": 17, "invalidated": 3, "other": 5784 }, "duration": 31711896, "reportedSentences": 16, "validatedSentences": 14459, "unvalidatedSentences": 3858, "clips": 5804, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5804, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 120, "size": 186461753, "checksum": "dde8f480a74c870d3b86c230c01a39d8f6139f47c2d961b51a42d7f2a5d8b443", "avgDurationSecs": 5.464, "validDurationSecs": 92.885, "totalHrs": 8.8, "validHrs": 0.02 }, "is": { "buckets": { "dev": 9, "test": 9, "train": 17, "validated": 35, "invalidated": 5, "other": 10 }, "duration": 349704, "reportedSentences": 0, "validatedSentences": 4993, "unvalidatedSentences": 203, "clips": 50, "splits": { "accent": {}, "age": { "": 0.7, "twenties": 0, "thirties": 0.2, "teens": 0, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0.3, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 50, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 2441855, "checksum": "8793978aaf1756438f8750434d264297037244a030eef847ca97a96cb951f99c", "avgDurationSecs": 6.994, "validDurationSecs": 244.793, "totalHrs": 0.09, "validHrs": 0.06 }, "it": { "buckets": { "dev": 15175, "test": 15176, "train": 172469, "validated": 239445, "invalidated": 20367, "other": 15644 }, "duration": 1492934136, "reportedSentences": 5976, "validatedSentences": 924688, "unvalidatedSentences": 1303, "clips": 275456, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.2, "thirties": 0.16, "teens": 0.01, "fourties": 0.13, "fifties": 0.14, "sixties": 0.08, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.62, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 275441, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 11, "healthcare": 1, "history_law_government": 5, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 3, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 7249, "size": 10170331649, "checksum": "5b36aeebc1df11b438f22a8801ac10b55db59974db5274dc7c08128696590798", "avgDurationSecs": 5.42, "validDurationSecs": 1297759.403, "totalHrs": 414.7, "validHrs": 360.48 }, "ja": { "buckets": { "dev": 8001, "test": 8001, "train": 15411, "validated": 177537, "invalidated": 31465, "other": 263944 }, "duration": 2147955225, "reportedSentences": 783, "validatedSentences": 37288, "unvalidatedSentences": 4389, "clips": 472946, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.55, "thirties": 0.08, "teens": 0.08, "fourties": 0.09, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.33, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 436780, "agriculture_food": 5727, "automotive_transport": 1225, "finance": 1395, "service_retail": 385, "general": 16549, "healthcare": 1339, "history_law_government": 3192, "language_fundamentals": 2721, "media_entertainment": 4065, "nature_environment": 1749, "news_current_affairs": 724, "technology_robotics": 571 } }, "users": 5512, "size": 12643733140, "checksum": "66b1b0ce45457c67400b17f8180fbd4863e3a1953cb81af4a259c68773a8f1fa", "avgDurationSecs": 4.542, "validDurationSecs": 806310.925, "totalHrs": 596.65, "validHrs": 223.97 }, "ka": { "buckets": { "dev": 12966, "test": 12970, "train": 59908, "validated": 111306, "invalidated": 5971, "other": 95660 }, "duration": 1107114192, "reportedSentences": 1033, "validatedSentences": 132366, "unvalidatedSentences": 1928, "clips": 212937, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.21, "thirties": 0.34, "teens": 0.11, "fourties": 0.06, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.18, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 212478, "agriculture_food": 7, "automotive_transport": 6, "finance": 3, "service_retail": 7, "general": 215, "healthcare": 8, "history_law_government": 168, "language_fundamentals": 25, "media_entertainment": 17, "nature_environment": 9, "news_current_affairs": 8, "technology_robotics": 3 } }, "users": 2500, "size": 6476991066, "checksum": "d58e6c3580ebd7ace4df740151d6307bc3b428b9d1f48ce9cbca3316647ce80d", "avgDurationSecs": 5.199, "validDurationSecs": 578708.502, "totalHrs": 307.53, "validHrs": 160.75 }, "kab": { "buckets": { "dev": 15003, "test": 15003, "train": 152426, "validated": 614153, "invalidated": 20024, "other": 112681 }, "duration": 2495557240, "reportedSentences": 9732, "validatedSentences": 182774, "unvalidatedSentences": 205402, "clips": 746858, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.18, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 746858, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1583, "size": 18635849513, "checksum": "244def9071ad445cf05960e4526fab495db796054265f2b81d70bf49326c5101", "avgDurationSecs": 3.341, "validDurationSecs": 2052135.701, "totalHrs": 693.21, "validHrs": 570.03 }, "kk": { "buckets": { "dev": 492, "test": 534, "train": 603, "validated": 1636, "invalidated": 246, "other": 735 }, "duration": 12879504, "reportedSentences": 42, "validatedSentences": 6204, "unvalidatedSentences": 264, "clips": 2617, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.23, "thirties": 0.04, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.33, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2617, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 178, "size": 73673057, "checksum": "d7aa1514f0220027c273918bf73f52ee483e5490accfd1af8e68e6fd611a7bbb", "avgDurationSecs": 4.921, "validDurationSecs": 8051.536, "totalHrs": 3.57, "validHrs": 2.23 }, "kln": { "buckets": { "dev": 6412, "test": 6075, "train": 11064, "validated": 32311, "invalidated": 537, "other": 37352 }, "duration": 330293556, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70200, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.59, "thirties": 0.05, "teens": 0.05, "fourties": 0.29, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.03, "female_feminine": 0.7, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4, "agriculture_food": 5, "automotive_transport": 16, "finance": 32, "service_retail": 626, "general": 68952, "healthcare": 54, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 484 } }, "users": 41, "size": 1800912369, "checksum": "436f8158ae9b69c4cc6daedcb2d28925fef65236b922cf093d5a50fe89236848", "avgDurationSecs": 4.705, "validDurationSecs": 152024.431, "totalHrs": 91.74, "validHrs": 42.22 }, "kmr": { "buckets": { "dev": 3973, "test": 3970, "train": 5264, "validated": 61326, "invalidated": 2585, "other": 25576 }, "duration": 366447348, "reportedSentences": 1058, "validatedSentences": 16806, "unvalidatedSentences": 1248, "clips": 89487, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.42, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89479, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 1, "media_entertainment": 1, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 582, "size": 1944433113, "checksum": "86dbe0c5d140a15c87efea3161009cc5bb7865b92e35c8e7f2dd9e531ecfee34", "avgDurationSecs": 4.095, "validDurationSecs": 251128.656, "totalHrs": 101.79, "validHrs": 69.75 }, "ko": { "buckets": { "dev": 466, "test": 451, "train": 512, "validated": 1430, "invalidated": 373, "other": 3707 }, "duration": 29464740, "reportedSentences": 29, "validatedSentences": 7256, "unvalidatedSentences": 2618, "clips": 5510, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.53, "thirties": 0.25, "teens": 0.03, "fourties": 0.05, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.4, "female_feminine": 0.26, "transgender": 0, "non-binary": 0.01, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5507, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 143, "size": 172412757, "checksum": "e6ab1764ec24c9c6e6f5334cbf4151a5c7b043848fa163414a806664baffbd8d", "avgDurationSecs": 5.348, "validDurationSecs": 7646.929, "totalHrs": 8.18, "validHrs": 2.12 }, "ky": { "buckets": { "dev": 1615, "test": 1615, "train": 1789, "validated": 30652, "invalidated": 5674, "other": 1505 }, "duration": 172250292, "reportedSentences": 39, "validatedSentences": 5045, "unvalidatedSentences": 242, "clips": 37831, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37831, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 294, "size": 1107842955, "checksum": "63f5dc543f120470eb721f745392716b240f7d30a2488cf26dac728e5a14c9bf", "avgDurationSecs": 4.553, "validDurationSecs": 139563.214, "totalHrs": 47.84, "validHrs": 38.76 }, "lg": { "buckets": { "dev": 13349, "test": 13377, "train": 71082, "validated": 271856, "invalidated": 39180, "other": 37056 }, "duration": 2013326487, "reportedSentences": 6451, "validatedSentences": 191409, "unvalidatedSentences": 173, "clips": 348092, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348092, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 660, "size": 11820261417, "checksum": "28d1bc133862304041206abce842d693a3600222610d4a735da4aadc125821ee", "avgDurationSecs": 5.784, "validDurationSecs": 1572385.707, "totalHrs": 559.25, "validHrs": 436.77 }, "lij": { "buckets": { "dev": 943, "test": 1344, "train": 2304, "validated": 4672, "invalidated": 584, "other": 0 }, "duration": 20616156, "reportedSentences": 378, "validatedSentences": 6079, "unvalidatedSentences": 50, "clips": 5256, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.07, "thirties": 0.45, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.17, "seventies": 0.11, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.45, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4887, "agriculture_food": 10, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 358, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 38, "size": 114798448, "checksum": "6d53c9067bace3015326f87def7a9601b9f89993a88b1ce7562ad6cd45707eb7", "avgDurationSecs": 3.922, "validDurationSecs": 18325.472, "totalHrs": 5.72, "validHrs": 5.09 }, "lo": { "buckets": { "dev": 28, "test": 23, "train": 85, "validated": 136, "invalidated": 9, "other": 73 }, "duration": 1414764, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 218, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.31, "thirties": 0.57, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.11, "male_masculine": 0.89, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 9097460, "checksum": "92f60adc5c3cd095a119122da7794f94a0b03c71db4d098467f73c4696907096", "avgDurationSecs": 6.49, "validDurationSecs": 882.605, "totalHrs": 0.39, "validHrs": 0.24 }, "lt": { "buckets": { "dev": 5236, "test": 5236, "train": 7827, "validated": 18501, "invalidated": 878, "other": 3155 }, "duration": 115861256, "reportedSentences": 192, "validatedSentences": 129301, "unvalidatedSentences": 4817, "clips": 22534, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.28, "thirties": 0.25, "teens": 0.03, "fourties": 0.23, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.45, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 22534, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 317, "size": 710446536, "checksum": "f055e91109653fe4f9dd1ef4f31c3f0bea6c13f090fc14616ed573a139e26616", "avgDurationSecs": 5.142, "validDurationSecs": 95125.104, "totalHrs": 32.18, "validHrs": 26.42 }, "ltg": { "buckets": { "dev": 3402, "test": 3510, "train": 4576, "validated": 21429, "invalidated": 1829, "other": 74 }, "duration": 108388512, "reportedSentences": 68, "validatedSentences": 11634, "unvalidatedSentences": 4182, "clips": 23332, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.09, "thirties": 0.24, "teens": 0, "fourties": 0.34, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.05, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19142, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4190, "healthcare": 0, "history_law_government": 459, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 353, "size": 623993390, "checksum": "d2eced2c7b061333eef99433d388ad463833885e6a59af89d4b0767d2dff0905", "avgDurationSecs": 4.645, "validDurationSecs": 99548.149, "totalHrs": 30.1, "validHrs": 27.65 }, "luo": { "buckets": { "dev": 3060, "test": 3000, "train": 4498, "validated": 20342, "invalidated": 792, "other": 61592 }, "duration": 430567524, "reportedSentences": 8, "validatedSentences": 14692, "unvalidatedSentences": 13870, "clips": 82726, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.33, "thirties": 0.28, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1135, "automotive_transport": 341, "finance": 967, "service_retail": 2747, "general": 69479, "healthcare": 2236, "history_law_government": 864, "language_fundamentals": 555, "media_entertainment": 806, "nature_environment": 634, "news_current_affairs": 2710, "technology_robotics": 241 } }, "users": 45, "size": 2391875128, "checksum": "98d3aa30ebd196fcc73442a504f86219cacf698e2ab69e1668ed6cae7f46396d", "avgDurationSecs": 5.205, "validDurationSecs": 105874.871, "totalHrs": 119.6, "validHrs": 29.4 }, "lv": { "buckets": { "dev": 7656, "test": 7656, "train": 14180, "validated": 195581, "invalidated": 11389, "other": 20604 }, "duration": 1088840969, "reportedSentences": 3139, "validatedSentences": 33815, "unvalidatedSentences": 65568, "clips": 227574, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 226844, "agriculture_food": 6, "automotive_transport": 3, "finance": 2, "service_retail": 12, "general": 610, "healthcare": 29, "history_law_government": 8, "language_fundamentals": 9, "media_entertainment": 16, "nature_environment": 36, "news_current_affairs": 5, "technology_robotics": 11 } }, "users": 6352, "size": 6098874632, "checksum": "ce64ede5482162c1055839412b4c2749885541629b57373261ff6ba661fa9692", "avgDurationSecs": 4.785, "validDurationSecs": 935768.61, "totalHrs": 302.45, "validHrs": 259.93 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11056899, "checksum": "a50c266e83e671ac05226d1db4ca2388a8bdabcaa9fbce22df8803369b79eada", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "dev": 14663, "test": 15135, "train": 186565, "validated": 218727, "invalidated": 16025, "other": 19 }, "duration": 1085385744, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234771, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234771, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 499, "size": 6124190382, "checksum": "1ff9621cb4301aa75f89ccc14740fe6aa2ec1cdfcfa95693d75b407c99e9803c", "avgDurationSecs": 4.623, "validDurationSecs": 1011211.639, "totalHrs": 301.49, "validHrs": 280.89 }, "mk": { "buckets": { "dev": 1762, "test": 1762, "train": 2006, "validated": 15183, "invalidated": 890, "other": 24670 }, "duration": 195803388, "reportedSentences": 161, "validatedSentences": 5976, "unvalidatedSentences": 712, "clips": 40743, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.27, "teens": 0.03, "fourties": 0.2, "fifties": 0.02, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.28, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40482, "agriculture_food": 14, "automotive_transport": 0, "finance": 7, "service_retail": 0, "general": 207, "healthcare": 26, "history_law_government": 7, "language_fundamentals": 0, "media_entertainment": 7, "nature_environment": 21, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 457, "size": 1074132812, "checksum": "b1f223833f1131156d0e5c5bdc94dd689a17db38f5cbfeb2cf75d6c7db207fd2", "avgDurationSecs": 4.806, "validDurationSecs": 72966.714, "totalHrs": 54.38, "validHrs": 20.26 }, "ml": { "buckets": { "dev": 917, "test": 858, "train": 1235, "validated": 3404, "invalidated": 119, "other": 5968 }, "duration": 40032576, "reportedSentences": 243, "validatedSentences": 5485, "unvalidatedSentences": 4728, "clips": 9491, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.59, "thirties": 0.08, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.48, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9491, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 147, "size": 226964878, "checksum": "276e30b9dfb35451692bc1f21264b5532e6986073f49e850b1d9fc6183ffcacd", "avgDurationSecs": 4.218, "validDurationSecs": 14357.906, "totalHrs": 11.12, "validHrs": 3.98 }, "mn": { "buckets": { "dev": 1934, "test": 1934, "train": 2190, "validated": 33591, "invalidated": 3221, "other": 59454 }, "duration": 505751012, "reportedSentences": 46, "validatedSentences": 6091, "unvalidatedSentences": 16, "clips": 96266, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96238, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 22, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 6, "technology_robotics": 0 } }, "users": 588, "size": 3076519286, "checksum": "5d62ff9dc45773179bc41a1990b57aa5ee5b04be7976411b24edaa0194b6306c", "avgDurationSecs": 5.254, "validDurationSecs": 176476.453, "totalHrs": 140.48, "validHrs": 49.02 }, "mr": { "buckets": { "dev": 1767, "test": 1793, "train": 2191, "validated": 10987, "invalidated": 2287, "other": 2785 }, "duration": 99542232, "reportedSentences": 59, "validatedSentences": 6715, "unvalidatedSentences": 12104, "clips": 16059, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.29, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0.19, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16059, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 100, "size": 585206123, "checksum": "5d3d8e70a49e616e5da65046a9ed246a0a42caa4f6c4255dc6fa91fe22d424e0", "avgDurationSecs": 6.199, "validDurationSecs": 68103.276, "totalHrs": 27.65, "validHrs": 18.91 }, "mrj": { "buckets": { "dev": 7314, "test": 7157, "train": 14325, "validated": 28966, "invalidated": 1261, "other": 1466 }, "duration": 132879852, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31693, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31693, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 747254932, "checksum": "170b733d3e33865389fb02615aa5b13d9afd5969d7143d8d6d539c631280a1ac", "avgDurationSecs": 4.193, "validDurationSecs": 121446.307, "totalHrs": 36.91, "validHrs": 33.73 }, "mt": { "buckets": { "dev": 1625, "test": 1657, "train": 1912, "validated": 6583, "invalidated": 326, "other": 6298 }, "duration": 62787780, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11773, "clips": 13207, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13207, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 466476952, "checksum": "1e66e25bac431f8b90b9fecaa948e990615ef17f164f954f68eb1eddf64b7451", "avgDurationSecs": 4.754, "validDurationSecs": 31296.43, "totalHrs": 17.44, "validHrs": 8.69 }, "myv": { "buckets": { "dev": 239, "test": 478, "train": 1241, "validated": 1968, "invalidated": 26, "other": 1 }, "duration": 11519388, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1995, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 67700633, "checksum": "05efbcabe6947a53a85fe5dc2afa85d6561dcda01f231f8b16c6595f4d009b53", "avgDurationSecs": 5.774, "validDurationSecs": 11363.487, "totalHrs": 3.19, "validHrs": 3.15 }, "nan-tw": { "buckets": { "dev": 5809, "test": 6154, "train": 10642, "validated": 26535, "invalidated": 759, "other": 3945 }, "duration": 82564308, "reportedSentences": 226, "validatedSentences": 24805, "unvalidatedSentences": 76, "clips": 31239, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.51, "thirties": 0.15, "teens": 0.01, "fourties": 0.16, "fifties": 0.01, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.62, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31155, "agriculture_food": 4, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 25, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 54, "media_entertainment": 0, "nature_environment": 54, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 271, "size": 469935589, "checksum": "5e28e258cb9c062cc55c4191f92264597aace19c2cc146adfcd2967e7a3e491f", "avgDurationSecs": 2.643, "validDurationSecs": 70131.692, "totalHrs": 22.93, "validHrs": 19.48 }, "nb-NO": { "buckets": { "dev": 24, "test": 25, "train": 63, "validated": 112, "invalidated": 5, "other": 208 }, "duration": 1339488, "reportedSentences": 0, "validatedSentences": 4343, "unvalidatedSentences": 3722, "clips": 325, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0, "thirties": 0.8, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.72, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 140, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 182, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 8, "size": 8235502, "checksum": "4ba27bf01abbb41a7edeea207bd546749e7f0b72e87a7633f3617851cba708d9", "avgDurationSecs": 4.122, "validDurationSecs": 461.608, "totalHrs": 0.37, "validHrs": 0.12 }, "ne-NP": { "buckets": { "dev": 307, "test": 272, "train": 365, "validated": 1154, "invalidated": 99, "other": 350 }, "duration": 6495948, "reportedSentences": 5, "validatedSentences": 1311, "unvalidatedSentences": 35, "clips": 1603, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.6, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.64, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1603, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 50, "size": 38107384, "checksum": "ada7bab888865185bc8724522c5bd513d0f5e2dce943b3e6f124e8562d125191", "avgDurationSecs": 4.052, "validDurationSecs": 4676.434, "totalHrs": 1.8, "validHrs": 1.29 }, "nhi": { "buckets": { "dev": 10, "test": 5, "train": 22, "validated": 37, "invalidated": 0, "other": 365 }, "duration": 2035908, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 9, "clips": 402, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.78, "thirties": 0.09, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.09, "female_feminine": 0.78, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 402, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 12027094, "checksum": "1f8a9fd928cfb3ec16dcd55a46fa6033875ab91ce9d1c132cf63152b303824b8", "avgDurationSecs": 5.064, "validDurationSecs": 187.385, "totalHrs": 0.56, "validHrs": 0.05 }, "nl": { "buckets": { "dev": 11571, "test": 11571, "train": 37984, "validated": 94175, "invalidated": 5713, "other": 2775 }, "duration": 448800373, "reportedSentences": 349, "validatedSentences": 259173, "unvalidatedSentences": 703, "clips": 102663, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.2, "thirties": 0.13, "teens": 0.02, "fourties": 0.15, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.48, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 102604, "agriculture_food": 1, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 5, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 34, "media_entertainment": 0, "nature_environment": 10, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 1796, "size": 3048281117, "checksum": "8d21a9809f783156e43c6c1815f25a0ff6b637125bc44a144cad81c657a08fbd", "avgDurationSecs": 4.372, "validDurationSecs": 411694.331, "totalHrs": 124.66, "validHrs": 114.35 }, "nn-NO": { "buckets": { "dev": 405, "test": 412, "train": 458, "validated": 1280, "invalidated": 67, "other": 32 }, "duration": 6050232, "reportedSentences": 23, "validatedSentences": 5318, "unvalidatedSentences": 6, "clips": 1379, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.4, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.58, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1379, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 35178707, "checksum": "1ae55f50522ff0e0f02bcc309f85daa4fe85acc683117f8becf47d2068f8f1cc", "avgDurationSecs": 4.387, "validDurationSecs": 5615.879, "totalHrs": 1.68, "validHrs": 1.55 }, "nr": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 2 }, "duration": 19476, "reportedSentences": 0, "validatedSentences": 319, "unvalidatedSentences": 7468, "clips": 2, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 637956, "checksum": "f6b8794d89584a349ef3ddb991aa53784d551a553982adff8ffca6895d602fac", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552766, "checksum": "92078343a6414dc5f7821f9ac27d86f524eebc05ff46a938fc0a472ba92c74b8", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "oc": { "buckets": { "dev": 266, "test": 274, "train": 305, "validated": 1963, "invalidated": 201, "other": 7711 }, "duration": 48105396, "reportedSentences": 13, "validatedSentences": 4934, "unvalidatedSentences": 14, "clips": 9875, "splits": { "accent": {}, "age": { "": 0.5, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0.08, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9875, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 147, "size": 273807730, "checksum": "9e6f01a51bdcda57894b22e1ef9b4f98f1ea0a78ef4cd90d8e666a8b7c2f30bd", "avgDurationSecs": 4.871, "validDurationSecs": 9562.622, "totalHrs": 13.36, "validHrs": 2.65 }, "or": { "buckets": { "dev": 630, "test": 474, "train": 2149, "validated": 4041, "invalidated": 432, "other": 19480 }, "duration": 133425492, "reportedSentences": 100, "validatedSentences": 6195, "unvalidatedSentences": 89, "clips": 23953, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.66, "thirties": 0.27, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.34, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.21 }, "sentence_domain": { "": 23946, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 159, "size": 825597647, "checksum": "ff39a23ce6c1fc2f941f04ecc116e98148d2297fb743a16f2956237747b27a43", "avgDurationSecs": 5.57, "validDurationSecs": 22509.599, "totalHrs": 37.06, "validHrs": 6.25 }, "os": { "buckets": { "dev": 84, "test": 130, "train": 347, "validated": 573, "invalidated": 52, "other": 167 }, "duration": 4349376, "reportedSentences": 3, "validatedSentences": 1665, "unvalidatedSentences": 224, "clips": 792, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.11, "thirties": 0.41, "teens": 0.03, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0.21, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 792, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 25410744, "checksum": "90fdbf08ee5d8145f2dae6b0ac33ec2eadb5a2b6847781e0710cd57fae401631", "avgDurationSecs": 5.492, "validDurationSecs": 3146.708, "totalHrs": 1.2, "validHrs": 0.87 }, "pa-IN": { "buckets": { "dev": 406, "test": 579, "train": 796, "validated": 1781, "invalidated": 98, "other": 1257 }, "duration": 15117506, "reportedSentences": 251, "validatedSentences": 14823, "unvalidatedSentences": 22, "clips": 3136, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.22, "thirties": 0.41, "teens": 0, "fourties": 0.04, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.66, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3135, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 71, "size": 106437637, "checksum": "bc47a95e637e48a2d3e79b1bec84d61dcbb0831bc28ba74d208bee3ab5621d40", "avgDurationSecs": 4.821, "validDurationSecs": 8585.548, "totalHrs": 4.19, "validHrs": 2.38 }, "pl": { "buckets": { "dev": 9775, "test": 9775, "train": 23697, "validated": 136729, "invalidated": 6870, "other": 2578 }, "duration": 666885530, "reportedSentences": 595, "validatedSentences": 229777, "unvalidatedSentences": 11298, "clips": 146177, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.27, "thirties": 0.34, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.28, "male_masculine": 0.58, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 146172, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3397, "size": 4727408160, "checksum": "53d09d99a5579c550f4566384addd607f2a54a06f3047d18aa53d2050a5ab073", "avgDurationSecs": 4.562, "validDurationSecs": 623782.07, "totalHrs": 185.24, "validHrs": 173.27 }, "ps": { "buckets": { "dev": 3410, "test": 3410, "train": 4292, "validated": 57763, "invalidated": 4870, "other": 37348 }, "duration": 494395596, "reportedSentences": 1123, "validatedSentences": 22750, "unvalidatedSentences": 10395, "clips": 99981, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.37, "thirties": 0.16, "teens": 0.07, "fourties": 0.14, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.02, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 58861, "agriculture_food": 780, "automotive_transport": 308, "finance": 107, "service_retail": 438, "general": 35413, "healthcare": 695, "history_law_government": 2095, "language_fundamentals": 1942, "media_entertainment": 1179, "nature_environment": 859, "news_current_affairs": 736, "technology_robotics": 457 } }, "users": 1435, "size": 2737822537, "checksum": "0d5c73befca334a128d6a833d31c453e28e9d8a140b2eacb4a2fa6e851840db5", "avgDurationSecs": 4.945, "validDurationSecs": 285631.998, "totalHrs": 137.33, "validHrs": 79.34 }, "pt": { "buckets": { "dev": 9604, "test": 9606, "train": 22729, "validated": 157808, "invalidated": 7630, "other": 25188 }, "duration": 791797363, "reportedSentences": 2841, "validatedSentences": 43437, "unvalidatedSentences": 151, "clips": 190626, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.37, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.67, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 189705, "agriculture_food": 26, "automotive_transport": 33, "finance": 14, "service_retail": 14, "general": 685, "healthcare": 46, "history_law_government": 49, "language_fundamentals": 16, "media_entertainment": 45, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 77 } }, "users": 3668, "size": 4974448476, "checksum": "a3d14e8524c8be0028ad6bc1de792db66c1f2d6559fee8187bc68bdf08a421d2", "avgDurationSecs": 4.154, "validDurationSecs": 655482.244, "totalHrs": 219.94, "validHrs": 182.07 }, "quy": { "buckets": { "dev": 0, "test": 7, "train": 25, "validated": 32, "invalidated": 7, "other": 1261 }, "duration": 11227752, "reportedSentences": 0, "validatedSentences": 755, "unvalidatedSentences": 285, "clips": 1300, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.71, "thirties": 0, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0.25, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1300, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 65533532, "checksum": "208c246ef361082bf1aeac25622ae00626c97710aa1a5396568e557563f449aa", "avgDurationSecs": 8.637, "validDurationSecs": 276.375, "totalHrs": 3.11, "validHrs": 0.07 }, "rm-sursilv": { "buckets": { "dev": 1365, "test": 1387, "train": 1589, "validated": 4356, "invalidated": 684, "other": 2464 }, "duration": 40806869, "reportedSentences": 17, "validatedSentences": 8780, "unvalidatedSentences": 7455, "clips": 7504, "splits": { "accent": {}, "age": { "": 0.61, "twenties": 0.09, "thirties": 0.07, "teens": 0.05, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0.17, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 91, "size": 306276924, "checksum": "b8310d3b8fc425cceea4a179c3c89f1a8d71b226a50c738d1a0e13a40dbd6b86", "avgDurationSecs": 5.438, "validDurationSecs": 23687.996, "totalHrs": 11.33, "validHrs": 6.57 }, "rm-vallader": { "buckets": { "dev": 500, "test": 462, "train": 557, "validated": 1526, "invalidated": 394, "other": 711 }, "duration": 15327554, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2631, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2631, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 117352112, "checksum": "6bbb0306405f33ac09614a5c7166f5a314b12b63a660af0df6e08cafa303109f", "avgDurationSecs": 5.826, "validDurationSecs": 8890.098, "totalHrs": 4.25, "validHrs": 2.46 }, "ro": { "buckets": { "dev": 3924, "test": 3929, "train": 5168, "validated": 19342, "invalidated": 1045, "other": 22961 }, "duration": 175014391, "reportedSentences": 481, "validatedSentences": 13749, "unvalidatedSentences": 89195, "clips": 43348, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.6, "thirties": 0.13, "teens": 0.01, "fourties": 0.09, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.7, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43348, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 443, "size": 1090315851, "checksum": "e6b80fc9b80a798e908d6715d13475de552b3532c618e581c9e449de1b908260", "avgDurationSecs": 4.037, "validDurationSecs": 78091.915, "totalHrs": 48.61, "validHrs": 21.69 }, "ru": { "buckets": { "dev": 10229, "test": 10229, "train": 26564, "validated": 170004, "invalidated": 10424, "other": 17821 }, "duration": 1026001908, "reportedSentences": 561, "validatedSentences": 47635, "unvalidatedSentences": 390, "clips": 198249, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.18, "teens": 0.09, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 198177, "agriculture_food": 0, "automotive_transport": 4, "finance": 6, "service_retail": 10, "general": 30, "healthcare": 4, "history_law_government": 35, "language_fundamentals": 2, "media_entertainment": 8, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 3434, "size": 6899531386, "checksum": "279afe356de2bb85e772192ce542dbbca36eb3b84939277ec51bbad6ef143e49", "avgDurationSecs": 5.175, "validDurationSecs": 879825.01, "totalHrs": 285, "validHrs": 244.39 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003033, "validated": 1439078, "invalidated": 227830, "other": 47246 }, "duration": 8583409265, "reportedSentences": 630, "validatedSentences": 1404857, "unvalidatedSentences": 3, "clips": 1714154, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1714154, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1166, "size": 61199365353, "checksum": "b8bb8f23f27e28f96d4bd6e22f4ad93527de145cbc513cf3985e06c823014764", "avgDurationSecs": 5.007, "validDurationSecs": 7206001, "totalHrs": 2384.28, "validHrs": 2001.66 }, "sah": { "buckets": { "dev": 1448, "test": 1574, "train": 2060, "validated": 5508, "invalidated": 178, "other": 2282 }, "duration": 49445724, "reportedSentences": 13, "validatedSentences": 6024, "unvalidatedSentences": 433, "clips": 7968, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.24, "thirties": 0.23, "teens": 0.05, "fourties": 0.06, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.37, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7968, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 128, "size": 332859535, "checksum": "f39bf958968c051890b78b59470bad2d5140f737d173963103a685a6f90c34b2", "avgDurationSecs": 6.206, "validDurationSecs": 34180.101, "totalHrs": 13.73, "validHrs": 9.49 }, "sat": { "buckets": { "dev": 64, "test": 125, "train": 332, "validated": 521, "invalidated": 17, "other": 311 }, "duration": 3801708, "reportedSentences": 7, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 849, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.42, "thirties": 0.12, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.59, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 21352220, "checksum": "c235188e204db353dfff87e8d5991dcf1616aa4e37e5b239b627f3b95d583aca", "avgDurationSecs": 4.478, "validDurationSecs": 2332.968, "totalHrs": 1.05, "validHrs": 0.64 }, "sc": { "buckets": { "dev": 545, "test": 589, "train": 925, "validated": 2233, "invalidated": 104, "other": 408 }, "duration": 12878280, "reportedSentences": 36, "validatedSentences": 5445, "unvalidatedSentences": 388, "clips": 2745, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.09, "thirties": 0.29, "teens": 0, "fourties": 0.01, "fifties": 0.15, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male_masculine": 0.13, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2736, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 3, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 2, "technology_robotics": 0 } }, "users": 32, "size": 74890923, "checksum": "1650a6f2a72cf7a80089423aca6aee77f964bf3f46f83b63597f16840463cd72", "avgDurationSecs": 4.692, "validDurationSecs": 10476.211, "totalHrs": 3.57, "validHrs": 2.91 }, "sd": { "buckets": { "dev": 0, "test": 40, "train": 270, "validated": 310, "invalidated": 101, "other": 17462 }, "duration": 75791124, "reportedSentences": 2, "validatedSentences": 13345, "unvalidatedSentences": 43, "clips": 17873, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0, "thirties": 0.12, "teens": 0, "fourties": 0.86, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.12, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17830, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 41, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 385276465, "checksum": "f0266a4d37fe85475d89883af48397e8af656becca968813b4b28b873fab21a8", "avgDurationSecs": 4.241, "validDurationSecs": 1314.567, "totalHrs": 21.05, "validHrs": 0.36 }, "sk": { "buckets": { "dev": 4387, "test": 4472, "train": 6262, "validated": 39380, "invalidated": 1184, "other": 10 }, "duration": 171437076, "reportedSentences": 337, "validatedSentences": 16599, "unvalidatedSentences": 6087, "clips": 40574, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.07, "thirties": 0.19, "teens": 0.06, "fourties": 0.14, "fifties": 0.28, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.38, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33933, "agriculture_food": 2, "automotive_transport": 262, "finance": 2, "service_retail": 0, "general": 6414, "healthcare": 4, "history_law_government": 4015, "language_fundamentals": 90, "media_entertainment": 85, "nature_environment": 170, "news_current_affairs": 0, "technology_robotics": 569 } }, "users": 263, "size": 986223363, "checksum": "428ae17939c30c183145a0603c21a3f81aa4bb7156e45250f313bffe99865522", "avgDurationSecs": 4.225, "validDurationSecs": 166392.075, "totalHrs": 47.62, "validHrs": 46.22 }, "skr": { "buckets": { "dev": 1140, "test": 1033, "train": 1552, "validated": 3737, "invalidated": 467, "other": 1633 }, "duration": 24307020, "reportedSentences": 17, "validatedSentences": 6011, "unvalidatedSentences": 93, "clips": 5837, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.24, "thirties": 0.01, "teens": 0.02, "fourties": 0, "fifties": 0.37, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.63, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5837, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 139958799, "checksum": "4d06afaf431bd339e79b8631699006cffc1d903aa7e580ca638f4719668d56fa", "avgDurationSecs": 4.164, "validDurationSecs": 15561.99, "totalHrs": 6.75, "validHrs": 4.32 }, "sl": { "buckets": { "dev": 1328, "test": 1338, "train": 1467, "validated": 15240, "invalidated": 467, "other": 3369 }, "duration": 76103758, "reportedSentences": 51, "validatedSentences": 4180, "unvalidatedSentences": 53, "clips": 19076, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.31, "thirties": 0.03, "teens": 0.05, "fourties": 0.15, "fifties": 0.05, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.53, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19076, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 936, "size": 500157836, "checksum": "8c106ae3a59fb839a9bcad389883d48840f6b99cb4c5c787b2bfc81ad7cbe825", "avgDurationSecs": 3.99, "validDurationSecs": 60800.025, "totalHrs": 21.13, "validHrs": 16.88 }, "sq": { "buckets": { "dev": 1645, "test": 1916, "train": 2658, "validated": 6229, "invalidated": 176, "other": 1 }, "duration": 32672628, "reportedSentences": 133, "validatedSentences": 50008, "unvalidatedSentences": 2634, "clips": 6406, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.08, "thirties": 0.15, "teens": 0, "fourties": 0.18, "fifties": 0.35, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.48, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1576, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1788, "healthcare": 0, "history_law_government": 3022, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 20, "technology_robotics": 0 } }, "users": 142, "size": 194768494, "checksum": "2319a5bbae777309312d3ec54d8a15134e05382856d3dc24355e0547c25a0595", "avgDurationSecs": 5.1, "validDurationSecs": 31769.872, "totalHrs": 9.07, "validHrs": 8.82 }, "sr": { "buckets": { "dev": 1908, "test": 1960, "train": 2340, "validated": 8217, "invalidated": 422, "other": 4849 }, "duration": 43639020, "reportedSentences": 78, "validatedSentences": 7898, "unvalidatedSentences": 753, "clips": 13488, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.25, "thirties": 0.11, "teens": 0, "fourties": 0.1, "fifties": 0.13, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.32, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13447, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 27 } }, "users": 176, "size": 244777288, "checksum": "02b0d5ce77aa28cf89fef4172b79917f317c31b953451e31c7d70fd526ad94a0", "avgDurationSecs": 3.235, "validDurationSecs": 26585.248, "totalHrs": 12.12, "validHrs": 7.38 }, "st": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 209 }, "duration": 673668, "reportedSentences": 0, "validatedSentences": 2339, "unvalidatedSentences": 14836, "clips": 209, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.29, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.71, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 209, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 4961171, "checksum": "5869752f9d38c81605d9b444b815611825d27edced24bbdf88569a0052d0c2c4", "avgDurationSecs": 3.223, "validDurationSecs": 0, "totalHrs": 0.18, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 5389, "test": 5399, "train": 8014, "validated": 41872, "invalidated": 1581, "other": 6414 }, "duration": 200398011, "reportedSentences": 592, "validatedSentences": 26198, "unvalidatedSentences": 7105, "clips": 49867, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.37, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.47, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 49866, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 867, "size": 1266809146, "checksum": "35250e119fedf34fa10ca0a6ca4a0c5b658a47d485cc1ed58429617c83057bf5", "avgDurationSecs": 4.019, "validDurationSecs": 168268.906, "totalHrs": 55.66, "validHrs": 46.74 }, "sw": { "buckets": { "dev": 12254, "test": 12254, "train": 46529, "validated": 267169, "invalidated": 80653, "other": 377290 }, "duration": 3905876088, "reportedSentences": 1902, "validatedSentences": 134709, "unvalidatedSentences": 5774, "clips": 725112, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 725112, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1471, "size": 22797283903, "checksum": "37aca0c8b7ab50ee50ebe54d68981efba8a872c853682103d9517cc96a7de41d", "avgDurationSecs": 5.387, "validDurationSecs": 1439128.036, "totalHrs": 1084.96, "validHrs": 399.75 }, "ta": { "buckets": { "dev": 12113, "test": 12215, "train": 46141, "validated": 136193, "invalidated": 5712, "other": 105200 }, "duration": 1531686228, "reportedSentences": 3437, "validatedSentences": 118209, "unvalidatedSentences": 626, "clips": 247105, "splits": { "accent": {}, "age": { "": 0.66, "twenties": 0.11, "thirties": 0.12, "teens": 0.04, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.19, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 247105, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 964, "size": 9181721454, "checksum": "a02b0f38d7833a8f78a49501760fb516f474b177ffb08e6fd94346ed04d4471f", "avgDurationSecs": 6.199, "validDurationSecs": 844195.554, "totalHrs": 425.46, "validHrs": 234.49 }, "te": { "buckets": { "dev": 58, "test": 61, "train": 73, "validated": 313, "invalidated": 50, "other": 1948 }, "duration": 9673704, "reportedSentences": 15, "validatedSentences": 330, "unvalidatedSentences": 19295, "clips": 2311, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.68, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.08, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.14, "male_masculine": 0.68, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2311, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 54, "size": 57931701, "checksum": "2c1d69f3ead2dbe3697f0b1e9b3eca247e1c40937a16668376c43b2840d86d50", "avgDurationSecs": 4.186, "validDurationSecs": 1310.199, "totalHrs": 2.68, "validHrs": 0.36 }, "th": { "buckets": { "dev": 11055, "test": 11055, "train": 32942, "validated": 148363, "invalidated": 9337, "other": 207197 }, "duration": 1529408532, "reportedSentences": 4296, "validatedSentences": 64477, "unvalidatedSentences": 121664, "clips": 364897, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 364897, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7953, "size": 8923550838, "checksum": "340f52987151731e4ae80706aac9149f0952ea0535a3b80ad0e92f9abf6af026", "avgDurationSecs": 4.191, "validDurationSecs": 621840.24, "totalHrs": 424.83, "validHrs": 172.73 }, "ti": { "buckets": { "dev": 5, "test": 8, "train": 14, "validated": 27, "invalidated": 2, "other": 272 }, "duration": 1510488, "reportedSentences": 3, "validatedSentences": 4532, "unvalidatedSentences": 138, "clips": 301, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0, "thirties": 0.03, "teens": 0, "fourties": 0.73, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0.05, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 301, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 9155127, "checksum": "140ee609858b28a717db42f81d67adaa9d68e79b2d7052b789fe1f845b41d028", "avgDurationSecs": 5.018, "validDurationSecs": 135.492, "totalHrs": 0.41, "validHrs": 0.03 }, "tig": { "buckets": { "dev": 1203, "test": 1248, "train": 1539, "validated": 4112, "invalidated": 127, "other": 1244 }, "duration": 35118324, "reportedSentences": 31, "validatedSentences": 5388, "unvalidatedSentences": 104, "clips": 5483, "splits": { "accent": {}, "age": { "": 0.73, "twenties": 0.24, "thirties": 0.04, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0.01, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5481, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 119, "size": 188277874, "checksum": "f737f3ce620fb3d5707545b13d495fe00128461608e3e82710c6fe6f2cf40301", "avgDurationSecs": 6.405, "validDurationSecs": 26337.142, "totalHrs": 9.75, "validHrs": 7.31 }, "tk": { "buckets": { "dev": 438, "test": 550, "train": 747, "validated": 1936, "invalidated": 130, "other": 2506 }, "duration": 25078788, "reportedSentences": 27, "validatedSentences": 2494, "unvalidatedSentences": 203, "clips": 4572, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.54, "thirties": 0.09, "teens": 0, "fourties": 0.03, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.45, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4572, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 122, "size": 141242943, "checksum": "8e859627956e5fc9701484e4e3362ae60ab5c4c4d652eed9d0a8e5c01ee45372", "avgDurationSecs": 5.485, "validDurationSecs": 10619.539, "totalHrs": 6.96, "validHrs": 2.94 }, "tn": { "buckets": { "dev": 317, "test": 369, "train": 1073, "validated": 3495, "invalidated": 11, "other": 543 }, "duration": 17693028, "reportedSentences": 1, "validatedSentences": 1765, "unvalidatedSentences": 15342, "clips": 4049, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.02, "thirties": 0.51, "teens": 0, "fourties": 0, "fifties": 0.43, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.04 }, "sentence_domain": { "": 4049, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 89812251, "checksum": "aac60412140ab79e7ee4b353c6ae339bc61c6e5eb699df5f223221a8fe8eb731", "avgDurationSecs": 4.37, "validDurationSecs": 15272.199, "totalHrs": 4.91, "validHrs": 4.24 }, "tok": { "buckets": { "dev": 2147, "test": 2162, "train": 2653, "validated": 12232, "invalidated": 356, "other": 4728 }, "duration": 72152316, "reportedSentences": 148, "validatedSentences": 7454, "unvalidatedSentences": 2269, "clips": 17316, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.21, "thirties": 0.07, "teens": 0.24, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.16, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0.49, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.02 }, "sentence_domain": { "": 17315, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 165, "size": 422235791, "checksum": "2378c93e48dedc499309a4e1649fc4573149ce3251b597d6c84094667ab8ae8b", "avgDurationSecs": 4.167, "validDurationSecs": 50968.303, "totalHrs": 20.04, "validHrs": 14.15 }, "tr": { "buckets": { "dev": 11713, "test": 11729, "train": 39777, "validated": 119611, "invalidated": 4839, "other": 116 }, "duration": 475310043, "reportedSentences": 455, "validatedSentences": 410395, "unvalidatedSentences": 3441, "clips": 124566, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.1, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.42, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 124552, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 11, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1718, "size": 2901491605, "checksum": "cd704ec5d6779bf4a66af948ece7686b4654e1976d71db12c5c6e4bc780803d6", "avgDurationSecs": 3.816, "validDurationSecs": 456403.108, "totalHrs": 132.03, "validHrs": 126.77 }, "ts": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 1 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1040824, "checksum": "c1d517b55d65474acee5a1d3a3fd493d5602e42c12c97af67eb46b3fa180eec5", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 3976, "test": 5102, "train": 8899, "validated": 29700, "invalidated": 453, "other": 716 }, "duration": 118249550, "reportedSentences": 6, "validatedSentences": 18043, "unvalidatedSentences": 40, "clips": 30869, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.05, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.76, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30858, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 11, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 280, "size": 862331884, "checksum": "4f5113bd1fafdb415c40da4ea93fcb3f4e51085dce54ee8c40286ca89006b91a", "avgDurationSecs": 3.831, "validDurationSecs": 113771.474, "totalHrs": 32.84, "validHrs": 31.6 }, "tw": { "buckets": { "dev": 0, "test": 14, "train": 185, "validated": 199, "invalidated": 3, "other": 70 }, "duration": 1172304, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 19, "clips": 272, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.02, "thirties": 0.79, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.81, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 9649466, "checksum": "b57274d4562efc697530c971b51401d1ad6fed4ba130ce04d9419c3749cb229c", "avgDurationSecs": 4.31, "validDurationSecs": 857.678, "totalHrs": 0.32, "validHrs": 0.23 }, "ug": { "buckets": { "dev": 13791, "test": 13791, "train": 80528, "validated": 224869, "invalidated": 17231, "other": 35033 }, "duration": 1613716092, "reportedSentences": 1412, "validatedSentences": 230610, "unvalidatedSentences": 8884, "clips": 277133, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.31, "thirties": 0.3, "teens": 0.06, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.29, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01, "intersex": 0 }, "sentence_domain": { "": 274916, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 2191, "healthcare": 26, "history_law_government": 3, "language_fundamentals": 9, "media_entertainment": 12, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 1533, "size": 9177696075, "checksum": "85281f6fe2a8c118e1984b54b01a046072473518ed447a43083d234f38c01540", "avgDurationSecs": 5.823, "validDurationSecs": 1309388.358, "totalHrs": 448.25, "validHrs": 363.71 }, "uk": { "buckets": { "dev": 10237, "test": 10238, "train": 26616, "validated": 77432, "invalidated": 3327, "other": 8329 }, "duration": 413552040, "reportedSentences": 654, "validatedSentences": 212666, "unvalidatedSentences": 1599, "clips": 89088, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.34, "thirties": 0.25, "teens": 0.09, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89088, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1136, "size": 2718404222, "checksum": "93a41db3f4ab24a94281fd51b61641c159f7407403f53dc770d306229523fce6", "avgDurationSecs": 4.642, "validDurationSecs": 359444.163, "totalHrs": 114.87, "validHrs": 99.84 }, "ur": { "buckets": { "dev": 5080, "test": 5080, "train": 7313, "validated": 68049, "invalidated": 10945, "other": 173486 }, "duration": 1084864500, "reportedSentences": 400, "validatedSentences": 21699, "unvalidatedSentences": 48263, "clips": 252480, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.92, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.52, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 252373, "agriculture_food": 12, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 48, "healthcare": 47, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 483, "size": 6333322259, "checksum": "39ba21f8531464082da2a029d6baa7e7e329988adb9282e4b435667715fd2b72", "avgDurationSecs": 4.297, "validDurationSecs": 292395.217, "totalHrs": 301.35, "validHrs": 81.22 }, "uz": { "buckets": { "dev": 12254, "test": 12347, "train": 48676, "validated": 86963, "invalidated": 14084, "other": 128328 }, "duration": 953054352, "reportedSentences": 1833, "validatedSentences": 148728, "unvalidatedSentences": 137876, "clips": 229375, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229375, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2235, "size": 5348304170, "checksum": "f1ceb3a2d412b660c01d7abdcb4fa4c293efded8e41ddb312ae53137f94e0e05", "avgDurationSecs": 4.155, "validDurationSecs": 361331.73, "totalHrs": 264.73, "validHrs": 100.36 }, "vi": { "buckets": { "dev": 732, "test": 1382, "train": 2204, "validated": 5453, "invalidated": 403, "other": 11954 }, "duration": 70336016, "reportedSentences": 201, "validatedSentences": 5830, "unvalidatedSentences": 5275, "clips": 17810, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.02, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0.28, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.53, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17810, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 314, "size": 409504824, "checksum": "b09847f1162dbfb3cf6fbef16a243838e062be13e3cae2b7f1bc1f59d2929b5f", "avgDurationSecs": 3.949, "validDurationSecs": 21535.222, "totalHrs": 19.53, "validHrs": 5.98 }, "vot": { "buckets": { "dev": 0, "test": 7, "train": 96, "validated": 103, "invalidated": 324, "other": 0 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186690, "checksum": "8ccca0da441a208fa77fe1a63abc25d0ba3a315b9234cb9843847856370d63b2", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "xh": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 0, "other": 14 }, "duration": 94140, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 15, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 15, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1691682, "checksum": "2d64bc7fad61267b9648870b8afa40134862b411f0ee3a4d73ded7f4403602d7", "avgDurationSecs": 6.276, "validDurationSecs": 6.276, "totalHrs": 0.02, "validHrs": 0 }, "yi": { "buckets": { "dev": 144, "test": 143, "train": 150, "validated": 452, "invalidated": 29, "other": 26 }, "duration": 2096172, "reportedSentences": 1, "validatedSentences": 896, "unvalidatedSentences": 606, "clips": 507, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.13, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 507, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 11615488, "checksum": "3a60ccc5436d2acd7303dea922df1d6bbe68c9ca5e12e4d2e87ccbc38925b192", "avgDurationSecs": 4.134, "validDurationSecs": 1868.777, "totalHrs": 0.58, "validHrs": 0.51 }, "yo": { "buckets": { "dev": 904, "test": 1083, "train": 1361, "validated": 3350, "invalidated": 252, "other": 1071 }, "duration": 27991800, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 16, "clips": 4673, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.63, "thirties": 0.02, "teens": 0.04, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.31, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4673, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 127, "size": 164093912, "checksum": "308d0ad0b81d06f926a39d9cccba25e28257a4e9bcba3d0f23d87ff166943799", "avgDurationSecs": 5.99, "validDurationSecs": 20066.88, "totalHrs": 7.77, "validHrs": 5.57 }, "yue": { "buckets": { "dev": 5129, "test": 5129, "train": 7417, "validated": 190963, "invalidated": 8036, "other": 79931 }, "duration": 1106476488, "reportedSentences": 2257, "validatedSentences": 18091, "unvalidatedSentences": 7660, "clips": 278930, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.29, "thirties": 0.47, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 276109, "agriculture_food": 17, "automotive_transport": 51, "finance": 127, "service_retail": 603, "general": 1659, "healthcare": 412, "history_law_government": 64, "language_fundamentals": 0, "media_entertainment": 191, "nature_environment": 187, "news_current_affairs": 113, "technology_robotics": 95 } }, "users": 1152, "size": 6412827047, "checksum": "7d8e4a36868f2936ac66a1c9ca30fc478f48a51af9e3dda9f7e9333e7a21817d", "avgDurationSecs": 3.967, "validDurationSecs": 757523.642, "totalHrs": 307.35, "validHrs": 210.42 }, "zgh": { "buckets": { "dev": 295, "test": 213, "train": 815, "validated": 1323, "invalidated": 84, "other": 618 }, "duration": 6942492, "reportedSentences": 6, "validatedSentences": 2730, "unvalidatedSentences": 354, "clips": 2025, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.36, "thirties": 0.55, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.09, "male_masculine": 0.88, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1989, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 29, "healthcare": 2, "history_law_government": 0, "language_fundamentals": 4, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 27, "size": 37754847, "checksum": "c361c41b3866629ce01fdd0bb8428fc364a2acd5f039db75069ab25fd6408fcc", "avgDurationSecs": 3.428, "validDurationSecs": 4535.761, "totalHrs": 1.92, "validHrs": 1.25 }, "zh-CN": { "buckets": { "dev": 10631, "test": 10631, "train": 29444, "validated": 187958, "invalidated": 58952, "other": 600619 }, "duration": 3850067076, "reportedSentences": 1033, "validatedSentences": 59026, "unvalidatedSentences": 854, "clips": 847529, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0.05, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 846226, "agriculture_food": 13, "automotive_transport": 33, "finance": 13, "service_retail": 35, "general": 436, "healthcare": 56, "history_law_government": 160, "language_fundamentals": 30, "media_entertainment": 596, "nature_environment": 14, "news_current_affairs": 34, "technology_robotics": 83 } }, "users": 7330, "size": 22794842832, "checksum": "abfd6abdb9151ca9f778c811289b5128d57df16e4f8c38c8e4089e3027740e09", "avgDurationSecs": 4.543, "validDurationSecs": 853836.161, "totalHrs": 1069.46, "validHrs": 237.17 }, "zh-HK": { "buckets": { "dev": 5601, "test": 5601, "train": 8445, "validated": 93760, "invalidated": 4610, "other": 23610 }, "duration": 507608016, "reportedSentences": 815, "validatedSentences": 20072, "unvalidatedSentences": 76, "clips": 121980, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.41, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 121199, "agriculture_food": 0, "automotive_transport": 16, "finance": 31, "service_retail": 178, "general": 517, "healthcare": 65, "history_law_government": 37, "language_fundamentals": 56, "media_entertainment": 38, "nature_environment": 33, "news_current_affairs": 20, "technology_robotics": 20 } }, "users": 3084, "size": 3618608716, "checksum": "fc7f490da1ccd5c61f06cea2f642364769e0f89798ef5a3c77141b71e7e32ee8", "avgDurationSecs": 4.161, "validDurationSecs": 390173.205, "totalHrs": 141, "validHrs": 108.38 }, "zh-TW": { "buckets": { "dev": 5068, "test": 5068, "train": 7290, "validated": 84157, "invalidated": 4865, "other": 49173 }, "duration": 464773990, "reportedSentences": 179, "validatedSentences": 19621, "unvalidatedSentences": 39, "clips": 138195, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.49, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 137885, "agriculture_food": 5, "automotive_transport": 130, "finance": 0, "service_retail": 72, "general": 110, "healthcare": 7, "history_law_government": 85, "language_fundamentals": 5, "media_entertainment": 90, "nature_environment": 7, "news_current_affairs": 20, "technology_robotics": 11 } }, "users": 2259, "size": 3106654241, "checksum": "530a5ebdc4b59ac2050ca5e34048b332ed1bc4c0dc8756589599aeffaf150484", "avgDurationSecs": 3.363, "validDurationSecs": 283034.731, "totalHrs": 129.1, "validHrs": 78.62 }, "zu": { "buckets": { "dev": 0, "test": 0, "train": 7, "validated": 7, "invalidated": 0, "other": 103 }, "duration": 361872, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 110, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.63, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.63 }, "sentence_domain": { "": 110, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 3549202, "checksum": "9bff362c407572a92d10e2c8a0dac54f3056281e9f19264bd783084db30a678d", "avgDurationSecs": 3.29, "validDurationSecs": 23.028, "totalHrs": 0.1, "validHrs": 0 }, "zza": { "buckets": { "dev": 463, "test": 388, "train": 707, "validated": 1558, "invalidated": 146, "other": 485 }, "duration": 8982468, "reportedSentences": 2321, "validatedSentences": 5644, "unvalidatedSentences": 982, "clips": 2189, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.71, "male_masculine": 0.27, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1456, "agriculture_food": 3, "automotive_transport": 1, "finance": 1, "service_retail": 9, "general": 670, "healthcare": 16, "history_law_government": 0, "language_fundamentals": 27, "media_entertainment": 2, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 18, "size": 52617187, "checksum": "55c62144221e77c80de50f1a11ae7956f4e3d4510669bfcf59127ec5784aee66", "avgDurationSecs": 4.103, "validDurationSecs": 6393.186, "totalHrs": 2.49, "validHrs": 1.77 } }, "totalDuration": 120725265847, "totalValidDurationSecs": 80439450, "totalHrs": 33534, "totalValidHrs": 22344 } ================================================ FILE: datasets/scripted-speech/cv-corpus-21.0-delta-2025-03-14.json ================================================ { "locales": { "ab": { "duration": 53820, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 2, "size": 293779, "checksum": "ba64a435f1413e9eaa328ec6d148b55131f23f60cc168ef1fa5b4df8d421da7f", "avgDurationSecs": 5.178, "validDurationSecs": 103.41, "totalHrs": 0.01, "validHrs": 0.03 }, "af": { "duration": 486036, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 66, "users": 7, "size": 2557974, "checksum": "07e6c1ec1a4dc4044d0ef564349a11ae132744c0bba521b765ce0878dfcaac1b", "avgDurationSecs": 5.878, "validDurationSecs": 269.8, "totalHrs": 0.13, "validHrs": 0.07 }, "am": { "duration": 263880, "reportedSentences": 0, "validatedSentences": 10, "unvalidatedSentences": -8, "clips": 36, "users": 6, "size": 1491373, "checksum": "96e19ee23235d139efaba48bca825a38be5f947f891fcc058066f097901b2cd9", "avgDurationSecs": 6.257, "validDurationSecs": 224.34, "totalHrs": 0.07, "validHrs": 0.07 }, "ar": { "duration": 1204164, "reportedSentences": 4, "validatedSentences": 20, "unvalidatedSentences": -5, "clips": 268, "users": 18, "size": 6904644, "checksum": "d417e72f24c42c8d55c3e229636da56198beb6bcbf6323957f38340d4b49702f", "avgDurationSecs": 4.161, "validDurationSecs": 871.48, "totalHrs": 0.34, "validHrs": 0.24 }, "as": { "duration": 269172, "reportedSentences": 2, "validatedSentences": 431, "unvalidatedSentences": -429, "clips": 49, "users": 1, "size": 1515009, "checksum": "1f2646de2a7c11fffcc7cc683dd6197af953600d8eab9c39083a3dc3658b3a54", "avgDurationSecs": 5.897, "validDurationSecs": 192.91, "totalHrs": 0.07, "validHrs": 0.05 }, "ast": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 4840, "checksum": "88c410b36a62f85a0dfb7b0326f8e1455377d239a28a406a95b3d1163659acf6", "avgDurationSecs": 4.434, "validDurationSecs": 257.18, "totalHrs": 0, "validHrs": 0.07 }, "az": { "duration": 367236, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 67, "users": 3, "size": 2081541, "checksum": "b205514ce6bfdbed3fbc67445c23fde0ec886032dd6a9bd51b43b713e0f51e1a", "avgDurationSecs": 5.438, "validDurationSecs": 169.58, "totalHrs": 0.11, "validHrs": 0.05 }, "ba": { "duration": 219924, "reportedSentences": 1, "validatedSentences": 9, "unvalidatedSentences": -7, "clips": 45, "users": 1, "size": 1239635, "checksum": "8978e8f586c025fb35ca3abc9616acaa1c4f348ba301df6ddaf227ad812030e8", "avgDurationSecs": 4.427, "validDurationSecs": 922.99, "totalHrs": 0.06, "validHrs": 0.26 }, "bas": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -21015, "checksum": "e731cb7a6408106dad1ce9c138348852cb91738b54789ed4f2a9568ad6a20ffe", "avgDurationSecs": 3.925, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "be": { "duration": 44703648, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 8, "clips": 7852, "users": 39, "size": 240684322, "checksum": "887e3191c4c9315cc16d8fb496c70eecd6ee5474b81a946ba8bf4b24bb834d52", "avgDurationSecs": 4.787, "validDurationSecs": 26552.5, "totalHrs": 12.42, "validHrs": 7.37 }, "bg": { "duration": 48096, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 10, "users": 1, "size": 280978, "checksum": "72ce07c237caa9a83391b1770cd7c04193d83aed77d68b6590500bf0ebf9501f", "avgDurationSecs": 5.472, "validDurationSecs": 164.28, "totalHrs": 0.01, "validHrs": 0.04 }, "bn": { "duration": 548712, "reportedSentences": 0, "validatedSentences": 16, "unvalidatedSentences": -16, "clips": 86, "users": 12, "size": 3292084, "checksum": "019d31ffe72d5f6dccc2d00e4fff59eb3d12854d023fc2ae5786afc0eb855df3", "avgDurationSecs": 4.368, "validDurationSecs": 243.19, "totalHrs": 0.16, "validHrs": 0.07 }, "br": { "duration": 99432, "reportedSentences": 0, "validatedSentences": 22, "unvalidatedSentences": -20, "clips": 29, "users": 1, "size": 561513, "checksum": "d55de8106570dc0a0a4ab51d764ed23127ddab8b0c7a3355890bf23dd4c16c10", "avgDurationSecs": 3.192, "validDurationSecs": 284.1, "totalHrs": 0.02, "validHrs": 0.08 }, "ca": { "duration": 125713440, "reportedSentences": 466, "validatedSentences": 1291, "unvalidatedSentences": 251, "clips": 33495, "users": 159, "size": 725856903, "checksum": "95e4326db4f1900d1aeeb628dba0bfff9270b03ceecf6089f784df42a9b7a7c7", "avgDurationSecs": 5.231, "validDurationSecs": 109654.78, "totalHrs": 34.92, "validHrs": 30.46 }, "ckb": { "duration": 4012524, "reportedSentences": 2, "validatedSentences": 12, "unvalidatedSentences": 8, "clips": 705, "users": 35, "size": 23264035, "checksum": "bcb94fe2c37bba8d9904e4ee2836f89c7754b30b3c4c73ddd99e9cfd09255b7d", "avgDurationSecs": 4.09, "validDurationSecs": 2892.54, "totalHrs": 1.11, "validHrs": 0.8 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1699, "checksum": "de263297edb0be56224cb2b932826eaf71b3a1287c7ffa0ca3b42c34b20f63d6", "avgDurationSecs": 3.516, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 5361840, "reportedSentences": 2, "validatedSentences": 20, "unvalidatedSentences": 208, "clips": 1264, "users": 39, "size": 31346299, "checksum": "35506b6a8db390de138e8dd71830ddf3a71ee102c033faa6d15bb0ab2fa58e0b", "avgDurationSecs": 4.463, "validDurationSecs": 5697.77, "totalHrs": 1.49, "validHrs": 1.59 }, "cv": { "duration": 285876, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 46, "users": 0, "size": 1669933, "checksum": "da0aa4a1cd3f7fbd7f7affc70ea23087c22089852a224cbb12f1def4c01af7f9", "avgDurationSecs": 5.042, "validDurationSecs": 51.28, "totalHrs": 0.08, "validHrs": 0.02 }, "cy": { "duration": 281664, "reportedSentences": 0, "validatedSentences": 12, "unvalidatedSentences": -11, "clips": 41, "users": 4, "size": 1739505, "checksum": "be3980a2ae1faf4854ff8fa039fef86b82765f64f26a9e078247f1e54a9157ad", "avgDurationSecs": 4.913, "validDurationSecs": 455.91, "totalHrs": 0.08, "validHrs": 0.13 }, "da": { "duration": 9322884, "reportedSentences": 197, "validatedSentences": 2, "unvalidatedSentences": 519, "clips": 2396, "users": 13, "size": 54577296, "checksum": "eb495dfdedaf84db5d8c631bdef854ecd9b1545e13558c6ab7269dc9df0c3949", "avgDurationSecs": 4.063, "validDurationSecs": 2409.89, "totalHrs": 2.59, "validHrs": 0.66 }, "dav": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 29, "clips": 0, "users": 0, "size": 21411, "checksum": "6032f6de3c4da75162ec1118e99e589fe4b2d07814b7aaa7ef6c7b0ad7d160e4", "avgDurationSecs": 4.059, "validDurationSecs": 12.18, "totalHrs": 0, "validHrs": 0.01 }, "de": { "duration": 34463592, "reportedSentences": 117, "validatedSentences": 629, "unvalidatedSentences": -514, "clips": 6036, "users": 90, "size": 202561003, "checksum": "a9fa0c62e0135c7d034ed9ec3a50e6a63b0e866acbfb9da5e292f039f8f4e70a", "avgDurationSecs": 5.243, "validDurationSecs": 28448.83, "totalHrs": 9.58, "validHrs": 7.9 }, "dv": { "duration": 692964, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 115, "users": 3, "size": 4118988, "checksum": "1169afebc94a80df739de0a09e90d8531713dda94726d46b117a8ecd4c9c4bb8", "avgDurationSecs": 5.12, "validDurationSecs": 360.73, "totalHrs": 0.2, "validHrs": 0.1 }, "dyu": { "duration": 74376, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 7, "users": 1, "size": 441472, "checksum": "ab6606b9336a0905f2b39e0bf1bb2dc2c4a21ac5434edea83629e82673555b6e", "avgDurationSecs": 6.309, "validDurationSecs": 96.17, "totalHrs": 0.02, "validHrs": 0.03 }, "el": { "duration": 894924, "reportedSentences": 0, "validatedSentences": 81, "unvalidatedSentences": -78, "clips": 211, "users": 3, "size": 5249024, "checksum": "8d3f99a1caf3c647ecbff802b558568be0b2310eab9af39b6eff803f84d80835", "avgDurationSecs": 4.149, "validDurationSecs": 721.13, "totalHrs": 0.25, "validHrs": 0.2 }, "en": { "duration": 139629960, "reportedSentences": 115, "validatedSentences": 3323, "unvalidatedSentences": -859, "clips": 21796, "users": 993, "size": 819193634, "checksum": "9c368b384739ff7ae1a7ca67cc11cb475e877b2d4f10a27f2707c077d685d9f4", "avgDurationSecs": 5.269, "validDurationSecs": 75269.29, "totalHrs": 38.78, "validHrs": 20.91 }, "eo": { "duration": 5197860, "reportedSentences": 17, "validatedSentences": 0, "unvalidatedSentences": 68, "clips": 750, "users": 29, "size": 30653666, "checksum": "c6cf0d4936053c58f0787d976ff6e529500a435bd08ea6f46d3155579e7b8cc9", "avgDurationSecs": 6.07, "validDurationSecs": 1958.95, "totalHrs": 1.44, "validHrs": 0.55 }, "es": { "duration": 30001068, "reportedSentences": 13, "validatedSentences": 431, "unvalidatedSentences": 3, "clips": 5933, "users": 123, "size": 176120072, "checksum": "81dabff368ec7292ecd34b6f09f4872802f6eed17077b67e6f09cfb425b1582d", "avgDurationSecs": 4.874, "validDurationSecs": 14424.39, "totalHrs": 8.34, "validHrs": 4 }, "et": { "duration": 897228, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 137, "users": 4, "size": 5305877, "checksum": "5957bb0b5fc83a381d78af1e2ba386e52805d6cddf32d9c3c8fdb7ba6425cf6a", "avgDurationSecs": 6.751, "validDurationSecs": -264.37, "totalHrs": 0.25, "validHrs": -0.07 }, "eu": { "duration": 16323228, "reportedSentences": 16, "validatedSentences": 199, "unvalidatedSentences": 5348, "clips": 2490, "users": 75, "size": 93836846, "checksum": "55263915d92928214e6a3fed4d662efbfbcc8f260b17e02d063241702437fba0", "avgDurationSecs": 5.501, "validDurationSecs": 8400.8, "totalHrs": 4.54, "validHrs": 2.34 }, "fa": { "duration": 7652952, "reportedSentences": 99, "validatedSentences": 261, "unvalidatedSentences": 144, "clips": 2602, "users": 39, "size": 44011840, "checksum": "1262bbb7a735d05463cf834323733f079115e212b37ab414e5719ba31517b7c2", "avgDurationSecs": 3.961, "validDurationSecs": 9976.79, "totalHrs": 2.13, "validHrs": 2.77 }, "fi": { "duration": 433944, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 80, "users": 2, "size": 2531019, "checksum": "0295ec92158874ff7318cb0f8aebecfb44ad260626c15718059d763c944968f6", "avgDurationSecs": 4.622, "validDurationSecs": 3599.95, "totalHrs": 0.12, "validHrs": 1 }, "fr": { "duration": 28811412, "reportedSentences": 56, "validatedSentences": 562, "unvalidatedSentences": -251, "clips": 5719, "users": 153, "size": 167897950, "checksum": "bd362910a453ad08ffc6dd212cebe219841ef297ae88b118af305fa08ed71e55", "avgDurationSecs": 5.025, "validDurationSecs": 43346.09, "totalHrs": 8, "validHrs": 12.04 }, "fy-NL": { "duration": 35676, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 4, "users": 0, "size": 157756, "checksum": "64217b0db1411e2b1bfc504a1b50eb5c9e058d184fa97108a64450351bdd68e1", "avgDurationSecs": 4.854, "validDurationSecs": 78.12, "totalHrs": 0.01, "validHrs": 0.02 }, "ga-IE": { "duration": 1980, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 323, "clips": 1, "users": 0, "size": 33517, "checksum": "d1feaf2c198a58f75f8cde0485f610a787751f0477e5b5b4ad21df8789a4a59f", "avgDurationSecs": 3.592, "validDurationSecs": 598.99, "totalHrs": 0, "validHrs": 0.16 }, "gl": { "duration": 124243344, "reportedSentences": 16, "validatedSentences": 540, "unvalidatedSentences": -447, "clips": 27489, "users": 181, "size": 714263106, "checksum": "2aab68c0e17bc948017349cff57ee37296ab2ac45b4cbacf42c5854e6ae20bf6", "avgDurationSecs": 5.027, "validDurationSecs": 109517.33, "totalHrs": 34.51, "validHrs": 30.42 }, "gn": { "duration": 54108, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 11, "users": 2, "size": 290960, "checksum": "596ece1eb88a9066391fa13d511e47d487a049f6afe46b88aa423f7fc3d0c42f", "avgDurationSecs": 4.555, "validDurationSecs": 223.69, "totalHrs": 0.02, "validHrs": 0.06 }, "ha": { "duration": 341712, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 56, "users": 5, "size": 1801500, "checksum": "e1f107a81ce4b422566557c6bf2657b5625a94176544eb245fe03a7f33506771", "avgDurationSecs": 4.352, "validDurationSecs": 97.8, "totalHrs": 0.09, "validHrs": 0.03 }, "he": { "duration": 136440, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": -5, "clips": 23, "users": 5, "size": 779462, "checksum": "46def9d7f7670c444ab6bf74436a037916fcbaaa219402c0ccafd1c581a70f69", "avgDurationSecs": 4.535, "validDurationSecs": 521.57, "totalHrs": 0.04, "validHrs": 0.15 }, "hi": { "duration": 451692, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 75, "users": 7, "size": 2638642, "checksum": "6fd2ba711746ce502271a35fdabe0314716c8dd106465ca7395cd331813c6b0e", "avgDurationSecs": 5.01, "validDurationSecs": 121.6, "totalHrs": 0.12, "validHrs": 0.03 }, "hsb": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1590, "checksum": "0a39241105eb510fa552c3d3a949479726298994a7fecb5f17a44d1cf54f76d9", "avgDurationSecs": 6.245, "validDurationSecs": 87.43, "totalHrs": 0, "validHrs": 0.02 }, "ht": { "duration": 61632, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 10, "users": 1, "size": 364065, "checksum": "438e8bee117965d87dbfedd1c2d7411c79c85b594288b2963e8fb8352098b862", "avgDurationSecs": 5.566, "validDurationSecs": 68.28, "totalHrs": 0.02, "validHrs": 0.02 }, "hu": { "duration": 4194108, "reportedSentences": 41, "validatedSentences": 10, "unvalidatedSentences": 372, "clips": 651, "users": 16, "size": 24621815, "checksum": "418125d0794a0022b2da0be531b101f45566b86b9ac166b1ef16c146eb88bf02", "avgDurationSecs": 5.54, "validDurationSecs": 3172.58, "totalHrs": 1.16, "validHrs": 0.88 }, "hy-AM": { "duration": 19476, "reportedSentences": 0, "validatedSentences": 17, "unvalidatedSentences": -15, "clips": 3, "users": 2, "size": 118875, "checksum": "087eea26efb201b1f81bddb44bb01b23e5d28334c0f2ba1d2d291a3b563d3314", "avgDurationSecs": 5.392, "validDurationSecs": 772.88, "totalHrs": 0.01, "validHrs": 0.22 }, "ia": { "duration": 509544, "reportedSentences": 2, "validatedSentences": 0, "unvalidatedSentences": 34, "clips": 90, "users": 2, "size": 2952902, "checksum": "3fafab7c04c5e83df42efb07703f8a75fd3eb90d3047b88b72f0ba5b59841128", "avgDurationSecs": 4.202, "validDurationSecs": 153.11, "totalHrs": 0.14, "validHrs": 0.05 }, "id": { "duration": 1190592, "reportedSentences": 0, "validatedSentences": 73, "unvalidatedSentences": -51, "clips": 275, "users": 22, "size": 6695307, "checksum": "9168779499ea6d1a14b6d56fefc53e5c5a2e896b2c27cdf18b0f3504d7f1aaf0", "avgDurationSecs": 4.003, "validDurationSecs": 849.9, "totalHrs": 0.33, "validHrs": 0.23 }, "ig": { "duration": 114048, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": 1, "clips": 19, "users": 2, "size": 682620, "checksum": "dde8f480a74c870d3b86c230c01a39d8f6139f47c2d961b51a42d7f2a5d8b443", "avgDurationSecs": 5.464, "validDurationSecs": 5.49, "totalHrs": 0.03, "validHrs": 0 }, "is": { "duration": 65664, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 10, "users": 1, "size": 388999, "checksum": "8793978aaf1756438f8750434d264297037244a030eef847ca97a96cb951f99c", "avgDurationSecs": 6.994, "validDurationSecs": 3.36, "totalHrs": 0.02, "validHrs": 0 }, "it": { "duration": 16704000, "reportedSentences": 13, "validatedSentences": 137, "unvalidatedSentences": -127, "clips": 2846, "users": 31, "size": 96057158, "checksum": "5b36aeebc1df11b438f22a8801ac10b55db59974db5274dc7c08128696590798", "avgDurationSecs": 5.42, "validDurationSecs": 4962.81, "totalHrs": 4.64, "validHrs": 1.37 }, "ja": { "duration": 141465564, "reportedSentences": 103, "validatedSentences": 1381, "unvalidatedSentences": 45, "clips": 32198, "users": 427, "size": 827565326, "checksum": "66b1b0ce45457c67400b17f8180fbd4863e3a1953cb81af4a259c68773a8f1fa", "avgDurationSecs": 4.542, "validDurationSecs": 64213.67, "totalHrs": 39.3, "validHrs": 17.84 }, "ka": { "duration": 3200328, "reportedSentences": 71, "validatedSentences": 54, "unvalidatedSentences": 558, "clips": 616, "users": 18, "size": 18718303, "checksum": "d58e6c3580ebd7ace4df740151d6307bc3b428b9d1f48ce9cbca3316647ce80d", "avgDurationSecs": 5.199, "validDurationSecs": 1990.05, "totalHrs": 0.89, "validHrs": 0.56 }, "kab": { "duration": 911916, "reportedSentences": 14, "validatedSentences": 28, "unvalidatedSentences": -27, "clips": 202, "users": 4, "size": 5202052, "checksum": "244def9071ad445cf05960e4526fab495db796054265f2b81d70bf49326c5101", "avgDurationSecs": 3.341, "validDurationSecs": 2273.06, "totalHrs": 0.26, "validHrs": 0.63 }, "kk": { "duration": 197064, "reportedSentences": 0, "validatedSentences": 28, "unvalidatedSentences": -28, "clips": 45, "users": 5, "size": 1158464, "checksum": "d7aa1514f0220027c273918bf73f52ee483e5490accfd1af8e68e6fd611a7bbb", "avgDurationSecs": 4.921, "validDurationSecs": 78.17, "totalHrs": 0.05, "validHrs": 0.02 }, "kln": { "duration": 21600, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 2, "users": 0, "size": 154371, "checksum": "436f8158ae9b69c4cc6daedcb2d28925fef65236b922cf093d5a50fe89236848", "avgDurationSecs": 4.705, "validDurationSecs": 76.18, "totalHrs": 0, "validHrs": 0.02 }, "kmr": { "duration": 111240, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 303, "clips": 20, "users": 4, "size": 674343, "checksum": "86dbe0c5d140a15c87efea3161009cc5bb7865b92e35c8e7f2dd9e531ecfee34", "avgDurationSecs": 4.095, "validDurationSecs": 257.6, "totalHrs": 0.03, "validHrs": 0.07 }, "ko": { "duration": 823500, "reportedSentences": 0, "validatedSentences": 91, "unvalidatedSentences": -91, "clips": 176, "users": 6, "size": 4834165, "checksum": "e6ab1764ec24c9c6e6f5334cbf4151a5c7b043848fa163414a806664baffbd8d", "avgDurationSecs": 5.348, "validDurationSecs": 263.78, "totalHrs": 0.23, "validHrs": 0.07 }, "ky": { "duration": 36576, "reportedSentences": 0, "validatedSentences": 18, "unvalidatedSentences": -17, "clips": 12, "users": 3, "size": 109143, "checksum": "63f5dc543f120470eb721f745392716b240f7d30a2488cf26dac728e5a14c9bf", "avgDurationSecs": 4.553, "validDurationSecs": 185.72, "totalHrs": 0.01, "validHrs": 0.05 }, "lg": { "duration": 59760, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 10, "users": 1, "size": 372250, "checksum": "28d1bc133862304041206abce842d693a3600222610d4a735da4aadc125821ee", "avgDurationSecs": 5.784, "validDurationSecs": 140.31, "totalHrs": 0.01, "validHrs": 0.04 }, "lij": { "duration": 308772, "reportedSentences": 5, "validatedSentences": 401, "unvalidatedSentences": -396, "clips": 73, "users": 3, "size": 1821781, "checksum": "6d53c9067bace3015326f87def7a9601b9f89993a88b1ce7562ad6cd45707eb7", "avgDurationSecs": 3.922, "validDurationSecs": 1050.68, "totalHrs": 0.08, "validHrs": 0.3 }, "lo": { "duration": 72612, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 429813, "checksum": "92f60adc5c3cd095a119122da7794f94a0b03c71db4d098467f73c4696907096", "avgDurationSecs": 6.49, "validDurationSecs": 43.76, "totalHrs": 0.02, "validHrs": 0.01 }, "lt": { "duration": 3172284, "reportedSentences": 1, "validatedSentences": 48, "unvalidatedSentences": 7, "clips": 628, "users": 8, "size": 18724742, "checksum": "f055e91109653fe4f9dd1ef4f31c3f0bea6c13f090fc14616ed573a139e26616", "avgDurationSecs": 5.142, "validDurationSecs": 6202.36, "totalHrs": 0.88, "validHrs": 1.72 }, "ltg": { "duration": 3120804, "reportedSentences": 2, "validatedSentences": 0, "unvalidatedSentences": 11, "clips": 457, "users": 18, "size": 17976668, "checksum": "d2eced2c7b061333eef99433d388ad463833885e6a59af89d4b0767d2dff0905", "avgDurationSecs": 4.645, "validDurationSecs": 6134.85, "totalHrs": 0.86, "validHrs": 1.71 }, "luo": { "duration": 74700, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 1, "size": 440323, "checksum": "98d3aa30ebd196fcc73442a504f86219cacf698e2ab69e1668ed6cae7f46396d", "avgDurationSecs": 5.205, "validDurationSecs": -0.83, "totalHrs": 0.02, "validHrs": 0 }, "lv": { "duration": 16465644, "reportedSentences": 8, "validatedSentences": 366, "unvalidatedSentences": -267, "clips": 2641, "users": 53, "size": 95586696, "checksum": "ce64ede5482162c1055839412b4c2749885541629b57373261ff6ba661fa9692", "avgDurationSecs": 4.785, "validDurationSecs": 11458.53, "totalHrs": 4.57, "validHrs": 3.18 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -21, "checksum": "a50c266e83e671ac05226d1db4ca2388a8bdabcaa9fbce22df8803369b79eada", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 44795, "checksum": "1ff9621cb4301aa75f89ccc14740fe6aa2ec1cdfcfa95693d75b407c99e9803c", "avgDurationSecs": 4.623, "validDurationSecs": 27.74, "totalHrs": 0, "validHrs": 0.01 }, "mk": { "duration": 46188828, "reportedSentences": 68, "validatedSentences": 217, "unvalidatedSentences": -178, "clips": 8900, "users": 179, "size": 246577321, "checksum": "b1f223833f1131156d0e5c5bdc94dd689a17db38f5cbfeb2cf75d6c7db207fd2", "avgDurationSecs": 4.806, "validDurationSecs": 25464.81, "totalHrs": 12.83, "validHrs": 7.07 }, "ml": { "duration": 61308, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 11, "users": 3, "size": 336990, "checksum": "276e30b9dfb35451692bc1f21264b5532e6986073f49e850b1d9fc6183ffcacd", "avgDurationSecs": 4.218, "validDurationSecs": 18, "totalHrs": 0.02, "validHrs": 0 }, "mn": { "duration": 108576, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 13, "users": 8, "size": 650862, "checksum": "5d62ff9dc45773179bc41a1990b57aa5ee5b04be7976411b24edaa0194b6306c", "avgDurationSecs": 5.254, "validDurationSecs": 1758.14, "totalHrs": 0.03, "validHrs": 0.49 }, "mr": { "duration": 113112, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 2, "size": 488487, "checksum": "5d3d8e70a49e616e5da65046a9ed246a0a42caa4f6c4255dc6fa91fe22d424e0", "avgDurationSecs": 6.199, "validDurationSecs": 26.18, "totalHrs": 0.04, "validHrs": 0 }, "mrj": { "duration": 108072, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 0, "size": 668386, "checksum": "170b733d3e33865389fb02615aa5b13d9afd5969d7143d8d6d539c631280a1ac", "avgDurationSecs": 4.193, "validDurationSecs": 234.11, "totalHrs": 0.03, "validHrs": 0.06 }, "mt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -7707, "checksum": "1e66e25bac431f8b90b9fecaa948e990615ef17f164f954f68eb1eddf64b7451", "avgDurationSecs": 4.754, "validDurationSecs": 33.28, "totalHrs": 0, "validHrs": 0.01 }, "myv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 3073, "checksum": "05efbcabe6947a53a85fe5dc2afa85d6561dcda01f231f8b16c6595f4d009b53", "avgDurationSecs": 5.774, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 2388420, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 1, "clips": 837, "users": 14, "size": 12697322, "checksum": "5e28e258cb9c062cc55c4191f92264597aace19c2cc146adfcd2967e7a3e491f", "avgDurationSecs": 2.643, "validDurationSecs": 4014.67, "totalHrs": 0.66, "validHrs": 1.12 }, "ne-NP": { "duration": 365724, "reportedSentences": 0, "validatedSentences": 11, "unvalidatedSentences": -10, "clips": 76, "users": 4, "size": 2159181, "checksum": "ada7bab888865185bc8724522c5bd513d0f5e2dce943b3e6f124e8562d125191", "avgDurationSecs": 4.052, "validDurationSecs": 1364.43, "totalHrs": 0.1, "validHrs": 0.37 }, "nhi": { "duration": 1629360, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 312, "users": 1, "size": 9593426, "checksum": "1f8a9fd928cfb3ec16dcd55a46fa6033875ab91ce9d1c132cf63152b303824b8", "avgDurationSecs": 5.064, "validDurationSecs": 20.25, "totalHrs": 0.45, "validHrs": 0.01 }, "nl": { "duration": 4193856, "reportedSentences": 5, "validatedSentences": 166, "unvalidatedSentences": 18, "clips": 853, "users": 27, "size": 23605591, "checksum": "8d21a9809f783156e43c6c1815f25a0ff6b637125bc44a144cad81c657a08fbd", "avgDurationSecs": 4.372, "validDurationSecs": 4303.58, "totalHrs": 1.16, "validHrs": 1.19 }, "nn-NO": { "duration": 33372, "reportedSentences": 0, "validatedSentences": 46, "unvalidatedSentences": -46, "clips": 5, "users": 1, "size": 201526, "checksum": "1ae55f50522ff0e0f02bcc309f85daa4fe85acc683117f8becf47d2068f8f1cc", "avgDurationSecs": 4.387, "validDurationSecs": 282.16, "totalHrs": 0.01, "validHrs": 0.07 }, "nr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 5, "checksum": "f6b8794d89584a349ef3ddb991aa53784d551a553982adff8ffca6895d602fac", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -7, "checksum": "92078343a6414dc5f7821f9ac27d86f524eebc05ff46a938fc0a472ba92c74b8", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 106452, "reportedSentences": 3, "validatedSentences": 5, "unvalidatedSentences": 4, "clips": 25, "users": 1, "size": 619393, "checksum": "9e6f01a51bdcda57894b22e1ef9b4f98f1ea0a78ef4cd90d8e666a8b7c2f30bd", "avgDurationSecs": 4.871, "validDurationSecs": 35.93, "totalHrs": 0.03, "validHrs": 0.01 }, "or": { "duration": 329832, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 60, "users": 3, "size": 1912012, "checksum": "ff39a23ce6c1fc2f941f04ecc116e98148d2297fb743a16f2956237747b27a43", "avgDurationSecs": 5.57, "validDurationSecs": -0.74, "totalHrs": 0.09, "validHrs": 0 }, "os": { "duration": 373428, "reportedSentences": 0, "validatedSentences": 510, "unvalidatedSentences": -503, "clips": 64, "users": 15, "size": 2214242, "checksum": "90fdbf08ee5d8145f2dae6b0ac33ec2eadb5a2b6847781e0710cd57fae401631", "avgDurationSecs": 5.492, "validDurationSecs": 607.13, "totalHrs": 0.1, "validHrs": 0.17 }, "pa-IN": { "duration": 296892, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 1, "clips": 65, "users": 1, "size": 1750278, "checksum": "bc47a95e637e48a2d3e79b1bec84d61dcbb0831bc28ba74d208bee3ab5621d40", "avgDurationSecs": 4.821, "validDurationSecs": 246.24, "totalHrs": 0.08, "validHrs": 0.07 }, "pl": { "duration": 8564220, "reportedSentences": 9, "validatedSentences": 65, "unvalidatedSentences": 317, "clips": 1726, "users": 23, "size": 50552385, "checksum": "53d09d99a5579c550f4566384addd607f2a54a06f3047d18aa53d2050a5ab073", "avgDurationSecs": 4.562, "validDurationSecs": 4494.91, "totalHrs": 2.38, "validHrs": 1.25 }, "ps": { "duration": 79911360, "reportedSentences": 133, "validatedSentences": 7599, "unvalidatedSentences": 7880, "clips": 16741, "users": 204, "size": 427889347, "checksum": "0d5c73befca334a128d6a833d31c453e28e9d8a140b2eacb4a2fa6e851840db5", "avgDurationSecs": 4.945, "validDurationSecs": 53901.25, "totalHrs": 22.2, "validHrs": 14.98 }, "pt": { "duration": 3375648, "reportedSentences": 6, "validatedSentences": 108, "unvalidatedSentences": 52, "clips": 601, "users": 52, "size": 18686146, "checksum": "a3d14e8524c8be0028ad6bc1de792db66c1f2d6559fee8187bc68bdf08a421d2", "avgDurationSecs": 4.154, "validDurationSecs": 5107.46, "totalHrs": 0.94, "validHrs": 1.42 }, "quy": { "duration": 11114388, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 1279, "users": 4, "size": 64801583, "checksum": "208c246ef361082bf1aeac25622ae00626c97710aa1a5396568e557563f449aa", "avgDurationSecs": 8.637, "validDurationSecs": 254.78, "totalHrs": 3.08, "validHrs": 0.07 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15719, "checksum": "b8310d3b8fc425cceea4a179c3c89f1a8d71b226a50c738d1a0e13a40dbd6b86", "avgDurationSecs": 5.438, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-vallader": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4432, "checksum": "6bbb0306405f33ac09614a5c7166f5a314b12b63a660af0df6e08cafa303109f", "avgDurationSecs": 5.826, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 4641228, "reportedSentences": 61, "validatedSentences": 6, "unvalidatedSentences": -4, "clips": 1076, "users": 6, "size": 27272982, "checksum": "e6b80fc9b80a798e908d6715d13475de552b3532c618e581c9e449de1b908260", "avgDurationSecs": 4.037, "validDurationSecs": 4863.54, "totalHrs": 1.29, "validHrs": 1.35 }, "ru": { "duration": 15891876, "reportedSentences": 20, "validatedSentences": 104, "unvalidatedSentences": -96, "clips": 3034, "users": 69, "size": 88670439, "checksum": "279afe356de2bb85e772192ce542dbbca36eb3b84939277ec51bbad6ef143e49", "avgDurationSecs": 5.175, "validDurationSecs": 7259.45, "totalHrs": 4.42, "validHrs": 2.02 }, "rw": { "duration": 180648, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": -1, "clips": 46, "users": 12, "size": 1173382, "checksum": "b8bb8f23f27e28f96d4bd6e22f4ad93527de145cbc513cf3985e06c823014764", "avgDurationSecs": 5.007, "validDurationSecs": 368.89, "totalHrs": 0.05, "validHrs": 0.1 }, "sah": { "duration": 3191436, "reportedSentences": 2, "validatedSentences": 35, "unvalidatedSentences": -35, "clips": 431, "users": 11, "size": 18830714, "checksum": "f39bf958968c051890b78b59470bad2d5140f737d173963103a685a6f90c34b2", "avgDurationSecs": 6.206, "validDurationSecs": 4035.34, "totalHrs": 0.89, "validHrs": 1.12 }, "sat": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -580, "checksum": "c235188e204db353dfff87e8d5991dcf1616aa4e37e5b239b627f3b95d583aca", "avgDurationSecs": 4.478, "validDurationSecs": 4.48, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 2463768, "reportedSentences": 28, "validatedSentences": 42, "unvalidatedSentences": 312, "clips": 496, "users": 2, "size": 14308199, "checksum": "1650a6f2a72cf7a80089423aca6aee77f964bf3f46f83b63597f16840463cd72", "avgDurationSecs": 4.692, "validDurationSecs": 1006.37, "totalHrs": 0.68, "validHrs": 0.28 }, "sd": { "duration": 28349712, "reportedSentences": 0, "validatedSentences": 15, "unvalidatedSentences": 19, "clips": 7132, "users": 3, "size": 141204884, "checksum": "f0266a4d37fe85475d89883af48397e8af656becca968813b4b28b873fab21a8", "avgDurationSecs": 4.241, "validDurationSecs": 38.1, "totalHrs": 7.88, "validHrs": 0.01 }, "sk": { "duration": 73381356, "reportedSentences": 141, "validatedSentences": 7215, "unvalidatedSentences": 5915, "clips": 16527, "users": 14, "size": 431804599, "checksum": "428ae17939c30c183145a0603c21a3f81aa4bb7156e45250f313bffe99865522", "avgDurationSecs": 4.225, "validDurationSecs": 82608.2, "totalHrs": 20.39, "validHrs": 22.95 }, "skr": { "duration": 578880, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": 1, "clips": 110, "users": 3, "size": 3410924, "checksum": "4d06afaf431bd339e79b8631699006cffc1d903aa7e580ca638f4719668d56fa", "avgDurationSecs": 4.164, "validDurationSecs": 157.55, "totalHrs": 0.16, "validHrs": 0.05 }, "sl": { "duration": 5936976, "reportedSentences": 0, "validatedSentences": 31, "unvalidatedSentences": -23, "clips": 1323, "users": 213, "size": 34611519, "checksum": "8c106ae3a59fb839a9bcad389883d48840f6b99cb4c5c787b2bfc81ad7cbe825", "avgDurationSecs": 3.99, "validDurationSecs": 4719.56, "totalHrs": 1.64, "validHrs": 1.31 }, "sq": { "duration": 94284, "reportedSentences": 0, "validatedSentences": 6, "unvalidatedSentences": -6, "clips": 15, "users": 2, "size": 552224, "checksum": "2319a5bbae777309312d3ec54d8a15134e05382856d3dc24355e0547c25a0595", "avgDurationSecs": 5.1, "validDurationSecs": 338.47, "totalHrs": 0.03, "validHrs": 0.09 }, "sr": { "duration": 109980, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 20, "users": 1, "size": 621315, "checksum": "02b0d5ce77aa28cf89fef4172b79917f317c31b953451e31c7d70fd526ad94a0", "avgDurationSecs": 3.235, "validDurationSecs": 156.9, "totalHrs": 0.03, "validHrs": 0.04 }, "st": { "duration": 555372, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 149, "users": 1, "size": 3238722, "checksum": "5869752f9d38c81605d9b444b815611825d27edced24bbdf88569a0052d0c2c4", "avgDurationSecs": 3.223, "validDurationSecs": 0, "totalHrs": 0.15, "validHrs": 0 }, "sv-SE": { "duration": 432612, "reportedSentences": 0, "validatedSentences": 286, "unvalidatedSentences": -264, "clips": 81, "users": 8, "size": 2595139, "checksum": "35250e119fedf34fa10ca0a6ca4a0c5b658a47d485cc1ed58429617c83057bf5", "avgDurationSecs": 4.019, "validDurationSecs": 1580.2, "totalHrs": 0.12, "validHrs": 0.44 }, "sw": { "duration": 162252, "reportedSentences": 0, "validatedSentences": 11, "unvalidatedSentences": -11, "clips": 21, "users": 4, "size": 951994, "checksum": "37aca0c8b7ab50ee50ebe54d68981efba8a872c853682103d9517cc96a7de41d", "avgDurationSecs": 5.387, "validDurationSecs": 427.48, "totalHrs": 0.04, "validHrs": 0.12 }, "ta": { "duration": 1309716, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -2, "clips": 218, "users": 17, "size": 7662188, "checksum": "a02b0f38d7833a8f78a49501760fb516f474b177ffb08e6fd94346ed04d4471f", "avgDurationSecs": 6.199, "validDurationSecs": 379.99, "totalHrs": 0.36, "validHrs": 0.1 }, "te": { "duration": 100836, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": -3, "clips": 20, "users": 2, "size": 595602, "checksum": "2c1d69f3ead2dbe3697f0b1e9b3eca247e1c40937a16668376c43b2840d86d50", "avgDurationSecs": 4.186, "validDurationSecs": 110.98, "totalHrs": 0.03, "validHrs": 0.03 }, "th": { "duration": 419400, "reportedSentences": 0, "validatedSentences": 30, "unvalidatedSentences": -29, "clips": 85, "users": 15, "size": 2401079, "checksum": "340f52987151731e4ae80706aac9149f0952ea0535a3b80ad0e92f9abf6af026", "avgDurationSecs": 4.191, "validDurationSecs": 1308.17, "totalHrs": 0.12, "validHrs": 0.36 }, "ti": { "duration": 36144, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 6, "users": 2, "size": 214275, "checksum": "140ee609858b28a717db42f81d67adaa9d68e79b2d7052b789fe1f845b41d028", "avgDurationSecs": 5.018, "validDurationSecs": 5.55, "totalHrs": 0.01, "validHrs": 0 }, "tig": { "duration": 25074756, "reportedSentences": 29, "validatedSentences": 6, "unvalidatedSentences": 84, "clips": 3853, "users": 90, "size": 136285038, "checksum": "f737f3ce620fb3d5707545b13d495fe00128461608e3e82710c6fe6f2cf40301", "avgDurationSecs": 6.405, "validDurationSecs": 22264.26, "totalHrs": 6.97, "validHrs": 6.18 }, "tk": { "duration": 1056780, "reportedSentences": 0, "validatedSentences": 38, "unvalidatedSentences": -33, "clips": 142, "users": 4, "size": 6152567, "checksum": "8e859627956e5fc9701484e4e3362ae60ab5c4c4d652eed9d0a8e5c01ee45372", "avgDurationSecs": 5.485, "validDurationSecs": 484.75, "totalHrs": 0.29, "validHrs": 0.13 }, "tn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -3950, "checksum": "aac60412140ab79e7ee4b353c6ae339bc61c6e5eb699df5f223221a8fe8eb731", "avgDurationSecs": 4.37, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tok": { "duration": 2243196, "reportedSentences": 3, "validatedSentences": 107, "unvalidatedSentences": -105, "clips": 559, "users": 5, "size": 12935039, "checksum": "2378c93e48dedc499309a4e1649fc4573149ce3251b597d6c84094667ab8ae8b", "avgDurationSecs": 4.167, "validDurationSecs": 775.75, "totalHrs": 0.63, "validHrs": 0.21 }, "tr": { "duration": 6014880, "reportedSentences": 11, "validatedSentences": 10, "unvalidatedSentences": 5, "clips": 1006, "users": 50, "size": 35337606, "checksum": "cd704ec5d6779bf4a66af948ece7686b4654e1976d71db12c5c6e4bc780803d6", "avgDurationSecs": 3.816, "validDurationSecs": 5612.36, "totalHrs": 1.68, "validHrs": 1.56 }, "ts": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -5, "checksum": "c1d517b55d65474acee5a1d3a3fd493d5602e42c12c97af67eb46b3fa180eec5", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "duration": 833544, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 171, "users": 7, "size": 4729799, "checksum": "4f5113bd1fafdb415c40da4ea93fcb3f4e51085dce54ee8c40286ca89006b91a", "avgDurationSecs": 3.831, "validDurationSecs": 1067.71, "totalHrs": 0.23, "validHrs": 0.3 }, "tw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": 586, "checksum": "b57274d4562efc697530c971b51401d1ad6fed4ba130ce04d9419c3749cb229c", "avgDurationSecs": 4.31, "validDurationSecs": 30.17, "totalHrs": 0, "validHrs": 0.01 }, "ug": { "duration": 232492248, "reportedSentences": 62, "validatedSentences": 58462, "unvalidatedSentences": 8123, "clips": 33667, "users": 109, "size": 1317765208, "checksum": "85281f6fe2a8c118e1984b54b01a046072473518ed447a43083d234f38c01540", "avgDurationSecs": 5.823, "validDurationSecs": 40527.31, "totalHrs": 64.58, "validHrs": 11.25 }, "uk": { "duration": 3616308, "reportedSentences": 1, "validatedSentences": 63, "unvalidatedSentences": -60, "clips": 781, "users": 16, "size": 21311352, "checksum": "93a41db3f4ab24a94281fd51b61641c159f7407403f53dc770d306229523fce6", "avgDurationSecs": 4.642, "validDurationSecs": 4216.35, "totalHrs": 1, "validHrs": 1.17 }, "ur": { "duration": 2791872, "reportedSentences": 75, "validatedSentences": 172, "unvalidatedSentences": 44, "clips": 550, "users": 19, "size": 15759072, "checksum": "39ba21f8531464082da2a029d6baa7e7e329988adb9282e4b435667715fd2b72", "avgDurationSecs": 4.297, "validDurationSecs": 3989.98, "totalHrs": 0.78, "validHrs": 1.11 }, "uz": { "duration": 1038636, "reportedSentences": 0, "validatedSentences": 47, "unvalidatedSentences": -40, "clips": 193, "users": 24, "size": 6162692, "checksum": "f1ceb3a2d412b660c01d7abdcb4fa4c293efded8e41ddb312ae53137f94e0e05", "avgDurationSecs": 4.155, "validDurationSecs": 1053.54, "totalHrs": 0.29, "validHrs": 0.29 }, "vi": { "duration": 372600, "reportedSentences": 0, "validatedSentences": 46, "unvalidatedSentences": -35, "clips": 95, "users": 11, "size": 2016703, "checksum": "b09847f1162dbfb3cf6fbef16a243838e062be13e3cae2b7f1bc1f59d2929b5f", "avgDurationSecs": 3.949, "validDurationSecs": 255.92, "totalHrs": 0.1, "validHrs": 0.07 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 21, "checksum": "8ccca0da441a208fa77fe1a63abc25d0ba3a315b9234cb9843847856370d63b2", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 5, "checksum": "2d64bc7fad61267b9648870b8afa40134862b411f0ee3a4d73ded7f4403602d7", "avgDurationSecs": 6.276, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yi": { "duration": 342648, "reportedSentences": 0, "validatedSentences": 34, "unvalidatedSentences": -34, "clips": 90, "users": 6, "size": 1871439, "checksum": "3a60ccc5436d2acd7303dea922df1d6bbe68c9ca5e12e4d2e87ccbc38925b192", "avgDurationSecs": 4.134, "validDurationSecs": 258.23, "totalHrs": 0.1, "validHrs": 0.07 }, "yo": { "duration": 738576, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 117, "users": 2, "size": 4370086, "checksum": "308d0ad0b81d06f926a39d9cccba25e28257a4e9bcba3d0f23d87ff166943799", "avgDurationSecs": 5.99, "validDurationSecs": 1146.35, "totalHrs": 0.2, "validHrs": 0.32 }, "yue": { "duration": 7420104, "reportedSentences": 12, "validatedSentences": 291, "unvalidatedSentences": -276, "clips": 1178, "users": 7, "size": 43669893, "checksum": "7d8e4a36868f2936ac66a1c9ca30fc478f48a51af9e3dda9f7e9333e7a21817d", "avgDurationSecs": 3.967, "validDurationSecs": 2668.27, "totalHrs": 2.06, "validHrs": 0.74 }, "zgh": { "duration": 138312, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 37, "users": 2, "size": 808641, "checksum": "c361c41b3866629ce01fdd0bb8428fc364a2acd5f039db75069ab25fd6408fcc", "avgDurationSecs": 3.428, "validDurationSecs": 38.43, "totalHrs": 0.03, "validHrs": 0.01 }, "zh-CN": { "duration": 4010760, "reportedSentences": 30, "validatedSentences": 52, "unvalidatedSentences": -34, "clips": 1019, "users": 85, "size": 22435509, "checksum": "abfd6abdb9151ca9f778c811289b5128d57df16e4f8c38c8e4089e3027740e09", "avgDurationSecs": 4.543, "validDurationSecs": 2334.35, "totalHrs": 1.12, "validHrs": 0.65 }, "zh-HK": { "duration": 2161296, "reportedSentences": 1, "validatedSentences": 51, "unvalidatedSentences": 49, "clips": 641, "users": 14, "size": 12476171, "checksum": "fc7f490da1ccd5c61f06cea2f642364769e0f89798ef5a3c77141b71e7e32ee8", "avgDurationSecs": 4.161, "validDurationSecs": 746.08, "totalHrs": 0.6, "validHrs": 0.21 }, "zh-TW": { "duration": 3320316, "reportedSentences": 7, "validatedSentences": 48, "unvalidatedSentences": 5, "clips": 1010, "users": 28, "size": 18318970, "checksum": "530a5ebdc4b59ac2050ca5e34048b332ed1bc4c0dc8756589599aeffaf150484", "avgDurationSecs": 3.363, "validDurationSecs": 1480.21, "totalHrs": 0.92, "validHrs": 0.42 }, "zu": { "duration": 140976, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 69, "users": 1, "size": 818358, "checksum": "9bff362c407572a92d10e2c8a0dac54f3056281e9f19264bd783084db30a678d", "avgDurationSecs": 3.29, "validDurationSecs": -14.69, "totalHrs": 0.04, "validHrs": -0.01 }, "zza": { "duration": 55872, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": 1, "clips": 14, "users": 1, "size": 328454, "checksum": "55c62144221e77c80de50f1a11ae7956f4e3d4510669bfcf59127ec5784aee66", "avgDurationSecs": 4.103, "validDurationSecs": 171.25, "totalHrs": 0.02, "validHrs": 0.05 } }, "totalDuration": 1887175728, "totalValidDurationSecs": 1315574, "totalHrs": 524, "totalValidHrs": 366 } ================================================ FILE: datasets/scripted-speech/cv-corpus-22.0-2025-06-20.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9152, "test": 9132, "train": 21037, "validated": 42159, "invalidated": 5290, "other": 16738 }, "duration": 332347536, "reportedSentences": 225, "validatedSentences": 1046779, "unvalidatedSentences": 5, "clips": 64187, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.17, "teens": 0.26, "fourties": 0.08, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 64187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 421, "size": 1974557460, "checksum": "13d4f651aa803c93dd03c300826032c3ee91bc5030f1a480b8f4be3183e1018c", "avgDurationSecs": 5.178, "validDurationSecs": 218290.928, "totalHrs": 92.31, "validHrs": 60.63 }, "af": { "buckets": { "dev": 125, "test": 131, "train": 139, "validated": 395, "invalidated": 198, "other": 306 }, "duration": 5331492, "reportedSentences": 73, "validatedSentences": 4779, "unvalidatedSentences": 16587, "clips": 899, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.33, "thirties": 0, "teens": 0.01, "fourties": 0.15, "fifties": 0.16, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male_masculine": 0.35, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 899, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 62, "size": 32493660, "checksum": "cec6c7776b5dc27e0ff70ddce2ca87e41406d7c3f1834d1420d9653fa1e16163", "avgDurationSecs": 5.93, "validDurationSecs": 2342.535, "totalHrs": 1.48, "validHrs": 0.65 }, "am": { "buckets": { "dev": 248, "test": 252, "train": 523, "validated": 1023, "invalidated": 29, "other": 579 }, "duration": 10270872, "reportedSentences": 11, "validatedSentences": 2208, "unvalidatedSentences": 12, "clips": 1631, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.14, "thirties": 0.61, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1631, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 60530441, "checksum": "f8ff75b265a870c48af974fa9ac830d4f5420342b059b51f801d524199741889", "avgDurationSecs": 6.297, "validDurationSecs": 6442.123, "totalHrs": 2.85, "validHrs": 1.78 }, "ar": { "buckets": { "dev": 10503, "test": 10500, "train": 28531, "validated": 79117, "invalidated": 15202, "other": 41364 }, "duration": 564577110, "reportedSentences": 2134, "validatedSentences": 59434, "unvalidatedSentences": 1742106, "clips": 135683, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.29, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 135683, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1632, "size": 3475769391, "checksum": "17cd613f88d19fce8f8f171fff5b78b67d491b617906163e876b6caecccdeb8c", "avgDurationSecs": 4.161, "validDurationSecs": 329205.923, "totalHrs": 156.82, "validHrs": 91.44 }, "as": { "buckets": { "dev": 485, "test": 379, "train": 952, "validated": 1816, "invalidated": 282, "other": 2558 }, "duration": 27454637, "reportedSentences": 247, "validatedSentences": 7146, "unvalidatedSentences": 3355, "clips": 4656, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.16, "thirties": 0.02, "teens": 0, "fourties": 0.52, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0.18, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4650, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 6 } }, "users": 51, "size": 166969829, "checksum": "74c3cb9660ff9868bb52d084dc38a1e6ec91d1ee1cffa88ebf61686e59172ade", "avgDurationSecs": 5.897, "validDurationSecs": 10708.252, "totalHrs": 7.62, "validHrs": 2.97 }, "ast": { "buckets": { "dev": 113, "test": 203, "train": 434, "validated": 758, "invalidated": 31, "other": 890 }, "duration": 7444944, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1679, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.1, "thirties": 0.12, "teens": 0.04, "fourties": 0.28, "fifties": 0.21, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1679, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 42480836, "checksum": "06a7bfbc078e9cedc0b9c7ee665aa85d13b8dfac65616faa4d281bd59808626f", "avgDurationSecs": 4.434, "validDurationSecs": 3361.088, "totalHrs": 2.06, "validHrs": 0.93 }, "az": { "buckets": { "dev": 78, "test": 95, "train": 157, "validated": 330, "invalidated": 50, "other": 529 }, "duration": 4927428, "reportedSentences": 5, "validatedSentences": 93161, "unvalidatedSentences": 1850, "clips": 909, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.14, "thirties": 0.65, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0.18, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 909, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 39, "size": 35074269, "checksum": "f10464cf2fb94db9b835d8ccfcfa56bdd2bf8fb09d422b6ff68d9ce9accc4461", "avgDurationSecs": 5.421, "validDurationSecs": 1788.835, "totalHrs": 1.36, "validHrs": 0.49 }, "ba": { "buckets": { "dev": 14580, "test": 14555, "train": 119088, "validated": 210259, "invalidated": 8007, "other": 190 }, "duration": 967108860, "reportedSentences": 868, "validatedSentences": 153968, "unvalidatedSentences": 5, "clips": 218456, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218456, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 928, "size": 5449655505, "checksum": "40ab8fd7e126ec97d34fa472ec5ce8b9b2e20de97314debf6a2290c1441fda4e", "avgDurationSecs": 4.427, "validDurationSecs": 930820.585, "totalHrs": 268.64, "validHrs": 258.56 }, "bas": { "buckets": { "dev": 1328, "test": 1547, "train": 2109, "validated": 11134, "invalidated": 1217, "other": 108 }, "duration": 48894552, "reportedSentences": 8, "validatedSentences": 5226, "unvalidatedSentences": 105, "clips": 12459, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.61, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12455, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 253447020, "checksum": "f21925f1da52acaf420cbf5c24ae6fbf3d07d9dc61aa71ecc1b467b1bf31172b", "avgDurationSecs": 3.924, "validDurationSecs": 43694.674, "totalHrs": 13.58, "validHrs": 12.13 }, "be": { "buckets": { "dev": 15879, "test": 15880, "train": 347672, "validated": 1357522, "invalidated": 36842, "other": 17002 }, "duration": 6759210924, "reportedSentences": 3197, "validatedSentences": 379541, "unvalidatedSentences": 1938, "clips": 1411366, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.04, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1411296, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 33, "healthcare": 0, "history_law_government": 16, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 7 } }, "users": 8576, "size": 38494901481, "checksum": "7f7a43b2b2c047f301b8b7d3bac6fd3bd0c8eff92c5e47a3d09de3939ebf9d9b", "avgDurationSecs": 4.789, "validDurationSecs": 6501345.173, "totalHrs": 1877.55, "validHrs": 1805.92 }, "bg": { "buckets": { "dev": 2932, "test": 3354, "train": 4952, "validated": 11255, "invalidated": 773, "other": 1787 }, "duration": 75593484, "reportedSentences": 191, "validatedSentences": 39572, "unvalidatedSentences": 3, "clips": 13815, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.58, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13815, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 144, "size": 445465721, "checksum": "6bd641e34c84e9074e4580f85d427a587e2377f3f464a0866dcd88e89572e4f1", "avgDurationSecs": 5.472, "validDurationSecs": 61585.571, "totalHrs": 20.99, "validHrs": 17.1 }, "bn": { "buckets": { "dev": 9382, "test": 9382, "train": 21514, "validated": 44560, "invalidated": 7903, "other": 999246 }, "duration": 4596066180, "reportedSentences": 1890, "validatedSentences": 562826, "unvalidatedSentences": 94116, "clips": 1051709, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1051709, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22976, "size": 26559796958, "checksum": "c873aff20fad4c95da1d39c475c6f678e362d1e585b4b90dbbf21b91d8ae02f2", "avgDurationSecs": 4.37, "validDurationSecs": 194731.346, "totalHrs": 1276.68, "validHrs": 54.09 }, "br": { "buckets": { "dev": 2823, "test": 2865, "train": 3510, "validated": 31451, "invalidated": 2317, "other": 43 }, "duration": 109015063, "reportedSentences": 301, "validatedSentences": 22182, "unvalidatedSentences": 92, "clips": 33811, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.1, "teens": 0.01, "fourties": 0.06, "fifties": 0.08, "sixties": 0.12, "seventies": 0.06, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.63, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31633, "agriculture_food": 1, "automotive_transport": 0, "finance": 1, "service_retail": 4, "general": 2145, "healthcare": 9, "history_law_government": 7, "language_fundamentals": 0, "media_entertainment": 5, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 233, "size": 747582103, "checksum": "dbb53180dfd4433e16136627248a1b89fa234d183a81162a316a5d718ab6730f", "avgDurationSecs": 3.224, "validDurationSecs": 101405.837, "totalHrs": 30.28, "validHrs": 28.16 }, "ca": { "buckets": { "dev": 16414, "test": 16414, "train": 1208213, "validated": 2270597, "invalidated": 140375, "other": 223305 }, "duration": 13729860558, "reportedSentences": 7797, "validatedSentences": 1302361, "unvalidatedSentences": 3140, "clips": 2634277, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.19, "sixties": 0.29, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.52, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 2633234, "agriculture_food": 41, "automotive_transport": 26, "finance": 4, "service_retail": 62, "general": 611, "healthcare": 34, "history_law_government": 64, "language_fundamentals": 29, "media_entertainment": 32, "nature_environment": 113, "news_current_affairs": 32, "technology_robotics": 20 } }, "users": 36740, "size": 82555798249, "checksum": "74c793966e00e656a2d2b2194c8cb02260815b53f5830ce5b0e7046a2c8dc69e", "avgDurationSecs": 5.212, "validDurationSecs": 11834359.178, "totalHrs": 3813.85, "validHrs": 3287.32 }, "ckb": { "buckets": { "dev": 5345, "test": 5344, "train": 7878, "validated": 119569, "invalidated": 9253, "other": 39286 }, "duration": 688258908, "reportedSentences": 2527, "validatedSentences": 19056, "unvalidatedSentences": 15418, "clips": 168108, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 167876, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 114, "healthcare": 0, "history_law_government": 10, "language_fundamentals": 10, "media_entertainment": 88, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1938, "size": 3803792664, "checksum": "e8c4e65fbaced436603ad6291453ac8641f2f0f74758c5e06b10f66b23e82d66", "avgDurationSecs": 4.094, "validDurationSecs": 489533.094, "totalHrs": 191.18, "validHrs": 135.98 }, "cnh": { "buckets": { "dev": 763, "test": 765, "train": 818, "validated": 2463, "invalidated": 437, "other": 3287 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168181932, "checksum": "1edff706113c0b0443eae2aac81a7098ff0261397b7415e9542c3793e4493a0c", "avgDurationSecs": 3.516, "validDurationSecs": 8659.177, "totalHrs": 6.04, "validHrs": 2.4 }, "cs": { "buckets": { "dev": 9410, "test": 9421, "train": 21731, "validated": 64565, "invalidated": 2556, "other": 149113 }, "duration": 964742727, "reportedSentences": 969, "validatedSentences": 451100, "unvalidatedSentences": 258, "clips": 216234, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.54, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 216212, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 4, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 7 } }, "users": 1105, "size": 5936664423, "checksum": "8efa762eb69b351481220321e1bdd8d459f7c1f339c67c8f239ff09f63734701", "avgDurationSecs": 4.462, "validDurationSecs": 288061.148, "totalHrs": 267.98, "validHrs": 80.01 }, "cv": { "buckets": { "dev": 1242, "test": 1288, "train": 1456, "validated": 17497, "invalidated": 2215, "other": 744 }, "duration": 103130544, "reportedSentences": 144, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 20456, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.5, "thirties": 0.01, "teens": 0.18, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.51, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20455, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 122, "size": 697770198, "checksum": "73a8ede271c067bc0fa9e5b8fa6f120b806525d8ba1b4ea28b44077e125c0f46", "avgDurationSecs": 5.042, "validDurationSecs": 88212.511, "totalHrs": 28.64, "validHrs": 24.5 }, "cy": { "buckets": { "dev": 5408, "test": 5408, "train": 8014, "validated": 90776, "invalidated": 4463, "other": 20676 }, "duration": 569722221, "reportedSentences": 176, "validatedSentences": 118044, "unvalidatedSentences": 14, "clips": 115915, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115915, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1916, "size": 4155292390, "checksum": "3430fe213780804f53a36417ecf15fbf3675ad825088666cd83dd6d8f5dca665", "avgDurationSecs": 4.915, "validDurationSecs": 446164.037, "totalHrs": 158.25, "validHrs": 123.93 }, "da": { "buckets": { "dev": 2630, "test": 2758, "train": 3602, "validated": 11312, "invalidated": 438, "other": 2215 }, "duration": 56748708, "reportedSentences": 685, "validatedSentences": 20779, "unvalidatedSentences": 302, "clips": 13965, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.16, "thirties": 0.23, "teens": 0.01, "fourties": 0.17, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.48, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13942, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 13, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 8, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 293, "size": 329093591, "checksum": "5e54de7822335ebcf96fe83ee4c02afacbd9874c99929c24d9a0bed81c94d9c8", "avgDurationSecs": 4.064, "validDurationSecs": 45967.876, "totalHrs": 15.76, "validHrs": 12.76 }, "dav": { "buckets": { "dev": 1275, "test": 1004, "train": 2097, "validated": 8260, "invalidated": 663, "other": 40707 }, "duration": 201431016, "reportedSentences": 11, "validatedSentences": 11773, "unvalidatedSentences": 20119, "clips": 49630, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.61, "thirties": 0.1, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0.23, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.73, "male_masculine": 0, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 971, "agriculture_food": 499, "automotive_transport": 100, "finance": 304, "service_retail": 1218, "general": 45323, "healthcare": 967, "history_law_government": 334, "language_fundamentals": 150, "media_entertainment": 68, "nature_environment": 873, "news_current_affairs": 86, "technology_robotics": 75 } }, "users": 24, "size": 1084115337, "checksum": "74bb24434e03a92b2346c5f99c6dbe751cbde56928dc278fe2dc41906daf98fa", "avgDurationSecs": 4.059, "validDurationSecs": 33524.485, "totalHrs": 55.95, "validHrs": 9.31 }, "de": { "buckets": { "dev": 16197, "test": 16197, "train": 607871, "validated": 936146, "invalidated": 56237, "other": 17260 }, "duration": 5311921841, "reportedSentences": 11305, "validatedSentences": 2057917, "unvalidatedSentences": 1331, "clips": 1009643, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.18, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.58, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1009446, "agriculture_food": 0, "automotive_transport": 6, "finance": 0, "service_retail": 12, "general": 138, "healthcare": 15, "history_law_government": 14, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 12, "news_current_affairs": 4, "technology_robotics": 9 } }, "users": 20266, "size": 36913535185, "checksum": "f8aead37e27b76b781bcd050c78d61190467564420def922946d0436f62b6526", "avgDurationSecs": 5.261, "validDurationSecs": 4925240.292, "totalHrs": 1475.53, "validHrs": 1368.12 }, "dv": { "buckets": { "dev": 2243, "test": 2222, "train": 2654, "validated": 26486, "invalidated": 1652, "other": 15104 }, "duration": 221431995, "reportedSentences": 59, "validatedSentences": 7174, "unvalidatedSentences": 9, "clips": 43242, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.11, "thirties": 0.39, "teens": 0.01, "fourties": 0.23, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.24, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43241, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 388, "size": 1408214479, "checksum": "90b9f4168d2443b2ba5c5c6782c7c4388227dd556f6188518d9e713f10da397c", "avgDurationSecs": 5.121, "validDurationSecs": 135628.505, "totalHrs": 61.5, "validHrs": 37.67 }, "dyu": { "buckets": { "dev": 50, "test": 63, "train": 88, "validated": 209, "invalidated": 14, "other": 72 }, "duration": 1861236, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 295, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.66, "thirties": 0, "teens": 0.02, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 10793370, "checksum": "0a72e30231529f9cc68dd52c2f2c90ea8c44b39a9b59a448bc5c0344ac4410d0", "avgDurationSecs": 6.309, "validDurationSecs": 1318.638, "totalHrs": 0.51, "validHrs": 0.36 }, "el": { "buckets": { "dev": 1694, "test": 1711, "train": 1934, "validated": 16761, "invalidated": 864, "other": 10351 }, "duration": 116123218, "reportedSentences": 72, "validatedSentences": 6856, "unvalidatedSentences": 4502, "clips": 27976, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.63, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 27976, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 445, "size": 771648305, "checksum": "782d4886e6db4e27eeae375a1f98773471aeb82c2a8c63b6fc421a772c9a2eb6", "avgDurationSecs": 4.151, "validDurationSecs": 69571.821, "totalHrs": 32.25, "validHrs": 19.32 }, "en": { "buckets": { "dev": 16401, "test": 16401, "train": 1138760, "validated": 1855619, "invalidated": 306854, "other": 370737 }, "duration": 13382611387, "reportedSentences": 9575, "validatedSentences": 1680014, "unvalidatedSentences": 35403, "clips": 2533210, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.44, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2532173, "agriculture_food": 147, "automotive_transport": 7, "finance": 42, "service_retail": 28, "general": 551, "healthcare": 21, "history_law_government": 69, "language_fundamentals": 9, "media_entertainment": 97, "nature_environment": 41, "news_current_affairs": 10, "technology_robotics": 87 } }, "users": 97925, "size": 92908028734, "checksum": "075ca496a0adc0b16bac6da3f77b92893a2b79ea524db0e8442f7dde11a87009", "avgDurationSecs": 5.283, "validDurationSecs": 9802988.287, "totalHrs": 3717.39, "validHrs": 2723.05 }, "eo": { "buckets": { "dev": 14899, "test": 14920, "train": 144331, "validated": 851888, "invalidated": 127495, "other": 157492 }, "duration": 6901347679, "reportedSentences": 2362, "validatedSentences": 180764, "unvalidatedSentences": 22260, "clips": 1136875, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.68, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1136872, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1849, "size": 41247589724, "checksum": "3b58a9dd888e072f93452b224f064ea92182ee021c076a4efb0edb932459f3a6", "avgDurationSecs": 6.07, "validDurationSecs": 5171347.133, "totalHrs": 1917.04, "validHrs": 1436.48 }, "es": { "buckets": { "dev": 15893, "test": 15893, "train": 353701, "validated": 432774, "invalidated": 94323, "other": 1142321 }, "duration": 8143477110, "reportedSentences": 2632, "validatedSentences": 1081921, "unvalidatedSentences": 5260, "clips": 1669418, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1669299, "agriculture_food": 1, "automotive_transport": 4, "finance": 4, "service_retail": 2, "general": 37, "healthcare": 4, "history_law_government": 28, "language_fundamentals": 7, "media_entertainment": 6, "nature_environment": 11, "news_current_affairs": 18, "technology_robotics": 21 } }, "users": 26738, "size": 51337759400, "checksum": "6460828a35d4f7056db26153a624644c7f346e91e478d667ccc0752ab187cca6", "avgDurationSecs": 4.878, "validDurationSecs": 2111086.117, "totalHrs": 2262.07, "validHrs": 586.41 }, "et": { "buckets": { "dev": 2823, "test": 2823, "train": 3402, "validated": 26253, "invalidated": 7766, "other": 107 }, "duration": 230185118, "reportedSentences": 566, "validatedSentences": 10910, "unvalidatedSentences": 855, "clips": 34126, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.52, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33558, "agriculture_food": 28, "automotive_transport": 27, "finance": 28, "service_retail": 6, "general": 362, "healthcare": 15, "history_law_government": 25, "language_fundamentals": 10, "media_entertainment": 22, "nature_environment": 79, "news_current_affairs": 10, "technology_robotics": 25 } }, "users": 1002, "size": 1540697509, "checksum": "8523341b5dd4c3df5bdd5e8eacf0c9937c3ea1a08dfa9b68fc78c716cbb7f99d", "avgDurationSecs": 6.745, "validDurationSecs": 177080.522, "totalHrs": 63.94, "validHrs": 49.18 }, "eu": { "buckets": { "dev": 14753, "test": 14753, "train": 130043, "validated": 299986, "invalidated": 47850, "other": 115454 }, "duration": 2549345907, "reportedSentences": 1272, "validatedSentences": 1018097, "unvalidatedSentences": 7926, "clips": 463290, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.02, "fourties": 0.16, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0.25, "female_feminine": 0.29, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443661, "agriculture_food": 6, "automotive_transport": 6, "finance": 4, "service_retail": 7, "general": 19560, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 11027, "size": 15632143990, "checksum": "b19c78b8fcd2237b1fc9e6ffc3b3cc357ac003c50081435f4ef7e0141f4796a7", "avgDurationSecs": 5.503, "validDurationSecs": 1650732.978, "totalHrs": 708.15, "validHrs": 458.53 }, "fa": { "buckets": { "dev": 10676, "test": 10676, "train": 29789, "validated": 338056, "invalidated": 15232, "other": 34503 }, "duration": 1535700104, "reportedSentences": 2736, "validatedSentences": 57484, "unvalidatedSentences": 283199, "clips": 387791, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.35, "teens": 0.03, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.7, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 387767, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 24, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4593, "size": 11093241698, "checksum": "883ae1240c336f9a9c727fd2ddf5f540fb9e1733d78942b89b644ee9e6fd292a", "avgDurationSecs": 3.96, "validDurationSecs": 1338743.381, "totalHrs": 426.58, "validHrs": 371.87 }, "fi": { "buckets": { "dev": 1767, "test": 1806, "train": 2093, "validated": 11924, "invalidated": 418, "other": 5078 }, "duration": 80540067, "reportedSentences": 78, "validatedSentences": 5848, "unvalidatedSentences": 49166, "clips": 17420, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.31, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.36, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17420, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 240, "size": 481454747, "checksum": "d329700a4e0d4a03e754b835b29e4ec0aaf45d180f72ce069b214c2295937fe1", "avgDurationSecs": 4.623, "validDurationSecs": 55129.722, "totalHrs": 22.37, "validHrs": 15.31 }, "fr": { "buckets": { "dev": 16186, "test": 16186, "train": 593066, "validated": 762418, "invalidated": 67073, "other": 18829 }, "duration": 4270378078, "reportedSentences": 7532, "validatedSentences": 1648822, "unvalidatedSentences": 43898, "clips": 848320, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.13, "fifties": 0.1, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.57, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 848217, "agriculture_food": 0, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 66, "healthcare": 5, "history_law_government": 19, "language_fundamentals": 7, "media_entertainment": 16, "nature_environment": 8, "news_current_affairs": 2, "technology_robotics": 18 } }, "users": 20709, "size": 29882902416, "checksum": "0b31c9b6d8e47f92d7c96bcaf3c8b2be652acc475993a5a73f86021b7309150c", "avgDurationSecs": 5.034, "validDurationSecs": 3837953.972, "totalHrs": 1186.21, "validHrs": 1066.09 }, "fy-NL": { "buckets": { "dev": 3172, "test": 3173, "train": 3924, "validated": 51648, "invalidated": 4021, "other": 102210 }, "duration": 766328211, "reportedSentences": 543, "validatedSentences": 10351, "unvalidatedSentences": 9, "clips": 157879, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157879, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2060, "size": 4594946611, "checksum": "5c3ee054f8bd6732fb948ec8171212fece144a0a9fcce08e2325b52daa0a7ab0", "avgDurationSecs": 4.854, "validDurationSecs": 250694.009, "totalHrs": 212.86, "validHrs": 69.63 }, "ga-IE": { "buckets": { "dev": 526, "test": 527, "train": 546, "validated": 8534, "invalidated": 1083, "other": 2747 }, "duration": 44210043, "reportedSentences": 33, "validatedSentences": 1675, "unvalidatedSentences": 1345, "clips": 12364, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.36, "thirties": 0.2, "teens": 0.01, "fourties": 0.05, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.41, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12331, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 36, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 211, "size": 293316404, "checksum": "e1b6b3fd44a5bbb76e15399629d75ea30aab300ca32b80fbfb1129b82dd04374", "avgDurationSecs": 3.576, "validDurationSecs": 30515.085, "totalHrs": 12.28, "validHrs": 8.47 }, "gl": { "buckets": { "dev": 13443, "test": 13443, "train": 70039, "validated": 98430, "invalidated": 10122, "other": 153848 }, "duration": 1312366248, "reportedSentences": 768, "validatedSentences": 696448, "unvalidatedSentences": 4510, "clips": 262400, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.16, "thirties": 0.07, "teens": 0.01, "fourties": 0.09, "fifties": 0.1, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0.13, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 261976, "agriculture_food": 33, "automotive_transport": 4, "finance": 40, "service_retail": 16, "general": 246, "healthcare": 3, "history_law_government": 52, "language_fundamentals": 18, "media_entertainment": 39, "nature_environment": 15, "news_current_affairs": 14, "technology_robotics": 4 } }, "users": 6229, "size": 7600189023, "checksum": "f04aebb37d72290957d6173a2df20cbc6a8dbdeec93321f7493f25de40c77545", "avgDurationSecs": 5.001, "validDurationSecs": 492287.385, "totalHrs": 364.54, "validHrs": 136.74 }, "gn": { "buckets": { "dev": 361, "test": 960, "train": 1641, "validated": 3160, "invalidated": 178, "other": 20070 }, "duration": 106981920, "reportedSentences": 36, "validatedSentences": 5894, "unvalidatedSentences": 298, "clips": 23408, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.56, "thirties": 0.18, "teens": 0.02, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0.37, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 23408, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 255, "size": 613399687, "checksum": "3a9c8f2e4591a6eaaa9b6e6f84a37f02cf8feda1a6241eacbc7152132910e139", "avgDurationSecs": 4.57, "validDurationSecs": 14442.194, "totalHrs": 29.71, "validHrs": 4.01 }, "ha": { "buckets": { "dev": 623, "test": 750, "train": 1908, "validated": 3441, "invalidated": 174, "other": 6668 }, "duration": 44786268, "reportedSentences": 34, "validatedSentences": 5448, "unvalidatedSentences": 241, "clips": 10283, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.13, "thirties": 0.66, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.58, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10283, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 262151011, "checksum": "f38d16d790de83f735db828614661d5bb63d559a2d997a3c721c39e79d5f677d", "avgDurationSecs": 4.355, "validDurationSecs": 14986.828, "totalHrs": 12.44, "validHrs": 4.16 }, "he": { "buckets": { "dev": 672, "test": 392, "train": 1011, "validated": 2568, "invalidated": 291, "other": 2473 }, "duration": 24334668, "reportedSentences": 33, "validatedSentences": 4104, "unvalidatedSentences": 6911, "clips": 5332, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.72, "thirties": 0.15, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.03, "nineties": 0 }, "gender": { "": 0.08, "male_masculine": 0.91, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5332, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 54, "size": 141631110, "checksum": "0292672d5c1dcd4ad4783f2b5d5b69c3e0effe2ae24daa18e1019e4b93c631eb", "avgDurationSecs": 4.564, "validDurationSecs": 11720.073, "totalHrs": 6.75, "validHrs": 3.25 }, "hi": { "buckets": { "dev": 2700, "test": 3343, "train": 4869, "validated": 10991, "invalidated": 889, "other": 4449 }, "duration": 81896496, "reportedSentences": 146, "validatedSentences": 32204, "unvalidatedSentences": 9963, "clips": 16329, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.32, "thirties": 0.24, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.58, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16329, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 461, "size": 486046342, "checksum": "85885a33e07d006a55a4f3c5920c907fb821d59759527ecfaa0f36ccaf31ec6f", "avgDurationSecs": 5.015, "validDurationSecs": 55124.281, "totalHrs": 22.74, "validHrs": 15.31 }, "hsb": { "buckets": { "dev": 172, "test": 479, "train": 809, "validated": 1460, "invalidated": 247, "other": 216 }, "duration": 12046572, "reportedSentences": 130, "validatedSentences": 7177, "unvalidatedSentences": 30, "clips": 1923, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.1, "thirties": 0.09, "teens": 0.07, "fourties": 0.48, "fifties": 0.03, "sixties": 0.02, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.77, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1923, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26, "size": 91261964, "checksum": "25c5dcc2c728a0ad904d73a65b851fe19893dccafd97d97d6d16b19ee522d4b3", "avgDurationSecs": 6.264, "validDurationSecs": 9146.123, "totalHrs": 3.34, "validHrs": 2.54 }, "ht": { "buckets": { "dev": 0, "test": 5, "train": 11, "validated": 17, "invalidated": 0, "other": 13 }, "duration": 166968, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": 1629, "clips": 30, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0, "thirties": 0.33, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1091305, "checksum": "83e08f8823780debaa601ecee7b10abd12d60ca6dd187f30dd6497fe6a11c024", "avgDurationSecs": 5.566, "validDurationSecs": 94.615, "totalHrs": 0.04, "validHrs": 0.02 }, "hu": { "buckets": { "dev": 11604, "test": 11659, "train": 39270, "validated": 62999, "invalidated": 3324, "other": 50475 }, "duration": 646987213, "reportedSentences": 554, "validatedSentences": 326221, "unvalidatedSentences": 398, "clips": 116798, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.13, "teens": 0.02, "fourties": 0.11, "fifties": 0.32, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.24, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 116787, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 4 } }, "users": 1663, "size": 3805231580, "checksum": "ed96c84f3ca0be1ab700afdd5cff3b3acf9ebbfd5831187a041c45959a289bc3", "avgDurationSecs": 5.539, "validDurationSecs": 348974.704, "totalHrs": 179.71, "validHrs": 96.93 }, "hy-AM": { "buckets": { "dev": 5859, "test": 5823, "train": 9303, "validated": 21199, "invalidated": 1220, "other": 15157 }, "duration": 202606236, "reportedSentences": 183, "validatedSentences": 237419, "unvalidatedSentences": 1338, "clips": 37576, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.63, "thirties": 0.08, "teens": 0.11, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.25, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37574, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 438, "size": 1191878793, "checksum": "7c79542e55bab99548a0539aca15ad5e4085b784334071f5ffad53d0cecbadea", "avgDurationSecs": 5.392, "validDurationSecs": 114303.002, "totalHrs": 56.27, "validHrs": 31.75 }, "ia": { "buckets": { "dev": 1865, "test": 1862, "train": 4909, "validated": 12061, "invalidated": 351, "other": 2348 }, "duration": 62016204, "reportedSentences": 279, "validatedSentences": 9069, "unvalidatedSentences": 75, "clips": 14760, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.29, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14760, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 72, "size": 420127933, "checksum": "6db05f61adc6f5d93e10e39e693984afd24a33a3e7ecc0c260866507211ea964", "avgDurationSecs": 4.202, "validDurationSecs": 50675.978, "totalHrs": 17.22, "validHrs": 14.07 }, "id": { "buckets": { "dev": 3467, "test": 3690, "train": 4973, "validated": 30098, "invalidated": 2787, "other": 26585 }, "duration": 238048644, "reportedSentences": 362, "validatedSentences": 13318, "unvalidatedSentences": 11859, "clips": 59470, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.44, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 59470, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 639, "size": 1496679235, "checksum": "8fcb715da58a4b12d33d77dca472c0eddf754d94f0ed7b7bb571bc9328bb81df", "avgDurationSecs": 4.003, "validDurationSecs": 120477.351, "totalHrs": 66.12, "validHrs": 33.46 }, "ig": { "buckets": { "dev": 3, "test": 5, "train": 9, "validated": 17, "invalidated": 3, "other": 5784 }, "duration": 31711896, "reportedSentences": 16, "validatedSentences": 14459, "unvalidatedSentences": 3858, "clips": 5804, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.04, "teens": 0.05, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5804, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 120, "size": 186451514, "checksum": "613290cd15397a26fd42e9fb58e512c768b111bf869fe145286eb9426829b3d4", "avgDurationSecs": 5.464, "validDurationSecs": 92.885, "totalHrs": 8.8, "validHrs": 0.02 }, "is": { "buckets": { "dev": 9, "test": 9, "train": 17, "validated": 35, "invalidated": 5, "other": 25 }, "duration": 465984, "reportedSentences": 1, "validatedSentences": 4993, "unvalidatedSentences": 203, "clips": 65, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0, "thirties": 0.38, "teens": 0, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.77, "male_masculine": 0.23, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 65, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 3060293, "checksum": "3d901d13ef269ec6f2c96413eda338f1f4c5cdd093b94ae3499b9440decc9ec6", "avgDurationSecs": 7.169, "validDurationSecs": 250.914, "totalHrs": 0.12, "validHrs": 0.06 }, "it": { "buckets": { "dev": 15179, "test": 15177, "train": 172828, "validated": 239810, "invalidated": 20632, "other": 17384 }, "duration": 1507133292, "reportedSentences": 5983, "validatedSentences": 924883, "unvalidatedSentences": 1112, "clips": 277826, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.2, "thirties": 0.16, "teens": 0.01, "fourties": 0.12, "fifties": 0.14, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.62, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 277811, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 11, "healthcare": 1, "history_law_government": 5, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 3, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 7287, "size": 10254010104, "checksum": "15314a3e90501d28db8f63432332a07d61e0fd6594b7ff65582f62c6548d5dd4", "avgDurationSecs": 5.425, "validDurationSecs": 1300906.448, "totalHrs": 418.64, "validHrs": 361.36 }, "ja": { "buckets": { "dev": 8004, "test": 8004, "train": 15425, "validated": 177864, "invalidated": 31567, "other": 263670 }, "duration": 2148883125, "reportedSentences": 783, "validatedSentences": 37288, "unvalidatedSentences": 4389, "clips": 473101, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.55, "thirties": 0.08, "teens": 0.08, "fourties": 0.09, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.33, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 436896, "agriculture_food": 5729, "automotive_transport": 1225, "finance": 1397, "service_retail": 385, "general": 16560, "healthcare": 1341, "history_law_government": 3214, "language_fundamentals": 2721, "media_entertainment": 4065, "nature_environment": 1749, "news_current_affairs": 724, "technology_robotics": 571 } }, "users": 5530, "size": 12649186598, "checksum": "3571b4d00f1ee0a4f77d35140dd70cdf8599669d26cc42290bf33a376ff4d839", "avgDurationSecs": 4.542, "validDurationSecs": 807880.237, "totalHrs": 596.91, "validHrs": 224.41 }, "ka": { "buckets": { "dev": 12952, "test": 13104, "train": 62537, "validated": 116586, "invalidated": 6632, "other": 97093 }, "duration": 1148972004, "reportedSentences": 1097, "validatedSentences": 132553, "unvalidatedSentences": 1807, "clips": 220311, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.2, "thirties": 0.34, "teens": 0.11, "fourties": 0.07, "fifties": 0.04, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.17, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 219791, "agriculture_food": 8, "automotive_transport": 9, "finance": 4, "service_retail": 7, "general": 239, "healthcare": 8, "history_law_government": 189, "language_fundamentals": 31, "media_entertainment": 22, "nature_environment": 16, "news_current_affairs": 10, "technology_robotics": 3 } }, "users": 2700, "size": 6719242923, "checksum": "577b3c16eff42d54201b5386946176da97eb393f5c9fc5bab0b52b772b68e651", "avgDurationSecs": 5.215, "validDurationSecs": 608022.523, "totalHrs": 319.15, "validHrs": 168.89 }, "kab": { "buckets": { "dev": 15002, "test": 15003, "train": 152479, "validated": 614502, "invalidated": 20042, "other": 112446 }, "duration": 2496062104, "reportedSentences": 9740, "validatedSentences": 182794, "unvalidatedSentences": 205832, "clips": 746990, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.18, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 746990, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1588, "size": 18638653109, "checksum": "d7083bef2fd342b36610935c50c1d9956f874f84d16d5189c59495d58d92bdb3", "avgDurationSecs": 3.341, "validDurationSecs": 2053354.335, "totalHrs": 693.35, "validHrs": 570.37 }, "kk": { "buckets": { "dev": 513, "test": 536, "train": 605, "validated": 1661, "invalidated": 247, "other": 730 }, "duration": 13010112, "reportedSentences": 42, "validatedSentences": 6204, "unvalidatedSentences": 266, "clips": 2638, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.23, "thirties": 0.04, "teens": 0.06, "fourties": 0.08, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.32, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2638, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 182, "size": 74446768, "checksum": "b3fba646b58c2f915d138197ad953ba84dee7892d2a4dbbca91d425ee0f5cfbb", "avgDurationSecs": 4.932, "validDurationSecs": 8191.735, "totalHrs": 3.61, "validHrs": 2.27 }, "kln": { "buckets": { "dev": 6412, "test": 6075, "train": 11064, "validated": 32311, "invalidated": 537, "other": 37352 }, "duration": 330293556, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70200, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.59, "thirties": 0.05, "teens": 0.05, "fourties": 0.29, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.03, "female_feminine": 0.7, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4, "agriculture_food": 5, "automotive_transport": 16, "finance": 32, "service_retail": 626, "general": 68952, "healthcare": 54, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 484 } }, "users": 41, "size": 1800908311, "checksum": "d0cb5b8854dcdce628d3f41c74c213c36123037aeb4c800151bb3aa146457f31", "avgDurationSecs": 4.705, "validDurationSecs": 152024.431, "totalHrs": 91.74, "validHrs": 42.22 }, "kmr": { "buckets": { "dev": 3999, "test": 3991, "train": 5277, "validated": 61612, "invalidated": 2582, "other": 25416 }, "duration": 367096068, "reportedSentences": 1058, "validatedSentences": 16807, "unvalidatedSentences": 1263, "clips": 89610, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.42, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89600, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 5, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 1, "media_entertainment": 2, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 589, "size": 1948012626, "checksum": "947d5faba8675fa7a016f4b04cb143dd724e0132dfbc51645f3953c57a7843e3", "avgDurationSecs": 4.097, "validDurationSecs": 252399.542, "totalHrs": 101.97, "validHrs": 70.11 }, "ko": { "buckets": { "dev": 474, "test": 472, "train": 519, "validated": 1466, "invalidated": 399, "other": 3813 }, "duration": 30334428, "reportedSentences": 29, "validatedSentences": 7322, "unvalidatedSentences": 2701, "clips": 5678, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.52, "thirties": 0.24, "teens": 0.03, "fourties": 0.05, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.39, "female_feminine": 0.25, "transgender": 0, "non-binary": 0.01, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5675, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 154, "size": 177355549, "checksum": "8bf375a20aebe7f01df9026eba32a064e98b5355b64265f44d2048fdc9415d10", "avgDurationSecs": 5.342, "validDurationSecs": 7832.031, "totalHrs": 8.42, "validHrs": 2.17 }, "ky": { "buckets": { "dev": 1614, "test": 1615, "train": 1790, "validated": 30670, "invalidated": 5677, "other": 1494 }, "duration": 172292988, "reportedSentences": 39, "validatedSentences": 5045, "unvalidatedSentences": 242, "clips": 37841, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37841, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 297, "size": 1108069420, "checksum": "14d360f456cba2b8e63a06a009a6bcd240a25696b3b594cf488d903b1668cecd", "avgDurationSecs": 4.553, "validDurationSecs": 139642.873, "totalHrs": 47.85, "validHrs": 38.78 }, "lg": { "buckets": { "dev": 13352, "test": 13388, "train": 71087, "validated": 271879, "invalidated": 39182, "other": 37692 }, "duration": 2017197063, "reportedSentences": 6485, "validatedSentences": 191409, "unvalidatedSentences": 174, "clips": 348753, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348753, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 664, "size": 11841919354, "checksum": "cc58a4e4df241679e743fe625cf433259fb25f2d8b4ce026112808389addd0af", "avgDurationSecs": 5.784, "validDurationSecs": 1572555.706, "totalHrs": 560.33, "validHrs": 436.82 }, "lij": { "buckets": { "dev": 943, "test": 1358, "train": 2304, "validated": 4686, "invalidated": 585, "other": 0 }, "duration": 20676960, "reportedSentences": 378, "validatedSentences": 6079, "unvalidatedSentences": 50, "clips": 5271, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.07, "thirties": 0.45, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.17, "seventies": 0.11, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.45, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4890, "agriculture_food": 14, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 366, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 39, "size": 115157438, "checksum": "0384ccbcd83b731fbe136125a03620942b69ab39183f0540663f9fa0571d4439", "avgDurationSecs": 3.923, "validDurationSecs": 18382.135, "totalHrs": 5.74, "validHrs": 5.1 }, "lo": { "buckets": { "dev": 28, "test": 26, "train": 98, "validated": 152, "invalidated": 10, "other": 61 }, "duration": 1456560, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 223, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.33, "thirties": 0.56, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.11, "male_masculine": 0.89, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 223, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 9346357, "checksum": "a878f9b617683c7df588fbd3b907381e511ad30b5a69156d5dc43290299faeaa", "avgDurationSecs": 6.532, "validDurationSecs": 992.812, "totalHrs": 0.4, "validHrs": 0.27 }, "lt": { "buckets": { "dev": 5111, "test": 5384, "train": 8299, "validated": 18996, "invalidated": 887, "other": 2736 }, "duration": 116345636, "reportedSentences": 192, "validatedSentences": 129304, "unvalidatedSentences": 4814, "clips": 22619, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.28, "thirties": 0.25, "teens": 0.03, "fourties": 0.22, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.45, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 22619, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 322, "size": 713240096, "checksum": "2c5046fa9d1ddab6732e84acfee3c4b9ee5239a0042e806858b90572b40bb63d", "avgDurationSecs": 5.144, "validDurationSecs": 97709.965, "totalHrs": 32.31, "validHrs": 27.14 }, "ltg": { "buckets": { "dev": 3506, "test": 3531, "train": 4596, "validated": 22220, "invalidated": 1857, "other": 545 }, "duration": 117025056, "reportedSentences": 74, "validatedSentences": 14996, "unvalidatedSentences": 1822, "clips": 24622, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.09, "thirties": 0.23, "teens": 0, "fourties": 0.33, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.05, "female_feminine": 0.61, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19189, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 5432, "healthcare": 1, "history_law_government": 616, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 373, "size": 674801571, "checksum": "b8cec3cadb4f219b53ef8655dd17125c3ee510b78c34cb3d126cc6a93e5efcf4", "avgDurationSecs": 4.753, "validDurationSecs": 105608.673, "totalHrs": 32.5, "validHrs": 29.33 }, "luo": { "buckets": { "dev": 3060, "test": 3000, "train": 4498, "validated": 20342, "invalidated": 792, "other": 61594 }, "duration": 430582284, "reportedSentences": 8, "validatedSentences": 14692, "unvalidatedSentences": 13870, "clips": 82728, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.33, "thirties": 0.28, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1135, "automotive_transport": 341, "finance": 967, "service_retail": 2748, "general": 69480, "healthcare": 2236, "history_law_government": 864, "language_fundamentals": 555, "media_entertainment": 806, "nature_environment": 634, "news_current_affairs": 2710, "technology_robotics": 241 } }, "users": 46, "size": 2391970692, "checksum": "9b0037747957bae0ead68e57a96fd6b8aaeceb2fd25cd4da87e5a802da463b01", "avgDurationSecs": 5.205, "validDurationSecs": 105875.941, "totalHrs": 119.6, "validHrs": 29.4 }, "lv": { "buckets": { "dev": 7705, "test": 7705, "train": 14354, "validated": 196618, "invalidated": 11455, "other": 21115 }, "duration": 1098957941, "reportedSentences": 3143, "validatedSentences": 34333, "unvalidatedSentences": 65167, "clips": 229188, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 228382, "agriculture_food": 8, "automotive_transport": 3, "finance": 3, "service_retail": 15, "general": 672, "healthcare": 33, "history_law_government": 8, "language_fundamentals": 9, "media_entertainment": 16, "nature_environment": 39, "news_current_affairs": 5, "technology_robotics": 12 } }, "users": 6430, "size": 6156962992, "checksum": "bff89d8119a3b3eee21502e550db87b6b10d5bb1293067036cf770e29671ceb3", "avgDurationSecs": 4.795, "validDurationSecs": 942784.581, "totalHrs": 305.26, "validHrs": 261.88 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11056973, "checksum": "b958bc1ae355adf5a87d3739e9e75e5dcdd60df9cbcdb474ddc08880362eb86f", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mhr": { "buckets": { "dev": 14663, "test": 15135, "train": 186565, "validated": 218727, "invalidated": 16025, "other": 19 }, "duration": 1085385744, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234771, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234771, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 499, "size": 6124161612, "checksum": "af5b260aee7cd25f038f7ebe16f39ffad3b8f5292f386cbc4314d8af04b4b8eb", "avgDurationSecs": 4.623, "validDurationSecs": 1011211.639, "totalHrs": 301.49, "validHrs": 280.89 }, "mk": { "buckets": { "dev": 1776, "test": 1754, "train": 2049, "validated": 16111, "invalidated": 921, "other": 23864 }, "duration": 196670412, "reportedSentences": 161, "validatedSentences": 6014, "unvalidatedSentences": 679, "clips": 40896, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.36, "thirties": 0.27, "teens": 0.03, "fourties": 0.2, "fifties": 0.02, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.28, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40632, "agriculture_food": 14, "automotive_transport": 0, "finance": 7, "service_retail": 0, "general": 210, "healthcare": 26, "history_law_government": 7, "language_fundamentals": 0, "media_entertainment": 7, "nature_environment": 21, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 462, "size": 1079250767, "checksum": "afcb173cc7505b5cbcf2864811c2cf13565d8dcff70598426048298aa3e91d6b", "avgDurationSecs": 4.809, "validDurationSecs": 77478.409, "totalHrs": 54.63, "validHrs": 21.52 }, "ml": { "buckets": { "dev": 926, "test": 873, "train": 1235, "validated": 3440, "invalidated": 119, "other": 5968 }, "duration": 40198932, "reportedSentences": 243, "validatedSentences": 5485, "unvalidatedSentences": 4728, "clips": 9527, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.59, "thirties": 0.08, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.48, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9527, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 149, "size": 227897904, "checksum": "eaab49423798bb65a3f0a2907d07240cf5eb119853dbf02dc0a6d2afe3394577", "avgDurationSecs": 4.219, "validDurationSecs": 14514.992, "totalHrs": 11.16, "validHrs": 4.03 }, "mn": { "buckets": { "dev": 1932, "test": 1933, "train": 2193, "validated": 33641, "invalidated": 3232, "other": 59420 }, "duration": 505948580, "reportedSentences": 46, "validatedSentences": 6092, "unvalidatedSentences": 17, "clips": 96293, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96246, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 31, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 16, "technology_robotics": 0 } }, "users": 601, "size": 3077682997, "checksum": "136fccf3f29b6743dc4ffb0f41dc15e3dd8f055f2dcfa05408fc7a3815b20dcf", "avgDurationSecs": 5.254, "validDurationSecs": 176758.603, "totalHrs": 140.54, "validHrs": 49.09 }, "mr": { "buckets": { "dev": 1766, "test": 1796, "train": 2189, "validated": 10992, "invalidated": 2289, "other": 2796 }, "duration": 99674352, "reportedSentences": 59, "validatedSentences": 6715, "unvalidatedSentences": 12104, "clips": 16077, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.28, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.06, "male_masculine": 0.19, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16077, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 102, "size": 585970557, "checksum": "4f9737ee3858f422e59a798f14d15e1174b9ba4859f180787f53b6339a9a751e", "avgDurationSecs": 6.2, "validDurationSecs": 68148.316, "totalHrs": 27.68, "validHrs": 18.93 }, "mrj": { "buckets": { "dev": 7314, "test": 7164, "train": 14325, "validated": 28973, "invalidated": 1261, "other": 1459 }, "duration": 132879852, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31693, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31693, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 747248369, "checksum": "db3d01e832885e76b692473ec4fc7ba6b289244e66d3d7e6ecaa1f8745456b03", "avgDurationSecs": 4.193, "validDurationSecs": 121475.656, "totalHrs": 36.91, "validHrs": 33.74 }, "mt": { "buckets": { "dev": 1625, "test": 1660, "train": 1910, "validated": 6593, "invalidated": 326, "other": 6288 }, "duration": 62787780, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11773, "clips": 13207, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13207, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 466479210, "checksum": "1fb863d9418f3429f11dc907aeccdbb01a942fdb6bae87c532b71726b35ad831", "avgDurationSecs": 4.754, "validDurationSecs": 31343.972, "totalHrs": 17.44, "validHrs": 8.7 }, "myv": { "buckets": { "dev": 239, "test": 481, "train": 1241, "validated": 1971, "invalidated": 24, "other": 0 }, "duration": 11519388, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 1995, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1995, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 67699874, "checksum": "7d0f7a2e52d42026ab3782deac261a1a9e2bc7eab7eab930e9c89ffa3718eb56", "avgDurationSecs": 5.774, "validDurationSecs": 11380.809, "totalHrs": 3.19, "validHrs": 3.16 }, "nan-tw": { "buckets": { "dev": 5648, "test": 6440, "train": 11608, "validated": 28995, "invalidated": 570, "other": 2266 }, "duration": 84081780, "reportedSentences": 226, "validatedSentences": 25621, "unvalidatedSentences": 76, "clips": 31831, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.5, "thirties": 0.15, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.63, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31563, "agriculture_food": 4, "automotive_transport": 0, "finance": 0, "service_retail": 2, "general": 207, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 55, "media_entertainment": 0, "nature_environment": 55, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 288, "size": 478515596, "checksum": "d3aedf4f12eccbab360ecb94ee403f897e57e2ef22bcddf3ad1dc7a4feba8d50", "avgDurationSecs": 2.642, "validDurationSecs": 76590.469, "totalHrs": 23.35, "validHrs": 21.27 }, "nb-NO": { "buckets": { "dev": 33, "test": 116, "train": 227, "validated": 376, "invalidated": 16, "other": 59 }, "duration": 1913112, "reportedSentences": 0, "validatedSentences": 4372, "unvalidatedSentences": 3695, "clips": 451, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0, "thirties": 0.62, "teens": 0, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0.53, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 196, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 2, "general": 248, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 2, "media_entertainment": 1, "nature_environment": 4, "news_current_affairs": 2, "technology_robotics": 0 } }, "users": 20, "size": 11618895, "checksum": "fdc512f7e961d56c90edff56c0fd8759dfc732324dd6b13b38d9fba05e2aeb37", "avgDurationSecs": 4.242, "validDurationSecs": 1594.967, "totalHrs": 0.53, "validHrs": 0.44 }, "ne-NP": { "buckets": { "dev": 314, "test": 287, "train": 353, "validated": 1172, "invalidated": 100, "other": 362 }, "duration": 6667992, "reportedSentences": 5, "validatedSentences": 1316, "unvalidatedSentences": 30, "clips": 1634, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.59, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.63, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1634, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 39123305, "checksum": "45129a700890d9a834b659524867863b52a8d4dba682235dc05efc0d4420a690", "avgDurationSecs": 4.081, "validDurationSecs": 4782.672, "totalHrs": 1.85, "validHrs": 1.32 }, "nhi": { "buckets": { "dev": 10, "test": 5, "train": 23, "validated": 38, "invalidated": 0, "other": 384 }, "duration": 2145780, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 11, "clips": 422, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.74, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.08, "female_feminine": 0.74, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 422, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 12595518, "checksum": "bf85be412ecd2d5e697f18273120ebd36fb32c26b5e6e13e0058636938c92b39", "avgDurationSecs": 5.085, "validDurationSecs": 193.222, "totalHrs": 0.59, "validHrs": 0.05 }, "nl": { "buckets": { "dev": 12032, "test": 12033, "train": 43458, "validated": 100580, "invalidated": 5899, "other": 2396 }, "duration": 475704973, "reportedSentences": 389, "validatedSentences": 259729, "unvalidatedSentences": 920, "clips": 108875, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.19, "thirties": 0.13, "teens": 0.02, "fourties": 0.19, "fifties": 0.07, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.49, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 108582, "agriculture_food": 5, "automotive_transport": 19, "finance": 1, "service_retail": 0, "general": 36, "healthcare": 20, "history_law_government": 8, "language_fundamentals": 164, "media_entertainment": 1, "nature_environment": 49, "news_current_affairs": 1, "technology_robotics": 3 } }, "users": 1840, "size": 3203951585, "checksum": "00b20b5cfaf907e86d7674c8b3fe61c5947e425966bd5ca41ed0a20cbb25a5e4", "avgDurationSecs": 4.369, "validDurationSecs": 439461.825, "totalHrs": 132.14, "validHrs": 122.07 }, "nn-NO": { "buckets": { "dev": 405, "test": 423, "train": 464, "validated": 1297, "invalidated": 64, "other": 18 }, "duration": 6050232, "reportedSentences": 23, "validatedSentences": 5318, "unvalidatedSentences": 6, "clips": 1379, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.4, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.58, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1379, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 35178556, "checksum": "80382ca5eb7bdd90d5c5223459da560dffabb5cadb3c2538524c1ed8d9b87720", "avgDurationSecs": 4.387, "validDurationSecs": 5690.465, "totalHrs": 1.68, "validHrs": 1.58 }, "nr": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 2 }, "duration": 19476, "reportedSentences": 0, "validatedSentences": 319, "unvalidatedSentences": 7468, "clips": 2, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 637949, "checksum": "ed5250c98b204bf22e182c69042a45172937171f186b4c3d331a4a072446566f", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552786, "checksum": "14f1c22a0b7a8d548baf627e6e38daf93aab5e7e08f1cb42c39d922670d509a3", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "oc": { "buckets": { "dev": 267, "test": 274, "train": 304, "validated": 1967, "invalidated": 201, "other": 7707 }, "duration": 48105396, "reportedSentences": 13, "validatedSentences": 4934, "unvalidatedSentences": 14, "clips": 9875, "splits": { "accent": {}, "age": { "": 0.5, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0.08, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9875, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 147, "size": 273810668, "checksum": "ec58c8ef1677ca532e3483bb5cc817375f0b4093a123fd31c6f7c5be4e17d03b", "avgDurationSecs": 4.871, "validDurationSecs": 9582.108, "totalHrs": 13.36, "validHrs": 2.66 }, "or": { "buckets": { "dev": 640, "test": 479, "train": 2151, "validated": 4085, "invalidated": 437, "other": 19622 }, "duration": 134612700, "reportedSentences": 100, "validatedSentences": 6197, "unvalidatedSentences": 87, "clips": 24144, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.65, "thirties": 0.27, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.34, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.21 }, "sentence_domain": { "": 24137, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 163, "size": 832572110, "checksum": "84f84dd6cfef84a6a4be4f92c4fbaf57da8c49ce4d6aa4ea8784f1c87599f538", "avgDurationSecs": 5.575, "validDurationSecs": 22775.55, "totalHrs": 37.39, "validHrs": 6.32 }, "os": { "buckets": { "dev": 228, "test": 219, "train": 414, "validated": 892, "invalidated": 34, "other": 0 }, "duration": 5109120, "reportedSentences": 3, "validatedSentences": 1888, "unvalidatedSentences": 1, "clips": 926, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.09, "thirties": 0.48, "teens": 0.02, "fourties": 0.17, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8, "male_masculine": 0.2, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 925, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 29900752, "checksum": "cc3dc20c8437991d9b1f4213e2882579ad96186fd7d28a1e71ab4934fd63d742", "avgDurationSecs": 5.517, "validDurationSecs": 4921.528, "totalHrs": 1.41, "validHrs": 1.36 }, "pa-IN": { "buckets": { "dev": 406, "test": 587, "train": 800, "validated": 1793, "invalidated": 100, "other": 1243 }, "duration": 15117506, "reportedSentences": 251, "validatedSentences": 14823, "unvalidatedSentences": 22, "clips": 3136, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.22, "thirties": 0.41, "teens": 0, "fourties": 0.04, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.66, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3135, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 71, "size": 106436536, "checksum": "80599faa974bb6e233d93333bd4ee5caf6bd1cc5dcc827b723a6f42cee01b8bf", "avgDurationSecs": 4.821, "validDurationSecs": 8643.395, "totalHrs": 4.19, "validHrs": 2.4 }, "pl": { "buckets": { "dev": 9856, "test": 9856, "train": 24173, "validated": 137367, "invalidated": 6906, "other": 2446 }, "duration": 670083698, "reportedSentences": 600, "validatedSentences": 229899, "unvalidatedSentences": 11176, "clips": 146719, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.27, "thirties": 0.34, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.28, "male_masculine": 0.58, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 146708, "agriculture_food": 0, "automotive_transport": 0, "finance": 8, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 3434, "size": 4746045604, "checksum": "8b766d3a5e58aa37fed2ee3d0465816c8d1b1d06ba57e7ec7a62e77f20be86d8", "avgDurationSecs": 4.567, "validDurationSecs": 627371.965, "totalHrs": 186.13, "validHrs": 174.26 }, "ps": { "buckets": { "dev": 3610, "test": 3610, "train": 4611, "validated": 59745, "invalidated": 5095, "other": 41324 }, "duration": 521048484, "reportedSentences": 1136, "validatedSentences": 24320, "unvalidatedSentences": 35008, "clips": 106164, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.38, "thirties": 0.15, "teens": 0.07, "fourties": 0.15, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.01, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 63766, "agriculture_food": 821, "automotive_transport": 310, "finance": 107, "service_retail": 439, "general": 36564, "healthcare": 709, "history_law_government": 2119, "language_fundamentals": 1958, "media_entertainment": 1220, "nature_environment": 895, "news_current_affairs": 756, "technology_robotics": 513 } }, "users": 1463, "size": 2880313215, "checksum": "c9247548a7f0c910a8bcae31165d6a58d7b85cc7d2ef273cb5c659d0540b019e", "avgDurationSecs": 4.908, "validDurationSecs": 293225.968, "totalHrs": 144.73, "validHrs": 81.45 }, "pt": { "buckets": { "dev": 9640, "test": 9641, "train": 22923, "validated": 159315, "invalidated": 7686, "other": 27354 }, "duration": 812233339, "reportedSentences": 2847, "validatedSentences": 43526, "unvalidatedSentences": 97, "clips": 194355, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.36, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 192802, "agriculture_food": 106, "automotive_transport": 35, "finance": 24, "service_retail": 21, "general": 1178, "healthcare": 162, "history_law_government": 64, "language_fundamentals": 17, "media_entertainment": 66, "nature_environment": 41, "news_current_affairs": 4, "technology_robotics": 149 } }, "users": 3759, "size": 5093479274, "checksum": "dee1da586d8e7703c6cb50f2fdc436bf914dd6d1d8fda261e728eb382b00c3ab", "avgDurationSecs": 4.179, "validDurationSecs": 665796.889, "totalHrs": 225.62, "validHrs": 184.94 }, "quy": { "buckets": { "dev": 0, "test": 10, "train": 26, "validated": 37, "invalidated": 7, "other": 1256 }, "duration": 11227752, "reportedSentences": 0, "validatedSentences": 759, "unvalidatedSentences": 282, "clips": 1300, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.71, "thirties": 0, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0.25, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1300, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 65532116, "checksum": "911a6bb2393ed7eec5417019c644540014c58663be3eef04854e5724fb55b9e3", "avgDurationSecs": 8.637, "validDurationSecs": 319.559, "totalHrs": 3.11, "validHrs": 0.08 }, "rm-sursilv": { "buckets": { "dev": 1366, "test": 1391, "train": 1591, "validated": 4364, "invalidated": 684, "other": 2456 }, "duration": 40806869, "reportedSentences": 17, "validatedSentences": 8780, "unvalidatedSentences": 7455, "clips": 7504, "splits": { "accent": {}, "age": { "": 0.61, "twenties": 0.09, "thirties": 0.07, "teens": 0.05, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0.17, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 91, "size": 306276176, "checksum": "0db054e42ace768072c6858dcefbea606d7da67f9adc9c6cb4e168cdc1636d02", "avgDurationSecs": 5.438, "validDurationSecs": 23731.5, "totalHrs": 11.33, "validHrs": 6.59 }, "rm-vallader": { "buckets": { "dev": 500, "test": 463, "train": 557, "validated": 1527, "invalidated": 394, "other": 740 }, "duration": 15519074, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2661, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2661, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 118486250, "checksum": "b693365e13e8f7b8baf21ebf39f9018de281a9bd3573a8f87023ce1933189fcd", "avgDurationSecs": 5.832, "validDurationSecs": 8905.534, "totalHrs": 4.31, "validHrs": 2.47 }, "ro": { "buckets": { "dev": 3918, "test": 3930, "train": 5178, "validated": 19459, "invalidated": 1084, "other": 23002 }, "duration": 175938115, "reportedSentences": 487, "validatedSentences": 13753, "unvalidatedSentences": 89202, "clips": 43545, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.59, "thirties": 0.13, "teens": 0.02, "fourties": 0.09, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.7, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43545, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 452, "size": 1095649564, "checksum": "37c2335f7da449b23410e92e28b36830908054c3e3aae72bf7afe741b8c2dbe2", "avgDurationSecs": 4.04, "validDurationSecs": 78621.651, "totalHrs": 48.87, "validHrs": 21.83 }, "ru": { "buckets": { "dev": 10243, "test": 10244, "train": 26654, "validated": 171606, "invalidated": 10469, "other": 17601 }, "duration": 1035024192, "reportedSentences": 570, "validatedSentences": 47819, "unvalidatedSentences": 227, "clips": 199676, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.18, "teens": 0.09, "fourties": 0.14, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 199582, "agriculture_food": 4, "automotive_transport": 4, "finance": 8, "service_retail": 12, "general": 33, "healthcare": 4, "history_law_government": 42, "language_fundamentals": 4, "media_entertainment": 11, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 5 } }, "users": 3551, "size": 6951943071, "checksum": "804b57b724599373805a444b698cb3f6676731a93bb1b48911d00f3555a7b494", "avgDurationSecs": 5.184, "validDurationSecs": 889522.834, "totalHrs": 287.5, "validHrs": 247.08 }, "rup": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 2 }, "duration": 18936, "reportedSentences": 0, "validatedSentences": 3507, "unvalidatedSentences": -1, "clips": 2, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 343060, "checksum": "dd0097cd0c7b8e540a9da0ca76e70998918b3a3ce0e9967e275b9415bb6a9430", "avgDurationSecs": 9.468, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003033, "validated": 1439114, "invalidated": 227836, "other": 47243 }, "duration": 8583547217, "reportedSentences": 634, "validatedSentences": 1404857, "unvalidatedSentences": 7, "clips": 1714193, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1714193, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1174, "size": 61199343897, "checksum": "778bb12a199a048cf242875c1c9e617cc51fc1a62fe24b7dc6b06c10386c2df1", "avgDurationSecs": 5.007, "validDurationSecs": 7206133.131, "totalHrs": 2384.31, "validHrs": 2001.7 }, "sah": { "buckets": { "dev": 1823, "test": 1831, "train": 2195, "validated": 9115, "invalidated": 462, "other": 3349 }, "duration": 81909696, "reportedSentences": 20, "validatedSentences": 6064, "unvalidatedSentences": 397, "clips": 12926, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.23, "thirties": 0.15, "teens": 0.03, "fourties": 0.1, "fifties": 0.17, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.23, "female_feminine": 0.45, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12926, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 244, "size": 521123861, "checksum": "ca9336f7fad58cc8477aea3921549333b286955a30e4e940ea00b90fe3e9653a", "avgDurationSecs": 6.337, "validDurationSecs": 57760.087, "totalHrs": 22.75, "validHrs": 16.04 }, "sat": { "buckets": { "dev": 68, "test": 127, "train": 333, "validated": 528, "invalidated": 18, "other": 303 }, "duration": 3801708, "reportedSentences": 7, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 849, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.42, "thirties": 0.12, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.59, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 21352309, "checksum": "0ddeb729ca8044be4ebcb38d1a74bab69dcb05f994cf852eea1f5ed93fca7cf8", "avgDurationSecs": 4.478, "validDurationSecs": 2364.313, "totalHrs": 1.05, "validHrs": 0.65 }, "sc": { "buckets": { "dev": 546, "test": 619, "train": 925, "validated": 2264, "invalidated": 118, "other": 383 }, "duration": 12976020, "reportedSentences": 36, "validatedSentences": 5473, "unvalidatedSentences": 361, "clips": 2765, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.09, "thirties": 0.29, "teens": 0, "fourties": 0.01, "fifties": 0.15, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male_masculine": 0.13, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2756, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 3, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 2, "technology_robotics": 0 } }, "users": 33, "size": 75465170, "checksum": "7b08a6f3cc703fd46648715ddc2eb1cf71167d5490a2f2a48a34630d35a38851", "avgDurationSecs": 4.693, "validDurationSecs": 10624.85, "totalHrs": 3.6, "validHrs": 2.95 }, "sd": { "buckets": { "dev": 0, "test": 40, "train": 271, "validated": 311, "invalidated": 144, "other": 24556 }, "duration": 104872140, "reportedSentences": 2, "validatedSentences": 13345, "unvalidatedSentences": 74, "clips": 25011, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0.83, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.86, "male_masculine": 0.14, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 24946, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 63, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 539906431, "checksum": "52782e840d6a1f344c356b8098a6c5119831a21270567bd5088a8fb47632fb93", "avgDurationSecs": 4.193, "validDurationSecs": 1304.036, "totalHrs": 29.13, "validHrs": 0.36 }, "sk": { "buckets": { "dev": 5007, "test": 5053, "train": 7354, "validated": 43066, "invalidated": 1211, "other": 358 }, "duration": 190297548, "reportedSentences": 343, "validatedSentences": 17876, "unvalidatedSentences": 5110, "clips": 44635, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.07, "thirties": 0.19, "teens": 0.05, "fourties": 0.13, "fifties": 0.33, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0.36, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33937, "agriculture_food": 2, "automotive_transport": 262, "finance": 2, "service_retail": 0, "general": 10415, "healthcare": 4, "history_law_government": 6462, "language_fundamentals": 90, "media_entertainment": 334, "nature_environment": 229, "news_current_affairs": 237, "technology_robotics": 823 } }, "users": 265, "size": 1097229583, "checksum": "a060e586bc7f841f9d332094a65b4f0e5c0afed91bc5238c3b7cba9ee37687a3", "avgDurationSecs": 4.263, "validDurationSecs": 183608.249, "totalHrs": 52.86, "validHrs": 51 }, "skr": { "buckets": { "dev": 1143, "test": 1036, "train": 1556, "validated": 3747, "invalidated": 467, "other": 1623 }, "duration": 24307020, "reportedSentences": 17, "validatedSentences": 6011, "unvalidatedSentences": 93, "clips": 5837, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.24, "thirties": 0.01, "teens": 0.02, "fourties": 0, "fifties": 0.37, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.63, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5837, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 139958564, "checksum": "d64811f4b50822850df855b91a0927a58efe010c564a0d7014e8732e11698b4f", "avgDurationSecs": 4.164, "validDurationSecs": 15603.633, "totalHrs": 6.75, "validHrs": 4.33 }, "sl": { "buckets": { "dev": 1331, "test": 1340, "train": 1469, "validated": 15575, "invalidated": 506, "other": 3409 }, "duration": 77975254, "reportedSentences": 55, "validatedSentences": 4206, "unvalidatedSentences": 31, "clips": 19490, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.31, "thirties": 0.03, "teens": 0.05, "fourties": 0.15, "fifties": 0.05, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.52, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19489, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 998, "size": 510967894, "checksum": "5d83371bddf988e6d9f6c7b6328a2498ea2f838b4cbe35651f319a8a584f3b27", "avgDurationSecs": 4.001, "validDurationSecs": 62312.19, "totalHrs": 21.65, "validHrs": 17.3 }, "sq": { "buckets": { "dev": 1645, "test": 1917, "train": 2658, "validated": 6230, "invalidated": 176, "other": 0 }, "duration": 32672628, "reportedSentences": 133, "validatedSentences": 50008, "unvalidatedSentences": 2635, "clips": 6406, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.08, "thirties": 0.15, "teens": 0, "fourties": 0.18, "fifties": 0.35, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.48, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1576, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1788, "healthcare": 0, "history_law_government": 3022, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 20, "technology_robotics": 0 } }, "users": 142, "size": 194776092, "checksum": "94e939778c5376a987c3307c55ff85f6507f76c357a49647a5b414618c256c78", "avgDurationSecs": 5.1, "validDurationSecs": 31774.972, "totalHrs": 9.07, "validHrs": 8.82 }, "sr": { "buckets": { "dev": 1908, "test": 1977, "train": 2336, "validated": 8238, "invalidated": 424, "other": 4846 }, "duration": 43738236, "reportedSentences": 78, "validatedSentences": 7898, "unvalidatedSentences": 753, "clips": 13508, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.25, "thirties": 0.11, "teens": 0, "fourties": 0.1, "fifties": 0.13, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.32, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13467, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 27 } }, "users": 180, "size": 245390476, "checksum": "c1dd2f0329172999113089abfd66c7e207aa2d0cf82ed32ffad585bd4e05712a", "avgDurationSecs": 3.238, "validDurationSecs": 26674.237, "totalHrs": 12.14, "validHrs": 7.4 }, "st": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 209 }, "duration": 673668, "reportedSentences": 0, "validatedSentences": 2339, "unvalidatedSentences": 14836, "clips": 209, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.29, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.71, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 209, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 4961032, "checksum": "c871ed4065e84e405d611971f6951fb8031764c8ac89f32c763fdd201f65761d", "avgDurationSecs": 3.223, "validDurationSecs": 0, "totalHrs": 0.18, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 5420, "test": 5441, "train": 8150, "validated": 42230, "invalidated": 1588, "other": 6250 }, "duration": 201406263, "reportedSentences": 597, "validatedSentences": 26202, "unvalidatedSentences": 7101, "clips": 50068, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.37, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.47, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 50065, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 879, "size": 1272678095, "checksum": "4c9ca58f99a6b8bf9f05a260d6b70685aea8fbe1eb75c582b3edf75b224b3b84", "avgDurationSecs": 4.023, "validDurationSecs": 169876.697, "totalHrs": 55.94, "validHrs": 47.18 }, "sw": { "buckets": { "dev": 12253, "test": 12256, "train": 46534, "validated": 267197, "invalidated": 80665, "other": 377313 }, "duration": 3906322308, "reportedSentences": 1902, "validatedSentences": 134725, "unvalidatedSentences": 5758, "clips": 725175, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 725175, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1481, "size": 22799952135, "checksum": "9b77dfc836fccd0e282f0476a1d7e0d7d26f6f9544cd2b4319f4d5fa4437d655", "avgDurationSecs": 5.387, "validDurationSecs": 1439318.236, "totalHrs": 1085.08, "validHrs": 399.81 }, "ta": { "buckets": { "dev": 12150, "test": 12237, "train": 46390, "validated": 136550, "invalidated": 5741, "other": 105197 }, "duration": 1533771312, "reportedSentences": 3438, "validatedSentences": 118260, "unvalidatedSentences": 575, "clips": 247488, "splits": { "accent": {}, "age": { "": 0.66, "twenties": 0.11, "thirties": 0.12, "teens": 0.04, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.19, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 247488, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 970, "size": 9193569383, "checksum": "ee0244f13ae3c4838872ac86a2cb01b93ae36c5ede206292c503d930f0a7e644", "avgDurationSecs": 6.197, "validDurationSecs": 846249.001, "totalHrs": 426.04, "validHrs": 235.06 }, "te": { "buckets": { "dev": 67, "test": 66, "train": 69, "validated": 342, "invalidated": 54, "other": 2034 }, "duration": 10150920, "reportedSentences": 15, "validatedSentences": 330, "unvalidatedSentences": 19295, "clips": 2430, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.69, "thirties": 0.02, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.07, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.65, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2430, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 60732345, "checksum": "6cc309b0752b9925e8e8a199a8e09126bc404aed2ec5b96a2ebde43e7d4a711d", "avgDurationSecs": 4.177, "validDurationSecs": 1428.648, "totalHrs": 2.81, "validHrs": 0.39 }, "tg": { "buckets": { "dev": 90, "test": 69, "train": 123, "validated": 282, "invalidated": 41, "other": 0 }, "duration": 1581876, "reportedSentences": 18, "validatedSentences": 3695, "unvalidatedSentences": 7076, "clips": 323, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.97, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.33 }, "sentence_domain": { "": 27, "agriculture_food": 4, "automotive_transport": 8, "finance": 10, "service_retail": 10, "general": 21, "healthcare": 10, "history_law_government": 17, "language_fundamentals": 143, "media_entertainment": 115, "nature_environment": 4, "news_current_affairs": 18, "technology_robotics": 7 } }, "users": 6, "size": 9696127, "checksum": "11bdb6276afa89a16659a58f99b9a13fb47ce1c8b0bd1050416cb866da688b16", "avgDurationSecs": 4.897, "validDurationSecs": 1381.081, "totalHrs": 0.43, "validHrs": 0.38 }, "th": { "buckets": { "dev": 11057, "test": 11057, "train": 32959, "validated": 148585, "invalidated": 9348, "other": 208030 }, "duration": 1534655424, "reportedSentences": 4317, "validatedSentences": 64490, "unvalidatedSentences": 121698, "clips": 365963, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 365963, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7973, "size": 8954733888, "checksum": "a28a23a9096c6a573c91da3fda41b38916ecea488b077961906b334c3968f9e4", "avgDurationSecs": 4.193, "validDurationSecs": 623086.968, "totalHrs": 426.29, "validHrs": 173.07 }, "ti": { "buckets": { "dev": 12, "test": 11, "train": 20, "validated": 43, "invalidated": 3, "other": 267 }, "duration": 1582884, "reportedSentences": 3, "validatedSentences": 4535, "unvalidatedSentences": 137, "clips": 313, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0, "thirties": 0.04, "teens": 0, "fourties": 0.7, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0.05, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 313, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 9586108, "checksum": "7c45d649eadd751fa3089280682e59280bfb7f5e758065dbf6a4197336cff1e4", "avgDurationSecs": 5.057, "validDurationSecs": 217.457, "totalHrs": 0.43, "validHrs": 0.06 }, "tig": { "buckets": { "dev": 1618, "test": 1606, "train": 1990, "validated": 6425, "invalidated": 244, "other": 73 }, "duration": 42776676, "reportedSentences": 37, "validatedSentences": 5417, "unvalidatedSentences": 88, "clips": 6742, "splits": { "accent": {}, "age": { "": 0.7, "twenties": 0.21, "thirties": 0.1, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0.01, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6730, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 5, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 146, "size": 229375450, "checksum": "0923bdf7db441e1482bb52b8ad0e00db02bbd1c9626b5a98d28f6c2592a40223", "avgDurationSecs": 6.345, "validDurationSecs": 40765.373, "totalHrs": 11.88, "validHrs": 11.32 }, "tk": { "buckets": { "dev": 447, "test": 570, "train": 741, "validated": 1972, "invalidated": 131, "other": 2593 }, "duration": 25710156, "reportedSentences": 27, "validatedSentences": 2494, "unvalidatedSentences": 238, "clips": 4696, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.55, "thirties": 0.08, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.43, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4696, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 126, "size": 144959234, "checksum": "ebd5200da295e5589e6b588a9817496e0743c00a9241e5cf4f1b71e5a2a12769", "avgDurationSecs": 5.475, "validDurationSecs": 10796.514, "totalHrs": 7.14, "validHrs": 2.99 }, "tn": { "buckets": { "dev": 312, "test": 369, "train": 1078, "validated": 3503, "invalidated": 11, "other": 535 }, "duration": 17693028, "reportedSentences": 1, "validatedSentences": 1765, "unvalidatedSentences": 15342, "clips": 4049, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.02, "thirties": 0.51, "teens": 0, "fourties": 0, "fifties": 0.43, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.04 }, "sentence_domain": { "": 4049, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 89822314, "checksum": "341de72016cc2665a994bbbfd70e9b04ee7ce21e4a402513b203d4d72efe13ca", "avgDurationSecs": 4.37, "validDurationSecs": 15307.157, "totalHrs": 4.91, "validHrs": 4.25 }, "tok": { "buckets": { "dev": 2217, "test": 2166, "train": 2630, "validated": 12390, "invalidated": 358, "other": 4997 }, "duration": 74633256, "reportedSentences": 148, "validatedSentences": 7465, "unvalidatedSentences": 2278, "clips": 17745, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.2, "thirties": 0.07, "teens": 0.23, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.18, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.49, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.03 }, "sentence_domain": { "": 17744, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 166, "size": 436348235, "checksum": "66677e7cc4635162ce59b6d66edd1ea4f265102f80362d331204ce6287c6bc5d", "avgDurationSecs": 4.206, "validDurationSecs": 52110.794, "totalHrs": 20.73, "validHrs": 14.47 }, "tr": { "buckets": { "dev": 11783, "test": 11784, "train": 40377, "validated": 120345, "invalidated": 4893, "other": 116 }, "duration": 480034287, "reportedSentences": 458, "validatedSentences": 410398, "unvalidatedSentences": 3446, "clips": 125354, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.1, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.42, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 125339, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 12, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1767, "size": 2929389524, "checksum": "f346ae93cddc9c6fd619ac85454e6bc3d310510b8d4525b11f68bb2f9fab1033", "avgDurationSecs": 3.829, "validDurationSecs": 460852.675, "totalHrs": 133.34, "validHrs": 128.01 }, "ts": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 1, "other": 0 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1040817, "checksum": "8f97c9bfbeb77f64bb8b6291e3b8532d72467b0221396d92e7afddac903382f7", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 3989, "test": 5117, "train": 8871, "validated": 29738, "invalidated": 456, "other": 680 }, "duration": 118273202, "reportedSentences": 6, "validatedSentences": 18043, "unvalidatedSentences": 40, "clips": 30874, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.05, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.76, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30863, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 11, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 281, "size": 862474472, "checksum": "1a07944c2af78f7990edc486f0cca36a395640022b5a4aa75b5143e5faab5662", "avgDurationSecs": 3.831, "validDurationSecs": 113921.373, "totalHrs": 32.85, "validHrs": 31.64 }, "tw": { "buckets": { "dev": 0, "test": 20, "train": 205, "validated": 225, "invalidated": 1, "other": 46 }, "duration": 1172304, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 19, "clips": 272, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.02, "thirties": 0.79, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.81, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 9651568, "checksum": "e0f4e3f9315550317c4a2f3fae74fe0578a0fc27614c6e51342df70be58ac58a", "avgDurationSecs": 4.31, "validDurationSecs": 969.737, "totalHrs": 0.32, "validHrs": 0.26 }, "ug": { "buckets": { "dev": 14412, "test": 14412, "train": 107646, "validated": 253429, "invalidated": 19346, "other": 21716 }, "duration": 1732252608, "reportedSentences": 1490, "validatedSentences": 230706, "unvalidatedSentences": 8860, "clips": 294491, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.34, "thirties": 0.28, "teens": 0.06, "fourties": 0.1, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0.27, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01, "intersex": 0 }, "sentence_domain": { "": 292084, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 2380, "healthcare": 27, "history_law_government": 3, "language_fundamentals": 9, "media_entertainment": 12, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 1599, "size": 9869301655, "checksum": "abe193521d310e78b5da6332e28da12d5143cb66b389f0451aa056d74df9da19", "avgDurationSecs": 5.882, "validDurationSecs": 1490718.04, "totalHrs": 481.18, "validHrs": 414.08 }, "uk": { "buckets": { "dev": 10253, "test": 10259, "train": 26773, "validated": 77626, "invalidated": 3336, "other": 8286 }, "duration": 414308760, "reportedSentences": 655, "validatedSentences": 212667, "unvalidatedSentences": 1602, "clips": 89248, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.34, "thirties": 0.25, "teens": 0.09, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89248, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1146, "size": 2722319924, "checksum": "d5a8a7d126da081055108c0d08d9c3c511e3ab83f9bbae2d768b75ceb93a7e8f", "avgDurationSecs": 4.642, "validDurationSecs": 360356.891, "totalHrs": 115.08, "validHrs": 100.09 }, "ur": { "buckets": { "dev": 5082, "test": 5082, "train": 7326, "validated": 68102, "invalidated": 10955, "other": 173438 }, "duration": 1084937076, "reportedSentences": 400, "validatedSentences": 21699, "unvalidatedSentences": 48263, "clips": 252495, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.92, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.52, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 252388, "agriculture_food": 12, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 48, "healthcare": 47, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 486, "size": 6333774741, "checksum": "6845fa00738c84d89aa9a25f3dc0712d1dd42b33fc00ca0a223e68ab0d2fc8e9", "avgDurationSecs": 4.297, "validDurationSecs": 292625.14, "totalHrs": 301.37, "validHrs": 81.28 }, "uz": { "buckets": { "dev": 12261, "test": 12365, "train": 48733, "validated": 87085, "invalidated": 14114, "other": 128457 }, "duration": 954607716, "reportedSentences": 1833, "validatedSentences": 148734, "unvalidatedSentences": 137878, "clips": 229656, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229656, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2262, "size": 5357279008, "checksum": "5a9331d6477037363462fa134f0b8aa3c6cdc7eb61aa7c35075af8731599c9a5", "avgDurationSecs": 4.157, "validDurationSecs": 361984.938, "totalHrs": 265.16, "validHrs": 100.55 }, "ve": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 5 }, "duration": 27684, "reportedSentences": 0, "validatedSentences": 1727, "unvalidatedSentences": 7849, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 764883, "checksum": "64fd9f9edc129f49dd174bae49cf76e567e2e0f4ed68030c10b42431913b839a", "avgDurationSecs": 5.537, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "vi": { "buckets": { "dev": 931, "test": 1367, "train": 2104, "validated": 5760, "invalidated": 433, "other": 12568 }, "duration": 74430080, "reportedSentences": 201, "validatedSentences": 6275, "unvalidatedSentences": 5464, "clips": 18761, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.21, "thirties": 0.05, "teens": 0.21, "fourties": 0.02, "fifties": 0, "sixties": 0.27, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.5, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18450, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 311, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 354, "size": 433392299, "checksum": "e281c6ea8e8516b07ae3e3ffb3e0116399dcae336bb74c9b61aca23f698d90ac", "avgDurationSecs": 3.967, "validDurationSecs": 22851.514, "totalHrs": 20.67, "validHrs": 6.34 }, "vot": { "buckets": { "dev": 0, "test": 7, "train": 96, "validated": 103, "invalidated": 324, "other": 0 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186628, "checksum": "4f6fb207960f0b55489e98aec261ea157521aca4e7d72a95315361174975659e", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "xh": { "buckets": { "dev": 0, "test": 0, "train": 7, "validated": 7, "invalidated": 0, "other": 23 }, "duration": 179712, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 30, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2197690, "checksum": "b2029fd5507e05909b5d9afa48a9fddc2ee96aaad3bce5013e918fe5a25b6dd4", "avgDurationSecs": 5.99, "validDurationSecs": 41.933, "totalHrs": 0.04, "validHrs": 0.01 }, "yi": { "buckets": { "dev": 168, "test": 222, "train": 320, "validated": 748, "invalidated": 43, "other": 58 }, "duration": 3327408, "reportedSentences": 4, "validatedSentences": 1440, "unvalidatedSentences": 92, "clips": 849, "splits": { "accent": {}, "age": { "": 0.63, "twenties": 0.11, "thirties": 0, "teens": 0.26, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 848, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 18782267, "checksum": "41684592981aed4ee1e8f34fd78193dd02666eb26a34b492f3235bf3a287335b", "avgDurationSecs": 3.919, "validDurationSecs": 2931.568, "totalHrs": 0.92, "validHrs": 0.81 }, "yo": { "buckets": { "dev": 913, "test": 1113, "train": 1404, "validated": 3432, "invalidated": 259, "other": 1156 }, "duration": 29280600, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 17, "clips": 4847, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.61, "thirties": 0.03, "teens": 0.04, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.3, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4847, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 131, "size": 170646521, "checksum": "e9316cdfc2c057b33d0d2d17d1e3c117106ff92b1fbdaa6e9825cb73e4a0dd2f", "avgDurationSecs": 6.041, "validDurationSecs": 20732.622, "totalHrs": 8.13, "validHrs": 5.75 }, "yue": { "buckets": { "dev": 5130, "test": 5130, "train": 7419, "validated": 191152, "invalidated": 8075, "other": 79908 }, "duration": 1107369936, "reportedSentences": 2257, "validatedSentences": 18531, "unvalidatedSentences": 10192, "clips": 279135, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.29, "thirties": 0.47, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 276294, "agriculture_food": 18, "automotive_transport": 51, "finance": 127, "service_retail": 606, "general": 1674, "healthcare": 416, "history_law_government": 67, "language_fundamentals": 0, "media_entertainment": 191, "nature_environment": 188, "news_current_affairs": 114, "technology_robotics": 95 } }, "users": 1163, "size": 6418092185, "checksum": "82628321e0e7e0872e8f61e0b927b59da7df9b18a7d9a3e128afcb808bb977c7", "avgDurationSecs": 3.967, "validDurationSecs": 758328.329, "totalHrs": 307.6, "validHrs": 210.64 }, "zgh": { "buckets": { "dev": 297, "test": 228, "train": 842, "validated": 1367, "invalidated": 88, "other": 648 }, "duration": 7261884, "reportedSentences": 6, "validatedSentences": 2731, "unvalidatedSentences": 355, "clips": 2103, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.37, "thirties": 0.54, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.85, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2064, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 31, "healthcare": 2, "history_law_government": 0, "language_fundamentals": 5, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 31, "size": 39631735, "checksum": "0dd6449e6255af6283c308939f7376b26d92ce15ed1606b92ad28908cd805f1b", "avgDurationSecs": 3.453, "validDurationSecs": 4720.397, "totalHrs": 2.01, "validHrs": 1.31 }, "zh-CN": { "buckets": { "dev": 10635, "test": 10635, "train": 29470, "validated": 188380, "invalidated": 59020, "other": 601017 }, "duration": 3853377996, "reportedSentences": 1047, "validatedSentences": 59060, "unvalidatedSentences": 851, "clips": 848417, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0.05, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 846937, "agriculture_food": 14, "automotive_transport": 38, "finance": 21, "service_retail": 37, "general": 489, "healthcare": 62, "history_law_government": 187, "language_fundamentals": 41, "media_entertainment": 674, "nature_environment": 18, "news_current_affairs": 50, "technology_robotics": 93 } }, "users": 7415, "size": 22814184978, "checksum": "fa27b9366c3ef5bc9b2683da131105b890f6a263196c3c5dbb7de42dbe4fbbca", "avgDurationSecs": 4.542, "validDurationSecs": 855592.647, "totalHrs": 1070.38, "validHrs": 237.66 }, "zh-HK": { "buckets": { "dev": 5603, "test": 5603, "train": 8449, "validated": 93937, "invalidated": 4622, "other": 24636 }, "duration": 512154492, "reportedSentences": 824, "validatedSentences": 20132, "unvalidatedSentences": 35, "clips": 123195, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.41, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 122085, "agriculture_food": 0, "automotive_transport": 25, "finance": 39, "service_retail": 252, "general": 735, "healthcare": 81, "history_law_government": 55, "language_fundamentals": 79, "media_entertainment": 47, "nature_environment": 46, "news_current_affairs": 23, "technology_robotics": 34 } }, "users": 3089, "size": 3645187886, "checksum": "563fbc0cd3710f4d24cebede8bdf1be8cb449abec5039c1cc3a237673d1a1990", "avgDurationSecs": 4.157, "validDurationSecs": 390521.178, "totalHrs": 142.26, "validHrs": 108.47 }, "zh-TW": { "buckets": { "dev": 5087, "test": 5087, "train": 7327, "validated": 84515, "invalidated": 4880, "other": 49926 }, "duration": 468620086, "reportedSentences": 179, "validatedSentences": 19626, "unvalidatedSentences": 43, "clips": 139321, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.49, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 138740, "agriculture_food": 12, "automotive_transport": 270, "finance": 1, "service_retail": 149, "general": 207, "healthcare": 21, "history_law_government": 164, "language_fundamentals": 8, "media_entertainment": 163, "nature_environment": 11, "news_current_affairs": 42, "technology_robotics": 13 } }, "users": 2280, "size": 3129076001, "checksum": "a1ebf7a80ae720bfa83c410ae1688cf9983217a420a38c0c092014aacf721366", "avgDurationSecs": 3.364, "validDurationSecs": 284274.636, "totalHrs": 130.17, "validHrs": 78.96 }, "zu": { "buckets": { "dev": 0, "test": 1, "train": 12, "validated": 13, "invalidated": 1, "other": 106 }, "duration": 422136, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 120, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.57, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.57 }, "sentence_domain": { "": 120, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 3905799, "checksum": "f9c182c7f4235175080116c955d0ee32bdcf53890a8079a7752090c219e9b693", "avgDurationSecs": 3.518, "validDurationSecs": 45.731, "totalHrs": 0.11, "validHrs": 0.01 }, "zza": { "buckets": { "dev": 463, "test": 392, "train": 734, "validated": 1589, "invalidated": 146, "other": 454 }, "duration": 8982468, "reportedSentences": 2321, "validatedSentences": 5654, "unvalidatedSentences": 972, "clips": 2189, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.71, "male_masculine": 0.27, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1456, "agriculture_food": 3, "automotive_transport": 1, "finance": 1, "service_retail": 9, "general": 670, "healthcare": 16, "history_law_government": 0, "language_fundamentals": 27, "media_entertainment": 2, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 18, "size": 52619113, "checksum": "4e063cc3b9d11340c2e6a0b1fe3ae968fa1fee2942be26002ab50b9cadeb1911", "avgDurationSecs": 4.103, "validDurationSecs": 6520.394, "totalHrs": 2.49, "validHrs": 1.81 } }, "totalDuration": 121737597979, "totalValidDurationSecs": 81507729, "totalHrs": 33815, "totalValidHrs": 22640 } ================================================ FILE: datasets/scripted-speech/cv-corpus-22.0-delta-2025-06-20.json ================================================ { "locales": { "ab": { "duration": 974304, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 185, "users": 4, "size": 5702102, "checksum": "13d4f651aa803c93dd03c300826032c3ee91bc5030f1a480b8f4be3183e1018c", "avgDurationSecs": 5.178, "validDurationSecs": 362.88, "totalHrs": 0.27, "validHrs": 0.1 }, "af": { "duration": 223488, "reportedSentences": 73, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 30, "users": 4, "size": 1327360, "checksum": "cec6c7776b5dc27e0ff70ddce2ca87e41406d7c3f1834d1420d9653fa1e16163", "avgDurationSecs": 5.93, "validDurationSecs": 161.79, "totalHrs": 0.07, "validHrs": 0.05 }, "am": { "duration": 266472, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 32, "users": 5, "size": 1570744, "checksum": "f8ff75b265a870c48af974fa9ac830d4f5420342b059b51f801d524199741889", "avgDurationSecs": 6.297, "validDurationSecs": 135.41, "totalHrs": 0.08, "validHrs": 0.03 }, "ar": { "duration": 420768, "reportedSentences": 3, "validatedSentences": 0, "unvalidatedSentences": 6, "clips": 90, "users": 12, "size": 2502169, "checksum": "17cd613f88d19fce8f8f171fff5b78b67d491b617906163e876b6caecccdeb8c", "avgDurationSecs": 4.161, "validDurationSecs": 859.14, "totalHrs": 0.11, "validHrs": 0.24 }, "as": { "duration": 0, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 0, "users": 0, "size": -4456, "checksum": "74c3cb9660ff9868bb52d084dc38a1e6ec91d1ee1cffa88ebf61686e59172ade", "avgDurationSecs": 5.897, "validDurationSecs": 88.45, "totalHrs": 0, "validHrs": 0.03 }, "ast": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1490, "checksum": "06a7bfbc078e9cedc0b9c7ee665aa85d13b8dfac65616faa4d281bd59808626f", "avgDurationSecs": 4.434, "validDurationSecs": 39.91, "totalHrs": 0, "validHrs": 0.01 }, "az": { "duration": 174312, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 35, "users": 3, "size": 1022031, "checksum": "f10464cf2fb94db9b835d8ccfcfa56bdd2bf8fb09d422b6ff68d9ce9accc4461", "avgDurationSecs": 5.421, "validDurationSecs": 108.39, "totalHrs": 0.04, "validHrs": 0.03 }, "ba": { "duration": 1093896, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 233, "users": 4, "size": 6299034, "checksum": "40ab8fd7e126ec97d34fa472ec5ce8b9b2e20de97314debf6a2290c1441fda4e", "avgDurationSecs": 4.427, "validDurationSecs": 684.29, "totalHrs": 0.31, "validHrs": 0.19 }, "bas": { "duration": 22068, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 6, "users": 1, "size": 120142, "checksum": "f21925f1da52acaf420cbf5c24ae6fbf3d07d9dc61aa71ecc1b467b1bf31172b", "avgDurationSecs": 3.924, "validDurationSecs": -1.32, "totalHrs": 0.01, "validHrs": 0 }, "be": { "duration": 18603900, "reportedSentences": 0, "validatedSentences": 13, "unvalidatedSentences": -8, "clips": 3396, "users": 16, "size": 98099677, "checksum": "7f7a43b2b2c047f301b8b7d3bac6fd3bd0c8eff92c5e47a3d09de3939ebf9d9b", "avgDurationSecs": 4.789, "validDurationSecs": 15014.78, "totalHrs": 5.16, "validHrs": 4.17 }, "bg": { "duration": 160920, "reportedSentences": 3, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 30, "users": 2, "size": 963190, "checksum": "6bd641e34c84e9074e4580f85d427a587e2377f3f464a0866dcd88e89572e4f1", "avgDurationSecs": 5.472, "validDurationSecs": 506.26, "totalHrs": 0.04, "validHrs": 0.14 }, "bn": { "duration": 7893828, "reportedSentences": 0, "validatedSentences": 55, "unvalidatedSentences": -53, "clips": 1366, "users": 18, "size": 46234061, "checksum": "c873aff20fad4c95da1d39c475c6f678e362d1e585b4b90dbbf21b91d8ae02f2", "avgDurationSecs": 4.37, "validDurationSecs": 758.72, "totalHrs": 2.19, "validHrs": 0.21 }, "br": { "duration": 7571376, "reportedSentences": 0, "validatedSentences": 14053, "unvalidatedSentences": 57, "clips": 2029, "users": 17, "size": 44936540, "checksum": "dbb53180dfd4433e16136627248a1b89fa234d183a81162a316a5d718ab6730f", "avgDurationSecs": 3.224, "validDurationSecs": 6556.53, "totalHrs": 2.11, "validHrs": 1.82 }, "ca": { "duration": 96416388, "reportedSentences": 452, "validatedSentences": 11807, "unvalidatedSentences": 2428, "clips": 28094, "users": 173, "size": 561791161, "checksum": "74c793966e00e656a2d2b2194c8cb02260815b53f5830ce5b0e7046a2c8dc69e", "avgDurationSecs": 5.212, "validDurationSecs": 114223.55, "totalHrs": 26.79, "validHrs": 31.73 }, "ckb": { "duration": 2279232, "reportedSentences": 1, "validatedSentences": 41, "unvalidatedSentences": -30, "clips": 406, "users": 34, "size": 12884253, "checksum": "e8c4e65fbaced436603ad6291453ac8641f2f0f74758c5e06b10f66b23e82d66", "avgDurationSecs": 4.094, "validDurationSecs": 1900.21, "totalHrs": 0.64, "validHrs": 0.53 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -390, "checksum": "1edff706113c0b0443eae2aac81a7098ff0261397b7415e9542c3793e4493a0c", "avgDurationSecs": 3.516, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 1839060, "reportedSentences": 0, "validatedSentences": 7, "unvalidatedSentences": 8, "clips": 479, "users": 31, "size": 10663807, "checksum": "8efa762eb69b351481220321e1bdd8d459f7c1f339c67c8f239ff09f63734701", "avgDurationSecs": 4.462, "validDurationSecs": 2111.05, "totalHrs": 0.51, "validHrs": 0.58 }, "cv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 4326, "checksum": "73a8ede271c067bc0fa9e5b8fa6f120b806525d8ba1b4ea28b44077e125c0f46", "avgDurationSecs": 5.042, "validDurationSecs": 40.33, "totalHrs": 0, "validHrs": 0.01 }, "cy": { "duration": 794160, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 125, "users": 7, "size": 4624307, "checksum": "3430fe213780804f53a36417ecf15fbf3675ad825088666cd83dd6d8f5dca665", "avgDurationSecs": 4.915, "validDurationSecs": 229.39, "totalHrs": 0.22, "validHrs": 0.06 }, "da": { "duration": 1119060, "reportedSentences": 54, "validatedSentences": 286, "unvalidatedSentences": -225, "clips": 272, "users": 8, "size": 6563143, "checksum": "5e54de7822335ebcf96fe83ee4c02afacbd9874c99929c24d9a0bed81c94d9c8", "avgDurationSecs": 4.064, "validDurationSecs": 177.93, "totalHrs": 0.31, "validHrs": 0.05 }, "dav": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15359, "checksum": "74bb24434e03a92b2346c5f99c6dbe751cbde56928dc278fe2dc41906daf98fa", "avgDurationSecs": 4.059, "validDurationSecs": 4.06, "totalHrs": 0, "validHrs": 0 }, "de": { "duration": 84790260, "reportedSentences": 73, "validatedSentences": 90, "unvalidatedSentences": 3, "clips": 12745, "users": 828, "size": 486154103, "checksum": "f8aead37e27b76b781bcd050c78d61190467564420def922946d0436f62b6526", "avgDurationSecs": 5.261, "validDurationSecs": 34116.05, "totalHrs": 23.55, "validHrs": 9.48 }, "dv": { "duration": 67320, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 0, "clips": 10, "users": 1, "size": 366641, "checksum": "90b9f4168d2443b2ba5c5c6782c7c4388227dd556f6188518d9e713f10da397c", "avgDurationSecs": 5.121, "validDurationSecs": 153.24, "totalHrs": 0.01, "validHrs": 0.04 }, "dyu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 241, "checksum": "0a72e30231529f9cc68dd52c2f2c90ea8c44b39a9b59a448bc5c0344ac4410d0", "avgDurationSecs": 6.309, "validDurationSecs": 12.62, "totalHrs": 0, "validHrs": 0 }, "el": { "duration": 928476, "reportedSentences": 2, "validatedSentences": 11, "unvalidatedSentences": -11, "clips": 211, "users": 23, "size": 5341636, "checksum": "782d4886e6db4e27eeae375a1f98773471aeb82c2a8c63b6fc421a772c9a2eb6", "avgDurationSecs": 4.151, "validDurationSecs": 513.06, "totalHrs": 0.26, "validHrs": 0.14 }, "en": { "duration": 172095408, "reportedSentences": 191, "validatedSentences": 1526, "unvalidatedSentences": 3180, "clips": 26061, "users": 2021, "size": 1000219201, "checksum": "075ca496a0adc0b16bac6da3f77b92893a2b79ea524db0e8442f7dde11a87009", "avgDurationSecs": 5.283, "validDurationSecs": 79482.96, "totalHrs": 47.81, "validHrs": 22.08 }, "eo": { "duration": 4901364, "reportedSentences": 7, "validatedSentences": 31, "unvalidatedSentences": -31, "clips": 730, "users": 11, "size": 28605742, "checksum": "3b58a9dd888e072f93452b224f064ea92182ee021c076a4efb0edb932459f3a6", "avgDurationSecs": 6.07, "validDurationSecs": 1633.14, "totalHrs": 1.37, "validHrs": 0.45 }, "es": { "duration": 35837496, "reportedSentences": 10, "validatedSentences": 320, "unvalidatedSentences": -267, "clips": 6127, "users": 325, "size": 208267211, "checksum": "6460828a35d4f7056db26153a624644c7f346e91e478d667ccc0752ab187cca6", "avgDurationSecs": 4.878, "validDurationSecs": 13807.34, "totalHrs": 9.95, "validHrs": 3.84 }, "et": { "duration": 692244, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 132, "users": 3, "size": 4080604, "checksum": "8523341b5dd4c3df5bdd5e8eacf0c9937c3ea1a08dfa9b68fc78c716cbb7f99d", "avgDurationSecs": 6.745, "validDurationSecs": 994.67, "totalHrs": 0.2, "validHrs": 0.27 }, "eu": { "duration": 18326340, "reportedSentences": 7, "validatedSentences": 251, "unvalidatedSentences": 547, "clips": 3219, "users": 142, "size": 108279279, "checksum": "b19c78b8fcd2237b1fc9e6ffc3b3cc357ac003c50081435f4ef7e0141f4796a7", "avgDurationSecs": 5.503, "validDurationSecs": 324485.35, "totalHrs": 5.09, "validHrs": 90.13 }, "fa": { "duration": 3281472, "reportedSentences": 41, "validatedSentences": 107, "unvalidatedSentences": 724, "clips": 949, "users": 19, "size": 18590695, "checksum": "883ae1240c336f9a9c727fd2ddf5f540fb9e1733d78942b89b644ee9e6fd292a", "avgDurationSecs": 3.96, "validDurationSecs": 1116.48, "totalHrs": 0.91, "validHrs": 0.31 }, "fi": { "duration": 147888, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 26, "users": 4, "size": 856335, "checksum": "d329700a4e0d4a03e754b835b29e4ec0aaf45d180f72ce069b214c2295937fe1", "avgDurationSecs": 4.623, "validDurationSecs": 291.66, "totalHrs": 0.04, "validHrs": 0.08 }, "fr": { "duration": 49489560, "reportedSentences": 17, "validatedSentences": 388, "unvalidatedSentences": -266, "clips": 8342, "users": 663, "size": 288476705, "checksum": "0b31c9b6d8e47f92d7c96bcaf3c8b2be652acc475993a5a73f86021b7309150c", "avgDurationSecs": 5.034, "validDurationSecs": 39964.34, "totalHrs": 13.75, "validHrs": 11.1 }, "fy-NL": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 8, "clips": 0, "users": 0, "size": 5372, "checksum": "5c3ee054f8bd6732fb948ec8171212fece144a0a9fcce08e2325b52daa0a7ab0", "avgDurationSecs": 4.854, "validDurationSecs": 538.78, "totalHrs": 0, "validHrs": 0.15 }, "ga-IE": { "duration": 6416820, "reportedSentences": 4, "validatedSentences": 57, "unvalidatedSentences": 430, "clips": 1843, "users": 16, "size": 37180404, "checksum": "e1b6b3fd44a5bbb76e15399629d75ea30aab300ca32b80fbfb1129b82dd04374", "avgDurationSecs": 3.576, "validDurationSecs": 8735.76, "totalHrs": 1.79, "validHrs": 2.43 }, "gl": { "duration": 62035344, "reportedSentences": 12, "validatedSentences": 392, "unvalidatedSentences": -257, "clips": 13688, "users": 76, "size": 355112243, "checksum": "f04aebb37d72290957d6173a2df20cbc6a8dbdeec93321f7493f25de40c77545", "avgDurationSecs": 5.001, "validDurationSecs": 23971.32, "totalHrs": 17.23, "validHrs": 6.66 }, "gn": { "duration": 4486464, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 5, "clips": 904, "users": 50, "size": 25447430, "checksum": "3a9c8f2e4591a6eaaa9b6e6f84a37f02cf8feda1a6241eacbc7152132910e139", "avgDurationSecs": 4.57, "validDurationSecs": 874.21, "totalHrs": 1.24, "validHrs": 0.25 }, "ha": { "duration": 300492, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 60, "users": 1, "size": 1451909, "checksum": "f38d16d790de83f735db828614661d5bb63d559a2d997a3c721c39e79d5f677d", "avgDurationSecs": 4.355, "validDurationSecs": 239.47, "totalHrs": 0.09, "validHrs": 0.07 }, "he": { "duration": 407880, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 56, "users": 6, "size": 2328874, "checksum": "0292672d5c1dcd4ad4783f2b5d5b69c3e0effe2ae24daa18e1019e4b93c631eb", "avgDurationSecs": 4.564, "validDurationSecs": 1099.05, "totalHrs": 0.11, "validHrs": 0.3 }, "hi": { "duration": 441756, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 70, "users": 7, "size": 2603189, "checksum": "85885a33e07d006a55a4f3c5920c907fb821d59759527ecfaa0f36ccaf31ec6f", "avgDurationSecs": 5.015, "validDurationSecs": 121.42, "totalHrs": 0.12, "validHrs": 0.04 }, "hsb": { "duration": 130680, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 2, "size": 776551, "checksum": "25c5dcc2c728a0ad904d73a65b851fe19893dccafd97d97d6d16b19ee522d4b3", "avgDurationSecs": 6.264, "validDurationSecs": 46.83, "totalHrs": 0.04, "validHrs": 0.02 }, "ht": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -42, "checksum": "83e08f8823780debaa601ecee7b10abd12d60ca6dd187f30dd6497fe6a11c024", "avgDurationSecs": 5.566, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hu": { "duration": 1367460, "reportedSentences": 0, "validatedSentences": 161, "unvalidatedSentences": -158, "clips": 250, "users": 7, "size": 8063672, "checksum": "ed96c84f3ca0be1ab700afdd5cff3b3acf9ebbfd5831187a041c45959a289bc3", "avgDurationSecs": 5.539, "validDurationSecs": 2993.02, "totalHrs": 0.38, "validHrs": 0.83 }, "hy-AM": { "duration": 353484, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 65, "users": 2, "size": 2049397, "checksum": "7c79542e55bab99548a0539aca15ad5e4085b784334071f5ffad53d0cecbadea", "avgDurationSecs": 5.392, "validDurationSecs": 61.01, "totalHrs": 0.09, "validHrs": 0.02 }, "ia": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -5951, "checksum": "6db05f61adc6f5d93e10e39e693984afd24a33a3e7ecc0c260866507211ea964", "avgDurationSecs": 4.202, "validDurationSecs": 487.39, "totalHrs": 0, "validHrs": 0.13 }, "id": { "duration": 754524, "reportedSentences": 8, "validatedSentences": 4, "unvalidatedSentences": -3, "clips": 187, "users": 13, "size": 4309301, "checksum": "8fcb715da58a4b12d33d77dca472c0eddf754d94f0ed7b7bb571bc9328bb81df", "avgDurationSecs": 4.003, "validDurationSecs": 455.35, "totalHrs": 0.21, "validHrs": 0.13 }, "ig": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -10239, "checksum": "613290cd15397a26fd42e9fb58e512c768b111bf869fe145286eb9426829b3d4", "avgDurationSecs": 5.464, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 116280, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 1, "size": 618438, "checksum": "3d901d13ef269ec6f2c96413eda338f1f4c5cdd093b94ae3499b9440decc9ec6", "avgDurationSecs": 7.169, "validDurationSecs": 6.12, "totalHrs": 0.03, "validHrs": 0 }, "it": { "duration": 14199156, "reportedSentences": 7, "validatedSentences": 195, "unvalidatedSentences": -191, "clips": 2370, "users": 38, "size": 83678455, "checksum": "15314a3e90501d28db8f63432332a07d61e0fd6594b7ff65582f62c6548d5dd4", "avgDurationSecs": 5.425, "validDurationSecs": 3147.05, "totalHrs": 3.94, "validHrs": 0.88 }, "ja": { "duration": 927900, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 155, "users": 18, "size": 5453458, "checksum": "3571b4d00f1ee0a4f77d35140dd70cdf8599669d26cc42290bf33a376ff4d839", "avgDurationSecs": 4.542, "validDurationSecs": 1569.31, "totalHrs": 0.26, "validHrs": 0.44 }, "ka": { "duration": 41857812, "reportedSentences": 64, "validatedSentences": 187, "unvalidatedSentences": -121, "clips": 7374, "users": 200, "size": 242251857, "checksum": "577b3c16eff42d54201b5386946176da97eb393f5c9fc5bab0b52b772b68e651", "avgDurationSecs": 5.215, "validDurationSecs": 29314.02, "totalHrs": 11.62, "validHrs": 8.14 }, "kab": { "duration": 504864, "reportedSentences": 8, "validatedSentences": 20, "unvalidatedSentences": 430, "clips": 132, "users": 5, "size": 2803596, "checksum": "d7083bef2fd342b36610935c50c1d9956f874f84d16d5189c59495d58d92bdb3", "avgDurationSecs": 3.341, "validDurationSecs": 1218.63, "totalHrs": 0.14, "validHrs": 0.34 }, "kk": { "duration": 130608, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 21, "users": 4, "size": 773711, "checksum": "b3fba646b58c2f915d138197ad953ba84dee7892d2a4dbbca91d425ee0f5cfbb", "avgDurationSecs": 4.932, "validDurationSecs": 140.2, "totalHrs": 0.04, "validHrs": 0.04 }, "kln": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4058, "checksum": "d0cb5b8854dcdce628d3f41c74c213c36123037aeb4c800151bb3aa146457f31", "avgDurationSecs": 4.705, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kmr": { "duration": 648720, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 15, "clips": 123, "users": 7, "size": 3579513, "checksum": "947d5faba8675fa7a016f4b04cb143dd724e0132dfbc51645f3953c57a7843e3", "avgDurationSecs": 4.097, "validDurationSecs": 1270.89, "totalHrs": 0.18, "validHrs": 0.36 }, "ko": { "duration": 869688, "reportedSentences": 0, "validatedSentences": 66, "unvalidatedSentences": 83, "clips": 168, "users": 11, "size": 4942792, "checksum": "8bf375a20aebe7f01df9026eba32a064e98b5355b64265f44d2048fdc9415d10", "avgDurationSecs": 5.342, "validDurationSecs": 185.1, "totalHrs": 0.24, "validHrs": 0.05 }, "ky": { "duration": 42696, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 3, "size": 226465, "checksum": "14d360f456cba2b8e63a06a009a6bcd240a25696b3b594cf488d903b1668cecd", "avgDurationSecs": 4.553, "validDurationSecs": 79.66, "totalHrs": 0.01, "validHrs": 0.02 }, "lg": { "duration": 3870576, "reportedSentences": 34, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 661, "users": 4, "size": 21657937, "checksum": "cc58a4e4df241679e743fe625cf433259fb25f2d8b4ce026112808389addd0af", "avgDurationSecs": 5.784, "validDurationSecs": 170, "totalHrs": 1.08, "validHrs": 0.05 }, "lij": { "duration": 60804, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 1, "size": 358990, "checksum": "0384ccbcd83b731fbe136125a03620942b69ab39183f0540663f9fa0571d4439", "avgDurationSecs": 3.923, "validDurationSecs": 56.66, "totalHrs": 0.02, "validHrs": 0.01 }, "lo": { "duration": 41796, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 248897, "checksum": "a878f9b617683c7df588fbd3b907381e511ad30b5a69156d5dc43290299faeaa", "avgDurationSecs": 6.532, "validDurationSecs": 110.21, "totalHrs": 0.01, "validHrs": 0.03 }, "lt": { "duration": 484380, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 85, "users": 5, "size": 2793560, "checksum": "2c5046fa9d1ddab6732e84acfee3c4b9ee5239a0042e806858b90572b40bb63d", "avgDurationSecs": 5.144, "validDurationSecs": 2584.86, "totalHrs": 0.13, "validHrs": 0.72 }, "ltg": { "duration": 8636544, "reportedSentences": 6, "validatedSentences": 3362, "unvalidatedSentences": -2360, "clips": 1290, "users": 20, "size": 50808181, "checksum": "b8cec3cadb4f219b53ef8655dd17125c3ee510b78c34cb3d126cc6a93e5efcf4", "avgDurationSecs": 4.753, "validDurationSecs": 6060.52, "totalHrs": 2.4, "validHrs": 1.68 }, "luo": { "duration": 14760, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 2, "users": 1, "size": 95564, "checksum": "9b0037747957bae0ead68e57a96fd6b8aaeceb2fd25cd4da87e5a802da463b01", "avgDurationSecs": 5.205, "validDurationSecs": 1.07, "totalHrs": 0, "validHrs": 0 }, "lv": { "duration": 10116972, "reportedSentences": 4, "validatedSentences": 518, "unvalidatedSentences": -401, "clips": 1614, "users": 78, "size": 58088360, "checksum": "bff89d8119a3b3eee21502e550db87b6b10d5bb1293067036cf770e29671ceb3", "avgDurationSecs": 4.795, "validDurationSecs": 7015.97, "totalHrs": 2.81, "validHrs": 1.95 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 74, "checksum": "b958bc1ae355adf5a87d3739e9e75e5dcdd60df9cbcdb474ddc08880362eb86f", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -28770, "checksum": "af5b260aee7cd25f038f7ebe16f39ffad3b8f5292f386cbc4314d8af04b4b8eb", "avgDurationSecs": 4.623, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mk": { "duration": 867024, "reportedSentences": 0, "validatedSentences": 38, "unvalidatedSentences": -33, "clips": 153, "users": 5, "size": 5117955, "checksum": "afcb173cc7505b5cbcf2864811c2cf13565d8dcff70598426048298aa3e91d6b", "avgDurationSecs": 4.809, "validDurationSecs": 4511.69, "totalHrs": 0.25, "validHrs": 1.26 }, "ml": { "duration": 166356, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 36, "users": 2, "size": 933026, "checksum": "eaab49423798bb65a3f0a2907d07240cf5eb119853dbf02dc0a6d2afe3394577", "avgDurationSecs": 4.219, "validDurationSecs": 157.09, "totalHrs": 0.04, "validHrs": 0.05 }, "mn": { "duration": 197568, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 1, "clips": 27, "users": 13, "size": 1163711, "checksum": "136fccf3f29b6743dc4ffb0f41dc15e3dd8f055f2dcfa05408fc7a3815b20dcf", "avgDurationSecs": 5.254, "validDurationSecs": 282.15, "totalHrs": 0.06, "validHrs": 0.07 }, "mr": { "duration": 132120, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 18, "users": 2, "size": 764434, "checksum": "4f9737ee3858f422e59a798f14d15e1174b9ba4859f180787f53b6339a9a751e", "avgDurationSecs": 6.2, "validDurationSecs": 45.04, "totalHrs": 0.03, "validHrs": 0.02 }, "mrj": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6563, "checksum": "db3d01e832885e76b692473ec4fc7ba6b289244e66d3d7e6ecaa1f8745456b03", "avgDurationSecs": 4.193, "validDurationSecs": 29.35, "totalHrs": 0, "validHrs": 0.01 }, "mt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2258, "checksum": "1fb863d9418f3429f11dc907aeccdbb01a942fdb6bae87c532b71726b35ad831", "avgDurationSecs": 4.754, "validDurationSecs": 47.54, "totalHrs": 0, "validHrs": 0.01 }, "myv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -759, "checksum": "7d0f7a2e52d42026ab3782deac261a1a9e2bc7eab7eab930e9c89ffa3718eb56", "avgDurationSecs": 5.774, "validDurationSecs": 17.32, "totalHrs": 0, "validHrs": 0.01 }, "nan-tw": { "duration": 1517472, "reportedSentences": 0, "validatedSentences": 816, "unvalidatedSentences": 0, "clips": 592, "users": 17, "size": 8580007, "checksum": "d3aedf4f12eccbab360ecb94ee403f897e57e2ef22bcddf3ad1dc7a4feba8d50", "avgDurationSecs": 2.642, "validDurationSecs": 6458.78, "totalHrs": 0.42, "validHrs": 1.79 }, "nb-NO": { "duration": 573624, "reportedSentences": 0, "validatedSentences": 29, "unvalidatedSentences": -27, "clips": 126, "users": 12, "size": 3383393, "checksum": "fdc512f7e961d56c90edff56c0fd8759dfc732324dd6b13b38d9fba05e2aeb37", "avgDurationSecs": 4.242, "validDurationSecs": 1133.36, "totalHrs": 0.16, "validHrs": 0.32 }, "ne-NP": { "duration": 172044, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": -5, "clips": 31, "users": 7, "size": 1015921, "checksum": "45129a700890d9a834b659524867863b52a8d4dba682235dc05efc0d4420a690", "avgDurationSecs": 4.081, "validDurationSecs": 106.24, "totalHrs": 0.05, "validHrs": 0.03 }, "nhi": { "duration": 109872, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 20, "users": 1, "size": 568424, "checksum": "bf85be412ecd2d5e697f18273120ebd36fb32c26b5e6e13e0058636938c92b39", "avgDurationSecs": 5.085, "validDurationSecs": 5.84, "totalHrs": 0.03, "validHrs": 0 }, "nl": { "duration": 26904600, "reportedSentences": 40, "validatedSentences": 556, "unvalidatedSentences": 217, "clips": 6212, "users": 44, "size": 155670468, "checksum": "00b20b5cfaf907e86d7674c8b3fe61c5947e425966bd5ca41ed0a20cbb25a5e4", "avgDurationSecs": 4.369, "validDurationSecs": 27767.49, "totalHrs": 7.48, "validHrs": 7.72 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -151, "checksum": "80382ca5eb7bdd90d5c5223459da560dffabb5cadb3c2538524c1ed8d9b87720", "avgDurationSecs": 4.387, "validDurationSecs": 74.59, "totalHrs": 0, "validHrs": 0.03 }, "nr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -7, "checksum": "ed5250c98b204bf22e182c69042a45172937171f186b4c3d331a4a072446566f", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 20, "checksum": "14f1c22a0b7a8d548baf627e6e38daf93aab5e7e08f1cb42c39d922670d509a3", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2938, "checksum": "ec58c8ef1677ca532e3483bb5cc817375f0b4093a123fd31c6f7c5be4e17d03b", "avgDurationSecs": 4.871, "validDurationSecs": 19.49, "totalHrs": 0, "validHrs": 0.01 }, "or": { "duration": 1187208, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 191, "users": 4, "size": 6974463, "checksum": "84f84dd6cfef84a6a4be4f92c4fbaf57da8c49ce4d6aa4ea8784f1c87599f538", "avgDurationSecs": 5.575, "validDurationSecs": 265.95, "totalHrs": 0.33, "validHrs": 0.07 }, "os": { "duration": 759744, "reportedSentences": 0, "validatedSentences": 223, "unvalidatedSentences": -223, "clips": 134, "users": 1, "size": 4490008, "checksum": "cc3dc20c8437991d9b1f4213e2882579ad96186fd7d28a1e71ab4934fd63d742", "avgDurationSecs": 5.517, "validDurationSecs": 1774.82, "totalHrs": 0.21, "validHrs": 0.49 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1101, "checksum": "80599faa974bb6e233d93333bd4ee5caf6bd1cc5dcc827b723a6f42cee01b8bf", "avgDurationSecs": 4.821, "validDurationSecs": 57.85, "totalHrs": 0, "validHrs": 0.02 }, "pl": { "duration": 3198168, "reportedSentences": 5, "validatedSentences": 122, "unvalidatedSentences": -122, "clips": 542, "users": 37, "size": 18637444, "checksum": "8b766d3a5e58aa37fed2ee3d0465816c8d1b1d06ba57e7ec7a62e77f20be86d8", "avgDurationSecs": 4.567, "validDurationSecs": 3589.9, "totalHrs": 0.89, "validHrs": 0.99 }, "ps": { "duration": 26652888, "reportedSentences": 13, "validatedSentences": 1570, "unvalidatedSentences": 24613, "clips": 6183, "users": 28, "size": 142490678, "checksum": "c9247548a7f0c910a8bcae31165d6a58d7b85cc7d2ef273cb5c659d0540b019e", "avgDurationSecs": 4.908, "validDurationSecs": 7593.97, "totalHrs": 7.4, "validHrs": 2.11 }, "pt": { "duration": 20435976, "reportedSentences": 6, "validatedSentences": 89, "unvalidatedSentences": -54, "clips": 3729, "users": 91, "size": 119030798, "checksum": "dee1da586d8e7703c6cb50f2fdc436bf914dd6d1d8fda261e728eb382b00c3ab", "avgDurationSecs": 4.179, "validDurationSecs": 10314.65, "totalHrs": 5.68, "validHrs": 2.87 }, "quy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -3, "clips": 0, "users": 0, "size": -1416, "checksum": "911a6bb2393ed7eec5417019c644540014c58663be3eef04854e5724fb55b9e3", "avgDurationSecs": 8.637, "validDurationSecs": 43.18, "totalHrs": 0, "validHrs": 0.01 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -748, "checksum": "0db054e42ace768072c6858dcefbea606d7da67f9adc9c6cb4e168cdc1636d02", "avgDurationSecs": 5.438, "validDurationSecs": 43.5, "totalHrs": 0, "validHrs": 0.02 }, "rm-vallader": { "duration": 191520, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 30, "users": 2, "size": 1134138, "checksum": "b693365e13e8f7b8baf21ebf39f9018de281a9bd3573a8f87023ce1933189fcd", "avgDurationSecs": 5.832, "validDurationSecs": 15.44, "totalHrs": 0.06, "validHrs": 0.01 }, "ro": { "duration": 923724, "reportedSentences": 6, "validatedSentences": 4, "unvalidatedSentences": 7, "clips": 197, "users": 9, "size": 5333713, "checksum": "37c2335f7da449b23410e92e28b36830908054c3e3aae72bf7afe741b8c2dbe2", "avgDurationSecs": 4.04, "validDurationSecs": 529.74, "totalHrs": 0.26, "validHrs": 0.14 }, "ru": { "duration": 9022284, "reportedSentences": 9, "validatedSentences": 184, "unvalidatedSentences": -163, "clips": 1427, "users": 117, "size": 52411685, "checksum": "804b57b724599373805a444b698cb3f6676731a93bb1b48911d00f3555a7b494", "avgDurationSecs": 5.184, "validDurationSecs": 9697.82, "totalHrs": 2.5, "validHrs": 2.69 }, "rw": { "duration": 137952, "reportedSentences": 4, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 39, "users": 8, "size": -21456, "checksum": "778bb12a199a048cf242875c1c9e617cc51fc1a62fe24b7dc6b06c10386c2df1", "avgDurationSecs": 5.007, "validDurationSecs": 132.13, "totalHrs": 0.03, "validHrs": 0.04 }, "sah": { "duration": 32463972, "reportedSentences": 7, "validatedSentences": 40, "unvalidatedSentences": -36, "clips": 4958, "users": 116, "size": 188264326, "checksum": "ca9336f7fad58cc8477aea3921549333b286955a30e4e940ea00b90fe3e9653a", "avgDurationSecs": 6.337, "validDurationSecs": 23579.99, "totalHrs": 9.02, "validHrs": 6.55 }, "sat": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 89, "checksum": "0ddeb729ca8044be4ebcb38d1a74bab69dcb05f994cf852eea1f5ed93fca7cf8", "avgDurationSecs": 4.478, "validDurationSecs": 31.35, "totalHrs": 0, "validHrs": 0.01 }, "sc": { "duration": 97740, "reportedSentences": 0, "validatedSentences": 28, "unvalidatedSentences": -27, "clips": 20, "users": 1, "size": 574247, "checksum": "7b08a6f3cc703fd46648715ddc2eb1cf71167d5490a2f2a48a34630d35a38851", "avgDurationSecs": 4.693, "validDurationSecs": 148.64, "totalHrs": 0.03, "validHrs": 0.04 }, "sd": { "duration": 29081016, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 31, "clips": 7138, "users": 1, "size": 154629966, "checksum": "52782e840d6a1f344c356b8098a6c5119831a21270567bd5088a8fb47632fb93", "avgDurationSecs": 4.193, "validDurationSecs": -10.53, "totalHrs": 8.08, "validHrs": 0 }, "sk": { "duration": 18860472, "reportedSentences": 6, "validatedSentences": 1277, "unvalidatedSentences": -977, "clips": 4061, "users": 2, "size": 111006220, "checksum": "a060e586bc7f841f9d332094a65b4f0e5c0afed91bc5238c3b7cba9ee37687a3", "avgDurationSecs": 4.263, "validDurationSecs": 17216.17, "totalHrs": 5.24, "validHrs": 4.78 }, "skr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -235, "checksum": "d64811f4b50822850df855b91a0927a58efe010c564a0d7014e8732e11698b4f", "avgDurationSecs": 4.164, "validDurationSecs": 41.64, "totalHrs": 0, "validHrs": 0.01 }, "sl": { "duration": 1871496, "reportedSentences": 4, "validatedSentences": 26, "unvalidatedSentences": -22, "clips": 414, "users": 62, "size": 10810058, "checksum": "5d83371bddf988e6d9f6c7b6328a2498ea2f838b4cbe35651f319a8a584f3b27", "avgDurationSecs": 4.001, "validDurationSecs": 1512.17, "totalHrs": 0.52, "validHrs": 0.42 }, "sq": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": 7598, "checksum": "94e939778c5376a987c3307c55ff85f6507f76c357a49647a5b414618c256c78", "avgDurationSecs": 5.1, "validDurationSecs": 5.1, "totalHrs": 0, "validHrs": 0 }, "sr": { "duration": 99216, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 20, "users": 4, "size": 613188, "checksum": "c1dd2f0329172999113089abfd66c7e207aa2d0cf82ed32ffad585bd4e05712a", "avgDurationSecs": 3.238, "validDurationSecs": 88.99, "totalHrs": 0.02, "validHrs": 0.02 }, "st": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -139, "checksum": "c871ed4065e84e405d611971f6951fb8031764c8ac89f32c763fdd201f65761d", "avgDurationSecs": 3.223, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sv-SE": { "duration": 1008252, "reportedSentences": 5, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 201, "users": 12, "size": 5868949, "checksum": "4c9ca58f99a6b8bf9f05a260d6b70685aea8fbe1eb75c582b3edf75b224b3b84", "avgDurationSecs": 4.023, "validDurationSecs": 1607.79, "totalHrs": 0.28, "validHrs": 0.44 }, "sw": { "duration": 446220, "reportedSentences": 0, "validatedSentences": 16, "unvalidatedSentences": -16, "clips": 63, "users": 10, "size": 2668232, "checksum": "9b77dfc836fccd0e282f0476a1d7e0d7d26f6f9544cd2b4319f4d5fa4437d655", "avgDurationSecs": 5.387, "validDurationSecs": 190.2, "totalHrs": 0.12, "validHrs": 0.06 }, "ta": { "duration": 2085084, "reportedSentences": 1, "validatedSentences": 51, "unvalidatedSentences": -51, "clips": 383, "users": 6, "size": 11847929, "checksum": "ee0244f13ae3c4838872ac86a2cb01b93ae36c5ede206292c503d930f0a7e644", "avgDurationSecs": 6.197, "validDurationSecs": 2053.45, "totalHrs": 0.58, "validHrs": 0.57 }, "te": { "duration": 477216, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 119, "users": 3, "size": 2800644, "checksum": "6cc309b0752b9925e8e8a199a8e09126bc404aed2ec5b96a2ebde43e7d4a711d", "avgDurationSecs": 4.177, "validDurationSecs": 118.45, "totalHrs": 0.13, "validHrs": 0.03 }, "th": { "duration": 5246892, "reportedSentences": 21, "validatedSentences": 13, "unvalidatedSentences": 34, "clips": 1066, "users": 20, "size": 31183050, "checksum": "a28a23a9096c6a573c91da3fda41b38916ecea488b077961906b334c3968f9e4", "avgDurationSecs": 4.193, "validDurationSecs": 1246.73, "totalHrs": 1.46, "validHrs": 0.34 }, "ti": { "duration": 72396, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -1, "clips": 12, "users": 2, "size": 430981, "checksum": "7c45d649eadd751fa3089280682e59280bfb7f5e758065dbf6a4197336cff1e4", "avgDurationSecs": 5.057, "validDurationSecs": 81.97, "totalHrs": 0.02, "validHrs": 0.03 }, "tig": { "duration": 7658352, "reportedSentences": 6, "validatedSentences": 29, "unvalidatedSentences": -16, "clips": 1259, "users": 27, "size": 41097576, "checksum": "0923bdf7db441e1482bb52b8ad0e00db02bbd1c9626b5a98d28f6c2592a40223", "avgDurationSecs": 6.345, "validDurationSecs": 14428.23, "totalHrs": 2.13, "validHrs": 4.01 }, "tk": { "duration": 631368, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 35, "clips": 124, "users": 4, "size": 3716291, "checksum": "ebd5200da295e5589e6b588a9817496e0743c00a9241e5cf4f1b71e5a2a12769", "avgDurationSecs": 5.475, "validDurationSecs": 176.97, "totalHrs": 0.18, "validHrs": 0.05 }, "tn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 10063, "checksum": "341de72016cc2665a994bbbfd70e9b04ee7ce21e4a402513b203d4d72efe13ca", "avgDurationSecs": 4.37, "validDurationSecs": 34.96, "totalHrs": 0, "validHrs": 0.01 }, "tok": { "duration": 2480940, "reportedSentences": 0, "validatedSentences": 11, "unvalidatedSentences": 9, "clips": 429, "users": 1, "size": 14112444, "checksum": "66677e7cc4635162ce59b6d66edd1ea4f265102f80362d331204ce6287c6bc5d", "avgDurationSecs": 4.206, "validDurationSecs": 1142.49, "totalHrs": 0.69, "validHrs": 0.32 }, "tr": { "duration": 4724244, "reportedSentences": 3, "validatedSentences": 3, "unvalidatedSentences": 5, "clips": 788, "users": 49, "size": 27897919, "checksum": "f346ae93cddc9c6fd619ac85454e6bc3d310510b8d4525b11f68bb2f9fab1033", "avgDurationSecs": 3.829, "validDurationSecs": 4449.57, "totalHrs": 1.31, "validHrs": 1.24 }, "ts": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -7, "checksum": "8f97c9bfbeb77f64bb8b6291e3b8532d72467b0221396d92e7afddac903382f7", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "duration": 23652, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 142588, "checksum": "1a07944c2af78f7990edc486f0cca36a395640022b5a4aa75b5143e5faab5662", "avgDurationSecs": 3.831, "validDurationSecs": 149.9, "totalHrs": 0.01, "validHrs": 0.04 }, "tw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2102, "checksum": "e0f4e3f9315550317c4a2f3fae74fe0578a0fc27614c6e51342df70be58ac58a", "avgDurationSecs": 4.31, "validDurationSecs": 112.06, "totalHrs": 0, "validHrs": 0.03 }, "ug": { "duration": 118536516, "reportedSentences": 78, "validatedSentences": 96, "unvalidatedSentences": -24, "clips": 17358, "users": 66, "size": 691605580, "checksum": "abe193521d310e78b5da6332e28da12d5143cb66b389f0451aa056d74df9da19", "avgDurationSecs": 5.882, "validDurationSecs": 181329.68, "totalHrs": 32.93, "validHrs": 50.37 }, "uk": { "duration": 756720, "reportedSentences": 1, "validatedSentences": 1, "unvalidatedSentences": 3, "clips": 160, "users": 10, "size": 3915702, "checksum": "d5a8a7d126da081055108c0d08d9c3c511e3ab83f9bbae2d768b75ceb93a7e8f", "avgDurationSecs": 4.642, "validDurationSecs": 912.73, "totalHrs": 0.21, "validHrs": 0.25 }, "ur": { "duration": 72576, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 3, "size": 452482, "checksum": "6845fa00738c84d89aa9a25f3dc0712d1dd42b33fc00ca0a223e68ab0d2fc8e9", "avgDurationSecs": 4.297, "validDurationSecs": 229.92, "totalHrs": 0.02, "validHrs": 0.06 }, "uz": { "duration": 1553364, "reportedSentences": 0, "validatedSentences": 6, "unvalidatedSentences": 2, "clips": 281, "users": 27, "size": 8974838, "checksum": "5a9331d6477037363462fa134f0b8aa3c6cdc7eb61aa7c35075af8731599c9a5", "avgDurationSecs": 4.157, "validDurationSecs": 653.21, "totalHrs": 0.43, "validHrs": 0.19 }, "vi": { "duration": 4094064, "reportedSentences": 0, "validatedSentences": 445, "unvalidatedSentences": 189, "clips": 951, "users": 40, "size": 23887475, "checksum": "e281c6ea8e8516b07ae3e3ffb3e0116399dcae336bb74c9b61aca23f698d90ac", "avgDurationSecs": 3.967, "validDurationSecs": 1316.29, "totalHrs": 1.14, "validHrs": 0.36 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -62, "checksum": "4f6fb207960f0b55489e98aec261ea157521aca4e7d72a95315361174975659e", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xh": { "duration": 85572, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 1, "size": 506008, "checksum": "b2029fd5507e05909b5d9afa48a9fddc2ee96aaad3bce5013e918fe5a25b6dd4", "avgDurationSecs": 5.99, "validDurationSecs": 35.66, "totalHrs": 0.02, "validHrs": 0.01 }, "yi": { "duration": 1231236, "reportedSentences": 3, "validatedSentences": 544, "unvalidatedSentences": -514, "clips": 342, "users": 12, "size": 7166779, "checksum": "41684592981aed4ee1e8f34fd78193dd02666eb26a34b492f3235bf3a287335b", "avgDurationSecs": 3.919, "validDurationSecs": 1062.79, "totalHrs": 0.34, "validHrs": 0.3 }, "yo": { "duration": 1288800, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 174, "users": 4, "size": 6552609, "checksum": "e9316cdfc2c057b33d0d2d17d1e3c117106ff92b1fbdaa6e9825cb73e4a0dd2f", "avgDurationSecs": 6.041, "validDurationSecs": 665.74, "totalHrs": 0.36, "validHrs": 0.18 }, "yue": { "duration": 893448, "reportedSentences": 0, "validatedSentences": 440, "unvalidatedSentences": 2532, "clips": 205, "users": 11, "size": 5265138, "checksum": "82628321e0e7e0872e8f61e0b927b59da7df9b18a7d9a3e128afcb808bb977c7", "avgDurationSecs": 3.967, "validDurationSecs": 804.69, "totalHrs": 0.25, "validHrs": 0.22 }, "zgh": { "duration": 319392, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 1, "clips": 78, "users": 4, "size": 1876888, "checksum": "0dd6449e6255af6283c308939f7376b26d92ce15ed1606b92ad28908cd805f1b", "avgDurationSecs": 3.453, "validDurationSecs": 184.64, "totalHrs": 0.09, "validHrs": 0.06 }, "zh-CN": { "duration": 3310920, "reportedSentences": 14, "validatedSentences": 34, "unvalidatedSentences": -3, "clips": 888, "users": 85, "size": 19342146, "checksum": "fa27b9366c3ef5bc9b2683da131105b890f6a263196c3c5dbb7de42dbe4fbbca", "avgDurationSecs": 4.542, "validDurationSecs": 1756.49, "totalHrs": 0.92, "validHrs": 0.49 }, "zh-HK": { "duration": 4546476, "reportedSentences": 9, "validatedSentences": 60, "unvalidatedSentences": -41, "clips": 1215, "users": 5, "size": 26579170, "checksum": "563fbc0cd3710f4d24cebede8bdf1be8cb449abec5039c1cc3a237673d1a1990", "avgDurationSecs": 4.157, "validDurationSecs": 347.97, "totalHrs": 1.26, "validHrs": 0.09 }, "zh-TW": { "duration": 3846096, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": 4, "clips": 1126, "users": 21, "size": 22421760, "checksum": "a1ebf7a80ae720bfa83c410ae1688cf9983217a420a38c0c092014aacf721366", "avgDurationSecs": 3.364, "validDurationSecs": 1239.9, "totalHrs": 1.07, "validHrs": 0.34 }, "zu": { "duration": 60264, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 356597, "checksum": "f9c182c7f4235175080116c955d0ee32bdcf53890a8079a7752090c219e9b693", "avgDurationSecs": 3.518, "validDurationSecs": 22.7, "totalHrs": 0.01, "validHrs": 0.01 }, "zza": { "duration": 0, "reportedSentences": 0, "validatedSentences": 10, "unvalidatedSentences": -10, "clips": 0, "users": 0, "size": 1926, "checksum": "4e063cc3b9d11340c2e6a0b1fe3ae968fa1fee2942be26002ab50b9cadeb1911", "avgDurationSecs": 4.103, "validDurationSecs": 127.21, "totalHrs": 0, "validHrs": 0.04 } }, "totalDuration": 1012332132, "totalValidDurationSecs": 1068279, "totalHrs": 281, "totalValidHrs": 296 } ================================================ FILE: datasets/scripted-speech/cv-corpus-23.0-2025-09-05.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9153, "test": 9133, "train": 21038, "validated": 42177, "invalidated": 5291, "other": 16746 }, "duration": 332442828, "reportedSentences": 225, "validatedSentences": 1046779, "unvalidatedSentences": 5, "clips": 64214, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.17, "teens": 0.26, "fourties": 0.08, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 64214, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 423, "size": 1975076512, "checksum": "59d052e0dcec3c55f1684b24853b081e91787548326267717824170965fecfb4", "avgDurationSecs": 5.177, "validDurationSecs": 218354.894, "totalHrs": 92.34, "validHrs": 60.65 }, "abb": { "buckets": { "dev": 293, "test": 313, "train": 390, "validated": 7887, "invalidated": 35, "other": 0 }, "duration": 40474692, "reportedSentences": 0, "validatedSentences": 996, "unvalidatedSentences": -1, "clips": 7922, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7922, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 238185615, "checksum": "ee3eeb26e84496e7f7c3f5fe175bfb6cb97e0dfaa7803b266c0edf7e76c09ba7", "avgDurationSecs": 5.109, "validDurationSecs": 40295.872, "totalHrs": 11.24, "validHrs": 11.19 }, "ady": { "buckets": { "dev": 2433, "test": 2460, "train": 2929, "validated": 24954, "invalidated": 883, "other": 8404 }, "duration": 172064520, "reportedSentences": 580, "validatedSentences": 9370, "unvalidatedSentences": 2218, "clips": 34241, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.28, "thirties": 0.19, "teens": 0.04, "fourties": 0.15, "fifties": 0.17, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.03, "female_feminine": 0.61, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 260, "agriculture_food": 15, "automotive_transport": 0, "finance": 0, "service_retail": 52, "general": 76241, "healthcare": 40, "history_law_government": 0, "language_fundamentals": 378, "media_entertainment": 72, "nature_environment": 6, "news_current_affairs": 80, "technology_robotics": 0 } }, "users": 109, "size": 979732501, "checksum": "c8f47ec303a24670137bf03ca179696e0cb80062e99820c5fa16b6fee79653ba", "avgDurationSecs": 5.025, "validDurationSecs": 125396.397, "totalHrs": 47.79, "validHrs": 34.83 }, "af": { "buckets": { "dev": 118, "test": 125, "train": 181, "validated": 424, "invalidated": 200, "other": 345 }, "duration": 5884884, "reportedSentences": 73, "validatedSentences": 4794, "unvalidatedSentences": 16572, "clips": 969, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.31, "thirties": 0, "teens": 0.01, "fourties": 0.21, "fifties": 0.15, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.32, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 969, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 63, "size": 35649182, "checksum": "5f64c4111802e497891bfb099b084b9128f7ff3e8066fb87bb15c9a34de7fbcf", "avgDurationSecs": 6.073, "validDurationSecs": 2575.016, "totalHrs": 1.63, "validHrs": 0.71 }, "ajg": { "buckets": { "dev": 560, "test": 774, "train": 1873, "validated": 19153, "invalidated": 210, "other": 35 }, "duration": 46342044, "reportedSentences": 0, "validatedSentences": 3213, "unvalidatedSentences": -1, "clips": 19398, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19398, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 263856109, "checksum": "c6a1f253f9de2c5dd792585f975c2bd2f80db5be55c22bc93d2a2f0007d055b2", "avgDurationSecs": 2.389, "validDurationSecs": 45756.736, "totalHrs": 12.87, "validHrs": 12.71 }, "am": { "buckets": { "dev": 248, "test": 257, "train": 530, "validated": 1035, "invalidated": 29, "other": 567 }, "duration": 10270872, "reportedSentences": 11, "validatedSentences": 2208, "unvalidatedSentences": 12, "clips": 1631, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.14, "thirties": 0.61, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1631, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 60528708, "checksum": "05d9fe4f4451a961093e71178564c26dde4a111a026ebd70294e72d6b87f06e7", "avgDurationSecs": 6.297, "validDurationSecs": 6517.69, "totalHrs": 2.85, "validHrs": 1.81 }, "an": { "buckets": { "dev": 2872, "test": 2920, "train": 4190, "validated": 13277, "invalidated": 758, "other": 0 }, "duration": 64174644, "reportedSentences": 8, "validatedSentences": 10454, "unvalidatedSentences": 357, "clips": 14035, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.02, "thirties": 0.2, "teens": 0, "fourties": 0.32, "fifties": 0.11, "sixties": 0.12, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.71, "male_masculine": 0.15, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 456, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 2, "general": 13633, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 52, "size": 374116173, "checksum": "21995ae5097121bc7f83e701f8bf2763e84f5f283baf68bb984cadbd8bcf7844", "avgDurationSecs": 4.572, "validDurationSecs": 60708.71, "totalHrs": 17.82, "validHrs": 16.86 }, "ar": { "buckets": { "dev": 10181, "test": 10508, "train": 28881, "validated": 79255, "invalidated": 15230, "other": 41320 }, "duration": 565138530, "reportedSentences": 2136, "validatedSentences": 59434, "unvalidatedSentences": 1742110, "clips": 135805, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.29, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 135805, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1646, "size": 3478851759, "checksum": "22d224c4f8e8d24f518eb7cb7143f34016e11f7d75384b19041e5c3fe66eaff4", "avgDurationSecs": 4.161, "validDurationSecs": 329811.525, "totalHrs": 156.98, "validHrs": 91.61 }, "as": { "buckets": { "dev": 485, "test": 394, "train": 953, "validated": 1832, "invalidated": 282, "other": 2567 }, "duration": 27604649, "reportedSentences": 249, "validatedSentences": 7153, "unvalidatedSentences": 3362, "clips": 4681, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.16, "thirties": 0.02, "teens": 0, "fourties": 0.52, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0.18, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4670, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 11, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 11 } }, "users": 51, "size": 167863073, "checksum": "5818719cd64da0151645ead75f7e31316136a365482f88f377c0cff4caadaa3c", "avgDurationSecs": 5.897, "validDurationSecs": 10803.614, "totalHrs": 7.66, "validHrs": 3 }, "ast": { "buckets": { "dev": 113, "test": 203, "train": 434, "validated": 758, "invalidated": 31, "other": 890 }, "duration": 7444944, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1679, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.1, "thirties": 0.12, "teens": 0.04, "fourties": 0.28, "fifties": 0.21, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.6, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1679, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 42479856, "checksum": "44272c9c6275d7fc6cb9eb90e07a2d521b8c118e4c9e1683a2814a54ee55d7fe", "avgDurationSecs": 4.434, "validDurationSecs": 3361.088, "totalHrs": 2.06, "validHrs": 0.93 }, "az": { "buckets": { "dev": 87, "test": 104, "train": 164, "validated": 355, "invalidated": 50, "other": 554 }, "duration": 5233896, "reportedSentences": 5, "validatedSentences": 93161, "unvalidatedSentences": 1850, "clips": 959, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.13, "thirties": 0.64, "teens": 0.02, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.81, "male_masculine": 0.18, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 959, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 43, "size": 36871900, "checksum": "4f0a9e3b1eef4aeccbf582477b78427a1d1561230a637b81ee57fb0c3ebf4582", "avgDurationSecs": 5.458, "validDurationSecs": 1937.469, "totalHrs": 1.45, "validHrs": 0.53 }, "ba": { "buckets": { "dev": 14525, "test": 14569, "train": 119138, "validated": 210457, "invalidated": 8011, "other": 45 }, "duration": 967369932, "reportedSentences": 868, "validatedSentences": 153968, "unvalidatedSentences": 5, "clips": 218513, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218513, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 929, "size": 5451267289, "checksum": "e7c87fe3df3b695bec6aa8dba3fd57c5e680baf50ce61d0d54ff08984130285f", "avgDurationSecs": 4.427, "validDurationSecs": 931705.545, "totalHrs": 268.71, "validHrs": 258.8 }, "bag": { "buckets": { "dev": 261, "test": 324, "train": 421, "validated": 7027, "invalidated": 49, "other": 36 }, "duration": 40652964, "reportedSentences": 1, "validatedSentences": 1006, "unvalidatedSentences": 5, "clips": 7112, "splits": { "accent": {}, "age": { "": 0.84, "twenties": 0, "thirties": 0.02, "teens": 0, "fourties": 0.14, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.84, "male_masculine": 0, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.14 }, "sentence_domain": { "": 7112, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 228628900, "checksum": "9ca1b390418e7aa8375ff24b0697c4d7fae044c9cc79d92e9b4079f52cb7c580", "avgDurationSecs": 5.716, "validDurationSecs": 40167.095, "totalHrs": 11.29, "validHrs": 11.15 }, "bas": { "buckets": { "dev": 1324, "test": 1550, "train": 2112, "validated": 11137, "invalidated": 1217, "other": 120 }, "duration": 48972240, "reportedSentences": 8, "validatedSentences": 5226, "unvalidatedSentences": 105, "clips": 12474, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.61, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12469, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 5, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 54, "size": 253911069, "checksum": "b3cd82259bb616e37e69153f9730e5a3f01194098261b71492d1bb2d5ebad6de", "avgDurationSecs": 3.926, "validDurationSecs": 43723.251, "totalHrs": 13.6, "validHrs": 12.14 }, "bax": { "buckets": { "dev": 319, "test": 338, "train": 373, "validated": 7777, "invalidated": 80, "other": 829 }, "duration": 44092836, "reportedSentences": 0, "validatedSentences": 1030, "unvalidatedSentences": -1, "clips": 8686, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8686, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 251909099, "checksum": "f3a2705a2f817804b57496d6be696dfd03aa3a2071e34f6be81439a15535fd64", "avgDurationSecs": 5.076, "validDurationSecs": 39478.469, "totalHrs": 12.24, "validHrs": 10.96 }, "bba": { "buckets": { "dev": 251, "test": 236, "train": 292, "validated": 6273, "invalidated": 2279, "other": 0 }, "duration": 54094284, "reportedSentences": 94, "validatedSentences": 971, "unvalidatedSentences": -1, "clips": 8552, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8552, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 297930314, "checksum": "2cb1bcdeb524baf43fd3761c11c8317c7c8d08a163ef33ccc3afa8305d93ffbd", "avgDurationSecs": 6.325, "validDurationSecs": 39678.84, "totalHrs": 15.02, "validHrs": 11.02 }, "bbj": { "buckets": { "dev": 321, "test": 329, "train": 346, "validated": 7235, "invalidated": 17, "other": 272 }, "duration": 46645812, "reportedSentences": 1, "validatedSentences": 996, "unvalidatedSentences": 1, "clips": 7524, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.02, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.86, "male_masculine": 0, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7524, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 272473063, "checksum": "3d274a2ab019c918804e3ee9148a3a4f25b17b552134170e91ccb7af5a818796", "avgDurationSecs": 6.2, "validDurationSecs": 44854.127, "totalHrs": 12.95, "validHrs": 12.45 }, "bbl": { "buckets": { "dev": 377, "test": 377, "train": 407, "validated": 4580, "invalidated": 130, "other": 82 }, "duration": 42232860, "reportedSentences": 0, "validatedSentences": 1661, "unvalidatedSentences": 454, "clips": 4792, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0, "thirties": 0.15, "teens": 0, "fourties": 0, "fifties": 0.04, "sixties": 0.55, "seventies": 0.24, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0, "female_feminine": 0.67, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 242, "agriculture_food": 14, "automotive_transport": 28, "finance": 0, "service_retail": 0, "general": 4520, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 20, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 250654925, "checksum": "2f2ddfe4cca1cb85f26f8c9e4e11516c4e3b0b58a4988847102d2e38afc01b7f", "avgDurationSecs": 8.813, "validDurationSecs": 40364.461, "totalHrs": 11.73, "validHrs": 11.21 }, "bce": { "buckets": { "dev": 320, "test": 322, "train": 336, "validated": 6117, "invalidated": 24, "other": 0 }, "duration": 36181656, "reportedSentences": 0, "validatedSentences": 986, "unvalidatedSentences": -1, "clips": 6141, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6141, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 146585266, "checksum": "5329890d39d0df87ac83755174acebed2ebb41992248fc65d8075e198eebdfe5", "avgDurationSecs": 5.892, "validDurationSecs": 36040.252, "totalHrs": 10.05, "validHrs": 10.01 }, "bci": { "buckets": { "dev": 267, "test": 290, "train": 319, "validated": 5620, "invalidated": 141, "other": 1417 }, "duration": 53374860, "reportedSentences": 1, "validatedSentences": 876, "unvalidatedSentences": -1, "clips": 7178, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0.14, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7178, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 309033530, "checksum": "f1979a11cc4a9ec292144a6f739c3a0d8f4c6c14f32c6ec1b038fb0817fdc5e8", "avgDurationSecs": 7.436, "validDurationSecs": 41789.734, "totalHrs": 14.82, "validHrs": 11.6 }, "be": { "buckets": { "dev": 15877, "test": 15875, "train": 347681, "validated": 1359476, "invalidated": 36903, "other": 18571 }, "duration": 6779513448, "reportedSentences": 3200, "validatedSentences": 379544, "unvalidatedSentences": 1935, "clips": 1414950, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.04, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1414868, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 3, "general": 33, "healthcare": 0, "history_law_government": 28, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 7 } }, "users": 8586, "size": 38607016835, "checksum": "389eddb83229528b9d958b18453e5aad07bb09fbfee1243f278fb1d34a291ef3", "avgDurationSecs": 4.791, "validDurationSecs": 6513718.382, "totalHrs": 1883.19, "validHrs": 1809.36 }, "beb": { "buckets": { "dev": 324, "test": 331, "train": 350, "validated": 6742, "invalidated": 737, "other": 2 }, "duration": 40845996, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": -1, "clips": 7481, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7481, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 33, "size": 235854066, "checksum": "680afc27337d48cfddd3d4a524aad2f4c1ef17c856d833452f094c2934e1302f", "avgDurationSecs": 5.46, "validDurationSecs": 36811.082, "totalHrs": 11.34, "validHrs": 10.22 }, "bfd": { "buckets": { "dev": 252, "test": 254, "train": 260, "validated": 6454, "invalidated": 53, "other": 673 }, "duration": 40598964, "reportedSentences": 0, "validatedSentences": 766, "unvalidatedSentences": -1, "clips": 7180, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7180, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 232039870, "checksum": "7c11165891522a35e335879882ffd999ee8bff34e52b257137346b1789ae0ff1", "avgDurationSecs": 5.654, "validDurationSecs": 36493.832, "totalHrs": 11.27, "validHrs": 10.13 }, "bft": { "buckets": { "dev": 1833, "test": 1799, "train": 2211, "validated": 10053, "invalidated": 489, "other": 16 }, "duration": 65071008, "reportedSentences": 37, "validatedSentences": 7968, "unvalidatedSentences": 12, "clips": 10558, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.37, "thirties": 0.35, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male_masculine": 0, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.1 }, "sentence_domain": { "": 10558, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 154, "size": 377801796, "checksum": "f4b525333362c12e7eba034c1c2aa31b9869129cb1684314f3c0f2b9b57698d3", "avgDurationSecs": 6.163, "validDurationSecs": 61958.595, "totalHrs": 18.07, "validHrs": 17.21 }, "bg": { "buckets": { "dev": 2940, "test": 3368, "train": 4962, "validated": 11287, "invalidated": 776, "other": 1794 }, "duration": 75819816, "reportedSentences": 191, "validatedSentences": 39572, "unvalidatedSentences": 3, "clips": 13857, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.58, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13857, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 147, "size": 446800810, "checksum": "4d6ebc09c9e977ff694286d3d7a41b0ae472afe76e37c71fa51170a2f5c16340", "avgDurationSecs": 5.472, "validDurationSecs": 61757.831, "totalHrs": 21.06, "validHrs": 17.15 }, "bgp": { "buckets": { "dev": 1105, "test": 2135, "train": 3540, "validated": 7598, "invalidated": 266, "other": 39 }, "duration": 48108024, "reportedSentences": 0, "validatedSentences": 6997, "unvalidatedSentences": -1, "clips": 7903, "splits": { "accent": {}, "age": { "": 0.95, "twenties": 0.05, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7903, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 25, "size": 284078832, "checksum": "f4fd0ec272a352a1e7436b020c2026a2b091eadd0f3c805d047ac1df79a8447a", "avgDurationSecs": 6.087, "validDurationSecs": 46251.394, "totalHrs": 13.36, "validHrs": 12.84 }, "bkh": { "buckets": { "dev": 305, "test": 304, "train": 377, "validated": 5614, "invalidated": 1949, "other": 0 }, "duration": 52500744, "reportedSentences": 2, "validatedSentences": 986, "unvalidatedSentences": 1, "clips": 7563, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7563, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 289012007, "checksum": "d4d62b4afbbceb405d56b09195182960b25192e357c4fb68980baf9b8c5192df", "avgDurationSecs": 6.942, "validDurationSecs": 38971.199, "totalHrs": 14.58, "validHrs": 10.82 }, "bkm": { "buckets": { "dev": 304, "test": 310, "train": 326, "validated": 7573, "invalidated": 260, "other": 0 }, "duration": 45229536, "reportedSentences": 0, "validatedSentences": 940, "unvalidatedSentences": -1, "clips": 7833, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7833, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 266199999, "checksum": "ab39e1d141b7c39132bc9c66d6fa5fbaabdcd4b2e1c8a9ee051ec0b859819ea5", "avgDurationSecs": 5.774, "validDurationSecs": 43728.236, "totalHrs": 12.56, "validHrs": 12.14 }, "bn": { "buckets": { "dev": 9391, "test": 9391, "train": 21560, "validated": 44634, "invalidated": 7926, "other": 999488 }, "duration": 4598208216, "reportedSentences": 1890, "validatedSentences": 562827, "unvalidatedSentences": 94116, "clips": 1052048, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1052048, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22994, "size": 26572531188, "checksum": "050cc3d7765c3b2b5021c22102c7d4579dc61257d44bb620b04f3fcae72e289b", "avgDurationSecs": 4.371, "validDurationSecs": 195082.758, "totalHrs": 1277.28, "validHrs": 54.18 }, "bnm": { "buckets": { "dev": 282, "test": 331, "train": 412, "validated": 7675, "invalidated": 83, "other": 216 }, "duration": 60507684, "reportedSentences": 2, "validatedSentences": 1025, "unvalidatedSentences": -1, "clips": 7974, "splits": { "accent": {}, "age": { "": 0.65, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.06, "fifties": 0.12, "sixties": 0.05, "seventies": 0.12, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7974, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 357640381, "checksum": "9692dd832e5b9112f021665e41f3191da504f888cafbb97328631ac9a0547638", "avgDurationSecs": 7.588, "validDurationSecs": 58238.836, "totalHrs": 16.8, "validHrs": 16.17 }, "bnn": { "buckets": { "dev": 958, "test": 960, "train": 1040, "validated": 7274, "invalidated": 946, "other": 0 }, "duration": 41843016, "reportedSentences": 3, "validatedSentences": 3167, "unvalidatedSentences": 8, "clips": 8220, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.06, "sixties": 0.34, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.06, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.35 }, "sentence_domain": { "": 273, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 10094, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 246702378, "checksum": "d3caa0b1f0034ddd2c0ec9fdb6962615db58e1c65404f0609846acada05b0ede", "avgDurationSecs": 5.09, "validDurationSecs": 37027.506, "totalHrs": 11.62, "validHrs": 10.28 }, "br": { "buckets": { "dev": 3111, "test": 3090, "train": 3884, "validated": 32453, "invalidated": 2353, "other": 31 }, "duration": 113054803, "reportedSentences": 303, "validatedSentences": 22349, "unvalidatedSentences": 87, "clips": 34837, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.11, "teens": 0.01, "fourties": 0.06, "fifties": 0.08, "sixties": 0.12, "seventies": 0.06, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.61, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31703, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 4, "general": 3094, "healthcare": 10, "history_law_government": 9, "language_fundamentals": 0, "media_entertainment": 8, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 4 } }, "users": 258, "size": 770745520, "checksum": "724240bd5d6d99dd60f7642ee0dc8c48bf066eec0f44c6d4711d73a97e88b3f1", "avgDurationSecs": 3.245, "validDurationSecs": 105318.125, "totalHrs": 31.4, "validHrs": 29.25 }, "brh": { "buckets": { "dev": 504, "test": 882, "train": 1705, "validated": 5082, "invalidated": 4, "other": 22 }, "duration": 37155096, "reportedSentences": 0, "validatedSentences": 3095, "unvalidatedSentences": -1, "clips": 5108, "splits": { "accent": {}, "age": { "": 0.81, "twenties": 0, "thirties": 0.19, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5108, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 215880237, "checksum": "edccfa0e214b21e4a965ea87aec2789dac8d8ae7a948578a4f0bbd31e557ae31", "avgDurationSecs": 7.274, "validDurationSecs": 36965.975, "totalHrs": 10.32, "validHrs": 10.26 }, "bri": { "buckets": { "dev": 203, "test": 298, "train": 699, "validated": 8838, "invalidated": 234, "other": 122 }, "duration": 39939480, "reportedSentences": 12, "validatedSentences": 1200, "unvalidatedSentences": -1, "clips": 9194, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0.01, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9194, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 197701250, "checksum": "f0efb6e506fde6ff3006f434f645c17e65a99e0888633269957f979e451e3e80", "avgDurationSecs": 4.344, "validDurationSecs": 38392.987, "totalHrs": 11.09, "validHrs": 10.66 }, "bsh": { "buckets": { "dev": 515, "test": 693, "train": 1420, "validated": 6765, "invalidated": 60, "other": 37 }, "duration": 37426032, "reportedSentences": 2, "validatedSentences": 2646, "unvalidatedSentences": 40, "clips": 6862, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.57, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.28, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6862, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 220730792, "checksum": "7cd4026b22f4f341748df65c55e78e5533850e0915fb4809f5f5a49173e1525f", "avgDurationSecs": 5.454, "validDurationSecs": 36896.984, "totalHrs": 10.39, "validHrs": 10.24 }, "bsk": { "buckets": { "dev": 348, "test": 733, "train": 1187, "validated": 8615, "invalidated": 686, "other": 1 }, "duration": 39648312, "reportedSentences": 2, "validatedSentences": 2447, "unvalidatedSentences": 22, "clips": 9302, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.46, "fifties": 0.24, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9248, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 54, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 229277556, "checksum": "77ddb92222b2000d723b0638f4b3db6b0f7aad6dd28f2a54469b341f4485942f", "avgDurationSecs": 4.262, "validDurationSecs": 36720.083, "totalHrs": 11.01, "validHrs": 10.2 }, "btv": { "buckets": { "dev": 222, "test": 329, "train": 452, "validated": 9105, "invalidated": 127, "other": 51 }, "duration": 37660860, "reportedSentences": 263, "validatedSentences": 1053, "unvalidatedSentences": 9, "clips": 9283, "splits": { "accent": {}, "age": { "": 0.47, "twenties": 0.32, "thirties": 0, "teens": 0, "fourties": 0.21, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0.11, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9274, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 8, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 215819049, "checksum": "e246f2b6f4fe3cca8a94d14e1a78573b80aa91bea28426455c35ec19edf7b25d", "avgDurationSecs": 4.057, "validDurationSecs": 36938.719, "totalHrs": 10.46, "validHrs": 10.26 }, "bum": { "buckets": { "dev": 269, "test": 285, "train": 347, "validated": 7751, "invalidated": 177, "other": 29 }, "duration": 37052424, "reportedSentences": 0, "validatedSentences": 901, "unvalidatedSentences": -1, "clips": 7957, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7957, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 210581757, "checksum": "2d9debba1a63ddc9edef9e1fa425e9a73838d37c6a47f94ebb71649d69b7a83f", "avgDurationSecs": 4.657, "validDurationSecs": 36093.168, "totalHrs": 10.29, "validHrs": 10.02 }, "byv": { "buckets": { "dev": 327, "test": 327, "train": 336, "validated": 7719, "invalidated": 0, "other": 0 }, "duration": 47684376, "reportedSentences": 0, "validatedSentences": 990, "unvalidatedSentences": 1027, "clips": 7719, "splits": { "accent": {}, "age": { "": 0.81, "twenties": 0.06, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.94, "male_masculine": 0, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7719, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 96, "size": 276990023, "checksum": "57fdd7c4c48682f523651b294f865624324ac8aa379f3ab0667e3e98089a447c", "avgDurationSecs": 6.178, "validDurationSecs": 47684.376, "totalHrs": 13.24, "validHrs": 13.24 }, "ca": { "buckets": { "dev": 16415, "test": 16415, "train": 1212809, "validated": 2281974, "invalidated": 139992, "other": 236128 }, "duration": 13835384874, "reportedSentences": 8864, "validatedSentences": 1302505, "unvalidatedSentences": 4835, "clips": 2658094, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.19, "sixties": 0.29, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.52, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 2657032, "agriculture_food": 44, "automotive_transport": 27, "finance": 4, "service_retail": 63, "general": 618, "healthcare": 34, "history_law_government": 65, "language_fundamentals": 32, "media_entertainment": 33, "nature_environment": 114, "news_current_affairs": 32, "technology_robotics": 21 } }, "users": 36816, "size": 83174802177, "checksum": "52e902e6d0fe22eb6bac3966683dc7fe9f650d3e35af2e0be25047fa264531d9", "avgDurationSecs": 5.205, "validDurationSecs": 11877679.481, "totalHrs": 3843.16, "validHrs": 3299.35 }, "cjk": { "buckets": { "dev": 377, "test": 377, "train": 387, "validated": 7333, "invalidated": 1195, "other": 0 }, "duration": 50697864, "reportedSentences": 0, "validatedSentences": 1141, "unvalidatedSentences": 1, "clips": 8528, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.12, "thirties": 0.26, "teens": 0, "fourties": 0.49, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.62, "male_masculine": 0.03, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8528, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 279650901, "checksum": "ebf916048226c1150c54cf76e2156dcf6a817471fd6d8d6327304b6e4d708c88", "avgDurationSecs": 5.945, "validDurationSecs": 43593.743, "totalHrs": 14.08, "validHrs": 12.1 }, "ckb": { "buckets": { "dev": 5345, "test": 5345, "train": 7882, "validated": 119699, "invalidated": 9263, "other": 39443 }, "duration": 689481828, "reportedSentences": 2531, "validatedSentences": 19067, "unvalidatedSentences": 15418, "clips": 168405, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.62, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 168163, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 120, "healthcare": 0, "history_law_government": 10, "language_fundamentals": 10, "media_entertainment": 92, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1961, "size": 3810623767, "checksum": "9c8d043e11206cab5958f605ea7c175b283da58db2d510698278d088da10d54c", "avgDurationSecs": 4.094, "validDurationSecs": 490070.279, "totalHrs": 191.52, "validHrs": 136.13 }, "cnh": { "buckets": { "dev": 763, "test": 765, "train": 818, "validated": 2463, "invalidated": 437, "other": 3287 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168183595, "checksum": "4bc9652d94b2d69dcb13cf7bd11088c9bc8c13ab8587684d6a72871ce7c2f6fd", "avgDurationSecs": 3.516, "validDurationSecs": 8659.177, "totalHrs": 6.04, "validHrs": 2.4 }, "cpy": { "buckets": { "dev": 188, "test": 294, "train": 518, "validated": 8162, "invalidated": 26, "other": 84 }, "duration": 36567612, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 15, "clips": 8272, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.12, "thirties": 0.12, "teens": 0.24, "fourties": 0.39, "fifties": 0, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.77, "male_masculine": 0, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 210673200, "checksum": "26f762d9edec665133c915d761732463ecae6a6075ea86d1ca1a0e179bac6c39", "avgDurationSecs": 4.421, "validDurationSecs": 36081.341, "totalHrs": 10.15, "validHrs": 10.02 }, "cs": { "buckets": { "dev": 9441, "test": 9383, "train": 21883, "validated": 64756, "invalidated": 2574, "other": 149393 }, "duration": 966694467, "reportedSentences": 972, "validatedSentences": 451146, "unvalidatedSentences": 212, "clips": 216723, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.53, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 216701, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 4, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 7 } }, "users": 1117, "size": 5947871827, "checksum": "07b7b40ad5cd78f7b65f9596758776d76299d50ee6a5efa857bc7851fd7967fc", "avgDurationSecs": 4.461, "validDurationSecs": 288844.594, "totalHrs": 268.52, "validHrs": 80.23 }, "cut": { "buckets": { "dev": 290, "test": 325, "train": 378, "validated": 5481, "invalidated": 5, "other": 130 }, "duration": 37239084, "reportedSentences": 0, "validatedSentences": 994, "unvalidatedSentences": 1, "clips": 5616, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0, "thirties": 0.42, "teens": 0, "fourties": 0.17, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0, "female_feminine": 0.68, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5616, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 219706810, "checksum": "be3aefaf588801c24af19af6b97f548f9a070eabe60dba5e6d73eeae605d90d0", "avgDurationSecs": 6.631, "validDurationSecs": 36343.914, "totalHrs": 10.34, "validHrs": 10.09 }, "cux": { "buckets": { "dev": 647, "test": 494, "train": 1107, "validated": 9016, "invalidated": 325, "other": 9 }, "duration": 38339748, "reportedSentences": 1, "validatedSentences": 2251, "unvalidatedSentences": 18, "clips": 9350, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.36, "thirties": 0.23, "teens": 0.41, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0, "female_feminine": 0.65, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9350, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 174512106, "checksum": "9689eba64168706e09c51e29d6f7c8bb6b9222a6b0d2e5d3b65dab2c7497807d", "avgDurationSecs": 4.101, "validDurationSecs": 36970.178, "totalHrs": 10.64, "validHrs": 10.26 }, "cv": { "buckets": { "dev": 1242, "test": 1288, "train": 1456, "validated": 17497, "invalidated": 2215, "other": 744 }, "duration": 103130544, "reportedSentences": 144, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 20456, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.5, "thirties": 0.01, "teens": 0.18, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.51, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20455, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 122, "size": 697772480, "checksum": "ded895e904272a247cb6162e520873814924e91a4e251255e08ff4b01f785443", "avgDurationSecs": 5.042, "validDurationSecs": 88212.511, "totalHrs": 28.64, "validHrs": 24.5 }, "cy": { "buckets": { "dev": 5403, "test": 5409, "train": 8024, "validated": 90804, "invalidated": 4469, "other": 20678 }, "duration": 569951901, "reportedSentences": 176, "validatedSentences": 118044, "unvalidatedSentences": 14, "clips": 115951, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 115950, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1922, "size": 4156589636, "checksum": "8d12d0d874f126560d2877038dcdeff7961eb28b4b02cb1ab7570318005a7a63", "avgDurationSecs": 4.915, "validDurationSecs": 446342.959, "totalHrs": 158.31, "validHrs": 123.98 }, "da": { "buckets": { "dev": 2630, "test": 2758, "train": 3602, "validated": 11320, "invalidated": 439, "other": 2296 }, "duration": 57112272, "reportedSentences": 685, "validatedSentences": 20807, "unvalidatedSentences": 274, "clips": 14055, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.16, "thirties": 0.22, "teens": 0.01, "fourties": 0.16, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.48, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14028, "agriculture_food": 1, "automotive_transport": 3, "finance": 0, "service_retail": 1, "general": 16, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 9, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 294, "size": 331224563, "checksum": "b81caa13e72bf4a417b53a594bf5042a8f10b0c5e499422359bbd9f31f22083e", "avgDurationSecs": 4.063, "validDurationSecs": 45998.642, "totalHrs": 15.86, "validHrs": 12.77 }, "dag": { "buckets": { "dev": 379, "test": 395, "train": 422, "validated": 8051, "invalidated": 109, "other": 12574 }, "duration": 89165448, "reportedSentences": 0, "validatedSentences": 5003, "unvalidatedSentences": 15474, "clips": 20734, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.38, "thirties": 0.23, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8, "male_masculine": 0.02, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 20721, "agriculture_food": 3, "automotive_transport": 3, "finance": 6, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 56, "size": 495093952, "checksum": "aed9102b9e927a4055338bdd778070583243317dce70b802334eb8ac11de13a9", "avgDurationSecs": 4.3, "validDurationSecs": 34622.891, "totalHrs": 24.76, "validHrs": 9.61 }, "dar": { "buckets": { "dev": 188, "test": 228, "train": 585, "validated": 1013, "invalidated": 14, "other": 360 }, "duration": 6731208, "reportedSentences": 0, "validatedSentences": 2025, "unvalidatedSentences": 340, "clips": 1387, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.74, "thirties": 0, "teens": 0.02, "fourties": 0, "fifties": 0.22, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.85, "male_masculine": 0, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 904, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 410, "nature_environment": 0, "news_current_affairs": 74, "technology_robotics": 0 } }, "users": 12, "size": 39739444, "checksum": "4455b631a51fae9d776a5c44781591caecddfb3185b4e5130c9a18fa82f6e53e", "avgDurationSecs": 4.853, "validDurationSecs": 4916.16, "totalHrs": 1.86, "validHrs": 1.36 }, "dav": { "buckets": { "dev": 1276, "test": 1004, "train": 2098, "validated": 8264, "invalidated": 663, "other": 40703 }, "duration": 201431016, "reportedSentences": 11, "validatedSentences": 11773, "unvalidatedSentences": 20119, "clips": 49630, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.61, "thirties": 0.1, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0.23, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.73, "male_masculine": 0, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 971, "agriculture_food": 499, "automotive_transport": 100, "finance": 304, "service_retail": 1218, "general": 45323, "healthcare": 967, "history_law_government": 334, "language_fundamentals": 150, "media_entertainment": 68, "nature_environment": 873, "news_current_affairs": 86, "technology_robotics": 75 } }, "users": 24, "size": 1084122785, "checksum": "0ba35ecbe65a58b7b771ff0e4eabf694211472ce8474771522eebff3713aa862", "avgDurationSecs": 4.059, "validDurationSecs": 33540.72, "totalHrs": 55.95, "validHrs": 9.31 }, "de": { "buckets": { "dev": 16198, "test": 16198, "train": 609238, "validated": 937519, "invalidated": 56529, "other": 16575 }, "duration": 5317885385, "reportedSentences": 11328, "validatedSentences": 2057955, "unvalidatedSentences": 1368, "clips": 1010623, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.17, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.58, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1010426, "agriculture_food": 0, "automotive_transport": 6, "finance": 0, "service_retail": 12, "general": 138, "healthcare": 15, "history_law_government": 14, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 12, "news_current_affairs": 4, "technology_robotics": 9 } }, "users": 20323, "size": 36944261537, "checksum": "2bfd93244cd180ddd71dbb64baa6e29dc8cd52280c70dafd11f9ff0e12f80a4b", "avgDurationSecs": 5.262, "validDurationSecs": 4933213.066, "totalHrs": 1477.19, "validHrs": 1370.33 }, "dmk": { "buckets": { "dev": 0, "test": 938, "train": 3200, "validated": 10648, "invalidated": 123, "other": 21 }, "duration": 38273472, "reportedSentences": 732, "validatedSentences": 4139, "unvalidatedSentences": 194, "clips": 10792, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.01, "thirties": 0.15, "teens": 0, "fourties": 0.81, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10790, "agriculture_food": 2, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 2, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 223097723, "checksum": "2ae376c6f53696204925b46fd7202530c86c7b11f4869fa24d372e15da69abc1", "avgDurationSecs": 3.546, "validDurationSecs": 37762.781, "totalHrs": 10.63, "validHrs": 10.48 }, "dml": { "buckets": { "dev": 1101, "test": 626, "train": 3190, "validated": 6115, "invalidated": 73, "other": 299 }, "duration": 39238596, "reportedSentences": 0, "validatedSentences": 5670, "unvalidatedSentences": 676, "clips": 6487, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.09, "thirties": 0, "teens": 0, "fourties": 0.79, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6473, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 3, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 232074663, "checksum": "a209d8e74b0cae17ab645f668a50e3896eec47f433e0afca40c59a0b487e2bc1", "avgDurationSecs": 6.049, "validDurationSecs": 36988.441, "totalHrs": 10.89, "validHrs": 10.27 }, "dru": { "buckets": { "dev": 933, "test": 957, "train": 1074, "validated": 6576, "invalidated": 116, "other": 0 }, "duration": 37984392, "reportedSentences": 2, "validatedSentences": 3867, "unvalidatedSentences": 477, "clips": 6692, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.87 }, "sentence_domain": { "": 70, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 10031, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1759, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 223957938, "checksum": "362ad78e66907967287787c84056df1bd303a814716d66abae0d98129c75f020", "avgDurationSecs": 5.676, "validDurationSecs": 37325.966, "totalHrs": 10.55, "validHrs": 10.36 }, "dua": { "buckets": { "dev": 326, "test": 311, "train": 366, "validated": 7018, "invalidated": 1102, "other": 4 }, "duration": 55392156, "reportedSentences": 1, "validatedSentences": 1003, "unvalidatedSentences": 1, "clips": 8124, "splits": { "accent": {}, "age": { "": 0.77, "twenties": 0.11, "thirties": 0.1, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8124, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 312394050, "checksum": "9605361cc89cfb67a9a11cd582aac3740e36574e4af0ca779cfe5d23ccc33ffe", "avgDurationSecs": 6.818, "validDurationSecs": 47851.077, "totalHrs": 15.38, "validHrs": 13.29 }, "dv": { "buckets": { "dev": 2242, "test": 2228, "train": 2649, "validated": 26533, "invalidated": 1659, "other": 15111 }, "duration": 221700519, "reportedSentences": 59, "validatedSentences": 7174, "unvalidatedSentences": 9, "clips": 43303, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.11, "thirties": 0.39, "teens": 0.01, "fourties": 0.23, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.24, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43302, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 391, "size": 1409787542, "checksum": "36caf9ae414318e1d96bea52d361be70fc43fc8b665a9e031a50ecf88e8a0661", "avgDurationSecs": 5.12, "validDurationSecs": 135842.317, "totalHrs": 61.58, "validHrs": 37.73 }, "dyu": { "buckets": { "dev": 50, "test": 63, "train": 90, "validated": 211, "invalidated": 15, "other": 69 }, "duration": 1861236, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 295, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.66, "thirties": 0, "teens": 0.02, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 10793551, "checksum": "1b182f55e456aa1c16e57ce9a72387b6c9002956df20cdac02fff3927e873724", "avgDurationSecs": 6.309, "validDurationSecs": 1331.257, "totalHrs": 0.51, "validHrs": 0.36 }, "ebr": { "buckets": { "dev": 0, "test": 228, "train": 644, "validated": 1544, "invalidated": 1, "other": 506 }, "duration": 11458872, "reportedSentences": 5, "validatedSentences": 872, "unvalidatedSentences": 3, "clips": 2051, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.23, "thirties": 0.32, "teens": 0, "fourties": 0.42, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.68, "male_masculine": 0, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2051, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 64929063, "checksum": "1e81c7c1c519f6400318d58e2adb54647e471b4d7006c7f584ac347e79bbe87a", "avgDurationSecs": 5.587, "validDurationSecs": 8626.279, "totalHrs": 3.18, "validHrs": 2.39 }, "eko": { "buckets": { "dev": 353, "test": 414, "train": 519, "validated": 3950, "invalidated": 113, "other": 2 }, "duration": 37965708, "reportedSentences": 0, "validatedSentences": 1288, "unvalidatedSentences": 16, "clips": 4065, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.32, "thirties": 0.55, "teens": 0, "fourties": 0.05, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0, "female_feminine": 0.53, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1790, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2251, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 24, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 221398178, "checksum": "d172d430ccde1ed46de4d4ed8110d02013ff3ab5add5ef4b64d3e86814e75488", "avgDurationSecs": 9.34, "validDurationSecs": 36891.647, "totalHrs": 10.54, "validHrs": 10.24 }, "el": { "buckets": { "dev": 1708, "test": 1713, "train": 1921, "validated": 16818, "invalidated": 868, "other": 10340 }, "duration": 116361718, "reportedSentences": 72, "validatedSentences": 6884, "unvalidatedSentences": 4474, "clips": 28026, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.63, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 28026, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 446, "size": 773054947, "checksum": "dd1a96f3ad2e17a1fdec55cb6d458977842df381a571837856f90b7ef7efc048", "avgDurationSecs": 4.152, "validDurationSecs": 69826.995, "totalHrs": 32.32, "validHrs": 19.39 }, "en": { "buckets": { "dev": 16402, "test": 16402, "train": 1143276, "validated": 1862170, "invalidated": 307995, "other": 371586 }, "duration": 13438681531, "reportedSentences": 9705, "validatedSentences": 1680665, "unvalidatedSentences": 35345, "clips": 2541751, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.44, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2540653, "agriculture_food": 148, "automotive_transport": 8, "finance": 43, "service_retail": 28, "general": 583, "healthcare": 24, "history_law_government": 102, "language_fundamentals": 9, "media_entertainment": 106, "nature_environment": 41, "news_current_affairs": 13, "technology_robotics": 88 } }, "users": 98436, "size": 93235510373, "checksum": "85a16b871613c38fccaf76532018c94942fe8435f9bcc11d76d77508daf52961", "avgDurationSecs": 5.287, "validDurationSecs": 9845618.075, "totalHrs": 3732.96, "validHrs": 2734.89 }, "eo": { "buckets": { "dev": 14913, "test": 14897, "train": 144397, "validated": 852287, "invalidated": 127512, "other": 159354 }, "duration": 6917789167, "reportedSentences": 2394, "validatedSentences": 181100, "unvalidatedSentences": 21924, "clips": 1139153, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.05, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.68, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1139150, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1873, "size": 41344147696, "checksum": "d1627f27610f7505bc5dd195422eafc516828bca2bad484a9156bdbae2c5dee9", "avgDurationSecs": 6.073, "validDurationSecs": 5175724.223, "totalHrs": 1921.6, "validHrs": 1437.7 }, "es": { "buckets": { "dev": 15897, "test": 15897, "train": 355918, "validated": 435083, "invalidated": 94834, "other": 1142844 }, "duration": 8161568658, "reportedSentences": 2643, "validatedSentences": 1082136, "unvalidatedSentences": 5064, "clips": 1672761, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1672639, "agriculture_food": 1, "automotive_transport": 4, "finance": 4, "service_retail": 3, "general": 38, "healthcare": 4, "history_law_government": 29, "language_fundamentals": 7, "media_entertainment": 7, "nature_environment": 11, "news_current_affairs": 18, "technology_robotics": 21 } }, "users": 26780, "size": 51443450722, "checksum": "4396c00fba2cf1830a8744b9f1d33fc2191f41d6e36371e0c4eff78a1d7ba69f", "avgDurationSecs": 4.879, "validDurationSecs": 2122813.586, "totalHrs": 2267.1, "validHrs": 589.67 }, "esu": { "buckets": { "dev": 0, "test": 501, "train": 6125, "validated": 7219, "invalidated": 0, "other": 0 }, "duration": 27443484, "reportedSentences": 17, "validatedSentences": 6929, "unvalidatedSentences": 5, "clips": 7219, "splits": { "accent": {}, "age": { "": 0, "twenties": 0, "thirties": 1, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7219, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 142869568, "checksum": "367742b8a4ad2384992c92a9f093bddecf2c97fc219de80b2c00e447f40c2988", "avgDurationSecs": 3.802, "validDurationSecs": 27443.484, "totalHrs": 7.62, "validHrs": 7.62 }, "et": { "buckets": { "dev": 2853, "test": 2853, "train": 3446, "validated": 26839, "invalidated": 7828, "other": 54 }, "duration": 233720570, "reportedSentences": 568, "validatedSentences": 10981, "unvalidatedSentences": 1288, "clips": 34721, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.51, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33974, "agriculture_food": 41, "automotive_transport": 31, "finance": 34, "service_retail": 12, "general": 476, "healthcare": 23, "history_law_government": 33, "language_fundamentals": 12, "media_entertainment": 34, "nature_environment": 90, "news_current_affairs": 16, "technology_robotics": 33 } }, "users": 1029, "size": 1561248442, "checksum": "db6390a8173aceb0a34f4f6518340b23dd5c66b651f7b885f4a4a3381fded78c", "avgDurationSecs": 6.731, "validDurationSecs": 180663.759, "totalHrs": 64.92, "validHrs": 50.18 }, "eto": { "buckets": { "dev": 304, "test": 299, "train": 363, "validated": 10539, "invalidated": 346, "other": 87 }, "duration": 37933056, "reportedSentences": 0, "validatedSentences": 966, "unvalidatedSentences": -1, "clips": 10972, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10972, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 214564484, "checksum": "2070c586cb4cf87fab011e0f26e8c541d23199baef93b2b41f34a0e01a13bf20", "avgDurationSecs": 3.457, "validDurationSecs": 36436.062, "totalHrs": 10.53, "validHrs": 10.12 }, "eu": { "buckets": { "dev": 14801, "test": 14801, "train": 133943, "validated": 311322, "invalidated": 50844, "other": 101571 }, "duration": 2553044259, "reportedSentences": 1277, "validatedSentences": 1018120, "unvalidatedSentences": 12722, "clips": 463737, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.02, "fourties": 0.16, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0.25, "female_feminine": 0.29, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443641, "agriculture_food": 6, "automotive_transport": 6, "finance": 4, "service_retail": 7, "general": 20164, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 11032, "size": 15654039546, "checksum": "4d28e418a096587fb0807ea03f12d4bdb0eb0c9e26b4d482af69795411e3c1d1", "avgDurationSecs": 5.505, "validDurationSecs": 1713943.129, "totalHrs": 709.17, "validHrs": 476.09 }, "ewo": { "buckets": { "dev": 283, "test": 292, "train": 308, "validated": 7569, "invalidated": 461, "other": 30 }, "duration": 70747596, "reportedSentences": 0, "validatedSentences": 883, "unvalidatedSentences": -1, "clips": 8060, "splits": { "accent": {}, "age": { "": 0.95, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.05, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8060, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 407606728, "checksum": "ae03700da2e06766a2023c351f043db3fb4d7ffed09241dafd4d5d626a8e0f56", "avgDurationSecs": 8.778, "validDurationSecs": 66437.786, "totalHrs": 19.65, "validHrs": 18.45 }, "fa": { "buckets": { "dev": 10683, "test": 10683, "train": 29839, "validated": 338410, "invalidated": 15265, "other": 34627 }, "duration": 1537520012, "reportedSentences": 2760, "validatedSentences": 57533, "unvalidatedSentences": 283458, "clips": 388302, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.35, "teens": 0.03, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.7, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 388277, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 25, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4609, "size": 11103799210, "checksum": "68a65a5013d1860c120d8a8f0e1ffa899d61a982f5e77bb4d9d23800b41b63e2", "avgDurationSecs": 3.96, "validDurationSecs": 1339967.724, "totalHrs": 427.08, "validHrs": 372.21 }, "fan": { "buckets": { "dev": 327, "test": 328, "train": 349, "validated": 7706, "invalidated": 82, "other": 346 }, "duration": 42145596, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": -1, "clips": 8134, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0, "thirties": 0, "teens": 0.01, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8134, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 43, "size": 247415849, "checksum": "6073ac4713a001916babc6c918b6e1de43c2c678998d2e8d63c94410cd9b3ec8", "avgDurationSecs": 5.181, "validDurationSecs": 39927.952, "totalHrs": 11.7, "validHrs": 11.09 }, "fi": { "buckets": { "dev": 1776, "test": 1814, "train": 2081, "validated": 12003, "invalidated": 417, "other": 5179 }, "duration": 81415515, "reportedSentences": 78, "validatedSentences": 5848, "unvalidatedSentences": 49179, "clips": 17599, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.3, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.36, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17599, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 245, "size": 486609282, "checksum": "1d0235ff7e95ecc7b24568db63d4acaccc2f8ce96cb12fb68ba449cba04b17bb", "avgDurationSecs": 4.626, "validDurationSecs": 55527.611, "totalHrs": 22.61, "validHrs": 15.42 }, "fmp": { "buckets": { "dev": 326, "test": 327, "train": 351, "validated": 5206, "invalidated": 293, "other": 105 }, "duration": 44521524, "reportedSentences": 5, "validatedSentences": 1004, "unvalidatedSentences": -1, "clips": 5604, "splits": { "accent": {}, "age": { "": 0.74, "twenties": 0.2, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.06, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5604, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 32, "size": 255869851, "checksum": "ea12c5d61f80ab2fcf480d3db6ca85779eb35e4a3856dc65f705b08544946a07", "avgDurationSecs": 7.945, "validDurationSecs": 41359.574, "totalHrs": 12.36, "validHrs": 11.48 }, "fr": { "buckets": { "dev": 16190, "test": 16190, "train": 598539, "validated": 768082, "invalidated": 67307, "other": 14437 }, "duration": 4278799162, "reportedSentences": 7569, "validatedSentences": 1648986, "unvalidatedSentences": 43795, "clips": 849826, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.13, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.57, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 849722, "agriculture_food": 0, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 67, "healthcare": 5, "history_law_government": 19, "language_fundamentals": 7, "media_entertainment": 16, "nature_environment": 8, "news_current_affairs": 2, "technology_robotics": 18 } }, "users": 20781, "size": 29926415973, "checksum": "88717f4acd399b3535525c447c2f8187ff4d782b25b008f3e3b405fd5f974d1c", "avgDurationSecs": 5.035, "validDurationSecs": 3867225.312, "totalHrs": 1188.55, "validHrs": 1074.22 }, "fub": { "buckets": { "dev": 277, "test": 345, "train": 445, "validated": 7686, "invalidated": 27, "other": 80 }, "duration": 47854224, "reportedSentences": 0, "validatedSentences": 1067, "unvalidatedSentences": -1, "clips": 7793, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0.01, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7793, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 261460974, "checksum": "c1fb1bce9d38594b71a428399391d265fcabcdd44f5049e1ee3fee80dbeab59d", "avgDurationSecs": 6.141, "validDurationSecs": 47197.173, "totalHrs": 13.29, "validHrs": 13.11 }, "fue": { "buckets": { "dev": 0, "test": 88, "train": 900, "validated": 7141, "invalidated": 11, "other": 78 }, "duration": 38649888, "reportedSentences": 0, "validatedSentences": 988, "unvalidatedSentences": -1, "clips": 7230, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7230, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 220620218, "checksum": "bcc3ece7c57d22f500414af4b25384d6d58b3ebe1f4047679022a3831675db02", "avgDurationSecs": 5.346, "validDurationSecs": 38174.115, "totalHrs": 10.73, "validHrs": 10.6 }, "fy-NL": { "buckets": { "dev": 3172, "test": 3173, "train": 3924, "validated": 51722, "invalidated": 4025, "other": 102434 }, "duration": 767899251, "reportedSentences": 543, "validatedSentences": 10902, "unvalidatedSentences": 5, "clips": 158181, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.06, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157885, "agriculture_food": 232, "automotive_transport": 0, "finance": 32, "service_retail": 0, "general": 64, "healthcare": 0, "history_law_government": 201, "language_fundamentals": 55, "media_entertainment": 257, "nature_environment": 1, "news_current_affairs": 36, "technology_robotics": 6 } }, "users": 2082, "size": 4604079419, "checksum": "167dd4d4415ecff2ee15a1047a888bd641b4d4aaccafccb11d30d96c047e50dd", "avgDurationSecs": 4.855, "validDurationSecs": 251087.584, "totalHrs": 213.3, "validHrs": 69.74 }, "ga-IE": { "buckets": { "dev": 532, "test": 532, "train": 550, "validated": 9692, "invalidated": 1141, "other": 2771 }, "duration": 48431475, "reportedSentences": 34, "validatedSentences": 4436, "unvalidatedSentences": 1339, "clips": 13604, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.36, "thirties": 0.24, "teens": 0.01, "fourties": 0.05, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.38, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13504, "agriculture_food": 0, "automotive_transport": 3, "finance": 0, "service_retail": 0, "general": 103, "healthcare": 1, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 216, "size": 318192023, "checksum": "18866850e9af0279d3c5c2a345be7a49e8675697a264d210d4e2bc291017b897", "avgDurationSecs": 3.56, "validDurationSecs": 34504.4, "totalHrs": 13.45, "validHrs": 9.58 }, "gej": { "buckets": { "dev": 954, "test": 952, "train": 1282, "validated": 16406, "invalidated": 332, "other": 36 }, "duration": 41018184, "reportedSentences": 0, "validatedSentences": 3188, "unvalidatedSentences": -1, "clips": 16774, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16774, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 225939282, "checksum": "28c5775859654c5dbaa13fc016ffb9e688f4d43b3ef4978da00baed0dbb5e94c", "avgDurationSecs": 2.445, "validDurationSecs": 40118.298, "totalHrs": 11.39, "validHrs": 11.14 }, "ggg": { "buckets": { "dev": 0, "test": 318, "train": 1687, "validated": 6684, "invalidated": 3, "other": 4393 }, "duration": 44190504, "reportedSentences": 0, "validatedSentences": 2005, "unvalidatedSentences": -1, "clips": 11080, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.19, "thirties": 0.33, "teens": 0, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11080, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 258846838, "checksum": "d8073f7c5cd23ff88c14d9b6d1f93f8c376abb5a7e8081f9d360a2cde6aea2fc", "avgDurationSecs": 3.988, "validDurationSecs": 26657.882, "totalHrs": 12.27, "validHrs": 7.4 }, "gid": { "buckets": { "dev": 286, "test": 302, "train": 329, "validated": 5111, "invalidated": 208, "other": 0 }, "duration": 37833048, "reportedSentences": 0, "validatedSentences": 917, "unvalidatedSentences": -1, "clips": 5319, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5319, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 216332230, "checksum": "e7e1ff9b60f568a43fc7d1569d01f24844f12ec6f33d82f2c5626cf45fd5fa76", "avgDurationSecs": 7.113, "validDurationSecs": 36353.583, "totalHrs": 10.5, "validHrs": 10.09 }, "gig": { "buckets": { "dev": 0, "test": 135, "train": 1869, "validated": 12130, "invalidated": 6, "other": 23 }, "duration": 36376452, "reportedSentences": 0, "validatedSentences": 2005, "unvalidatedSentences": -1, "clips": 12159, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.67, "thirties": 0.16, "teens": 0, "fourties": 0, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12159, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 212058226, "checksum": "2cc0220cad472adb0a4cdabc481a64081c04a826da750fe4d1e34fd3e43f3978", "avgDurationSecs": 2.992, "validDurationSecs": 36289.692, "totalHrs": 10.1, "validHrs": 10.08 }, "giz": { "buckets": { "dev": 308, "test": 277, "train": 406, "validated": 6494, "invalidated": 149, "other": 5 }, "duration": 37043316, "reportedSentences": 14, "validatedSentences": 991, "unvalidatedSentences": -1, "clips": 6648, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0.03, "thirties": 0.07, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6648, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 32, "size": 216339647, "checksum": "66ad29ef24decd69053503ea34073ad9e3f8527ed931d0c559836227a5bc2dca", "avgDurationSecs": 5.572, "validDurationSecs": 36185.213, "totalHrs": 10.28, "validHrs": 10.05 }, "gjk": { "buckets": { "dev": 570, "test": 623, "train": 808, "validated": 8548, "invalidated": 168, "other": 10 }, "duration": 39656844, "reportedSentences": 0, "validatedSentences": 2004, "unvalidatedSentences": -1, "clips": 8726, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.12, "thirties": 0.59, "teens": 0.04, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.74, "male_masculine": 0, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8726, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 230595435, "checksum": "c6911fa52ebce671c64b35e755f7372fbb1a47e7d6a3effbac03211a1134be06", "avgDurationSecs": 4.545, "validDurationSecs": 38847.892, "totalHrs": 11.01, "validHrs": 10.79 }, "gju": { "buckets": { "dev": 0, "test": 628, "train": 3202, "validated": 11074, "invalidated": 155, "other": 512 }, "duration": 38540952, "reportedSentences": 0, "validatedSentences": 3852, "unvalidatedSentences": 2, "clips": 11741, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0.77, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11738, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 220149579, "checksum": "9fcd2228689e0e8247450123f969d43d53592ee26b95a6a8dd708a7b871667c3", "avgDurationSecs": 3.283, "validDurationSecs": 36351.461, "totalHrs": 10.7, "validHrs": 10.09 }, "gl": { "buckets": { "dev": 14551, "test": 14563, "train": 116944, "validated": 149077, "invalidated": 18243, "other": 99644 }, "duration": 1334349072, "reportedSentences": 780, "validatedSentences": 696745, "unvalidatedSentences": 4263, "clips": 266964, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.16, "thirties": 0.07, "teens": 0.02, "fourties": 0.09, "fifties": 0.1, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.64, "male_masculine": 0.13, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 266525, "agriculture_food": 33, "automotive_transport": 4, "finance": 40, "service_retail": 17, "general": 258, "healthcare": 3, "history_law_government": 52, "language_fundamentals": 19, "media_entertainment": 41, "nature_environment": 15, "news_current_affairs": 14, "technology_robotics": 5 } }, "users": 6381, "size": 7726065207, "checksum": "ccab851fda33b47318d57ac9af8205ed60b894eff5a06b5c01b3bc9e0e58a2ef", "avgDurationSecs": 4.998, "validDurationSecs": 745122.026, "totalHrs": 370.65, "validHrs": 206.97 }, "gn": { "buckets": { "dev": 491, "test": 1012, "train": 1738, "validated": 3583, "invalidated": 211, "other": 21601 }, "duration": 116832060, "reportedSentences": 52, "validatedSentences": 5894, "unvalidatedSentences": 301, "clips": 25395, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.55, "thirties": 0.17, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.34, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 25395, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 298, "size": 669922176, "checksum": "55bdd9fa1fea2fd872e2fe20450b47ec269173785a1b6b88d9958977c2350ec3", "avgDurationSecs": 4.601, "validDurationSecs": 16483.925, "totalHrs": 32.45, "validHrs": 4.57 }, "gv": { "buckets": { "dev": 1166, "test": 525, "train": 2420, "validated": 6302, "invalidated": 151, "other": 112 }, "duration": 38012652, "reportedSentences": 267, "validatedSentences": 4219, "unvalidatedSentences": 21, "clips": 6565, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.54, "thirties": 0.04, "teens": 0, "fourties": 0.39, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.51, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6565, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 224332377, "checksum": "fad63fb15dd8d258ccb25b0615ac85b944f051cea7252c355db5cee67debf962", "avgDurationSecs": 5.79, "validDurationSecs": 36489.83, "totalHrs": 10.55, "validHrs": 10.13 }, "gwc": { "buckets": { "dev": 776, "test": 918, "train": 3268, "validated": 7421, "invalidated": 683, "other": 1699 }, "duration": 57059136, "reportedSentences": 1, "validatedSentences": 5574, "unvalidatedSentences": -1, "clips": 9803, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.34, "thirties": 0, "teens": 0, "fourties": 0.48, "fifties": 0.14, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9803, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 335361185, "checksum": "15b0dace01582f01eb5efe6018c48d0ced12a66d9810325422f916f492caf5bf", "avgDurationSecs": 5.821, "validDurationSecs": 43194.517, "totalHrs": 15.84, "validHrs": 11.99 }, "gwt": { "buckets": { "dev": 0, "test": 220, "train": 3136, "validated": 7681, "invalidated": 57, "other": 0 }, "duration": 44273880, "reportedSentences": 0, "validatedSentences": 3719, "unvalidatedSentences": 2087, "clips": 7738, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0, "thirties": 0.4, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.43, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7738, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 261955624, "checksum": "b4c2c1b62cf56173755e557bff06d241b53c5eff7b7f6444a2c91695e2937221", "avgDurationSecs": 5.722, "validDurationSecs": 43947.748, "totalHrs": 12.29, "validHrs": 12.2 }, "gya": { "buckets": { "dev": 346, "test": 346, "train": 362, "validated": 6914, "invalidated": 491, "other": 0 }, "duration": 38369700, "reportedSentences": 1, "validatedSentences": 1054, "unvalidatedSentences": -1, "clips": 7405, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7405, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 221193381, "checksum": "d773cdeb9ad9c0075b5b6d04d56df31202377bddac9b26331b40735a1a538be8", "avgDurationSecs": 5.182, "validDurationSecs": 35825.538, "totalHrs": 10.65, "validHrs": 9.95 }, "ha": { "buckets": { "dev": 626, "test": 751, "train": 1908, "validated": 3448, "invalidated": 174, "other": 6666 }, "duration": 44817624, "reportedSentences": 34, "validatedSentences": 5448, "unvalidatedSentences": 242, "clips": 10288, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.13, "thirties": 0.66, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.58, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10288, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 58, "size": 262350321, "checksum": "aec59743fa075c60a616d9b21d36a84dfdb1159f7ab32a34634dcfe1788d2be1", "avgDurationSecs": 4.356, "validDurationSecs": 15020.526, "totalHrs": 12.44, "validHrs": 4.17 }, "haz": { "buckets": { "dev": 86, "test": 446, "train": 823, "validated": 8319, "invalidated": 0, "other": 0 }, "duration": 37895580, "reportedSentences": 29, "validatedSentences": 1361, "unvalidatedSentences": -1, "clips": 8319, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.32, "thirties": 0.33, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8319, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7, "size": 183539889, "checksum": "28d9bf0f247e91481d446c28171fc5fd57fce8b0e723c309db878d4b6a1f770d", "avgDurationSecs": 4.555, "validDurationSecs": 37895.58, "totalHrs": 10.52, "validHrs": 10.52 }, "he": { "buckets": { "dev": 359, "test": 780, "train": 1396, "validated": 3166, "invalidated": 339, "other": 1986 }, "duration": 25219404, "reportedSentences": 43, "validatedSentences": 4172, "unvalidatedSentences": 6844, "clips": 5491, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.72, "thirties": 0.16, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.04, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.91, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5490, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 146851454, "checksum": "8ed51adb92edcdb1fc2fe6f36533284c0b2abe851e88e8789427c191473cb476", "avgDurationSecs": 4.593, "validDurationSecs": 14541, "totalHrs": 7, "validHrs": 4.03 }, "hem": { "buckets": { "dev": 337, "test": 337, "train": 345, "validated": 5974, "invalidated": 470, "other": 0 }, "duration": 38994660, "reportedSentences": 2, "validatedSentences": 1020, "unvalidatedSentences": 2, "clips": 6444, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.47, "thirties": 0.27, "teens": 0, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.77, "male_masculine": 0.03, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6444, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 41, "size": 211336388, "checksum": "986a01caeeeb23ef6ee6591f0600637a8aef9129512c5f7cdee9e73a728d51d4", "avgDurationSecs": 6.051, "validDurationSecs": 36150.543, "totalHrs": 10.83, "validHrs": 10.04 }, "hi": { "buckets": { "dev": 2700, "test": 3349, "train": 4872, "validated": 11000, "invalidated": 891, "other": 4466 }, "duration": 82082436, "reportedSentences": 146, "validatedSentences": 32204, "unvalidatedSentences": 9964, "clips": 16357, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.32, "thirties": 0.24, "teens": 0.01, "fourties": 0.07, "fifties": 0.02, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.58, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16357, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 467, "size": 487155780, "checksum": "7bbdd04b8e79fd11b702015a11fb01f672edd8ab4fb1828c51d037b7a7a4bfb0", "avgDurationSecs": 5.018, "validDurationSecs": 55200.024, "totalHrs": 22.8, "validHrs": 15.33 }, "hno": { "buckets": { "dev": 765, "test": 708, "train": 863, "validated": 9120, "invalidated": 282, "other": 5 }, "duration": 37708020, "reportedSentences": 21, "validatedSentences": 2349, "unvalidatedSentences": -1, "clips": 9407, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.66, "thirties": 0.23, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6162, "agriculture_food": 5816, "automotive_transport": 0, "finance": 3245, "service_retail": 0, "general": 2574, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 5816, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 216314873, "checksum": "b5932fc740dc2f109f83d82f8430a057f68e12895fa79b82bbeb8e2a620e91fe", "avgDurationSecs": 4.009, "validDurationSecs": 36557.579, "totalHrs": 10.47, "validHrs": 10.15 }, "hsb": { "buckets": { "dev": 270, "test": 398, "train": 810, "validated": 1478, "invalidated": 247, "other": 263 }, "duration": 12533940, "reportedSentences": 130, "validatedSentences": 7180, "unvalidatedSentences": 27, "clips": 1988, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.1, "thirties": 0.09, "teens": 0.07, "fourties": 0.47, "fifties": 0.03, "sixties": 0.02, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.75, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1988, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 30, "size": 94137580, "checksum": "5e34adbb89c2f7e96972b4b98fe3bbff2ac3fd1710d84d7443b9666ec0a723bc", "avgDurationSecs": 6.305, "validDurationSecs": 9318.493, "totalHrs": 3.48, "validHrs": 2.58 }, "ht": { "buckets": { "dev": 5, "test": 5, "train": 13, "validated": 24, "invalidated": 1, "other": 5 }, "duration": 166968, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": 1629, "clips": 30, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0, "thirties": 0.33, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1091475, "checksum": "8bd055f142e4bb4600c305033320aef3e6a26b32d8c23761b0a3ef841af18f78", "avgDurationSecs": 5.566, "validDurationSecs": 133.574, "totalHrs": 0.04, "validHrs": 0.03 }, "hu": { "buckets": { "dev": 11698, "test": 11688, "train": 39722, "validated": 63578, "invalidated": 3343, "other": 50290 }, "duration": 649243333, "reportedSentences": 554, "validatedSentences": 326333, "unvalidatedSentences": 289, "clips": 117211, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.13, "teens": 0.02, "fourties": 0.11, "fifties": 0.32, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.23, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 117160, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 44, "healthcare": 0, "history_law_government": 12, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 6 } }, "users": 1672, "size": 3818287096, "checksum": "cc00c31d4fea341f0afdad1ed8ebb5a7c5e843351fe8a197e7ba521c9924eae4", "avgDurationSecs": 5.539, "validDurationSecs": 352164.836, "totalHrs": 180.34, "validHrs": 97.82 }, "hux": { "buckets": { "dev": 25, "test": 326, "train": 637, "validated": 9285, "invalidated": 1322, "other": 1 }, "duration": 41113224, "reportedSentences": 0, "validatedSentences": 995, "unvalidatedSentences": -1, "clips": 10608, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.19, "thirties": 0.09, "teens": 0, "fourties": 0.26, "fifties": 0.37, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10608, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 240826603, "checksum": "8e3eecf5cf97faaf1cedc9b3d8a1bae5490a487af00761fec120bb70a89b02c4", "avgDurationSecs": 3.876, "validDurationSecs": 35985.698, "totalHrs": 11.42, "validHrs": 9.99 }, "hy-AM": { "buckets": { "dev": 5866, "test": 5830, "train": 9304, "validated": 21215, "invalidated": 1221, "other": 15145 }, "duration": 202638528, "reportedSentences": 183, "validatedSentences": 237419, "unvalidatedSentences": 1338, "clips": 37581, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.63, "thirties": 0.08, "teens": 0.11, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.25, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37579, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 439, "size": 1192094550, "checksum": "84c5be2ea4462323c0a40234fe32f680b467090b79b35603ba9cd86f892d1a73", "avgDurationSecs": 5.392, "validDurationSecs": 114392.283, "totalHrs": 56.28, "validHrs": 31.77 }, "ia": { "buckets": { "dev": 1866, "test": 1862, "train": 4908, "validated": 12064, "invalidated": 351, "other": 2345 }, "duration": 62016204, "reportedSentences": 279, "validatedSentences": 9069, "unvalidatedSentences": 96, "clips": 14760, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.29, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14760, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 72, "size": 420126870, "checksum": "2e74cf02c0c8dd0f8e1358e1865bb3e8a0e9718b13de5deb2ffb3312b2becb2e", "avgDurationSecs": 4.202, "validDurationSecs": 50688.583, "totalHrs": 17.22, "validHrs": 14.08 }, "ibb": { "buckets": { "dev": 316, "test": 328, "train": 352, "validated": 3014, "invalidated": 115, "other": 4332 }, "duration": 96209352, "reportedSentences": 0, "validatedSentences": 1031, "unvalidatedSentences": -1, "clips": 7461, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.97, "male_masculine": 0, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7461, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 561138278, "checksum": "90e4cb961549ad311101119f263553647bf284ff039d13798d78eadd7170a0e0", "avgDurationSecs": 12.895, "validDurationSecs": 38865.432, "totalHrs": 26.72, "validHrs": 10.79 }, "id": { "buckets": { "dev": 3469, "test": 3691, "train": 4973, "validated": 30218, "invalidated": 2794, "other": 26488 }, "duration": 238191348, "reportedSentences": 362, "validatedSentences": 13318, "unvalidatedSentences": 11860, "clips": 59500, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.44, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 59500, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 645, "size": 1497459948, "checksum": "174b2fda9f7415a09ecb0489ef90ded0ad10eb11aed17c10ae3bebb3546e7c01", "avgDurationSecs": 4.003, "validDurationSecs": 120969.179, "totalHrs": 66.16, "validHrs": 33.6 }, "ig": { "buckets": { "dev": 3, "test": 5, "train": 9, "validated": 17, "invalidated": 3, "other": 5805 }, "duration": 31839696, "reportedSentences": 16, "validatedSentences": 14459, "unvalidatedSentences": 3858, "clips": 5825, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.33, "thirties": 0.04, "teens": 0.04, "fourties": 0, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0.14, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5825, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 122, "size": 187164329, "checksum": "345de1f833826b94f5cb5da053db3258b19ed76b892028f8525fe4ebeeb695a9", "avgDurationSecs": 5.466, "validDurationSecs": 92.923, "totalHrs": 8.84, "validHrs": 0.02 }, "ipk": { "buckets": { "dev": 0, "test": 160, "train": 3104, "validated": 3264, "invalidated": 0, "other": 0 }, "duration": 25908516, "reportedSentences": 0, "validatedSentences": 4386, "unvalidatedSentences": 27, "clips": 3264, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.95, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0, "female_feminine": 0.95, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3264, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 153448226, "checksum": "bc1bb8587ee91c26f32f517fd179825a721981746f778a0202ec4fb9d9ec8f05", "avgDurationSecs": 7.938, "validDurationSecs": 25908.516, "totalHrs": 7.19, "validHrs": 7.19 }, "is": { "buckets": { "dev": 17, "test": 18, "train": 68, "validated": 103, "invalidated": 15, "other": 296 }, "duration": 2660040, "reportedSentences": 1, "validatedSentences": 5083, "unvalidatedSentences": 113, "clips": 414, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0, "thirties": 0.61, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.96, "male_masculine": 0.04, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 414, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 16009805, "checksum": "5dc65dc9e351740d70733459457aede0c450aecd1e571eac4ab8edecfd97c42b", "avgDurationSecs": 6.425, "validDurationSecs": 661.797, "totalHrs": 0.73, "validHrs": 0.18 }, "it": { "buckets": { "dev": 15179, "test": 15178, "train": 172989, "validated": 239972, "invalidated": 20694, "other": 18146 }, "duration": 1513211712, "reportedSentences": 5983, "validatedSentences": 924971, "unvalidatedSentences": 1042, "clips": 278812, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.2, "thirties": 0.16, "teens": 0.01, "fourties": 0.12, "fifties": 0.14, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.62, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 278797, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 11, "healthcare": 1, "history_law_government": 5, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 3, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 7301, "size": 10289501945, "checksum": "5116fbc5b6500272bb65428798c18bcfb3df01b2f8c0c5235ec7d442fd272043", "avgDurationSecs": 5.427, "validDurationSecs": 1302413.242, "totalHrs": 420.33, "validHrs": 361.78 }, "ja": { "buckets": { "dev": 8007, "test": 8007, "train": 15437, "validated": 178126, "invalidated": 31687, "other": 263832 }, "duration": 2151865761, "reportedSentences": 783, "validatedSentences": 37288, "unvalidatedSentences": 8788, "clips": 473645, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.55, "thirties": 0.08, "teens": 0.08, "fourties": 0.09, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.33, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 437276, "agriculture_food": 5741, "automotive_transport": 1225, "finance": 1402, "service_retail": 385, "general": 16609, "healthcare": 1352, "history_law_government": 3306, "language_fundamentals": 2721, "media_entertainment": 4066, "nature_environment": 1750, "news_current_affairs": 724, "technology_robotics": 571 } }, "users": 5543, "size": 12667029619, "checksum": "5b4f10d4614f601fc62b064ec03f539a26e99bd271dffe7bb820e1c3f87cf404", "avgDurationSecs": 4.543, "validDurationSecs": 809262.719, "totalHrs": 597.74, "validHrs": 224.79 }, "jgo": { "buckets": { "dev": 336, "test": 336, "train": 362, "validated": 6080, "invalidated": 89, "other": 20 }, "duration": 41432724, "reportedSentences": 0, "validatedSentences": 1035, "unvalidatedSentences": 2, "clips": 6189, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.11, "thirties": 0.15, "teens": 0, "fourties": 0.3, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male_masculine": 0, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6188, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 227452131, "checksum": "f3ef2655195e43ddfeb13de79b95d362b3361cb2216196860a8c21a9cf488ea8", "avgDurationSecs": 6.695, "validDurationSecs": 40703.015, "totalHrs": 11.5, "validHrs": 11.3 }, "jqr": { "buckets": { "dev": 323, "test": 273, "train": 403, "validated": 6060, "invalidated": 116, "other": 79 }, "duration": 36995796, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": -1, "clips": 6255, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.6, "fifties": 0.16, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6255, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 215097740, "checksum": "0e758eae3f351725442f8e2449f123b4988c7ca8d9fd3a5598820bc33e686cec", "avgDurationSecs": 5.915, "validDurationSecs": 35842.45, "totalHrs": 10.27, "validHrs": 9.95 }, "ka": { "buckets": { "dev": 12978, "test": 13126, "train": 62845, "validated": 117311, "invalidated": 6738, "other": 103162 }, "duration": 1181850444, "reportedSentences": 1101, "validatedSentences": 132577, "unvalidatedSentences": 1786, "clips": 227211, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.22, "thirties": 0.33, "teens": 0.11, "fourties": 0.07, "fifties": 0.04, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.17, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 226661, "agriculture_food": 9, "automotive_transport": 10, "finance": 4, "service_retail": 9, "general": 253, "healthcare": 9, "history_law_government": 200, "language_fundamentals": 32, "media_entertainment": 22, "nature_environment": 17, "news_current_affairs": 10, "technology_robotics": 3 } }, "users": 2751, "size": 6911288857, "checksum": "f893f73a5478ae787037bc23e18f68d749e8632286266108bbefedbb982a26e3", "avgDurationSecs": 5.202, "validDurationSecs": 610199.583, "totalHrs": 328.29, "validHrs": 169.49 }, "kab": { "buckets": { "dev": 15003, "test": 15003, "train": 152502, "validated": 614703, "invalidated": 20068, "other": 112413 }, "duration": 2496848812, "reportedSentences": 9747, "validatedSentences": 182803, "unvalidatedSentences": 205835, "clips": 747184, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.18, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 747184, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1593, "size": 18643498627, "checksum": "86b4ca14130f8016f3ef679bb8353469cd0dc1f457c5a53ba8e30680e0023824", "avgDurationSecs": 3.342, "validDurationSecs": 2054139.884, "totalHrs": 693.56, "validHrs": 570.59 }, "kbd": { "buckets": { "dev": 5423, "test": 5416, "train": 8134, "validated": 64812, "invalidated": 1426, "other": 4284 }, "duration": 431194392, "reportedSentences": 2390, "validatedSentences": 20914, "unvalidatedSentences": 2376, "clips": 70522, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.2, "teens": 0.02, "fourties": 0.13, "fifties": 0.28, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.01, "female_feminine": 0.51, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2597, "agriculture_food": 4, "automotive_transport": 4, "finance": 0, "service_retail": 0, "general": 77836, "healthcare": 64, "history_law_government": 212, "language_fundamentals": 16, "media_entertainment": 7, "nature_environment": 128, "news_current_affairs": 5, "technology_robotics": 0 } }, "users": 214, "size": 2423875122, "checksum": "b33dc85f14f43a91f7511cdad79b77d8ab3d453a36edc6125a332ae644e9a825", "avgDurationSecs": 6.114, "validDurationSecs": 396281.599, "totalHrs": 119.77, "validHrs": 110.07 }, "kdh": { "buckets": { "dev": 356, "test": 363, "train": 498, "validated": 13513, "invalidated": 279, "other": 496 }, "duration": 38163852, "reportedSentences": 0, "validatedSentences": 1219, "unvalidatedSentences": -1, "clips": 14288, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0, "thirties": 0.02, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14288, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 201050475, "checksum": "9b401ffd5ab845bbc36813465d3debd48cbba61de999a62edb42bcd4ff508d96", "avgDurationSecs": 2.671, "validDurationSecs": 36093.794, "totalHrs": 10.6, "validHrs": 10.02 }, "khw": { "buckets": { "dev": 1607, "test": 1535, "train": 2631, "validated": 8486, "invalidated": 261, "other": 1141 }, "duration": 72821808, "reportedSentences": 4, "validatedSentences": 7046, "unvalidatedSentences": 201, "clips": 9888, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.11, "thirties": 0.26, "teens": 0, "fourties": 0.55, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9531, "agriculture_food": 18, "automotive_transport": 53, "finance": 2, "service_retail": 0, "general": 253, "healthcare": 4, "history_law_government": 9, "language_fundamentals": 30, "media_entertainment": 4, "nature_environment": 9, "news_current_affairs": 4, "technology_robotics": 1 } }, "users": 49, "size": 429333063, "checksum": "4e66d4dd1d3912d20e3fb9c9a9ecb8c58bc70d6d2cff2067582089b56ae40dc4", "avgDurationSecs": 7.365, "validDurationSecs": 62496.548, "totalHrs": 20.22, "validHrs": 17.36 }, "kk": { "buckets": { "dev": 536, "test": 560, "train": 614, "validated": 1718, "invalidated": 257, "other": 763 }, "duration": 13503780, "reportedSentences": 42, "validatedSentences": 6205, "unvalidatedSentences": 265, "clips": 2738, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.23, "thirties": 0.05, "teens": 0.06, "fourties": 0.07, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.31, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2738, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 191, "size": 77345816, "checksum": "1f33391cf774bbc0c65311e1c5f3d84c18ac1e746e934cb05fc8131df32d060c", "avgDurationSecs": 4.932, "validDurationSecs": 8473.153, "totalHrs": 3.75, "validHrs": 2.35 }, "kln": { "buckets": { "dev": 6412, "test": 6075, "train": 11064, "validated": 32311, "invalidated": 537, "other": 37352 }, "duration": 330293556, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70200, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.59, "thirties": 0.05, "teens": 0.05, "fourties": 0.29, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.03, "female_feminine": 0.7, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4, "agriculture_food": 5, "automotive_transport": 16, "finance": 32, "service_retail": 626, "general": 68952, "healthcare": 54, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 484 } }, "users": 41, "size": 1800947136, "checksum": "44e05d757e0a27337a6a9d5fc7b31dee6d010925567d44bfb383ce2e058a2c56", "avgDurationSecs": 4.705, "validDurationSecs": 152024.431, "totalHrs": 91.74, "validHrs": 42.22 }, "kls": { "buckets": { "dev": 1258, "test": 1233, "train": 1415, "validated": 9774, "invalidated": 163, "other": 11 }, "duration": 37741320, "reportedSentences": 2, "validatedSentences": 3912, "unvalidatedSentences": 4, "clips": 9948, "splits": { "accent": {}, "age": { "": 0.64, "twenties": 0.06, "thirties": 0, "teens": 0, "fourties": 0.3, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9948, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 215523326, "checksum": "ab8ced7b6b9c3da26a4aa3ef04eca7997ec5a54dd70432098f3cc2ee15513b2f", "avgDurationSecs": 3.794, "validDurationSecs": 37081.188, "totalHrs": 10.48, "validHrs": 10.3 }, "kmr": { "buckets": { "dev": 4082, "test": 4077, "train": 5419, "validated": 63388, "invalidated": 2607, "other": 24967 }, "duration": 376083612, "reportedSentences": 1062, "validatedSentences": 16988, "unvalidatedSentences": 1127, "clips": 90962, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.42, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 90948, "agriculture_food": 2, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 8, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 2, "media_entertainment": 2, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 676, "size": 1998096742, "checksum": "af479a593a4e2d31436226c3c12221e6df2d24765b64378af7a3c8c375a906db", "avgDurationSecs": 4.135, "validDurationSecs": 262078.538, "totalHrs": 104.46, "validHrs": 72.79 }, "ko": { "buckets": { "dev": 478, "test": 486, "train": 520, "validated": 1485, "invalidated": 417, "other": 3811 }, "duration": 30565800, "reportedSentences": 29, "validatedSentences": 7355, "unvalidatedSentences": 2668, "clips": 5713, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.51, "thirties": 0.24, "teens": 0.03, "fourties": 0.05, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.39, "female_feminine": 0.25, "transgender": 0, "non-binary": 0.01, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5710, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 160, "size": 178711022, "checksum": "18e96e0def09d688c9360111a0cd3bdf3cbae5eb34122ceebeae2c1883ba9e3f", "avgDurationSecs": 5.35, "validDurationSecs": 7945.075, "totalHrs": 8.49, "validHrs": 2.2 }, "ksf": { "buckets": { "dev": 308, "test": 320, "train": 376, "validated": 7414, "invalidated": 305, "other": 222 }, "duration": 73970280, "reportedSentences": 0, "validatedSentences": 1004, "unvalidatedSentences": -1, "clips": 7941, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0.07, "thirties": 0.05, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7941, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 407853419, "checksum": "eeb575e15e6f8e006c9e09a6d4331b27cb24afd2cdbc5322ff773a015d59779f", "avgDurationSecs": 9.315, "validDurationSecs": 69061.284, "totalHrs": 20.54, "validHrs": 19.18 }, "kvx": { "buckets": { "dev": 654, "test": 450, "train": 913, "validated": 7464, "invalidated": 36, "other": 57 }, "duration": 40217904, "reportedSentences": 2, "validatedSentences": 2025, "unvalidatedSentences": -1, "clips": 7557, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.2, "thirties": 0.27, "teens": 0.28, "fourties": 0.21, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7557, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 230139068, "checksum": "910ea95de6ede3bd9b6d97040ba694b6803b9547b3b2d95e2871d110fd8f432f", "avgDurationSecs": 5.322, "validDurationSecs": 39722.964, "totalHrs": 11.17, "validHrs": 11.03 }, "kw": { "buckets": { "dev": 0, "test": 2401, "train": 6223, "validated": 10806, "invalidated": 402, "other": 60 }, "duration": 46633392, "reportedSentences": 43, "validatedSentences": 9609, "unvalidatedSentences": -1, "clips": 11268, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0.47, "sixties": 0.02, "seventies": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 12725, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 273362922, "checksum": "88ccd4f7be82449ec6643c533aa6c9beb1ce613134d68231857453a20b604eb5", "avgDurationSecs": 4.139, "validDurationSecs": 44721.373, "totalHrs": 12.95, "validHrs": 12.42 }, "kxp": { "buckets": { "dev": 0, "test": 197, "train": 1881, "validated": 11644, "invalidated": 95, "other": 20 }, "duration": 40016700, "reportedSentences": 0, "validatedSentences": 2079, "unvalidatedSentences": -1, "clips": 11759, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.28, "thirties": 0.53, "teens": 0.01, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11759, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 231123953, "checksum": "da6466f7e3a795fb35b1f1b5c4019b2626b1cd931ab85d9c377deb2c0672cc4f", "avgDurationSecs": 3.403, "validDurationSecs": 39625.347, "totalHrs": 11.11, "validHrs": 11 }, "ky": { "buckets": { "dev": 1614, "test": 1615, "train": 1790, "validated": 30674, "invalidated": 5677, "other": 1490 }, "duration": 172292988, "reportedSentences": 39, "validatedSentences": 5045, "unvalidatedSentences": 242, "clips": 37841, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37841, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 297, "size": 1108063669, "checksum": "3fb1badd4e9e99671a4c519710ad673c4a24ce53a5a94d7daca09a9f77042101", "avgDurationSecs": 4.553, "validDurationSecs": 139661.085, "totalHrs": 47.85, "validHrs": 38.79 }, "lg": { "buckets": { "dev": 13352, "test": 13388, "train": 71087, "validated": 271879, "invalidated": 39182, "other": 37697 }, "duration": 2017225395, "reportedSentences": 6485, "validatedSentences": 191409, "unvalidatedSentences": 174, "clips": 348758, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348758, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 664, "size": 11842082213, "checksum": "f0182a6bd48dabc469f149c58aa1ce1e57c3f613384dafcd6cf6569b40c87196", "avgDurationSecs": 5.784, "validDurationSecs": 1572555.248, "totalHrs": 560.34, "validHrs": 436.82 }, "lij": { "buckets": { "dev": 943, "test": 1358, "train": 2304, "validated": 4686, "invalidated": 585, "other": 0 }, "duration": 20676960, "reportedSentences": 378, "validatedSentences": 6079, "unvalidatedSentences": 50, "clips": 5271, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.07, "thirties": 0.45, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.17, "seventies": 0.11, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.45, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4890, "agriculture_food": 14, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 366, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 39, "size": 115143922, "checksum": "d2e89335cb9d9bdfa0a423835fb1e5471f2275e7d627e02480c73d8086838031", "avgDurationSecs": 3.923, "validDurationSecs": 18382.135, "totalHrs": 5.74, "validHrs": 5.1 }, "lo": { "buckets": { "dev": 28, "test": 26, "train": 102, "validated": 156, "invalidated": 11, "other": 56 }, "duration": 1456560, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 223, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.33, "thirties": 0.56, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.11, "male_masculine": 0.89, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 223, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 9346505, "checksum": "e17186c2d50931006209e7145265f1c663b2b71df246cbb69518afda773cfbef", "avgDurationSecs": 6.532, "validDurationSecs": 1018.939, "totalHrs": 0.4, "validHrs": 0.28 }, "lrk": { "buckets": { "dev": 0, "test": 347, "train": 1654, "validated": 11462, "invalidated": 206, "other": 123 }, "duration": 42216012, "reportedSentences": 0, "validatedSentences": 2006, "unvalidatedSentences": -1, "clips": 11791, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.28, "thirties": 0.01, "teens": 0.34, "fourties": 0.19, "fifties": 0.01, "sixties": 0.17, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.83, "male_masculine": 0, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11791, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 246863702, "checksum": "dd66b0f79ae1783209f2b4cc45fd110978dc0d247a0c67e27e18e1dc201f163f", "avgDurationSecs": 3.58, "validDurationSecs": 41038.074, "totalHrs": 11.72, "validHrs": 11.39 }, "lss": { "buckets": { "dev": 474, "test": 645, "train": 903, "validated": 10326, "invalidated": 34, "other": 94 }, "duration": 36300708, "reportedSentences": 0, "validatedSentences": 2029, "unvalidatedSentences": -1, "clips": 10454, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.45, "thirties": 0.33, "teens": 0, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.11 }, "sentence_domain": { "": 10454, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 183622008, "checksum": "ebc222fa1e1b6bbdc7db8aa5a1f302f0ddc10e9fd6505c44f0b241fdfa8b7c14", "avgDurationSecs": 3.472, "validDurationSecs": 35856.238, "totalHrs": 10.08, "validHrs": 9.96 }, "lt": { "buckets": { "dev": 5111, "test": 5508, "train": 8767, "validated": 19588, "invalidated": 903, "other": 4285 }, "duration": 127000376, "reportedSentences": 192, "validatedSentences": 129304, "unvalidatedSentences": 4814, "clips": 24776, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.26, "thirties": 0.23, "teens": 0.03, "fourties": 0.29, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.41, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 24776, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 325, "size": 775975730, "checksum": "ebcedc73729600609da671de4d429ac1de34abe5559b0f1664c5ec2bf382feac", "avgDurationSecs": 5.126, "validDurationSecs": 100406.981, "totalHrs": 35.27, "validHrs": 27.89 }, "ltg": { "buckets": { "dev": 3537, "test": 3579, "train": 4684, "validated": 22387, "invalidated": 1861, "other": 761 }, "duration": 119531340, "reportedSentences": 78, "validatedSentences": 14996, "unvalidatedSentences": 1826, "clips": 25009, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.09, "thirties": 0.23, "teens": 0, "fourties": 0.33, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.05, "female_feminine": 0.6, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19189, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 5820, "healthcare": 1, "history_law_government": 691, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 377, "size": 689621077, "checksum": "f021bd4c9a14e3d924baa1aa37a5e609b1aea91c8d50be6dd0ed8ea70589fecd", "avgDurationSecs": 4.78, "validDurationSecs": 106999.405, "totalHrs": 33.2, "validHrs": 29.72 }, "lua": { "buckets": { "dev": 291, "test": 291, "train": 298, "validated": 4801, "invalidated": 894, "other": 0 }, "duration": 46917972, "reportedSentences": 0, "validatedSentences": 880, "unvalidatedSentences": -1, "clips": 5695, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.67, "thirties": 0.19, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.68, "male_masculine": 0, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5695, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 272341238, "checksum": "bd8b0d735b056c46ef3ab3693cb0e3fcdd95e4a6b036dee01f1de43804cf0a1f", "avgDurationSecs": 8.238, "validDurationSecs": 39552.798, "totalHrs": 13.03, "validHrs": 10.98 }, "luo": { "buckets": { "dev": 3060, "test": 3001, "train": 4498, "validated": 20343, "invalidated": 792, "other": 61593 }, "duration": 430582284, "reportedSentences": 8, "validatedSentences": 24678, "unvalidatedSentences": 13870, "clips": 82728, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.33, "thirties": 0.28, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1135, "automotive_transport": 341, "finance": 967, "service_retail": 2748, "general": 69480, "healthcare": 2236, "history_law_government": 864, "language_fundamentals": 555, "media_entertainment": 806, "nature_environment": 634, "news_current_affairs": 2710, "technology_robotics": 241 } }, "users": 46, "size": 2392662440, "checksum": "ce0ca491218e0dff5ce65c8be7a842f0a2ed836ec4e173955ada1be2e8ae42e4", "avgDurationSecs": 5.205, "validDurationSecs": 105881.145, "totalHrs": 119.6, "validHrs": 29.41 }, "lv": { "buckets": { "dev": 7728, "test": 7728, "train": 14432, "validated": 197038, "invalidated": 11490, "other": 21427 }, "duration": 1104608429, "reportedSentences": 3143, "validatedSentences": 34448, "unvalidatedSentences": 65078, "clips": 229955, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.29, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229120, "agriculture_food": 8, "automotive_transport": 3, "finance": 4, "service_retail": 16, "general": 694, "healthcare": 36, "history_law_government": 8, "language_fundamentals": 9, "media_entertainment": 16, "nature_environment": 40, "news_current_affairs": 5, "technology_robotics": 13 } }, "users": 6464, "size": 6188175922, "checksum": "81fdd0d48539f1c39dc786ad9ab39c3b185fb896850ae65344d6d87ec7e66d18", "avgDurationSecs": 4.804, "validDurationSecs": 946488.816, "totalHrs": 306.83, "validHrs": 262.91 }, "mau": { "buckets": { "dev": 252, "test": 312, "train": 555, "validated": 6040, "invalidated": 13, "other": 216 }, "duration": 38969352, "reportedSentences": 0, "validatedSentences": 1119, "unvalidatedSentences": -1, "clips": 6269, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.21, "thirties": 0.43, "teens": 0, "fourties": 0.36, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6269, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7, "size": 207478013, "checksum": "b38c8c5a794a2eeeb4dba7a5c41efb630cb83d89a72b381c60737a538b818083", "avgDurationSecs": 6.216, "validDurationSecs": 37545.842, "totalHrs": 10.82, "validHrs": 10.42 }, "mbo": { "buckets": { "dev": 158, "test": 235, "train": 601, "validated": 7030, "invalidated": 75, "other": 757 }, "duration": 43366860, "reportedSentences": 4, "validatedSentences": 998, "unvalidatedSentences": 4, "clips": 7862, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7862, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 253665900, "checksum": "9fa579645665e15070017c4ac4328c4d30607cb477dae8f7433f09512722fd46", "avgDurationSecs": 5.516, "validDurationSecs": 38777.541, "totalHrs": 12.04, "validHrs": 10.77 }, "mcf": { "buckets": { "dev": 0, "test": 320, "train": 677, "validated": 13884, "invalidated": 28, "other": 9 }, "duration": 36966816, "reportedSentences": 0, "validatedSentences": 997, "unvalidatedSentences": -1, "clips": 13921, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.28, "thirties": 0.34, "teens": 0.26, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.74, "male_masculine": 0, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13921, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 182169901, "checksum": "017e3697abebe88931595d869d3204e3aeb268bcd3caa3275ee53e2719182af8", "avgDurationSecs": 2.655, "validDurationSecs": 36868.564, "totalHrs": 10.26, "validHrs": 10.24 }, "mcn": { "buckets": { "dev": 316, "test": 316, "train": 325, "validated": 8620, "invalidated": 32, "other": 608 }, "duration": 39026016, "reportedSentences": 4, "validatedSentences": 959, "unvalidatedSentences": -1, "clips": 9260, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9260, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 228282308, "checksum": "a0d07839ea1315f434d2fb7c8d1a4c43b2a1587278af8a57722171d07db854d3", "avgDurationSecs": 4.214, "validDurationSecs": 36328.754, "totalHrs": 10.84, "validHrs": 10.09 }, "mcx": { "buckets": { "dev": 111, "test": 249, "train": 543, "validated": 5491, "invalidated": 67, "other": 2 }, "duration": 36755964, "reportedSentences": 0, "validatedSentences": 903, "unvalidatedSentences": -1, "clips": 5560, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5560, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 217222020, "checksum": "6f2b7efd7e8806cbb65efa2474801ed64b3c9a0c29df31e9fc8987c1092fd8e8", "avgDurationSecs": 6.611, "validDurationSecs": 36299.82, "totalHrs": 10.2, "validHrs": 10.08 }, "mdd": { "buckets": { "dev": 0, "test": 46, "train": 877, "validated": 5542, "invalidated": 31, "other": 0 }, "duration": 36437112, "reportedSentences": 0, "validatedSentences": 923, "unvalidatedSentences": -1, "clips": 5573, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5573, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7, "size": 208717939, "checksum": "7a69f4040011a60724daa7dc0967186c30abce9de0224db9fbfb6610928833b4", "avgDurationSecs": 6.538, "validDurationSecs": 36234.429, "totalHrs": 10.12, "validHrs": 10.06 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11057130, "checksum": "94cb754fce2fb2e832470f00ff31bd74ed9d171e723d3dc6866e06ccb24ece99", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mgg": { "buckets": { "dev": 296, "test": 212, "train": 483, "validated": 4863, "invalidated": 290, "other": 0 }, "duration": 39065004, "reportedSentences": 0, "validatedSentences": 991, "unvalidatedSentences": -1, "clips": 5153, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5153, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 231050557, "checksum": "76f105dca7cd2abec749d03a939ebf3e772a5e94f5113a2fc4a32ecb0479faf6", "avgDurationSecs": 7.581, "validDurationSecs": 36866.508, "totalHrs": 10.85, "validHrs": 10.24 }, "mhk": { "buckets": { "dev": 305, "test": 202, "train": 493, "validated": 6597, "invalidated": 52, "other": 46 }, "duration": 41164560, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 6695, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6695, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 230630891, "checksum": "d383f06aafa324959a3a702102ad8650c15cbed60d210be89b01dabfd195e87b", "avgDurationSecs": 6.149, "validDurationSecs": 40562.002, "totalHrs": 11.43, "validHrs": 11.26 }, "mhr": { "buckets": { "dev": 14663, "test": 15136, "train": 186565, "validated": 218728, "invalidated": 16025, "other": 18 }, "duration": 1085385744, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234771, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234771, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 499, "size": 6124213649, "checksum": "9c8ff932e077aee5381373db7c2302dcdb313cff7e91643c8ab5927d32f215c2", "avgDurationSecs": 4.623, "validDurationSecs": 1011216.262, "totalHrs": 301.49, "validHrs": 280.89 }, "mk": { "buckets": { "dev": 1713, "test": 1786, "train": 2095, "validated": 16364, "invalidated": 940, "other": 23632 }, "duration": 196867044, "reportedSentences": 161, "validatedSentences": 6032, "unvalidatedSentences": 661, "clips": 40936, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.36, "thirties": 0.27, "teens": 0.03, "fourties": 0.2, "fifties": 0.02, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.28, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40671, "agriculture_food": 14, "automotive_transport": 0, "finance": 7, "service_retail": 0, "general": 210, "healthcare": 27, "history_law_government": 7, "language_fundamentals": 0, "media_entertainment": 7, "nature_environment": 21, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 464, "size": 1080400680, "checksum": "6c2a3f3104b868b21d6eeb00ef9c18752fd8a5d5ede8c1f7f955064fd7e56656", "avgDurationSecs": 4.809, "validDurationSecs": 78696.803, "totalHrs": 54.68, "validHrs": 21.86 }, "mki": { "buckets": { "dev": 0, "test": 23, "train": 2008, "validated": 11181, "invalidated": 134, "other": 12 }, "duration": 36287784, "reportedSentences": 0, "validatedSentences": 2055, "unvalidatedSentences": -1, "clips": 11327, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.28, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11327, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 196597529, "checksum": "3c4d8b3290b002b90aa52a5a9547f920196b10cdacf67adb151c3f3d8f7d7dd6", "avgDurationSecs": 3.204, "validDurationSecs": 35820.051, "totalHrs": 10.07, "validHrs": 9.95 }, "ml": { "buckets": { "dev": 934, "test": 883, "train": 1232, "validated": 3467, "invalidated": 120, "other": 5948 }, "duration": 40249908, "reportedSentences": 243, "validatedSentences": 5487, "unvalidatedSentences": 4728, "clips": 9535, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.59, "thirties": 0.08, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.48, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9535, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 151, "size": 228144229, "checksum": "789169ac172baef4ccb24bbd00d5fdd41f423b15a47583004b3971af918b50ea", "avgDurationSecs": 4.221, "validDurationSecs": 14635.179, "totalHrs": 11.18, "validHrs": 4.06 }, "mn": { "buckets": { "dev": 1934, "test": 1933, "train": 2191, "validated": 33654, "invalidated": 3233, "other": 59408 }, "duration": 505961468, "reportedSentences": 46, "validatedSentences": 6095, "unvalidatedSentences": 17, "clips": 96295, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96248, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 31, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 16, "technology_robotics": 0 } }, "users": 603, "size": 3077781199, "checksum": "f2631343a73e23233d0f4f642504fbd220eead41c51464701ec4a1e0133daa96", "avgDurationSecs": 5.254, "validDurationSecs": 176827.74, "totalHrs": 140.54, "validHrs": 49.11 }, "mr": { "buckets": { "dev": 1766, "test": 1796, "train": 2189, "validated": 10995, "invalidated": 2292, "other": 2798 }, "duration": 99727560, "reportedSentences": 59, "validatedSentences": 6715, "unvalidatedSentences": 12104, "clips": 16085, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.28, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.06, "male_masculine": 0.19, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16085, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 104, "size": 586277943, "checksum": "afd61b4cc8140fb6f2095cd4a67c64082235b55e14c8e6c8f36eed5d38fe1e94", "avgDurationSecs": 6.2, "validDurationSecs": 68169.383, "totalHrs": 27.7, "validHrs": 18.93 }, "mrj": { "buckets": { "dev": 7314, "test": 7164, "train": 14325, "validated": 28973, "invalidated": 1261, "other": 1459 }, "duration": 132879852, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31693, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31693, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 747243458, "checksum": "0ef9c69d3ee8dc9b34818617a50acac4514ea7164aaf388f0831f327f446172d", "avgDurationSecs": 4.193, "validDurationSecs": 121475.656, "totalHrs": 36.91, "validHrs": 33.74 }, "ms": { "buckets": { "dev": 6, "test": 4, "train": 10, "validated": 22, "invalidated": 1, "other": 2069 }, "duration": 12876444, "reportedSentences": 0, "validatedSentences": 257, "unvalidatedSentences": 1, "clips": 2092, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2057, "agriculture_food": 0, "automotive_transport": 6, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 29, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 27, "size": 76004371, "checksum": "bae536fe39ff65335544ad26beb15f155f034da718c366a4be28e7f0493954c0", "avgDurationSecs": 6.155, "validDurationSecs": 135.412, "totalHrs": 3.57, "validHrs": 0.03 }, "mse": { "buckets": { "dev": 327, "test": 284, "train": 391, "validated": 4338, "invalidated": 310, "other": 18 }, "duration": 38836476, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": -1, "clips": 4666, "splits": { "accent": {}, "age": { "": 0.79, "twenties": 0.21, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4666, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 218226822, "checksum": "cc2aab381728bc2ee968ae3ebffd98daa707986805a6cf1bce5e5d0636a32bed", "avgDurationSecs": 8.323, "validDurationSecs": 36106.437, "totalHrs": 10.78, "validHrs": 10.02 }, "mt": { "buckets": { "dev": 1625, "test": 1660, "train": 1910, "validated": 6593, "invalidated": 326, "other": 6288 }, "duration": 62787780, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11773, "clips": 13207, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13207, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 466487981, "checksum": "a76ddf73b9c382a65f423ee3bb04c9ae7bdc9dc36c9bb49a36a6fcb0bda99a41", "avgDurationSecs": 4.754, "validDurationSecs": 31343.972, "totalHrs": 17.44, "validHrs": 8.7 }, "mua": { "buckets": { "dev": 286, "test": 281, "train": 428, "validated": 8677, "invalidated": 33, "other": 27 }, "duration": 38936952, "reportedSentences": 5, "validatedSentences": 995, "unvalidatedSentences": -1, "clips": 8737, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8737, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 217975633, "checksum": "2fddb0eddc3f4812dfebd99b3b3e31b0a9a979f449dc272921b9d905545f5059", "avgDurationSecs": 4.457, "validDurationSecs": 38669.558, "totalHrs": 10.81, "validHrs": 10.74 }, "mug": { "buckets": { "dev": 297, "test": 297, "train": 404, "validated": 2687, "invalidated": 263, "other": 0 }, "duration": 45123948, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": -1, "clips": 2950, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2950, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 266764196, "checksum": "4e52297c2a8dbfdf86f818c20d630dd542ac2d1c9cca9d94602d80b91ba7258c", "avgDurationSecs": 15.296, "validDurationSecs": 41101.033, "totalHrs": 12.53, "validHrs": 11.41 }, "mve": { "buckets": { "dev": 555, "test": 146, "train": 1296, "validated": 7648, "invalidated": 47, "other": 11 }, "duration": 38372904, "reportedSentences": 0, "validatedSentences": 2003, "unvalidatedSentences": -1, "clips": 7706, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.26, "thirties": 0.33, "teens": 0, "fourties": 0.39, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7706, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 225594864, "checksum": "85c55fb01213c27424617b71036cf72cc2275b078cc53cd6c25bea287e2f3e9a", "avgDurationSecs": 4.98, "validDurationSecs": 38084.086, "totalHrs": 10.65, "validHrs": 10.57 }, "mvy": { "buckets": { "dev": 1946, "test": 2076, "train": 2558, "validated": 16600, "invalidated": 590, "other": 1285 }, "duration": 90013968, "reportedSentences": 90, "validatedSentences": 6634, "unvalidatedSentences": 222, "clips": 18475, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.39, "thirties": 0.47, "teens": 0, "fourties": 0.06, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18245, "agriculture_food": 33, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 108, "healthcare": 0, "history_law_government": 78, "language_fundamentals": 0, "media_entertainment": 11, "nature_environment": 7, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 56, "size": 477005232, "checksum": "cef63de5ab873db3507a29750cc1d423c828e32d9ce4e51433988fd6d8fb581b", "avgDurationSecs": 4.872, "validDurationSecs": 80878.586, "totalHrs": 25, "validHrs": 22.46 }, "mxu": { "buckets": { "dev": 309, "test": 317, "train": 333, "validated": 7749, "invalidated": 67, "other": 2 }, "duration": 50839092, "reportedSentences": 0, "validatedSentences": 959, "unvalidatedSentences": -1, "clips": 7818, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7818, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 298315184, "checksum": "b1a82fa6e55d8406738f384955a3a0c09f018b9e41b25de71fde38bec7e4d047", "avgDurationSecs": 6.503, "validDurationSecs": 50390.397, "totalHrs": 14.12, "validHrs": 13.99 }, "myv": { "buckets": { "dev": 239, "test": 482, "train": 1241, "validated": 1972, "invalidated": 23, "other": 15 }, "duration": 11624940, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 2010, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2010, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 68324712, "checksum": "17f0dd2ad1afa8e62972ed372f428b76b6615b024a04c0a4c1c0e277f09e9a91", "avgDurationSecs": 5.784, "validDurationSecs": 11405.165, "totalHrs": 3.22, "validHrs": 3.16 }, "nan-tw": { "buckets": { "dev": 5705, "test": 6523, "train": 11584, "validated": 29287, "invalidated": 565, "other": 2099 }, "duration": 84381048, "reportedSentences": 226, "validatedSentences": 26907, "unvalidatedSentences": 370, "clips": 31951, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.5, "thirties": 0.15, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.63, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31563, "agriculture_food": 4, "automotive_transport": 0, "finance": 0, "service_retail": 2, "general": 327, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 55, "media_entertainment": 0, "nature_environment": 55, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 290, "size": 480352640, "checksum": "4e98f2845fdcd1b201fde448f42297fecaaea21988244c007487eef36191696e", "avgDurationSecs": 2.641, "validDurationSecs": 77345.553, "totalHrs": 23.43, "validHrs": 21.48 }, "nb-NO": { "buckets": { "dev": 0, "test": 399, "train": 1259, "validated": 1658, "invalidated": 76, "other": 208 }, "duration": 8028000, "reportedSentences": 1, "validatedSentences": 4375, "unvalidatedSentences": 3692, "clips": 1942, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0, "thirties": 0.71, "teens": 0, "fourties": 0.22, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.69, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1172, "agriculture_food": 3, "automotive_transport": 0, "finance": 4, "service_retail": 29, "general": 695, "healthcare": 0, "history_law_government": 18, "language_fundamentals": 11, "media_entertainment": 11, "nature_environment": 34, "news_current_affairs": 29, "technology_robotics": 0 } }, "users": 20, "size": 46632788, "checksum": "33911867e0819f597c942cdd0bc215d8ec39664c2c534e7a50ea49944cceef20", "avgDurationSecs": 4.134, "validDurationSecs": 6853.977, "totalHrs": 2.23, "validHrs": 1.9 }, "ncx": { "buckets": { "dev": 339, "test": 345, "train": 360, "validated": 8640, "invalidated": 60, "other": 809 }, "duration": 42381792, "reportedSentences": 2, "validatedSentences": 1518, "unvalidatedSentences": 5, "clips": 9509, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.15, "thirties": 0.63, "teens": 0, "fourties": 0.17, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.61, "male_masculine": 0.01, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2832, "agriculture_food": 0, "automotive_transport": 0, "finance": 212, "service_retail": 0, "general": 0, "healthcare": 212, "history_law_government": 0, "language_fundamentals": 2047, "media_entertainment": 4700, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 41, "size": 243144822, "checksum": "3929c3261146eb2473488734c2df68f80a9380976cb1890e8bc03d193bf0622c", "avgDurationSecs": 4.457, "validDurationSecs": 38508.643, "totalHrs": 11.77, "validHrs": 10.69 }, "ne-NP": { "buckets": { "dev": 314, "test": 289, "train": 355, "validated": 1179, "invalidated": 100, "other": 405 }, "duration": 6917940, "reportedSentences": 5, "validatedSentences": 1316, "unvalidatedSentences": 30, "clips": 1684, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.57, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1683, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 59, "size": 40598939, "checksum": "7978ccbfb6bacaf77db6d6551a33d41df00ef0aa00e0ac2e94735c681e98c68a", "avgDurationSecs": 4.108, "validDurationSecs": 4843.38, "totalHrs": 1.92, "validHrs": 1.34 }, "nhi": { "buckets": { "dev": 10, "test": 6, "train": 24, "validated": 40, "invalidated": 0, "other": 387 }, "duration": 2169576, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 11, "clips": 427, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.73, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.08, "female_feminine": 0.73, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 12735711, "checksum": "ea94956581c357a3fdb07e992a602eab52fa2df539d296b6552262a08e33db64", "avgDurationSecs": 5.081, "validDurationSecs": 203.239, "totalHrs": 0.6, "validHrs": 0.05 }, "nl": { "buckets": { "dev": 12183, "test": 12201, "train": 45756, "validated": 103205, "invalidated": 6100, "other": 4368 }, "duration": 496675729, "reportedSentences": 400, "validatedSentences": 260504, "unvalidatedSentences": 9864, "clips": 113673, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.18, "thirties": 0.12, "teens": 0.02, "fourties": 0.18, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.51, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 113118, "agriculture_food": 10, "automotive_transport": 21, "finance": 6, "service_retail": 0, "general": 218, "healthcare": 55, "history_law_government": 47, "language_fundamentals": 169, "media_entertainment": 2, "nature_environment": 53, "news_current_affairs": 2, "technology_robotics": 4 } }, "users": 1853, "size": 3327305604, "checksum": "28960ecc82a25ea90cd32da233772f5fe4ff791157d7ad62f97188d0271c00c4", "avgDurationSecs": 4.369, "validDurationSecs": 450937.502, "totalHrs": 137.96, "validHrs": 125.26 }, "nla": { "buckets": { "dev": 172, "test": 220, "train": 592, "validated": 4635, "invalidated": 176, "other": 5 }, "duration": 37346544, "reportedSentences": 0, "validatedSentences": 984, "unvalidatedSentences": -1, "clips": 4816, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4816, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 212137228, "checksum": "cd9046d0fb3c8d1da9b52632dfccb448f8e7916733b322360844d6a5a6a86635", "avgDurationSecs": 7.755, "validDurationSecs": 35942.947, "totalHrs": 10.37, "validHrs": 9.98 }, "nlv": { "buckets": { "dev": 248, "test": 291, "train": 467, "validated": 6656, "invalidated": 128, "other": 60 }, "duration": 42922260, "reportedSentences": 1, "validatedSentences": 1006, "unvalidatedSentences": -1, "clips": 6844, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.13, "thirties": 0.85, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.91, "male_masculine": 0, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5103, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 698, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 824, "media_entertainment": 944, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 249257515, "checksum": "7e8f159ac878082f7ac9fad76e5b72c017a072ab671e3070f015266f6e4bff74", "avgDurationSecs": 6.272, "validDurationSecs": 41743.215, "totalHrs": 11.92, "validHrs": 11.59 }, "nmg": { "buckets": { "dev": 272, "test": 270, "train": 483, "validated": 5865, "invalidated": 89, "other": 5 }, "duration": 38142936, "reportedSentences": 1, "validatedSentences": 1025, "unvalidatedSentences": -1, "clips": 5959, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.14, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5959, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 217287077, "checksum": "a3ca9ddb089bab8e730e293a8ff753e8e81a5fb234d6725bcfcaf02223b99c37", "avgDurationSecs": 6.401, "validDurationSecs": 37541.252, "totalHrs": 10.59, "validHrs": 10.42 }, "nmz": { "buckets": { "dev": 795, "test": 795, "train": 846, "validated": 13698, "invalidated": 109, "other": 313 }, "duration": 41779332, "reportedSentences": 2, "validatedSentences": 2446, "unvalidatedSentences": 1, "clips": 14120, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14120, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 215709638, "checksum": "6bf8a39040da050fc8b1c0dbcd0f0eec49bbd9bd4f61a00625fc87bc686c29ee", "avgDurationSecs": 2.959, "validDurationSecs": 40530.686, "totalHrs": 11.6, "validHrs": 11.25 }, "nn-NO": { "buckets": { "dev": 405, "test": 423, "train": 464, "validated": 1297, "invalidated": 64, "other": 18 }, "duration": 6050232, "reportedSentences": 23, "validatedSentences": 5318, "unvalidatedSentences": 6, "clips": 1379, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.4, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.58, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1379, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 35178572, "checksum": "07a505a003f6db258a60a9b5ce0ae41b97f57cd936987480f41a47555db3e29f", "avgDurationSecs": 4.387, "validDurationSecs": 5690.465, "totalHrs": 1.68, "validHrs": 1.58 }, "nnh": { "buckets": { "dev": 186, "test": 288, "train": 412, "validated": 7631, "invalidated": 156, "other": 1 }, "duration": 82297008, "reportedSentences": 0, "validatedSentences": 886, "unvalidatedSentences": -1, "clips": 7788, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0, "thirties": 0.11, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7788, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 486895510, "checksum": "a9634c5a4da53187b1253993fe6b00f8a619d22c73f6a2e556ddb097046b2d11", "avgDurationSecs": 10.567, "validDurationSecs": 80637.965, "totalHrs": 22.86, "validHrs": 22.39 }, "nr": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 2 }, "duration": 19476, "reportedSentences": 0, "validatedSentences": 319, "unvalidatedSentences": 7468, "clips": 2, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 637948, "checksum": "d3a5f272634eb6d4a0d75d2b5f2d0324b5f66b3d716859c79e439848f8bc92da", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552769, "checksum": "97e30787ba685a7fbc30bfa63d5eb031179587380508900f4ca4338ad137d7fa", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "nyu": { "buckets": { "dev": 0, "test": 248, "train": 1023, "validated": 3639, "invalidated": 76, "other": 1 }, "duration": 38390760, "reportedSentences": 0, "validatedSentences": 1271, "unvalidatedSentences": -1, "clips": 3716, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.33, "thirties": 0.34, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male_masculine": 0, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2624, "agriculture_food": 4, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1082, "healthcare": 2, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 219328596, "checksum": "d38b502e66d0b51327df1fff12c6217e32049c67360c5e8780c25a01663f29f9", "avgDurationSecs": 10.331, "validDurationSecs": 37595.257, "totalHrs": 10.66, "validHrs": 10.44 }, "oc": { "buckets": { "dev": 272, "test": 274, "train": 304, "validated": 1976, "invalidated": 203, "other": 7696 }, "duration": 48105396, "reportedSentences": 13, "validatedSentences": 4934, "unvalidatedSentences": 14, "clips": 9875, "splits": { "accent": {}, "age": { "": 0.5, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0.08, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9875, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 147, "size": 273823094, "checksum": "ec00fbb148be60227559200557b4aa763a4ec3c1686b55c9fdd5078c8bf28f9c", "avgDurationSecs": 4.871, "validDurationSecs": 9625.951, "totalHrs": 13.36, "validHrs": 2.67 }, "odk": { "buckets": { "dev": 467, "test": 627, "train": 941, "validated": 6341, "invalidated": 46, "other": 0 }, "duration": 40676832, "reportedSentences": 0, "validatedSentences": 2047, "unvalidatedSentences": -1, "clips": 6387, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.2, "thirties": 0.18, "teens": 0.15, "fourties": 0.08, "fifties": 0.08, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0.14, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6387, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 236348344, "checksum": "f99eb2ec02890d5a422bae2e1ffbd245196b1acfc192a8fac5cd113b608bf7ac", "avgDurationSecs": 6.369, "validDurationSecs": 40383.872, "totalHrs": 11.29, "validHrs": 11.21 }, "or": { "buckets": { "dev": 640, "test": 479, "train": 2151, "validated": 4085, "invalidated": 437, "other": 19622 }, "duration": 134612700, "reportedSentences": 100, "validatedSentences": 6197, "unvalidatedSentences": 87, "clips": 24144, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.65, "thirties": 0.27, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.34, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.21 }, "sentence_domain": { "": 24137, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 163, "size": 832552939, "checksum": "f5172dfb69883decd6298ed0e35161ecd8a394382ac4ff8173d54fe901baa77f", "avgDurationSecs": 5.575, "validDurationSecs": 22775.55, "totalHrs": 37.39, "validHrs": 6.32 }, "oru": { "buckets": { "dev": 0, "test": 1018, "train": 6269, "validated": 8298, "invalidated": 86, "other": 238 }, "duration": 64119060, "reportedSentences": 0, "validatedSentences": 7355, "unvalidatedSentences": 902, "clips": 8622, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.78, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.41, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.38 }, "sentence_domain": { "": 8516, "agriculture_food": 106, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 106, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 106, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 380501165, "checksum": "fb6a707a66d454f83270d10caa092de25654e809f60cff49570cd8e0f753da1e", "avgDurationSecs": 7.437, "validDurationSecs": 61709.575, "totalHrs": 17.81, "validHrs": 17.14 }, "os": { "buckets": { "dev": 247, "test": 219, "train": 414, "validated": 911, "invalidated": 34, "other": 0 }, "duration": 5217804, "reportedSentences": 4, "validatedSentences": 1888, "unvalidatedSentences": 34, "clips": 945, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.09, "thirties": 0.47, "teens": 0.02, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0.21, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 944, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 30545005, "checksum": "cc31226084bef93c50dce6c63387258d9b86c70682269b7b37193591ba505699", "avgDurationSecs": 5.521, "validDurationSecs": 5030.073, "totalHrs": 1.44, "validHrs": 1.39 }, "pa-IN": { "buckets": { "dev": 489, "test": 524, "train": 808, "validated": 1821, "invalidated": 104, "other": 1357 }, "duration": 16778510, "reportedSentences": 251, "validatedSentences": 14825, "unvalidatedSentences": 22, "clips": 3282, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.21, "thirties": 0.39, "teens": 0, "fourties": 0.07, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.63, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3281, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 75, "size": 116224696, "checksum": "485ad9d6519fc88cd877f4adeb9423d1dfa1a3a00bc2f6695880e9bd454aca38", "avgDurationSecs": 5.112, "validDurationSecs": 9309.466, "totalHrs": 4.66, "validHrs": 2.58 }, "pcm": { "buckets": { "dev": 326, "test": 326, "train": 335, "validated": 7752, "invalidated": 64, "other": 1108 }, "duration": 53886564, "reportedSentences": 0, "validatedSentences": 987, "unvalidatedSentences": -1, "clips": 8924, "splits": { "accent": {}, "age": { "": 0.73, "twenties": 0, "thirties": 0.11, "teens": 0, "fourties": 0.16, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.85, "male_masculine": 0, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8924, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 307383414, "checksum": "629d35f0557fe5afe657c7d9dc682d3931efd3b62d17965697fc7c8146cd38b6", "avgDurationSecs": 6.038, "validDurationSecs": 46809.575, "totalHrs": 14.96, "validHrs": 13 }, "phl": { "buckets": { "dev": 1412, "test": 1364, "train": 1956, "validated": 15524, "invalidated": 83, "other": 5551 }, "duration": 105392160, "reportedSentences": 0, "validatedSentences": 4745, "unvalidatedSentences": 1184, "clips": 21158, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.48, "thirties": 0.23, "teens": 0.06, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21120, "agriculture_food": 0, "automotive_transport": 4, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 10, "history_law_government": 24, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 619777006, "checksum": "3361da2de146d98f0e49693d6bb0701a79105236b4676a331c2ff392491d43ff", "avgDurationSecs": 4.981, "validDurationSecs": 77328.098, "totalHrs": 29.27, "validHrs": 21.48 }, "phr": { "buckets": { "dev": 673, "test": 656, "train": 743, "validated": 12695, "invalidated": 139, "other": 10 }, "duration": 52106364, "reportedSentences": 1, "validatedSentences": 2077, "unvalidatedSentences": -1, "clips": 12844, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.52, "thirties": 0.17, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12844, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 63, "size": 296575348, "checksum": "33882ea4f5700dd2eb3b28fb76efb826a72f3e6b14de0be9011c34f9dd9ae673", "avgDurationSecs": 4.057, "validDurationSecs": 51501.891, "totalHrs": 14.47, "validHrs": 14.3 }, "pl": { "buckets": { "dev": 9945, "test": 9947, "train": 24735, "validated": 138109, "invalidated": 6939, "other": 2290 }, "duration": 673588838, "reportedSentences": 604, "validatedSentences": 230006, "unvalidatedSentences": 11112, "clips": 147338, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.27, "thirties": 0.34, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.28, "male_masculine": 0.58, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 147323, "agriculture_food": 0, "automotive_transport": 0, "finance": 12, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 3 } }, "users": 3451, "size": 4766647722, "checksum": "5a4074abcbc9b704a43a5b9cb242ffa1c294125e1b24898e786c961ae8155d65", "avgDurationSecs": 4.572, "validDurationSecs": 631396.387, "totalHrs": 187.1, "validHrs": 175.38 }, "plk": { "buckets": { "dev": 1237, "test": 651, "train": 2528, "validated": 9004, "invalidated": 960, "other": 1971 }, "duration": 59980824, "reportedSentences": 0, "validatedSentences": 4657, "unvalidatedSentences": -1, "clips": 11935, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0, "thirties": 0.16, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.71, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11935, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 345848903, "checksum": "6737b908087fba1c0883acda8fe741d762993115da4a6e72a6879dbc1f1195ac", "avgDurationSecs": 5.026, "validDurationSecs": 45250.72, "totalHrs": 16.66, "validHrs": 12.56 }, "prq": { "buckets": { "dev": 282, "test": 287, "train": 417, "validated": 7422, "invalidated": 68, "other": 179 }, "duration": 37282968, "reportedSentences": 0, "validatedSentences": 986, "unvalidatedSentences": -1, "clips": 7669, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.17, "thirties": 0.26, "teens": 0, "fourties": 0.57, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0, "female_feminine": 0.84, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7669, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 202948004, "checksum": "1f43c0248aa30fc673e3ee96f2b7b7ce3908f0134b1b3cc1298da0d8ae9a3888", "avgDurationSecs": 4.862, "validDurationSecs": 36082.173, "totalHrs": 10.35, "validHrs": 10.02 }, "ps": { "buckets": { "dev": 3660, "test": 3660, "train": 4693, "validated": 60337, "invalidated": 5179, "other": 42265 }, "duration": 529471260, "reportedSentences": 1140, "validatedSentences": 25109, "unvalidatedSentences": 34260, "clips": 107781, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.37, "thirties": 0.15, "teens": 0.07, "fourties": 0.15, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0.01, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 65012, "agriculture_food": 830, "automotive_transport": 310, "finance": 107, "service_retail": 439, "general": 36905, "healthcare": 718, "history_law_government": 2127, "language_fundamentals": 1960, "media_entertainment": 1244, "nature_environment": 911, "news_current_affairs": 758, "technology_robotics": 521 } }, "users": 1483, "size": 2927038681, "checksum": "75a24aab8d2735588e058c835ae107e3c8ff1e41ba4cd4d93a01c49fc9eef340", "avgDurationSecs": 4.912, "validDurationSecs": 296403.888, "totalHrs": 147.07, "validHrs": 82.33 }, "pt": { "buckets": { "dev": 9653, "test": 9653, "train": 22991, "validated": 159969, "invalidated": 7746, "other": 28174 }, "duration": 820130227, "reportedSentences": 2850, "validatedSentences": 43533, "unvalidatedSentences": 147, "clips": 195889, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.36, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 194189, "agriculture_food": 121, "automotive_transport": 36, "finance": 26, "service_retail": 23, "general": 1300, "healthcare": 180, "history_law_government": 64, "language_fundamentals": 17, "media_entertainment": 73, "nature_environment": 47, "news_current_affairs": 4, "technology_robotics": 168 } }, "users": 3787, "size": 5139613362, "checksum": "3076e9686b49a833bf16225a82d136af838ac483987cd3a5d58b49af7fd99449", "avgDurationSecs": 4.187, "validDurationSecs": 669743.642, "totalHrs": 227.81, "validHrs": 186.03 }, "pua": { "buckets": { "dev": 495, "test": 547, "train": 1153, "validated": 7536, "invalidated": 199, "other": 445 }, "duration": 40235256, "reportedSentences": 1, "validatedSentences": 2201, "unvalidatedSentences": -1, "clips": 8180, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.55, "thirties": 0.31, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.72, "male_masculine": 0, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8180, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 228614959, "checksum": "c75ffb7d039810b1ab08ddae721720222d61d275960d9d86e78bf0db51108a10", "avgDurationSecs": 4.919, "validDurationSecs": 37067.59, "totalHrs": 11.17, "validHrs": 10.29 }, "pwn": { "buckets": { "dev": 722, "test": 724, "train": 761, "validated": 10798, "invalidated": 138, "other": 2 }, "duration": 53298360, "reportedSentences": 0, "validatedSentences": 2207, "unvalidatedSentences": 241, "clips": 10938, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.05, "thirties": 0.43, "teens": 0, "fourties": 0, "fifties": 0.22, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.04, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.24 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13132, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 294332209, "checksum": "9dc0c59060adb70c865dd587ae38dd6c23fe05490d8e63e05983bcb35af11e12", "avgDurationSecs": 4.873, "validDurationSecs": 52616.172, "totalHrs": 14.8, "validHrs": 14.61 }, "qup": { "buckets": { "dev": 288, "test": 254, "train": 449, "validated": 7202, "invalidated": 2, "other": 2465 }, "duration": 57452508, "reportedSentences": 4, "validatedSentences": 991, "unvalidatedSentences": -1, "clips": 9669, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.11, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9669, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 330095708, "checksum": "de1c827751b5149a73bd3abd370f234197f9d4a9a3b907cddcecc4b78ef79d30", "avgDurationSecs": 5.942, "validDurationSecs": 42793.77, "totalHrs": 15.95, "validHrs": 11.88 }, "qur": { "buckets": { "dev": 0, "test": 32, "train": 977, "validated": 10408, "invalidated": 27, "other": 41 }, "duration": 36351540, "reportedSentences": 0, "validatedSentences": 1010, "unvalidatedSentences": -1, "clips": 10476, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.76, "thirties": 0.24, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0, "female_feminine": 0.81, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10476, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 194596443, "checksum": "d237ddd03097ccc1a67b19c11427b41189ea837c42172207e9a10f96e80ab505", "avgDurationSecs": 3.47, "validDurationSecs": 36115.581, "totalHrs": 10.09, "validHrs": 10.03 }, "qus": { "buckets": { "dev": 314, "test": 330, "train": 354, "validated": 10323, "invalidated": 93, "other": 432 }, "duration": 41647572, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 10848, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.36, "thirties": 0.32, "teens": 0, "fourties": 0.03, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male_masculine": 0, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10848, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 222351787, "checksum": "778e7cc4d7ec7406a4194aad70bb03d918d4db1e9e13003a6feb9b92b4dab380", "avgDurationSecs": 3.839, "validDurationSecs": 39631.995, "totalHrs": 11.56, "validHrs": 11 }, "qux": { "buckets": { "dev": 280, "test": 299, "train": 399, "validated": 6156, "invalidated": 25, "other": 2008 }, "duration": 48318408, "reportedSentences": 0, "validatedSentences": 978, "unvalidatedSentences": -1, "clips": 8189, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.24, "thirties": 0.39, "teens": 0, "fourties": 0, "fifties": 0.24, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8189, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 270628840, "checksum": "2976da9e3803cce29fe1e23d6d8f2687dfe0fef183a861897abdcbb4ff5de741", "avgDurationSecs": 5.9, "validDurationSecs": 36322.887, "totalHrs": 13.42, "validHrs": 10.08 }, "quy": { "buckets": { "dev": 0, "test": 10, "train": 27, "validated": 38, "invalidated": 7, "other": 1255 }, "duration": 11227752, "reportedSentences": 0, "validatedSentences": 759, "unvalidatedSentences": 282, "clips": 1300, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.71, "thirties": 0, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0.25, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1300, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 65530863, "checksum": "a6ebc8e82f11faa7bb6425a0a897c9f1ebd3fec47a7fd5e26169649a981691ff", "avgDurationSecs": 8.637, "validDurationSecs": 328.196, "totalHrs": 3.11, "validHrs": 0.09 }, "qva": { "buckets": { "dev": 200, "test": 326, "train": 473, "validated": 8222, "invalidated": 21, "other": 23 }, "duration": 36399420, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 8266, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.1, "thirties": 0.54, "teens": 0, "fourties": 0, "fifties": 0.12, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0, "female_feminine": 0.88, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8266, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 199004489, "checksum": "3c60e2389d794a655ba4221d0870da6fb2d84475f3f57ea59d1719d54ffcf283", "avgDurationSecs": 4.404, "validDurationSecs": 36205.666, "totalHrs": 10.11, "validHrs": 10.05 }, "qvi": { "buckets": { "dev": 364, "test": 379, "train": 405, "validated": 9266, "invalidated": 237, "other": 693 }, "duration": 45140940, "reportedSentences": 0, "validatedSentences": 1149, "unvalidatedSentences": -1, "clips": 10196, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.26, "thirties": 0.64, "teens": 0.1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0, "female_feminine": 0.72, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10196, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 262115856, "checksum": "c66207b7a055bb4a2fb3b5b32ef756ecc8b98700f16c48b10a007fa3c0963cf8", "avgDurationSecs": 4.427, "validDurationSecs": 41023.534, "totalHrs": 12.53, "validHrs": 11.39 }, "qvj": { "buckets": { "dev": 349, "test": 348, "train": 362, "validated": 6552, "invalidated": 8, "other": 186 }, "duration": 40215924, "reportedSentences": 0, "validatedSentences": 1059, "unvalidatedSentences": -1, "clips": 6746, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.58, "thirties": 0.02, "teens": 0, "fourties": 0.39, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0, "female_feminine": 0.8, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6746, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 232491915, "checksum": "298194e4a8a92ea9865ec07e650922d33bde58f9af97edb133e2003439d751c5", "avgDurationSecs": 5.961, "validDurationSecs": 39059.403, "totalHrs": 11.17, "validHrs": 10.84 }, "qvl": { "buckets": { "dev": 222, "test": 321, "train": 457, "validated": 9082, "invalidated": 5, "other": 13 }, "duration": 36219708, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 9100, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.83, "thirties": 0.11, "teens": 0.06, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0, "female_feminine": 0.61, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9100, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 201100562, "checksum": "19a9cf6a203d105afd934415fdbc2f1c994eba72e190e71816dc12af3117f803", "avgDurationSecs": 3.98, "validDurationSecs": 36148.065, "totalHrs": 10.06, "validHrs": 10.04 }, "qwa": { "buckets": { "dev": 106, "test": 309, "train": 585, "validated": 7215, "invalidated": 1, "other": 1149 }, "duration": 41537916, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 8365, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.12, "thirties": 0.32, "teens": 0, "fourties": 0.2, "fifties": 0.36, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8365, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 220854020, "checksum": "64e1c64eee3ee9bf11cac5bb50305ab89c46571f8267eac2dcdc6a39e47f8b00", "avgDurationSecs": 4.966, "validDurationSecs": 35827.384, "totalHrs": 11.53, "validHrs": 9.95 }, "qws": { "buckets": { "dev": 0, "test": 61, "train": 937, "validated": 8879, "invalidated": 2, "other": 96 }, "duration": 37376136, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 8977, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.55, "thirties": 0.11, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0, "female_feminine": 0.56, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8977, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 211204320, "checksum": "ad179d26a0fb2991f74c7ff467449299b930ab0c877d6dead8edc83c860bd7a5", "avgDurationSecs": 4.164, "validDurationSecs": 36968.109, "totalHrs": 10.38, "validHrs": 10.26 }, "qxa": { "buckets": { "dev": 233, "test": 315, "train": 451, "validated": 8353, "invalidated": 8, "other": 354 }, "duration": 37901772, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": -1, "clips": 8715, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.67, "thirties": 0.2, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8715, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 207946463, "checksum": "6eee977819be26ccf49a7adc1bbad409f939bec7c0761458b50f9a87de8c9bb3", "avgDurationSecs": 4.349, "validDurationSecs": 36327.424, "totalHrs": 10.52, "validHrs": 10.09 }, "qxp": { "buckets": { "dev": 0, "test": 562, "train": 1502, "validated": 7678, "invalidated": 213, "other": 1142 }, "duration": 41874264, "reportedSentences": 53, "validatedSentences": 2065, "unvalidatedSentences": -1, "clips": 9033, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.76, "thirties": 0.23, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 8998, "automotive_transport": 0, "finance": 43, "service_retail": 0, "general": 7043, "healthcare": 7055, "history_law_government": 485, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 3488, "news_current_affairs": 58, "technology_robotics": 0 } }, "users": 14, "size": 241546720, "checksum": "5fb796bef146ced1c7b1469d9df6052d95788089395e23d1cfc8b085d58588c5", "avgDurationSecs": 4.636, "validDurationSecs": 35592.893, "totalHrs": 11.63, "validHrs": 9.88 }, "qxt": { "buckets": { "dev": 60, "test": 321, "train": 616, "validated": 8647, "invalidated": 12, "other": 2469 }, "duration": 47738988, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 11128, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.19, "thirties": 0.18, "teens": 0, "fourties": 0.54, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0, "female_feminine": 0.73, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11128, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 273941249, "checksum": "d65d89ebe0f3196cc48b2ce63ca0406317c56025ff8e2cee0b62de2e6887c252", "avgDurationSecs": 4.29, "validDurationSecs": 37095.527, "totalHrs": 13.26, "validHrs": 10.3 }, "qxu": { "buckets": { "dev": 0, "test": 325, "train": 673, "validated": 8712, "invalidated": 298, "other": 1899 }, "duration": 47439360, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 10909, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.47, "thirties": 0.34, "teens": 0.09, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10909, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 272705534, "checksum": "85407a1826384945d203be826cec7fae126328e2065ac59b56972a7be5dba52d", "avgDurationSecs": 4.349, "validDurationSecs": 37885.389, "totalHrs": 13.17, "validHrs": 10.52 }, "qxw": { "buckets": { "dev": 167, "test": 227, "train": 608, "validated": 8021, "invalidated": 82, "other": 1150 }, "duration": 48633444, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": -1, "clips": 9253, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.54, "thirties": 0.11, "teens": 0, "fourties": 0.22, "fifties": 0, "sixties": 0, "seventies": 0.11, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9253, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 275528033, "checksum": "b71f2844a95506ca883ade6a9a428c3dae5d2bd5a157e702b3218ebcda61f6c5", "avgDurationSecs": 5.256, "validDurationSecs": 42158.095, "totalHrs": 13.5, "validHrs": 11.71 }, "rm-sursilv": { "buckets": { "dev": 1308, "test": 1358, "train": 1723, "validated": 4408, "invalidated": 687, "other": 2409 }, "duration": 40806869, "reportedSentences": 17, "validatedSentences": 8780, "unvalidatedSentences": 7455, "clips": 7504, "splits": { "accent": {}, "age": { "": 0.61, "twenties": 0.09, "thirties": 0.07, "teens": 0.05, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0.17, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 91, "size": 306284497, "checksum": "9e793e4d3bf60f5a46a79429a264a247c5d29634493c9b8841d7862c7174a661", "avgDurationSecs": 5.438, "validDurationSecs": 23970.773, "totalHrs": 11.33, "validHrs": 6.65 }, "rm-vallader": { "buckets": { "dev": 500, "test": 463, "train": 557, "validated": 1527, "invalidated": 394, "other": 740 }, "duration": 15519074, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2661, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2661, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 118484047, "checksum": "25f5913911b9a50d7ac146364674f8c4a6fd99bbb8723b0d49622ca112065524", "avgDurationSecs": 5.832, "validDurationSecs": 8905.534, "totalHrs": 4.31, "validHrs": 2.47 }, "ro": { "buckets": { "dev": 3932, "test": 3933, "train": 5163, "validated": 19504, "invalidated": 1089, "other": 22972 }, "duration": 176026243, "reportedSentences": 487, "validatedSentences": 13753, "unvalidatedSentences": 89202, "clips": 43565, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.59, "thirties": 0.13, "teens": 0.02, "fourties": 0.09, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.7, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43565, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 455, "size": 1096147542, "checksum": "735e3a1d4730332c05595a9fc1fd05b66c90d35009f6746508309c10402718fa", "avgDurationSecs": 4.041, "validDurationSecs": 78806.745, "totalHrs": 48.89, "validHrs": 21.89 }, "rof": { "buckets": { "dev": 461, "test": 461, "train": 477, "validated": 9557, "invalidated": 835, "other": 0 }, "duration": 41952420, "reportedSentences": 0, "validatedSentences": 1399, "unvalidatedSentences": -1, "clips": 10392, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.35, "thirties": 0.41, "teens": 0, "fourties": 0.23, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10392, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 239911457, "checksum": "259834cde0d0ad26651c0b7cea069cf7a81a2b1312244c61d66bd465e439c930", "avgDurationSecs": 4.037, "validDurationSecs": 38581.532, "totalHrs": 11.65, "validHrs": 10.71 }, "ru": { "buckets": { "dev": 10253, "test": 10253, "train": 26721, "validated": 172808, "invalidated": 10535, "other": 17446 }, "duration": 1041474780, "reportedSentences": 573, "validatedSentences": 47830, "unvalidatedSentences": 227, "clips": 200789, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.18, "teens": 0.09, "fourties": 0.13, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.59, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 200672, "agriculture_food": 4, "automotive_transport": 4, "finance": 8, "service_retail": 12, "general": 38, "healthcare": 6, "history_law_government": 44, "language_fundamentals": 4, "media_entertainment": 16, "nature_environment": 5, "news_current_affairs": 1, "technology_robotics": 13 } }, "users": 3591, "size": 6989201054, "checksum": "0e63ddbaf8ee013d863a79df54621e420fd71fbce3ed9d6b78a9217fec76e7bb", "avgDurationSecs": 5.187, "validDurationSecs": 896339.808, "totalHrs": 289.29, "validHrs": 248.98 }, "rup": { "buckets": { "dev": 0, "test": 3, "train": 10, "validated": 13, "invalidated": 2, "other": 15 }, "duration": 235800, "reportedSentences": 0, "validatedSentences": 3507, "unvalidatedSentences": -1, "clips": 30, "splits": { "accent": {}, "age": { "": 0.47, "twenties": 0, "thirties": 0.53, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0, "female_feminine": 0.53, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 1519018, "checksum": "f4edabc8cc45e33008fd7b1ce4cc61416135c3eb96c7f38a07a73bf67c15985b", "avgDurationSecs": 7.86, "validDurationSecs": 102.18, "totalHrs": 0.06, "validHrs": 0.02 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003033, "validated": 1439114, "invalidated": 227836, "other": 47256 }, "duration": 8583613673, "reportedSentences": 634, "validatedSentences": 1404857, "unvalidatedSentences": 7, "clips": 1714206, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1714206, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1176, "size": 61199631893, "checksum": "191669748fbd3a3d551b58e5cf24de78c808056b7f1767d60f3bc4ed51fc0b03", "avgDurationSecs": 5.007, "validDurationSecs": 7206134.273, "totalHrs": 2384.33, "validHrs": 2001.7 }, "sah": { "buckets": { "dev": 1747, "test": 1804, "train": 2315, "validated": 9269, "invalidated": 475, "other": 3346 }, "duration": 82929324, "reportedSentences": 26, "validatedSentences": 6076, "unvalidatedSentences": 385, "clips": 13090, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.23, "thirties": 0.15, "teens": 0.03, "fourties": 0.1, "fifties": 0.18, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.24, "female_feminine": 0.44, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13090, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 245, "size": 527127804, "checksum": "36c1525dc102a67bc9777236cabc82d7c6c415c1fd73dad444d6f71648e16ffe", "avgDurationSecs": 6.335, "validDurationSecs": 58722.071, "totalHrs": 23.03, "validHrs": 16.31 }, "sat": { "buckets": { "dev": 68, "test": 127, "train": 333, "validated": 528, "invalidated": 18, "other": 303 }, "duration": 3801708, "reportedSentences": 7, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 849, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.42, "thirties": 0.12, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.59, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 21352529, "checksum": "dc8c771a273e472ee9cd1d0a2d9d3f3c94b8db44f67c39db1d67f5f4819e76e1", "avgDurationSecs": 4.478, "validDurationSecs": 2364.313, "totalHrs": 1.05, "validHrs": 0.65 }, "sbn": { "buckets": { "dev": 0, "test": 247, "train": 1754, "validated": 10957, "invalidated": 12, "other": 18 }, "duration": 38755044, "reportedSentences": 0, "validatedSentences": 2001, "unvalidatedSentences": -1, "clips": 10987, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.55, "thirties": 0, "teens": 0.19, "fourties": 0.26, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10987, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 226541261, "checksum": "bd63ba1440b66040ad635b8ad4489a3dcfeec502d856f0d4d6428a51df1e8022", "avgDurationSecs": 3.527, "validDurationSecs": 38649.223, "totalHrs": 10.76, "validHrs": 10.73 }, "sc": { "buckets": { "dev": 547, "test": 620, "train": 925, "validated": 2266, "invalidated": 118, "other": 391 }, "duration": 13034952, "reportedSentences": 36, "validatedSentences": 5473, "unvalidatedSentences": 361, "clips": 2775, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.09, "thirties": 0.29, "teens": 0, "fourties": 0.01, "fifties": 0.15, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male_masculine": 0.13, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2765, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 4, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 2, "technology_robotics": 0 } }, "users": 35, "size": 75759776, "checksum": "625d478be019ae136593ec8a0ecf7b125c471736afded5fcc65f63e65a32ccf2", "avgDurationSecs": 4.697, "validDurationSecs": 10644.036, "totalHrs": 3.62, "validHrs": 2.95 }, "scl": { "buckets": { "dev": 862, "test": 984, "train": 1407, "validated": 8876, "invalidated": 144, "other": 2 }, "duration": 37134540, "reportedSentences": 0, "validatedSentences": 3300, "unvalidatedSentences": 4, "clips": 9022, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.02, "thirties": 0.82, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9018, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 39, "size": 217078538, "checksum": "b4102e75d6ba434b732ebde2293c6047c48bb28711dee43bc2a83c3a417f9959", "avgDurationSecs": 4.116, "validDurationSecs": 36533.604, "totalHrs": 10.31, "validHrs": 10.14 }, "sd": { "buckets": { "dev": 0, "test": 41, "train": 274, "validated": 315, "invalidated": 164, "other": 29684 }, "duration": 124617852, "reportedSentences": 6, "validatedSentences": 13346, "unvalidatedSentences": 73, "clips": 30163, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0, "thirties": 0.13, "teens": 0, "fourties": 0.85, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.13, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30086, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 75, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 651447158, "checksum": "e968c076b37ebc0bd09570698f02a3060f480d928eb29a7988dfbdeeadb69ec6", "avgDurationSecs": 4.131, "validDurationSecs": 1301.416, "totalHrs": 34.61, "validHrs": 0.36 }, "sei": { "buckets": { "dev": 435, "test": 452, "train": 728, "validated": 8006, "invalidated": 254, "other": 0 }, "duration": 37573452, "reportedSentences": 0, "validatedSentences": 1615, "unvalidatedSentences": -1, "clips": 8260, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.23, "thirties": 0.27, "teens": 0, "fourties": 0.49, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0, "female_feminine": 0.81, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 8260, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 218636155, "checksum": "2fa022f3ab1a43a1713fd99faf5bed8d2c2fb7be0fade51549f13ea0cfb6ddd7", "avgDurationSecs": 4.549, "validDurationSecs": 36418.046, "totalHrs": 10.43, "validHrs": 10.11 }, "sk": { "buckets": { "dev": 5012, "test": 5060, "train": 7410, "validated": 43432, "invalidated": 1213, "other": 531 }, "duration": 192821076, "reportedSentences": 344, "validatedSentences": 20283, "unvalidatedSentences": 2703, "clips": 45176, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.07, "thirties": 0.18, "teens": 0.05, "fourties": 0.13, "fifties": 0.34, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0.36, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33939, "agriculture_food": 2, "automotive_transport": 262, "finance": 2, "service_retail": 0, "general": 10948, "healthcare": 4, "history_law_government": 6808, "language_fundamentals": 90, "media_entertainment": 351, "nature_environment": 235, "news_current_affairs": 246, "technology_robotics": 840 } }, "users": 266, "size": 1112080665, "checksum": "49bf35f7a32b28f17c467469eae69345e7c95448607b036360022daa9ee41a2e", "avgDurationSecs": 4.268, "validDurationSecs": 185377.302, "totalHrs": 53.56, "validHrs": 51.49 }, "skr": { "buckets": { "dev": 1143, "test": 1037, "train": 1556, "validated": 3748, "invalidated": 467, "other": 1622 }, "duration": 24307020, "reportedSentences": 17, "validatedSentences": 6011, "unvalidatedSentences": 93, "clips": 5837, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.24, "thirties": 0.01, "teens": 0.02, "fourties": 0, "fifties": 0.37, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.63, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5837, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 139958778, "checksum": "1e50951bac423fe3900b54da61ba3724a76ad41826cb61228dea0002ba00a82f", "avgDurationSecs": 4.164, "validDurationSecs": 15607.797, "totalHrs": 6.75, "validHrs": 4.33 }, "sl": { "buckets": { "dev": 1333, "test": 1340, "train": 1467, "validated": 15609, "invalidated": 509, "other": 3382 }, "duration": 78014314, "reportedSentences": 55, "validatedSentences": 4206, "unvalidatedSentences": 31, "clips": 19500, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.31, "thirties": 0.03, "teens": 0.05, "fourties": 0.15, "fifties": 0.05, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.52, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19498, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1000, "size": 511197400, "checksum": "2963df5019ecad40492de820f1f60a2a7842aceb51a233e757f0a3685b821443", "avgDurationSecs": 4.001, "validDurationSecs": 62447.458, "totalHrs": 21.67, "validHrs": 17.34 }, "sq": { "buckets": { "dev": 1645, "test": 1936, "train": 2658, "validated": 6249, "invalidated": 181, "other": 7 }, "duration": 32872032, "reportedSentences": 134, "validatedSentences": 50009, "unvalidatedSentences": 2634, "clips": 6437, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.08, "thirties": 0.15, "teens": 0, "fourties": 0.18, "fifties": 0.35, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.47, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1576, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1789, "healthcare": 0, "history_law_government": 3050, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 22, "technology_robotics": 0 } }, "users": 145, "size": 195939392, "checksum": "f71f3f65a9aa06d72a588aeb0514be9f48233fab311a3c1839ab03343d769ba8", "avgDurationSecs": 5.107, "validDurationSecs": 31911.966, "totalHrs": 9.13, "validHrs": 8.86 }, "sr": { "buckets": { "dev": 1908, "test": 1983, "train": 2336, "validated": 8245, "invalidated": 424, "other": 4854 }, "duration": 43811676, "reportedSentences": 82, "validatedSentences": 7898, "unvalidatedSentences": 753, "clips": 13523, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.25, "thirties": 0.11, "teens": 0, "fourties": 0.1, "fifties": 0.13, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.32, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13481, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 28 } }, "users": 182, "size": 245821661, "checksum": "327767a6cf4570b3259a3cfead66657b83cc7b555e883a0a5d11d5fe56d057b8", "avgDurationSecs": 3.24, "validDurationSecs": 26712.066, "totalHrs": 12.16, "validHrs": 7.42 }, "ss": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 111 }, "duration": 825840, "reportedSentences": 0, "validatedSentences": 1614, "unvalidatedSentences": 6796, "clips": 111, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.55, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 111, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 5075362, "checksum": "fc7b18540dc41b96c56d10d336912a2ba9e3ec862a6aea600b5cf414a48bbeb3", "avgDurationSecs": 7.44, "validDurationSecs": 0, "totalHrs": 0.22, "validHrs": 0 }, "ssi": { "buckets": { "dev": 0, "test": 161, "train": 1845, "validated": 10188, "invalidated": 12, "other": 0 }, "duration": 37951740, "reportedSentences": 0, "validatedSentences": 2007, "unvalidatedSentences": -1, "clips": 10200, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.6, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.39, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10200, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 221558780, "checksum": "9e1b223f58c57a34d5a151ba49cfc832649c6a43f43d2af681eb9163d7b04758", "avgDurationSecs": 3.721, "validDurationSecs": 37907.091, "totalHrs": 10.54, "validHrs": 10.52 }, "st": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 209 }, "duration": 673668, "reportedSentences": 0, "validatedSentences": 2339, "unvalidatedSentences": 14836, "clips": 209, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.29, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.71, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 209, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 4960974, "checksum": "fb6fa9e2553e0013441ed66ed552e5b97a85d9ad33bd40828d87e6b8f9f0a2d5", "avgDurationSecs": 3.223, "validDurationSecs": 0, "totalHrs": 0.18, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 5470, "test": 5476, "train": 8192, "validated": 42460, "invalidated": 1597, "other": 6031 }, "duration": 201521895, "reportedSentences": 597, "validatedSentences": 26232, "unvalidatedSentences": 7071, "clips": 50088, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.37, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.47, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 50085, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 883, "size": 1273393153, "checksum": "39ca02d3975a9fa782bb4694bea1ccd1b652c905a48534b56316067814cd5bdf", "avgDurationSecs": 4.023, "validDurationSecs": 170831.729, "totalHrs": 55.97, "validHrs": 47.45 }, "sva": { "buckets": { "dev": 467, "test": 503, "train": 567, "validated": 9567, "invalidated": 1376, "other": 16 }, "duration": 64961820, "reportedSentences": 9, "validatedSentences": 1538, "unvalidatedSentences": 11, "clips": 10959, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.1, "thirties": 0.2, "teens": 0, "fourties": 0.24, "fifties": 0.21, "sixties": 0.13, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0, "female_feminine": 0.8, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10593, "agriculture_food": 108, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 96, "healthcare": 162, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 369403813, "checksum": "91c4010917797b73658d93fe8bb8e01a17e05cc60e20e8f280ba98e931bd0757", "avgDurationSecs": 5.928, "validDurationSecs": 56710.442, "totalHrs": 18.04, "validHrs": 15.75 }, "sw": { "buckets": { "dev": 12258, "test": 12262, "train": 46618, "validated": 267770, "invalidated": 80832, "other": 376573 }, "duration": 3906322308, "reportedSentences": 1902, "validatedSentences": 134725, "unvalidatedSentences": 5758, "clips": 725175, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 725175, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1481, "size": 22800067743, "checksum": "f9dd7e77ea5802983a18eae79ebefe1cccb9b0176f77aec8138ce604b3b62f31", "avgDurationSecs": 5.387, "validDurationSecs": 1442404.833, "totalHrs": 1085.08, "validHrs": 400.66 }, "szy": { "buckets": { "dev": 257, "test": 264, "train": 280, "validated": 9158, "invalidated": 485, "other": 0 }, "duration": 52040448, "reportedSentences": 0, "validatedSentences": 801, "unvalidatedSentences": 2, "clips": 9643, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.13, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0.11, "sixties": 0.1, "seventies": 0.08, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.39 }, "sentence_domain": { "": 27, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 9825, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 9, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26, "size": 294397447, "checksum": "bf9db953687f3b18344255c9e1f30f01f788b648ebd0367cd92f3fc10dd0f072", "avgDurationSecs": 5.397, "validDurationSecs": 49423.045, "totalHrs": 14.45, "validHrs": 13.72 }, "ta": { "buckets": { "dev": 12150, "test": 12240, "train": 46401, "validated": 136567, "invalidated": 5741, "other": 105245 }, "duration": 1534119900, "reportedSentences": 3438, "validatedSentences": 118260, "unvalidatedSentences": 577, "clips": 247553, "splits": { "accent": {}, "age": { "": 0.66, "twenties": 0.11, "thirties": 0.12, "teens": 0.04, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.19, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 247553, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 972, "size": 9195589375, "checksum": "1ab75879e1667ac89090edc71120d2c0e8dc3ccd21e9f0e14479cdb20db45803", "avgDurationSecs": 6.197, "validDurationSecs": 846324.433, "totalHrs": 426.14, "validHrs": 235.09 }, "tar": { "buckets": { "dev": 406, "test": 404, "train": 430, "validated": 7889, "invalidated": 23, "other": 10 }, "duration": 36039168, "reportedSentences": 32, "validatedSentences": 1258, "unvalidatedSentences": -1, "clips": 7922, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.11, "thirties": 0.47, "teens": 0, "fourties": 0.38, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0, "female_feminine": 0.66, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7922, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 211008035, "checksum": "c763d9a62d0b8a9eb6ed657c131720d1657cdde0611571d1ac021668f4058486", "avgDurationSecs": 4.549, "validDurationSecs": 35889.043, "totalHrs": 10.01, "validHrs": 9.96 }, "tay": { "buckets": { "dev": 452, "test": 851, "train": 1299, "validated": 7446, "invalidated": 411, "other": 0 }, "duration": 44092728, "reportedSentences": 0, "validatedSentences": 3790, "unvalidatedSentences": 77, "clips": 7857, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.22, "thirties": 0, "teens": 0, "fourties": 0.13, "fifties": 0.18, "sixties": 0.15, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.33 }, "sentence_domain": { "": 291, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 9021, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 260208056, "checksum": "0aebe9a0c3000b91b459ab9c4e15b4f045e8bbb202c70c00e17c5cd97e2a68e3", "avgDurationSecs": 5.612, "validDurationSecs": 41786.236, "totalHrs": 12.24, "validHrs": 11.6 }, "te": { "buckets": { "dev": 61, "test": 65, "train": 79, "validated": 353, "invalidated": 57, "other": 2032 }, "duration": 10221804, "reportedSentences": 15, "validatedSentences": 330, "unvalidatedSentences": 19295, "clips": 2442, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.69, "thirties": 0.02, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.07, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.65, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2442, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 59, "size": 61072694, "checksum": "de11e7cf726336dd6084153a36e10d3de2ebe6640495478c10f655085629e86d", "avgDurationSecs": 4.186, "validDurationSecs": 1477.599, "totalHrs": 2.83, "validHrs": 0.41 }, "tg": { "buckets": { "dev": 123, "test": 119, "train": 223, "validated": 465, "invalidated": 42, "other": 0 }, "duration": 2526804, "reportedSentences": 18, "validatedSentences": 3846, "unvalidatedSentences": 6925, "clips": 507, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.97, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.21 }, "sentence_domain": { "": 36, "agriculture_food": 7, "automotive_transport": 9, "finance": 12, "service_retail": 14, "general": 77, "healthcare": 11, "history_law_government": 22, "language_fundamentals": 240, "media_entertainment": 135, "nature_environment": 5, "news_current_affairs": 43, "technology_robotics": 8 } }, "users": 7, "size": 15174834, "checksum": "2b981cab0e140404aa1f05e102d0decfdb9eaad16bd364c3bf79f513bb2977d7", "avgDurationSecs": 4.984, "validDurationSecs": 2317.483, "totalHrs": 0.7, "validHrs": 0.64 }, "th": { "buckets": { "dev": 11058, "test": 11058, "train": 32969, "validated": 148686, "invalidated": 9355, "other": 208326 }, "duration": 1536288528, "reportedSentences": 4318, "validatedSentences": 64490, "unvalidatedSentences": 121701, "clips": 366367, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 366367, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7977, "size": 8964045754, "checksum": "60141dc0b31a4b26c1b8829a61f2c5f7765f21ce721a8e771f92382878230e7a", "avgDurationSecs": 4.193, "validDurationSecs": 623485.729, "totalHrs": 426.74, "validHrs": 173.19 }, "ti": { "buckets": { "dev": 20, "test": 16, "train": 30, "validated": 66, "invalidated": 6, "other": 379 }, "duration": 2342484, "reportedSentences": 3, "validatedSentences": 4570, "unvalidatedSentences": 111, "clips": 451, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.31, "thirties": 0.03, "teens": 0, "fourties": 0.49, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.96, "male_masculine": 0.04, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 451, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 14067673, "checksum": "19817f6ac29f509a95ee3f8d16d3bb8529857ae153832983d3d6b8cb476cb84f", "avgDurationSecs": 5.194, "validDurationSecs": 342.803, "totalHrs": 0.65, "validHrs": 0.09 }, "tig": { "buckets": { "dev": 1617, "test": 1607, "train": 1990, "validated": 6427, "invalidated": 245, "other": 82 }, "duration": 42874416, "reportedSentences": 37, "validatedSentences": 5417, "unvalidatedSentences": 88, "clips": 6754, "splits": { "accent": {}, "age": { "": 0.7, "twenties": 0.21, "thirties": 0.1, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0.01, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6741, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 6, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 149, "size": 229943430, "checksum": "83740a7b6f4e209d0b547d45908225d5bec4ab71f10011e48e432545dd294c42", "avgDurationSecs": 6.348, "validDurationSecs": 40798.619, "totalHrs": 11.9, "validHrs": 11.33 }, "tk": { "buckets": { "dev": 447, "test": 570, "train": 741, "validated": 1972, "invalidated": 131, "other": 2628 }, "duration": 25926264, "reportedSentences": 27, "validatedSentences": 2494, "unvalidatedSentences": 238, "clips": 4731, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.54, "thirties": 0.08, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.43, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4731, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 127, "size": 145880930, "checksum": "67ef64d068b9d90225d1d2f7c54b66fd3709d7b44e64d7e37ce52e9015e00ae1", "avgDurationSecs": 5.48, "validDurationSecs": 10806.72, "totalHrs": 7.2, "validHrs": 3 }, "tli": { "buckets": { "dev": 0, "test": 524, "train": 2316, "validated": 2840, "invalidated": 0, "other": 0 }, "duration": 36638964, "reportedSentences": 0, "validatedSentences": 7863, "unvalidatedSentences": 10, "clips": 2840, "splits": { "accent": {}, "age": { "": 0.82, "twenties": 0, "thirties": 0.18, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2840, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 217520347, "checksum": "e6ea4a3f3061cc723ee1e0c21d5b77b05c50c25dbdd0010eee350fac251c1ebe", "avgDurationSecs": 12.901, "validDurationSecs": 36638.964, "totalHrs": 10.17, "validHrs": 10.17 }, "tn": { "buckets": { "dev": 312, "test": 369, "train": 1078, "validated": 3503, "invalidated": 11, "other": 535 }, "duration": 17693028, "reportedSentences": 1, "validatedSentences": 1765, "unvalidatedSentences": 15342, "clips": 4049, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.02, "thirties": 0.51, "teens": 0, "fourties": 0, "fifties": 0.43, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.04 }, "sentence_domain": { "": 4049, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 89813782, "checksum": "186bc8a1b1dd72c2cb5fd28f98b2d32bca8f56bbcfd3314b5a12cc9202d7d049", "avgDurationSecs": 4.37, "validDurationSecs": 15307.157, "totalHrs": 4.91, "validHrs": 4.25 }, "tok": { "buckets": { "dev": 2251, "test": 2248, "train": 2635, "validated": 12950, "invalidated": 399, "other": 5872 }, "duration": 82709352, "reportedSentences": 165, "validatedSentences": 7498, "unvalidatedSentences": 2246, "clips": 19221, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.19, "thirties": 0.06, "teens": 0.21, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.23, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.52, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.03 }, "sentence_domain": { "": 19219, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 171, "size": 483982678, "checksum": "141e2f3ec45fa0b81b98f2e98b90cddcd2c97e1b29e53a823453ee7020a6ae3a", "avgDurationSecs": 4.303, "validDurationSecs": 55724.786, "totalHrs": 22.97, "validHrs": 15.47 }, "tr": { "buckets": { "dev": 11786, "test": 11790, "train": 40452, "validated": 120429, "invalidated": 4912, "other": 146 }, "duration": 480786147, "reportedSentences": 479, "validatedSentences": 410402, "unvalidatedSentences": 3471, "clips": 125487, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.1, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.42, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 125472, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 12, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1781, "size": 2933767497, "checksum": "2c72f91639f400d2123206fbe76fdd6f3a8bfc6e44e69385466802eed7049afc", "avgDurationSecs": 3.831, "validDurationSecs": 461407.117, "totalHrs": 133.55, "validHrs": 128.16 }, "trv": { "buckets": { "dev": 558, "test": 529, "train": 885, "validated": 6386, "invalidated": 104, "other": 0 }, "duration": 36415152, "reportedSentences": 1, "validatedSentences": 1974, "unvalidatedSentences": 2, "clips": 6490, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.07, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.68, "male_masculine": 0.07, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.12 }, "sentence_domain": { "": 465, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6276, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 214898901, "checksum": "386f7c82350fcbb142a630d50a8a71cdd41b7982e22e4670a8d43a594b9eb221", "avgDurationSecs": 5.611, "validDurationSecs": 35831.612, "totalHrs": 10.11, "validHrs": 9.95 }, "trw": { "buckets": { "dev": 2083, "test": 1956, "train": 2859, "validated": 11054, "invalidated": 1372, "other": 14 }, "duration": 67715892, "reportedSentences": 578, "validatedSentences": 7770, "unvalidatedSentences": 27, "clips": 12440, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.34, "thirties": 0.11, "teens": 0.04, "fourties": 0.36, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12245, "agriculture_food": 57, "automotive_transport": 2, "finance": 2, "service_retail": 57, "general": 55, "healthcare": 49, "history_law_government": 49, "language_fundamentals": 17, "media_entertainment": 1, "nature_environment": 63, "news_current_affairs": 67, "technology_robotics": 0 } }, "users": 27, "size": 397063300, "checksum": "e4bb8538b55a436d582982a3f113dbd6823e1aab9b6350341677b6acc26b4d5c", "avgDurationSecs": 5.443, "validDurationSecs": 60171.34, "totalHrs": 18.8, "validHrs": 16.71 }, "ts": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 1, "other": 0 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1040821, "checksum": "09925723ee487a4276667e268c80a8569147b40de03fbbebf42836051129fec8", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 4009, "test": 5156, "train": 8813, "validated": 29815, "invalidated": 467, "other": 613 }, "duration": 118394054, "reportedSentences": 6, "validatedSentences": 18051, "unvalidatedSentences": 32, "clips": 30895, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.05, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.76, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30883, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 12, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 285, "size": 863180776, "checksum": "2198e6f30c0dd78063f229bbfd7d6ede37d55ac8c22ae1c2fee650300d10e8f4", "avgDurationSecs": 3.832, "validDurationSecs": 114255.34, "totalHrs": 32.88, "validHrs": 31.73 }, "tui": { "buckets": { "dev": 604, "test": 641, "train": 722, "validated": 7534, "invalidated": 126, "other": 911 }, "duration": 42207192, "reportedSentences": 6, "validatedSentences": 1971, "unvalidatedSentences": 5, "clips": 8571, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.21, "thirties": 0.52, "teens": 0, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.01, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8571, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 48, "size": 247205474, "checksum": "9193ce4796c645909243f9c653fb5730de79489909bce85452dee4b7406c9bf0", "avgDurationSecs": 4.924, "validDurationSecs": 37100.57, "totalHrs": 11.72, "validHrs": 10.3 }, "tvu": { "buckets": { "dev": 495, "test": 438, "train": 725, "validated": 5247, "invalidated": 122, "other": 65 }, "duration": 38085048, "reportedSentences": 24, "validatedSentences": 1943, "unvalidatedSentences": 347, "clips": 5434, "splits": { "accent": {}, "age": { "": 0.8, "twenties": 0, "thirties": 0.2, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.81, "male_masculine": 0, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5434, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 204881413, "checksum": "80a386d618d48a592d0cbd5e736f66fb0097802713b9ad229760cc71f2c6dc9d", "avgDurationSecs": 7.009, "validDurationSecs": 36774.429, "totalHrs": 10.57, "validHrs": 10.21 }, "tw": { "buckets": { "dev": 0, "test": 21, "train": 208, "validated": 229, "invalidated": 1, "other": 57 }, "duration": 1249164, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 19, "clips": 287, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.05, "thirties": 0.75, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.77, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 287, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 10105461, "checksum": "00142f76c74a6aa2d5f48c42ad718db2f2a8a8a44e28b5d909ade3f94ce693f4", "avgDurationSecs": 4.352, "validDurationSecs": 996.72, "totalHrs": 0.34, "validHrs": 0.27 }, "udl": { "buckets": { "dev": 285, "test": 329, "train": 396, "validated": 6400, "invalidated": 112, "other": 32 }, "duration": 38688012, "reportedSentences": 1, "validatedSentences": 1011, "unvalidatedSentences": -1, "clips": 6544, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.15, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6544, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 227560219, "checksum": "fbf912d88a72754b8322c7bda475eab32d9567cf7a3f3c54e7dc202ec3ea51df", "avgDurationSecs": 5.912, "validDurationSecs": 37836.687, "totalHrs": 10.74, "validHrs": 10.51 }, "ug": { "buckets": { "dev": 14613, "test": 14613, "train": 119968, "validated": 266325, "invalidated": 20904, "other": 14331 }, "duration": 1788503436, "reportedSentences": 1510, "validatedSentences": 231285, "unvalidatedSentences": 8330, "clips": 301560, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.34, "thirties": 0.27, "teens": 0.06, "fourties": 0.1, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.27, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01, "intersex": 0 }, "sentence_domain": { "": 299063, "agriculture_food": 2, "automotive_transport": 2, "finance": 1, "service_retail": 0, "general": 2468, "healthcare": 27, "history_law_government": 3, "language_fundamentals": 9, "media_entertainment": 12, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 10 } }, "users": 1814, "size": 10198361542, "checksum": "b56ba336fb9fe67332fead65ca25daf10cc05574b585ef11a8d20c7c0d19c5ba", "avgDurationSecs": 5.931, "validDurationSecs": 1579530.367, "totalHrs": 496.8, "validHrs": 438.75 }, "uk": { "buckets": { "dev": 10272, "test": 10270, "train": 26860, "validated": 77743, "invalidated": 3342, "other": 8537 }, "duration": 416124600, "reportedSentences": 655, "validatedSentences": 212667, "unvalidatedSentences": 1611, "clips": 89622, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.34, "thirties": 0.24, "teens": 0.09, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.59, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 89622, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1158, "size": 2732925042, "checksum": "41a1faa7f637b6269a439500b48c2d42cd3317e8caa12dba6f8b344016450f04", "avgDurationSecs": 4.643, "validDurationSecs": 360969.123, "totalHrs": 115.59, "validHrs": 100.26 }, "ur": { "buckets": { "dev": 5088, "test": 5088, "train": 7336, "validated": 68207, "invalidated": 10990, "other": 173690 }, "duration": 1087202412, "reportedSentences": 411, "validatedSentences": 21701, "unvalidatedSentences": 48262, "clips": 252887, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.92, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.52, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 252780, "agriculture_food": 12, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 48, "healthcare": 47, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 495, "size": 6347074260, "checksum": "f43b1c7e98dc77abdca6657d5223641a28fce8e97a8e6482a34ffa4088d06365", "avgDurationSecs": 4.299, "validDurationSecs": 293233.005, "totalHrs": 302, "validHrs": 81.45 }, "ush": { "buckets": { "dev": 195, "test": 357, "train": 586, "validated": 3861, "invalidated": 82, "other": 18 }, "duration": 24351012, "reportedSentences": 0, "validatedSentences": 1170, "unvalidatedSentences": -1, "clips": 3961, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.23, "thirties": 0.01, "teens": 0.02, "fourties": 0.6, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3958, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 135804231, "checksum": "ae70d51c220252106f91c88412be81ffc21de5356d4ee194cdc25b71d129e546", "avgDurationSecs": 6.148, "validDurationSecs": 23736.243, "totalHrs": 6.76, "validHrs": 6.59 }, "uz": { "buckets": { "dev": 12260, "test": 12371, "train": 48738, "validated": 87103, "invalidated": 14117, "other": 128485 }, "duration": 954887796, "reportedSentences": 1833, "validatedSentences": 148734, "unvalidatedSentences": 137880, "clips": 229705, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229705, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2272, "size": 5358745747, "checksum": "b362c5b3a2c1a4ccb59a07856f2512aad1f9775667b3d40655b2400bfc21ac38", "avgDurationSecs": 4.157, "validDurationSecs": 362088.73, "totalHrs": 265.24, "validHrs": 100.58 }, "var": { "buckets": { "dev": 546, "test": 611, "train": 752, "validated": 6851, "invalidated": 291, "other": 864 }, "duration": 42487380, "reportedSentences": 246, "validatedSentences": 1977, "unvalidatedSentences": -1, "clips": 8006, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.29, "thirties": 0.57, "teens": 0.14, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.06, "male_masculine": 0, "female_feminine": 0.94, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8006, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 247654870, "checksum": "8bfe440598c5573e685165b3a615dbb8678ae48f0efc96b3b3dc10754d11b1a3", "avgDurationSecs": 5.307, "validDurationSecs": 36357.862, "totalHrs": 11.8, "validHrs": 10.09 }, "ve": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 20 }, "duration": 113076, "reportedSentences": 0, "validatedSentences": 1735, "unvalidatedSentences": 7843, "clips": 20, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.75, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 1206517, "checksum": "5f5b324ca9832de611ab1d06fff55c2cc19703ab753dd3baaa3c95bd5b8f3fda", "avgDurationSecs": 5.654, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "vi": { "buckets": { "dev": 933, "test": 1373, "train": 2099, "validated": 5771, "invalidated": 437, "other": 12569 }, "duration": 74502008, "reportedSentences": 201, "validatedSentences": 6481, "unvalidatedSentences": 5631, "clips": 18777, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.21, "thirties": 0.05, "teens": 0.21, "fourties": 0.02, "fifties": 0, "sixties": 0.27, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.5, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18450, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 327, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 358, "size": 433817275, "checksum": "5cec4e5a6414fd9a9e113058dd82bf63894458056a1b237ff93e68d437d6e349", "avgDurationSecs": 3.968, "validDurationSecs": 22897.752, "totalHrs": 20.69, "validHrs": 6.36 }, "vot": { "buckets": { "dev": 0, "test": 7, "train": 96, "validated": 103, "invalidated": 324, "other": 0 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186642, "checksum": "7f1a39bf3303cd72edada530463e85327a64fe007d7c4200783705bce6e349fb", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "wbl": { "buckets": { "dev": 1110, "test": 1128, "train": 2542, "validated": 6464, "invalidated": 89, "other": 1599 }, "duration": 55226772, "reportedSentences": 1, "validatedSentences": 5493, "unvalidatedSentences": 113, "clips": 8152, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.12, "seventies": 0.66, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.12, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8149, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 320440417, "checksum": "8287e2366690623e1fbf11f7fb08dbfba342670bcd27bdb4fc9124e0deedc6dc", "avgDurationSecs": 6.775, "validDurationSecs": 43791.199, "totalHrs": 15.34, "validHrs": 12.16 }, "wes": { "buckets": { "dev": 324, "test": 323, "train": 351, "validated": 9003, "invalidated": 51, "other": 363 }, "duration": 38779632, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": -1, "clips": 9417, "splits": { "accent": {}, "age": { "": 0.94, "twenties": 0.01, "thirties": 0, "teens": 0, "fourties": 0.06, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.94, "male_masculine": 0, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9417, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 209721832, "checksum": "5a6b9da58fc02bc55099664d051b6705546aeb8c7a75a8e8cbb49d7d8b22c412", "avgDurationSecs": 4.118, "validDurationSecs": 37074.761, "totalHrs": 10.77, "validHrs": 10.29 }, "xh": { "buckets": { "dev": 0, "test": 0, "train": 7, "validated": 7, "invalidated": 0, "other": 23 }, "duration": 179712, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 30, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2197661, "checksum": "b89441a0ad5a6ce62e5946ba732046c73a28e0016ecd6f0fdffd8d8d084179a9", "avgDurationSecs": 5.99, "validDurationSecs": 41.933, "totalHrs": 0.04, "validHrs": 0.01 }, "xhe": { "buckets": { "dev": 0, "test": 1443, "train": 3542, "validated": 11429, "invalidated": 160, "other": 14 }, "duration": 37269792, "reportedSentences": 1, "validatedSentences": 5040, "unvalidatedSentences": 2, "clips": 11603, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.85, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11603, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 217963117, "checksum": "2bdafd7818390e63a26ce154b65b8ae86690032e9e7d3f31c549937a74ad480d", "avgDurationSecs": 3.212, "validDurationSecs": 36710.89, "totalHrs": 10.35, "validHrs": 10.19 }, "xka": { "buckets": { "dev": 0, "test": 399, "train": 1571, "validated": 11163, "invalidated": 66, "other": 583 }, "duration": 38274120, "reportedSentences": 4, "validatedSentences": 1980, "unvalidatedSentences": 46, "clips": 11853, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.74, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.25, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11827, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 26, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 203464678, "checksum": "314b875c765cadc262ef9eff28afa5e059ea9f570321cfb7136eeba7dd87cdea", "avgDurationSecs": 3.229, "validDurationSecs": 36046.064, "totalHrs": 10.63, "validHrs": 10.01 }, "xmf": { "buckets": { "dev": 327, "test": 327, "train": 334, "validated": 6750, "invalidated": 582, "other": 2 }, "duration": 45345492, "reportedSentences": 20, "validatedSentences": 1004, "unvalidatedSentences": 24, "clips": 7334, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.32, "thirties": 0.31, "teens": 0.02, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.05, "female_feminine": 0.49, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7296, "agriculture_food": 9, "automotive_transport": 19, "finance": 0, "service_retail": 0, "general": 19, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 84, "size": 265905562, "checksum": "36b049902dbc2cbbc9c82fa337fb4cf01503796b79f02b24a8a1782d3a96c6bc", "avgDurationSecs": 6.183, "validDurationSecs": 41734.67, "totalHrs": 12.59, "validHrs": 11.59 }, "yaq": { "buckets": { "dev": 131, "test": 800, "train": 1897, "validated": 6925, "invalidated": 1, "other": 845 }, "duration": 41273208, "reportedSentences": 0, "validatedSentences": 2838, "unvalidatedSentences": -1, "clips": 7771, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.36, "thirties": 0.27, "teens": 0, "fourties": 0.36, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7771, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 243176052, "checksum": "d643869828ec0f52846775640a2bdea7d55f7925062358b6bb4cd049b4d508ab", "avgDurationSecs": 5.311, "validDurationSecs": 36779.947, "totalHrs": 11.46, "validHrs": 10.21 }, "yav": { "buckets": { "dev": 274, "test": 316, "train": 423, "validated": 4679, "invalidated": 413, "other": 506 }, "duration": 47985372, "reportedSentences": 0, "validatedSentences": 1013, "unvalidatedSentences": -1, "clips": 5598, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5598, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 279292537, "checksum": "1fe9d33639f6bff8f080d8f64c9f7f946778d2857e1887bbad76abcd7757a2ae", "avgDurationSecs": 8.572, "validDurationSecs": 40107.816, "totalHrs": 13.32, "validHrs": 11.14 }, "ydg": { "buckets": { "dev": 0, "test": 314, "train": 1480, "validated": 10526, "invalidated": 10, "other": 567 }, "duration": 39625020, "reportedSentences": 0, "validatedSentences": 1882, "unvalidatedSentences": 88, "clips": 11103, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11103, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 231016202, "checksum": "3c04be36bca9aefd09e4d584f386b14ddbf30e2ce813c9587f6b3e80657325b5", "avgDurationSecs": 3.569, "validDurationSecs": 37565.789, "totalHrs": 11, "validHrs": 10.43 }, "yi": { "buckets": { "dev": 381, "test": 444, "train": 521, "validated": 1713, "invalidated": 81, "other": 55 }, "duration": 7268184, "reportedSentences": 12, "validatedSentences": 1548, "unvalidatedSentences": 85, "clips": 1849, "splits": { "accent": {}, "age": { "": 0.76, "twenties": 0.06, "thirties": 0.07, "teens": 0.12, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1835, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 0, "general": 11, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 114, "size": 41017948, "checksum": "fcb793668c70fdde3bcce1a2bcbc717fde39fe2364b15d1c2995f1436865a043", "avgDurationSecs": 3.931, "validDurationSecs": 6733.585, "totalHrs": 2.01, "validHrs": 1.87 }, "yo": { "buckets": { "dev": 914, "test": 1116, "train": 1404, "validated": 3436, "invalidated": 260, "other": 1156 }, "duration": 29313072, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 17, "clips": 4852, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.61, "thirties": 0.03, "teens": 0.04, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.3, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4852, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 132, "size": 170838350, "checksum": "f8f2a61935e0014c1a72bb4b3ccd83dcc587df7e498546ad0ef44ca2922f7db8", "avgDurationSecs": 6.041, "validDurationSecs": 20758.391, "totalHrs": 8.14, "validHrs": 5.76 }, "yue": { "buckets": { "dev": 5130, "test": 5130, "train": 7420, "validated": 191222, "invalidated": 8085, "other": 79893 }, "duration": 1107615420, "reportedSentences": 2257, "validatedSentences": 18551, "unvalidatedSentences": 10175, "clips": 279200, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.29, "thirties": 0.47, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 276356, "agriculture_food": 18, "automotive_transport": 51, "finance": 128, "service_retail": 606, "general": 1676, "healthcare": 417, "history_law_government": 67, "language_fundamentals": 0, "media_entertainment": 192, "nature_environment": 188, "news_current_affairs": 114, "technology_robotics": 95 } }, "users": 1169, "size": 6419546560, "checksum": "341b2d6b9c5fd15034ae2d34fe054577420c63262680cae28fdd131d179f409b", "avgDurationSecs": 3.967, "validDurationSecs": 758597.55, "totalHrs": 307.67, "validHrs": 210.72 }, "zgh": { "buckets": { "dev": 297, "test": 230, "train": 855, "validated": 1382, "invalidated": 88, "other": 653 }, "duration": 7376616, "reportedSentences": 6, "validatedSentences": 2737, "unvalidatedSentences": 350, "clips": 2123, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.37, "thirties": 0.53, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.84, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2082, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 33, "healthcare": 2, "history_law_government": 0, "language_fundamentals": 5, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 33, "size": 40307982, "checksum": "4675e9da531f088ec983ea5465f85b36c750de424289935a04201bbb40a68423", "avgDurationSecs": 3.475, "validDurationSecs": 4801.923, "totalHrs": 2.04, "validHrs": 1.33 }, "zh-CN": { "buckets": { "dev": 10635, "test": 10635, "train": 29474, "validated": 188599, "invalidated": 59067, "other": 601324 }, "duration": 3855566688, "reportedSentences": 1074, "validatedSentences": 59072, "unvalidatedSentences": 914, "clips": 848990, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0.05, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 847386, "agriculture_food": 18, "automotive_transport": 42, "finance": 24, "service_retail": 41, "general": 532, "healthcare": 65, "history_law_government": 203, "language_fundamentals": 46, "media_entertainment": 721, "nature_environment": 19, "news_current_affairs": 55, "technology_robotics": 100 } }, "users": 7459, "size": 22827110273, "checksum": "7590951b68db1eae81151fc9b53ebdf0ce44349511684056c0f51754ee5f0927", "avgDurationSecs": 4.541, "validDurationSecs": 856495.391, "totalHrs": 1070.99, "validHrs": 237.91 }, "zh-HK": { "buckets": { "dev": 5604, "test": 5604, "train": 8450, "validated": 93976, "invalidated": 4624, "other": 24620 }, "duration": 512256840, "reportedSentences": 824, "validatedSentences": 20144, "unvalidatedSentences": 25, "clips": 123220, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.41, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 122093, "agriculture_food": 0, "automotive_transport": 26, "finance": 41, "service_retail": 255, "general": 746, "healthcare": 82, "history_law_government": 56, "language_fundamentals": 79, "media_entertainment": 47, "nature_environment": 46, "news_current_affairs": 23, "technology_robotics": 35 } }, "users": 3091, "size": 3645525059, "checksum": "11c3eb93e92a60e6b0bcb513578b1f285dbc2f486317d5da17d0895c65897057", "avgDurationSecs": 4.157, "validDurationSecs": 390682.104, "totalHrs": 142.29, "validHrs": 108.52 }, "zh-TW": { "buckets": { "dev": 5100, "test": 5100, "train": 7356, "validated": 84780, "invalidated": 4898, "other": 50043 }, "duration": 470012134, "reportedSentences": 179, "validatedSentences": 20748, "unvalidatedSentences": 841, "clips": 139721, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.49, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 138740, "agriculture_food": 12, "automotive_transport": 271, "finance": 1, "service_retail": 150, "general": 605, "healthcare": 21, "history_law_government": 164, "language_fundamentals": 8, "media_entertainment": 165, "nature_environment": 11, "news_current_affairs": 42, "technology_robotics": 257 } }, "users": 2290, "size": 3137076161, "checksum": "1956384cc8850938e4fb6f0140409f6472d8a5412b5582184a4674b720b7a40f", "avgDurationSecs": 3.364, "validDurationSecs": 285194.271, "totalHrs": 130.55, "validHrs": 79.22 }, "zoc": { "buckets": { "dev": 483, "test": 474, "train": 524, "validated": 8886, "invalidated": 2, "other": 60 }, "duration": 36533628, "reportedSentences": 0, "validatedSentences": 1481, "unvalidatedSentences": 7, "clips": 8948, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.38, "thirties": 0.22, "teens": 0, "fourties": 0.31, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0, "female_feminine": 0.44, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8948, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 213973409, "checksum": "3c449c5f6a3205a2bc0ae4e225423026817ea3a6e2fcb59b57860cdb43fe8107", "avgDurationSecs": 4.083, "validDurationSecs": 36280.489, "totalHrs": 10.14, "validHrs": 10.07 }, "zu": { "buckets": { "dev": 0, "test": 1, "train": 15, "validated": 16, "invalidated": 9, "other": 178 }, "duration": 1123272, "reportedSentences": 4, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 203, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.51, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.34 }, "sentence_domain": { "": 203, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 7938592, "checksum": "f7e4f25dffd6583533eaf051dd99985df6c85f66c771da2cef205d0ed86ff29a", "avgDurationSecs": 5.533, "validDurationSecs": 88.534, "totalHrs": 0.31, "validHrs": 0.02 }, "zza": { "buckets": { "dev": 463, "test": 411, "train": 738, "validated": 1612, "invalidated": 243, "other": 340 }, "duration": 9010836, "reportedSentences": 2321, "validatedSentences": 5655, "unvalidatedSentences": 972, "clips": 2195, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.71, "male_masculine": 0.27, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1458, "agriculture_food": 3, "automotive_transport": 1, "finance": 1, "service_retail": 9, "general": 673, "healthcare": 16, "history_law_government": 0, "language_fundamentals": 27, "media_entertainment": 2, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 20, "size": 52787914, "checksum": "95b64357aac0b9a26b2afd5f285371c9e77c234b7f1a0548fd2945d595460ab3", "avgDurationSecs": 4.105, "validDurationSecs": 6617.525, "totalHrs": 2.5, "validHrs": 1.83 } }, "totalDuration": 129319692655, "totalValidDurationSecs": 88566730, "totalHrs": 35921, "totalValidHrs": 24600 } ================================================ FILE: datasets/scripted-speech/cv-corpus-23.0-delta-2025-09-05.json ================================================ { "locale": { "ab": { "duration": 95292, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 27, "users": 2, "size": 519052, "checksum": "59d052e0dcec3c55f1684b24853b081e91787548326267717824170965fecfb4", "avgDurationSecs": 5.177, "validDurationSecs": 63.97, "totalHrs": 0.03, "validHrs": 0.02 }, "af": { "duration": 553392, "reportedSentences": 0, "validatedSentences": 15, "unvalidatedSentences": -15, "clips": 70, "users": 1, "size": 3155522, "checksum": "5f64c4111802e497891bfb099b084b9128f7ff3e8066fb87bb15c9a34de7fbcf", "avgDurationSecs": 6.073, "validDurationSecs": 232.48, "totalHrs": 0.15, "validHrs": 0.06 }, "am": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1733, "checksum": "05d9fe4f4451a961093e71178564c26dde4a111a026ebd70294e72d6b87f06e7", "avgDurationSecs": 6.297, "validDurationSecs": 75.57, "totalHrs": 0, "validHrs": 0.03 }, "ar": { "duration": 561420, "reportedSentences": 2, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 122, "users": 14, "size": 3082368, "checksum": "22d224c4f8e8d24f518eb7cb7143f34016e11f7d75384b19041e5c3fe66eaff4", "avgDurationSecs": 4.161, "validDurationSecs": 605.6, "totalHrs": 0.16, "validHrs": 0.17 }, "as": { "duration": 150012, "reportedSentences": 2, "validatedSentences": 7, "unvalidatedSentences": 7, "clips": 25, "users": 0, "size": 893244, "checksum": "5818719cd64da0151645ead75f7e31316136a365482f88f377c0cff4caadaa3c", "avgDurationSecs": 5.897, "validDurationSecs": 95.36, "totalHrs": 0.04, "validHrs": 0.03 }, "ast": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -980, "checksum": "44272c9c6275d7fc6cb9eb90e07a2d521b8c118e4c9e1683a2814a54ee55d7fe", "avgDurationSecs": 4.434, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "az": { "duration": 306468, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 50, "users": 4, "size": 1797631, "checksum": "4f0a9e3b1eef4aeccbf582477b78427a1d1561230a637b81ee57fb0c3ebf4582", "avgDurationSecs": 5.458, "validDurationSecs": 148.63, "totalHrs": 0.09, "validHrs": 0.04 }, "ba": { "duration": 261072, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 57, "users": 1, "size": 1611784, "checksum": "e7c87fe3df3b695bec6aa8dba3fd57c5e680baf50ce61d0d54ff08984130285f", "avgDurationSecs": 4.427, "validDurationSecs": 884.96, "totalHrs": 0.07, "validHrs": 0.24 }, "bas": { "duration": 77688, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 1, "size": 464049, "checksum": "b3cd82259bb616e37e69153f9730e5a3f01194098261b71492d1bb2d5ebad6de", "avgDurationSecs": 3.926, "validDurationSecs": 28.58, "totalHrs": 0.02, "validHrs": 0.01 }, "be": { "duration": 20302524, "reportedSentences": 3, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 3584, "users": 10, "size": 112115354, "checksum": "389eddb83229528b9d958b18453e5aad07bb09fbfee1243f278fb1d34a291ef3", "avgDurationSecs": 4.791, "validDurationSecs": 12373.21, "totalHrs": 5.64, "validHrs": 3.44 }, "bg": { "duration": 226332, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 42, "users": 3, "size": 1335089, "checksum": "4d6ebc09c9e977ff694286d3d7a41b0ae472afe76e37c71fa51170a2f5c16340", "avgDurationSecs": 5.472, "validDurationSecs": 172.26, "totalHrs": 0.07, "validHrs": 0.05 }, "bn": { "duration": 2142036, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 339, "users": 18, "size": 12734230, "checksum": "050cc3d7765c3b2b5021c22102c7d4579dc61257d44bb620b04f3fcae72e289b", "avgDurationSecs": 4.371, "validDurationSecs": 351.41, "totalHrs": 0.6, "validHrs": 0.09 }, "br": { "duration": 4039740, "reportedSentences": 2, "validatedSentences": 167, "unvalidatedSentences": -5, "clips": 1026, "users": 25, "size": 23163417, "checksum": "724240bd5d6d99dd60f7642ee0dc8c48bf066eec0f44c6d4711d73a97e88b3f1", "avgDurationSecs": 3.245, "validDurationSecs": 3912.29, "totalHrs": 1.12, "validHrs": 1.09 }, "ca": { "duration": 105524316, "reportedSentences": 1067, "validatedSentences": 144, "unvalidatedSentences": 1695, "clips": 23817, "users": 76, "size": 619003928, "checksum": "52e902e6d0fe22eb6bac3966683dc7fe9f650d3e35af2e0be25047fa264531d9", "avgDurationSecs": 5.205, "validDurationSecs": 43320.3, "totalHrs": 29.31, "validHrs": 12.03 }, "ckb": { "duration": 1222920, "reportedSentences": 4, "validatedSentences": 11, "unvalidatedSentences": 0, "clips": 297, "users": 23, "size": 6831103, "checksum": "9c8d043e11206cab5958f605ea7c175b283da58db2d510698278d088da10d54c", "avgDurationSecs": 4.094, "validDurationSecs": 537.18, "totalHrs": 0.34, "validHrs": 0.15 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1663, "checksum": "4bc9652d94b2d69dcb13cf7bd11088c9bc8c13ab8587684d6a72871ce7c2f6fd", "avgDurationSecs": 3.516, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 1951740, "reportedSentences": 3, "validatedSentences": 46, "unvalidatedSentences": -46, "clips": 489, "users": 12, "size": 11207404, "checksum": "07b7b40ad5cd78f7b65f9596758776d76299d50ee6a5efa857bc7851fd7967fc", "avgDurationSecs": 4.461, "validDurationSecs": 783.45, "totalHrs": 0.54, "validHrs": 0.22 }, "cv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2282, "checksum": "ded895e904272a247cb6162e520873814924e91a4e251255e08ff4b01f785443", "avgDurationSecs": 5.042, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cy": { "duration": 229680, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 36, "users": 6, "size": 1297246, "checksum": "8d12d0d874f126560d2877038dcdeff7961eb28b4b02cb1ab7570318005a7a63", "avgDurationSecs": 4.915, "validDurationSecs": 178.92, "totalHrs": 0.06, "validHrs": 0.05 }, "da": { "duration": 363564, "reportedSentences": 0, "validatedSentences": 28, "unvalidatedSentences": -28, "clips": 90, "users": 1, "size": 2130972, "checksum": "b81caa13e72bf4a417b53a594bf5042a8f10b0c5e499422359bbd9f31f22083e", "avgDurationSecs": 4.063, "validDurationSecs": 30.77, "totalHrs": 0.1, "validHrs": 0.01 }, "dav": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 7448, "checksum": "0ba35ecbe65a58b7b771ff0e4eabf694211472ce8474771522eebff3713aa862", "avgDurationSecs": 4.059, "validDurationSecs": 16.24, "totalHrs": 0, "validHrs": 0 }, "de": { "duration": 5963544, "reportedSentences": 23, "validatedSentences": 38, "unvalidatedSentences": 37, "clips": 980, "users": 57, "size": 30726352, "checksum": "2bfd93244cd180ddd71dbb64baa6e29dc8cd52280c70dafd11f9ff0e12f80a4b", "avgDurationSecs": 5.262, "validDurationSecs": 7972.77, "totalHrs": 1.66, "validHrs": 2.21 }, "dv": { "duration": 268524, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 61, "users": 3, "size": 1573063, "checksum": "36caf9ae414318e1d96bea52d361be70fc43fc8b665a9e031a50ecf88e8a0661", "avgDurationSecs": 5.12, "validDurationSecs": 213.81, "totalHrs": 0.08, "validHrs": 0.06 }, "dyu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 181, "checksum": "1b182f55e456aa1c16e57ce9a72387b6c9002956df20cdac02fff3927e873724", "avgDurationSecs": 6.309, "validDurationSecs": 12.62, "totalHrs": 0, "validHrs": 0 }, "el": { "duration": 238500, "reportedSentences": 0, "validatedSentences": 28, "unvalidatedSentences": -28, "clips": 50, "users": 1, "size": 1406642, "checksum": "dd1a96f3ad2e17a1fdec55cb6d458977842df381a571837856f90b7ef7efc048", "avgDurationSecs": 4.152, "validDurationSecs": 255.17, "totalHrs": 0.07, "validHrs": 0.07 }, "en": { "duration": 56070144, "reportedSentences": 130, "validatedSentences": 651, "unvalidatedSentences": -58, "clips": 8541, "users": 511, "size": 327481639, "checksum": "85a16b871613c38fccaf76532018c94942fe8435f9bcc11d76d77508daf52961", "avgDurationSecs": 5.287, "validDurationSecs": 42629.79, "totalHrs": 15.57, "validHrs": 11.84 }, "eo": { "duration": 16441488, "reportedSentences": 32, "validatedSentences": 336, "unvalidatedSentences": -336, "clips": 2278, "users": 24, "size": 96557972, "checksum": "d1627f27610f7505bc5dd195422eafc516828bca2bad484a9156bdbae2c5dee9", "avgDurationSecs": 6.073, "validDurationSecs": 4377.09, "totalHrs": 4.56, "validHrs": 1.22 }, "es": { "duration": 18091548, "reportedSentences": 11, "validatedSentences": 215, "unvalidatedSentences": -196, "clips": 3343, "users": 42, "size": 105691322, "checksum": "4396c00fba2cf1830a8744b9f1d33fc2191f41d6e36371e0c4eff78a1d7ba69f", "avgDurationSecs": 4.879, "validDurationSecs": 11727.47, "totalHrs": 5.03, "validHrs": 3.26 }, "et": { "duration": 3535452, "reportedSentences": 2, "validatedSentences": 71, "unvalidatedSentences": 433, "clips": 595, "users": 27, "size": 20550933, "checksum": "db6390a8173aceb0a34f4f6518340b23dd5c66b651f7b885f4a4a3381fded78c", "avgDurationSecs": 6.731, "validDurationSecs": 3583.24, "totalHrs": 0.98, "validHrs": 1 }, "eu": { "duration": 3698352, "reportedSentences": 5, "validatedSentences": 23, "unvalidatedSentences": 4796, "clips": 447, "users": 5, "size": 21895556, "checksum": "4d28e418a096587fb0807ea03f12d4bdb0eb0c9e26b4d482af69795411e3c1d1", "avgDurationSecs": 5.505, "validDurationSecs": 63210.15, "totalHrs": 1.02, "validHrs": 17.56 }, "fa": { "duration": 1819908, "reportedSentences": 24, "validatedSentences": 49, "unvalidatedSentences": 259, "clips": 511, "users": 16, "size": 10557512, "checksum": "68a65a5013d1860c120d8a8f0e1ffa899d61a982f5e77bb4d9d23800b41b63e2", "avgDurationSecs": 3.96, "validDurationSecs": 1224.34, "totalHrs": 0.5, "validHrs": 0.34 }, "fi": { "duration": 875448, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 13, "clips": 179, "users": 5, "size": 5154535, "checksum": "1d0235ff7e95ecc7b24568db63d4acaccc2f8ce96cb12fb68ba449cba04b17bb", "avgDurationSecs": 4.626, "validDurationSecs": 397.89, "totalHrs": 0.24, "validHrs": 0.11 }, "fr": { "duration": 8421084, "reportedSentences": 37, "validatedSentences": 164, "unvalidatedSentences": -103, "clips": 1506, "users": 72, "size": 43513557, "checksum": "88717f4acd399b3535525c447c2f8187ff4d782b25b008f3e3b405fd5f974d1c", "avgDurationSecs": 5.035, "validDurationSecs": 29271.34, "totalHrs": 2.34, "validHrs": 8.13 }, "fy-NL": { "duration": 1571040, "reportedSentences": 0, "validatedSentences": 551, "unvalidatedSentences": -4, "clips": 302, "users": 22, "size": 9132808, "checksum": "167dd4d4415ecff2ee15a1047a888bd641b4d4aaccafccb11d30d96c047e50dd", "avgDurationSecs": 4.855, "validDurationSecs": 393.58, "totalHrs": 0.44, "validHrs": 0.11 }, "ga-IE": { "duration": 4221432, "reportedSentences": 1, "validatedSentences": 2761, "unvalidatedSentences": -6, "clips": 1240, "users": 5, "size": 24875619, "checksum": "18866850e9af0279d3c5c2a345be7a49e8675697a264d210d4e2bc291017b897", "avgDurationSecs": 3.56, "validDurationSecs": 3989.32, "totalHrs": 1.17, "validHrs": 1.11 }, "gl": { "duration": 21982824, "reportedSentences": 12, "validatedSentences": 297, "unvalidatedSentences": -247, "clips": 4564, "users": 152, "size": 125876184, "checksum": "ccab851fda33b47318d57ac9af8205ed60b894eff5a06b5c01b3bc9e0e58a2ef", "avgDurationSecs": 4.998, "validDurationSecs": 252834.64, "totalHrs": 6.11, "validHrs": 70.23 }, "gn": { "duration": 9850140, "reportedSentences": 16, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 1987, "users": 43, "size": 56522489, "checksum": "55bdd9fa1fea2fd872e2fe20450b47ec269173785a1b6b88d9958977c2350ec3", "avgDurationSecs": 4.601, "validDurationSecs": 2041.73, "totalHrs": 2.74, "validHrs": 0.56 }, "ha": { "duration": 31356, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 5, "users": 1, "size": 199310, "checksum": "aec59743fa075c60a616d9b21d36a84dfdb1159f7ab32a34634dcfe1788d2be1", "avgDurationSecs": 4.356, "validDurationSecs": 33.7, "totalHrs": 0, "validHrs": 0.01 }, "he": { "duration": 884736, "reportedSentences": 10, "validatedSentences": 68, "unvalidatedSentences": -67, "clips": 159, "users": 1, "size": 5220344, "checksum": "8ed51adb92edcdb1fc2fe6f36533284c0b2abe851e88e8789427c191473cb476", "avgDurationSecs": 4.593, "validDurationSecs": 2820.93, "totalHrs": 0.25, "validHrs": 0.78 }, "hi": { "duration": 185940, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 28, "users": 6, "size": 1109438, "checksum": "7bbdd04b8e79fd11b702015a11fb01f672edd8ab4fb1828c51d037b7a7a4bfb0", "avgDurationSecs": 5.018, "validDurationSecs": 75.74, "totalHrs": 0.06, "validHrs": 0.02 }, "hsb": { "duration": 487368, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 65, "users": 4, "size": 2875616, "checksum": "5e34adbb89c2f7e96972b4b98fe3bbff2ac3fd1710d84d7443b9666ec0a723bc", "avgDurationSecs": 6.305, "validDurationSecs": 172.37, "totalHrs": 0.14, "validHrs": 0.04 }, "ht": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 170, "checksum": "8bd055f142e4bb4600c305033320aef3e6a26b32d8c23761b0a3ef841af18f78", "avgDurationSecs": 5.566, "validDurationSecs": 38.96, "totalHrs": 0, "validHrs": 0.01 }, "hu": { "duration": 2256120, "reportedSentences": 0, "validatedSentences": 112, "unvalidatedSentences": -109, "clips": 413, "users": 9, "size": 13055516, "checksum": "cc00c31d4fea341f0afdad1ed8ebb5a7c5e843351fe8a197e7ba521c9924eae4", "avgDurationSecs": 5.539, "validDurationSecs": 3190.13, "totalHrs": 0.63, "validHrs": 0.89 }, "hy-AM": { "duration": 32292, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 215757, "checksum": "84c5be2ea4462323c0a40234fe32f680b467090b79b35603ba9cd86f892d1a73", "avgDurationSecs": 5.392, "validDurationSecs": 89.28, "totalHrs": 0.01, "validHrs": 0.02 }, "ia": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 21, "clips": 0, "users": 0, "size": -1063, "checksum": "2e74cf02c0c8dd0f8e1358e1865bb3e8a0e9718b13de5deb2ffb3312b2becb2e", "avgDurationSecs": 4.202, "validDurationSecs": 12.6, "totalHrs": 0, "validHrs": 0.01 }, "id": { "duration": 142704, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 30, "users": 6, "size": 780713, "checksum": "174b2fda9f7415a09ecb0489ef90ded0ad10eb11aed17c10ae3bebb3546e7c01", "avgDurationSecs": 4.003, "validDurationSecs": 491.83, "totalHrs": 0.04, "validHrs": 0.14 }, "ig": { "duration": 127800, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 21, "users": 2, "size": 712815, "checksum": "345de1f833826b94f5cb5da053db3258b19ed76b892028f8525fe4ebeeb695a9", "avgDurationSecs": 5.466, "validDurationSecs": 0.04, "totalHrs": 0.04, "validHrs": 0 }, "is": { "duration": 2194056, "reportedSentences": 0, "validatedSentences": 90, "unvalidatedSentences": -90, "clips": 349, "users": 2, "size": 12949512, "checksum": "5dc65dc9e351740d70733459457aede0c450aecd1e571eac4ab8edecfd97c42b", "avgDurationSecs": 6.425, "validDurationSecs": 410.88, "totalHrs": 0.61, "validHrs": 0.12 }, "it": { "duration": 6078420, "reportedSentences": 0, "validatedSentences": 88, "unvalidatedSentences": -70, "clips": 986, "users": 14, "size": 35491841, "checksum": "5116fbc5b6500272bb65428798c18bcfb3df01b2f8c0c5235ec7d442fd272043", "avgDurationSecs": 5.427, "validDurationSecs": 1506.79, "totalHrs": 1.69, "validHrs": 0.42 }, "ja": { "duration": 2982636, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 4399, "clips": 544, "users": 13, "size": 17843021, "checksum": "5b4f10d4614f601fc62b064ec03f539a26e99bd271dffe7bb820e1c3f87cf404", "avgDurationSecs": 4.543, "validDurationSecs": 1382.48, "totalHrs": 0.83, "validHrs": 0.38 }, "ka": { "duration": 32878440, "reportedSentences": 4, "validatedSentences": 24, "unvalidatedSentences": -21, "clips": 6900, "users": 51, "size": 192045934, "checksum": "f893f73a5478ae787037bc23e18f68d749e8632286266108bbefedbb982a26e3", "avgDurationSecs": 5.202, "validDurationSecs": 2177.06, "totalHrs": 9.14, "validHrs": 0.6 }, "kab": { "duration": 786708, "reportedSentences": 7, "validatedSentences": 9, "unvalidatedSentences": 3, "clips": 194, "users": 5, "size": 4845518, "checksum": "86b4ca14130f8016f3ef679bb8353469cd0dc1f457c5a53ba8e30680e0023824", "avgDurationSecs": 3.342, "validDurationSecs": 785.55, "totalHrs": 0.21, "validHrs": 0.22 }, "kk": { "duration": 493668, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": -1, "clips": 100, "users": 9, "size": 2899048, "checksum": "1f33391cf774bbc0c65311e1c5f3d84c18ac1e746e934cb05fc8131df32d060c", "avgDurationSecs": 4.932, "validDurationSecs": 281.42, "totalHrs": 0.14, "validHrs": 0.08 }, "kln": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 38825, "checksum": "44e05d757e0a27337a6a9d5fc7b31dee6d010925567d44bfb383ce2e058a2c56", "avgDurationSecs": 4.705, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kmr": { "duration": 8987544, "reportedSentences": 4, "validatedSentences": 181, "unvalidatedSentences": -136, "clips": 1352, "users": 87, "size": 50084116, "checksum": "af479a593a4e2d31436226c3c12221e6df2d24765b64378af7a3c8c375a906db", "avgDurationSecs": 4.135, "validDurationSecs": 9679, "totalHrs": 2.49, "validHrs": 2.68 }, "ko": { "duration": 231372, "reportedSentences": 0, "validatedSentences": 33, "unvalidatedSentences": -33, "clips": 35, "users": 6, "size": 1355473, "checksum": "18e96e0def09d688c9360111a0cd3bdf3cbae5eb34122ceebeae2c1883ba9e3f", "avgDurationSecs": 5.35, "validDurationSecs": 113.04, "totalHrs": 0.07, "validHrs": 0.03 }, "ky": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -5751, "checksum": "3fb1badd4e9e99671a4c519710ad673c4a24ce53a5a94d7daca09a9f77042101", "avgDurationSecs": 4.553, "validDurationSecs": 18.21, "totalHrs": 0, "validHrs": 0.01 }, "lg": { "duration": 28332, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 0, "size": 162859, "checksum": "f0182a6bd48dabc469f149c58aa1ce1e57c3f613384dafcd6cf6569b40c87196", "avgDurationSecs": 5.784, "validDurationSecs": -0.46, "totalHrs": 0.01, "validHrs": 0 }, "lij": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13516, "checksum": "d2e89335cb9d9bdfa0a423835fb1e5471f2275e7d627e02480c73d8086838031", "avgDurationSecs": 3.923, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 148, "checksum": "e17186c2d50931006209e7145265f1c663b2b71df246cbb69518afda773cfbef", "avgDurationSecs": 6.532, "validDurationSecs": 26.13, "totalHrs": 0, "validHrs": 0.01 }, "lt": { "duration": 10654740, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 2157, "users": 3, "size": 62735634, "checksum": "ebcedc73729600609da671de4d429ac1de34abe5559b0f1664c5ec2bf382feac", "avgDurationSecs": 5.126, "validDurationSecs": 2697.02, "totalHrs": 2.96, "validHrs": 0.75 }, "ltg": { "duration": 2506284, "reportedSentences": 4, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 387, "users": 4, "size": 14819506, "checksum": "f021bd4c9a14e3d924baa1aa37a5e609b1aea91c8d50be6dd0ed8ea70589fecd", "avgDurationSecs": 4.78, "validDurationSecs": 1390.73, "totalHrs": 0.7, "validHrs": 0.39 }, "luo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 9986, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 691748, "checksum": "ce0ca491218e0dff5ce65c8be7a842f0a2ed836ec4e173955ada1be2e8ae42e4", "avgDurationSecs": 5.205, "validDurationSecs": 5.2, "totalHrs": 0, "validHrs": 0.01 }, "lv": { "duration": 5650488, "reportedSentences": 0, "validatedSentences": 115, "unvalidatedSentences": -89, "clips": 767, "users": 34, "size": 31212930, "checksum": "81fdd0d48539f1c39dc786ad9ab39c3b185fb896850ae65344d6d87ec7e66d18", "avgDurationSecs": 4.804, "validDurationSecs": 3704.23, "totalHrs": 1.57, "validHrs": 1.03 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 157, "checksum": "94cb754fce2fb2e832470f00ff31bd74ed9d171e723d3dc6866e06ccb24ece99", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 52037, "checksum": "9c8ff932e077aee5381373db7c2302dcdb313cff7e91643c8ab5927d32f215c2", "avgDurationSecs": 4.623, "validDurationSecs": 4.62, "totalHrs": 0, "validHrs": 0 }, "mk": { "duration": 196632, "reportedSentences": 0, "validatedSentences": 18, "unvalidatedSentences": -18, "clips": 40, "users": 2, "size": 1149913, "checksum": "6c2a3f3104b868b21d6eeb00ef9c18752fd8a5d5ede8c1f7f955064fd7e56656", "avgDurationSecs": 4.809, "validDurationSecs": 1218.39, "totalHrs": 0.05, "validHrs": 0.34 }, "ml": { "duration": 50976, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 0, "clips": 8, "users": 2, "size": 246325, "checksum": "789169ac172baef4ccb24bbd00d5fdd41f423b15a47583004b3971af918b50ea", "avgDurationSecs": 4.221, "validDurationSecs": 120.19, "totalHrs": 0.02, "validHrs": 0.03 }, "mn": { "duration": 12888, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": 0, "clips": 2, "users": 2, "size": 98202, "checksum": "f2631343a73e23233d0f4f642504fbd220eead41c51464701ec4a1e0133daa96", "avgDurationSecs": 5.254, "validDurationSecs": 69.14, "totalHrs": 0, "validHrs": 0.02 }, "mr": { "duration": 53208, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 8, "users": 2, "size": 307386, "checksum": "afd61b4cc8140fb6f2095cd4a67c64082235b55e14c8e6c8f36eed5d38fe1e94", "avgDurationSecs": 6.2, "validDurationSecs": 21.07, "totalHrs": 0.02, "validHrs": 0 }, "mrj": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4911, "checksum": "0ef9c69d3ee8dc9b34818617a50acac4514ea7164aaf388f0831f327f446172d", "avgDurationSecs": 4.193, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 8771, "checksum": "a76ddf73b9c382a65f423ee3bb04c9ae7bdc9dc36c9bb49a36a6fcb0bda99a41", "avgDurationSecs": 4.754, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "myv": { "duration": 105552, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 0, "size": 624838, "checksum": "17f0dd2ad1afa8e62972ed372f428b76b6615b024a04c0a4c1c0e277f09e9a91", "avgDurationSecs": 5.784, "validDurationSecs": 24.36, "totalHrs": 0.03, "validHrs": 0 }, "nan-tw": { "duration": 299268, "reportedSentences": 0, "validatedSentences": 1286, "unvalidatedSentences": 294, "clips": 120, "users": 2, "size": 1837044, "checksum": "4e98f2845fdcd1b201fde448f42297fecaaea21988244c007487eef36191696e", "avgDurationSecs": 2.641, "validDurationSecs": 755.08, "totalHrs": 0.08, "validHrs": 0.21 }, "nb-NO": { "duration": 6114888, "reportedSentences": 1, "validatedSentences": 3, "unvalidatedSentences": -3, "clips": 1491, "users": 0, "size": 35013893, "checksum": "33911867e0819f597c942cdd0bc215d8ec39664c2c534e7a50ea49944cceef20", "avgDurationSecs": 4.134, "validDurationSecs": 5259.01, "totalHrs": 1.7, "validHrs": 1.46 }, "ne-NP": { "duration": 249948, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 50, "users": 2, "size": 1475634, "checksum": "7978ccbfb6bacaf77db6d6551a33d41df00ef0aa00e0ac2e94735c681e98c68a", "avgDurationSecs": 4.108, "validDurationSecs": 60.71, "totalHrs": 0.07, "validHrs": 0.02 }, "nhi": { "duration": 23796, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 140193, "checksum": "ea94956581c357a3fdb07e992a602eab52fa2df539d296b6552262a08e33db64", "avgDurationSecs": 5.081, "validDurationSecs": 10.02, "totalHrs": 0.01, "validHrs": 0 }, "nl": { "duration": 20970756, "reportedSentences": 11, "validatedSentences": 775, "unvalidatedSentences": 8944, "clips": 4798, "users": 13, "size": 123354019, "checksum": "28960ecc82a25ea90cd32da233772f5fe4ff791157d7ad62f97188d0271c00c4", "avgDurationSecs": 4.369, "validDurationSecs": 11475.68, "totalHrs": 5.82, "validHrs": 3.19 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 16, "checksum": "07a505a003f6db258a60a9b5ce0ae41b97f57cd936987480f41a47555db3e29f", "avgDurationSecs": 4.387, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1, "checksum": "d3a5f272634eb6d4a0d75d2b5f2d0324b5f66b3d716859c79e439848f8bc92da", "avgDurationSecs": 9.738, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -17, "checksum": "97e30787ba685a7fbc30bfa63d5eb031179587380508900f4ca4338ad137d7fa", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 12426, "checksum": "ec00fbb148be60227559200557b4aa763a4ec3c1686b55c9fdd5078c8bf28f9c", "avgDurationSecs": 4.871, "validDurationSecs": 43.84, "totalHrs": 0, "validHrs": 0.01 }, "or": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -19171, "checksum": "f5172dfb69883decd6298ed0e35161ecd8a394382ac4ff8173d54fe901baa77f", "avgDurationSecs": 5.575, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "os": { "duration": 108684, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 33, "clips": 19, "users": 0, "size": 644253, "checksum": "cc31226084bef93c50dce6c63387258d9b86c70682269b7b37193591ba505699", "avgDurationSecs": 5.521, "validDurationSecs": 108.55, "totalHrs": 0.03, "validHrs": 0.03 }, "pa-IN": { "duration": 1661004, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 0, "clips": 146, "users": 4, "size": 9788160, "checksum": "485ad9d6519fc88cd877f4adeb9423d1dfa1a3a00bc2f6695880e9bd454aca38", "avgDurationSecs": 5.112, "validDurationSecs": 666.07, "totalHrs": 0.47, "validHrs": 0.18 }, "pl": { "duration": 3505140, "reportedSentences": 4, "validatedSentences": 107, "unvalidatedSentences": -64, "clips": 619, "users": 17, "size": 20602118, "checksum": "5a4074abcbc9b704a43a5b9cb242ffa1c294125e1b24898e786c961ae8155d65", "avgDurationSecs": 4.572, "validDurationSecs": 4024.42, "totalHrs": 0.97, "validHrs": 1.12 }, "ps": { "duration": 8422776, "reportedSentences": 4, "validatedSentences": 789, "unvalidatedSentences": -748, "clips": 1617, "users": 20, "size": 46725466, "checksum": "75a24aab8d2735588e058c835ae107e3c8ff1e41ba4cd4d93a01c49fc9eef340", "avgDurationSecs": 4.912, "validDurationSecs": 3177.92, "totalHrs": 2.34, "validHrs": 0.88 }, "pt": { "duration": 7896888, "reportedSentences": 3, "validatedSentences": 7, "unvalidatedSentences": 50, "clips": 1534, "users": 28, "size": 46134088, "checksum": "3076e9686b49a833bf16225a82d136af838ac483987cd3a5d58b49af7fd99449", "avgDurationSecs": 4.187, "validDurationSecs": 3946.75, "totalHrs": 2.19, "validHrs": 1.09 }, "quy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1253, "checksum": "a6ebc8e82f11faa7bb6425a0a897c9f1ebd3fec47a7fd5e26169649a981691ff", "avgDurationSecs": 8.637, "validDurationSecs": 8.64, "totalHrs": 0, "validHrs": 0.01 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 8321, "checksum": "9e793e4d3bf60f5a46a79429a264a247c5d29634493c9b8841d7862c7174a661", "avgDurationSecs": 5.438, "validDurationSecs": 239.27, "totalHrs": 0, "validHrs": 0.06 }, "rm-vallader": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2203, "checksum": "25f5913911b9a50d7ac146364674f8c4a6fd99bbb8723b0d49622ca112065524", "avgDurationSecs": 5.832, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 88128, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 20, "users": 3, "size": 497978, "checksum": "735e3a1d4730332c05595a9fc1fd05b66c90d35009f6746508309c10402718fa", "avgDurationSecs": 4.041, "validDurationSecs": 185.09, "totalHrs": 0.02, "validHrs": 0.06 }, "ru": { "duration": 6450588, "reportedSentences": 3, "validatedSentences": 11, "unvalidatedSentences": 0, "clips": 1113, "users": 40, "size": 37257983, "checksum": "0e63ddbaf8ee013d863a79df54621e420fd71fbce3ed9d6b78a9217fec76e7bb", "avgDurationSecs": 5.187, "validDurationSecs": 6816.97, "totalHrs": 1.79, "validHrs": 1.9 }, "rup": { "duration": 216864, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 28, "users": 5, "size": 1175958, "checksum": "f4edabc8cc45e33008fd7b1ce4cc61416135c3eb96c7f38a07a73bf67c15985b", "avgDurationSecs": 7.86, "validDurationSecs": 102.18, "totalHrs": 0.06, "validHrs": 0.02 }, "rw": { "duration": 66456, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 13, "users": 2, "size": 287996, "checksum": "191669748fbd3a3d551b58e5cf24de78c808056b7f1767d60f3bc4ed51fc0b03", "avgDurationSecs": 5.007, "validDurationSecs": 1.14, "totalHrs": 0.02, "validHrs": 0 }, "sah": { "duration": 1019628, "reportedSentences": 6, "validatedSentences": 12, "unvalidatedSentences": -12, "clips": 164, "users": 1, "size": 6003943, "checksum": "36c1525dc102a67bc9777236cabc82d7c6c415c1fd73dad444d6f71648e16ffe", "avgDurationSecs": 6.335, "validDurationSecs": 961.98, "totalHrs": 0.28, "validHrs": 0.27 }, "sat": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 220, "checksum": "dc8c771a273e472ee9cd1d0a2d9d3f3c94b8db44f67c39db1d67f5f4819e76e1", "avgDurationSecs": 4.478, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 58932, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 294606, "checksum": "625d478be019ae136593ec8a0ecf7b125c471736afded5fcc65f63e65a32ccf2", "avgDurationSecs": 4.697, "validDurationSecs": 19.19, "totalHrs": 0.02, "validHrs": 0 }, "sd": { "duration": 19745712, "reportedSentences": 4, "validatedSentences": 1, "unvalidatedSentences": -1, "clips": 5152, "users": 3, "size": 111540727, "checksum": "e968c076b37ebc0bd09570698f02a3060f480d928eb29a7988dfbdeeadb69ec6", "avgDurationSecs": 4.131, "validDurationSecs": -2.62, "totalHrs": 5.48, "validHrs": 0 }, "sk": { "duration": 2523528, "reportedSentences": 1, "validatedSentences": 2407, "unvalidatedSentences": -2407, "clips": 541, "users": 1, "size": 14851082, "checksum": "49bf35f7a32b28f17c467469eae69345e7c95448607b036360022daa9ee41a2e", "avgDurationSecs": 4.268, "validDurationSecs": 1769.05, "totalHrs": 0.7, "validHrs": 0.49 }, "skr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 214, "checksum": "1e50951bac423fe3900b54da61ba3724a76ad41826cb61228dea0002ba00a82f", "avgDurationSecs": 4.164, "validDurationSecs": 4.16, "totalHrs": 0, "validHrs": 0 }, "sl": { "duration": 39060, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 229506, "checksum": "2963df5019ecad40492de820f1f60a2a7842aceb51a233e757f0a3685b821443", "avgDurationSecs": 4.001, "validDurationSecs": 135.27, "totalHrs": 0.02, "validHrs": 0.04 }, "sq": { "duration": 199404, "reportedSentences": 1, "validatedSentences": 1, "unvalidatedSentences": -1, "clips": 31, "users": 3, "size": 1163300, "checksum": "f71f3f65a9aa06d72a588aeb0514be9f48233fab311a3c1839ab03343d769ba8", "avgDurationSecs": 5.107, "validDurationSecs": 136.99, "totalHrs": 0.06, "validHrs": 0.04 }, "sr": { "duration": 73440, "reportedSentences": 4, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 2, "size": 431185, "checksum": "327767a6cf4570b3259a3cfead66657b83cc7b555e883a0a5d11d5fe56d057b8", "avgDurationSecs": 3.24, "validDurationSecs": 37.83, "totalHrs": 0.02, "validHrs": 0.02 }, "st": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -58, "checksum": "fb6fa9e2553e0013441ed66ed552e5b97a85d9ad33bd40828d87e6b8f9f0a2d5", "avgDurationSecs": 3.223, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sv-SE": { "duration": 115632, "reportedSentences": 0, "validatedSentences": 30, "unvalidatedSentences": -30, "clips": 20, "users": 4, "size": 715058, "checksum": "39ca02d3975a9fa782bb4694bea1ccd1b652c905a48534b56316067814cd5bdf", "avgDurationSecs": 4.023, "validDurationSecs": 955.03, "totalHrs": 0.03, "validHrs": 0.27 }, "sw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 115608, "checksum": "f9dd7e77ea5802983a18eae79ebefe1cccb9b0176f77aec8138ce604b3b62f31", "avgDurationSecs": 5.387, "validDurationSecs": 3086.6, "totalHrs": 0, "validHrs": 0.85 }, "ta": { "duration": 348588, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 65, "users": 2, "size": 2019992, "checksum": "1ab75879e1667ac89090edc71120d2c0e8dc3ccd21e9f0e14479cdb20db45803", "avgDurationSecs": 6.197, "validDurationSecs": 75.43, "totalHrs": 0.1, "validHrs": 0.03 }, "te": { "duration": 70884, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 12, "users": 2, "size": 340349, "checksum": "de11e7cf726336dd6084153a36e10d3de2ebe6640495478c10f655085629e86d", "avgDurationSecs": 4.186, "validDurationSecs": 48.95, "totalHrs": 0.02, "validHrs": 0.02 }, "tg": { "duration": 944928, "reportedSentences": 0, "validatedSentences": 151, "unvalidatedSentences": -151, "clips": 184, "users": 1, "size": 5478707, "checksum": "2b981cab0e140404aa1f05e102d0decfdb9eaad16bd364c3bf79f513bb2977d7", "avgDurationSecs": 4.984, "validDurationSecs": 936.4, "totalHrs": 0.27, "validHrs": 0.26 }, "th": { "duration": 1633104, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 404, "users": 4, "size": 9311866, "checksum": "60141dc0b31a4b26c1b8829a61f2c5f7765f21ce721a8e771f92382878230e7a", "avgDurationSecs": 4.193, "validDurationSecs": 398.76, "totalHrs": 0.45, "validHrs": 0.12 }, "ti": { "duration": 759600, "reportedSentences": 0, "validatedSentences": 35, "unvalidatedSentences": -26, "clips": 138, "users": 2, "size": 4481565, "checksum": "19817f6ac29f509a95ee3f8d16d3bb8529857ae153832983d3d6b8cb476cb84f", "avgDurationSecs": 5.194, "validDurationSecs": 125.35, "totalHrs": 0.22, "validHrs": 0.03 }, "tig": { "duration": 97740, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 12, "users": 3, "size": 567980, "checksum": "83740a7b6f4e209d0b547d45908225d5bec4ab71f10011e48e432545dd294c42", "avgDurationSecs": 6.348, "validDurationSecs": 33.25, "totalHrs": 0.02, "validHrs": 0.01 }, "tk": { "duration": 216108, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 35, "users": 1, "size": 921696, "checksum": "67ef64d068b9d90225d1d2f7c54b66fd3709d7b44e64d7e37ce52e9015e00ae1", "avgDurationSecs": 5.48, "validDurationSecs": 10.21, "totalHrs": 0.06, "validHrs": 0.01 }, "tn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8532, "checksum": "186bc8a1b1dd72c2cb5fd28f98b2d32bca8f56bbcfd3314b5a12cc9202d7d049", "avgDurationSecs": 4.37, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tok": { "duration": 8076096, "reportedSentences": 17, "validatedSentences": 33, "unvalidatedSentences": -32, "clips": 1476, "users": 5, "size": 47634443, "checksum": "141e2f3ec45fa0b81b98f2e98b90cddcd2c97e1b29e53a823453ee7020a6ae3a", "avgDurationSecs": 4.303, "validDurationSecs": 3613.99, "totalHrs": 2.24, "validHrs": 1 }, "tr": { "duration": 751860, "reportedSentences": 21, "validatedSentences": 4, "unvalidatedSentences": 25, "clips": 133, "users": 14, "size": 4377973, "checksum": "2c72f91639f400d2123206fbe76fdd6f3a8bfc6e44e69385466802eed7049afc", "avgDurationSecs": 3.831, "validDurationSecs": 554.44, "totalHrs": 0.21, "validHrs": 0.15 }, "ts": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 4, "checksum": "09925723ee487a4276667e268c80a8569147b40de03fbbebf42836051129fec8", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "duration": 120852, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": -8, "clips": 21, "users": 4, "size": 706304, "checksum": "2198e6f30c0dd78063f229bbfd7d6ede37d55ac8c22ae1c2fee650300d10e8f4", "avgDurationSecs": 3.832, "validDurationSecs": 333.97, "totalHrs": 0.03, "validHrs": 0.09 }, "tw": { "duration": 76860, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 15, "users": 2, "size": 453893, "checksum": "00142f76c74a6aa2d5f48c42ad718db2f2a8a8a44e28b5d909ade3f94ce693f4", "avgDurationSecs": 4.352, "validDurationSecs": 26.98, "totalHrs": 0.02, "validHrs": 0.01 }, "ug": { "duration": 56250828, "reportedSentences": 20, "validatedSentences": 579, "unvalidatedSentences": -530, "clips": 7069, "users": 215, "size": 329059887, "checksum": "b56ba336fb9fe67332fead65ca25daf10cc05574b585ef11a8d20c7c0d19c5ba", "avgDurationSecs": 5.931, "validDurationSecs": 88812.33, "totalHrs": 15.62, "validHrs": 24.67 }, "uk": { "duration": 1815840, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 9, "clips": 374, "users": 12, "size": 10605118, "checksum": "41a1faa7f637b6269a439500b48c2d42cd3317e8caa12dba6f8b344016450f04", "avgDurationSecs": 4.643, "validDurationSecs": 612.23, "totalHrs": 0.51, "validHrs": 0.17 }, "ur": { "duration": 2265336, "reportedSentences": 11, "validatedSentences": 2, "unvalidatedSentences": -1, "clips": 392, "users": 9, "size": 13299519, "checksum": "f43b1c7e98dc77abdca6657d5223641a28fce8e97a8e6482a34ffa4088d06365", "avgDurationSecs": 4.299, "validDurationSecs": 607.86, "totalHrs": 0.63, "validHrs": 0.17 }, "uz": { "duration": 280080, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 49, "users": 10, "size": 1466739, "checksum": "b362c5b3a2c1a4ccb59a07856f2512aad1f9775667b3d40655b2400bfc21ac38", "avgDurationSecs": 4.157, "validDurationSecs": 103.79, "totalHrs": 0.08, "validHrs": 0.03 }, "ve": { "duration": 85392, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": -6, "clips": 15, "users": 1, "size": 441634, "checksum": "5f5b324ca9832de611ab1d06fff55c2cc19703ab753dd3baaa3c95bd5b8f3fda", "avgDurationSecs": 5.654, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "vi": { "duration": 71928, "reportedSentences": 0, "validatedSentences": 206, "unvalidatedSentences": 167, "clips": 16, "users": 4, "size": 424976, "checksum": "5cec4e5a6414fd9a9e113058dd82bf63894458056a1b237ff93e68d437d6e349", "avgDurationSecs": 3.968, "validDurationSecs": 46.24, "totalHrs": 0.02, "validHrs": 0.02 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 14, "checksum": "7f1a39bf3303cd72edada530463e85327a64fe007d7c4200783705bce6e349fb", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -29, "checksum": "b89441a0ad5a6ce62e5946ba732046c73a28e0016ecd6f0fdffd8d8d084179a9", "avgDurationSecs": 5.99, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yi": { "duration": 3940776, "reportedSentences": 8, "validatedSentences": 108, "unvalidatedSentences": -7, "clips": 1000, "users": 57, "size": 22235681, "checksum": "fcb793668c70fdde3bcce1a2bcbc717fde39fe2364b15d1c2995f1436865a043", "avgDurationSecs": 3.931, "validDurationSecs": 3802.02, "totalHrs": 1.09, "validHrs": 1.06 }, "yo": { "duration": 32472, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 191829, "checksum": "f8f2a61935e0014c1a72bb4b3ccd83dcc587df7e498546ad0ef44ca2922f7db8", "avgDurationSecs": 6.041, "validDurationSecs": 25.77, "totalHrs": 0.01, "validHrs": 0.01 }, "yue": { "duration": 245484, "reportedSentences": 0, "validatedSentences": 20, "unvalidatedSentences": -17, "clips": 65, "users": 6, "size": 1454375, "checksum": "341b2d6b9c5fd15034ae2d34fe054577420c63262680cae28fdd131d179f409b", "avgDurationSecs": 3.967, "validDurationSecs": 269.22, "totalHrs": 0.07, "validHrs": 0.08 }, "zgh": { "duration": 114732, "reportedSentences": 0, "validatedSentences": 6, "unvalidatedSentences": -5, "clips": 20, "users": 2, "size": 676247, "checksum": "4675e9da531f088ec983ea5465f85b36c750de424289935a04201bbb40a68423", "avgDurationSecs": 3.475, "validDurationSecs": 81.53, "totalHrs": 0.03, "validHrs": 0.02 }, "zh-CN": { "duration": 2188692, "reportedSentences": 27, "validatedSentences": 12, "unvalidatedSentences": 63, "clips": 573, "users": 44, "size": 12925295, "checksum": "7590951b68db1eae81151fc9b53ebdf0ce44349511684056c0f51754ee5f0927", "avgDurationSecs": 4.541, "validDurationSecs": 902.74, "totalHrs": 0.61, "validHrs": 0.25 }, "zh-HK": { "duration": 102348, "reportedSentences": 0, "validatedSentences": 12, "unvalidatedSentences": -10, "clips": 25, "users": 2, "size": 337173, "checksum": "11c3eb93e92a60e6b0bcb513578b1f285dbc2f486317d5da17d0895c65897057", "avgDurationSecs": 4.157, "validDurationSecs": 160.93, "totalHrs": 0.03, "validHrs": 0.05 }, "zh-TW": { "duration": 1392048, "reportedSentences": 0, "validatedSentences": 1122, "unvalidatedSentences": 798, "clips": 400, "users": 10, "size": 8000160, "checksum": "1956384cc8850938e4fb6f0140409f6472d8a5412b5582184a4674b720b7a40f", "avgDurationSecs": 3.364, "validDurationSecs": 919.64, "totalHrs": 0.38, "validHrs": 0.26 }, "zu": { "duration": 701136, "reportedSentences": 4, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 83, "users": 2, "size": 4032793, "checksum": "f7e4f25dffd6583533eaf051dd99985df6c85f66c771da2cef205d0ed86ff29a", "avgDurationSecs": 5.533, "validDurationSecs": 42.8, "totalHrs": 0.2, "validHrs": 0.01 }, "zza": { "duration": 28368, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 6, "users": 2, "size": 168801, "checksum": "95b64357aac0b9a26b2afd5f285371c9e77c234b7f1a0548fd2945d595460ab3", "avgDurationSecs": 4.105, "validDurationSecs": 97.13, "totalHrs": 0.01, "validHrs": 0.02 } }, "totalDuration": 7582094676, "totalValidDurationSecs": 7059001, "totalHrs": 2106, "totalValidHrs": 1960 } ================================================ FILE: datasets/scripted-speech/cv-corpus-24.0-2025-12-05.json ================================================ { "locales": { "ab": { "buckets": { "dev": 9329, "test": 9230, "train": 21331, "validated": 42970, "invalidated": 5419, "other": 15860 }, "duration": 332682156, "reportedSentences": 225, "validatedSentences": 1046780, "unvalidatedSentences": 5, "clips": 64249, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.17, "teens": 0.26, "fourties": 0.08, "fifties": 0.06, "sixties": 0.05, "seventies": 0.01, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 64249, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 427, "size": 1976516939, "checksum": "ecc75443c63048b97d1244b89b45b483ce31c151e035345928a0c9158fd16397", "avgDurationSecs": 5.178, "validDurationSecs": 222499.218, "totalHrs": 92.41, "validHrs": 61.8 }, "abb": { "buckets": { "dev": 293, "test": 313, "train": 390, "validated": 7887, "invalidated": 35, "other": 0 }, "duration": 40474692, "reportedSentences": 0, "validatedSentences": 996, "unvalidatedSentences": -1, "clips": 7922, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7922, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 238179520, "checksum": "7ef1fe7863a8ee8a5e36c3bcf68ecfbdc3a704bae91ceeda28191dcd51c49e4b", "avgDurationSecs": 5.109, "validDurationSecs": 40295.872, "totalHrs": 11.24, "validHrs": 11.19 }, "ady": { "buckets": { "dev": 2991, "test": 2989, "train": 3664, "validated": 40812, "invalidated": 1524, "other": 3124 }, "duration": 227644956, "reportedSentences": 633, "validatedSentences": 10309, "unvalidatedSentences": 2725, "clips": 45460, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.31, "thirties": 0.19, "teens": 0.09, "fourties": 0.12, "fifties": 0.14, "sixties": 0.05, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.02, "female_feminine": 0.66, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 362, "agriculture_food": 25, "automotive_transport": 0, "finance": 0, "service_retail": 57, "general": 95250, "healthcare": 47, "history_law_government": 0, "language_fundamentals": 463, "media_entertainment": 87, "nature_environment": 7, "news_current_affairs": 94, "technology_robotics": 0 } }, "users": 137, "size": 1301563097, "checksum": "31759be8f8da07a3fd2ff2799457ce783fca6a88c6f4530364d63ec79fc16759", "avgDurationSecs": 5.008, "validDurationSecs": 204369.686, "totalHrs": 63.23, "validHrs": 56.76 }, "af": { "buckets": { "dev": 118, "test": 129, "train": 181, "validated": 428, "invalidated": 201, "other": 348 }, "duration": 5938740, "reportedSentences": 73, "validatedSentences": 4802, "unvalidatedSentences": 16565, "clips": 977, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.31, "thirties": 0, "teens": 0.01, "fourties": 0.21, "fifties": 0.15, "sixties": 0, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.32, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 977, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 65, "size": 35967911, "checksum": "9c95833ed6d565d4872fa551d1dd0ca4d4fd75aabc3c6e9a7355e4d799c234c4", "avgDurationSecs": 6.079, "validDurationSecs": 2601.618, "totalHrs": 1.64, "validHrs": 0.72 }, "ajg": { "buckets": { "dev": 560, "test": 774, "train": 1873, "validated": 19155, "invalidated": 212, "other": 31 }, "duration": 46342044, "reportedSentences": 0, "validatedSentences": 3213, "unvalidatedSentences": -1, "clips": 19398, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19398, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 263833585, "checksum": "75c41be24180fa08cdce591497fbeba663931358c56eebaa1c81b58d5a2d79e2", "avgDurationSecs": 2.389, "validDurationSecs": 45761.514, "totalHrs": 12.87, "validHrs": 12.71 }, "am": { "buckets": { "dev": 248, "test": 263, "train": 534, "validated": 1045, "invalidated": 30, "other": 557 }, "duration": 10279728, "reportedSentences": 11, "validatedSentences": 2208, "unvalidatedSentences": 12, "clips": 1632, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.14, "thirties": 0.61, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1632, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 60581563, "checksum": "f5d8274eb80826338f6f1d6d47fd80d4702207fb31276a2afe0dd815e205fc99", "avgDurationSecs": 6.299, "validDurationSecs": 6582.301, "totalHrs": 2.85, "validHrs": 1.82 }, "an": { "buckets": { "dev": 2872, "test": 2920, "train": 4190, "validated": 13277, "invalidated": 758, "other": 5 }, "duration": 64215432, "reportedSentences": 8, "validatedSentences": 10454, "unvalidatedSentences": 357, "clips": 14040, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.02, "thirties": 0.2, "teens": 0, "fourties": 0.32, "fifties": 0.11, "sixties": 0.12, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.71, "male_masculine": 0.15, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 456, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 2, "general": 13638, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 53, "size": 374347878, "checksum": "780f87179dd488f46fcd9af2737cd2aed3e662cb678d686243f4146c78320573", "avgDurationSecs": 4.574, "validDurationSecs": 60725.662, "totalHrs": 17.83, "validHrs": 16.86 }, "ar": { "buckets": { "dev": 10219, "test": 10498, "train": 28870, "validated": 79354, "invalidated": 15249, "other": 41437 }, "duration": 566224506, "reportedSentences": 2137, "validatedSentences": 59434, "unvalidatedSentences": 1742112, "clips": 136040, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.3, "thirties": 0.11, "teens": 0.03, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.28, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 136040, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1651, "size": 3484836841, "checksum": "5df32f94259c2cfc45854fb7cf7191e317c5a9b662335fb9cfc9e5c3ec8b4f25", "avgDurationSecs": 4.162, "validDurationSecs": 330286.529, "totalHrs": 157.28, "validHrs": 91.74 }, "as": { "buckets": { "dev": 485, "test": 394, "train": 953, "validated": 1832, "invalidated": 282, "other": 2567 }, "duration": 27604649, "reportedSentences": 249, "validatedSentences": 7171, "unvalidatedSentences": 3344, "clips": 4681, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.16, "thirties": 0.02, "teens": 0, "fourties": 0.52, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0.18, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4670, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 11, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 11 } }, "users": 51, "size": 167861194, "checksum": "799c31374c95f64b4400f9c1989466f727a797bd874d96359580012f1271e4f1", "avgDurationSecs": 5.897, "validDurationSecs": 10803.614, "totalHrs": 7.66, "validHrs": 3 }, "ast": { "buckets": { "dev": 114, "test": 230, "train": 445, "validated": 804, "invalidated": 42, "other": 863 }, "duration": 7513344, "reportedSentences": 0, "validatedSentences": 1642, "unvalidatedSentences": 290, "clips": 1709, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.1, "thirties": 0.12, "teens": 0.06, "fourties": 0.27, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.59, "female_feminine": 0.14, "transgender": 0.02, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1709, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 42880232, "checksum": "85477bbd78bab637a35e0ed18562dcbd729d85f253a22b75ce5a42124954167b", "avgDurationSecs": 4.396, "validDurationSecs": 3534.657, "totalHrs": 2.08, "validHrs": 0.98 }, "az": { "buckets": { "dev": 93, "test": 121, "train": 197, "validated": 411, "invalidated": 53, "other": 545 }, "duration": 5501016, "reportedSentences": 5, "validatedSentences": 93161, "unvalidatedSentences": 1862, "clips": 1009, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.17, "thirties": 0.61, "teens": 0.01, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0.17, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1009, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 38453820, "checksum": "00f1b0d6ef904e62eafbcc8d92e8e764486ba3d2d3d37a6b321b63555ab26a59", "avgDurationSecs": 5.452, "validDurationSecs": 2240.751, "totalHrs": 1.52, "validHrs": 0.62 }, "ba": { "buckets": { "dev": 14525, "test": 14569, "train": 119138, "validated": 210457, "invalidated": 8011, "other": 50 }, "duration": 967387572, "reportedSentences": 868, "validatedSentences": 153968, "unvalidatedSentences": 5, "clips": 218518, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.17, "thirties": 0.17, "teens": 0.04, "fourties": 0.06, "fifties": 0.05, "sixties": 0.19, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.3, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 218518, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 930, "size": 5451322660, "checksum": "fab12b73feb3d921f594f5e0fa3b648d6ff79169c0fbeec35dee302a957adedc", "avgDurationSecs": 4.427, "validDurationSecs": 931701.216, "totalHrs": 268.71, "validHrs": 258.8 }, "bag": { "buckets": { "dev": 261, "test": 324, "train": 421, "validated": 7028, "invalidated": 49, "other": 35 }, "duration": 40652964, "reportedSentences": 1, "validatedSentences": 1006, "unvalidatedSentences": 6, "clips": 7112, "splits": { "accent": {}, "age": { "": 0.84, "twenties": 0, "thirties": 0.02, "teens": 0, "fourties": 0.14, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.84, "male_masculine": 0, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.14 }, "sentence_domain": { "": 7112, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 228620074, "checksum": "a8f8539aeae94dfe59b4842698eeccbf9f8adec175ef11ae0bac25f3ca925683", "avgDurationSecs": 5.716, "validDurationSecs": 40172.811, "totalHrs": 11.29, "validHrs": 11.15 }, "bas": { "buckets": { "dev": 1324, "test": 1550, "train": 2112, "validated": 11138, "invalidated": 1217, "other": 144 }, "duration": 49084848, "reportedSentences": 8, "validatedSentences": 5226, "unvalidatedSentences": 105, "clips": 12499, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.61, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12491, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 8, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 254567498, "checksum": "185c03ca819b5b2e8db5ee3dfa63929c364fe5f952e919dc0b71bf4a126e7d68", "avgDurationSecs": 3.927, "validDurationSecs": 43740.062, "totalHrs": 13.63, "validHrs": 12.15 }, "bax": { "buckets": { "dev": 319, "test": 338, "train": 373, "validated": 7777, "invalidated": 80, "other": 829 }, "duration": 44092836, "reportedSentences": 0, "validatedSentences": 1030, "unvalidatedSentences": -1, "clips": 8686, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8686, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 251893798, "checksum": "009c91f847a92d1dcd6061fc943a2fce3c94a0006bf340fff2ac6dea08e10a74", "avgDurationSecs": 5.076, "validDurationSecs": 39478.469, "totalHrs": 12.24, "validHrs": 10.96 }, "bba": { "buckets": { "dev": 251, "test": 236, "train": 292, "validated": 6273, "invalidated": 2279, "other": 0 }, "duration": 54094284, "reportedSentences": 94, "validatedSentences": 971, "unvalidatedSentences": -1, "clips": 8552, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8552, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 297924030, "checksum": "b4c88af1da2257fabbeb487809e14f1048ee453d2392348cd3a88fba780d6779", "avgDurationSecs": 6.325, "validDurationSecs": 39678.84, "totalHrs": 15.02, "validHrs": 11.02 }, "bbj": { "buckets": { "dev": 268, "test": 324, "train": 404, "validated": 7242, "invalidated": 17, "other": 265 }, "duration": 46645812, "reportedSentences": 1, "validatedSentences": 996, "unvalidatedSentences": 1, "clips": 7524, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.02, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.86, "male_masculine": 0, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7524, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 272457246, "checksum": "e95e5c54a75a528e49cd6eb79ddcc6dec8823d382d2d22e3421fc6299ca0b2b5", "avgDurationSecs": 6.2, "validDurationSecs": 44897.524, "totalHrs": 12.95, "validHrs": 12.47 }, "bbl": { "buckets": { "dev": 377, "test": 377, "train": 407, "validated": 4580, "invalidated": 130, "other": 112 }, "duration": 42473448, "reportedSentences": 0, "validatedSentences": 1661, "unvalidatedSentences": 462, "clips": 4822, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0, "thirties": 0.15, "teens": 0, "fourties": 0, "fifties": 0.04, "sixties": 0.55, "seventies": 0.24, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0, "female_feminine": 0.67, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 248, "agriculture_food": 20, "automotive_transport": 28, "finance": 0, "service_retail": 0, "general": 4539, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 20, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 252071422, "checksum": "630e474f98551ae6be9c32014ccfcd9e2147773dd2f039f08a57f89754cfa76d", "avgDurationSecs": 8.808, "validDurationSecs": 40341.848, "totalHrs": 11.79, "validHrs": 11.2 }, "bce": { "buckets": { "dev": 320, "test": 322, "train": 336, "validated": 6117, "invalidated": 24, "other": 0 }, "duration": 36181656, "reportedSentences": 0, "validatedSentences": 986, "unvalidatedSentences": -1, "clips": 6141, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6141, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 146567369, "checksum": "c05a81d13e145b5e30d749a6c51bd9e36731b10b99cc8947e989131f94c0e068", "avgDurationSecs": 5.892, "validDurationSecs": 36040.252, "totalHrs": 10.05, "validHrs": 10.01 }, "bci": { "buckets": { "dev": 267, "test": 290, "train": 319, "validated": 5623, "invalidated": 141, "other": 1414 }, "duration": 53374860, "reportedSentences": 1, "validatedSentences": 876, "unvalidatedSentences": -1, "clips": 7178, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0.14, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7178, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 309014764, "checksum": "77cca7a20507da7a7fb01355f526b7591406cbb354cbcf22193dc16e6e5f8fcd", "avgDurationSecs": 7.436, "validDurationSecs": 41812.042, "totalHrs": 14.82, "validHrs": 11.61 }, "be": { "buckets": { "dev": 15877, "test": 15880, "train": 347680, "validated": 1362284, "invalidated": 36972, "other": 18514 }, "duration": 6795011700, "reportedSentences": 3200, "validatedSentences": 379548, "unvalidatedSentences": 1931, "clips": 1417770, "splits": { "accent": {}, "age": { "": 0.87, "twenties": 0.03, "thirties": 0.04, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0.06, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1417681, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 4, "general": 33, "healthcare": 0, "history_law_government": 34, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 7, "technology_robotics": 7 } }, "users": 8596, "size": 38690099984, "checksum": "680f17ef2a63ef2934a2cd2c311e2d5002d32465fcccddb3a111141e95ac6d76", "avgDurationSecs": 4.793, "validDurationSecs": 6529081.387, "totalHrs": 1887.5, "validHrs": 1813.63 }, "beb": { "buckets": { "dev": 324, "test": 331, "train": 350, "validated": 6742, "invalidated": 737, "other": 2 }, "duration": 40845996, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": -1, "clips": 7481, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7481, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 33, "size": 235854057, "checksum": "4fbd54e932f2c10d69de7d04340e3a9dd79fa3ed518b78a243685acc0ac2e410", "avgDurationSecs": 5.46, "validDurationSecs": 36811.082, "totalHrs": 11.34, "validHrs": 10.22 }, "bfd": { "buckets": { "dev": 252, "test": 254, "train": 260, "validated": 6454, "invalidated": 53, "other": 673 }, "duration": 40598964, "reportedSentences": 0, "validatedSentences": 766, "unvalidatedSentences": -1, "clips": 7180, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7180, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 232041727, "checksum": "91ec8fb3b6aa03a9eefe8adffc781d1dd8e2afa97c87d76968392955b9d6ae89", "avgDurationSecs": 5.654, "validDurationSecs": 36493.832, "totalHrs": 11.27, "validHrs": 10.13 }, "bft": { "buckets": { "dev": 1833, "test": 1805, "train": 2211, "validated": 10059, "invalidated": 489, "other": 10 }, "duration": 65071008, "reportedSentences": 37, "validatedSentences": 7968, "unvalidatedSentences": 12, "clips": 10558, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.37, "thirties": 0.35, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male_masculine": 0, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.1 }, "sentence_domain": { "": 10558, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 154, "size": 377793372, "checksum": "84f039c5154daaa1087d779cc28ca89181526c65080cc95a741dc8f17775453f", "avgDurationSecs": 6.163, "validDurationSecs": 61995.574, "totalHrs": 18.07, "validHrs": 17.22 }, "bg": { "buckets": { "dev": 2949, "test": 3380, "train": 4967, "validated": 11313, "invalidated": 777, "other": 1772 }, "duration": 75850704, "reportedSentences": 191, "validatedSentences": 39572, "unvalidatedSentences": 3, "clips": 13862, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.11, "thirties": 0.28, "teens": 0, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.58, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13862, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 148, "size": 446977666, "checksum": "6189a89f3fb0b00fc4e915ce6038f84e4cf4ff20879ded0eb32bac2f83e4f302", "avgDurationSecs": 5.472, "validDurationSecs": 61902.973, "totalHrs": 21.06, "validHrs": 17.19 }, "bgp": { "buckets": { "dev": 1105, "test": 2139, "train": 3538, "validated": 7606, "invalidated": 268, "other": 29 }, "duration": 48108024, "reportedSentences": 0, "validatedSentences": 6997, "unvalidatedSentences": -1, "clips": 7903, "splits": { "accent": {}, "age": { "": 0.95, "twenties": 0.05, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7903, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 25, "size": 284064238, "checksum": "149863ab8d1b264cbd81116c92708f7dc630f5c65ba4a12319838c4e12858c1b", "avgDurationSecs": 6.087, "validDurationSecs": 46300.092, "totalHrs": 13.36, "validHrs": 12.86 }, "bkh": { "buckets": { "dev": 305, "test": 304, "train": 377, "validated": 5614, "invalidated": 1949, "other": 0 }, "duration": 52500744, "reportedSentences": 2, "validatedSentences": 986, "unvalidatedSentences": 1, "clips": 7563, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7563, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 288996956, "checksum": "a39171ec6092f4b8a502f8a3b5316342d0293192ef64594a838f4b3a7d487072", "avgDurationSecs": 6.942, "validDurationSecs": 38971.199, "totalHrs": 14.58, "validHrs": 10.82 }, "bkm": { "buckets": { "dev": 304, "test": 310, "train": 326, "validated": 7573, "invalidated": 260, "other": 0 }, "duration": 45229536, "reportedSentences": 0, "validatedSentences": 940, "unvalidatedSentences": -1, "clips": 7833, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7833, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 266191823, "checksum": "6bb316a0f85ab143d44817041c8fa6e25ad5e5ff53d745911f4e08cc4b16d889", "avgDurationSecs": 5.774, "validDurationSecs": 43728.236, "totalHrs": 12.56, "validHrs": 12.14 }, "bn": { "buckets": { "dev": 9394, "test": 9394, "train": 21575, "validated": 44655, "invalidated": 7928, "other": 999507 }, "duration": 4598510976, "reportedSentences": 1890, "validatedSentences": 562827, "unvalidatedSentences": 94116, "clips": 1052090, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.67, "thirties": 0.03, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.54, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1052090, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23002, "size": 26574204428, "checksum": "3e0f6c6c6484df5f7b2c03af079962671d2d3debb0fcab38351db1d644b01b4e", "avgDurationSecs": 4.371, "validDurationSecs": 195179.602, "totalHrs": 1277.36, "validHrs": 54.21 }, "bnm": { "buckets": { "dev": 282, "test": 331, "train": 412, "validated": 7675, "invalidated": 83, "other": 216 }, "duration": 60507684, "reportedSentences": 2, "validatedSentences": 1025, "unvalidatedSentences": -1, "clips": 7974, "splits": { "accent": {}, "age": { "": 0.65, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.06, "fifties": 0.12, "sixties": 0.05, "seventies": 0.12, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7974, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 357621762, "checksum": "1877f0433254855359cafe4dcc6ea8f7919d5d98de023929a4cc501b3e3257c4", "avgDurationSecs": 7.588, "validDurationSecs": 58238.836, "totalHrs": 16.8, "validHrs": 16.17 }, "bnn": { "buckets": { "dev": 958, "test": 960, "train": 1040, "validated": 7274, "invalidated": 946, "other": 10 }, "duration": 41889204, "reportedSentences": 3, "validatedSentences": 3167, "unvalidatedSentences": 8, "clips": 8230, "splits": { "accent": {}, "age": { "": 0.6, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.06, "sixties": 0.34, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0.06, "female_feminine": 0.41, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.35 }, "sentence_domain": { "": 273, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 10108, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 246952751, "checksum": "cb970d52a5f68caad735b54666dc96b8b063526072d6a6c80b96f0cf3002a813", "avgDurationSecs": 5.09, "validDurationSecs": 37023.338, "totalHrs": 11.63, "validHrs": 10.28 }, "br": { "buckets": { "dev": 3397, "test": 3399, "train": 4396, "validated": 33695, "invalidated": 2406, "other": 0 }, "duration": 117428587, "reportedSentences": 304, "validatedSentences": 22656, "unvalidatedSentences": 370, "clips": 36101, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.29, "thirties": 0.13, "teens": 0.01, "fourties": 0.06, "fifties": 0.08, "sixties": 0.12, "seventies": 0.06, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.59, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31773, "agriculture_food": 3, "automotive_transport": 2, "finance": 1, "service_retail": 4, "general": 4276, "healthcare": 13, "history_law_government": 11, "language_fundamentals": 0, "media_entertainment": 13, "nature_environment": 8, "news_current_affairs": 0, "technology_robotics": 4 } }, "users": 272, "size": 796308893, "checksum": "4d824b0baf8291a9afbd9daab2ce651d5fb48974b4f33181fe8c0bdf7b57eea7", "avgDurationSecs": 3.253, "validDurationSecs": 109602.4, "totalHrs": 32.61, "validHrs": 30.44 }, "brh": { "buckets": { "dev": 504, "test": 882, "train": 1705, "validated": 5084, "invalidated": 12, "other": 12 }, "duration": 37155096, "reportedSentences": 0, "validatedSentences": 3095, "unvalidatedSentences": -1, "clips": 5108, "splits": { "accent": {}, "age": { "": 0.81, "twenties": 0, "thirties": 0.19, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5108, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 215870657, "checksum": "a7404392ac28d37861d0d5cf7d444bb1591332c1601d11c6b55d0df8c242e39a", "avgDurationSecs": 7.274, "validDurationSecs": 36980.522, "totalHrs": 10.32, "validHrs": 10.27 }, "bri": { "buckets": { "dev": 203, "test": 298, "train": 699, "validated": 8838, "invalidated": 234, "other": 122 }, "duration": 39939480, "reportedSentences": 12, "validatedSentences": 1200, "unvalidatedSentences": -1, "clips": 9194, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0.01, "thirties": 0.13, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.87, "male_masculine": 0, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9194, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 197673444, "checksum": "385b9cae7f3fa2d70cfc54c00d191f28c435392d8309c33a666695146798903c", "avgDurationSecs": 4.344, "validDurationSecs": 38392.987, "totalHrs": 11.09, "validHrs": 10.66 }, "bsh": { "buckets": { "dev": 515, "test": 693, "train": 1420, "validated": 6765, "invalidated": 60, "other": 37 }, "duration": 37426032, "reportedSentences": 2, "validatedSentences": 2646, "unvalidatedSentences": 40, "clips": 6862, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.57, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.28, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6862, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 220715976, "checksum": "4111396cbb8b05515503fe3846ab5732d1209a8325d951e58e2212fedc8e0839", "avgDurationSecs": 5.454, "validDurationSecs": 36896.984, "totalHrs": 10.39, "validHrs": 10.24 }, "bsk": { "buckets": { "dev": 348, "test": 733, "train": 1187, "validated": 8615, "invalidated": 686, "other": 1 }, "duration": 39648312, "reportedSentences": 2, "validatedSentences": 2447, "unvalidatedSentences": 22, "clips": 9302, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.46, "fifties": 0.24, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9248, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 54, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 229263635, "checksum": "468289bf74d387ddf1f708a23ef08b5f6d25a56bdabd7cf637e02d547df9301a", "avgDurationSecs": 4.262, "validDurationSecs": 36720.083, "totalHrs": 11.01, "validHrs": 10.2 }, "btv": { "buckets": { "dev": 226, "test": 329, "train": 452, "validated": 9109, "invalidated": 127, "other": 47 }, "duration": 37660860, "reportedSentences": 263, "validatedSentences": 1053, "unvalidatedSentences": 9, "clips": 9283, "splits": { "accent": {}, "age": { "": 0.47, "twenties": 0.32, "thirties": 0, "teens": 0, "fourties": 0.21, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0.11, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9274, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 8, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 215819786, "checksum": "8e3da90b7112ef6be21aadd8ef9b34b5339e0d96dda50179864bdbdaebade716", "avgDurationSecs": 4.057, "validDurationSecs": 36954.947, "totalHrs": 10.46, "validHrs": 10.26 }, "bum": { "buckets": { "dev": 269, "test": 285, "train": 347, "validated": 7751, "invalidated": 177, "other": 29 }, "duration": 37052424, "reportedSentences": 0, "validatedSentences": 901, "unvalidatedSentences": -1, "clips": 7957, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7957, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 210559404, "checksum": "ea1e4b8532b0b153d715335f212bafabc80166cc4c4a92f0d7d2fc75380605f9", "avgDurationSecs": 4.657, "validDurationSecs": 36093.168, "totalHrs": 10.29, "validHrs": 10.02 }, "byv": { "buckets": { "dev": 327, "test": 327, "train": 336, "validated": 7719, "invalidated": 0, "other": 0 }, "duration": 47684376, "reportedSentences": 0, "validatedSentences": 990, "unvalidatedSentences": 1027, "clips": 7719, "splits": { "accent": {}, "age": { "": 0.81, "twenties": 0.06, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.94, "male_masculine": 0, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7719, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 96, "size": 276978955, "checksum": "669c0a2460b4badcf56f48d8b9008f342ba6f5967d0574b09b830c6e6ba705bb", "avgDurationSecs": 6.178, "validDurationSecs": 47684.376, "totalHrs": 13.24, "validHrs": 13.24 }, "ca": { "buckets": { "dev": 16416, "test": 16416, "train": 1215000, "validated": 2297291, "invalidated": 141235, "other": 238250 }, "duration": 13920572682, "reportedSentences": 9581, "validatedSentences": 1304058, "unvalidatedSentences": 3656, "clips": 2676776, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.04, "thirties": 0.05, "teens": 0.01, "fourties": 0.12, "fifties": 0.19, "sixties": 0.29, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.52, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 2675630, "agriculture_food": 59, "automotive_transport": 31, "finance": 4, "service_retail": 63, "general": 657, "healthcare": 36, "history_law_government": 65, "language_fundamentals": 34, "media_entertainment": 37, "nature_environment": 135, "news_current_affairs": 36, "technology_robotics": 27 } }, "users": 36868, "size": 83674470758, "checksum": "74b7ee289aaf2a199661b45a7b99447e7cc0dc527c1276e5448cd20b6dacc100", "avgDurationSecs": 5.2, "validDurationSecs": 11947061.068, "totalHrs": 3866.82, "validHrs": 3318.62 }, "cjk": { "buckets": { "dev": 377, "test": 377, "train": 387, "validated": 7333, "invalidated": 1195, "other": 0 }, "duration": 50697864, "reportedSentences": 0, "validatedSentences": 1141, "unvalidatedSentences": 1, "clips": 8528, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.12, "thirties": 0.26, "teens": 0, "fourties": 0.49, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.62, "male_masculine": 0.03, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8528, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 45, "size": 279634470, "checksum": "0da6fc97db29d26c393769cd2299305ede7e6a9e9ca5e827f95f04731665e69f", "avgDurationSecs": 5.945, "validDurationSecs": 43593.743, "totalHrs": 14.08, "validHrs": 12.1 }, "ckb": { "buckets": { "dev": 5348, "test": 5348, "train": 7883, "validated": 119869, "invalidated": 9278, "other": 39520 }, "duration": 690661692, "reportedSentences": 2531, "validatedSentences": 19230, "unvalidatedSentences": 15255, "clips": 168667, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.43, "thirties": 0.1, "teens": 0.02, "fourties": 0.02, "fifties": 0.01, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.63, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 168421, "agriculture_food": 0, "automotive_transport": 10, "finance": 0, "service_retail": 0, "general": 121, "healthcare": 0, "history_law_government": 10, "language_fundamentals": 10, "media_entertainment": 95, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1976, "size": 3817557893, "checksum": "19fc00e7df6b164a6f77c7eca7c22530b52f032b9d00a12e99ee46ccb403553e", "avgDurationSecs": 4.095, "validDurationSecs": 490842.467, "totalHrs": 191.85, "validHrs": 136.34 }, "cnh": { "buckets": { "dev": 763, "test": 765, "train": 818, "validated": 2463, "invalidated": 437, "other": 3287 }, "duration": 21751656, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": -1, "clips": 6187, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.4, "thirties": 0.07, "teens": 0.01, "fourties": 0.01, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0.3, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6187, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 300, "size": 168181204, "checksum": "cb04539ec3670594b854f4010f0dbb8ce0464ed8bf0bcbbc726f22fd58bd91fc", "avgDurationSecs": 3.516, "validDurationSecs": 8659.177, "totalHrs": 6.04, "validHrs": 2.4 }, "cpy": { "buckets": { "dev": 188, "test": 294, "train": 518, "validated": 8162, "invalidated": 26, "other": 84 }, "duration": 36567612, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 15, "clips": 8272, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.12, "thirties": 0.12, "teens": 0.24, "fourties": 0.39, "fifties": 0, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.77, "male_masculine": 0, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8272, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 210662597, "checksum": "912429d002017db6e89bef944953a49cd8468c538e1d71f5071a401933549cf9", "avgDurationSecs": 4.421, "validDurationSecs": 36081.341, "totalHrs": 10.15, "validHrs": 10.02 }, "cs": { "buckets": { "dev": 9458, "test": 9437, "train": 21951, "validated": 65000, "invalidated": 2610, "other": 149239 }, "duration": 967201527, "reportedSentences": 973, "validatedSentences": 451217, "unvalidatedSentences": 141, "clips": 216849, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.19, "thirties": 0.47, "teens": 0.02, "fourties": 0.07, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.53, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 216826, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 15, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 4, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 8 } }, "users": 1129, "size": 5950686352, "checksum": "3ef013a0c17afa5e873694e292c928dfe20a26f83b6eae5ed89a2710c58f6181", "avgDurationSecs": 4.46, "validDurationSecs": 289916.482, "totalHrs": 268.66, "validHrs": 80.53 }, "cut": { "buckets": { "dev": 290, "test": 325, "train": 378, "validated": 5481, "invalidated": 5, "other": 130 }, "duration": 37239084, "reportedSentences": 0, "validatedSentences": 994, "unvalidatedSentences": 1, "clips": 5616, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0, "thirties": 0.42, "teens": 0, "fourties": 0.17, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0, "female_feminine": 0.68, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5616, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 219697025, "checksum": "95481d4df662f9640074f60ebab50808f0c651abaf574b3ca694a647735ce0f0", "avgDurationSecs": 6.631, "validDurationSecs": 36343.914, "totalHrs": 10.34, "validHrs": 10.09 }, "cux": { "buckets": { "dev": 647, "test": 494, "train": 1107, "validated": 9016, "invalidated": 325, "other": 9 }, "duration": 38339748, "reportedSentences": 1, "validatedSentences": 2251, "unvalidatedSentences": 18, "clips": 9350, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.36, "thirties": 0.23, "teens": 0.41, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0, "female_feminine": 0.65, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9350, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 174502178, "checksum": "8a0023face6efa3ba1ed8b536d8a107b6586ccae6e1561a71f18c16ce6acfc9a", "avgDurationSecs": 4.101, "validDurationSecs": 36970.178, "totalHrs": 10.64, "validHrs": 10.26 }, "cv": { "buckets": { "dev": 1242, "test": 1288, "train": 1456, "validated": 17497, "invalidated": 2215, "other": 744 }, "duration": 103130544, "reportedSentences": 144, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 20456, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.5, "thirties": 0.01, "teens": 0.18, "fourties": 0.06, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0.51, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20455, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 122, "size": 697770986, "checksum": "4ffa14e3b8d428118c32e8a26c5b940d246eaececc85eecb572b5ca8dc5b7819", "avgDurationSecs": 5.042, "validDurationSecs": 88212.511, "totalHrs": 28.64, "validHrs": 24.5 }, "cy": { "buckets": { "dev": 5414, "test": 5414, "train": 8026, "validated": 90859, "invalidated": 4471, "other": 20672 }, "duration": 570264093, "reportedSentences": 176, "validatedSentences": 118044, "unvalidatedSentences": 15, "clips": 116002, "splits": { "accent": {}, "age": { "": 0.43, "twenties": 0.13, "thirties": 0.09, "teens": 0.02, "fourties": 0.16, "fifties": 0.09, "sixties": 0.07, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.43, "male_masculine": 0.33, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 116001, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1931, "size": 4158425297, "checksum": "c73af06c90775d7b9108c569ce6f41a4111a76897e6962e83ca64a1e99f82c8a", "avgDurationSecs": 4.916, "validDurationSecs": 446661.482, "totalHrs": 158.4, "validHrs": 124.07 }, "da": { "buckets": { "dev": 2629, "test": 2758, "train": 3602, "validated": 11341, "invalidated": 441, "other": 2308 }, "duration": 57211740, "reportedSentences": 685, "validatedSentences": 20807, "unvalidatedSentences": 274, "clips": 14090, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.16, "thirties": 0.22, "teens": 0.01, "fourties": 0.16, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.48, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14063, "agriculture_food": 1, "automotive_transport": 3, "finance": 0, "service_retail": 1, "general": 16, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 9, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 296, "size": 331777774, "checksum": "3d60b07b341b736ae70ed8fc0c685663ae7304bd2840ade7d705f6bdada74d9b", "avgDurationSecs": 4.06, "validDurationSecs": 46049.563, "totalHrs": 15.89, "validHrs": 12.79 }, "dag": { "buckets": { "dev": 396, "test": 415, "train": 446, "validated": 8114, "invalidated": 114, "other": 13046 }, "duration": 91434492, "reportedSentences": 0, "validatedSentences": 5220, "unvalidatedSentences": 15347, "clips": 21274, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.39, "thirties": 0.24, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male_masculine": 0.02, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.02 }, "sentence_domain": { "": 21261, "agriculture_food": 3, "automotive_transport": 3, "finance": 6, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 57, "size": 508358002, "checksum": "6cd20551fbfb1352fea5f535c502f8d85c37ad70adeef52c7981cd8816c30a95", "avgDurationSecs": 4.298, "validDurationSecs": 34873.53, "totalHrs": 25.39, "validHrs": 9.68 }, "dar": { "buckets": { "dev": 611, "test": 689, "train": 859, "validated": 3113, "invalidated": 31, "other": 2013 }, "duration": 27594144, "reportedSentences": 4, "validatedSentences": 2819, "unvalidatedSentences": 472, "clips": 5157, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.36, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.45, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2, "agriculture_food": 18, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3214, "healthcare": 59, "history_law_government": 121, "language_fundamentals": 166, "media_entertainment": 1125, "nature_environment": 0, "news_current_affairs": 456, "technology_robotics": 0 } }, "users": 13, "size": 160770318, "checksum": "4b6fadcda297c53d5ff319d2ee29f02698e9959cc46eb96128fd973bfe378b6f", "avgDurationSecs": 5.351, "validDurationSecs": 16657.082, "totalHrs": 7.66, "validHrs": 4.62 }, "dav": { "buckets": { "dev": 1276, "test": 1004, "train": 2098, "validated": 8264, "invalidated": 663, "other": 40703 }, "duration": 201431016, "reportedSentences": 11, "validatedSentences": 11773, "unvalidatedSentences": 20119, "clips": 49630, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.61, "thirties": 0.1, "teens": 0, "fourties": 0.01, "fifties": 0.01, "sixties": 0.23, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.73, "male_masculine": 0, "female_feminine": 0.27, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 971, "agriculture_food": 499, "automotive_transport": 100, "finance": 304, "service_retail": 1218, "general": 45323, "healthcare": 967, "history_law_government": 334, "language_fundamentals": 150, "media_entertainment": 68, "nature_environment": 873, "news_current_affairs": 86, "technology_robotics": 75 } }, "users": 24, "size": 1084119611, "checksum": "88631830c7aa16e1e62508169dc6434f58d957828bac8a12b93f375eab805328", "avgDurationSecs": 4.059, "validDurationSecs": 33540.72, "totalHrs": 55.95, "validHrs": 9.31 }, "de": { "buckets": { "dev": 16203, "test": 16203, "train": 616461, "validated": 944924, "invalidated": 57712, "other": 11538 }, "duration": 5339754341, "reportedSentences": 11379, "validatedSentences": 2057983, "unvalidatedSentences": 1537, "clips": 1014174, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.17, "thirties": 0.16, "teens": 0.03, "fourties": 0.17, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.58, "female_feminine": 0.08, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1013973, "agriculture_food": 0, "automotive_transport": 6, "finance": 0, "service_retail": 12, "general": 142, "healthcare": 16, "history_law_government": 14, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 12, "news_current_affairs": 4, "technology_robotics": 9 } }, "users": 20413, "size": 37071377685, "checksum": "7b0b97f32ed8404b02c75251a99e3ea19317888fd3a51567275b971cb36c1f21", "avgDurationSecs": 5.265, "validDurationSecs": 4975144.335, "totalHrs": 1483.26, "validHrs": 1381.98 }, "dmk": { "buckets": { "dev": 0, "test": 938, "train": 3200, "validated": 10648, "invalidated": 123, "other": 21 }, "duration": 38273472, "reportedSentences": 732, "validatedSentences": 4139, "unvalidatedSentences": 194, "clips": 10792, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.01, "thirties": 0.15, "teens": 0, "fourties": 0.81, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10790, "agriculture_food": 2, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 2, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 223080239, "checksum": "12bd601e002918086e4f6381da0c00820170a3fb111674c55fd3ae91257b35aa", "avgDurationSecs": 3.546, "validDurationSecs": 37762.781, "totalHrs": 10.63, "validHrs": 10.48 }, "dml": { "buckets": { "dev": 1101, "test": 626, "train": 3190, "validated": 6115, "invalidated": 73, "other": 299 }, "duration": 39238596, "reportedSentences": 0, "validatedSentences": 5670, "unvalidatedSentences": 676, "clips": 6487, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.09, "thirties": 0, "teens": 0, "fourties": 0.79, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6473, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 3, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 232053163, "checksum": "217262aaf8bc8133edd0088e2813d7b91e1a3b1970351356d877f76859b4d900", "avgDurationSecs": 6.049, "validDurationSecs": 36988.441, "totalHrs": 10.89, "validHrs": 10.27 }, "dru": { "buckets": { "dev": 933, "test": 957, "train": 1074, "validated": 6576, "invalidated": 116, "other": 1 }, "duration": 37990332, "reportedSentences": 2, "validatedSentences": 3867, "unvalidatedSentences": 477, "clips": 6693, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.12, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.87 }, "sentence_domain": { "": 71, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 10031, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1759, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 223976380, "checksum": "0a27a4418f5afc38f8960d83bd89deeb77d8a6f9a557158e525cac1edeaf33ff", "avgDurationSecs": 5.676, "validDurationSecs": 37326.225, "totalHrs": 10.55, "validHrs": 10.36 }, "dsb": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 5 }, "duration": 40284, "reportedSentences": 0, "validatedSentences": 796, "unvalidatedSentences": -1, "clips": 5, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 293762, "checksum": "88c5e7f1c3a52573d7248bbf464b63cea37bec7a2690c7e2724a9007cbbe38a6", "avgDurationSecs": 8.057, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "dua": { "buckets": { "dev": 326, "test": 311, "train": 366, "validated": 7018, "invalidated": 1102, "other": 4 }, "duration": 55392156, "reportedSentences": 1, "validatedSentences": 1003, "unvalidatedSentences": 1, "clips": 8124, "splits": { "accent": {}, "age": { "": 0.77, "twenties": 0.11, "thirties": 0.1, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8124, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 312378845, "checksum": "7510e1763517426565510abe5376ba0b5ea3eabdf1efbba198fe198416da2ac6", "avgDurationSecs": 6.818, "validDurationSecs": 47851.077, "totalHrs": 15.38, "validHrs": 13.29 }, "dv": { "buckets": { "dev": 2243, "test": 2228, "train": 2648, "validated": 26571, "invalidated": 1661, "other": 15071 }, "duration": 221700519, "reportedSentences": 59, "validatedSentences": 7174, "unvalidatedSentences": 9, "clips": 43303, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.11, "thirties": 0.39, "teens": 0.01, "fourties": 0.23, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.24, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43302, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 391, "size": 1409830097, "checksum": "4bc4c02d7baee9139efa04f9b7b06c7d2fb7bdfca62a3ea071c4df678e67822f", "avgDurationSecs": 5.12, "validDurationSecs": 136036.868, "totalHrs": 61.58, "validHrs": 37.78 }, "dyu": { "buckets": { "dev": 50, "test": 63, "train": 90, "validated": 211, "invalidated": 15, "other": 69 }, "duration": 1861236, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 295, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.66, "thirties": 0, "teens": 0.02, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.31, "male_masculine": 0.39, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 295, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 10792930, "checksum": "5c73eef972e528e84cc618c3ba889e8f40656fa5d1aa2daba186fef1d7359c72", "avgDurationSecs": 6.309, "validDurationSecs": 1331.257, "totalHrs": 0.51, "validHrs": 0.36 }, "ebr": { "buckets": { "dev": 0, "test": 228, "train": 644, "validated": 1544, "invalidated": 1, "other": 506 }, "duration": 11458872, "reportedSentences": 5, "validatedSentences": 872, "unvalidatedSentences": 3, "clips": 2051, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.23, "thirties": 0.32, "teens": 0, "fourties": 0.42, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.68, "male_masculine": 0, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2051, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 64924896, "checksum": "4d3c3313b8e1c20da0216c410510ed86d5c21c5ddaf97f959f464cd07e98d5e6", "avgDurationSecs": 5.587, "validDurationSecs": 8626.279, "totalHrs": 3.18, "validHrs": 2.39 }, "eko": { "buckets": { "dev": 353, "test": 414, "train": 519, "validated": 3950, "invalidated": 113, "other": 12 }, "duration": 37988928, "reportedSentences": 0, "validatedSentences": 1288, "unvalidatedSentences": 16, "clips": 4075, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.31, "thirties": 0.55, "teens": 0, "fourties": 0.05, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1794, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2257, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 24, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7, "size": 221521094, "checksum": "a565d2a7e72ea4b0ed67ddc43e2adacf991c5b3f785f92bb7c7d1d5df7d01033", "avgDurationSecs": 9.322, "validDurationSecs": 36823.623, "totalHrs": 10.55, "validHrs": 10.22 }, "el": { "buckets": { "dev": 1711, "test": 1717, "train": 1921, "validated": 17020, "invalidated": 877, "other": 10324 }, "duration": 117179854, "reportedSentences": 72, "validatedSentences": 7018, "unvalidatedSentences": 4340, "clips": 28221, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.12, "thirties": 0.36, "teens": 0.01, "fourties": 0.17, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.63, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 28221, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 448, "size": 777849676, "checksum": "61e377fda91bf4f7a0a45b735d0bede5d140573600a4e9356b381ba1dc5ee297", "avgDurationSecs": 4.152, "validDurationSecs": 70670.817, "totalHrs": 32.54, "validHrs": 19.63 }, "en": { "buckets": { "dev": 16403, "test": 16403, "train": 1147819, "validated": 1869726, "invalidated": 309878, "other": 389504 }, "duration": 13605229315, "reportedSentences": 9830, "validatedSentences": 1681441, "unvalidatedSentences": 37247, "clips": 2569108, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.25, "thirties": 0.14, "teens": 0.06, "fourties": 0.09, "fifties": 0.05, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.43, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 2567905, "agriculture_food": 161, "automotive_transport": 8, "finance": 44, "service_retail": 29, "general": 644, "healthcare": 25, "history_law_government": 120, "language_fundamentals": 10, "media_entertainment": 115, "nature_environment": 55, "news_current_affairs": 13, "technology_robotics": 98 } }, "users": 99289, "size": 94210106373, "checksum": "a477e793f831662c00fd526453abde3c3e4791b8eda19e1a723457751f8f3c61", "avgDurationSecs": 5.296, "validDurationSecs": 9901510.947, "totalHrs": 3779.23, "validHrs": 2750.41 }, "eo": { "buckets": { "dev": 14916, "test": 14900, "train": 144451, "validated": 852619, "invalidated": 127521, "other": 164426 }, "duration": 6951494923, "reportedSentences": 2404, "validatedSentences": 181129, "unvalidatedSentences": 21895, "clips": 1144566, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.56, "thirties": 0.12, "teens": 0.05, "fourties": 0.04, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.69, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1144561, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 5, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1878, "size": 41544000333, "checksum": "b35aa988b502d8c7da49f5f34525a5b7d55948e4c6c1cd62a6cd0cd6ffb3902c", "avgDurationSecs": 6.073, "validDurationSecs": 5178361.623, "totalHrs": 1930.97, "validHrs": 1438.43 }, "es": { "buckets": { "dev": 15900, "test": 15900, "train": 357397, "validated": 436590, "invalidated": 95033, "other": 1144151 }, "duration": 8177052330, "reportedSentences": 2666, "validatedSentences": 1082337, "unvalidatedSentences": 4887, "clips": 1675774, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.53, "thirties": 0.09, "teens": 0.08, "fourties": 0.03, "fifties": 0.04, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.55, "female_feminine": 0.31, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1675639, "agriculture_food": 1, "automotive_transport": 4, "finance": 5, "service_retail": 3, "general": 43, "healthcare": 4, "history_law_government": 33, "language_fundamentals": 8, "media_entertainment": 8, "nature_environment": 11, "news_current_affairs": 18, "technology_robotics": 22 } }, "users": 26838, "size": 51528103656, "checksum": "2c2943fbc5c96d5c00bd98f6fe41f358a057ae451fd4c324bff98a1b41e4f0d8", "avgDurationSecs": 4.88, "validDurationSecs": 2130370.37, "totalHrs": 2271.4, "validHrs": 591.76 }, "esu": { "buckets": { "dev": 0, "test": 501, "train": 6125, "validated": 7219, "invalidated": 0, "other": 5 }, "duration": 27453780, "reportedSentences": 17, "validatedSentences": 6929, "unvalidatedSentences": 5, "clips": 7224, "splits": { "accent": {}, "age": { "": 0, "twenties": 0, "thirties": 1, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7224, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 142932823, "checksum": "3a7954a7aadc0752923b1ba0c4429ac08bd491ffdb2b8bc3eda1f067fb2eb276", "avgDurationSecs": 3.8, "validDurationSecs": 27434.778, "totalHrs": 7.62, "validHrs": 7.62 }, "et": { "buckets": { "dev": 2887, "test": 2887, "train": 3496, "validated": 27656, "invalidated": 7962, "other": 126 }, "duration": 240245642, "reportedSentences": 571, "validatedSentences": 11090, "unvalidatedSentences": 1876, "clips": 35744, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.7, "thirties": 0.08, "teens": 0, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.49, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 34751, "agriculture_food": 61, "automotive_transport": 46, "finance": 42, "service_retail": 14, "general": 620, "healthcare": 36, "history_law_government": 40, "language_fundamentals": 17, "media_entertainment": 46, "nature_environment": 111, "news_current_affairs": 23, "technology_robotics": 46 } }, "users": 1067, "size": 1599629830, "checksum": "42e83ac54800354de55915a618b1499f87aab5228ba1e76a1e8bef80151fed21", "avgDurationSecs": 6.721, "validDurationSecs": 185883.882, "totalHrs": 66.73, "validHrs": 51.63 }, "eto": { "buckets": { "dev": 300, "test": 307, "train": 359, "validated": 10552, "invalidated": 346, "other": 204 }, "duration": 38369448, "reportedSentences": 0, "validatedSentences": 966, "unvalidatedSentences": -1, "clips": 11102, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11102, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26, "size": 217054142, "checksum": "a5b737b4a4f050cc83ff2219a52ad54ebb9e47a0dda86172a12086928c2e5d5d", "avgDurationSecs": 3.456, "validDurationSecs": 36468.602, "totalHrs": 10.65, "validHrs": 10.13 }, "eu": { "buckets": { "dev": 14804, "test": 14804, "train": 134190, "validated": 311983, "invalidated": 50883, "other": 102015 }, "duration": 2559683811, "reportedSentences": 1279, "validatedSentences": 1018163, "unvalidatedSentences": 12680, "clips": 464881, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.17, "thirties": 0.06, "teens": 0.02, "fourties": 0.16, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0.25, "female_feminine": 0.29, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 443641, "agriculture_food": 7, "automotive_transport": 6, "finance": 4, "service_retail": 7, "general": 21307, "healthcare": 22, "history_law_government": 0, "language_fundamentals": 2, "media_entertainment": 6, "nature_environment": 6, "news_current_affairs": 0, "technology_robotics": 23 } }, "users": 11045, "size": 15692713908, "checksum": "594a7b558c3d9c11899e8f13f07e28783ea7085fc39f3dbe6613c6a8b7715874", "avgDurationSecs": 5.506, "validDurationSecs": 1717811.299, "totalHrs": 711.02, "validHrs": 477.16 }, "ewo": { "buckets": { "dev": 288, "test": 288, "train": 307, "validated": 7571, "invalidated": 461, "other": 102 }, "duration": 71308908, "reportedSentences": 0, "validatedSentences": 883, "unvalidatedSentences": -1, "clips": 8134, "splits": { "accent": {}, "age": { "": 0.94, "twenties": 0.01, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.05, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8134, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 410898056, "checksum": "cdad8bdbf48360d44a20754b4e4ba8aa7ae002f009fe5a8f466a61b7bdea08c4", "avgDurationSecs": 8.767, "validDurationSecs": 66373.216, "totalHrs": 19.8, "validHrs": 18.43 }, "fa": { "buckets": { "dev": 10712, "test": 10712, "train": 30066, "validated": 340117, "invalidated": 15412, "other": 34605 }, "duration": 1543007888, "reportedSentences": 2810, "validatedSentences": 57763, "unvalidatedSentences": 283344, "clips": 390134, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.33, "thirties": 0.35, "teens": 0.03, "fourties": 0.02, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.7, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 390103, "agriculture_food": 0, "automotive_transport": 3, "finance": 0, "service_retail": 0, "general": 27, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4639, "size": 11134521257, "checksum": "543fe1231d5c72b76919906b452f77fdb2f15c8fd60779fe2254030dc44ce656", "avgDurationSecs": 3.955, "validDurationSecs": 1345187.074, "totalHrs": 428.61, "validHrs": 373.66 }, "fan": { "buckets": { "dev": 327, "test": 328, "train": 349, "validated": 7706, "invalidated": 82, "other": 346 }, "duration": 42145596, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": -1, "clips": 8134, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0, "thirties": 0, "teens": 0.01, "fourties": 0.1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8134, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 43, "size": 247401451, "checksum": "65d9feb954bc8209ec5dd994ae7488f94e6ec8f732c0e6cd7a4ecee136edc7ff", "avgDurationSecs": 5.181, "validDurationSecs": 39927.952, "totalHrs": 11.7, "validHrs": 11.09 }, "fi": { "buckets": { "dev": 1812, "test": 1796, "train": 2069, "validated": 12196, "invalidated": 426, "other": 5248 }, "duration": 82683723, "reportedSentences": 78, "validatedSentences": 5865, "unvalidatedSentences": 49163, "clips": 17870, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.16, "thirties": 0.17, "teens": 0.01, "fourties": 0.3, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.36, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 17870, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 248, "size": 493606591, "checksum": "77756bdc318c2daabdc0abde96cd47db55bd0b29029e0044c6cdf97de763df5d", "avgDurationSecs": 4.627, "validDurationSecs": 56430.369, "totalHrs": 22.96, "validHrs": 15.67 }, "fmp": { "buckets": { "dev": 326, "test": 327, "train": 351, "validated": 5206, "invalidated": 293, "other": 105 }, "duration": 44521524, "reportedSentences": 5, "validatedSentences": 1004, "unvalidatedSentences": -1, "clips": 5604, "splits": { "accent": {}, "age": { "": 0.74, "twenties": 0.2, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.06, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5604, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 32, "size": 255854248, "checksum": "75f0d4b381a4adda3ccea5c8ff326454202b0f61cbc62ac92873aec398ab20c3", "avgDurationSecs": 7.945, "validDurationSecs": 41359.574, "totalHrs": 12.36, "validHrs": 11.48 }, "fr": { "buckets": { "dev": 16196, "test": 16196, "train": 605822, "validated": 775562, "invalidated": 67557, "other": 13071 }, "duration": 4311619534, "reportedSentences": 7619, "validatedSentences": 1649084, "unvalidatedSentences": 43764, "clips": 856190, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.17, "thirties": 0.15, "teens": 0.03, "fourties": 0.14, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.57, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 856083, "agriculture_food": 0, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 70, "healthcare": 5, "history_law_government": 19, "language_fundamentals": 7, "media_entertainment": 16, "nature_environment": 8, "news_current_affairs": 2, "technology_robotics": 18 } }, "users": 20926, "size": 30119183507, "checksum": "ab1903e88eb7eab4c27804f34937e1d2f880cc1cb2a4bdebb06f77325042a2ee", "avgDurationSecs": 5.036, "validDurationSecs": 3905591.363, "totalHrs": 1197.67, "validHrs": 1084.88 }, "fub": { "buckets": { "dev": 277, "test": 345, "train": 445, "validated": 7686, "invalidated": 30, "other": 154 }, "duration": 48138012, "reportedSentences": 0, "validatedSentences": 1067, "unvalidatedSentences": -1, "clips": 7870, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0.01, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7870, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 263109466, "checksum": "44fb5c99b7dc56bb390cffc93e94c9d92f4b7bb0e3164dcc07a1fbbe89737103", "avgDurationSecs": 6.117, "validDurationSecs": 47012.549, "totalHrs": 13.37, "validHrs": 13.05 }, "fue": { "buckets": { "dev": 0, "test": 88, "train": 900, "validated": 7146, "invalidated": 11, "other": 108 }, "duration": 38734056, "reportedSentences": 0, "validatedSentences": 988, "unvalidatedSentences": -1, "clips": 7265, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7265, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 221067092, "checksum": "860c7d976b474b04331c7daade001693e9c1761ba19636b265771a91a36cfbf5", "avgDurationSecs": 5.332, "validDurationSecs": 38099.596, "totalHrs": 10.75, "validHrs": 10.58 }, "fy-NL": { "buckets": { "dev": 3173, "test": 3173, "train": 3927, "validated": 52227, "invalidated": 4054, "other": 103297 }, "duration": 774347499, "reportedSentences": 553, "validatedSentences": 10902, "unvalidatedSentences": 15, "clips": 159578, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.04, "thirties": 0.1, "teens": 0, "fourties": 0.07, "fifties": 0.1, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.13, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 157895, "agriculture_food": 1216, "automotive_transport": 0, "finance": 207, "service_retail": 0, "general": 467, "healthcare": 0, "history_law_government": 1148, "language_fundamentals": 288, "media_entertainment": 1433, "nature_environment": 3, "news_current_affairs": 223, "technology_robotics": 25 } }, "users": 2095, "size": 4641549420, "checksum": "73a493e91ec5f4915064327c9e80463d3f2cfe138e563e9f8939c0f2db8a8784", "avgDurationSecs": 4.852, "validDurationSecs": 253429.964, "totalHrs": 215.09, "validHrs": 70.39 }, "ga-IE": { "buckets": { "dev": 849, "test": 851, "train": 899, "validated": 13303, "invalidated": 1300, "other": 647 }, "duration": 58143231, "reportedSentences": 42, "validatedSentences": 4883, "unvalidatedSentences": 1470, "clips": 15250, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.35, "thirties": 0.24, "teens": 0.01, "fourties": 0.05, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.34, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14992, "agriculture_food": 4, "automotive_transport": 5, "finance": 1, "service_retail": 5, "general": 289, "healthcare": 2, "history_law_government": 8, "language_fundamentals": 0, "media_entertainment": 6, "nature_environment": 4, "news_current_affairs": 4, "technology_robotics": 3 } }, "users": 269, "size": 374564231, "checksum": "c168b4ac1d65c39d3ab6929fc367b4d538be48bc00a5b11a673d2f2c3cdebbb0", "avgDurationSecs": 3.813, "validDurationSecs": 50719.961, "totalHrs": 16.15, "validHrs": 14.08 }, "gej": { "buckets": { "dev": 949, "test": 950, "train": 1289, "validated": 16413, "invalidated": 334, "other": 27 }, "duration": 41018184, "reportedSentences": 0, "validatedSentences": 3188, "unvalidatedSentences": -1, "clips": 16774, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16774, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 225917577, "checksum": "1f7870efd16f99741bcd905e464244ce381d3ce0eeaa6e7f490e0c67580d6515", "avgDurationSecs": 2.445, "validDurationSecs": 40135.415, "totalHrs": 11.39, "validHrs": 11.14 }, "ggg": { "buckets": { "dev": 0, "test": 318, "train": 1687, "validated": 6684, "invalidated": 3, "other": 4393 }, "duration": 44190504, "reportedSentences": 0, "validatedSentences": 2005, "unvalidatedSentences": -1, "clips": 11080, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.19, "thirties": 0.33, "teens": 0, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11080, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 258835300, "checksum": "bc9718ed0f4bea207db1bd5d310fb95b21a43120398ab6d78cf2b6800da7a769", "avgDurationSecs": 3.988, "validDurationSecs": 26657.882, "totalHrs": 12.27, "validHrs": 7.4 }, "gid": { "buckets": { "dev": 286, "test": 302, "train": 329, "validated": 5111, "invalidated": 208, "other": 0 }, "duration": 37833048, "reportedSentences": 0, "validatedSentences": 917, "unvalidatedSentences": -1, "clips": 5319, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5319, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 216320607, "checksum": "4462a0507beddd3b261cc280cf31eab8ad4f2923c78c0f14d155f25c3eed3186", "avgDurationSecs": 7.113, "validDurationSecs": 36353.583, "totalHrs": 10.5, "validHrs": 10.09 }, "gig": { "buckets": { "dev": 0, "test": 135, "train": 1869, "validated": 12130, "invalidated": 6, "other": 23 }, "duration": 36376452, "reportedSentences": 0, "validatedSentences": 2005, "unvalidatedSentences": -1, "clips": 12159, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.67, "thirties": 0.16, "teens": 0, "fourties": 0, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12159, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 212046431, "checksum": "e1e3433e86c377a60c22f15c4c9829d681e462b17ecef43d8ad274b69f27851d", "avgDurationSecs": 2.992, "validDurationSecs": 36289.692, "totalHrs": 10.1, "validHrs": 10.08 }, "giz": { "buckets": { "dev": 308, "test": 277, "train": 406, "validated": 6494, "invalidated": 149, "other": 5 }, "duration": 37043316, "reportedSentences": 14, "validatedSentences": 991, "unvalidatedSentences": -1, "clips": 6648, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0.03, "thirties": 0.07, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6648, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 32, "size": 216330762, "checksum": "614987927289d3380aa526d5aa2643d891d914399155067d0a357e272de313ee", "avgDurationSecs": 5.572, "validDurationSecs": 36185.213, "totalHrs": 10.28, "validHrs": 10.05 }, "gjk": { "buckets": { "dev": 570, "test": 623, "train": 808, "validated": 8548, "invalidated": 168, "other": 10 }, "duration": 39656844, "reportedSentences": 0, "validatedSentences": 2004, "unvalidatedSentences": -1, "clips": 8726, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.12, "thirties": 0.59, "teens": 0.04, "fourties": 0.23, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.74, "male_masculine": 0, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8726, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 230571438, "checksum": "e992d46d525e358225180418fa277c0f0f517ab5f8687d9974e522c4cfd3ab59", "avgDurationSecs": 4.545, "validDurationSecs": 38847.892, "totalHrs": 11.01, "validHrs": 10.79 }, "gju": { "buckets": { "dev": 0, "test": 626, "train": 3204, "validated": 11076, "invalidated": 155, "other": 510 }, "duration": 38540952, "reportedSentences": 0, "validatedSentences": 3852, "unvalidatedSentences": 2, "clips": 11741, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0.77, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11738, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 220134491, "checksum": "073b80d594a9faf861e0ca4d07abf885d01a933845f5fdd4df4065714f8304e1", "avgDurationSecs": 3.283, "validDurationSecs": 36358.026, "totalHrs": 10.7, "validHrs": 10.09 }, "gl": { "buckets": { "dev": 14970, "test": 14969, "train": 149131, "validated": 183801, "invalidated": 25176, "other": 60920 }, "duration": 1350675288, "reportedSentences": 872, "validatedSentences": 1001730, "unvalidatedSentences": 3827, "clips": 269897, "splits": { "accent": {}, "age": { "": 0.54, "twenties": 0.16, "thirties": 0.07, "teens": 0.02, "fourties": 0.09, "fifties": 0.1, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.64, "male_masculine": 0.13, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 269452, "agriculture_food": 33, "automotive_transport": 4, "finance": 40, "service_retail": 17, "general": 264, "healthcare": 3, "history_law_government": 52, "language_fundamentals": 19, "media_entertainment": 41, "nature_environment": 15, "news_current_affairs": 14, "technology_robotics": 5 } }, "users": 6397, "size": 7840048937, "checksum": "e33bf85d88497fa1ed5595210c9a251326c1b84df1f59f2de31ac655518c3093", "avgDurationSecs": 5.004, "validDurationSecs": 919815.591, "totalHrs": 375.18, "validHrs": 255.5 }, "gn": { "buckets": { "dev": 661, "test": 1105, "train": 1720, "validated": 4029, "invalidated": 220, "other": 21610 }, "duration": 119240820, "reportedSentences": 52, "validatedSentences": 5894, "unvalidatedSentences": 301, "clips": 25859, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.54, "thirties": 0.17, "teens": 0.02, "fourties": 0.08, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.33, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 25859, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 313, "size": 683967749, "checksum": "c97bc800ba00c3736e80658c681027d90edde530882544768fc9e4a4276e4249", "avgDurationSecs": 4.611, "validDurationSecs": 18578.494, "totalHrs": 33.12, "validHrs": 5.16 }, "gsw": { "buckets": { "dev": 17, "test": 18, "train": 24, "validated": 228, "invalidated": 36, "other": 437 }, "duration": 4008276, "reportedSentences": 2, "validatedSentences": 64, "unvalidatedSentences": 19, "clips": 701, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.01, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.2, "sixties": 0.52, "seventies": 0.1, "eighties": 0.01, "nineties": 0 }, "gender": { "": 0.51, "male_masculine": 0, "female_feminine": 0.49, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 5, "general": 27, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 674, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 22386915, "checksum": "1a5f38f432bf02d93bcb7d100c8cb82784229d404e92e985e42a0c81e49769c4", "avgDurationSecs": 5.718, "validDurationSecs": 1303.69, "totalHrs": 1.11, "validHrs": 0.36 }, "gv": { "buckets": { "dev": 1166, "test": 525, "train": 2420, "validated": 6302, "invalidated": 151, "other": 137 }, "duration": 38105820, "reportedSentences": 267, "validatedSentences": 4219, "unvalidatedSentences": 21, "clips": 6590, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.54, "thirties": 0.04, "teens": 0, "fourties": 0.39, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0.5, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6590, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 224868436, "checksum": "a0e4c22483d334c9b745f0cf56dad8e4395a65a67b52c73a3a63ca5f0bb4e5b8", "avgDurationSecs": 5.782, "validDurationSecs": 36440.497, "totalHrs": 10.58, "validHrs": 10.12 }, "gwc": { "buckets": { "dev": 776, "test": 918, "train": 3268, "validated": 7421, "invalidated": 683, "other": 1699 }, "duration": 57059136, "reportedSentences": 1, "validatedSentences": 5574, "unvalidatedSentences": -1, "clips": 9803, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.34, "thirties": 0, "teens": 0, "fourties": 0.48, "fifties": 0.14, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9803, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 335348506, "checksum": "efd083b343f3f52d11b21a749e837b2d39afc35abf3f125ba59dfba024a1f14a", "avgDurationSecs": 5.821, "validDurationSecs": 43194.517, "totalHrs": 15.84, "validHrs": 11.99 }, "gwt": { "buckets": { "dev": 0, "test": 220, "train": 3136, "validated": 7681, "invalidated": 57, "other": 0 }, "duration": 44273880, "reportedSentences": 0, "validatedSentences": 3719, "unvalidatedSentences": 2087, "clips": 7738, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0, "thirties": 0.4, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.43, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7738, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 261938785, "checksum": "fea63dc5528a2639f0d20c17fab85fd64e20a98c6cf7ce92cd0dcd56c269dbd8", "avgDurationSecs": 5.722, "validDurationSecs": 43947.748, "totalHrs": 12.29, "validHrs": 12.2 }, "gya": { "buckets": { "dev": 346, "test": 346, "train": 362, "validated": 6914, "invalidated": 491, "other": 0 }, "duration": 38369700, "reportedSentences": 1, "validatedSentences": 1054, "unvalidatedSentences": -1, "clips": 7405, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0, "thirties": 0.14, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7405, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 31, "size": 221180027, "checksum": "8676f1ddb77395ec0d4900e61dce291a8f3f28ed7b9909d3c37ad75022b751ce", "avgDurationSecs": 5.182, "validDurationSecs": 35825.538, "totalHrs": 10.65, "validHrs": 9.95 }, "ha": { "buckets": { "dev": 634, "test": 783, "train": 1901, "validated": 3496, "invalidated": 175, "other": 6776 }, "duration": 45455724, "reportedSentences": 34, "validatedSentences": 5450, "unvalidatedSentences": 240, "clips": 10447, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.13, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.57, "female_feminine": 0.24, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10447, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 266037995, "checksum": "21b27ecfa4ed161da6007d1900acbdea2216a6553a33aa53431733dc2e9d07ff", "avgDurationSecs": 4.351, "validDurationSecs": 15211.373, "totalHrs": 12.62, "validHrs": 4.22 }, "haz": { "buckets": { "dev": 86, "test": 446, "train": 823, "validated": 8319, "invalidated": 0, "other": 1 }, "duration": 37900836, "reportedSentences": 29, "validatedSentences": 1361, "unvalidatedSentences": -1, "clips": 8320, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.32, "thirties": 0.33, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8320, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 183552507, "checksum": "b5944f1622600c6474930fb651ba6cedb8ef1efd32948fe49d7a6b2710e8226b", "avgDurationSecs": 4.555, "validDurationSecs": 37896.281, "totalHrs": 10.52, "validHrs": 10.52 }, "he": { "buckets": { "dev": 394, "test": 910, "train": 1870, "validated": 4059, "invalidated": 346, "other": 1086 }, "duration": 25219404, "reportedSentences": 43, "validatedSentences": 4188, "unvalidatedSentences": 6836, "clips": 5491, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.72, "thirties": 0.16, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.04, "nineties": 0 }, "gender": { "": 0.07, "male_masculine": 0.91, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5490, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 146911811, "checksum": "2a91a9c2d958efb9182c7077b343d5f049c99eb8cd9bc011487569e0455e922c", "avgDurationSecs": 4.593, "validDurationSecs": 18642.426, "totalHrs": 7, "validHrs": 5.17 }, "hem": { "buckets": { "dev": 337, "test": 337, "train": 345, "validated": 5974, "invalidated": 470, "other": 0 }, "duration": 38994660, "reportedSentences": 2, "validatedSentences": 1020, "unvalidatedSentences": 2, "clips": 6444, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.47, "thirties": 0.27, "teens": 0, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.77, "male_masculine": 0.03, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6444, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 41, "size": 211316608, "checksum": "04ea4c4c436e486e1023f492b172baa9839cf466e971f904289cef967fbc8eb9", "avgDurationSecs": 6.051, "validDurationSecs": 36150.543, "totalHrs": 10.83, "validHrs": 10.04 }, "hi": { "buckets": { "dev": 2702, "test": 3360, "train": 4876, "validated": 11017, "invalidated": 911, "other": 6553 }, "duration": 92714028, "reportedSentences": 146, "validatedSentences": 32204, "unvalidatedSentences": 9965, "clips": 18481, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.28, "thirties": 0.33, "teens": 0.01, "fourties": 0.07, "fifties": 0.01, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.51, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18481, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 471, "size": 549742692, "checksum": "2cfba22d28fec767bc8b786da9e22966cc9a703676854d47723c7c0c482518d7", "avgDurationSecs": 5.017, "validDurationSecs": 55269.22, "totalHrs": 25.75, "validHrs": 15.35 }, "hno": { "buckets": { "dev": 765, "test": 708, "train": 863, "validated": 9120, "invalidated": 282, "other": 5 }, "duration": 37708020, "reportedSentences": 21, "validatedSentences": 2349, "unvalidatedSentences": -1, "clips": 9407, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.66, "thirties": 0.23, "teens": 0.02, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6162, "agriculture_food": 5816, "automotive_transport": 0, "finance": 3245, "service_retail": 0, "general": 2574, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 5816, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 36, "size": 216280482, "checksum": "999869fa27bf0aaa04c4ba1a44702e1f5c2105bf7644f1d3e912205b7de54861", "avgDurationSecs": 4.009, "validDurationSecs": 36557.579, "totalHrs": 10.47, "validHrs": 10.15 }, "hsb": { "buckets": { "dev": 270, "test": 405, "train": 811, "validated": 1486, "invalidated": 248, "other": 323 }, "duration": 13102884, "reportedSentences": 130, "validatedSentences": 7182, "unvalidatedSentences": 26, "clips": 2057, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.1, "thirties": 0.09, "teens": 0.07, "fourties": 0.46, "fifties": 0.03, "sixties": 0.02, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.73, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2057, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 32, "size": 97504584, "checksum": "c11d0d6356bf0c886d1103e02627a66533de4069678e7b52950a1cb176e9199f", "avgDurationSecs": 6.37, "validDurationSecs": 9465.671, "totalHrs": 3.63, "validHrs": 2.62 }, "ht": { "buckets": { "dev": 6, "test": 5, "train": 13, "validated": 27, "invalidated": 1, "other": 2 }, "duration": 166968, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": 1629, "clips": 30, "splits": { "accent": {}, "age": { "": 0.67, "twenties": 0, "thirties": 0.33, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 1091474, "checksum": "0d10797fa8f3be5168535495be6b34ba4ee583800305d4f7430ee223ebb97f37", "avgDurationSecs": 5.566, "validDurationSecs": 150.271, "totalHrs": 0.04, "validHrs": 0.04 }, "hu": { "buckets": { "dev": 12027, "test": 12066, "train": 44352, "validated": 68975, "invalidated": 3473, "other": 45019 }, "duration": 650828773, "reportedSentences": 554, "validatedSentences": 326397, "unvalidatedSentences": 226, "clips": 117467, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.11, "thirties": 0.13, "teens": 0.02, "fourties": 0.11, "fifties": 0.32, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.23, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 117394, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 64, "healthcare": 0, "history_law_government": 19, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 1675, "size": 3828056509, "checksum": "3efa8e3df2c1e27dbbddc7fc84ed4b4765d7a8b88d57080b45bf90fcf62e36ea", "avgDurationSecs": 5.541, "validDurationSecs": 382157.667, "totalHrs": 180.78, "validHrs": 106.15 }, "hux": { "buckets": { "dev": 25, "test": 326, "train": 637, "validated": 9285, "invalidated": 1322, "other": 1 }, "duration": 41113224, "reportedSentences": 0, "validatedSentences": 995, "unvalidatedSentences": -1, "clips": 10608, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.19, "thirties": 0.09, "teens": 0, "fourties": 0.26, "fifties": 0.37, "sixties": 0.09, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10608, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 240800853, "checksum": "0cdadf2e8fa73680e4b4afcffef020366a8c6cac4707bf41fdea230f2ae1d35b", "avgDurationSecs": 3.876, "validDurationSecs": 35985.698, "totalHrs": 11.42, "validHrs": 9.99 }, "hy-AM": { "buckets": { "dev": 5874, "test": 5841, "train": 9314, "validated": 21244, "invalidated": 1221, "other": 15123 }, "duration": 202675896, "reportedSentences": 183, "validatedSentences": 237419, "unvalidatedSentences": 1338, "clips": 37588, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.63, "thirties": 0.08, "teens": 0.11, "fourties": 0.04, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.25, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37586, "agriculture_food": 0, "automotive_transport": 0, "finance": 1, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 441, "size": 1192289812, "checksum": "3469b9deae30a757883b45812c7ed35b69c79187ece74fae9cf2cac3e7fa2010", "avgDurationSecs": 5.392, "validDurationSecs": 114548.439, "totalHrs": 56.29, "validHrs": 31.81 }, "ia": { "buckets": { "dev": 1869, "test": 1865, "train": 4903, "validated": 12077, "invalidated": 352, "other": 2341 }, "duration": 62070024, "reportedSentences": 279, "validatedSentences": 9069, "unvalidatedSentences": 134, "clips": 14770, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fourties": 0.29, "fifties": 0.03, "sixties": 0, "seventies": 0.22, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14770, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 72, "size": 420459353, "checksum": "01fe681194c6779401319a9a312889e9d53d27f4f284fbb239286a5227b30951", "avgDurationSecs": 4.202, "validDurationSecs": 50752.856, "totalHrs": 17.24, "validHrs": 14.09 }, "ibb": { "buckets": { "dev": 319, "test": 307, "train": 372, "validated": 3106, "invalidated": 118, "other": 4238 }, "duration": 96213852, "reportedSentences": 0, "validatedSentences": 1031, "unvalidatedSentences": -1, "clips": 7462, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.97, "male_masculine": 0, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7462, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 561149314, "checksum": "d14ec0f9ac2a6f5cbc9455a2d7130ebc0d0935030aa354c89ef2295c65b255e9", "avgDurationSecs": 12.894, "validDurationSecs": 40048.274, "totalHrs": 26.72, "validHrs": 11.12 }, "id": { "buckets": { "dev": 3469, "test": 3691, "train": 4973, "validated": 30256, "invalidated": 2795, "other": 26500 }, "duration": 238450224, "reportedSentences": 363, "validatedSentences": 13318, "unvalidatedSentences": 11864, "clips": 59551, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.41, "thirties": 0.1, "teens": 0.22, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.44, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0, "intersex": 0 }, "sentence_domain": { "": 59551, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 653, "size": 1498957544, "checksum": "abaa6f144cfd71d2144303b2019b4156672e39b10b8e4b0a9555471abd373e3e", "avgDurationSecs": 4.004, "validDurationSecs": 121149.099, "totalHrs": 66.23, "validHrs": 33.65 }, "ig": { "buckets": { "dev": 87, "test": 89, "train": 92, "validated": 269, "invalidated": 138, "other": 8295 }, "duration": 47624184, "reportedSentences": 16, "validatedSentences": 15225, "unvalidatedSentences": 3122, "clips": 8702, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.33, "thirties": 0.17, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.47, "male_masculine": 0.09, "female_feminine": 0.44, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8700, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 1, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 159, "size": 279448506, "checksum": "cb99fab21b97ef36c43052b60adba30d81daba5ba44a0d15bd027af70249457c", "avgDurationSecs": 5.473, "validDurationSecs": 1472.179, "totalHrs": 13.22, "validHrs": 0.4 }, "ipk": { "buckets": { "dev": 0, "test": 160, "train": 3104, "validated": 3264, "invalidated": 0, "other": 0 }, "duration": 25908516, "reportedSentences": 0, "validatedSentences": 4386, "unvalidatedSentences": 27, "clips": 3264, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.95, "nineties": 0 }, "gender": { "": 0.05, "male_masculine": 0, "female_feminine": 0.95, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3264, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 153440333, "checksum": "de70e5900a5aa4d1a8debeaa2ab8b81f41f2c0b496442b0b62eefa6ac1db3c40", "avgDurationSecs": 7.938, "validDurationSecs": 25908.516, "totalHrs": 7.19, "validHrs": 7.19 }, "is": { "buckets": { "dev": 0, "test": 35, "train": 73, "validated": 108, "invalidated": 15, "other": 291 }, "duration": 2660040, "reportedSentences": 1, "validatedSentences": 5083, "unvalidatedSentences": 113, "clips": 414, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0, "thirties": 0.61, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.96, "male_masculine": 0.04, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 414, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 16009584, "checksum": "3e207b41d6eabafe87aae769e8a92e04eae4629727c2da45a1a4b29d1f71ff54", "avgDurationSecs": 6.425, "validDurationSecs": 693.923, "totalHrs": 0.73, "validHrs": 0.19 }, "it": { "buckets": { "dev": 15181, "test": 15181, "train": 173207, "validated": 240195, "invalidated": 20772, "other": 19030 }, "duration": 1520567196, "reportedSentences": 5983, "validatedSentences": 925062, "unvalidatedSentences": 971, "clips": 279997, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.2, "thirties": 0.16, "teens": 0.01, "fourties": 0.12, "fifties": 0.14, "sixties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.62, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 279981, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 11, "healthcare": 1, "history_law_government": 5, "language_fundamentals": 3, "media_entertainment": 1, "nature_environment": 3, "news_current_affairs": 1, "technology_robotics": 1 } }, "users": 7319, "size": 10333034887, "checksum": "1371f22127b7ff142c10084ef36278631dd50c39564254729e568d9a19782ec5", "avgDurationSecs": 5.431, "validDurationSecs": 1304416.253, "totalHrs": 422.37, "validHrs": 362.33 }, "ja": { "buckets": { "dev": 9019, "test": 9019, "train": 19694, "validated": 280477, "invalidated": 52382, "other": 225140 }, "duration": 2546437569, "reportedSentences": 793, "validatedSentences": 42286, "unvalidatedSentences": 8231, "clips": 557999, "splits": { "accent": {}, "age": { "": 0.17, "twenties": 0.55, "thirties": 0.07, "teens": 0.08, "fourties": 0.08, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.28, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01, "intersex": 0 }, "sentence_domain": { "": 510070, "agriculture_food": 8157, "automotive_transport": 1910, "finance": 2005, "service_retail": 400, "general": 21766, "healthcare": 2268, "history_law_government": 4518, "language_fundamentals": 3554, "media_entertainment": 4770, "nature_environment": 2540, "news_current_affairs": 971, "technology_robotics": 643 } }, "users": 7498, "size": 14977274557, "checksum": "85c714c97705123d0abbc0bd4715ceefcd1388b9ce6428bc0aa1cfd635754dd1", "avgDurationSecs": 4.564, "validDurationSecs": 1279961.38, "totalHrs": 707.34, "validHrs": 355.54 }, "jgo": { "buckets": { "dev": 336, "test": 336, "train": 362, "validated": 6080, "invalidated": 89, "other": 20 }, "duration": 41432724, "reportedSentences": 0, "validatedSentences": 1035, "unvalidatedSentences": 2, "clips": 6189, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.11, "thirties": 0.15, "teens": 0, "fourties": 0.3, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male_masculine": 0, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6188, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 227442147, "checksum": "af72614e0dc3c4cc3680a734f56267036c6bcd96142f7be1badbb4257a49627f", "avgDurationSecs": 6.695, "validDurationSecs": 40703.015, "totalHrs": 11.5, "validHrs": 11.3 }, "jqr": { "buckets": { "dev": 323, "test": 273, "train": 403, "validated": 6060, "invalidated": 116, "other": 79 }, "duration": 36995796, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": -1, "clips": 6255, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.6, "fifties": 0.16, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.52, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6255, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 215086389, "checksum": "196799a74d3b72510f615fd7ae28df31eb8d189ada4ed46e0e9bc768abd25b12", "avgDurationSecs": 5.915, "validDurationSecs": 35842.45, "totalHrs": 10.27, "validHrs": 9.95 }, "ka": { "buckets": { "dev": 13047, "test": 13116, "train": 63056, "validated": 117702, "invalidated": 6766, "other": 103078 }, "duration": 1184559984, "reportedSentences": 1101, "validatedSentences": 132626, "unvalidatedSentences": 1891, "clips": 227546, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.22, "thirties": 0.33, "teens": 0.11, "fourties": 0.07, "fifties": 0.04, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.17, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 226990, "agriculture_food": 9, "automotive_transport": 10, "finance": 4, "service_retail": 9, "general": 256, "healthcare": 10, "history_law_government": 202, "language_fundamentals": 32, "media_entertainment": 23, "nature_environment": 17, "news_current_affairs": 10, "technology_robotics": 3 } }, "users": 2759, "size": 6926976886, "checksum": "c5af491be4210fdda49378bfe330bced7a26c0d4250a15826947b93610146a4c", "avgDurationSecs": 5.206, "validDurationSecs": 612733.598, "totalHrs": 329.04, "validHrs": 170.2 }, "kab": { "buckets": { "dev": 15003, "test": 15003, "train": 152504, "validated": 614748, "invalidated": 20076, "other": 112425 }, "duration": 2497072912, "reportedSentences": 9747, "validatedSentences": 182804, "unvalidatedSentences": 205837, "clips": 747249, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.12, "thirties": 0.29, "teens": 0, "fourties": 0.09, "fifties": 0.18, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.53, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 747249, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1599, "size": 18644707031, "checksum": "e1103bdf8cc84a32f499e8d9a17d0b19f21ec2a7a9cb3d245678db2a9bc55149", "avgDurationSecs": 3.342, "validDurationSecs": 2054295.929, "totalHrs": 693.63, "validHrs": 570.63 }, "kbd": { "buckets": { "dev": 6954, "test": 6958, "train": 11996, "validated": 102361, "invalidated": 2499, "other": 7602 }, "duration": 701074224, "reportedSentences": 3359, "validatedSentences": 27620, "unvalidatedSentences": 2376, "clips": 112462, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.19, "thirties": 0.28, "teens": 0.03, "fourties": 0.11, "fifties": 0.24, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.51, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2728, "agriculture_food": 4, "automotive_transport": 5, "finance": 0, "service_retail": 0, "general": 121798, "healthcare": 64, "history_law_government": 245, "language_fundamentals": 16, "media_entertainment": 7, "nature_environment": 131, "news_current_affairs": 5, "technology_robotics": 0 } }, "users": 289, "size": 3912180592, "checksum": "9b163af0d093d3f5ca9b9b951e7070edb9b3390121392625ee8f58166ad31fe7", "avgDurationSecs": 6.234, "validDurationSecs": 638105.837, "totalHrs": 194.74, "validHrs": 177.25 }, "kdh": { "buckets": { "dev": 356, "test": 363, "train": 498, "validated": 13513, "invalidated": 279, "other": 496 }, "duration": 38163852, "reportedSentences": 0, "validatedSentences": 1219, "unvalidatedSentences": -1, "clips": 14288, "splits": { "accent": {}, "age": { "": 0.98, "twenties": 0, "thirties": 0.02, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14288, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 201028275, "checksum": "5ea464a0bdbe326b2387f543219a0041c01542d14c73fc43f3aef755098aff3c", "avgDurationSecs": 2.671, "validDurationSecs": 36093.794, "totalHrs": 10.6, "validHrs": 10.02 }, "khw": { "buckets": { "dev": 1607, "test": 1535, "train": 2642, "validated": 8500, "invalidated": 261, "other": 1138 }, "duration": 72903024, "reportedSentences": 4, "validatedSentences": 7051, "unvalidatedSentences": 200, "clips": 9899, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.11, "thirties": 0.26, "teens": 0, "fourties": 0.55, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9542, "agriculture_food": 18, "automotive_transport": 53, "finance": 2, "service_retail": 0, "general": 253, "healthcare": 4, "history_law_government": 9, "language_fundamentals": 30, "media_entertainment": 4, "nature_environment": 9, "news_current_affairs": 4, "technology_robotics": 1 } }, "users": 50, "size": 429795765, "checksum": "70e00b56fb6b888e406d195247bad0983a437a5002f7d43d8eade7189d4b27cc", "avgDurationSecs": 7.365, "validDurationSecs": 62599.829, "totalHrs": 20.25, "validHrs": 17.38 }, "kk": { "buckets": { "dev": 551, "test": 573, "train": 616, "validated": 1748, "invalidated": 259, "other": 743 }, "duration": 13561848, "reportedSentences": 42, "validatedSentences": 6210, "unvalidatedSentences": 260, "clips": 2750, "splits": { "accent": {}, "age": { "": 0.55, "twenties": 0.23, "thirties": 0.05, "teens": 0.06, "fourties": 0.07, "fifties": 0.05, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.57, "male_masculine": 0.31, "female_feminine": 0.12, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2750, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 193, "size": 77693598, "checksum": "cdc3e608667382d61aaf3b729170d701c87aab9dfe830eb4d63209e0169e4afd", "avgDurationSecs": 4.932, "validDurationSecs": 8620.404, "totalHrs": 3.76, "validHrs": 2.39 }, "kln": { "buckets": { "dev": 6412, "test": 6076, "train": 11065, "validated": 32315, "invalidated": 537, "other": 37348 }, "duration": 330293556, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70200, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.59, "thirties": 0.05, "teens": 0.05, "fourties": 0.29, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.03, "female_feminine": 0.7, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4, "agriculture_food": 5, "automotive_transport": 16, "finance": 32, "service_retail": 626, "general": 68952, "healthcare": 54, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 2, "nature_environment": 23, "news_current_affairs": 2, "technology_robotics": 484 } }, "users": 41, "size": 1800892624, "checksum": "f60908bb36e29ad4f747b8875c2c85ba4e2935a197e27cd43c2dca1b71d266c9", "avgDurationSecs": 4.705, "validDurationSecs": 152043.252, "totalHrs": 91.74, "validHrs": 42.23 }, "kls": { "buckets": { "dev": 1258, "test": 1233, "train": 1415, "validated": 9774, "invalidated": 163, "other": 11 }, "duration": 37741320, "reportedSentences": 2, "validatedSentences": 3912, "unvalidatedSentences": 4, "clips": 9948, "splits": { "accent": {}, "age": { "": 0.64, "twenties": 0.06, "thirties": 0, "teens": 0, "fourties": 0.3, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9948, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 215514534, "checksum": "0294f5272cd1f2757c79e4b71f25c659663f19d87840532241b86527250b7649", "avgDurationSecs": 3.794, "validDurationSecs": 37081.188, "totalHrs": 10.48, "validHrs": 10.3 }, "kmr": { "buckets": { "dev": 4125, "test": 4133, "train": 5509, "validated": 64397, "invalidated": 2616, "other": 24285 }, "duration": 378044424, "reportedSentences": 1078, "validatedSentences": 17001, "unvalidatedSentences": 1116, "clips": 91298, "splits": { "accent": {}, "age": { "": 0.48, "twenties": 0.35, "thirties": 0.09, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.42, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 91284, "agriculture_food": 2, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 8, "healthcare": 0, "history_law_government": 2, "language_fundamentals": 2, "media_entertainment": 2, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 682, "size": 2009041404, "checksum": "de85490aed2ac9bd1059fe2f8e69c73aa1bc9f1121dca90a36052582da48a0e5", "avgDurationSecs": 4.141, "validDurationSecs": 266653.451, "totalHrs": 105.01, "validHrs": 74.07 }, "ko": { "buckets": { "dev": 504, "test": 523, "train": 658, "validated": 1686, "invalidated": 477, "other": 4857 }, "duration": 36998208, "reportedSentences": 29, "validatedSentences": 7401, "unvalidatedSentences": 2622, "clips": 7020, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.53, "thirties": 0.2, "teens": 0.03, "fourties": 0.04, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.32, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7015, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 1, "general": 1, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 194, "size": 216225690, "checksum": "82ecf582beb0ea54ea81c75ef42a17ca1bc74ba4231764cadab87849e1c67f07", "avgDurationSecs": 5.27, "validDurationSecs": 8885.894, "totalHrs": 10.27, "validHrs": 2.46 }, "ksf": { "buckets": { "dev": 308, "test": 320, "train": 376, "validated": 7414, "invalidated": 305, "other": 222 }, "duration": 73970280, "reportedSentences": 0, "validatedSentences": 1004, "unvalidatedSentences": -1, "clips": 7941, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0.07, "thirties": 0.05, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.95, "male_masculine": 0, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7941, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 407838646, "checksum": "e5c2f4171648dd9e8a66e8023d5a9e56ea61fc49864fff82ca9516776d9f99d9", "avgDurationSecs": 9.315, "validDurationSecs": 69061.284, "totalHrs": 20.54, "validHrs": 19.18 }, "kvx": { "buckets": { "dev": 654, "test": 450, "train": 913, "validated": 7464, "invalidated": 36, "other": 57 }, "duration": 40217904, "reportedSentences": 2, "validatedSentences": 2025, "unvalidatedSentences": -1, "clips": 7557, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.2, "thirties": 0.27, "teens": 0.28, "fourties": 0.21, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7557, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 230134556, "checksum": "9a96384afff97ae5082f1a0c87f9cf0b53a8330138262b0568d668872fc53679", "avgDurationSecs": 5.322, "validDurationSecs": 39722.964, "totalHrs": 11.17, "validHrs": 11.03 }, "kw": { "buckets": { "dev": 0, "test": 2401, "train": 6223, "validated": 10806, "invalidated": 402, "other": 60 }, "duration": 46633392, "reportedSentences": 43, "validatedSentences": 9609, "unvalidatedSentences": -1, "clips": 11268, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0.47, "sixties": 0.02, "seventies": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 12725, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 273350876, "checksum": "5aa9cc98cbcf49c06eb88af972ef563e727f06c8a975f0b49d93c63607e6ad6d", "avgDurationSecs": 4.139, "validDurationSecs": 44721.373, "totalHrs": 12.95, "validHrs": 12.42 }, "kxp": { "buckets": { "dev": 0, "test": 197, "train": 1881, "validated": 11644, "invalidated": 95, "other": 20 }, "duration": 40016700, "reportedSentences": 0, "validatedSentences": 2079, "unvalidatedSentences": -1, "clips": 11759, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.28, "thirties": 0.53, "teens": 0.01, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11759, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 231110501, "checksum": "4f8988ad38bc6c0ef5dc05b8cb0b30e25ce8da6c306d3a2534718bcbeb516a7f", "avgDurationSecs": 3.403, "validDurationSecs": 39625.347, "totalHrs": 11.11, "validHrs": 11 }, "ky": { "buckets": { "dev": 1614, "test": 1615, "train": 1790, "validated": 30706, "invalidated": 5680, "other": 1460 }, "duration": 172310520, "reportedSentences": 39, "validatedSentences": 5045, "unvalidatedSentences": 242, "clips": 37846, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.65, "thirties": 0.09, "teens": 0.17, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.54, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 37846, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 298, "size": 1108204541, "checksum": "a0f04ffe114d62699d006bf112c35d79ce7689d7b1b6683ca2fb0df7ac25e970", "avgDurationSecs": 4.553, "validDurationSecs": 139802.537, "totalHrs": 47.86, "validHrs": 38.83 }, "lg": { "buckets": { "dev": 13354, "test": 13390, "train": 71089, "validated": 271885, "invalidated": 39183, "other": 37695 }, "duration": 2017262871, "reportedSentences": 6485, "validatedSentences": 191409, "unvalidatedSentences": 179, "clips": 348763, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.38, "thirties": 0.22, "teens": 0.01, "fourties": 0.07, "fifties": 0.05, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.36, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 348763, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 665, "size": 11842344489, "checksum": "f60e779e1a0852718866540cd6cd09b5fd7b05cd5db2839b01708eb5c865ce7b", "avgDurationSecs": 5.784, "validDurationSecs": 1572596.622, "totalHrs": 560.35, "validHrs": 436.83 }, "lij": { "buckets": { "dev": 943, "test": 1358, "train": 2304, "validated": 4686, "invalidated": 585, "other": 0 }, "duration": 20676960, "reportedSentences": 378, "validatedSentences": 6079, "unvalidatedSentences": 50, "clips": 5271, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.07, "thirties": 0.45, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.17, "seventies": 0.11, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.45, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4890, "agriculture_food": 14, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 366, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 1, "technology_robotics": 0 } }, "users": 39, "size": 115145015, "checksum": "7c9213de879b7214f580c6421b855b5f731b7b757e96bc80160c78a9ec42ce67", "avgDurationSecs": 3.923, "validDurationSecs": 18382.135, "totalHrs": 5.74, "validHrs": 5.1 }, "lo": { "buckets": { "dev": 28, "test": 28, "train": 106, "validated": 162, "invalidated": 11, "other": 50 }, "duration": 1456560, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5925, "clips": 223, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.33, "thirties": 0.56, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.11, "male_masculine": 0.89, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 223, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 9346586, "checksum": "a09fcebc44a72c0cca612a5394902a5c5dac06e190da915df06b3a0bf2cc9fa0", "avgDurationSecs": 6.532, "validDurationSecs": 1058.129, "totalHrs": 0.4, "validHrs": 0.29 }, "lrk": { "buckets": { "dev": 0, "test": 347, "train": 1654, "validated": 11462, "invalidated": 206, "other": 123 }, "duration": 42216012, "reportedSentences": 0, "validatedSentences": 2006, "unvalidatedSentences": -1, "clips": 11791, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.28, "thirties": 0.01, "teens": 0.34, "fourties": 0.19, "fifties": 0.01, "sixties": 0.17, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.83, "male_masculine": 0, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11791, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 246865264, "checksum": "bdb71407d5327ccbf5ab12239dca33c77dd6d6a12d85a366494f89367fda0ca3", "avgDurationSecs": 3.58, "validDurationSecs": 41038.074, "totalHrs": 11.72, "validHrs": 11.39 }, "lss": { "buckets": { "dev": 474, "test": 645, "train": 903, "validated": 10326, "invalidated": 34, "other": 94 }, "duration": 36300708, "reportedSentences": 0, "validatedSentences": 2029, "unvalidatedSentences": -1, "clips": 10454, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.45, "thirties": 0.33, "teens": 0, "fourties": 0.19, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.11 }, "sentence_domain": { "": 10454, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 183610168, "checksum": "f9aba4d8963a6aa930762bb41dc1a12b8278860044600ed9e136b2c9a156209e", "avgDurationSecs": 3.472, "validDurationSecs": 35856.238, "totalHrs": 10.08, "validHrs": 9.96 }, "lt": { "buckets": { "dev": 5111, "test": 5517, "train": 9061, "validated": 19891, "invalidated": 910, "other": 4040 }, "duration": 127317788, "reportedSentences": 192, "validatedSentences": 129304, "unvalidatedSentences": 4814, "clips": 24841, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.26, "thirties": 0.23, "teens": 0.03, "fourties": 0.29, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.41, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 24841, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 328, "size": 777846668, "checksum": "6b14a22ba834499d6918a05b6f3af067f34f2d88a95890ad07569715aaae4947", "avgDurationSecs": 5.125, "validDurationSecs": 101947.511, "totalHrs": 35.36, "validHrs": 28.31 }, "ltg": { "buckets": { "dev": 3490, "test": 3639, "train": 4937, "validated": 22653, "invalidated": 1867, "other": 629 }, "duration": 120455748, "reportedSentences": 79, "validatedSentences": 15079, "unvalidatedSentences": 2269, "clips": 25149, "splits": { "accent": {}, "age": { "": 0.34, "twenties": 0.09, "thirties": 0.23, "teens": 0, "fourties": 0.32, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.05, "female_feminine": 0.61, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19190, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6297, "healthcare": 1, "history_law_government": 709, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 382, "size": 694917582, "checksum": "be380d8e082f5bf47659b57b62c5bd34b426633464aa22f91a2a30c30c707eee", "avgDurationSecs": 4.79, "validDurationSecs": 108500.698, "totalHrs": 33.45, "validHrs": 30.13 }, "lua": { "buckets": { "dev": 291, "test": 291, "train": 298, "validated": 4801, "invalidated": 894, "other": 0 }, "duration": 46917972, "reportedSentences": 0, "validatedSentences": 880, "unvalidatedSentences": -1, "clips": 5695, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.67, "thirties": 0.19, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.68, "male_masculine": 0, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5695, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 46, "size": 272331674, "checksum": "4f209c15a0ad7c21fcbb0840985b9439e98c6a074a74bfbcf774b992c2265c33", "avgDurationSecs": 8.238, "validDurationSecs": 39552.798, "totalHrs": 13.03, "validHrs": 10.98 }, "luo": { "buckets": { "dev": 3060, "test": 3001, "train": 4498, "validated": 20343, "invalidated": 792, "other": 61603 }, "duration": 430613136, "reportedSentences": 8, "validatedSentences": 14691, "unvalidatedSentences": 13870, "clips": 82738, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.33, "thirties": 0.28, "teens": 0, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11, "agriculture_food": 1135, "automotive_transport": 341, "finance": 967, "service_retail": 2748, "general": 69488, "healthcare": 2237, "history_law_government": 865, "language_fundamentals": 555, "media_entertainment": 806, "nature_environment": 634, "news_current_affairs": 2710, "technology_robotics": 241 } }, "users": 47, "size": 2392815146, "checksum": "88eb400a20c69e9f2be64afc31058da8cc5aa947759c50b1692551562b1b8c71", "avgDurationSecs": 5.205, "validDurationSecs": 105875.934, "totalHrs": 119.61, "validHrs": 29.4 }, "lv": { "buckets": { "dev": 7781, "test": 7781, "train": 14619, "validated": 197952, "invalidated": 11580, "other": 21328 }, "duration": 1110350141, "reportedSentences": 3144, "validatedSentences": 34599, "unvalidatedSentences": 64990, "clips": 230860, "splits": { "accent": {}, "age": { "": 0.44, "twenties": 0.06, "thirties": 0.28, "teens": 0.01, "fourties": 0.09, "fifties": 0.08, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.25, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229992, "agriculture_food": 8, "automotive_transport": 3, "finance": 5, "service_retail": 16, "general": 722, "healthcare": 36, "history_law_government": 8, "language_fundamentals": 9, "media_entertainment": 16, "nature_environment": 44, "news_current_affairs": 5, "technology_robotics": 13 } }, "users": 6530, "size": 6221788585, "checksum": "4c3f8684153e45f57f249cec4c7509723d0a0d54791aec22f65faf13f0e3f2bb", "avgDurationSecs": 4.81, "validDurationSecs": 952074.985, "totalHrs": 308.43, "validHrs": 264.46 }, "lzz": { "buckets": { "dev": 815, "test": 772, "train": 972, "validated": 5779, "invalidated": 418, "other": 927 }, "duration": 27077688, "reportedSentences": 16, "validatedSentences": 4412, "unvalidatedSentences": 1203, "clips": 7124, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.17, "thirties": 0.07, "teens": 0, "fourties": 0.25, "fifties": 0.09, "sixties": 0.3, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0, "female_feminine": 0.42, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33, "agriculture_food": 3, "automotive_transport": 8, "finance": 1, "service_retail": 0, "general": 7558, "healthcare": 6, "history_law_government": 2, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 48, "news_current_affairs": 3, "technology_robotics": 0 } }, "users": 57, "size": 143891652, "checksum": "deac6c71e8f65378eb69fb49ada05679946bd726bc91517cfd089d6ecb137fb7", "avgDurationSecs": 3.801, "validDurationSecs": 21965.463, "totalHrs": 7.52, "validHrs": 6.1 }, "mau": { "buckets": { "dev": 252, "test": 312, "train": 555, "validated": 6040, "invalidated": 13, "other": 216 }, "duration": 38969352, "reportedSentences": 0, "validatedSentences": 1119, "unvalidatedSentences": -1, "clips": 6269, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.21, "thirties": 0.43, "teens": 0, "fourties": 0.36, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6269, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7, "size": 207459818, "checksum": "9fad9bf14acb1702c1dad989373134a59527bc856cb646c14fab5ba68d8f5da5", "avgDurationSecs": 6.216, "validDurationSecs": 37545.842, "totalHrs": 10.82, "validHrs": 10.42 }, "mbo": { "buckets": { "dev": 321, "test": 263, "train": 410, "validated": 7106, "invalidated": 75, "other": 701 }, "duration": 43456968, "reportedSentences": 4, "validatedSentences": 998, "unvalidatedSentences": 4, "clips": 7882, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7882, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 254177118, "checksum": "a2dc5f7ffa82e73d1adddffc372eaac7352ad5b11da6c51ac53bc85dfd9ef728", "avgDurationSecs": 5.513, "validDurationSecs": 39178.535, "totalHrs": 12.07, "validHrs": 10.88 }, "mcf": { "buckets": { "dev": 0, "test": 320, "train": 677, "validated": 13884, "invalidated": 28, "other": 78 }, "duration": 37156428, "reportedSentences": 0, "validatedSentences": 997, "unvalidatedSentences": -1, "clips": 13990, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.28, "thirties": 0.34, "teens": 0.25, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.74, "male_masculine": 0, "female_feminine": 0.26, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13990, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 183268786, "checksum": "2555d944cbce5510565439960845916a40b02de78d389ec58b271b98df975aec", "avgDurationSecs": 2.656, "validDurationSecs": 36874.9, "totalHrs": 10.32, "validHrs": 10.24 }, "mcn": { "buckets": { "dev": 316, "test": 316, "train": 325, "validated": 8621, "invalidated": 32, "other": 607 }, "duration": 39026016, "reportedSentences": 4, "validatedSentences": 959, "unvalidatedSentences": -1, "clips": 9260, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9260, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 228253357, "checksum": "98814c19f4f6e35d188f95d96196847bd9e10d15283ce6de237f7d7945605deb", "avgDurationSecs": 4.214, "validDurationSecs": 36332.968, "totalHrs": 10.84, "validHrs": 10.09 }, "mcx": { "buckets": { "dev": 111, "test": 249, "train": 543, "validated": 5491, "invalidated": 67, "other": 2 }, "duration": 36755964, "reportedSentences": 0, "validatedSentences": 903, "unvalidatedSentences": -1, "clips": 5560, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5560, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 217211497, "checksum": "6876d86e0d40bbabe5bc77b2f459d86ff19363646185be82b5d8d667db1fb66b", "avgDurationSecs": 6.611, "validDurationSecs": 36299.82, "totalHrs": 10.2, "validHrs": 10.08 }, "mdd": { "buckets": { "dev": 0, "test": 46, "train": 877, "validated": 5542, "invalidated": 31, "other": 0 }, "duration": 36437112, "reportedSentences": 0, "validatedSentences": 923, "unvalidatedSentences": -1, "clips": 5573, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5573, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7, "size": 208708874, "checksum": "143398b834b4d612570753e124c3b8fef58d8343f6c4d05b463c6c5b14667bd8", "avgDurationSecs": 6.538, "validDurationSecs": 36234.429, "totalHrs": 10.12, "validHrs": 10.06 }, "mdf": { "buckets": { "dev": 54, "test": 107, "train": 175, "validated": 336, "invalidated": 9, "other": 0 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "age": { "": 0.58, "twenties": 0.02, "thirties": 0, "teens": 0, "fourties": 0.34, "fifties": 0, "sixties": 0.06, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0.08, "female_feminine": 0.34, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 345, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 11056956, "checksum": "705c18c73d1217495c84bcdd6df8953d6d6fe0244665b8f807c4af7c416525ef", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mgg": { "buckets": { "dev": 296, "test": 212, "train": 483, "validated": 4863, "invalidated": 290, "other": 0 }, "duration": 39065004, "reportedSentences": 0, "validatedSentences": 991, "unvalidatedSentences": -1, "clips": 5153, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5153, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 231037660, "checksum": "017f84fba3b42e99cbcf1939b7f5264644b54749cc1b1b96d32afc7ba51b9fa6", "avgDurationSecs": 7.581, "validDurationSecs": 36866.508, "totalHrs": 10.85, "validHrs": 10.24 }, "mhk": { "buckets": { "dev": 305, "test": 202, "train": 493, "validated": 6597, "invalidated": 52, "other": 46 }, "duration": 41164560, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 6695, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.07, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6695, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 230628274, "checksum": "5b79a388485e539bcb27d890f5b85a1177b4549eda7a11064624f596f4a53c03", "avgDurationSecs": 6.149, "validDurationSecs": 40562.002, "totalHrs": 11.43, "validHrs": 11.26 }, "mhr": { "buckets": { "dev": 14663, "test": 15152, "train": 186565, "validated": 218744, "invalidated": 16027, "other": 5 }, "duration": 1085410764, "reportedSentences": 102, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234776, "splits": { "accent": {}, "age": { "": 0.12, "twenties": 0.33, "thirties": 0.21, "teens": 0.14, "fourties": 0.12, "fifties": 0.04, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0.19, "female_feminine": 0.69, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 234776, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 500, "size": 6124380934, "checksum": "21c80f3897057743d21e15f591d413f4348aa398905c066efeb00a4ae1861a6b", "avgDurationSecs": 4.623, "validDurationSecs": 1011292.007, "totalHrs": 301.5, "validHrs": 280.91 }, "mk": { "buckets": { "dev": 1736, "test": 1800, "train": 2111, "validated": 17439, "invalidated": 989, "other": 22573 }, "duration": 197182728, "reportedSentences": 161, "validatedSentences": 6046, "unvalidatedSentences": 648, "clips": 41001, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.36, "thirties": 0.27, "teens": 0.03, "fourties": 0.2, "fifties": 0.02, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.28, "female_feminine": 0.39, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 40735, "agriculture_food": 14, "automotive_transport": 0, "finance": 7, "service_retail": 0, "general": 211, "healthcare": 27, "history_law_government": 7, "language_fundamentals": 0, "media_entertainment": 7, "nature_environment": 21, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 466, "size": 1082253769, "checksum": "053b8c57582d47d876ac6ea4632704a6f0749279c2257ba46b864e42cfead76c", "avgDurationSecs": 4.809, "validDurationSecs": 83867.945, "totalHrs": 54.77, "validHrs": 23.29 }, "mki": { "buckets": { "dev": 0, "test": 23, "train": 2008, "validated": 11181, "invalidated": 134, "other": 12 }, "duration": 36287784, "reportedSentences": 0, "validatedSentences": 2055, "unvalidatedSentences": -1, "clips": 11327, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.28, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11327, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 196588067, "checksum": "a9c2835deea7cad075681efd6f30fe65cb3139779ee119ba80bcdfb14c89de51", "avgDurationSecs": 3.204, "validDurationSecs": 35820.051, "totalHrs": 10.07, "validHrs": 9.95 }, "ml": { "buckets": { "dev": 934, "test": 882, "train": 1232, "validated": 3466, "invalidated": 121, "other": 6043 }, "duration": 40854852, "reportedSentences": 243, "validatedSentences": 5487, "unvalidatedSentences": 4728, "clips": 9630, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.59, "thirties": 0.08, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0.47, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9630, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 154, "size": 230927465, "checksum": "09bad714c75c67807062977871629e6aed94df29c196c3edd6e9230b28fb20a2", "avgDurationSecs": 4.242, "validDurationSecs": 14704.353, "totalHrs": 11.34, "validHrs": 4.08 }, "mn": { "buckets": { "dev": 1934, "test": 1934, "train": 2190, "validated": 33704, "invalidated": 3239, "other": 59365 }, "duration": 506019608, "reportedSentences": 46, "validatedSentences": 6098, "unvalidatedSentences": 16, "clips": 96308, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.92, "thirties": 0.03, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.46, "male_masculine": 0.14, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 96251, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 33, "healthcare": 2, "history_law_government": 2, "language_fundamentals": 2, "media_entertainment": 2, "nature_environment": 2, "news_current_affairs": 16, "technology_robotics": 0 } }, "users": 606, "size": 3078079672, "checksum": "384488bb777f7478ec8ad9467e402a8864b04a38aee2d8da728c887e24cd51a5", "avgDurationSecs": 5.254, "validDurationSecs": 177086.897, "totalHrs": 140.56, "validHrs": 49.19 }, "mr": { "buckets": { "dev": 1766, "test": 1796, "train": 2189, "validated": 10995, "invalidated": 2292, "other": 2804 }, "duration": 99768888, "reportedSentences": 59, "validatedSentences": 6717, "unvalidatedSentences": 12102, "clips": 16091, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.28, "thirties": 0.13, "teens": 0.53, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.06, "male_masculine": 0.19, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 16091, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 105, "size": 586525379, "checksum": "3e4cdbb5d87e20a12bb888459baba7a8f41b2e635c2884da6cfad34c8a165b9b", "avgDurationSecs": 6.2, "validDurationSecs": 68172.203, "totalHrs": 27.71, "validHrs": 18.93 }, "mrj": { "buckets": { "dev": 7314, "test": 7164, "train": 14325, "validated": 28973, "invalidated": 1261, "other": 1459 }, "duration": 132879852, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31693, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.35, "thirties": 0.22, "teens": 0.06, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.13, "male_masculine": 0.25, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31693, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 747239471, "checksum": "a01e3933ee8dcc0b3c211c7867e29f775164505ffd7b00bd0242dd29509819c3", "avgDurationSecs": 4.193, "validDurationSecs": 121475.656, "totalHrs": 36.91, "validHrs": 33.74 }, "ms": { "buckets": { "dev": 6, "test": 5, "train": 11, "validated": 27, "invalidated": 1, "other": 2064 }, "duration": 12876444, "reportedSentences": 0, "validatedSentences": 257, "unvalidatedSentences": 9, "clips": 2092, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2057, "agriculture_food": 0, "automotive_transport": 6, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 29, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 27, "size": 76000823, "checksum": "20ff7d48d859b7bf3f2b51fed187c349efde41d9f03bc99049791df447ecebf4", "avgDurationSecs": 6.155, "validDurationSecs": 166.187, "totalHrs": 3.57, "validHrs": 0.04 }, "mse": { "buckets": { "dev": 327, "test": 284, "train": 391, "validated": 4338, "invalidated": 310, "other": 18 }, "duration": 38836476, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": -1, "clips": 4666, "splits": { "accent": {}, "age": { "": 0.79, "twenties": 0.21, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4666, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 218219565, "checksum": "77b8d72d1483f1a82384abbde1971c2e7ddc5cf816e2230358b2010706185f9e", "avgDurationSecs": 8.323, "validDurationSecs": 36106.437, "totalHrs": 10.78, "validHrs": 10.02 }, "mt": { "buckets": { "dev": 1625, "test": 1660, "train": 1910, "validated": 6595, "invalidated": 326, "other": 6286 }, "duration": 62787780, "reportedSentences": 10, "validatedSentences": 5273, "unvalidatedSentences": 11773, "clips": 13207, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.19, "thirties": 0.09, "teens": 0.03, "fourties": 0.17, "fifties": 0.25, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0.25, "female_feminine": 0.47, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13207, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 222, "size": 466475288, "checksum": "f6be17867ef174e2235020c7fb7f997fb8530dfbbde9d84a23bb3816ee0a0d85", "avgDurationSecs": 4.754, "validDurationSecs": 31353.48, "totalHrs": 17.44, "validHrs": 8.7 }, "mua": { "buckets": { "dev": 286, "test": 281, "train": 428, "validated": 8677, "invalidated": 33, "other": 27 }, "duration": 38936952, "reportedSentences": 5, "validatedSentences": 995, "unvalidatedSentences": -1, "clips": 8737, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8737, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 217979209, "checksum": "1f30809c317d59d91a3c7060140e5500e212a30d240aef8ebfcb802ef1a679e5", "avgDurationSecs": 4.457, "validDurationSecs": 38669.558, "totalHrs": 10.81, "validHrs": 10.74 }, "mug": { "buckets": { "dev": 297, "test": 297, "train": 404, "validated": 2687, "invalidated": 263, "other": 0 }, "duration": 45123948, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": -1, "clips": 2950, "splits": { "accent": {}, "age": { "": 0.99, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2950, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 266758088, "checksum": "61707c1c6eb4ad4d852e43a142e108600c37feb3d1ba3c565e07c8fb1473bd30", "avgDurationSecs": 15.296, "validDurationSecs": 41101.033, "totalHrs": 12.53, "validHrs": 11.41 }, "mve": { "buckets": { "dev": 555, "test": 146, "train": 1296, "validated": 7648, "invalidated": 47, "other": 11 }, "duration": 38372904, "reportedSentences": 0, "validatedSentences": 2003, "unvalidatedSentences": -1, "clips": 7706, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.26, "thirties": 0.33, "teens": 0, "fourties": 0.39, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7706, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 225592528, "checksum": "0935f7b6789e38c6e935bff6a15922f7875d44a208f789b7e02dc6545e52bc6e", "avgDurationSecs": 4.98, "validDurationSecs": 38084.086, "totalHrs": 10.65, "validHrs": 10.57 }, "mvy": { "buckets": { "dev": 1946, "test": 2076, "train": 2558, "validated": 16600, "invalidated": 590, "other": 1285 }, "duration": 90013968, "reportedSentences": 90, "validatedSentences": 6634, "unvalidatedSentences": 222, "clips": 18475, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.39, "thirties": 0.47, "teens": 0, "fourties": 0.06, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18245, "agriculture_food": 33, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 108, "healthcare": 0, "history_law_government": 78, "language_fundamentals": 0, "media_entertainment": 11, "nature_environment": 7, "news_current_affairs": 0, "technology_robotics": 9 } }, "users": 56, "size": 476972904, "checksum": "298600a827aec8372df257483745fe3093c54509d1ae657995f5e382dad8b7cb", "avgDurationSecs": 4.872, "validDurationSecs": 80878.586, "totalHrs": 25, "validHrs": 22.46 }, "mxu": { "buckets": { "dev": 309, "test": 317, "train": 333, "validated": 7749, "invalidated": 67, "other": 2 }, "duration": 50839092, "reportedSentences": 0, "validatedSentences": 959, "unvalidatedSentences": -1, "clips": 7818, "splits": { "accent": {}, "age": { "": 0.56, "twenties": 0.32, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7818, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 22, "size": 298287692, "checksum": "bc7d32fbc9d4c35e91d2797ce5eb7a8b4d26ee805b4f3f4b67eec6f28ed8f3d0", "avgDurationSecs": 6.503, "validDurationSecs": 50390.397, "totalHrs": 14.12, "validHrs": 13.99 }, "myv": { "buckets": { "dev": 239, "test": 482, "train": 1241, "validated": 1972, "invalidated": 23, "other": 15 }, "duration": 11624940, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 2010, "splits": { "accent": {}, "age": { "": 0.39, "twenties": 0.09, "thirties": 0.24, "teens": 0.01, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.54, "female_feminine": 0.07, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2010, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 68321817, "checksum": "13aadf802fc4548ef59760746fbc04c9a1c9e9705db95169602e338745bb6064", "avgDurationSecs": 5.784, "validDurationSecs": 11405.165, "totalHrs": 3.22, "validHrs": 3.16 }, "nan-tw": { "buckets": { "dev": 5923, "test": 6383, "train": 11559, "validated": 29405, "invalidated": 576, "other": 2265 }, "duration": 85243932, "reportedSentences": 226, "validatedSentences": 26907, "unvalidatedSentences": 370, "clips": 32246, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.51, "thirties": 0.15, "teens": 0.01, "fourties": 0.17, "fifties": 0.01, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.62, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 31563, "agriculture_food": 4, "automotive_transport": 0, "finance": 0, "service_retail": 2, "general": 622, "healthcare": 1, "history_law_government": 0, "language_fundamentals": 55, "media_entertainment": 0, "nature_environment": 55, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 297, "size": 485404206, "checksum": "fe5cd5420acf6d284ff9cd006c475844ef8f68874863d7c09c2cb6d75fd8d513", "avgDurationSecs": 2.644, "validDurationSecs": 77733.605, "totalHrs": 23.67, "validHrs": 21.59 }, "nb-NO": { "buckets": { "dev": 415, "test": 312, "train": 1273, "validated": 2001, "invalidated": 96, "other": 140 }, "duration": 9378972, "reportedSentences": 1, "validatedSentences": 4394, "unvalidatedSentences": 3674, "clips": 2237, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0, "thirties": 0.62, "teens": 0, "fourties": 0.19, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.6, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1426, "agriculture_food": 3, "automotive_transport": 0, "finance": 4, "service_retail": 37, "general": 717, "healthcare": 0, "history_law_government": 21, "language_fundamentals": 13, "media_entertainment": 17, "nature_environment": 37, "news_current_affairs": 32, "technology_robotics": 0 } }, "users": 67, "size": 54581626, "checksum": "7e02d3de626e49f7fec04ffb61e7215a2c4c16c2c4c943abfa5864ed994cce44", "avgDurationSecs": 4.193, "validDurationSecs": 8389.505, "totalHrs": 2.6, "validHrs": 2.33 }, "ncx": { "buckets": { "dev": 339, "test": 345, "train": 360, "validated": 8641, "invalidated": 62, "other": 806 }, "duration": 42381792, "reportedSentences": 2, "validatedSentences": 1518, "unvalidatedSentences": 5, "clips": 9509, "splits": { "accent": {}, "age": { "": 0.04, "twenties": 0.15, "thirties": 0.63, "teens": 0, "fourties": 0.17, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.61, "male_masculine": 0.01, "female_feminine": 0.38, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2832, "agriculture_food": 0, "automotive_transport": 0, "finance": 212, "service_retail": 0, "general": 0, "healthcare": 212, "history_law_government": 0, "language_fundamentals": 2047, "media_entertainment": 4700, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 41, "size": 243130501, "checksum": "0e68bdaab632f8183451a1a44ad9790ecab34678fbe9ad22c9ffee0fb3ca76bf", "avgDurationSecs": 4.457, "validDurationSecs": 38513.1, "totalHrs": 11.77, "validHrs": 10.69 }, "ne-NP": { "buckets": { "dev": 314, "test": 289, "train": 355, "validated": 1179, "invalidated": 100, "other": 415 }, "duration": 6964560, "reportedSentences": 5, "validatedSentences": 1316, "unvalidatedSentences": 30, "clips": 1694, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.57, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.61, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1693, "agriculture_food": 1, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 61, "size": 40868769, "checksum": "d7860601d5d1fdcd2993e90efeac9730ca0f40e46be9516f52f0e17403b213d5", "avgDurationSecs": 4.111, "validDurationSecs": 4847.235, "totalHrs": 1.93, "validHrs": 1.34 }, "nhi": { "buckets": { "dev": 10, "test": 6, "train": 24, "validated": 40, "invalidated": 0, "other": 387 }, "duration": 2169576, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 11, "clips": 427, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.73, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.08, "female_feminine": 0.73, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 12735162, "checksum": "244897ae41792379b265414bbaf44bffbccca8e24dd0487c8646862137dafd94", "avgDurationSecs": 5.081, "validDurationSecs": 203.239, "totalHrs": 0.6, "validHrs": 0.05 }, "nl": { "buckets": { "dev": 12225, "test": 12225, "train": 46082, "validated": 103597, "invalidated": 6133, "other": 5700 }, "duration": 504525061, "reportedSentences": 406, "validatedSentences": 260817, "unvalidatedSentences": 10596, "clips": 115430, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.18, "thirties": 0.12, "teens": 0.02, "fourties": 0.18, "fifties": 0.11, "sixties": 0.02, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.38, "male_masculine": 0.52, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 114703, "agriculture_food": 16, "automotive_transport": 21, "finance": 7, "service_retail": 0, "general": 368, "healthcare": 56, "history_law_government": 55, "language_fundamentals": 175, "media_entertainment": 2, "nature_environment": 55, "news_current_affairs": 3, "technology_robotics": 5 } }, "users": 1865, "size": 3373398122, "checksum": "21eb500aa204e5084516690f5d271e1502c1b3c5f15580ace2e114f75eb6dbf2", "avgDurationSecs": 4.371, "validDurationSecs": 452805.014, "totalHrs": 140.14, "validHrs": 125.77 }, "nla": { "buckets": { "dev": 172, "test": 220, "train": 592, "validated": 4635, "invalidated": 176, "other": 5 }, "duration": 37346544, "reportedSentences": 0, "validatedSentences": 984, "unvalidatedSentences": -1, "clips": 4816, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4816, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 212123306, "checksum": "63ada318ffc0fbf732daed24b8fdd805fa05d7face5c1c5c1b8d66a18bc4e831", "avgDurationSecs": 7.755, "validDurationSecs": 35942.947, "totalHrs": 10.37, "validHrs": 9.98 }, "nlv": { "buckets": { "dev": 248, "test": 291, "train": 467, "validated": 6656, "invalidated": 128, "other": 60 }, "duration": 42922260, "reportedSentences": 1, "validatedSentences": 1006, "unvalidatedSentences": -1, "clips": 6844, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.13, "thirties": 0.85, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.91, "male_masculine": 0, "female_feminine": 0.09, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5103, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 698, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 824, "media_entertainment": 944, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 249248700, "checksum": "1d501ae8bc18eaec8ce2835947f36f0c6a9d3323057b7415ce2fa00e00ae0791", "avgDurationSecs": 6.272, "validDurationSecs": 41743.215, "totalHrs": 11.92, "validHrs": 11.59 }, "nmg": { "buckets": { "dev": 272, "test": 270, "train": 483, "validated": 5865, "invalidated": 89, "other": 5 }, "duration": 38142936, "reportedSentences": 1, "validatedSentences": 1025, "unvalidatedSentences": -1, "clips": 5959, "splits": { "accent": {}, "age": { "": 0.86, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.14, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5959, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 217274870, "checksum": "42a73b308a7cc5bdbd0f62c73c9ad6c159ff34a87ae4a37ba67a38e5fd0dd41e", "avgDurationSecs": 6.401, "validDurationSecs": 37541.252, "totalHrs": 10.59, "validHrs": 10.42 }, "nmz": { "buckets": { "dev": 795, "test": 795, "train": 846, "validated": 13698, "invalidated": 109, "other": 313 }, "duration": 41779332, "reportedSentences": 2, "validatedSentences": 2446, "unvalidatedSentences": 1, "clips": 14120, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 14120, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 35, "size": 215687900, "checksum": "46566952dd93ab8bb6cd9563df118dc892b434554ecaec441218bddfc7830e0c", "avgDurationSecs": 2.959, "validDurationSecs": 40530.686, "totalHrs": 11.6, "validHrs": 11.25 }, "nn-NO": { "buckets": { "dev": 322, "test": 392, "train": 588, "validated": 1307, "invalidated": 61, "other": 11 }, "duration": 6050232, "reportedSentences": 23, "validatedSentences": 5318, "unvalidatedSentences": 6, "clips": 1379, "splits": { "accent": {}, "age": { "": 0.26, "twenties": 0.28, "thirties": 0.4, "teens": 0.02, "fourties": 0.03, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.58, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1379, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 35178699, "checksum": "7d63636ad50966fbed2f6d092060a4a0e95735df571726f2faed08c336b1ca8a", "avgDurationSecs": 4.387, "validDurationSecs": 5734.339, "totalHrs": 1.68, "validHrs": 1.59 }, "nnh": { "buckets": { "dev": 186, "test": 288, "train": 412, "validated": 7631, "invalidated": 156, "other": 1 }, "duration": 82297008, "reportedSentences": 0, "validatedSentences": 886, "unvalidatedSentences": -1, "clips": 7788, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0, "thirties": 0.11, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7788, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 486872826, "checksum": "babd44c2f72627c3a056b231c56db4202e847d9634009ed258b84f1d097bc1ba", "avgDurationSecs": 10.567, "validDurationSecs": 80637.965, "totalHrs": 22.86, "validHrs": 22.39 }, "nr": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 7 }, "duration": 60696, "reportedSentences": 0, "validatedSentences": 319, "unvalidatedSentences": 7468, "clips": 7, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 881984, "checksum": "56bf8efe6e787631d2167e53ea0f75d5d8d23574400c1ded2dee82f52a1c513c", "avgDurationSecs": 8.671, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "test": 0, "train": 2, "validated": 2, "invalidated": 0, "other": 18 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2244, "unvalidatedSentences": 14227, "clips": 20, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1552765, "checksum": "73e9b4a6e2b9eab02599ace56ee43a4dbe7acaeb4bf184f546d5462a60e73d65", "avgDurationSecs": 4.63, "validDurationSecs": 9.259, "totalHrs": 0.02, "validHrs": 0 }, "nyu": { "buckets": { "dev": 0, "test": 248, "train": 1023, "validated": 3639, "invalidated": 76, "other": 1 }, "duration": 38390760, "reportedSentences": 0, "validatedSentences": 1271, "unvalidatedSentences": -1, "clips": 3716, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.33, "thirties": 0.34, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.67, "male_masculine": 0, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2624, "agriculture_food": 4, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1082, "healthcare": 2, "history_law_government": 2, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 219323882, "checksum": "3bb330e84c386c143ac89e39312aadbc3742b2e53b4f8411f22d18b535b73df2", "avgDurationSecs": 10.331, "validDurationSecs": 37595.257, "totalHrs": 10.66, "validHrs": 10.44 }, "oc": { "buckets": { "dev": 272, "test": 274, "train": 304, "validated": 1977, "invalidated": 203, "other": 7705 }, "duration": 48154680, "reportedSentences": 13, "validatedSentences": 4934, "unvalidatedSentences": 14, "clips": 9885, "splits": { "accent": {}, "age": { "": 0.5, "twenties": 0.02, "thirties": 0.04, "teens": 0.01, "fourties": 0.17, "fifties": 0.16, "sixties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.52, "male_masculine": 0.08, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9885, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 149, "size": 274106967, "checksum": "e0331248b393bf9364ece8f85c0ecc71d33de74652847bcb4c143cdcccb186a2", "avgDurationSecs": 4.871, "validDurationSecs": 9630.936, "totalHrs": 13.37, "validHrs": 2.67 }, "odk": { "buckets": { "dev": 467, "test": 627, "train": 941, "validated": 6341, "invalidated": 46, "other": 0 }, "duration": 40676832, "reportedSentences": 0, "validatedSentences": 2047, "unvalidatedSentences": -1, "clips": 6387, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.2, "thirties": 0.18, "teens": 0.15, "fourties": 0.08, "fifties": 0.08, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0.14, "female_feminine": 0.16, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6387, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 236342138, "checksum": "be42f35dacc8897e6d6adac8e837baa719bb06bbfe31c9fe43193174451b8ea6", "avgDurationSecs": 6.369, "validDurationSecs": 40383.872, "totalHrs": 11.29, "validHrs": 11.21 }, "or": { "buckets": { "dev": 640, "test": 479, "train": 2151, "validated": 4085, "invalidated": 437, "other": 19622 }, "duration": 134612700, "reportedSentences": 100, "validatedSentences": 6197, "unvalidatedSentences": 87, "clips": 24144, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.65, "thirties": 0.27, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.34, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.21 }, "sentence_domain": { "": 24137, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 7, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 163, "size": 832546925, "checksum": "0f23ae006a91ba4ef4a980488409c233cb8af7d9bbf397d69368182b569674b1", "avgDurationSecs": 5.575, "validDurationSecs": 22775.55, "totalHrs": 37.39, "validHrs": 6.32 }, "oru": { "buckets": { "dev": 0, "test": 1210, "train": 6079, "validated": 8490, "invalidated": 86, "other": 160 }, "duration": 65019456, "reportedSentences": 0, "validatedSentences": 7355, "unvalidatedSentences": 902, "clips": 8736, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.77, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.4, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.37 }, "sentence_domain": { "": 8630, "agriculture_food": 106, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 106, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 106, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 385828863, "checksum": "b2c9859c26e48cd41a5a54439c6d570676e2399015b6fd591646093d4e05617f", "avgDurationSecs": 7.443, "validDurationSecs": 63188.551, "totalHrs": 18.06, "validHrs": 17.55 }, "os": { "buckets": { "dev": 251, "test": 219, "train": 414, "validated": 915, "invalidated": 35, "other": 6 }, "duration": 5280084, "reportedSentences": 5, "validatedSentences": 1888, "unvalidatedSentences": 34, "clips": 956, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.09, "thirties": 0.46, "teens": 0.02, "fourties": 0.2, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.78, "male_masculine": 0.22, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 955, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 37, "size": 30907997, "checksum": "ab156c9fbdce44f05645cbfeb8814d7169bbd6ac5f7f67c6620eace2ec4231ba", "avgDurationSecs": 5.523, "validDurationSecs": 5053.637, "totalHrs": 1.46, "validHrs": 1.4 }, "pa-IN": { "buckets": { "dev": 489, "test": 524, "train": 808, "validated": 1821, "invalidated": 104, "other": 1357 }, "duration": 16778510, "reportedSentences": 251, "validatedSentences": 14825, "unvalidatedSentences": 22, "clips": 3282, "splits": { "accent": {}, "age": { "": 0.28, "twenties": 0.21, "thirties": 0.39, "teens": 0, "fourties": 0.07, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.63, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3281, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 75, "size": 116221215, "checksum": "51e1408f6a42373d70e5c58619df034be28766a7204b2734d55d4c82240ce543", "avgDurationSecs": 5.112, "validDurationSecs": 9309.466, "totalHrs": 4.66, "validHrs": 2.58 }, "pcm": { "buckets": { "dev": 326, "test": 326, "train": 335, "validated": 7754, "invalidated": 64, "other": 1146 }, "duration": 54050724, "reportedSentences": 0, "validatedSentences": 987, "unvalidatedSentences": -1, "clips": 8964, "splits": { "accent": {}, "age": { "": 0.73, "twenties": 0, "thirties": 0.11, "teens": 0, "fourties": 0.16, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.85, "male_masculine": 0, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8964, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 59, "size": 308334350, "checksum": "68d6dea59c9ee3f8eb160acc202796feefb2a580413055b740e562a2a11d85fc", "avgDurationSecs": 6.03, "validDurationSecs": 46754.72, "totalHrs": 15.01, "validHrs": 12.98 }, "phl": { "buckets": { "dev": 1408, "test": 1377, "train": 1947, "validated": 15556, "invalidated": 83, "other": 5519 }, "duration": 105392160, "reportedSentences": 0, "validatedSentences": 4745, "unvalidatedSentences": 1184, "clips": 21158, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.48, "thirties": 0.23, "teens": 0.06, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 21120, "agriculture_food": 0, "automotive_transport": 4, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 10, "history_law_government": 24, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 619742106, "checksum": "4aaf371feffca8920d87fccab54c5b30110428b624fbf5aa6df513b488798f3d", "avgDurationSecs": 4.981, "validDurationSecs": 77487.496, "totalHrs": 29.27, "validHrs": 21.52 }, "phr": { "buckets": { "dev": 673, "test": 656, "train": 743, "validated": 12695, "invalidated": 139, "other": 10 }, "duration": 52106364, "reportedSentences": 1, "validatedSentences": 2077, "unvalidatedSentences": -1, "clips": 12844, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.52, "thirties": 0.17, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0, "female_feminine": 0.3, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12844, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 63, "size": 296556266, "checksum": "f9105e948fa3db9fd4b6cb7bd31d7f11851197874fef8268e0b64f6571e7f16d", "avgDurationSecs": 4.057, "validDurationSecs": 51501.891, "totalHrs": 14.47, "validHrs": 14.3 }, "pl": { "buckets": { "dev": 9975, "test": 9975, "train": 24903, "validated": 138335, "invalidated": 6952, "other": 2689 }, "duration": 676710182, "reportedSentences": 604, "validatedSentences": 230081, "unvalidatedSentences": 11038, "clips": 147976, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.27, "thirties": 0.35, "teens": 0.02, "fourties": 0.11, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0.01 }, "gender": { "": 0.28, "male_masculine": 0.58, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 147959, "agriculture_food": 0, "automotive_transport": 0, "finance": 12, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 2, "technology_robotics": 3 } }, "users": 3455, "size": 4784731518, "checksum": "1070367840751023da2a7b99c964ddfb1bd00df77f42b9c9e546341c1520cf04", "avgDurationSecs": 4.573, "validDurationSecs": 632620.851, "totalHrs": 187.97, "validHrs": 175.72 }, "plk": { "buckets": { "dev": 1237, "test": 662, "train": 2517, "validated": 9015, "invalidated": 960, "other": 1960 }, "duration": 59980824, "reportedSentences": 0, "validatedSentences": 4657, "unvalidatedSentences": -1, "clips": 11935, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0, "thirties": 0.16, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.71, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11935, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 345834596, "checksum": "54b9322f0140b9d249ea831a54d67c1a5287b7c32df06944c06cf830c8f3e7d4", "avgDurationSecs": 5.026, "validDurationSecs": 45306.002, "totalHrs": 16.66, "validHrs": 12.58 }, "prq": { "buckets": { "dev": 282, "test": 287, "train": 417, "validated": 7422, "invalidated": 68, "other": 179 }, "duration": 37282968, "reportedSentences": 0, "validatedSentences": 986, "unvalidatedSentences": -1, "clips": 7669, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.17, "thirties": 0.26, "teens": 0, "fourties": 0.57, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.16, "male_masculine": 0, "female_feminine": 0.84, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7669, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 202924151, "checksum": "ad6dcb53dbe2e7b600d7748b77db0238ebda964a1bdf225bdb95986ce3b79a1b", "avgDurationSecs": 4.862, "validDurationSecs": 36082.173, "totalHrs": 10.35, "validHrs": 10.02 }, "ps": { "buckets": { "dev": 13791, "test": 13791, "train": 80525, "validated": 848687, "invalidated": 51866, "other": 1507246 }, "duration": 9967341528, "reportedSentences": 4230, "validatedSentences": 246535, "unvalidatedSentences": 15712, "clips": 2407799, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.6, "thirties": 0.2, "teens": 0.08, "fourties": 0.04, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0, "female_feminine": 0.58, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2053505, "agriculture_food": 24125, "automotive_transport": 3647, "finance": 6460, "service_retail": 14974, "general": 278899, "healthcare": 11001, "history_law_government": 69867, "language_fundamentals": 16045, "media_entertainment": 13696, "nature_environment": 79412, "news_current_affairs": 4381, "technology_robotics": 14030 } }, "users": 6654, "size": 53663559573, "checksum": "c2dc2c2b16e6ddfe831bd3c92c24761767fac439be809d26c0c7bbb55b3e18b4", "avgDurationSecs": 4.14, "validDurationSecs": 3513230.622, "totalHrs": 2768.7, "validHrs": 975.89 }, "pt": { "buckets": { "dev": 9659, "test": 9662, "train": 23046, "validated": 160548, "invalidated": 7825, "other": 27822 }, "duration": 821649031, "reportedSentences": 2853, "validatedSentences": 43602, "unvalidatedSentences": 96, "clips": 196195, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.36, "thirties": 0.18, "teens": 0.02, "fourties": 0.13, "fifties": 0.03, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0.68, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 194404, "agriculture_food": 147, "automotive_transport": 37, "finance": 27, "service_retail": 24, "general": 1357, "healthcare": 191, "history_law_government": 66, "language_fundamentals": 17, "media_entertainment": 76, "nature_environment": 49, "news_current_affairs": 4, "technology_robotics": 181 } }, "users": 3804, "size": 5148282839, "checksum": "f326e333e79da755db71a0c66986c87729ad4a9d5b1630758e5fcc09dfa2d5af", "avgDurationSecs": 4.188, "validDurationSecs": 672362.235, "totalHrs": 228.23, "validHrs": 186.76 }, "pua": { "buckets": { "dev": 495, "test": 547, "train": 1153, "validated": 7536, "invalidated": 199, "other": 445 }, "duration": 40235256, "reportedSentences": 1, "validatedSentences": 2201, "unvalidatedSentences": -1, "clips": 8180, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.55, "thirties": 0.31, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.72, "male_masculine": 0, "female_feminine": 0.28, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8180, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 228610574, "checksum": "de0c8d62ccc4b94c825efc9dbb360145a74d33b1ae7cef91ba064562a9d0c685", "avgDurationSecs": 4.919, "validDurationSecs": 37067.59, "totalHrs": 11.17, "validHrs": 10.29 }, "pwn": { "buckets": { "dev": 722, "test": 724, "train": 761, "validated": 10800, "invalidated": 138, "other": 0 }, "duration": 53298360, "reportedSentences": 0, "validatedSentences": 2207, "unvalidatedSentences": 241, "clips": 10938, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.05, "thirties": 0.43, "teens": 0, "fourties": 0, "fifties": 0.22, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0.04, "female_feminine": 0.23, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.24 }, "sentence_domain": { "": 0, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13132, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 27, "size": 294314757, "checksum": "eefcdc85a1ef010bd40d4f03afa4ab837c54348584ecbd9fc389377d564154fb", "avgDurationSecs": 4.873, "validDurationSecs": 52625.918, "totalHrs": 14.8, "validHrs": 14.61 }, "qup": { "buckets": { "dev": 288, "test": 254, "train": 449, "validated": 7202, "invalidated": 2, "other": 2465 }, "duration": 57452508, "reportedSentences": 4, "validatedSentences": 991, "unvalidatedSentences": -1, "clips": 9669, "splits": { "accent": {}, "age": { "": 0.08, "twenties": 0.11, "thirties": 0.72, "teens": 0, "fourties": 0, "fifties": 0.1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.79, "male_masculine": 0, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9669, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 330081092, "checksum": "46e1197dc29db5ba3a884162c74dce7d54cfc3a893e5a06dd548443bac96cd22", "avgDurationSecs": 5.942, "validDurationSecs": 42793.77, "totalHrs": 15.95, "validHrs": 11.88 }, "qur": { "buckets": { "dev": 0, "test": 32, "train": 977, "validated": 10408, "invalidated": 27, "other": 41 }, "duration": 36351540, "reportedSentences": 0, "validatedSentences": 1010, "unvalidatedSentences": -1, "clips": 10476, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.76, "thirties": 0.24, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0, "female_feminine": 0.81, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10476, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 194563538, "checksum": "f10ccb583f4878442eb4a5e8360e9646c427bbc9086d7e05b5fda7926fc999ad", "avgDurationSecs": 3.47, "validDurationSecs": 36115.581, "totalHrs": 10.09, "validHrs": 10.03 }, "qus": { "buckets": { "dev": 314, "test": 330, "train": 354, "validated": 10323, "invalidated": 93, "other": 433 }, "duration": 41653512, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 10849, "splits": { "accent": {}, "age": { "": 0.2, "twenties": 0.36, "thirties": 0.32, "teens": 0, "fourties": 0.03, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male_masculine": 0, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 222384268, "checksum": "237121fcf9e890d51858bed67c6c5b586f1a2c6e46c171a3378a1bb18f6baf42", "avgDurationSecs": 3.839, "validDurationSecs": 39633.994, "totalHrs": 11.57, "validHrs": 11 }, "qux": { "buckets": { "dev": 280, "test": 299, "train": 399, "validated": 6156, "invalidated": 25, "other": 2008 }, "duration": 48318408, "reportedSentences": 0, "validatedSentences": 978, "unvalidatedSentences": -1, "clips": 8189, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.24, "thirties": 0.39, "teens": 0, "fourties": 0, "fifties": 0.24, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8189, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 270613847, "checksum": "ca676f40d62b0b50d21aebda5a610c10147ebf9a5d773490d23df80ddacff2ce", "avgDurationSecs": 5.9, "validDurationSecs": 36322.887, "totalHrs": 13.42, "validHrs": 10.08 }, "quy": { "buckets": { "dev": 0, "test": 10, "train": 27, "validated": 38, "invalidated": 7, "other": 1255 }, "duration": 11227752, "reportedSentences": 0, "validatedSentences": 759, "unvalidatedSentences": 282, "clips": 1300, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.71, "thirties": 0, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0.25, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.82, "male_masculine": 0, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1300, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 65531078, "checksum": "14cb3d8261136c2ace09b7d83a7fc1dcfd0da22dea239b40af1e9dc175bc7c7a", "avgDurationSecs": 8.637, "validDurationSecs": 328.196, "totalHrs": 3.11, "validHrs": 0.09 }, "qva": { "buckets": { "dev": 200, "test": 326, "train": 473, "validated": 8222, "invalidated": 21, "other": 23 }, "duration": 36399420, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 8266, "splits": { "accent": {}, "age": { "": 0.11, "twenties": 0.1, "thirties": 0.54, "teens": 0, "fourties": 0, "fifties": 0.12, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.12, "male_masculine": 0, "female_feminine": 0.88, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8266, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 199010373, "checksum": "1ecd84244857b93d4dda453a25651a68ee58ef415180043cce8b9949069a4b00", "avgDurationSecs": 4.404, "validDurationSecs": 36205.666, "totalHrs": 10.11, "validHrs": 10.05 }, "qvi": { "buckets": { "dev": 364, "test": 379, "train": 405, "validated": 9266, "invalidated": 237, "other": 693 }, "duration": 45140940, "reportedSentences": 0, "validatedSentences": 1149, "unvalidatedSentences": -1, "clips": 10196, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.26, "thirties": 0.64, "teens": 0.1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.28, "male_masculine": 0, "female_feminine": 0.72, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10196, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 262118236, "checksum": "34799bc58736d1665788583381e2e2547bc03b4b9df9a49a7c587dfec0356765", "avgDurationSecs": 4.427, "validDurationSecs": 41023.534, "totalHrs": 12.53, "validHrs": 11.39 }, "qvj": { "buckets": { "dev": 349, "test": 348, "train": 362, "validated": 6552, "invalidated": 8, "other": 186 }, "duration": 40215924, "reportedSentences": 0, "validatedSentences": 1059, "unvalidatedSentences": -1, "clips": 6746, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.58, "thirties": 0.02, "teens": 0, "fourties": 0.39, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0, "female_feminine": 0.8, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6746, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 232485499, "checksum": "bfe433ba3421daf0a5804f8f518e655ea5fa2b85a09953d70512f13aaffbc329", "avgDurationSecs": 5.961, "validDurationSecs": 39059.403, "totalHrs": 11.17, "validHrs": 10.84 }, "qvl": { "buckets": { "dev": 222, "test": 321, "train": 457, "validated": 9082, "invalidated": 5, "other": 13 }, "duration": 36219708, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 9100, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.83, "thirties": 0.11, "teens": 0.06, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0, "female_feminine": 0.61, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9100, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 201087771, "checksum": "f4559bce00466942b0a8e55590927112e8cf351bc45d8419a1c571dc0e6e84a4", "avgDurationSecs": 3.98, "validDurationSecs": 36148.065, "totalHrs": 10.06, "validHrs": 10.04 }, "qwa": { "buckets": { "dev": 106, "test": 308, "train": 586, "validated": 7217, "invalidated": 1, "other": 1147 }, "duration": 41537916, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": -1, "clips": 8365, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.12, "thirties": 0.32, "teens": 0, "fourties": 0.2, "fifties": 0.36, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8365, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 220832456, "checksum": "88a62630ce0768c38b6a0f7470a3232266794c6ae5cf84ba7fa9d6b2221e34f6", "avgDurationSecs": 4.966, "validDurationSecs": 35837.315, "totalHrs": 11.53, "validHrs": 9.95 }, "qws": { "buckets": { "dev": 0, "test": 61, "train": 937, "validated": 8879, "invalidated": 2, "other": 96 }, "duration": 37376136, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 8977, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0.55, "thirties": 0.11, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.44, "male_masculine": 0, "female_feminine": 0.56, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8977, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 211202213, "checksum": "266632544eb4aa83b13d013c0c97c020365f5e7e4d239ba8263f5414ace24090", "avgDurationSecs": 4.164, "validDurationSecs": 36968.109, "totalHrs": 10.38, "validHrs": 10.26 }, "qxa": { "buckets": { "dev": 233, "test": 315, "train": 451, "validated": 8353, "invalidated": 8, "other": 359 }, "duration": 37918800, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": -1, "clips": 8720, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.67, "thirties": 0.2, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0, "female_feminine": 0.55, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8720, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 208033129, "checksum": "8b23b4d8ef2a7cddd582e412e7a8ba13c90058c890a6c34f2c7d314ffc322cbc", "avgDurationSecs": 4.348, "validDurationSecs": 36322.906, "totalHrs": 10.53, "validHrs": 10.08 }, "qxp": { "buckets": { "dev": 0, "test": 561, "train": 1503, "validated": 7679, "invalidated": 213, "other": 1141 }, "duration": 41874264, "reportedSentences": 53, "validatedSentences": 2065, "unvalidatedSentences": -1, "clips": 9033, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.76, "thirties": 0.23, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 0, "agriculture_food": 8998, "automotive_transport": 0, "finance": 43, "service_retail": 0, "general": 7043, "healthcare": 7055, "history_law_government": 485, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 3488, "news_current_affairs": 58, "technology_robotics": 0 } }, "users": 14, "size": 241523276, "checksum": "31afe9f6c1aa3241a72436d14c9e0a5f4571b110f6d8ccd8a1d01052cfee99ab", "avgDurationSecs": 4.636, "validDurationSecs": 35597.528, "totalHrs": 11.63, "validHrs": 9.88 }, "qxt": { "buckets": { "dev": 60, "test": 321, "train": 616, "validated": 8647, "invalidated": 12, "other": 2469 }, "duration": 47738988, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 11128, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.19, "thirties": 0.18, "teens": 0, "fourties": 0.54, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.27, "male_masculine": 0, "female_feminine": 0.73, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11128, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 273938595, "checksum": "27dcfd0f8307753b66bfb50406b1b17885482fb9d03211d6654ac60c750613cf", "avgDurationSecs": 4.29, "validDurationSecs": 37095.527, "totalHrs": 13.26, "validHrs": 10.3 }, "qxu": { "buckets": { "dev": 0, "test": 325, "train": 673, "validated": 8712, "invalidated": 298, "other": 1899 }, "duration": 47439360, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": -1, "clips": 10909, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.47, "thirties": 0.34, "teens": 0.09, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10909, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 272676268, "checksum": "057cbb265953728900cfab48882a24391502e5bff4f9c33580d0238cef5d5959", "avgDurationSecs": 4.349, "validDurationSecs": 37885.389, "totalHrs": 13.17, "validHrs": 10.52 }, "qxw": { "buckets": { "dev": 167, "test": 227, "train": 608, "validated": 8021, "invalidated": 82, "other": 1150 }, "duration": 48633444, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": -1, "clips": 9253, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.54, "thirties": 0.11, "teens": 0, "fourties": 0.22, "fifties": 0, "sixties": 0, "seventies": 0.11, "eighties": 0, "nineties": 0 }, "gender": { "": 0.46, "male_masculine": 0, "female_feminine": 0.54, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9253, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 275514194, "checksum": "e2105521891aad5524d884b3768e5e584819e1df9035f3bb096de930ff64849b", "avgDurationSecs": 5.256, "validDurationSecs": 42158.095, "totalHrs": 13.5, "validHrs": 11.71 }, "rm-sursilv": { "buckets": { "dev": 1369, "test": 1419, "train": 1604, "validated": 4411, "invalidated": 687, "other": 2406 }, "duration": 40806869, "reportedSentences": 17, "validatedSentences": 8780, "unvalidatedSentences": 7455, "clips": 7504, "splits": { "accent": {}, "age": { "": 0.61, "twenties": 0.09, "thirties": 0.07, "teens": 0.05, "fourties": 0.18, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.65, "male_masculine": 0.17, "female_feminine": 0.18, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7504, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 91, "size": 306286617, "checksum": "fe04ea27af47298e1ec01b64d7e36f5f87c496d4b707909a6066f0a671ca3426", "avgDurationSecs": 5.438, "validDurationSecs": 23987.087, "totalHrs": 11.33, "validHrs": 6.66 }, "rm-vallader": { "buckets": { "dev": 500, "test": 463, "train": 557, "validated": 1527, "invalidated": 394, "other": 740 }, "duration": 15519074, "reportedSentences": 38, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2661, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.14, "thirties": 0.06, "teens": 0, "fourties": 0.41, "fifties": 0, "sixties": 0.03, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.44, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2661, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 118485233, "checksum": "8b1c3bcd86f0dddd121218222fd4d9e8b88a9f38181f601671effc328db77b67", "avgDurationSecs": 5.832, "validDurationSecs": 8905.534, "totalHrs": 4.31, "validHrs": 2.47 }, "ro": { "buckets": { "dev": 3927, "test": 3912, "train": 5192, "validated": 19569, "invalidated": 1093, "other": 22919 }, "duration": 176104975, "reportedSentences": 487, "validatedSentences": 13753, "unvalidatedSentences": 89202, "clips": 43581, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.59, "thirties": 0.13, "teens": 0.02, "fourties": 0.09, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.17, "male_masculine": 0.7, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 43581, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 459, "size": 1096596634, "checksum": "485f929bb94328403e43bfd9e930e7815b5fc7b1cdf776b3cdac27ba5c425a3c", "avgDurationSecs": 4.041, "validDurationSecs": 79075.704, "totalHrs": 48.91, "validHrs": 21.96 }, "rof": { "buckets": { "dev": 461, "test": 461, "train": 477, "validated": 9557, "invalidated": 835, "other": 0 }, "duration": 41952420, "reportedSentences": 0, "validatedSentences": 1399, "unvalidatedSentences": -1, "clips": 10392, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.35, "thirties": 0.41, "teens": 0, "fourties": 0.23, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0, "female_feminine": 0.63, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10392, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 24, "size": 239899208, "checksum": "42454b644bd2374c5ec0424f0b255391090b9cd7ae157a14377cd352cdea3644", "avgDurationSecs": 4.037, "validDurationSecs": 38581.532, "totalHrs": 11.65, "validHrs": 10.71 }, "ru": { "buckets": { "dev": 10260, "test": 10261, "train": 26772, "validated": 173750, "invalidated": 10594, "other": 16982 }, "duration": 1044851472, "reportedSentences": 577, "validatedSentences": 47849, "unvalidatedSentences": 230, "clips": 201326, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.33, "thirties": 0.18, "teens": 0.09, "fourties": 0.13, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.59, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 201199, "agriculture_food": 4, "automotive_transport": 4, "finance": 8, "service_retail": 12, "general": 41, "healthcare": 6, "history_law_government": 44, "language_fundamentals": 4, "media_entertainment": 18, "nature_environment": 5, "news_current_affairs": 3, "technology_robotics": 16 } }, "users": 3636, "size": 7008716262, "checksum": "9a2ed32a0574f74f505cd7740a599f0b9edc9f52ba1e7d6624b66f258db4c0ea", "avgDurationSecs": 5.19, "validDurationSecs": 901736.205, "totalHrs": 290.23, "validHrs": 250.48 }, "rup": { "buckets": { "dev": 5, "test": 8, "train": 16, "validated": 29, "invalidated": 17, "other": 16 }, "duration": 370260, "reportedSentences": 0, "validatedSentences": 3507, "unvalidatedSentences": -1, "clips": 62, "splits": { "accent": {}, "age": { "": 0.32, "twenties": 0.02, "thirties": 0.26, "teens": 0.24, "fourties": 0, "fifties": 0.16, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.58, "male_masculine": 0, "female_feminine": 0.42, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 62, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 2309540, "checksum": "31b2483416f226ff7dc6b52ce3a70c781e6627cab96e5f7090c0331a4316fd8c", "avgDurationSecs": 5.972, "validDurationSecs": 173.186, "totalHrs": 0.1, "validHrs": 0.04 }, "rw": { "buckets": { "dev": 15987, "test": 16213, "train": 1003033, "validated": 1439127, "invalidated": 227838, "other": 47253 }, "duration": 8583672569, "reportedSentences": 634, "validatedSentences": 1404857, "unvalidatedSentences": 8, "clips": 1714218, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1, "male_masculine": 0.57, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1714218, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1179, "size": 61200840427, "checksum": "b61528d19c78687020b28328c916f0a517ff57fbb17749e30ce6f5f43bdee834", "avgDurationSecs": 5.007, "validDurationSecs": 7206198.368, "totalHrs": 2384.35, "validHrs": 2001.72 }, "sah": { "buckets": { "dev": 1746, "test": 1808, "train": 2313, "validated": 9282, "invalidated": 477, "other": 3341 }, "duration": 82989732, "reportedSentences": 26, "validatedSentences": 6076, "unvalidatedSentences": 385, "clips": 13100, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.23, "thirties": 0.15, "teens": 0.03, "fourties": 0.1, "fifties": 0.18, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.32, "male_masculine": 0.24, "female_feminine": 0.44, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13100, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 247, "size": 527460914, "checksum": "96e747ace1cbfd7d97775f8ecd454ea53683a6c521ccdbee2bfe8952714cfd0b", "avgDurationSecs": 6.335, "validDurationSecs": 58802.343, "totalHrs": 23.05, "validHrs": 16.33 }, "sat": { "buckets": { "dev": 68, "test": 127, "train": 333, "validated": 528, "invalidated": 18, "other": 303 }, "duration": 3801708, "reportedSentences": 7, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 849, "splits": { "accent": {}, "age": { "": 0.42, "twenties": 0.42, "thirties": 0.12, "teens": 0.01, "fourties": 0.01, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.59, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 849, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 21352261, "checksum": "016b9d7b4488b688147e480956d81ff41584cb61e31b14904130fca4505e95cf", "avgDurationSecs": 4.478, "validDurationSecs": 2364.313, "totalHrs": 1.05, "validHrs": 0.65 }, "sbn": { "buckets": { "dev": 0, "test": 247, "train": 1754, "validated": 10957, "invalidated": 12, "other": 18 }, "duration": 38755044, "reportedSentences": 0, "validatedSentences": 2001, "unvalidatedSentences": -1, "clips": 10987, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.55, "thirties": 0, "teens": 0.19, "fourties": 0.26, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10987, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 226528570, "checksum": "5184a7f673b0020c469e718b17213d18f6e40f363334a3c27fe823436debce0f", "avgDurationSecs": 3.527, "validDurationSecs": 38649.223, "totalHrs": 10.76, "validHrs": 10.73 }, "sc": { "buckets": { "dev": 551, "test": 637, "train": 926, "validated": 2288, "invalidated": 120, "other": 407 }, "duration": 13224744, "reportedSentences": 36, "validatedSentences": 5473, "unvalidatedSentences": 361, "clips": 2815, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.09, "thirties": 0.28, "teens": 0, "fourties": 0.01, "fifties": 0.15, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.55, "male_masculine": 0.12, "female_feminine": 0.33, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2800, "agriculture_food": 2, "automotive_transport": 1, "finance": 1, "service_retail": 0, "general": 9, "healthcare": 2, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 1, "news_current_affairs": 3, "technology_robotics": 0 } }, "users": 41, "size": 76855546, "checksum": "1c16219cc05fc6e18c1d77709ab43c8b0e37723332c573ce12ddfe772d848e79", "avgDurationSecs": 4.698, "validDurationSecs": 10748.922, "totalHrs": 3.67, "validHrs": 2.98 }, "scl": { "buckets": { "dev": 862, "test": 984, "train": 1407, "validated": 8876, "invalidated": 146, "other": 3 }, "duration": 37147104, "reportedSentences": 0, "validatedSentences": 3300, "unvalidatedSentences": 4, "clips": 9025, "splits": { "accent": {}, "age": { "": 0.16, "twenties": 0.02, "thirties": 0.82, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.98, "male_masculine": 0, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9021, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 4, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 40, "size": 217100806, "checksum": "bab306d96c7d7d0ac7eaf676270e92194df0f33da5c23bcf35835e73f6364531", "avgDurationSecs": 4.116, "validDurationSecs": 36533.817, "totalHrs": 10.31, "validHrs": 10.14 }, "sd": { "buckets": { "dev": 0, "test": 42, "train": 277, "validated": 319, "invalidated": 197, "other": 34384 }, "duration": 143157636, "reportedSentences": 6, "validatedSentences": 13348, "unvalidatedSentences": 71, "clips": 34900, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0, "thirties": 0.11, "teens": 0, "fourties": 0.87, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0.11, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 34808, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 89, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26, "size": 755876846, "checksum": "b65a12ef79e5458d9432027cbfa5472b070bb1325bc177343b3f761aac8fda94", "avgDurationSecs": 4.102, "validDurationSecs": 1308.518, "totalHrs": 39.76, "validHrs": 0.36 }, "sei": { "buckets": { "dev": 435, "test": 452, "train": 728, "validated": 8006, "invalidated": 254, "other": 0 }, "duration": 37573452, "reportedSentences": 0, "validatedSentences": 1615, "unvalidatedSentences": -1, "clips": 8260, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.23, "thirties": 0.27, "teens": 0, "fourties": 0.49, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.18, "male_masculine": 0, "female_feminine": 0.81, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01 }, "sentence_domain": { "": 8260, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 218623843, "checksum": "7f56cbcb9f0c95d371b2cb863488d914d11bc03f01eae8ef2c45dd77056d4cf9", "avgDurationSecs": 4.549, "validDurationSecs": 36418.046, "totalHrs": 10.43, "validHrs": 10.11 }, "sk": { "buckets": { "dev": 5090, "test": 5239, "train": 7846, "validated": 44126, "invalidated": 1219, "other": 1373 }, "duration": 200036880, "reportedSentences": 347, "validatedSentences": 20639, "unvalidatedSentences": 2347, "clips": 46718, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.07, "thirties": 0.18, "teens": 0.05, "fourties": 0.12, "fifties": 0.36, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.62, "male_masculine": 0.35, "female_feminine": 0.04, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 33942, "agriculture_food": 3, "automotive_transport": 263, "finance": 2, "service_retail": 0, "general": 12454, "healthcare": 4, "history_law_government": 7803, "language_fundamentals": 101, "media_entertainment": 405, "nature_environment": 260, "news_current_affairs": 278, "technology_robotics": 895 } }, "users": 269, "size": 1154587658, "checksum": "9228980a35d828225c66215ac342840b633d62927afd27de46b2678eacd361b9", "avgDurationSecs": 4.282, "validDurationSecs": 188938.468, "totalHrs": 55.56, "validHrs": 52.48 }, "skr": { "buckets": { "dev": 1144, "test": 1037, "train": 1557, "validated": 3750, "invalidated": 467, "other": 1620 }, "duration": 24307020, "reportedSentences": 17, "validatedSentences": 6011, "unvalidatedSentences": 93, "clips": 5837, "splits": { "accent": {}, "age": { "": 0.35, "twenties": 0.24, "thirties": 0.01, "teens": 0.02, "fourties": 0, "fifties": 0.37, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.63, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5837, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 60, "size": 139959301, "checksum": "f8cb9841fc74b8f954a189d24492bbecdb5a26610c0db700bd8abe3898373576", "avgDurationSecs": 4.164, "validDurationSecs": 15616.126, "totalHrs": 6.75, "validHrs": 4.33 }, "sl": { "buckets": { "dev": 1337, "test": 1340, "train": 1463, "validated": 15650, "invalidated": 511, "other": 3364 }, "duration": 78119074, "reportedSentences": 55, "validatedSentences": 4209, "unvalidatedSentences": 29, "clips": 19525, "splits": { "accent": {}, "age": { "": 0.37, "twenties": 0.31, "thirties": 0.03, "teens": 0.05, "fourties": 0.15, "fifties": 0.05, "sixties": 0.04, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.52, "female_feminine": 0.1, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 19523, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1002, "size": 511824211, "checksum": "044c8a2e25f1c3f08c203c9f57caa1cad227ba31ec46cdf615af7631e6253d6f", "avgDurationSecs": 4.001, "validDurationSecs": 62615.289, "totalHrs": 21.69, "validHrs": 17.39 }, "sq": { "buckets": { "dev": 1795, "test": 1874, "train": 2658, "validated": 6337, "invalidated": 180, "other": 21 }, "duration": 33475068, "reportedSentences": 144, "validatedSentences": 50009, "unvalidatedSentences": 2635, "clips": 6538, "splits": { "accent": {}, "age": { "": 0.24, "twenties": 0.09, "thirties": 0.16, "teens": 0, "fourties": 0.18, "fifties": 0.34, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male_masculine": 0.47, "female_feminine": 0.14, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1576, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 1801, "healthcare": 0, "history_law_government": 3136, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 25, "technology_robotics": 0 } }, "users": 150, "size": 199485140, "checksum": "d1598491e686237a7e9c8b688bfd1d1203e3c08f71395e623fa030bf2810b256", "avgDurationSecs": 5.12, "validDurationSecs": 32445.932, "totalHrs": 9.29, "validHrs": 9.01 }, "sr": { "buckets": { "dev": 1816, "test": 1915, "train": 2522, "validated": 8300, "invalidated": 425, "other": 5029 }, "duration": 44748936, "reportedSentences": 185, "validatedSentences": 8066, "unvalidatedSentences": 661, "clips": 13754, "splits": { "accent": {}, "age": { "": 0.14, "twenties": 0.25, "thirties": 0.12, "teens": 0, "fourties": 0.1, "fifties": 0.13, "sixties": 0.26, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.54, "male_masculine": 0.31, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 13705, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 13, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 3, "media_entertainment": 0, "nature_environment": 2, "news_current_affairs": 0, "technology_robotics": 31 } }, "users": 183, "size": 250988892, "checksum": "6243b57a8ce83b37f2d2d2fc17f66289d37b0eb80d296a117c0ebc143521a3e8", "avgDurationSecs": 3.254, "validDurationSecs": 27004.229, "totalHrs": 12.43, "validHrs": 7.5 }, "ss": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 111 }, "duration": 825840, "reportedSentences": 0, "validatedSentences": 1614, "unvalidatedSentences": 6796, "clips": 111, "splits": { "accent": {}, "age": { "": 0.45, "twenties": 0.55, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 111, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 5075058, "checksum": "205e5a5d055ccd73e332c5d27ebcda25b074496a7019065d1f4d59375d0dcd42", "avgDurationSecs": 7.44, "validDurationSecs": 0, "totalHrs": 0.22, "validHrs": 0 }, "ssi": { "buckets": { "dev": 0, "test": 161, "train": 1845, "validated": 10188, "invalidated": 12, "other": 0 }, "duration": 37951740, "reportedSentences": 0, "validatedSentences": 2007, "unvalidatedSentences": -1, "clips": 10200, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.6, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.39, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10200, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 21, "size": 221549575, "checksum": "a2085ac91e1eb686756187de472a1b4305f9b448ed6cc80a2e928017eaf6ae60", "avgDurationSecs": 3.721, "validDurationSecs": 37907.091, "totalHrs": 10.54, "validHrs": 10.52 }, "st": { "buckets": { "dev": 0, "test": 0, "train": 1, "validated": 1, "invalidated": 0, "other": 208 }, "duration": 673668, "reportedSentences": 0, "validatedSentences": 2339, "unvalidatedSentences": 14837, "clips": 209, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.29, "thirties": 0.71, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.71, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 209, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 4961419, "checksum": "0410f551552a71c2d4889273236202907ec723bb99fda4331c0ea44c4a640750", "avgDurationSecs": 3.223, "validDurationSecs": 3.223, "totalHrs": 0.18, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 5455, "test": 5489, "train": 8232, "validated": 42543, "invalidated": 1598, "other": 6046 }, "duration": 201935247, "reportedSentences": 597, "validatedSentences": 26257, "unvalidatedSentences": 7125, "clips": 50187, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.12, "thirties": 0.25, "teens": 0.03, "fourties": 0.37, "fifties": 0.03, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.47, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 50184, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 885, "size": 1275732190, "checksum": "cd66ab8d6cba060cac10a005b5f0369a5ac9371cd5d13cec7c24958e576c0610", "avgDurationSecs": 4.024, "validDurationSecs": 171178.417, "totalHrs": 56.09, "validHrs": 47.54 }, "sva": { "buckets": { "dev": 467, "test": 503, "train": 567, "validated": 9567, "invalidated": 1376, "other": 16 }, "duration": 64961820, "reportedSentences": 9, "validatedSentences": 1538, "unvalidatedSentences": 11, "clips": 10959, "splits": { "accent": {}, "age": { "": 0.07, "twenties": 0.1, "thirties": 0.2, "teens": 0, "fourties": 0.24, "fifties": 0.21, "sixties": 0.13, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0, "female_feminine": 0.8, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 10593, "agriculture_food": 108, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 96, "healthcare": 162, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 55, "size": 369385417, "checksum": "f524918b5aaa9974d92b794a59471d9a43966c7c9dd9fbce460434ad523ba998", "avgDurationSecs": 5.928, "validDurationSecs": 56710.442, "totalHrs": 18.04, "validHrs": 15.75 }, "sw": { "buckets": { "dev": 12269, "test": 12269, "train": 46704, "validated": 268255, "invalidated": 80837, "other": 379914 }, "duration": 3934927116, "reportedSentences": 1902, "validatedSentences": 134725, "unvalidatedSentences": 5758, "clips": 729006, "splits": { "accent": {}, "age": { "": 0.3, "twenties": 0.5, "thirties": 0.11, "teens": 0.01, "fourties": 0.03, "fifties": 0.04, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.34, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 729006, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1497, "size": 22967482338, "checksum": "709aa10aadbc920e36e51bb238185dde5400f26e24f212f5d50a637a67d8e4e6", "avgDurationSecs": 5.398, "validDurationSecs": 1447949.5, "totalHrs": 1093.03, "validHrs": 402.2 }, "szy": { "buckets": { "dev": 257, "test": 264, "train": 280, "validated": 9158, "invalidated": 485, "other": 0 }, "duration": 52040448, "reportedSentences": 0, "validatedSentences": 801, "unvalidatedSentences": 2, "clips": 9643, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.13, "thirties": 0.08, "teens": 0, "fourties": 0, "fifties": 0.11, "sixties": 0.1, "seventies": 0.08, "eighties": 0, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0, "female_feminine": 0.35, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.39 }, "sentence_domain": { "": 27, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 9825, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 9, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 26, "size": 294380934, "checksum": "212dcc3f01dc91273007f706ff98ff6677d25dd9e7914071efb75cd50e612169", "avgDurationSecs": 5.397, "validDurationSecs": 49423.045, "totalHrs": 14.45, "validHrs": 13.72 }, "ta": { "buckets": { "dev": 12163, "test": 12230, "train": 46411, "validated": 136580, "invalidated": 5745, "other": 105244 }, "duration": 1534227252, "reportedSentences": 3438, "validatedSentences": 118260, "unvalidatedSentences": 577, "clips": 247569, "splits": { "accent": {}, "age": { "": 0.66, "twenties": 0.11, "thirties": 0.12, "teens": 0.04, "fourties": 0.03, "fifties": 0.02, "sixties": 0, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.66, "male_masculine": 0.19, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 247569, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 975, "size": 9196249656, "checksum": "f62fbf8c6b44ba63c2ee58da74ce12d08147851b844a00218915b4817750a4af", "avgDurationSecs": 6.197, "validDurationSecs": 846409.518, "totalHrs": 426.17, "validHrs": 235.11 }, "tar": { "buckets": { "dev": 406, "test": 404, "train": 430, "validated": 7889, "invalidated": 23, "other": 10 }, "duration": 36039168, "reportedSentences": 32, "validatedSentences": 1258, "unvalidatedSentences": -1, "clips": 7922, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.11, "thirties": 0.47, "teens": 0, "fourties": 0.38, "fifties": 0.04, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.34, "male_masculine": 0, "female_feminine": 0.66, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7922, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 211005134, "checksum": "d126a9e6662bfb16bbe9b26aa09652d0dd2b730f03f693fad37e3621051b6df3", "avgDurationSecs": 4.549, "validDurationSecs": 35889.043, "totalHrs": 10.01, "validHrs": 9.96 }, "tay": { "buckets": { "dev": 452, "test": 851, "train": 1299, "validated": 7446, "invalidated": 411, "other": 0 }, "duration": 44092728, "reportedSentences": 0, "validatedSentences": 3790, "unvalidatedSentences": 77, "clips": 7857, "splits": { "accent": {}, "age": { "": 0.33, "twenties": 0.22, "thirties": 0, "teens": 0, "fourties": 0.13, "fifties": 0.18, "sixties": 0.15, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.22, "male_masculine": 0, "female_feminine": 0.46, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.33 }, "sentence_domain": { "": 291, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 9021, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 260194128, "checksum": "fc18314c67752531deee56c5fa549687c5588003e6bd2df6bcec3743312d3a2f", "avgDurationSecs": 5.612, "validDurationSecs": 41786.236, "totalHrs": 12.24, "validHrs": 11.6 }, "te": { "buckets": { "dev": 66, "test": 69, "train": 74, "validated": 363, "invalidated": 59, "other": 2031 }, "duration": 10282608, "reportedSentences": 15, "validatedSentences": 336, "unvalidatedSentences": 19289, "clips": 2453, "splits": { "accent": {}, "age": { "": 0.13, "twenties": 0.69, "thirties": 0.02, "teens": 0.07, "fourties": 0.01, "fifties": 0, "sixties": 0.07, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.19, "male_masculine": 0.64, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2453, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 61, "size": 61298335, "checksum": "18b9c95fd488185388c56952a3ebf3ff449d9586183c767a5d189e6ce5dd3bd0", "avgDurationSecs": 4.192, "validDurationSecs": 1521.642, "totalHrs": 2.85, "validHrs": 0.42 }, "tg": { "buckets": { "dev": 123, "test": 119, "train": 252, "validated": 494, "invalidated": 42, "other": 76 }, "duration": 3037968, "reportedSentences": 18, "validatedSentences": 3991, "unvalidatedSentences": 6780, "clips": 612, "splits": { "accent": {}, "age": { "": 0.02, "twenties": 0.98, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.83, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.17 }, "sentence_domain": { "": 39, "agriculture_food": 7, "automotive_transport": 11, "finance": 15, "service_retail": 16, "general": 108, "healthcare": 12, "history_law_government": 29, "language_fundamentals": 299, "media_entertainment": 143, "nature_environment": 5, "news_current_affairs": 52, "technology_robotics": 9 } }, "users": 7, "size": 18178867, "checksum": "8cdb1433f285efc9ac413ec879034588bc2fcc8d2835cb57d047cd5c8c9dec3a", "avgDurationSecs": 4.964, "validDurationSecs": 2452.216, "totalHrs": 0.84, "validHrs": 0.68 }, "th": { "buckets": { "dev": 11059, "test": 11059, "train": 32971, "validated": 148714, "invalidated": 9358, "other": 208329 }, "duration": 1536467772, "reportedSentences": 4318, "validatedSentences": 64491, "unvalidatedSentences": 121701, "clips": 366401, "splits": { "accent": {}, "age": { "": 0.4, "twenties": 0.24, "thirties": 0.07, "teens": 0.04, "fourties": 0.04, "fifties": 0.2, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.42, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 366401, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 7982, "size": 8964895041, "checksum": "f3ff5f97625cb56ecdfafe5d0a37bfb70646aca7f50eab1074bcffceeddacb3a", "avgDurationSecs": 4.193, "validDurationSecs": 623618.026, "totalHrs": 426.79, "validHrs": 173.22 }, "ti": { "buckets": { "dev": 21, "test": 17, "train": 32, "validated": 70, "invalidated": 6, "other": 375 }, "duration": 2342484, "reportedSentences": 3, "validatedSentences": 4570, "unvalidatedSentences": 111, "clips": 451, "splits": { "accent": {}, "age": { "": 0.18, "twenties": 0.31, "thirties": 0.03, "teens": 0, "fourties": 0.49, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.96, "male_masculine": 0.04, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 451, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 16, "size": 14067846, "checksum": "115e0cd6ba7886a2eb1d5b9306d9123af5a7165a6587089da1fde95f4ea0a021", "avgDurationSecs": 5.194, "validDurationSecs": 363.578, "totalHrs": 0.65, "validHrs": 0.1 }, "tig": { "buckets": { "dev": 1615, "test": 1609, "train": 1990, "validated": 6433, "invalidated": 245, "other": 92 }, "duration": 42956856, "reportedSentences": 39, "validatedSentences": 5417, "unvalidatedSentences": 90, "clips": 6770, "splits": { "accent": {}, "age": { "": 0.7, "twenties": 0.21, "thirties": 0.1, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.99, "male_masculine": 0.01, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6757, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6, "healthcare": 0, "history_law_government": 6, "language_fundamentals": 1, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 151, "size": 230415362, "checksum": "5d68f963754089ec2085a170fbb1d2530a46884259c85043fb283e041d260889", "avgDurationSecs": 6.345, "validDurationSecs": 40818.531, "totalHrs": 11.93, "validHrs": 11.33 }, "tk": { "buckets": { "dev": 522, "test": 516, "train": 734, "validated": 1999, "invalidated": 135, "other": 2813 }, "duration": 27210240, "reportedSentences": 27, "validatedSentences": 2494, "unvalidatedSentences": 238, "clips": 4947, "splits": { "accent": {}, "age": { "": 0.36, "twenties": 0.53, "thirties": 0.08, "teens": 0, "fourties": 0.02, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.42, "male_masculine": 0.41, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4947, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 141, "size": 152311909, "checksum": "3a6b65006014e9e6e2a8060f0c338c5e60ac38b39c1013522d7d3038ba09e342", "avgDurationSecs": 5.5, "validDurationSecs": 10995.203, "totalHrs": 7.55, "validHrs": 3.05 }, "tli": { "buckets": { "dev": 0, "test": 524, "train": 2316, "validated": 2840, "invalidated": 0, "other": 0 }, "duration": 36638964, "reportedSentences": 0, "validatedSentences": 7863, "unvalidatedSentences": 10, "clips": 2840, "splits": { "accent": {}, "age": { "": 0.82, "twenties": 0, "thirties": 0.18, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2840, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 3, "size": 217513417, "checksum": "60deb24a69131763cccdff92492c2ede732407b944dfb6b32c9c9eabf68cd7b8", "avgDurationSecs": 12.901, "validDurationSecs": 36638.964, "totalHrs": 10.17, "validHrs": 10.17 }, "tn": { "buckets": { "dev": 311, "test": 369, "train": 1079, "validated": 3506, "invalidated": 11, "other": 532 }, "duration": 17693028, "reportedSentences": 1, "validatedSentences": 1765, "unvalidatedSentences": 15342, "clips": 4049, "splits": { "accent": {}, "age": { "": 0.03, "twenties": 0.02, "thirties": 0.51, "teens": 0, "fourties": 0, "fifties": 0.43, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.48, "male_masculine": 0, "female_feminine": 0.48, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.04 }, "sentence_domain": { "": 4049, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 17, "size": 89812531, "checksum": "5eab9c278d45f0002d82343e0b2e138ec6c98a1e0ec44d48c1469903be956f51", "avgDurationSecs": 4.37, "validDurationSecs": 15320.266, "totalHrs": 4.91, "validHrs": 4.25 }, "tok": { "buckets": { "dev": 2250, "test": 2251, "train": 2635, "validated": 12954, "invalidated": 399, "other": 5973 }, "duration": 83304972, "reportedSentences": 165, "validatedSentences": 7502, "unvalidatedSentences": 2257, "clips": 19326, "splits": { "accent": {}, "age": { "": 0.31, "twenties": 0.19, "thirties": 0.06, "teens": 0.21, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0.23, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.52, "female_feminine": 0.05, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.02 }, "sentence_domain": { "": 19324, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 0, "general": 2, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 171, "size": 487505279, "checksum": "193da8657cab1eacfe7fb5cc78a7392f95c99ec86d026df0551afc866ecad17c", "avgDurationSecs": 4.311, "validDurationSecs": 55838.384, "totalHrs": 23.14, "validHrs": 15.51 }, "tr": { "buckets": { "dev": 11756, "test": 11803, "train": 40650, "validated": 120610, "invalidated": 4942, "other": 317 }, "duration": 483090831, "reportedSentences": 484, "validatedSentences": 410403, "unvalidatedSentences": 3475, "clips": 125869, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.23, "thirties": 0.09, "teens": 0.02, "fourties": 0.03, "fifties": 0.07, "sixties": 0.2, "seventies": 0.03, "eighties": 0.1, "nineties": 0 }, "gender": { "": 0.26, "male_masculine": 0.42, "female_feminine": 0.32, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 125854, "agriculture_food": 0, "automotive_transport": 1, "finance": 0, "service_retail": 0, "general": 1, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 12, "media_entertainment": 1, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1795, "size": 2947404719, "checksum": "e37186670c9d71c6bb4904f349f3d640aa1288ca3a458aa884ab56ce83bf486a", "avgDurationSecs": 3.838, "validDurationSecs": 462906.555, "totalHrs": 134.19, "validHrs": 128.58 }, "trv": { "buckets": { "dev": 558, "test": 529, "train": 885, "validated": 6386, "invalidated": 104, "other": 0 }, "duration": 36415152, "reportedSentences": 1, "validatedSentences": 1974, "unvalidatedSentences": 2, "clips": 6490, "splits": { "accent": {}, "age": { "": 0.93, "twenties": 0.07, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.68, "male_masculine": 0.07, "female_feminine": 0.13, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.12 }, "sentence_domain": { "": 465, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 6276, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 214885747, "checksum": "5620ad2d3d39f3dfbb26bcd3ea9ac5a07c402f1138c5dff7224a6e9ebf770886", "avgDurationSecs": 5.611, "validDurationSecs": 35831.612, "totalHrs": 10.11, "validHrs": 9.95 }, "trw": { "buckets": { "dev": 2083, "test": 1956, "train": 2859, "validated": 11054, "invalidated": 1372, "other": 14 }, "duration": 67715892, "reportedSentences": 578, "validatedSentences": 7770, "unvalidatedSentences": 27, "clips": 12440, "splits": { "accent": {}, "age": { "": 0.05, "twenties": 0.34, "thirties": 0.11, "teens": 0.04, "fourties": 0.36, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 12245, "agriculture_food": 57, "automotive_transport": 2, "finance": 2, "service_retail": 57, "general": 55, "healthcare": 49, "history_law_government": 49, "language_fundamentals": 17, "media_entertainment": 1, "nature_environment": 63, "news_current_affairs": 67, "technology_robotics": 0 } }, "users": 27, "size": 397034706, "checksum": "06ea0c07f054817a3141d1b0e0d95da505b85062fde465f202bef6fc80e79eaf", "avgDurationSecs": 5.443, "validDurationSecs": 60171.34, "totalHrs": 18.8, "validHrs": 16.71 }, "ts": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 1, "other": 0 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1, "size": 1040823, "checksum": "3abc4849fb791ef192ec46d21cc07b8cfb15791e2ffa3a0425a174e5193deaa0", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 4402, "test": 5062, "train": 8522, "validated": 30199, "invalidated": 539, "other": 262 }, "duration": 118759526, "reportedSentences": 6, "validatedSentences": 18051, "unvalidatedSentences": 32, "clips": 31000, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.05, "thirties": 0.71, "teens": 0.01, "fourties": 0, "fifties": 0.01, "sixties": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.21, "male_masculine": 0.76, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30988, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 12, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 288, "size": 865339305, "checksum": "dfa2a8760b49bb5120b78a9514f1055d6c358d5594c6f1c55ca15e37564438f6", "avgDurationSecs": 3.831, "validDurationSecs": 115690.933, "totalHrs": 32.98, "validHrs": 32.13 }, "tui": { "buckets": { "dev": 602, "test": 644, "train": 722, "validated": 7570, "invalidated": 129, "other": 904 }, "duration": 42402492, "reportedSentences": 6, "validatedSentences": 1971, "unvalidatedSentences": 5, "clips": 8603, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.21, "thirties": 0.51, "teens": 0, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.01, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8603, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 50, "size": 248346597, "checksum": "904887b509115378a44bc51046b1638985aeec56f1fd80cda980e84393bffb94", "avgDurationSecs": 4.929, "validDurationSecs": 37311.039, "totalHrs": 11.77, "validHrs": 10.36 }, "tvu": { "buckets": { "dev": 495, "test": 438, "train": 725, "validated": 5247, "invalidated": 122, "other": 65 }, "duration": 38085048, "reportedSentences": 24, "validatedSentences": 1943, "unvalidatedSentences": 347, "clips": 5434, "splits": { "accent": {}, "age": { "": 0.8, "twenties": 0, "thirties": 0.2, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.81, "male_masculine": 0, "female_feminine": 0.19, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5434, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 13, "size": 204873124, "checksum": "2dcd5291c53124cc7d603d5cce483bd9dc8d286831ee284f0b0d25f03148c635", "avgDurationSecs": 7.009, "validDurationSecs": 36774.429, "totalHrs": 10.57, "validHrs": 10.21 }, "tw": { "buckets": { "dev": 0, "test": 27, "train": 213, "validated": 240, "invalidated": 5, "other": 96 }, "duration": 1495440, "reportedSentences": 3, "validatedSentences": 49159, "unvalidatedSentences": 22, "clips": 341, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.09, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.35, "male_masculine": 0.65, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 341, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 19, "size": 11552065, "checksum": "90fe28ee202d593067b5a04a0b25c991f892089f1448f2ae4a707f74e1e54e96", "avgDurationSecs": 4.385, "validDurationSecs": 1052.509, "totalHrs": 0.41, "validHrs": 0.29 }, "udl": { "buckets": { "dev": 285, "test": 329, "train": 396, "validated": 6400, "invalidated": 112, "other": 32 }, "duration": 38688012, "reportedSentences": 1, "validatedSentences": 1011, "unvalidatedSentences": -1, "clips": 6544, "splits": { "accent": {}, "age": { "": 0.85, "twenties": 0.15, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 6544, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 20, "size": 227536919, "checksum": "64b6b0594df78415759d30d41bd45cebc484383ffbe453b0e36f0c41f2fc27c0", "avgDurationSecs": 5.912, "validDurationSecs": 37836.687, "totalHrs": 10.74, "validHrs": 10.51 }, "ug": { "buckets": { "dev": 14656, "test": 14656, "train": 122911, "validated": 269390, "invalidated": 21500, "other": 14191 }, "duration": 1820354616, "reportedSentences": 1539, "validatedSentences": 231426, "unvalidatedSentences": 11032, "clips": 305081, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0.34, "thirties": 0.27, "teens": 0.06, "fourties": 0.1, "fifties": 0.01, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.53, "male_masculine": 0.26, "female_feminine": 0.2, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.01, "intersex": 0 }, "sentence_domain": { "": 302526, "agriculture_food": 2, "automotive_transport": 2, "finance": 1, "service_retail": 1, "general": 2522, "healthcare": 27, "history_law_government": 3, "language_fundamentals": 10, "media_entertainment": 15, "nature_environment": 1, "news_current_affairs": 0, "technology_robotics": 10 } }, "users": 1836, "size": 10384973335, "checksum": "7b709a2dc3e624791eebc0f4202e8b59bd5a50b3cd051c1d65e0b697b54f98d6", "avgDurationSecs": 5.967, "validDurationSecs": 1607393.872, "totalHrs": 505.65, "validHrs": 446.49 }, "uk": { "buckets": { "dev": 10297, "test": 10295, "train": 27058, "validated": 77993, "invalidated": 3340, "other": 8811 }, "duration": 418806240, "reportedSentences": 660, "validatedSentences": 212668, "unvalidatedSentences": 1625, "clips": 90144, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.33, "thirties": 0.25, "teens": 0.09, "fourties": 0.09, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.24, "male_masculine": 0.58, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 90144, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 1164, "size": 2748721118, "checksum": "5fe1c9be7116d14cf6928f326b25f4910875fcc770500e1315137c9ac52931e5", "avgDurationSecs": 4.646, "validDurationSecs": 362353.069, "totalHrs": 116.33, "validHrs": 100.65 }, "ur": { "buckets": { "dev": 5089, "test": 5091, "train": 7339, "validated": 68229, "invalidated": 11008, "other": 173662 }, "duration": 1087276644, "reportedSentences": 411, "validatedSentences": 21705, "unvalidatedSentences": 48258, "clips": 252899, "splits": { "accent": {}, "age": { "": 0.06, "twenties": 0.92, "thirties": 0, "teens": 0, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.23, "male_masculine": 0.52, "female_feminine": 0.25, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 252792, "agriculture_food": 12, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 48, "healthcare": 47, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 498, "size": 6347527720, "checksum": "391a79277ca6855ef6da612366584cf47a8dfb5b4eeed1f5902f496413faee64", "avgDurationSecs": 4.299, "validDurationSecs": 293333.695, "totalHrs": 302.02, "validHrs": 81.48 }, "ush": { "buckets": { "dev": 195, "test": 357, "train": 586, "validated": 3861, "invalidated": 82, "other": 18 }, "duration": 24351012, "reportedSentences": 0, "validatedSentences": 1170, "unvalidatedSentences": -1, "clips": 3961, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.23, "thirties": 0.01, "teens": 0.02, "fourties": 0.6, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 3958, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 3, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 23, "size": 135793370, "checksum": "f41fd2137282ea84e7fc35a6775016b710a2d9a37110128993660817f50dad77", "avgDurationSecs": 6.148, "validDurationSecs": 23736.243, "totalHrs": 6.76, "validHrs": 6.59 }, "uz": { "buckets": { "dev": 12264, "test": 12388, "train": 48762, "validated": 87181, "invalidated": 14141, "other": 128515 }, "duration": 955631808, "reportedSentences": 1833, "validatedSentences": 148734, "unvalidatedSentences": 137880, "clips": 229837, "splits": { "accent": {}, "age": { "": 0.41, "twenties": 0.4, "thirties": 0.01, "teens": 0.18, "fourties": 0.01, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male_masculine": 0.44, "female_feminine": 0.15, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 229837, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2281, "size": 5363124452, "checksum": "71593db33b13a634292eb7e2a3b471373c5ce2682a163994140460e32933d9be", "avgDurationSecs": 4.158, "validDurationSecs": 362487.052, "totalHrs": 265.45, "validHrs": 100.69 }, "var": { "buckets": { "dev": 546, "test": 611, "train": 752, "validated": 6851, "invalidated": 291, "other": 864 }, "duration": 42487380, "reportedSentences": 246, "validatedSentences": 1977, "unvalidatedSentences": -1, "clips": 8006, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.29, "thirties": 0.57, "teens": 0.14, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.06, "male_masculine": 0, "female_feminine": 0.94, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8006, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 247635388, "checksum": "ba7c24ada264068bafc39298c431215a1f52d8b01d516b7d8d211bcd190d8409", "avgDurationSecs": 5.307, "validDurationSecs": 36357.862, "totalHrs": 11.8, "validHrs": 10.09 }, "ve": { "buckets": { "dev": 0, "test": 0, "train": 0, "validated": 0, "invalidated": 0, "other": 20 }, "duration": 113076, "reportedSentences": 0, "validatedSentences": 1735, "unvalidatedSentences": 7843, "clips": 20, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.75, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0, "female_feminine": 0.75, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 20, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 2, "size": 1206489, "checksum": "f8a215f053ce4f46dd6e73e56aa7dd2339d5c13770903ed6a0c3a035672937ef", "avgDurationSecs": 5.654, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "vi": { "buckets": { "dev": 1094, "test": 1383, "train": 2009, "validated": 6036, "invalidated": 457, "other": 12721 }, "duration": 76849892, "reportedSentences": 201, "validatedSentences": 8159, "unvalidatedSentences": 5398, "clips": 19214, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.2, "thirties": 0.05, "teens": 0.2, "fourties": 0.02, "fifties": 0, "sixties": 0.26, "seventies": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3, "male_masculine": 0.49, "female_feminine": 0.21, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 18475, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 739, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 370, "size": 447769009, "checksum": "93520f4e269fda549cd22d716085a76b5096554d43836d70c7798e218ac9b208", "avgDurationSecs": 4, "validDurationSecs": 24142.081, "totalHrs": 21.34, "validHrs": 6.7 }, "vot": { "buckets": { "dev": 0, "test": 7, "train": 96, "validated": 103, "invalidated": 324, "other": 0 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "age": { "": 0.25, "twenties": 0.74, "thirties": 0, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 427, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 6, "size": 8186608, "checksum": "cd4765febc933489d5ffd3c43c4bd0815a0e1f42aa1141824645fd2434bf8eeb", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "wbl": { "buckets": { "dev": 1110, "test": 1128, "train": 2542, "validated": 6464, "invalidated": 89, "other": 1639 }, "duration": 55320552, "reportedSentences": 1, "validatedSentences": 5493, "unvalidatedSentences": 114, "clips": 8192, "splits": { "accent": {}, "age": { "": 0.21, "twenties": 0, "thirties": 0.01, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0.12, "seventies": 0.65, "eighties": 0, "nineties": 0 }, "gender": { "": 0.88, "male_masculine": 0.12, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8189, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 14, "size": 320971415, "checksum": "4b3d8ccd1a9477f1e2c037386cdacd855e3186fe48a19242eac174d8bfc5c05b", "avgDurationSecs": 6.753, "validDurationSecs": 43651.373, "totalHrs": 15.36, "validHrs": 12.12 }, "wes": { "buckets": { "dev": 324, "test": 323, "train": 351, "validated": 9003, "invalidated": 51, "other": 363 }, "duration": 38779632, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": -1, "clips": 9417, "splits": { "accent": {}, "age": { "": 0.94, "twenties": 0.01, "thirties": 0, "teens": 0, "fourties": 0.06, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.94, "male_masculine": 0, "female_feminine": 0.06, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 9417, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 18, "size": 209712696, "checksum": "8955146c6bd3521a103b24e1c4542c937bfdac93dab69c21037d993d0d6bf83e", "avgDurationSecs": 4.118, "validDurationSecs": 37074.761, "totalHrs": 10.77, "validHrs": 10.29 }, "xh": { "buckets": { "dev": 0, "test": 2, "train": 7, "validated": 9, "invalidated": 2, "other": 19 }, "duration": 179712, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 30, "splits": { "accent": {}, "age": { "": 1, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 30, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 4, "size": 2197868, "checksum": "084813607f5d5a858df95a199a9930074f238efc2891d7f0c28a80dbc91f30b7", "avgDurationSecs": 5.99, "validDurationSecs": 53.914, "totalHrs": 0.04, "validHrs": 0.01 }, "xhe": { "buckets": { "dev": 0, "test": 1443, "train": 3542, "validated": 11429, "invalidated": 160, "other": 14 }, "duration": 37269792, "reportedSentences": 1, "validatedSentences": 5040, "unvalidatedSentences": 2, "clips": 11603, "splits": { "accent": {}, "age": { "": 0.15, "twenties": 0.85, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11603, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 11, "size": 217956696, "checksum": "801266f1540c987cfa379241f4c76a1da6f558f14ded2becd2f9ea405bdf6565", "avgDurationSecs": 3.212, "validDurationSecs": 36710.89, "totalHrs": 10.35, "validHrs": 10.19 }, "xka": { "buckets": { "dev": 0, "test": 399, "train": 1571, "validated": 11163, "invalidated": 66, "other": 583 }, "duration": 38274120, "reportedSentences": 4, "validatedSentences": 1980, "unvalidatedSentences": 46, "clips": 11853, "splits": { "accent": {}, "age": { "": 0.01, "twenties": 0.74, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0.25, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11827, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 26, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 9, "size": 203456160, "checksum": "9b23e454211b544e89586a5b5ba1bdbf3bf8f9e79593e48e7e63fec6bfbc0442", "avgDurationSecs": 3.229, "validDurationSecs": 36046.064, "totalHrs": 10.63, "validHrs": 10.01 }, "xmf": { "buckets": { "dev": 327, "test": 327, "train": 334, "validated": 6753, "invalidated": 587, "other": 31 }, "duration": 45608148, "reportedSentences": 20, "validatedSentences": 1004, "unvalidatedSentences": 35, "clips": 7371, "splits": { "accent": {}, "age": { "": 0.23, "twenties": 0.32, "thirties": 0.3, "teens": 0.02, "fourties": 0.13, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.45, "male_masculine": 0.05, "female_feminine": 0.49, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7333, "agriculture_food": 9, "automotive_transport": 19, "finance": 0, "service_retail": 0, "general": 19, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 87, "size": 267430804, "checksum": "95f40af15d55c3cf065fae9257bb2e076a4517e9af57f10e84e9657d6c45df33", "avgDurationSecs": 6.188, "validDurationSecs": 41784.266, "totalHrs": 12.66, "validHrs": 11.6 }, "yaq": { "buckets": { "dev": 131, "test": 800, "train": 1897, "validated": 6925, "invalidated": 1, "other": 845 }, "duration": 41273208, "reportedSentences": 0, "validatedSentences": 2838, "unvalidatedSentences": -1, "clips": 7771, "splits": { "accent": {}, "age": { "": 0, "twenties": 0.36, "thirties": 0.27, "teens": 0, "fourties": 0.36, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0, "female_feminine": 0.64, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 7771, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 5, "size": 243158559, "checksum": "579c5858fe2779f7659bf56c6772b8d95bdafe8995ec69e2846fc1f485df2ca1", "avgDurationSecs": 5.311, "validDurationSecs": 36779.947, "totalHrs": 11.46, "validHrs": 10.21 }, "yav": { "buckets": { "dev": 274, "test": 316, "train": 423, "validated": 4679, "invalidated": 413, "other": 506 }, "duration": 47985372, "reportedSentences": 0, "validatedSentences": 1013, "unvalidatedSentences": -1, "clips": 5598, "splits": { "accent": {}, "age": { "": 0.89, "twenties": 0, "thirties": 0, "teens": 0, "fourties": 0.11, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.89, "male_masculine": 0, "female_feminine": 0.11, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 5598, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 10, "size": 279281015, "checksum": "7a07ce50e5b8a9d33b5d4780d29f851d56199f22ba69c315281de787d82d2343", "avgDurationSecs": 8.572, "validDurationSecs": 40107.816, "totalHrs": 13.32, "validHrs": 11.14 }, "ydg": { "buckets": { "dev": 0, "test": 321, "train": 1474, "validated": 10556, "invalidated": 10, "other": 636 }, "duration": 40096152, "reportedSentences": 0, "validatedSentences": 1882, "unvalidatedSentences": 109, "clips": 11202, "splits": { "accent": {}, "age": { "": 0.88, "twenties": 0, "thirties": 0.12, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 11202, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 15, "size": 233778099, "checksum": "733fdddd0a1f594c3606862c30560225c4e33ac33664ce0ebd468b45e4383766", "avgDurationSecs": 3.579, "validDurationSecs": 37783.876, "totalHrs": 11.13, "validHrs": 10.49 }, "yi": { "buckets": { "dev": 440, "test": 456, "train": 488, "validated": 1826, "invalidated": 86, "other": 87 }, "duration": 7946640, "reportedSentences": 17, "validatedSentences": 1550, "unvalidatedSentences": 84, "clips": 1999, "splits": { "accent": {}, "age": { "": 0.77, "twenties": 0.06, "thirties": 0.06, "teens": 0.11, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1981, "agriculture_food": 0, "automotive_transport": 0, "finance": 2, "service_retail": 0, "general": 13, "healthcare": 0, "history_law_government": 1, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 3, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 124, "size": 44764597, "checksum": "4674aa3e92e5b8e5620004b5c473d8cd4245aa973a642abf781899e88179b458", "avgDurationSecs": 3.975, "validDurationSecs": 7258.912, "totalHrs": 2.2, "validHrs": 2.01 }, "yo": { "buckets": { "dev": 915, "test": 1122, "train": 1412, "validated": 3451, "invalidated": 261, "other": 1140 }, "duration": 29313072, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 17, "clips": 4852, "splits": { "accent": {}, "age": { "": 0.29, "twenties": 0.61, "thirties": 0.03, "teens": 0.04, "fourties": 0, "fifties": 0.02, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.33, "male_masculine": 0.3, "female_feminine": 0.37, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 4852, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 132, "size": 170839274, "checksum": "6afad07dac589bd86a1794c2d918c6d3da232590bc79c4d19085abbf836cf368", "avgDurationSecs": 6.041, "validDurationSecs": 20849.013, "totalHrs": 8.14, "validHrs": 5.79 }, "yue": { "buckets": { "dev": 5130, "test": 5130, "train": 7420, "validated": 191396, "invalidated": 8103, "other": 79741 }, "duration": 1107782964, "reportedSentences": 2257, "validatedSentences": 18556, "unvalidatedSentences": 10171, "clips": 279240, "splits": { "accent": {}, "age": { "": 0.19, "twenties": 0.29, "thirties": 0.47, "teens": 0.01, "fourties": 0.04, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2, "male_masculine": 0.17, "female_feminine": 0.62, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 276394, "agriculture_food": 18, "automotive_transport": 51, "finance": 128, "service_retail": 606, "general": 1678, "healthcare": 417, "history_law_government": 67, "language_fundamentals": 0, "media_entertainment": 192, "nature_environment": 188, "news_current_affairs": 114, "technology_robotics": 95 } }, "users": 1174, "size": 6420496257, "checksum": "755100899533b01d42d1ca5d4a607afa72c519fab5fae7b0750de62d82db1893", "avgDurationSecs": 3.967, "validDurationSecs": 759293.898, "totalHrs": 307.71, "validHrs": 210.91 }, "zgh": { "buckets": { "dev": 302, "test": 233, "train": 871, "validated": 1406, "invalidated": 91, "other": 672 }, "duration": 7560684, "reportedSentences": 6, "validatedSentences": 2738, "unvalidatedSentences": 355, "clips": 2169, "splits": { "accent": {}, "age": { "": 0.1, "twenties": 0.38, "thirties": 0.52, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.15, "male_masculine": 0.82, "female_feminine": 0.03, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 2126, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 34, "healthcare": 3, "history_law_government": 0, "language_fundamentals": 5, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 1 } }, "users": 38, "size": 41386121, "checksum": "679955e7c53e8c0ce7b890df2f5a7e914bd0abeea91edeb8ca1199ee790aeaca", "avgDurationSecs": 3.486, "validDurationSecs": 4901.024, "totalHrs": 2.1, "validHrs": 1.36 }, "zh-CN": { "buckets": { "dev": 10647, "test": 10647, "train": 29560, "validated": 189406, "invalidated": 59164, "other": 602833 }, "duration": 3865791552, "reportedSentences": 1145, "validatedSentences": 59126, "unvalidatedSentences": 909, "clips": 851403, "splits": { "accent": {}, "age": { "": 0.92, "twenties": 0.05, "thirties": 0.01, "teens": 0.01, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "male_masculine": 0.06, "female_feminine": 0.01, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 848200, "agriculture_food": 51, "automotive_transport": 71, "finance": 101, "service_retail": 58, "general": 829, "healthcare": 132, "history_law_government": 369, "language_fundamentals": 86, "media_entertainment": 1591, "nature_environment": 53, "news_current_affairs": 178, "technology_robotics": 236 } }, "users": 7525, "size": 22886754844, "checksum": "858c5ab1dc55d7a8e9802e5fa896683f17be84ca25edbeb79b9abd25a99b2a91", "avgDurationSecs": 4.54, "validDurationSecs": 859997.104, "totalHrs": 1073.83, "validHrs": 238.88 }, "zh-HK": { "buckets": { "dev": 5604, "test": 5604, "train": 8450, "validated": 94010, "invalidated": 4632, "other": 25129 }, "duration": 513685212, "reportedSentences": 824, "validatedSentences": 20146, "unvalidatedSentences": 29, "clips": 123771, "splits": { "accent": {}, "age": { "": 0.38, "twenties": 0.33, "thirties": 0.11, "teens": 0.02, "fourties": 0.12, "fifties": 0.03, "sixties": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male_masculine": 0.4, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 122401, "agriculture_food": 0, "automotive_transport": 29, "finance": 44, "service_retail": 295, "general": 925, "healthcare": 90, "history_law_government": 72, "language_fundamentals": 85, "media_entertainment": 53, "nature_environment": 54, "news_current_affairs": 24, "technology_robotics": 45 } }, "users": 3108, "size": 3653748292, "checksum": "213c03ce1bd553168a1f964b5be8a5cab3dc626a68fb1a569fa4ee955ed812c0", "avgDurationSecs": 4.15, "validDurationSecs": 390168.511, "totalHrs": 142.69, "validHrs": 108.38 }, "zh-TW": { "buckets": { "dev": 5107, "test": 5107, "train": 7369, "validated": 84883, "invalidated": 4905, "other": 50436 }, "duration": 471662698, "reportedSentences": 179, "validatedSentences": 20751, "unvalidatedSentences": 849, "clips": 140224, "splits": { "accent": {}, "age": { "": 0.27, "twenties": 0.3, "thirties": 0.19, "teens": 0.06, "fourties": 0.09, "fifties": 0.09, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.29, "male_masculine": 0.49, "female_feminine": 0.22, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 138740, "agriculture_food": 12, "automotive_transport": 274, "finance": 1, "service_retail": 150, "general": 1103, "healthcare": 22, "history_law_government": 167, "language_fundamentals": 8, "media_entertainment": 167, "nature_environment": 12, "news_current_affairs": 43, "technology_robotics": 543 } }, "users": 2304, "size": 3146644633, "checksum": "478f55259cf02eed454ef9bba979e879d008c7b98c4aaa4d57c16f8371ff6af3", "avgDurationSecs": 3.364, "validDurationSecs": 285515.638, "totalHrs": 131.01, "validHrs": 79.3 }, "zoc": { "buckets": { "dev": 483, "test": 474, "train": 524, "validated": 8886, "invalidated": 2, "other": 60 }, "duration": 36533628, "reportedSentences": 0, "validatedSentences": 1481, "unvalidatedSentences": 7, "clips": 8948, "splits": { "accent": {}, "age": { "": 0.09, "twenties": 0.38, "thirties": 0.22, "teens": 0, "fourties": 0.31, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.56, "male_masculine": 0, "female_feminine": 0.44, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 8948, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 12, "size": 213960228, "checksum": "54186fba91b69e7c71ba70046387d4d385eea9e7e2ab21eeeac8157684de13e3", "avgDurationSecs": 4.083, "validDurationSecs": 36280.489, "totalHrs": 10.14, "validHrs": 10.07 }, "zu": { "buckets": { "dev": 0, "test": 1, "train": 15, "validated": 16, "invalidated": 12, "other": 175 }, "duration": 1123272, "reportedSentences": 4, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 203, "splits": { "accent": {}, "age": { "": 0.49, "twenties": 0.51, "thirties": 0, "teens": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.49, "male_masculine": 0, "female_feminine": 0.17, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.34 }, "sentence_domain": { "": 203, "agriculture_food": 0, "automotive_transport": 0, "finance": 0, "service_retail": 0, "general": 0, "healthcare": 0, "history_law_government": 0, "language_fundamentals": 0, "media_entertainment": 0, "nature_environment": 0, "news_current_affairs": 0, "technology_robotics": 0 } }, "users": 8, "size": 7938516, "checksum": "dff28383d0ba714cb7e49d597f2c21579e5e14c0aa8e06534b57a77a08ed690e", "avgDurationSecs": 5.533, "validDurationSecs": 88.534, "totalHrs": 0.31, "validHrs": 0.02 }, "zza": { "buckets": { "dev": 463, "test": 424, "train": 790, "validated": 1677, "invalidated": 245, "other": 298 }, "duration": 9090864, "reportedSentences": 2338, "validatedSentences": 5655, "unvalidatedSentences": 974, "clips": 2220, "splits": { "accent": {}, "age": { "": 0.22, "twenties": 0, "thirties": 0.65, "teens": 0, "fourties": 0, "fifties": 0.01, "sixties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7, "male_masculine": 0.28, "female_feminine": 0.02, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "sentence_domain": { "": 1481, "agriculture_food": 3, "automotive_transport": 1, "finance": 1, "service_retail": 9, "general": 675, "healthcare": 16, "history_law_government": 0, "language_fundamentals": 27, "media_entertainment": 2, "nature_environment": 4, "news_current_affairs": 0, "technology_robotics": 2 } }, "users": 20, "size": 53259616, "checksum": "d4d2407866c14b3fdab77c42fde3557e4f20c45e2a53c0323f7bc5380950263b", "avgDurationSecs": 4.095, "validDurationSecs": 6867.288, "totalHrs": 2.52, "validHrs": 1.9 } }, "totalDuration": 140161062895, "totalValidDurationSecs": 93193530, "totalHrs": 38932, "totalValidHrs": 25886 } ================================================ FILE: datasets/scripted-speech/cv-corpus-24.0-delta-2025-12-05.json ================================================ { "locale": { "ab": { "duration": 239328, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 35, "users": 4, "size": 1440427, "checksum": "ecc75443c63048b97d1244b89b45b483ce31c151e035345928a0c9158fd16397", "avgDurationSecs": 5.178, "validDurationSecs": 4144.32, "totalHrs": 0.07, "validHrs": 1.15 }, "abb": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6095, "checksum": "7ef1fe7863a8ee8a5e36c3bcf68ecfbdc3a704bae91ceeda28191dcd51c49e4b", "avgDurationSecs": 5.109, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ady": { "duration": 55580436, "reportedSentences": 53, "validatedSentences": 939, "unvalidatedSentences": 507, "clips": 11219, "users": 28, "size": 321830596, "checksum": "31759be8f8da07a3fd2ff2799457ce783fca6a88c6f4530364d63ec79fc16759", "avgDurationSecs": 5.008, "validDurationSecs": 78973.29, "totalHrs": 15.44, "validHrs": 21.93 }, "af": { "duration": 53856, "reportedSentences": 0, "validatedSentences": 8, "unvalidatedSentences": -7, "clips": 8, "users": 2, "size": 318729, "checksum": "9c95833ed6d565d4872fa551d1dd0ca4d4fd75aabc3c6e9a7355e4d799c234c4", "avgDurationSecs": 6.079, "validDurationSecs": 26.6, "totalHrs": 0.01, "validHrs": 0.01 }, "ajg": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -22524, "checksum": "75c41be24180fa08cdce591497fbeba663931358c56eebaa1c81b58d5a2d79e2", "avgDurationSecs": 2.389, "validDurationSecs": 4.78, "totalHrs": 0, "validHrs": 0 }, "am": { "duration": 8856, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 52855, "checksum": "f5d8274eb80826338f6f1d6d47fd80d4702207fb31276a2afe0dd815e205fc99", "avgDurationSecs": 6.299, "validDurationSecs": 64.61, "totalHrs": 0, "validHrs": 0.01 }, "an": { "duration": 40788, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 231705, "checksum": "780f87179dd488f46fcd9af2737cd2aed3e662cb678d686243f4146c78320573", "avgDurationSecs": 4.574, "validDurationSecs": 16.95, "totalHrs": 0.01, "validHrs": 0 }, "ar": { "duration": 1085976, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 235, "users": 5, "size": 5985082, "checksum": "5df32f94259c2cfc45854fb7cf7191e317c5a9b662335fb9cfc9e5c3ec8b4f25", "avgDurationSecs": 4.162, "validDurationSecs": 475, "totalHrs": 0.3, "validHrs": 0.13 }, "as": { "duration": 0, "reportedSentences": 0, "validatedSentences": 18, "unvalidatedSentences": -18, "clips": 0, "users": 0, "size": -1879, "checksum": "799c31374c95f64b4400f9c1989466f727a797bd874d96359580012f1271e4f1", "avgDurationSecs": 5.897, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ast": { "duration": 68400, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 30, "users": 1, "size": 400376, "checksum": "85477bbd78bab637a35e0ed18562dcbd729d85f253a22b75ce5a42124954167b", "avgDurationSecs": 4.396, "validDurationSecs": 173.57, "totalHrs": 0.02, "validHrs": 0.05 }, "az": { "duration": 267120, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 12, "clips": 50, "users": 2, "size": 1581920, "checksum": "00f1b0d6ef904e62eafbcc8d92e8e764486ba3d2d3d37a6b321b63555ab26a59", "avgDurationSecs": 5.452, "validDurationSecs": 303.28, "totalHrs": 0.07, "validHrs": 0.09 }, "ba": { "duration": 17640, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 55371, "checksum": "fab12b73feb3d921f594f5e0fa3b648d6ff79169c0fbeec35dee302a957adedc", "avgDurationSecs": 4.427, "validDurationSecs": -4.33, "totalHrs": 0, "validHrs": 0 }, "bag": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": -8826, "checksum": "a8f8539aeae94dfe59b4842698eeccbf9f8adec175ef11ae0bac25f3ca925683", "avgDurationSecs": 5.716, "validDurationSecs": 5.72, "totalHrs": 0, "validHrs": 0 }, "bas": { "duration": 112608, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 25, "users": 3, "size": 656429, "checksum": "185c03ca819b5b2e8db5ee3dfa63929c364fe5f952e919dc0b71bf4a126e7d68", "avgDurationSecs": 3.927, "validDurationSecs": 16.81, "totalHrs": 0.03, "validHrs": 0.01 }, "bax": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15301, "checksum": "009c91f847a92d1dcd6061fc943a2fce3c94a0006bf340fff2ac6dea08e10a74", "avgDurationSecs": 5.076, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bba": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6284, "checksum": "b4c88af1da2257fabbeb487809e14f1048ee453d2392348cd3a88fba780d6779", "avgDurationSecs": 6.325, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bbj": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15817, "checksum": "e95e5c54a75a528e49cd6eb79ddcc6dec8823d382d2d22e3421fc6299ca0b2b5", "avgDurationSecs": 6.2, "validDurationSecs": 43.4, "totalHrs": 0, "validHrs": 0.02 }, "bbl": { "duration": 240588, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 8, "clips": 30, "users": 0, "size": 1416497, "checksum": "630e474f98551ae6be9c32014ccfcd9e2147773dd2f039f08a57f89754cfa76d", "avgDurationSecs": 8.808, "validDurationSecs": -22.61, "totalHrs": 0.06, "validHrs": -0.01 }, "bce": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -17897, "checksum": "c05a81d13e145b5e30d749a6c51bd9e36731b10b99cc8947e989131f94c0e068", "avgDurationSecs": 5.892, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bci": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -18766, "checksum": "77cca7a20507da7a7fb01355f526b7591406cbb354cbcf22193dc16e6e5f8fcd", "avgDurationSecs": 7.436, "validDurationSecs": 22.31, "totalHrs": 0, "validHrs": 0.01 }, "be": { "duration": 15498252, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 2820, "users": 10, "size": 83083149, "checksum": "680f17ef2a63ef2934a2cd2c311e2d5002d32465fcccddb3a111141e95ac6d76", "avgDurationSecs": 4.793, "validDurationSecs": 15363, "totalHrs": 4.31, "validHrs": 4.27 }, "beb": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9, "checksum": "4fbd54e932f2c10d69de7d04340e3a9dd79fa3ed518b78a243685acc0ac2e410", "avgDurationSecs": 5.46, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bfd": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1857, "checksum": "91ec8fb3b6aa03a9eefe8adffc781d1dd8e2afa97c87d76968392955b9d6ae89", "avgDurationSecs": 5.654, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bft": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8424, "checksum": "84f039c5154daaa1087d779cc28ca89181526c65080cc95a741dc8f17775453f", "avgDurationSecs": 6.163, "validDurationSecs": 36.98, "totalHrs": 0, "validHrs": 0.01 }, "bg": { "duration": 30888, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 176856, "checksum": "6189a89f3fb0b00fc4e915ce6038f84e4cf4ff20879ded0eb32bac2f83e4f302", "avgDurationSecs": 5.472, "validDurationSecs": 145.14, "totalHrs": 0, "validHrs": 0.04 }, "bgp": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14594, "checksum": "149863ab8d1b264cbd81116c92708f7dc630f5c65ba4a12319838c4e12858c1b", "avgDurationSecs": 6.087, "validDurationSecs": 48.7, "totalHrs": 0, "validHrs": 0.02 }, "bkh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15051, "checksum": "a39171ec6092f4b8a502f8a3b5316342d0293192ef64594a838f4b3a7d487072", "avgDurationSecs": 6.942, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bkm": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8176, "checksum": "6bb316a0f85ab143d44817041c8fa6e25ad5e5ff53d745911f4e08cc4b16d889", "avgDurationSecs": 5.774, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bn": { "duration": 302760, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 42, "users": 8, "size": 1673240, "checksum": "3e0f6c6c6484df5f7b2c03af079962671d2d3debb0fcab38351db1d644b01b4e", "avgDurationSecs": 4.371, "validDurationSecs": 96.84, "totalHrs": 0.08, "validHrs": 0.03 }, "bnm": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -18619, "checksum": "1877f0433254855359cafe4dcc6ea8f7919d5d98de023929a4cc501b3e3257c4", "avgDurationSecs": 7.588, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bnn": { "duration": 46188, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 1, "size": 250373, "checksum": "cb970d52a5f68caad735b54666dc96b8b063526072d6a6c80b96f0cf3002a813", "avgDurationSecs": 5.09, "validDurationSecs": -4.17, "totalHrs": 0.01, "validHrs": 0 }, "br": { "duration": 4373784, "reportedSentences": 1, "validatedSentences": 307, "unvalidatedSentences": 283, "clips": 1264, "users": 14, "size": 25563373, "checksum": "4d824b0baf8291a9afbd9daab2ce651d5fb48974b4f33181fe8c0bdf7b57eea7", "avgDurationSecs": 3.253, "validDurationSecs": 4284.27, "totalHrs": 1.21, "validHrs": 1.19 }, "brh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9580, "checksum": "a7404392ac28d37861d0d5cf7d444bb1591332c1601d11c6b55d0df8c242e39a", "avgDurationSecs": 7.274, "validDurationSecs": 14.55, "totalHrs": 0, "validHrs": 0.01 }, "bri": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -27806, "checksum": "385b9cae7f3fa2d70cfc54c00d191f28c435392d8309c33a666695146798903c", "avgDurationSecs": 4.344, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bsh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14816, "checksum": "4111396cbb8b05515503fe3846ab5732d1209a8325d951e58e2212fedc8e0839", "avgDurationSecs": 5.454, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bsk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13921, "checksum": "468289bf74d387ddf1f708a23ef08b5f6d25a56bdabd7cf637e02d547df9301a", "avgDurationSecs": 4.262, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "btv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 737, "checksum": "8e3da90b7112ef6be21aadd8ef9b34b5339e0d96dda50179864bdbdaebade716", "avgDurationSecs": 4.057, "validDurationSecs": 16.23, "totalHrs": 0, "validHrs": 0 }, "bum": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -22353, "checksum": "ea1e4b8532b0b153d715335f212bafabc80166cc4c4a92f0d7d2fc75380605f9", "avgDurationSecs": 4.657, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "byv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11068, "checksum": "669c0a2460b4badcf56f48d8b9008f342ba6f5967d0574b09b830c6e6ba705bb", "avgDurationSecs": 6.178, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ca": { "duration": 85187808, "reportedSentences": 717, "validatedSentences": 1553, "unvalidatedSentences": -1179, "clips": 18682, "users": 52, "size": 499668581, "checksum": "74b7ee289aaf2a199661b45a7b99447e7cc0dc527c1276e5448cd20b6dacc100", "avgDurationSecs": 5.2, "validDurationSecs": 69381.59, "totalHrs": 23.66, "validHrs": 19.27 }, "cjk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -16431, "checksum": "0da6fc97db29d26c393769cd2299305ede7e6a9e9ca5e827f95f04731665e69f", "avgDurationSecs": 5.945, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ckb": { "duration": 1179864, "reportedSentences": 0, "validatedSentences": 163, "unvalidatedSentences": -163, "clips": 262, "users": 15, "size": 6934126, "checksum": "19fc00e7df6b164a6f77c7eca7c22530b52f032b9d00a12e99ee46ccb403553e", "avgDurationSecs": 4.095, "validDurationSecs": 772.19, "totalHrs": 0.33, "validHrs": 0.21 }, "cnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2391, "checksum": "cb04539ec3670594b854f4010f0dbb8ce0464ed8bf0bcbbc726f22fd58bd91fc", "avgDurationSecs": 3.516, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cpy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -10603, "checksum": "912429d002017db6e89bef944953a49cd8468c538e1d71f5071a401933549cf9", "avgDurationSecs": 4.421, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cs": { "duration": 507060, "reportedSentences": 1, "validatedSentences": 71, "unvalidatedSentences": -71, "clips": 126, "users": 12, "size": 2814525, "checksum": "3ef013a0c17afa5e873694e292c928dfe20a26f83b6eae5ed89a2710c58f6181", "avgDurationSecs": 4.46, "validDurationSecs": 1071.89, "totalHrs": 0.14, "validHrs": 0.3 }, "cut": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9785, "checksum": "95481d4df662f9640074f60ebab50808f0c651abaf574b3ca694a647735ce0f0", "avgDurationSecs": 6.631, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cux": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9928, "checksum": "8a0023face6efa3ba1ed8b536d8a107b6586ccae6e1561a71f18c16ce6acfc9a", "avgDurationSecs": 4.101, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1494, "checksum": "4ffa14e3b8d428118c32e8a26c5b940d246eaececc85eecb572b5ca8dc5b7819", "avgDurationSecs": 5.042, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cy": { "duration": 312192, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 51, "users": 9, "size": 1835661, "checksum": "c73af06c90775d7b9108c569ce6f41a4111a76897e6962e83ca64a1e99f82c8a", "avgDurationSecs": 4.916, "validDurationSecs": 318.52, "totalHrs": 0.09, "validHrs": 0.09 }, "da": { "duration": 99468, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 35, "users": 2, "size": 553211, "checksum": "3d60b07b341b736ae70ed8fc0c685663ae7304bd2840ade7d705f6bdada74d9b", "avgDurationSecs": 4.06, "validDurationSecs": 50.92, "totalHrs": 0.03, "validHrs": 0.02 }, "dag": { "duration": 2269044, "reportedSentences": 0, "validatedSentences": 217, "unvalidatedSentences": -127, "clips": 540, "users": 1, "size": 13264050, "checksum": "6cd20551fbfb1352fea5f535c502f8d85c37ad70adeef52c7981cd8816c30a95", "avgDurationSecs": 4.298, "validDurationSecs": 250.64, "totalHrs": 0.63, "validHrs": 0.07 }, "dar": { "duration": 20862936, "reportedSentences": 4, "validatedSentences": 794, "unvalidatedSentences": 132, "clips": 3770, "users": 1, "size": 121030874, "checksum": "4b6fadcda297c53d5ff319d2ee29f02698e9959cc46eb96128fd973bfe378b6f", "avgDurationSecs": 5.351, "validDurationSecs": 11740.92, "totalHrs": 5.8, "validHrs": 3.26 }, "dav": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -3174, "checksum": "88631830c7aa16e1e62508169dc6434f58d957828bac8a12b93f375eab805328", "avgDurationSecs": 4.059, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "de": { "duration": 21868956, "reportedSentences": 51, "validatedSentences": 28, "unvalidatedSentences": 169, "clips": 3551, "users": 90, "size": 127116148, "checksum": "7b0b97f32ed8404b02c75251a99e3ea19317888fd3a51567275b971cb36c1f21", "avgDurationSecs": 5.265, "validDurationSecs": 41931.27, "totalHrs": 6.07, "validHrs": 11.65 }, "dmk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -17484, "checksum": "12bd601e002918086e4f6381da0c00820170a3fb111674c55fd3ae91257b35aa", "avgDurationSecs": 3.546, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "dml": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -21500, "checksum": "217262aaf8bc8133edd0088e2813d7b91e1a3b1970351356d877f76859b4d900", "avgDurationSecs": 6.049, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "dru": { "duration": 5940, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 0, "size": 18442, "checksum": "0a27a4418f5afc38f8960d83bd89deeb77d8a6f9a557158e525cac1edeaf33ff", "avgDurationSecs": 5.676, "validDurationSecs": 0.26, "totalHrs": 0, "validHrs": 0 }, "dua": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15205, "checksum": "7510e1763517426565510abe5376ba0b5ea3eabdf1efbba198fe198416da2ac6", "avgDurationSecs": 6.818, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "dv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 42555, "checksum": "4bc4c02d7baee9139efa04f9b7b06c7d2fb7bdfca62a3ea071c4df678e67822f", "avgDurationSecs": 5.12, "validDurationSecs": 194.55, "totalHrs": 0, "validHrs": 0.05 }, "dyu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -621, "checksum": "5c73eef972e528e84cc618c3ba889e8f40656fa5d1aa2daba186fef1d7359c72", "avgDurationSecs": 6.309, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ebr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4167, "checksum": "4d3c3313b8e1c20da0216c410510ed86d5c21c5ddaf97f959f464cd07e98d5e6", "avgDurationSecs": 5.587, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "eko": { "duration": 23220, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 1, "size": 122916, "checksum": "a565d2a7e72ea4b0ed67ddc43e2adacf991c5b3f785f92bb7c7d1d5df7d01033", "avgDurationSecs": 9.322, "validDurationSecs": -68.02, "totalHrs": 0.01, "validHrs": -0.02 }, "el": { "duration": 818136, "reportedSentences": 0, "validatedSentences": 134, "unvalidatedSentences": -134, "clips": 195, "users": 2, "size": 4794729, "checksum": "61e377fda91bf4f7a0a45b735d0bede5d140573600a4e9356b381ba1dc5ee297", "avgDurationSecs": 4.152, "validDurationSecs": 843.82, "totalHrs": 0.22, "validHrs": 0.24 }, "en": { "duration": 166547784, "reportedSentences": 125, "validatedSentences": 776, "unvalidatedSentences": 1902, "clips": 27357, "users": 853, "size": 974596000, "checksum": "a477e793f831662c00fd526453abde3c3e4791b8eda19e1a723457751f8f3c61", "avgDurationSecs": 5.296, "validDurationSecs": 55892.87, "totalHrs": 46.27, "validHrs": 15.52 }, "eo": { "duration": 33705756, "reportedSentences": 10, "validatedSentences": 29, "unvalidatedSentences": -29, "clips": 5413, "users": 5, "size": 199852637, "checksum": "b35aa988b502d8c7da49f5f34525a5b7d55948e4c6c1cd62a6cd0cd6ffb3902c", "avgDurationSecs": 6.073, "validDurationSecs": 2637.4, "totalHrs": 9.37, "validHrs": 0.73 }, "es": { "duration": 15483672, "reportedSentences": 23, "validatedSentences": 201, "unvalidatedSentences": -177, "clips": 3013, "users": 58, "size": 84652934, "checksum": "2c2943fbc5c96d5c00bd98f6fe41f358a057ae451fd4c324bff98a1b41e4f0d8", "avgDurationSecs": 4.88, "validDurationSecs": 7556.78, "totalHrs": 4.3, "validHrs": 2.09 }, "esu": { "duration": 10296, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 63255, "checksum": "3a7954a7aadc0752923b1ba0c4429ac08bd491ffdb2b8bc3eda1f067fb2eb276", "avgDurationSecs": 3.8, "validDurationSecs": -8.71, "totalHrs": 0, "validHrs": 0 }, "et": { "duration": 6525072, "reportedSentences": 3, "validatedSentences": 109, "unvalidatedSentences": 588, "clips": 1023, "users": 38, "size": 38381388, "checksum": "42e83ac54800354de55915a618b1499f87aab5228ba1e76a1e8bef80151fed21", "avgDurationSecs": 6.721, "validDurationSecs": 5220.12, "totalHrs": 1.81, "validHrs": 1.45 }, "eto": { "duration": 436392, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 130, "users": 2, "size": 2489658, "checksum": "a5b737b4a4f050cc83ff2219a52ad54ebb9e47a0dda86172a12086928c2e5d5d", "avgDurationSecs": 3.456, "validDurationSecs": 32.54, "totalHrs": 0.12, "validHrs": 0.01 }, "eu": { "duration": 6639552, "reportedSentences": 2, "validatedSentences": 43, "unvalidatedSentences": -42, "clips": 1144, "users": 13, "size": 38674362, "checksum": "594a7b558c3d9c11899e8f13f07e28783ea7085fc39f3dbe6613c6a8b7715874", "avgDurationSecs": 5.506, "validDurationSecs": 3868.17, "totalHrs": 1.85, "validHrs": 1.07 }, "ewo": { "duration": 561312, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 74, "users": 4, "size": 3291328, "checksum": "cdad8bdbf48360d44a20754b4e4ba8aa7ae002f009fe5a8f466a61b7bdea08c4", "avgDurationSecs": 8.767, "validDurationSecs": -64.57, "totalHrs": 0.15, "validHrs": -0.02 }, "fa": { "duration": 5487876, "reportedSentences": 50, "validatedSentences": 230, "unvalidatedSentences": -114, "clips": 1832, "users": 30, "size": 30722047, "checksum": "543fe1231d5c72b76919906b452f77fdb2f15c8fd60779fe2254030dc44ce656", "avgDurationSecs": 3.955, "validDurationSecs": 5219.35, "totalHrs": 1.53, "validHrs": 1.45 }, "fan": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14398, "checksum": "65d9feb954bc8209ec5dd994ae7488f94e6ec8f732c0e6cd7a4ecee136edc7ff", "avgDurationSecs": 5.181, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "fi": { "duration": 1268208, "reportedSentences": 0, "validatedSentences": 17, "unvalidatedSentences": -16, "clips": 271, "users": 3, "size": 6997309, "checksum": "77756bdc318c2daabdc0abde96cd47db55bd0b29029e0044c6cdf97de763df5d", "avgDurationSecs": 4.627, "validDurationSecs": 902.76, "totalHrs": 0.35, "validHrs": 0.25 }, "fmp": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15603, "checksum": "75f0d4b381a4adda3ccea5c8ff326454202b0f61cbc62ac92873aec398ab20c3", "avgDurationSecs": 7.945, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "fr": { "duration": 32820372, "reportedSentences": 50, "validatedSentences": 98, "unvalidatedSentences": -31, "clips": 6364, "users": 145, "size": 192767534, "checksum": "ab1903e88eb7eab4c27804f34937e1d2f880cc1cb2a4bdebb06f77325042a2ee", "avgDurationSecs": 5.036, "validDurationSecs": 38366.05, "totalHrs": 9.12, "validHrs": 10.66 }, "fub": { "duration": 283788, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 77, "users": 2, "size": 1648492, "checksum": "44fb5c99b7dc56bb390cffc93e94c9d92f4b7bb0e3164dcc07a1fbbe89737103", "avgDurationSecs": 6.117, "validDurationSecs": -184.62, "totalHrs": 0.08, "validHrs": -0.06 }, "fue": { "duration": 84168, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 35, "users": 1, "size": 446874, "checksum": "860c7d976b474b04331c7daade001693e9c1761ba19636b265771a91a36cfbf5", "avgDurationSecs": 5.332, "validDurationSecs": -74.52, "totalHrs": 0.02, "validHrs": -0.02 }, "fy-NL": { "duration": 6448248, "reportedSentences": 10, "validatedSentences": 0, "unvalidatedSentences": 10, "clips": 1397, "users": 13, "size": 37470001, "checksum": "73a493e91ec5f4915064327c9e80463d3f2cfe138e563e9f8939c0f2db8a8784", "avgDurationSecs": 4.852, "validDurationSecs": 2342.38, "totalHrs": 1.79, "validHrs": 0.65 }, "ga-IE": { "duration": 9711756, "reportedSentences": 8, "validatedSentences": 447, "unvalidatedSentences": 131, "clips": 1646, "users": 53, "size": 56372208, "checksum": "c168b4ac1d65c39d3ab6929fc367b4d538be48bc00a5b11a673d2f2c3cdebbb0", "avgDurationSecs": 3.813, "validDurationSecs": 16215.56, "totalHrs": 2.7, "validHrs": 4.5 }, "gej": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -21705, "checksum": "1f7870efd16f99741bcd905e464244ce381d3ce0eeaa6e7f490e0c67580d6515", "avgDurationSecs": 2.445, "validDurationSecs": 17.12, "totalHrs": 0, "validHrs": 0 }, "ggg": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11538, "checksum": "bc9718ed0f4bea207db1bd5d310fb95b21a43120398ab6d78cf2b6800da7a769", "avgDurationSecs": 3.988, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gid": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11623, "checksum": "4462a0507beddd3b261cc280cf31eab8ad4f2923c78c0f14d155f25c3eed3186", "avgDurationSecs": 7.113, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gig": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11795, "checksum": "e1e3433e86c377a60c22f15c4c9829d681e462b17ecef43d8ad274b69f27851d", "avgDurationSecs": 2.992, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "giz": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8885, "checksum": "614987927289d3380aa526d5aa2643d891d914399155067d0a357e272de313ee", "avgDurationSecs": 5.572, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gjk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23997, "checksum": "e992d46d525e358225180418fa277c0f0f517ab5f8687d9974e522c4cfd3ab59", "avgDurationSecs": 4.545, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gju": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -15088, "checksum": "073b80d594a9faf861e0ca4d07abf885d01a933845f5fdd4df4065714f8304e1", "avgDurationSecs": 3.283, "validDurationSecs": 6.56, "totalHrs": 0, "validHrs": 0 }, "gl": { "duration": 16326216, "reportedSentences": 92, "validatedSentences": 304985, "unvalidatedSentences": -436, "clips": 2933, "users": 16, "size": 113983730, "checksum": "e33bf85d88497fa1ed5595210c9a251326c1b84df1f59f2de31ac655518c3093", "avgDurationSecs": 5.004, "validDurationSecs": 174693.57, "totalHrs": 4.53, "validHrs": 48.53 }, "gn": { "duration": 2408760, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 464, "users": 15, "size": 14045573, "checksum": "c97bc800ba00c3736e80658c681027d90edde530882544768fc9e4a4276e4249", "avgDurationSecs": 4.611, "validDurationSecs": 2094.57, "totalHrs": 0.67, "validHrs": 0.59 }, "gv": { "duration": 93168, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 25, "users": 1, "size": 536059, "checksum": "a0e4c22483d334c9b745f0cf56dad8e4395a65a67b52c73a3a63ca5f0bb4e5b8", "avgDurationSecs": 5.782, "validDurationSecs": -49.33, "totalHrs": 0.03, "validHrs": -0.01 }, "gwc": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12679, "checksum": "efd083b343f3f52d11b21a749e837b2d39afc35abf3f125ba59dfba024a1f14a", "avgDurationSecs": 5.821, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gwt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -16839, "checksum": "fea63dc5528a2639f0d20c17fab85fd64e20a98c6cf7ce92cd0dcd56c269dbd8", "avgDurationSecs": 5.722, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gya": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13354, "checksum": "8676f1ddb77395ec0d4900e61dce291a8f3f28ed7b9909d3c37ad75022b751ce", "avgDurationSecs": 5.182, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ha": { "duration": 638100, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 159, "users": 2, "size": 3687674, "checksum": "21b27ecfa4ed161da6007d1900acbdea2216a6553a33aa53431733dc2e9d07ff", "avgDurationSecs": 4.351, "validDurationSecs": 190.85, "totalHrs": 0.18, "validHrs": 0.05 }, "haz": { "duration": 5256, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 12618, "checksum": "b5944f1622600c6474930fb651ba6cedb8ef1efd32948fe49d7a6b2710e8226b", "avgDurationSecs": 4.555, "validDurationSecs": 0.7, "totalHrs": 0, "validHrs": 0 }, "he": { "duration": 0, "reportedSentences": 0, "validatedSentences": 16, "unvalidatedSentences": -8, "clips": 0, "users": 0, "size": 60357, "checksum": "2a91a9c2d958efb9182c7077b343d5f049c99eb8cd9bc011487569e0455e922c", "avgDurationSecs": 4.593, "validDurationSecs": 4101.43, "totalHrs": 0, "validHrs": 1.14 }, "hem": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -19780, "checksum": "04ea4c4c436e486e1023f492b172baa9839cf466e971f904289cef967fbc8eb9", "avgDurationSecs": 6.051, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hi": { "duration": 10631592, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 2124, "users": 4, "size": 62586912, "checksum": "2cfba22d28fec767bc8b786da9e22966cc9a703676854d47723c7c0c482518d7", "avgDurationSecs": 5.017, "validDurationSecs": 69.2, "totalHrs": 2.95, "validHrs": 0.02 }, "hno": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -34391, "checksum": "999869fa27bf0aaa04c4ba1a44702e1f5c2105bf7644f1d3e912205b7de54861", "avgDurationSecs": 4.009, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hsb": { "duration": 568944, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -1, "clips": 69, "users": 2, "size": 3367004, "checksum": "c11d0d6356bf0c886d1103e02627a66533de4069678e7b52950a1cb176e9199f", "avgDurationSecs": 6.37, "validDurationSecs": 147.18, "totalHrs": 0.15, "validHrs": 0.04 }, "ht": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1, "checksum": "0d10797fa8f3be5168535495be6b34ba4ee583800305d4f7430ee223ebb97f37", "avgDurationSecs": 5.566, "validDurationSecs": 16.7, "totalHrs": 0, "validHrs": 0.01 }, "hu": { "duration": 1585440, "reportedSentences": 0, "validatedSentences": 64, "unvalidatedSentences": -63, "clips": 256, "users": 3, "size": 9769413, "checksum": "3efa8e3df2c1e27dbbddc7fc84ed4b4765d7a8b88d57080b45bf90fcf62e36ea", "avgDurationSecs": 5.541, "validDurationSecs": 29992.83, "totalHrs": 0.44, "validHrs": 8.33 }, "hux": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -25750, "checksum": "0cdadf2e8fa73680e4b4afcffef020366a8c6cac4707bf41fdea230f2ae1d35b", "avgDurationSecs": 3.876, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hy-AM": { "duration": 37368, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 7, "users": 2, "size": 195262, "checksum": "3469b9deae30a757883b45812c7ed35b69c79187ece74fae9cf2cac3e7fa2010", "avgDurationSecs": 5.392, "validDurationSecs": 156.16, "totalHrs": 0.01, "validHrs": 0.04 }, "ia": { "duration": 53820, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 38, "clips": 10, "users": 0, "size": 332483, "checksum": "01fe681194c6779401319a9a312889e9d53d27f4f284fbb239286a5227b30951", "avgDurationSecs": 4.202, "validDurationSecs": 64.27, "totalHrs": 0.02, "validHrs": 0.01 }, "ibb": { "duration": 4500, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 11036, "checksum": "d14ec0f9ac2a6f5cbc9455a2d7130ebc0d0935030aa354c89ef2295c65b255e9", "avgDurationSecs": 12.894, "validDurationSecs": 1182.84, "totalHrs": 0, "validHrs": 0.33 }, "id": { "duration": 258876, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 4, "clips": 51, "users": 8, "size": 1497596, "checksum": "abaa6f144cfd71d2144303b2019b4156672e39b10b8e4b0a9555471abd373e3e", "avgDurationSecs": 4.004, "validDurationSecs": 179.92, "totalHrs": 0.07, "validHrs": 0.05 }, "ig": { "duration": 15784488, "reportedSentences": 0, "validatedSentences": 766, "unvalidatedSentences": -736, "clips": 2877, "users": 37, "size": 92284177, "checksum": "cb99fab21b97ef36c43052b60adba30d81daba5ba44a0d15bd027af70249457c", "avgDurationSecs": 5.473, "validDurationSecs": 1379.26, "totalHrs": 4.38, "validHrs": 0.38 }, "ipk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -7893, "checksum": "de70e5900a5aa4d1a8debeaa2ab8b81f41f2c0b496442b0b62eefa6ac1db3c40", "avgDurationSecs": 7.938, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "is": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -221, "checksum": "3e207b41d6eabafe87aae769e8a92e04eae4629727c2da45a1a4b29d1f71ff54", "avgDurationSecs": 6.425, "validDurationSecs": 32.13, "totalHrs": 0, "validHrs": 0.01 }, "it": { "duration": 7355484, "reportedSentences": 0, "validatedSentences": 91, "unvalidatedSentences": -71, "clips": 1185, "users": 18, "size": 43532942, "checksum": "1371f22127b7ff142c10084ef36278631dd50c39564254729e568d9a19782ec5", "avgDurationSecs": 5.431, "validDurationSecs": 2003.01, "totalHrs": 2.04, "validHrs": 0.55 }, "ja": { "duration": 394571808, "reportedSentences": 10, "validatedSentences": 4998, "unvalidatedSentences": -557, "clips": 84354, "users": 1955, "size": 2310244938, "checksum": "85c714c97705123d0abbc0bd4715ceefcd1388b9ce6428bc0aa1cfd635754dd1", "avgDurationSecs": 4.564, "validDurationSecs": 470698.66, "totalHrs": 109.6, "validHrs": 130.75 }, "jgo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9984, "checksum": "af72614e0dc3c4cc3680a734f56267036c6bcd96142f7be1badbb4257a49627f", "avgDurationSecs": 6.695, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "jqr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11351, "checksum": "196799a74d3b72510f615fd7ae28df31eb8d189ada4ed46e0e9bc768abd25b12", "avgDurationSecs": 5.915, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ka": { "duration": 2709540, "reportedSentences": 0, "validatedSentences": 49, "unvalidatedSentences": 105, "clips": 335, "users": 8, "size": 15688029, "checksum": "c5af491be4210fdda49378bfe330bced7a26c0d4250a15826947b93610146a4c", "avgDurationSecs": 5.206, "validDurationSecs": 2534.02, "totalHrs": 0.75, "validHrs": 0.71 }, "kab": { "duration": 224100, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 2, "clips": 65, "users": 6, "size": 1208404, "checksum": "e1103bdf8cc84a32f499e8d9a17d0b19f21ec2a7a9cb3d245678db2a9bc55149", "avgDurationSecs": 3.342, "validDurationSecs": 156.04, "totalHrs": 0.07, "validHrs": 0.04 }, "kbd": { "duration": 269879832, "reportedSentences": 969, "validatedSentences": 6706, "unvalidatedSentences": 0, "clips": 41940, "users": 75, "size": 1488305470, "checksum": "9b163af0d093d3f5ca9b9b951e7070edb9b3390121392625ee8f58166ad31fe7", "avgDurationSecs": 6.234, "validDurationSecs": 241824.24, "totalHrs": 74.97, "validHrs": 67.18 }, "kdh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -22200, "checksum": "5ea464a0bdbe326b2387f543219a0041c01542d14c73fc43f3aef755098aff3c", "avgDurationSecs": 2.671, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "khw": { "duration": 81216, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": -1, "clips": 11, "users": 1, "size": 462702, "checksum": "70e00b56fb6b888e406d195247bad0983a437a5002f7d43d8eade7189d4b27cc", "avgDurationSecs": 7.365, "validDurationSecs": 103.28, "totalHrs": 0.03, "validHrs": 0.02 }, "kk": { "duration": 58068, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": -5, "clips": 12, "users": 2, "size": 347782, "checksum": "cdc3e608667382d61aaf3b729170d701c87aab9dfe830eb4d63209e0169e4afd", "avgDurationSecs": 4.932, "validDurationSecs": 147.25, "totalHrs": 0.01, "validHrs": 0.04 }, "kln": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -54512, "checksum": "f60908bb36e29ad4f747b8875c2c85ba4e2935a197e27cd43c2dca1b71d266c9", "avgDurationSecs": 4.705, "validDurationSecs": 18.82, "totalHrs": 0, "validHrs": 0.01 }, "kls": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8792, "checksum": "0294f5272cd1f2757c79e4b71f25c659663f19d87840532241b86527250b7649", "avgDurationSecs": 3.794, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kmr": { "duration": 1960812, "reportedSentences": 16, "validatedSentences": 13, "unvalidatedSentences": -11, "clips": 336, "users": 6, "size": 10944662, "checksum": "de85490aed2ac9bd1059fe2f8e69c73aa1bc9f1121dca90a36052582da48a0e5", "avgDurationSecs": 4.141, "validDurationSecs": 4574.91, "totalHrs": 0.55, "validHrs": 1.28 }, "ko": { "duration": 6432408, "reportedSentences": 0, "validatedSentences": 46, "unvalidatedSentences": -46, "clips": 1307, "users": 34, "size": 37514668, "checksum": "82ecf582beb0ea54ea81c75ef42a17ca1bc74ba4231764cadab87849e1c67f07", "avgDurationSecs": 5.27, "validDurationSecs": 940.82, "totalHrs": 1.78, "validHrs": 0.26 }, "ksf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14773, "checksum": "e5c2f4171648dd9e8a66e8023d5a9e56ea61fc49864fff82ca9516776d9f99d9", "avgDurationSecs": 9.315, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kvx": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4512, "checksum": "9a96384afff97ae5082f1a0c87f9cf0b53a8330138262b0568d668872fc53679", "avgDurationSecs": 5.322, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12046, "checksum": "5aa9cc98cbcf49c06eb88af972ef563e727f06c8a975f0b49d93c63607e6ad6d", "avgDurationSecs": 4.139, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kxp": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13452, "checksum": "4f8988ad38bc6c0ef5dc05b8cb0b30e25ce8da6c306d3a2534718bcbeb516a7f", "avgDurationSecs": 3.403, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ky": { "duration": 17532, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 140872, "checksum": "a0f04ffe114d62699d006bf112c35d79ce7689d7b1b6683ca2fb0df7ac25e970", "avgDurationSecs": 4.553, "validDurationSecs": 141.45, "totalHrs": 0.01, "validHrs": 0.04 }, "lg": { "duration": 37476, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 5, "clips": 5, "users": 1, "size": 262276, "checksum": "f60e779e1a0852718866540cd6cd09b5fd7b05cd5db2839b01708eb5c865ce7b", "avgDurationSecs": 5.784, "validDurationSecs": 41.37, "totalHrs": 0.01, "validHrs": 0.01 }, "lij": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1093, "checksum": "7c9213de879b7214f580c6421b855b5f731b7b757e96bc80160c78a9ec42ce67", "avgDurationSecs": 3.923, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 81, "checksum": "a09fcebc44a72c0cca612a5394902a5c5dac06e190da915df06b3a0bf2cc9fa0", "avgDurationSecs": 6.532, "validDurationSecs": 39.19, "totalHrs": 0, "validHrs": 0.01 }, "lrk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1562, "checksum": "bdb71407d5327ccbf5ab12239dca33c77dd6d6a12d85a366494f89367fda0ca3", "avgDurationSecs": 3.58, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lss": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11840, "checksum": "f9aba4d8963a6aa930762bb41dc1a12b8278860044600ed9e136b2c9a156209e", "avgDurationSecs": 3.472, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lt": { "duration": 317412, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 65, "users": 3, "size": 1870938, "checksum": "6b14a22ba834499d6918a05b6f3af067f34f2d88a95890ad07569715aaae4947", "avgDurationSecs": 5.125, "validDurationSecs": 1540.53, "totalHrs": 0.09, "validHrs": 0.42 }, "ltg": { "duration": 924408, "reportedSentences": 1, "validatedSentences": 83, "unvalidatedSentences": 443, "clips": 140, "users": 5, "size": 5296505, "checksum": "be380d8e082f5bf47659b57b62c5bd34b426633464aa22f91a2a30c30c707eee", "avgDurationSecs": 4.79, "validDurationSecs": 1501.29, "totalHrs": 0.25, "validHrs": 0.41 }, "lua": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9564, "checksum": "4f209c15a0ad7c21fcbb0840985b9439e98c6a074a74bfbcf774b992c2265c33", "avgDurationSecs": 8.238, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "luo": { "duration": 30852, "reportedSentences": 0, "validatedSentences": -9987, "unvalidatedSentences": 0, "clips": 10, "users": 1, "size": 152706, "checksum": "88eb400a20c69e9f2be64afc31058da8cc5aa947759c50b1692551562b1b8c71", "avgDurationSecs": 5.205, "validDurationSecs": -5.21, "totalHrs": 0.01, "validHrs": -0.01 }, "lv": { "duration": 5741712, "reportedSentences": 1, "validatedSentences": 151, "unvalidatedSentences": -88, "clips": 905, "users": 66, "size": 33612663, "checksum": "4c3f8684153e45f57f249cec4c7509723d0a0d54791aec22f65faf13f0e3f2bb", "avgDurationSecs": 4.81, "validDurationSecs": 5586.17, "totalHrs": 1.6, "validHrs": 1.55 }, "mau": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -18195, "checksum": "9fad9bf14acb1702c1dad989373134a59527bc856cb646c14fab5ba68d8f5da5", "avgDurationSecs": 6.216, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mbo": { "duration": 90108, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 20, "users": 1, "size": 511218, "checksum": "a2dc5f7ffa82e73d1adddffc372eaac7352ad5b11da6c51ac53bc85dfd9ef728", "avgDurationSecs": 5.513, "validDurationSecs": 400.99, "totalHrs": 0.03, "validHrs": 0.11 }, "mcf": { "duration": 189612, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 69, "users": 1, "size": 1098885, "checksum": "2555d944cbce5510565439960845916a40b02de78d389ec58b271b98df975aec", "avgDurationSecs": 2.656, "validDurationSecs": 6.34, "totalHrs": 0.06, "validHrs": 0 }, "mcn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -28951, "checksum": "98814c19f4f6e35d188f95d96196847bd9e10d15283ce6de237f7d7945605deb", "avgDurationSecs": 4.214, "validDurationSecs": 4.21, "totalHrs": 0, "validHrs": 0 }, "mcx": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -10523, "checksum": "6876d86e0d40bbabe5bc77b2f459d86ff19363646185be82b5d8d667db1fb66b", "avgDurationSecs": 6.611, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mdd": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9065, "checksum": "143398b834b4d612570753e124c3b8fef58d8343f6c4d05b463c6c5b14667bd8", "avgDurationSecs": 6.538, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mdf": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -174, "checksum": "705c18c73d1217495c84bcdd6df8953d6d6fe0244665b8f807c4af7c416525ef", "avgDurationSecs": 5.25, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mgg": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12897, "checksum": "017f84fba3b42e99cbcf1939b7f5264644b54749cc1b1b96d32afc7ba51b9fa6", "avgDurationSecs": 7.581, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2617, "checksum": "5b79a388485e539bcb27d890f5b85a1177b4549eda7a11064624f596f4a53c03", "avgDurationSecs": 6.149, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mhr": { "duration": 25020, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 167285, "checksum": "21c80f3897057743d21e15f591d413f4348aa398905c066efeb00a4ae1861a6b", "avgDurationSecs": 4.623, "validDurationSecs": 75.74, "totalHrs": 0.01, "validHrs": 0.02 }, "mk": { "duration": 315684, "reportedSentences": 0, "validatedSentences": 14, "unvalidatedSentences": -13, "clips": 65, "users": 2, "size": 1853089, "checksum": "053b8c57582d47d876ac6ea4632704a6f0749279c2257ba46b864e42cfead76c", "avgDurationSecs": 4.809, "validDurationSecs": 5171.14, "totalHrs": 0.09, "validHrs": 1.43 }, "mki": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9462, "checksum": "a9c2835deea7cad075681efd6f30fe65cb3139779ee119ba80bcdfb14c89de51", "avgDurationSecs": 3.204, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ml": { "duration": 604944, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 95, "users": 3, "size": 2783236, "checksum": "09bad714c75c67807062977871629e6aed94df29c196c3edd6e9230b28fb20a2", "avgDurationSecs": 4.242, "validDurationSecs": 69.17, "totalHrs": 0.16, "validHrs": 0.02 }, "mn": { "duration": 58140, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -1, "clips": 13, "users": 3, "size": 298473, "checksum": "384488bb777f7478ec8ad9467e402a8864b04a38aee2d8da728c887e24cd51a5", "avgDurationSecs": 5.254, "validDurationSecs": 259.16, "totalHrs": 0.02, "validHrs": 0.08 }, "mr": { "duration": 41328, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 6, "users": 1, "size": 247436, "checksum": "3e4cdbb5d87e20a12bb888459baba7a8f41b2e635c2884da6cfad34c8a165b9b", "avgDurationSecs": 6.2, "validDurationSecs": 2.82, "totalHrs": 0.01, "validHrs": 0 }, "mrj": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -3987, "checksum": "a01e3933ee8dcc0b3c211c7867e29f775164505ffd7b00bd0242dd29509819c3", "avgDurationSecs": 4.193, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ms": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 8, "clips": 0, "users": 0, "size": -3548, "checksum": "20ff7d48d859b7bf3f2b51fed187c349efde41d9f03bc99049791df447ecebf4", "avgDurationSecs": 6.155, "validDurationSecs": 30.78, "totalHrs": 0, "validHrs": 0.01 }, "mse": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -7257, "checksum": "77b8d72d1483f1a82384abbde1971c2e7ddc5cf816e2230358b2010706185f9e", "avgDurationSecs": 8.323, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12693, "checksum": "f6be17867ef174e2235020c7fb7f997fb8530dfbbde9d84a23bb3816ee0a0d85", "avgDurationSecs": 4.754, "validDurationSecs": 9.51, "totalHrs": 0, "validHrs": 0 }, "mua": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 3576, "checksum": "1f30809c317d59d91a3c7060140e5500e212a30d240aef8ebfcb802ef1a679e5", "avgDurationSecs": 4.457, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mug": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6108, "checksum": "61707c1c6eb4ad4d852e43a142e108600c37feb3d1ba3c565e07c8fb1473bd30", "avgDurationSecs": 15.296, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mve": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2336, "checksum": "0935f7b6789e38c6e935bff6a15922f7875d44a208f789b7e02dc6545e52bc6e", "avgDurationSecs": 4.98, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mvy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -32328, "checksum": "298600a827aec8372df257483745fe3093c54509d1ae657995f5e382dad8b7cb", "avgDurationSecs": 4.872, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mxu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -27492, "checksum": "bc7d32fbc9d4c35e91d2797ce5eb7a8b4d26ee805b4f3f4b67eec6f28ed8f3d0", "avgDurationSecs": 6.503, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "myv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2895, "checksum": "13aadf802fc4548ef59760746fbc04c9a1c9e9705db95169602e338745bb6064", "avgDurationSecs": 5.784, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "duration": 862884, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 295, "users": 7, "size": 5051566, "checksum": "fe5cd5420acf6d284ff9cd006c475844ef8f68874863d7c09c2cb6d75fd8d513", "avgDurationSecs": 2.644, "validDurationSecs": 388.05, "totalHrs": 0.24, "validHrs": 0.11 }, "nb-NO": { "duration": 1350972, "reportedSentences": 0, "validatedSentences": 19, "unvalidatedSentences": -18, "clips": 295, "users": 47, "size": 7948838, "checksum": "7e02d3de626e49f7fec04ffb61e7215a2c4c16c2c4c943abfa5864ed994cce44", "avgDurationSecs": 4.193, "validDurationSecs": 1535.53, "totalHrs": 0.37, "validHrs": 0.43 }, "ncx": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14321, "checksum": "0e68bdaab632f8183451a1a44ad9790ecab34678fbe9ad22c9ffee0fb3ca76bf", "avgDurationSecs": 4.457, "validDurationSecs": 4.46, "totalHrs": 0, "validHrs": 0 }, "ne-NP": { "duration": 46620, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 269830, "checksum": "d7860601d5d1fdcd2993e90efeac9730ca0f40e46be9516f52f0e17403b213d5", "avgDurationSecs": 4.111, "validDurationSecs": 3.85, "totalHrs": 0.01, "validHrs": 0 }, "nhi": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -549, "checksum": "244897ae41792379b265414bbaf44bffbccca8e24dd0487c8646862137dafd94", "avgDurationSecs": 5.081, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nl": { "duration": 7849332, "reportedSentences": 6, "validatedSentences": 313, "unvalidatedSentences": 732, "clips": 1757, "users": 12, "size": 46092518, "checksum": "21eb500aa204e5084516690f5d271e1502c1b3c5f15580ace2e114f75eb6dbf2", "avgDurationSecs": 4.371, "validDurationSecs": 1867.51, "totalHrs": 2.18, "validHrs": 0.51 }, "nla": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13922, "checksum": "63ada318ffc0fbf732daed24b8fdd805fa05d7face5c1c5c1b8d66a18bc4e831", "avgDurationSecs": 7.755, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nlv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8815, "checksum": "1d501ae8bc18eaec8ce2835947f36f0c6a9d3323057b7415ce2fa00e00ae0791", "avgDurationSecs": 6.272, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nmg": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12207, "checksum": "42a73b308a7cc5bdbd0f62c73c9ad6c159ff34a87ae4a37ba67a38e5fd0dd41e", "avgDurationSecs": 6.401, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nmz": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -21738, "checksum": "46566952dd93ab8bb6cd9563df118dc892b434554ecaec441218bddfc7830e0c", "avgDurationSecs": 2.959, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nn-NO": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 127, "checksum": "7d63636ad50966fbed2f6d092060a4a0e95735df571726f2faed08c336b1ca8a", "avgDurationSecs": 4.387, "validDurationSecs": 43.87, "totalHrs": 0, "validHrs": 0.01 }, "nnh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -22684, "checksum": "babd44c2f72627c3a056b231c56db4202e847d9634009ed258b84f1d097bc1ba", "avgDurationSecs": 10.567, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nr": { "duration": 41220, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 244036, "checksum": "56bf8efe6e787631d2167e53ea0f75d5d8d23574400c1ded2dee82f52a1c513c", "avgDurationSecs": 8.671, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "nso": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4, "checksum": "73e9b4a6e2b9eab02599ace56ee43a4dbe7acaeb4bf184f546d5462a60e73d65", "avgDurationSecs": 4.63, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nyu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4714, "checksum": "3bb330e84c386c143ac89e39312aadbc3742b2e53b4f8411f22d18b535b73df2", "avgDurationSecs": 10.331, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "duration": 49284, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 283873, "checksum": "e0331248b393bf9364ece8f85c0ecc71d33de74652847bcb4c143cdcccb186a2", "avgDurationSecs": 4.871, "validDurationSecs": 4.99, "totalHrs": 0.01, "validHrs": 0 }, "odk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6206, "checksum": "be42f35dacc8897e6d6adac8e837baa719bb06bbfe31c9fe43193174451b8ea6", "avgDurationSecs": 6.369, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "or": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6014, "checksum": "0f23ae006a91ba4ef4a980488409c233cb8af7d9bbf397d69368182b569674b1", "avgDurationSecs": 5.575, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oru": { "duration": 900396, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 114, "users": 1, "size": 5327698, "checksum": "b2c9859c26e48cd41a5a54439c6d570676e2399015b6fd591646093d4e05617f", "avgDurationSecs": 7.443, "validDurationSecs": 1478.98, "totalHrs": 0.25, "validHrs": 0.41 }, "os": { "duration": 62280, "reportedSentences": 1, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 11, "users": 1, "size": 362992, "checksum": "ab156c9fbdce44f05645cbfeb8814d7169bbd6ac5f7f67c6620eace2ec4231ba", "avgDurationSecs": 5.523, "validDurationSecs": 23.56, "totalHrs": 0.02, "validHrs": 0.01 }, "pa-IN": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -3481, "checksum": "51e1408f6a42373d70e5c58619df034be28766a7204b2734d55d4c82240ce543", "avgDurationSecs": 5.112, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "pcm": { "duration": 164160, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 40, "users": 2, "size": 950936, "checksum": "68d6dea59c9ee3f8eb160acc202796feefb2a580413055b740e562a2a11d85fc", "avgDurationSecs": 6.03, "validDurationSecs": -54.85, "totalHrs": 0.05, "validHrs": -0.02 }, "phl": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -34900, "checksum": "4aaf371feffca8920d87fccab54c5b30110428b624fbf5aa6df513b488798f3d", "avgDurationSecs": 4.981, "validDurationSecs": 159.4, "totalHrs": 0, "validHrs": 0.04 }, "phr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -19082, "checksum": "f9105e948fa3db9fd4b6cb7bd31d7f11851197874fef8268e0b64f6571e7f16d", "avgDurationSecs": 4.057, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "pl": { "duration": 3121344, "reportedSentences": 0, "validatedSentences": 75, "unvalidatedSentences": -74, "clips": 638, "users": 4, "size": 18083796, "checksum": "1070367840751023da2a7b99c964ddfb1bd00df77f42b9c9e546341c1520cf04", "avgDurationSecs": 4.573, "validDurationSecs": 1224.46, "totalHrs": 0.87, "validHrs": 0.34 }, "plk": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14307, "checksum": "54b9322f0140b9d249ea831a54d67c1a5287b7c32df06944c06cf830c8f3e7d4", "avgDurationSecs": 5.026, "validDurationSecs": 55.28, "totalHrs": 0, "validHrs": 0.02 }, "prq": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23853, "checksum": "ad6dcb53dbe2e7b600d7748b77db0238ebda964a1bdf225bdb95986ce3b79a1b", "avgDurationSecs": 4.862, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ps": { "duration": 9437870268, "reportedSentences": 3090, "validatedSentences": 221426, "unvalidatedSentences": -18548, "clips": 2300018, "users": 5171, "size": 50736520892, "checksum": "c2dc2c2b16e6ddfe831bd3c92c24761767fac439be809d26c0c7bbb55b3e18b4", "avgDurationSecs": 4.14, "validDurationSecs": 3216826.73, "totalHrs": 2621.63, "validHrs": 893.56 }, "pt": { "duration": 1518804, "reportedSentences": 3, "validatedSentences": 69, "unvalidatedSentences": -51, "clips": 306, "users": 17, "size": 8669477, "checksum": "f326e333e79da755db71a0c66986c87729ad4a9d5b1630758e5fcc09dfa2d5af", "avgDurationSecs": 4.188, "validDurationSecs": 2618.59, "totalHrs": 0.42, "validHrs": 0.73 }, "pua": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -4385, "checksum": "de0c8d62ccc4b94c825efc9dbb360145a74d33b1ae7cef91ba064562a9d0c685", "avgDurationSecs": 4.919, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "pwn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -17452, "checksum": "eefcdc85a1ef010bd40d4f03afa4ab837c54348584ecbd9fc389377d564154fb", "avgDurationSecs": 4.873, "validDurationSecs": 9.75, "totalHrs": 0, "validHrs": 0 }, "qup": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14616, "checksum": "46e1197dc29db5ba3a884162c74dce7d54cfc3a893e5a06dd548443bac96cd22", "avgDurationSecs": 5.942, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qur": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -32905, "checksum": "f10ccb583f4878442eb4a5e8360e9646c427bbc9086d7e05b5fda7926fc999ad", "avgDurationSecs": 3.47, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qus": { "duration": 5940, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 1, "users": 1, "size": 32481, "checksum": "237121fcf9e890d51858bed67c6c5b586f1a2c6e46c171a3378a1bb18f6baf42", "avgDurationSecs": 3.839, "validDurationSecs": 2, "totalHrs": 0.01, "validHrs": 0 }, "qux": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -14993, "checksum": "ca676f40d62b0b50d21aebda5a610c10147ebf9a5d773490d23df80ddacff2ce", "avgDurationSecs": 5.9, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "quy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 215, "checksum": "14cb3d8261136c2ace09b7d83a7fc1dcfd0da22dea239b40af1e9dc175bc7c7a", "avgDurationSecs": 8.637, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qva": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 5884, "checksum": "1ecd84244857b93d4dda453a25651a68ee58ef415180043cce8b9949069a4b00", "avgDurationSecs": 4.404, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qvi": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2380, "checksum": "34799bc58736d1665788583381e2e2547bc03b4b9df9a49a7c587dfec0356765", "avgDurationSecs": 4.427, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qvj": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6416, "checksum": "bfe433ba3421daf0a5804f8f518e655ea5fa2b85a09953d70512f13aaffbc329", "avgDurationSecs": 5.961, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qvl": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12791, "checksum": "f4559bce00466942b0a8e55590927112e8cf351bc45d8419a1c571dc0e6e84a4", "avgDurationSecs": 3.98, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qwa": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -21564, "checksum": "88a62630ce0768c38b6a0f7470a3232266794c6ae5cf84ba7fa9d6b2221e34f6", "avgDurationSecs": 4.966, "validDurationSecs": 9.93, "totalHrs": 0, "validHrs": 0 }, "qws": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2107, "checksum": "266632544eb4aa83b13d013c0c97c020365f5e7e4d239ba8263f5414ace24090", "avgDurationSecs": 4.164, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qxa": { "duration": 17028, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 5, "users": 1, "size": 86666, "checksum": "8b23b4d8ef2a7cddd582e412e7a8ba13c90058c890a6c34f2c7d314ffc322cbc", "avgDurationSecs": 4.348, "validDurationSecs": -4.52, "totalHrs": 0.01, "validHrs": -0.01 }, "qxp": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23444, "checksum": "31afe9f6c1aa3241a72436d14c9e0a5f4571b110f6d8ccd8a1d01052cfee99ab", "avgDurationSecs": 4.636, "validDurationSecs": 4.64, "totalHrs": 0, "validHrs": 0 }, "qxt": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2654, "checksum": "27dcfd0f8307753b66bfb50406b1b17885482fb9d03211d6654ac60c750613cf", "avgDurationSecs": 4.29, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qxu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -29266, "checksum": "057cbb265953728900cfab48882a24391502e5bff4f9c33580d0238cef5d5959", "avgDurationSecs": 4.349, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qxw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13839, "checksum": "e2105521891aad5524d884b3768e5e584819e1df9035f3bb096de930ff64849b", "avgDurationSecs": 5.256, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "rm-sursilv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2120, "checksum": "fe04ea27af47298e1ec01b64d7e36f5f87c496d4b707909a6066f0a671ca3426", "avgDurationSecs": 5.438, "validDurationSecs": 16.31, "totalHrs": 0, "validHrs": 0.01 }, "rm-vallader": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 1186, "checksum": "8b1c3bcd86f0dddd121218222fd4d9e8b88a9f38181f601671effc328db77b67", "avgDurationSecs": 5.832, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ro": { "duration": 78732, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 16, "users": 4, "size": 449092, "checksum": "485f929bb94328403e43bfd9e930e7815b5fc7b1cdf776b3cdac27ba5c425a3c", "avgDurationSecs": 4.041, "validDurationSecs": 268.96, "totalHrs": 0.02, "validHrs": 0.07 }, "rof": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12249, "checksum": "42454b644bd2374c5ec0424f0b255391090b9cd7ae157a14377cd352cdea3644", "avgDurationSecs": 4.037, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ru": { "duration": 3376692, "reportedSentences": 4, "validatedSentences": 19, "unvalidatedSentences": 3, "clips": 537, "users": 45, "size": 19515208, "checksum": "9a2ed32a0574f74f505cd7740a599f0b9edc9f52ba1e7d6624b66f258db4c0ea", "avgDurationSecs": 5.19, "validDurationSecs": 5396.4, "totalHrs": 0.94, "validHrs": 1.5 }, "rup": { "duration": 134460, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 32, "users": 3, "size": 790522, "checksum": "31b2483416f226ff7dc6b52ce3a70c781e6627cab96e5f7090c0331a4316fd8c", "avgDurationSecs": 5.972, "validDurationSecs": 71.01, "totalHrs": 0.04, "validHrs": 0.02 }, "rw": { "duration": 58896, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 12, "users": 3, "size": 1208534, "checksum": "b61528d19c78687020b28328c916f0a517ff57fbb17749e30ce6f5f43bdee834", "avgDurationSecs": 5.007, "validDurationSecs": 64.09, "totalHrs": 0.02, "validHrs": 0.02 }, "sah": { "duration": 60408, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 10, "users": 2, "size": 333110, "checksum": "96e747ace1cbfd7d97775f8ecd454ea53683a6c521ccdbee2bfe8952714cfd0b", "avgDurationSecs": 6.335, "validDurationSecs": 80.27, "totalHrs": 0.02, "validHrs": 0.02 }, "sat": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -268, "checksum": "016b9d7b4488b688147e480956d81ff41584cb61e31b14904130fca4505e95cf", "avgDurationSecs": 4.478, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sbn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12691, "checksum": "5184a7f673b0020c469e718b17213d18f6e40f363334a3c27fe823436debce0f", "avgDurationSecs": 3.527, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sc": { "duration": 189792, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 40, "users": 6, "size": 1095770, "checksum": "1c16219cc05fc6e18c1d77709ab43c8b0e37723332c573ce12ddfe772d848e79", "avgDurationSecs": 4.698, "validDurationSecs": 104.89, "totalHrs": 0.05, "validHrs": 0.03 }, "scl": { "duration": 12564, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 3, "users": 1, "size": 22268, "checksum": "bab306d96c7d7d0ac7eaf676270e92194df0f33da5c23bcf35835e73f6364531", "avgDurationSecs": 4.116, "validDurationSecs": 0.21, "totalHrs": 0, "validHrs": 0 }, "sd": { "duration": 18539784, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": -2, "clips": 4737, "users": 2, "size": 104429688, "checksum": "b65a12ef79e5458d9432027cbfa5472b070bb1325bc177343b3f761aac8fda94", "avgDurationSecs": 4.102, "validDurationSecs": 7.1, "totalHrs": 5.15, "validHrs": 0 }, "sei": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -12312, "checksum": "7f56cbcb9f0c95d371b2cb863488d914d11bc03f01eae8ef2c45dd77056d4cf9", "avgDurationSecs": 4.549, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sk": { "duration": 7215804, "reportedSentences": 3, "validatedSentences": 356, "unvalidatedSentences": -356, "clips": 1542, "users": 3, "size": 42506993, "checksum": "9228980a35d828225c66215ac342840b633d62927afd27de46b2678eacd361b9", "avgDurationSecs": 4.282, "validDurationSecs": 3561.17, "totalHrs": 2, "validHrs": 0.99 }, "skr": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 523, "checksum": "f8cb9841fc74b8f954a189d24492bbecdb5a26610c0db700bd8abe3898373576", "avgDurationSecs": 4.164, "validDurationSecs": 8.33, "totalHrs": 0, "validHrs": 0 }, "sl": { "duration": 104760, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": -2, "clips": 25, "users": 2, "size": 626811, "checksum": "044c8a2e25f1c3f08c203c9f57caa1cad227ba31ec46cdf615af7631e6253d6f", "avgDurationSecs": 4.001, "validDurationSecs": 167.83, "totalHrs": 0.02, "validHrs": 0.05 }, "sq": { "duration": 603036, "reportedSentences": 10, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 101, "users": 5, "size": 3545748, "checksum": "d1598491e686237a7e9c8b688bfd1d1203e3c08f71395e623fa030bf2810b256", "avgDurationSecs": 5.12, "validDurationSecs": 533.97, "totalHrs": 0.16, "validHrs": 0.15 }, "sr": { "duration": 937260, "reportedSentences": 103, "validatedSentences": 168, "unvalidatedSentences": -92, "clips": 231, "users": 1, "size": 5167231, "checksum": "6243b57a8ce83b37f2d2d2fc17f66289d37b0eb80d296a117c0ebc143521a3e8", "avgDurationSecs": 3.254, "validDurationSecs": 292.16, "totalHrs": 0.27, "validHrs": 0.08 }, "ss": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -304, "checksum": "205e5a5d055ccd73e332c5d27ebcda25b074496a7019065d1f4d59375d0dcd42", "avgDurationSecs": 7.44, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ssi": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9205, "checksum": "a2085ac91e1eb686756187de472a1b4305f9b448ed6cc80a2e928017eaf6ae60", "avgDurationSecs": 3.721, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "st": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 0, "users": 0, "size": 445, "checksum": "0410f551552a71c2d4889273236202907ec723bb99fda4331c0ea44c4a640750", "avgDurationSecs": 3.223, "validDurationSecs": 3.223, "totalHrs": 0, "validHrs": 0 }, "sv-SE": { "duration": 413352, "reportedSentences": 0, "validatedSentences": 25, "unvalidatedSentences": 54, "clips": 99, "users": 2, "size": 2339037, "checksum": "cd66ab8d6cba060cac10a005b5f0369a5ac9371cd5d13cec7c24958e576c0610", "avgDurationSecs": 4.024, "validDurationSecs": 346.69, "totalHrs": 0.12, "validHrs": 0.09 }, "sva": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -18396, "checksum": "f524918b5aaa9974d92b794a59471d9a43966c7c9dd9fbce460434ad523ba998", "avgDurationSecs": 5.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sw": { "duration": 28604808, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 3831, "users": 16, "size": 167414595, "checksum": "709aa10aadbc920e36e51bb238185dde5400f26e24f212f5d50a637a67d8e4e6", "avgDurationSecs": 5.398, "validDurationSecs": 5544.67, "totalHrs": 7.95, "validHrs": 1.54 }, "szy": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -16513, "checksum": "212dcc3f01dc91273007f706ff98ff6677d25dd9e7914071efb75cd50e612169", "avgDurationSecs": 5.397, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ta": { "duration": 107352, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 16, "users": 3, "size": 660281, "checksum": "f62fbf8c6b44ba63c2ee58da74ce12d08147851b844a00218915b4817750a4af", "avgDurationSecs": 6.197, "validDurationSecs": 85.09, "totalHrs": 0.03, "validHrs": 0.02 }, "tar": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -2901, "checksum": "d126a9e6662bfb16bbe9b26aa09652d0dd2b730f03f693fad37e3621051b6df3", "avgDurationSecs": 4.549, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tay": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13928, "checksum": "fc18314c67752531deee56c5fa549687c5588003e6bd2df6bcec3743312d3a2f", "avgDurationSecs": 5.612, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "te": { "duration": 60804, "reportedSentences": 0, "validatedSentences": 6, "unvalidatedSentences": -6, "clips": 11, "users": 2, "size": 225641, "checksum": "18b9c95fd488185388c56952a3ebf3ff449d9586183c767a5d189e6ce5dd3bd0", "avgDurationSecs": 4.192, "validDurationSecs": 44.04, "totalHrs": 0.02, "validHrs": 0.01 }, "tg": { "duration": 511164, "reportedSentences": 0, "validatedSentences": 145, "unvalidatedSentences": -145, "clips": 105, "users": 0, "size": 3004033, "checksum": "8cdb1433f285efc9ac413ec879034588bc2fcc8d2835cb57d047cd5c8c9dec3a", "avgDurationSecs": 4.964, "validDurationSecs": 134.73, "totalHrs": 0.14, "validHrs": 0.04 }, "th": { "duration": 179244, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 0, "clips": 34, "users": 5, "size": 849287, "checksum": "f3ff5f97625cb56ecdfafe5d0a37bfb70646aca7f50eab1074bcffceeddacb3a", "avgDurationSecs": 4.193, "validDurationSecs": 132.3, "totalHrs": 0.05, "validHrs": 0.03 }, "ti": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 173, "checksum": "115e0cd6ba7886a2eb1d5b9306d9123af5a7165a6587089da1fde95f4ea0a021", "avgDurationSecs": 5.194, "validDurationSecs": 20.77, "totalHrs": 0, "validHrs": 0.01 }, "tig": { "duration": 82440, "reportedSentences": 2, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 16, "users": 2, "size": 471932, "checksum": "5d68f963754089ec2085a170fbb1d2530a46884259c85043fb283e041d260889", "avgDurationSecs": 6.345, "validDurationSecs": 19.91, "totalHrs": 0.03, "validHrs": 0 }, "tk": { "duration": 1283976, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 216, "users": 14, "size": 6430979, "checksum": "3a6b65006014e9e6e2a8060f0c338c5e60ac38b39c1013522d7d3038ba09e342", "avgDurationSecs": 5.5, "validDurationSecs": 188.48, "totalHrs": 0.35, "validHrs": 0.05 }, "tli": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6930, "checksum": "60deb24a69131763cccdff92492c2ede732407b944dfb6b32c9c9eabf68cd7b8", "avgDurationSecs": 12.901, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tn": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -1251, "checksum": "5eab9c278d45f0002d82343e0b2e138ec6c98a1e0ec44d48c1469903be956f51", "avgDurationSecs": 4.37, "validDurationSecs": 13.11, "totalHrs": 0, "validHrs": 0 }, "tok": { "duration": 595620, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": 11, "clips": 105, "users": 0, "size": 3522601, "checksum": "193da8657cab1eacfe7fb5cc78a7392f95c99ec86d026df0551afc866ecad17c", "avgDurationSecs": 4.311, "validDurationSecs": 113.6, "totalHrs": 0.17, "validHrs": 0.04 }, "tr": { "duration": 2304684, "reportedSentences": 5, "validatedSentences": 1, "unvalidatedSentences": 4, "clips": 382, "users": 14, "size": 13637222, "checksum": "e37186670c9d71c6bb4904f349f3d640aa1288ca3a458aa884ab56ce83bf486a", "avgDurationSecs": 3.838, "validDurationSecs": 1499.44, "totalHrs": 0.64, "validHrs": 0.42 }, "trv": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13154, "checksum": "5620ad2d3d39f3dfbb26bcd3ea9ac5a07c402f1138c5dff7224a6e9ebf770886", "avgDurationSecs": 5.611, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "trw": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -28594, "checksum": "06ea0c07f054817a3141d1b0e0d95da505b85062fde465f202bef6fc80e79eaf", "avgDurationSecs": 5.443, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ts": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 2, "checksum": "3abc4849fb791ef192ec46d21cc07b8cfb15791e2ffa3a0425a174e5193deaa0", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "duration": 365472, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 105, "users": 3, "size": 2158529, "checksum": "dfa2a8760b49bb5120b78a9514f1055d6c358d5594c6f1c55ca15e37564438f6", "avgDurationSecs": 3.831, "validDurationSecs": 1435.59, "totalHrs": 0.1, "validHrs": 0.4 }, "tui": { "duration": 195300, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 32, "users": 2, "size": 1141123, "checksum": "904887b509115378a44bc51046b1638985aeec56f1fd80cda980e84393bffb94", "avgDurationSecs": 4.929, "validDurationSecs": 210.47, "totalHrs": 0.05, "validHrs": 0.06 }, "tvu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8289, "checksum": "2dcd5291c53124cc7d603d5cce483bd9dc8d286831ee284f0b0d25f03148c635", "avgDurationSecs": 7.009, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tw": { "duration": 246276, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 3, "clips": 54, "users": 6, "size": 1446604, "checksum": "90fe28ee202d593067b5a04a0b25c991f892089f1448f2ae4a707f74e1e54e96", "avgDurationSecs": 4.385, "validDurationSecs": 55.79, "totalHrs": 0.07, "validHrs": 0.02 }, "udl": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -23300, "checksum": "64b6b0594df78415759d30d41bd45cebc484383ffbe453b0e36f0c41f2fc27c0", "avgDurationSecs": 5.912, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ug": { "duration": 31851180, "reportedSentences": 29, "validatedSentences": 141, "unvalidatedSentences": 2702, "clips": 3521, "users": 22, "size": 186611793, "checksum": "7b709a2dc3e624791eebc0f4202e8b59bd5a50b3cd051c1d65e0b697b54f98d6", "avgDurationSecs": 5.967, "validDurationSecs": 27863.5, "totalHrs": 8.85, "validHrs": 7.74 }, "uk": { "duration": 2681640, "reportedSentences": 5, "validatedSentences": 1, "unvalidatedSentences": 14, "clips": 522, "users": 6, "size": 15796076, "checksum": "5fe1c9be7116d14cf6928f326b25f4910875fcc770500e1315137c9ac52931e5", "avgDurationSecs": 4.646, "validDurationSecs": 1383.95, "totalHrs": 0.74, "validHrs": 0.39 }, "ur": { "duration": 74232, "reportedSentences": 0, "validatedSentences": 4, "unvalidatedSentences": -4, "clips": 12, "users": 3, "size": 453460, "checksum": "391a79277ca6855ef6da612366584cf47a8dfb5b4eeed1f5902f496413faee64", "avgDurationSecs": 4.299, "validDurationSecs": 100.69, "totalHrs": 0.02, "validHrs": 0.03 }, "ush": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -10861, "checksum": "f41fd2137282ea84e7fc35a6775016b710a2d9a37110128993660817f50dad77", "avgDurationSecs": 6.148, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "uz": { "duration": 744012, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 132, "users": 9, "size": 4378705, "checksum": "71593db33b13a634292eb7e2a3b471373c5ce2682a163994140460e32933d9be", "avgDurationSecs": 4.158, "validDurationSecs": 398.32, "totalHrs": 0.21, "validHrs": 0.11 }, "var": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -19482, "checksum": "ba7c24ada264068bafc39298c431215a1f52d8b01d516b7d8d211bcd190d8409", "avgDurationSecs": 5.307, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ve": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -28, "checksum": "f8a215f053ce4f46dd6e73e56aa7dd2339d5c13770903ed6a0c3a035672937ef", "avgDurationSecs": 5.654, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "vi": { "duration": 2347884, "reportedSentences": 0, "validatedSentences": 1678, "unvalidatedSentences": -233, "clips": 437, "users": 12, "size": 13951734, "checksum": "93520f4e269fda549cd22d716085a76b5096554d43836d70c7798e218ac9b208", "avgDurationSecs": 4, "validDurationSecs": 1244.33, "totalHrs": 0.65, "validHrs": 0.34 }, "vot": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -34, "checksum": "cd4765febc933489d5ffd3c43c4bd0815a0e1f42aa1141824645fd2434bf8eeb", "avgDurationSecs": 2.412, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "wbl": { "duration": 93780, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 1, "clips": 40, "users": 1, "size": 530998, "checksum": "4b3d8ccd1a9477f1e2c037386cdacd855e3186fe48a19242eac174d8bfc5c05b", "avgDurationSecs": 6.753, "validDurationSecs": -139.83, "totalHrs": 0.02, "validHrs": -0.04 }, "wes": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -9136, "checksum": "8955146c6bd3521a103b24e1c4542c937bfdac93dab69c21037d993d0d6bf83e", "avgDurationSecs": 4.118, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xh": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 207, "checksum": "084813607f5d5a858df95a199a9930074f238efc2891d7f0c28a80dbc91f30b7", "avgDurationSecs": 5.99, "validDurationSecs": 11.98, "totalHrs": 0, "validHrs": 0 }, "xhe": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -6421, "checksum": "801266f1540c987cfa379241f4c76a1da6f558f14ded2becd2f9ea405bdf6565", "avgDurationSecs": 3.212, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xka": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -8518, "checksum": "9b23e454211b544e89586a5b5ba1bdbf3bf8f9e79593e48e7e63fec6bfbc0442", "avgDurationSecs": 3.229, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "xmf": { "duration": 262656, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 11, "clips": 37, "users": 3, "size": 1525242, "checksum": "95f40af15d55c3cf065fae9257bb2e076a4517e9af57f10e84e9657d6c45df33", "avgDurationSecs": 6.188, "validDurationSecs": 49.6, "totalHrs": 0.07, "validHrs": 0.01 }, "yaq": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -17493, "checksum": "579c5858fe2779f7659bf56c6772b8d95bdafe8995ec69e2846fc1f485df2ca1", "avgDurationSecs": 5.311, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yav": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -11522, "checksum": "7a07ce50e5b8a9d33b5d4780d29f851d56199f22ba69c315281de787d82d2343", "avgDurationSecs": 8.572, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ydg": { "duration": 471132, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 21, "clips": 99, "users": 0, "size": 2761897, "checksum": "733fdddd0a1f594c3606862c30560225c4e33ac33664ce0ebd468b45e4383766", "avgDurationSecs": 3.579, "validDurationSecs": 218.09, "totalHrs": 0.13, "validHrs": 0.06 }, "yi": { "duration": 678456, "reportedSentences": 5, "validatedSentences": 2, "unvalidatedSentences": -1, "clips": 150, "users": 10, "size": 3746649, "checksum": "4674aa3e92e5b8e5620004b5c473d8cd4245aa973a642abf781899e88179b458", "avgDurationSecs": 3.975, "validDurationSecs": 525.33, "totalHrs": 0.19, "validHrs": 0.14 }, "yo": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": 924, "checksum": "6afad07dac589bd86a1794c2d918c6d3da232590bc79c4d19085abbf836cf368", "avgDurationSecs": 6.041, "validDurationSecs": 90.62, "totalHrs": 0, "validHrs": 0.03 }, "yue": { "duration": 167544, "reportedSentences": 0, "validatedSentences": 5, "unvalidatedSentences": -4, "clips": 40, "users": 5, "size": 949697, "checksum": "755100899533b01d42d1ca5d4a607afa72c519fab5fae7b0750de62d82db1893", "avgDurationSecs": 3.967, "validDurationSecs": 696.35, "totalHrs": 0.04, "validHrs": 0.19 }, "zgh": { "duration": 184068, "reportedSentences": 0, "validatedSentences": 1, "unvalidatedSentences": 5, "clips": 46, "users": 5, "size": 1078139, "checksum": "679955e7c53e8c0ce7b890df2f5a7e914bd0abeea91edeb8ca1199ee790aeaca", "avgDurationSecs": 3.486, "validDurationSecs": 99.1, "totalHrs": 0.06, "validHrs": 0.03 }, "zh-CN": { "duration": 10224864, "reportedSentences": 71, "validatedSentences": 54, "unvalidatedSentences": -5, "clips": 2413, "users": 66, "size": 59644571, "checksum": "858c5ab1dc55d7a8e9802e5fa896683f17be84ca25edbeb79b9abd25a99b2a91", "avgDurationSecs": 4.54, "validDurationSecs": 3501.71, "totalHrs": 2.84, "validHrs": 0.97 }, "zh-HK": { "duration": 1428372, "reportedSentences": 0, "validatedSentences": 2, "unvalidatedSentences": 4, "clips": 551, "users": 17, "size": 8223233, "checksum": "213c03ce1bd553168a1f964b5be8a5cab3dc626a68fb1a569fa4ee955ed812c0", "avgDurationSecs": 4.15, "validDurationSecs": -513.59, "totalHrs": 0.4, "validHrs": -0.14 }, "zh-TW": { "duration": 1650564, "reportedSentences": 0, "validatedSentences": 3, "unvalidatedSentences": 8, "clips": 503, "users": 14, "size": 9568472, "checksum": "478f55259cf02eed454ef9bba979e879d008c7b98c4aaa4d57c16f8371ff6af3", "avgDurationSecs": 3.364, "validDurationSecs": 321.37, "totalHrs": 0.46, "validHrs": 0.08 }, "zoc": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -13181, "checksum": "54186fba91b69e7c71ba70046387d4d385eea9e7e2ab21eeeac8157684de13e3", "avgDurationSecs": 4.083, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "zu": { "duration": 0, "reportedSentences": 0, "validatedSentences": 0, "unvalidatedSentences": 0, "clips": 0, "users": 0, "size": -76, "checksum": "dff28383d0ba714cb7e49d597f2c21579e5e14c0aa8e06534b57a77a08ed690e", "avgDurationSecs": 5.533, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "zza": { "duration": 80028, "reportedSentences": 17, "validatedSentences": 0, "unvalidatedSentences": 2, "clips": 25, "users": 0, "size": 471702, "checksum": "d4d2407866c14b3fdab77c42fde3557e4f20c45e2a53c0323f7bc5380950263b", "avgDurationSecs": 4.095, "validDurationSecs": 249.76, "totalHrs": 0.02, "validHrs": 0.07 } }, "totalDuration": 10841370240, "totalValidDurationSecs": 4626800, "totalHrs": 3011, "totalValidHrs": 1286 } ================================================ FILE: datasets/scripted-speech/cv-corpus-25.0-2026-03-09.json ================================================ { "locales": { "ab": { "buckets": { "dev": 14152, "invalidated": 22429, "other": 330, "test": 14208, "train": 97329, "validated": 135652 }, "duration": 872117532, "reportedSentences": 346, "validatedSentences": 1046833, "unvalidatedSentences": 14, "clips": 158411, "splits": { "accent": { "": 75 }, "variant": {}, "age": { "seventies": 0.0036, "thirties": 0.1051, "": 0.1553, "teens": 0.4836, "twenties": 0.143, "fifties": 0.0355, "sixties": 0.0199, "fourties": 0.0514, "eighties": 0.0025, "nineties": 0.0001 }, "gender": { "male_masculine": 0.0698, "female_feminine": 0.6798, "": 0.2437, "transgender": 0.0001, "do_not_wish_to_say": 0.0066 }, "sentence_domain": { "nature_environment": 1, "automotive_transport": 2, "agriculture_food": 1, "history_law_government": 2 } }, "users": 1300, "size": 5113659167, "checksum": "3a88afb4bbd050f78e1b24d76d8feef91f2c4cd3b52aa689cfcd64a35dbb598a", "avgDurationSecs": 5.505, "validDurationSecs": 746819.902, "totalHrs": 242.25, "validHrs": 207.44 }, "abb": { "buckets": { "dev": 293, "invalidated": 35, "other": 0, "test": 313, "train": 390, "validated": 7887 }, "duration": 40474692, "reportedSentences": 0, "validatedSentences": 996, "unvalidatedSentences": 0, "clips": 7922, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9975, "fourties": 0.0025 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 22, "size": 238216372, "checksum": "fa9e488fbc13a177c84d1354e400538884f37a2ca19e0bbb21de5ea284dfa3b1", "avgDurationSecs": 5.109, "validDurationSecs": 40295.872, "totalHrs": 11.24, "validHrs": 11.19 }, "ady": { "buckets": { "dev": 3381, "invalidated": 2585, "other": 9069, "test": 3388, "train": 4267, "validated": 48689 }, "duration": 314805456, "reportedSentences": 1425, "validatedSentences": 11364, "unvalidatedSentences": 2734, "clips": 60343, "splits": { "accent": { "shapsug": 7085, "temirgoy": 10213, "bzhedug": 6350, "hatikuay": 205, "abzakh": 2313, "": 550 }, "variant": { "ady-Cyrl": 40584, "ady-Cyrl-TR": 3624, "ady-Cyrl-JOR": 48, "ady-RU": 5202, "ady-Latn-TR-t-ady-cyrl": 671 }, "age": { "fifties": 0.1319, "thirties": 0.2608, "twenties": 0.3335, "sixties": 0.0339, "fourties": 0.0957, "": 0.068, "teens": 0.0751, "seventies": 0.0012 }, "gender": { "female_feminine": 0.7146, "": 0.2639, "do_not_wish_to_say": 0.0035, "male_masculine": 0.0179 }, "sentence_domain": { "general": 118654, "language_fundamentals": 534, "news_current_affairs": 111, "agriculture_food": 34, "healthcare": 55, "service_retail": 69, "media_entertainment": 101, "nature_environment": 10, "technology_robotics": 54 } }, "users": 163, "size": 1805000755, "checksum": "66f874bef7d426717af5a2338961f56c0451907009d4528ec24170b70f3e37d3", "avgDurationSecs": 5.217, "validDurationSecs": 254007.306, "totalHrs": 87.44, "validHrs": 70.55 }, "af": { "buckets": { "dev": 120, "invalidated": 202, "other": 326, "test": 142, "train": 188, "validated": 450 }, "duration": 5946660, "reportedSentences": 73, "validatedSentences": 4805, "unvalidatedSentences": 16562, "clips": 978, "splits": { "accent": { "": 318 }, "variant": {}, "age": { "fourties": 0.2096, "": 0.32, "twenties": 0.3119, "teens": 0.0051, "fifties": 0.1452, "seventies": 0.0051, "thirties": 0.0031 }, "gender": { "male_masculine": 0.319, "": 0.4581, "female_feminine": 0.2229 }, "sentence_domain": {} }, "users": 66, "size": 36039823, "checksum": "7976ee567d9e26aea4cadecad6f7c3cabf8efd598dea18869621ddb4585d299a", "avgDurationSecs": 6.08, "validDurationSecs": 2736.193, "totalHrs": 1.65, "validHrs": 0.76 }, "ajg": { "buckets": { "dev": 560, "invalidated": 218, "other": 22, "test": 774, "train": 1873, "validated": 19158 }, "duration": 46342044, "reportedSentences": 0, "validatedSentences": 3213, "unvalidatedSentences": 0, "clips": 19398, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.0002, "": 0.9998 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 23, "size": 264920483, "checksum": "c0f6db24d06708d121a38ebf039fc176249304a68ac666de5b265c67cedb18e4", "avgDurationSecs": 2.389, "validDurationSecs": 45768.681, "totalHrs": 12.87, "validHrs": 12.71 }, "am": { "buckets": { "dev": 248, "invalidated": 30, "other": 552, "test": 269, "train": 553, "validated": 1070 }, "duration": 10481256, "reportedSentences": 3, "validatedSentences": 2208, "unvalidatedSentences": 15, "clips": 1652, "splits": { "accent": { "": 50 }, "variant": {}, "age": { "thirties": 0.6132, "": 0.2324, "twenties": 0.1423, "fourties": 0.0121 }, "gender": { "male_masculine": 0.7373, "": 0.2627 }, "sentence_domain": { "history_law_government": 1 } }, "users": 49, "size": 61719846, "checksum": "4ae6148ece4641e7eaeb0231a7356c734332a4e22a6ed25fa8294661f17ddf91", "avgDurationSecs": 6.345, "validDurationSecs": 6788.707, "totalHrs": 2.91, "validHrs": 1.88 }, "an": { "buckets": { "dev": 2872, "invalidated": 758, "other": 5, "test": 2920, "train": 4190, "validated": 13277 }, "duration": 64215432, "reportedSentences": 8, "validatedSentences": 10454, "unvalidatedSentences": 357, "clips": 14040, "splits": { "accent": { "": 7456 }, "variant": {}, "age": { "twenties": 0.0209, "fourties": 0.3167, "thirties": 0.2033, "": 0.1853, "sixties": 0.1226, "fifties": 0.1125, "seventies": 0.0384, "teens": 0.0004 }, "gender": { "": 0.7093, "male_masculine": 0.1453, "female_feminine": 0.145, "do_not_wish_to_say": 0.0004 }, "sentence_domain": { "general": 13638, "finance": 2, "service_retail": 2 } }, "users": 53, "size": 375485423, "checksum": "96e642705c63c35f75b4fd432d34a9e22d7fc85305a16ec2d4d7384ee94ac2eb", "avgDurationSecs": 4.574, "validDurationSecs": 60725.662, "totalHrs": 17.83, "validHrs": 16.86 }, "ar": { "buckets": { "dev": 10229, "invalidated": 15255, "other": 41400, "test": 10508, "train": 28865, "validated": 79485 }, "duration": 566673318, "reportedSentences": 2128, "validatedSentences": 59434, "unvalidatedSentences": 1742117, "clips": 136140, "splits": { "accent": { "": 1783 }, "variant": {}, "age": { "thirties": 0.1094, "": 0.5473, "twenties": 0.2954, "fourties": 0.013, "teens": 0.0334, "fifties": 0.0008, "sixties": 0.0007, "nineties": 0 }, "gender": { "female_feminine": 0.168, "": 0.5497, "male_masculine": 0.2823 }, "sentence_domain": {} }, "users": 1664, "size": 3524109316, "checksum": "f47fc303117170fa8a8d356650f39c54c9731ef79f9a8ae17c2671a724315eeb", "avgDurationSecs": 4.162, "validDurationSecs": 330850.806, "totalHrs": 157.4, "validHrs": 91.9 }, "as": { "buckets": { "dev": 485, "invalidated": 283, "other": 2552, "test": 408, "train": 956, "validated": 1849 }, "duration": 27434153, "reportedSentences": 248, "validatedSentences": 7171, "unvalidatedSentences": 3344, "clips": 4684, "splits": { "accent": { "": 1447 }, "variant": {}, "age": { "twenties": 0.1616, "": 0.2801, "thirties": 0.0169, "teens": 0.0015, "fifties": 0.0164, "fourties": 0.5235 }, "gender": { "male_masculine": 0.1825, "": 0.8168, "female_feminine": 0.0006 }, "sentence_domain": { "general": 13, "technology_robotics": 13 } }, "users": 51, "size": 166888627, "checksum": "19c21426dd862743910774898e0c2a945693f715cd22110b6c0e0c986dd6ccdd", "avgDurationSecs": 5.857, "validDurationSecs": 10829.579, "totalHrs": 7.62, "validHrs": 3 }, "ast": { "buckets": { "dev": 114, "invalidated": 48, "other": 847, "test": 244, "train": 450, "validated": 824 }, "duration": 7558344, "reportedSentences": 0, "validatedSentences": 1645, "unvalidatedSentences": 287, "clips": 1719, "splits": { "accent": { "": 340 }, "variant": {}, "age": { "": 0.2571, "fourties": 0.2723, "fifties": 0.2007, "teens": 0.0553, "thirties": 0.1158, "twenties": 0.0989 }, "gender": { "": 0.2571, "male_masculine": 0.5835, "female_feminine": 0.1419, "transgender": 0.0175 }, "sentence_domain": {} }, "users": 33, "size": 43146073, "checksum": "9bb5a42462b1e29f82f12b8b6080d605774fb5fde5be13847b802efc10c051c6", "avgDurationSecs": 4.397, "validDurationSecs": 3623.081, "totalHrs": 2.09, "validHrs": 1 }, "az": { "buckets": { "dev": 93, "invalidated": 54, "other": 521, "test": 126, "train": 215, "validated": 434 }, "duration": 5501016, "reportedSentences": 5, "validatedSentences": 93161, "unvalidatedSentences": 1862, "clips": 1009, "splits": { "accent": { "": 45 }, "variant": {}, "age": { "": 0.1952, "twenties": 0.1744, "fourties": 0.0079, "thirties": 0.6075, "teens": 0.0149 }, "gender": { "": 0.8236, "male_masculine": 0.1665, "female_feminine": 0.0099 }, "sentence_domain": {} }, "users": 45, "size": 38470282, "checksum": "a091621f2509f68507fa902258c10bcefc2b04e3d8b8395e874482eac60364e1", "avgDurationSecs": 5.452, "validDurationSecs": 2366.146, "totalHrs": 1.52, "validHrs": 0.65 }, "ba": { "buckets": { "dev": 14527, "invalidated": 8012, "other": 43, "test": 14572, "train": 119133, "validated": 210463 }, "duration": 967387572, "reportedSentences": 864, "validatedSentences": 153968, "unvalidatedSentences": 5, "clips": 218518, "splits": { "accent": { "": 613 }, "variant": {}, "age": { "thirties": 0.1734, "": 0.3029, "fourties": 0.0597, "fifties": 0.0531, "twenties": 0.1743, "sixties": 0.1939, "seventies": 0.0002, "teens": 0.0425 }, "gender": { "male_masculine": 0.3032, "": 0.3028, "female_feminine": 0.394 }, "sentence_domain": {} }, "users": 930, "size": 5471729194, "checksum": "b6ec4fc2573b9538422994ea420920041e408af92f0e15bd8d2bff5fbb535561", "avgDurationSecs": 4.427, "validDurationSecs": 931727.778, "totalHrs": 268.71, "validHrs": 258.81 }, "bag": { "buckets": { "dev": 261, "invalidated": 43, "other": 35, "test": 324, "train": 421, "validated": 7028 }, "duration": 40049388, "reportedSentences": 1, "validatedSentences": 1006, "unvalidatedSentences": 6, "clips": 7106, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8373, "fourties": 0.1416, "thirties": 0.0211 }, "gender": { "": 0.8373, "do_not_wish_to_say": 0.1416, "female_feminine": 0.0211 }, "sentence_domain": {} }, "users": 14, "size": 225080690, "checksum": "66c6d21b8f5bf5f76ca9673ff392f42d60060e4b1a61d248a100a1ae715493a5", "avgDurationSecs": 5.636, "validDurationSecs": 39609.78, "totalHrs": 11.12, "validHrs": 11 }, "bas": { "buckets": { "dev": 1324, "invalidated": 1214, "other": 143, "test": 1551, "train": 2112, "validated": 11139 }, "duration": 48849192, "reportedSentences": 8, "validatedSentences": 5226, "unvalidatedSentences": 105, "clips": 12496, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.3861, "fourties": 0.6099, "teens": 0.0012, "twenties": 0.0016, "thirties": 0.0012 }, "gender": { "": 0.9944, "female_feminine": 0.0044, "male_masculine": 0.0012 }, "sentence_domain": { "general": 8 } }, "users": 57, "size": 254569391, "checksum": "bbe542459f2eeb732759bd96efce4669084f754563672a0edc0b02828359ee57", "avgDurationSecs": 3.909, "validDurationSecs": 43544.426, "totalHrs": 13.56, "validHrs": 12.09 }, "bax": { "buckets": { "dev": 319, "invalidated": 62, "other": 829, "test": 338, "train": 373, "validated": 7777 }, "duration": 42618204, "reportedSentences": 0, "validatedSentences": 1030, "unvalidatedSentences": 0, "clips": 8668, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 13, "size": 243212469, "checksum": "863a5809c6b3c862b89cb3daad4d0c82240b8d696713eb8f9e304b6e514c1d06", "avgDurationSecs": 4.917, "validDurationSecs": 38237.399, "totalHrs": 11.83, "validHrs": 10.62 }, "bba": { "buckets": { "dev": 251, "invalidated": 2258, "other": 0, "test": 236, "train": 292, "validated": 6273 }, "duration": 51768792, "reportedSentences": 94, "validatedSentences": 971, "unvalidatedSentences": 0, "clips": 8531, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 14, "size": 284227558, "checksum": "38c4a7e78f1b2d1a8023e2e5d8922541023315b51e14bca3865e0d39e1e27b4e", "avgDurationSecs": 6.068, "validDurationSecs": 38066.538, "totalHrs": 14.38, "validHrs": 10.57 }, "bbj": { "buckets": { "dev": 268, "invalidated": 7, "other": 265, "test": 324, "train": 404, "validated": 7242 }, "duration": 45907812, "reportedSentences": 1, "validatedSentences": 996, "unvalidatedSentences": 1, "clips": 7514, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8462, "twenties": 0.0222, "thirties": 0.1316 }, "gender": { "": 0.8571, "female_feminine": 0.1429 }, "sentence_domain": {} }, "users": 31, "size": 268126660, "checksum": "620cc55d07c11a8f0e5aef492d5ab25917862fdc7a4c66a339d60198dc59d8c5", "avgDurationSecs": 6.11, "validDurationSecs": 44245.991, "totalHrs": 12.75, "validHrs": 12.29 }, "bbl": { "buckets": { "dev": 377, "invalidated": 130, "other": 272, "test": 377, "train": 407, "validated": 4580 }, "duration": 43779024, "reportedSentences": 0, "validatedSentences": 1661, "unvalidatedSentences": 580, "clips": 4982, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.1415, "": 0.0554, "sixties": 0.5333, "seventies": 0.231, "fifties": 0.0387 }, "gender": { "female_feminine": 0.6509, "": 0.3491 }, "sentence_domain": { "general": 4670, "language_fundamentals": 20, "automotive_transport": 28, "agriculture_food": 37 } }, "users": 23, "size": 259880872, "checksum": "39b9ded5aee059bd5c6c0d169886184471fd87f7268d6e20173cb1d299768ed2", "avgDurationSecs": 8.787, "validDurationSecs": 40246.473, "totalHrs": 12.16, "validHrs": 11.17 }, "bce": { "buckets": { "dev": 320, "invalidated": 24, "other": 0, "test": 322, "train": 336, "validated": 6117 }, "duration": 36181656, "reportedSentences": 0, "validatedSentences": 986, "unvalidatedSentences": 0, "clips": 6141, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 14, "size": 146607090, "checksum": "9fb9fb0157190079d5df6e7d72df15939c3c15fee739a12b51af2824de28ae9d", "avgDurationSecs": 5.892, "validDurationSecs": 36040.252, "totalHrs": 10.05, "validHrs": 10.01 }, "bci": { "buckets": { "dev": 267, "invalidated": 128, "other": 1415, "test": 290, "train": 319, "validated": 5623 }, "duration": 51668280, "reportedSentences": 1, "validatedSentences": 876, "unvalidatedSentences": 0, "clips": 7166, "splits": { "accent": {}, "variant": {}, "age": { "": 0.857, "twenties": 0.1373, "thirties": 0.0056, "fourties": 0.0001 }, "gender": { "": 0.9301, "female_feminine": 0.0699 }, "sentence_domain": {} }, "users": 17, "size": 298992812, "checksum": "3c0f5fcd4a84e4cb4d67f6646effa8bc45840d21eae5cb7a504db1b28cb1a0b9", "avgDurationSecs": 7.21, "validDurationSecs": 40542.944, "totalHrs": 14.35, "validHrs": 11.26 }, "be": { "buckets": { "dev": 15879, "invalidated": 36985, "other": 18732, "test": 15875, "train": 347710, "validated": 1363996 }, "duration": 6804717948, "reportedSentences": 3217, "validatedSentences": 379562, "unvalidatedSentences": 1917, "clips": 1419713, "splits": { "accent": { "": 827 }, "variant": {}, "age": { "": 0.8743, "fourties": 0.0384, "thirties": 0.0436, "twenties": 0.0343, "teens": 0.0075, "fifties": 0.0009, "sixties": 0.0009, "seventies": 0.0001 }, "gender": { "": 0.8707, "male_masculine": 0.0574, "female_feminine": 0.0718, "do_not_wish_to_say": 0 }, "sentence_domain": { "service_retail": 4, "general": 33, "news_current_affairs": 8, "nature_environment": 4, "technology_robotics": 8, "history_law_government": 48 } }, "users": 8604, "size": 38875606311, "checksum": "4539a19634b191011e376da84b5bfd43060989eb514e49e91774084177be8f84", "avgDurationSecs": 4.793, "validDurationSecs": 6537665.051, "totalHrs": 1890.19, "validHrs": 1816.01 }, "beb": { "buckets": { "dev": 326, "invalidated": 727, "other": 2, "test": 318, "train": 361, "validated": 6742 }, "duration": 39950352, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": 0, "clips": 7471, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 33, "size": 230589104, "checksum": "e4cd86b224170bef551d0911839aef0d76ed60fc47d55fb95e1487791c35946e", "avgDurationSecs": 5.347, "validDurationSecs": 36052.105, "totalHrs": 11.09, "validHrs": 10.01 }, "bfd": { "buckets": { "dev": 252, "invalidated": 51, "other": 673, "test": 254, "train": 260, "validated": 6454 }, "duration": 40260276, "reportedSentences": 0, "validatedSentences": 766, "unvalidatedSentences": 0, "clips": 7178, "splits": { "accent": {}, "variant": {}, "age": { "": 0.987, "thirties": 0.0121, "twenties": 0.0001, "teens": 0.0007 }, "gender": { "": 0.9872, "female_feminine": 0.0128 }, "sentence_domain": {} }, "users": 36, "size": 230058925, "checksum": "c60864c6b2a16acb4882a8b8b1870cdf026738c2ffb7542f44b8d090f5fd0a83", "avgDurationSecs": 5.609, "validDurationSecs": 36199.474, "totalHrs": 11.18, "validHrs": 10.05 }, "bft": { "buckets": { "dev": 1833, "invalidated": 464, "other": 23, "test": 1806, "train": 2211, "validated": 10060 }, "duration": 62524008, "reportedSentences": 37, "validatedSentences": 7968, "unvalidatedSentences": 12, "clips": 10547, "splits": { "accent": { "": 45 }, "variant": {}, "age": { "": 0.273, "thirties": 0.3511, "twenties": 0.3663, "fourties": 0.0001, "teens": 0.0096 }, "gender": { "": 0.7763, "female_feminine": 0.1258, "do_not_wish_to_say": 0.0978 }, "sentence_domain": {} }, "users": 155, "size": 363878525, "checksum": "3b57141ebadd3065fa0128e83a67a7fa1da82312edee6a370a75b53837515101", "avgDurationSecs": 5.928, "validDurationSecs": 59637.008, "totalHrs": 17.36, "validHrs": 16.56 }, "bg": { "buckets": { "dev": 2964, "invalidated": 775, "other": 1783, "test": 3398, "train": 4980, "validated": 11359 }, "duration": 76196124, "reportedSentences": 194, "validatedSentences": 39572, "unvalidatedSentences": 3, "clips": 13917, "splits": { "accent": { "": 5171 }, "variant": {}, "age": { "fourties": 0.2303, "thirties": 0.2788, "": 0.3705, "twenties": 0.11, "teens": 0.0047, "sixties": 0.0004, "fifties": 0.0054 }, "gender": { "male_masculine": 0.5803, "female_feminine": 0.0463, "": 0.3734 }, "sentence_domain": {} }, "users": 150, "size": 450725864, "checksum": "5e79f6982b42dfeece019c8e4522385a3152788bf7103e89ec653743500d59ee", "avgDurationSecs": 5.475, "validDurationSecs": 62190.973, "totalHrs": 21.16, "validHrs": 17.27 }, "bgp": { "buckets": { "dev": 1410, "invalidated": 214, "other": 648, "test": 1854, "train": 3522, "validated": 7645 }, "duration": 46089972, "reportedSentences": 0, "validatedSentences": 6997, "unvalidatedSentences": 0, "clips": 8507, "splits": { "accent": { "": 45 }, "variant": {}, "age": { "twenties": 0.0424, "": 0.8828, "thirties": 0.0748 }, "gender": { "female_feminine": 0.0065, "": 0.9935 }, "sentence_domain": {} }, "users": 28, "size": 272132495, "checksum": "c78dabb8f7354020399210fa92cc0ba92a33a2942b11eb32ddad601b0b4248b7", "avgDurationSecs": 5.418, "validDurationSecs": 41419.753, "totalHrs": 12.8, "validHrs": 11.5 }, "bkh": { "buckets": { "dev": 305, "invalidated": 1866, "other": 0, "test": 304, "train": 377, "validated": 5614 }, "duration": 47871756, "reportedSentences": 2, "validatedSentences": 986, "unvalidatedSentences": 1, "clips": 7480, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 13, "size": 261694804, "checksum": "e39b4a4b6b90b1b741031f7291f23110f32dad64ac9c80c6844f5ff8f2d7f564", "avgDurationSecs": 6.4, "validDurationSecs": 35929.417, "totalHrs": 13.29, "validHrs": 9.98 }, "bkm": { "buckets": { "dev": 304, "invalidated": 223, "other": 0, "test": 310, "train": 326, "validated": 7573 }, "duration": 42296436, "reportedSentences": 0, "validatedSentences": 940, "unvalidatedSentences": 0, "clips": 7796, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 27, "size": 248891938, "checksum": "34489d652cd130d2c3faeb4ad535ac7dc94ef420109a914674f33e551710f2a9", "avgDurationSecs": 5.425, "validDurationSecs": 41086.571, "totalHrs": 11.74, "validHrs": 11.41 }, "bn": { "buckets": { "dev": 9401, "invalidated": 7937, "other": 999456, "test": 9401, "train": 21614, "validated": 44733 }, "duration": 4598652276, "reportedSentences": 1883, "validatedSentences": 562831, "unvalidatedSentences": 94115, "clips": 1052126, "splits": { "accent": { "": 107276 }, "variant": {}, "age": { "thirties": 0.0308, "twenties": 0.6658, "": 0.2242, "teens": 0.0706, "fourties": 0.0066, "fifties": 0.002 }, "gender": { "male_masculine": 0.541, "": 0.2254, "female_feminine": 0.2336 }, "sentence_domain": {} }, "users": 23007, "size": 26671663045, "checksum": "de9efc688c172ef81c9f47771dd8dd673111ae3c43ac362999d0d517603d055d", "avgDurationSecs": 4.371, "validDurationSecs": 195519.845, "totalHrs": 1277.4, "validHrs": 54.31 }, "bnm": { "buckets": { "dev": 282, "invalidated": 54, "other": 214, "test": 331, "train": 412, "validated": 7675 }, "duration": 56975544, "reportedSentences": 1, "validatedSentences": 1025, "unvalidatedSentences": 0, "clips": 7943, "splits": { "accent": {}, "variant": {}, "age": { "": 0.6437, "fifties": 0.1253, "sixties": 0.0457, "seventies": 0.1251, "fourties": 0.0602 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 21, "size": 336822349, "checksum": "72bfeaf12e2528d3a03c92ecc43c42ac364c8befc4d1441c62de72ed0b96671e", "avgDurationSecs": 7.173, "validDurationSecs": 55053.166, "totalHrs": 15.82, "validHrs": 15.29 }, "bnn": { "buckets": { "dev": 958, "invalidated": 946, "other": 10, "test": 960, "train": 1040, "validated": 7274 }, "duration": 41889204, "reportedSentences": 3, "validatedSentences": 3167, "unvalidatedSentences": 8, "clips": 8230, "splits": { "accent": {}, "variant": { "bnn-vatan": 2241, "bnn-bubukun": 3045, "bnn-bakha": 1894, "bnn-takitudu": 571 }, "age": { "sixties": 0.3403, "": 0.6, "fifties": 0.0597 }, "gender": { "female_feminine": 0.4068, "do_not_wish_to_say": 0.3539, "": 0.1808, "male_masculine": 0.0584 }, "sentence_domain": { "general": 10108 } }, "users": 19, "size": 246986349, "checksum": "d469fdf29db55236b5f25d1013999d7c3613acd1317c9a92309f076e32947221", "avgDurationSecs": 5.09, "validDurationSecs": 37023.338, "totalHrs": 11.63, "validHrs": 10.28 }, "br": { "buckets": { "dev": 3489, "invalidated": 2406, "other": 31, "test": 3492, "train": 4428, "validated": 33940 }, "duration": 118430035, "reportedSentences": 303, "validatedSentences": 23286, "unvalidatedSentences": 18484, "clips": 36377, "splits": { "accent": { "leoneg": 2118, "kerneveg": 9438, "gwenedeg": 2651, "tregerieg": 255, "": 99 }, "variant": {}, "age": { "twenties": 0.2842, "": 0.2612, "fifties": 0.0772, "fourties": 0.0593, "thirties": 0.1307, "sixties": 0.1203, "seventies": 0.0594, "teens": 0.0077 }, "gender": { "male_masculine": 0.5828, "": 0.3546, "female_feminine": 0.0625 }, "sentence_domain": { "general": 4496, "history_law_government": 11, "nature_environment": 8, "service_retail": 4, "healthcare": 13, "agriculture_food": 3, "technology_robotics": 4, "media_entertainment": 13, "finance": 4, "automotive_transport": 2 } }, "users": 283, "size": 806464425, "checksum": "a8c510f0062f499a9772ab4fbc31725f85b0166057f749e33369447605f757d2", "avgDurationSecs": 3.256, "validDurationSecs": 110496.066, "totalHrs": 32.89, "validHrs": 30.69 }, "brh": { "buckets": { "dev": 506, "invalidated": 3, "other": 6, "test": 882, "train": 1703, "validated": 5086 }, "duration": 35762760, "reportedSentences": 0, "validatedSentences": 3095, "unvalidatedSentences": 0, "clips": 5095, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8057, "thirties": 0.1943 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 18, "size": 207687899, "checksum": "77445add06f31b8cc368061316c8638f5a4f5641e77cbd3fc58e5dd19a0ee846", "avgDurationSecs": 7.019, "validDurationSecs": 35699.587, "totalHrs": 9.93, "validHrs": 9.91 }, "bri": { "buckets": { "dev": 203, "invalidated": 215, "other": 122, "test": 298, "train": 699, "validated": 8838 }, "duration": 38915640, "reportedSentences": 12, "validatedSentences": 1200, "unvalidatedSentences": 0, "clips": 9175, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8646, "thirties": 0.128, "twenties": 0.0074 }, "gender": { "": 0.872, "female_feminine": 0.128 }, "sentence_domain": {} }, "users": 15, "size": 191642019, "checksum": "d6c38764065be3775822e671b05c3c5ecbba4079ddefb57dfbd1d6d5a269b1c0", "avgDurationSecs": 4.241, "validDurationSecs": 37486.259, "totalHrs": 10.8, "validHrs": 10.41 }, "bsh": { "buckets": { "dev": 515, "invalidated": 45, "other": 34, "test": 693, "train": 1420, "validated": 6765 }, "duration": 36023616, "reportedSentences": 2, "validatedSentences": 2646, "unvalidatedSentences": 40, "clips": 6844, "splits": { "accent": {}, "variant": {}, "age": { "fifties": 0.2764, "": 0.155, "twenties": 0.5685 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 14, "size": 212465644, "checksum": "2304a62d54497fd2c7685452ba2da5554f0efca1606af24a204dcfe86a5d2cee", "avgDurationSecs": 5.264, "validDurationSecs": 35607.797, "totalHrs": 10, "validHrs": 9.89 }, "bsk": { "buckets": { "dev": 348, "invalidated": 686, "other": 1, "test": 733, "train": 1187, "validated": 8615 }, "duration": 39648312, "reportedSentences": 2, "validatedSentences": 2447, "unvalidatedSentences": 22, "clips": 9302, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.3028, "fifties": 0.2401, "fourties": 0.4571 }, "gender": { "": 1 }, "sentence_domain": { "general": 54 } }, "users": 27, "size": 229279161, "checksum": "2ab4aa98b375e7368995a8b2342973e962410a5e0de78fa8946d296c17dfe8b5", "avgDurationSecs": 4.262, "validDurationSecs": 36720.083, "totalHrs": 11.01, "validHrs": 10.2 }, "btv": { "buckets": { "dev": 226, "invalidated": 127, "other": 43, "test": 329, "train": 452, "validated": 9113 }, "duration": 37660860, "reportedSentences": 263, "validatedSentences": 1053, "unvalidatedSentences": 9, "clips": 9283, "splits": { "accent": {}, "variant": {}, "age": { "": 0.4733, "fourties": 0.2103, "twenties": 0.3164 }, "gender": { "": 0.8934, "male_masculine": 0.1066 }, "sentence_domain": { "general": 8, "healthcare": 1 } }, "users": 16, "size": 215828535, "checksum": "491b29a0b49df7b67a9f71faa5a89c1961d4f458e67227f1d5971c60fe4f7568", "avgDurationSecs": 4.057, "validDurationSecs": 36971.175, "totalHrs": 10.46, "validHrs": 10.26 }, "bum": { "buckets": { "dev": 269, "invalidated": 177, "other": 29, "test": 285, "train": 347, "validated": 7751 }, "duration": 37052424, "reportedSentences": 0, "validatedSentences": 901, "unvalidatedSentences": 0, "clips": 7957, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 22, "size": 210579351, "checksum": "0eae56badd31300352fc7da7a7b3bc08e56cdb7c040b450b65581fe496b1fc16", "avgDurationSecs": 4.657, "validDurationSecs": 36093.168, "totalHrs": 10.29, "validHrs": 10.02 }, "byv": { "buckets": { "dev": 327, "invalidated": 0, "other": 0, "test": 327, "train": 336, "validated": 7719 }, "duration": 47684376, "reportedSentences": 0, "validatedSentences": 990, "unvalidatedSentences": 1027, "clips": 7719, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8128, "thirties": 0.1228, "twenties": 0.0644 }, "gender": { "": 0.9356, "female_feminine": 0.0644 }, "sentence_domain": {} }, "users": 96, "size": 277007229, "checksum": "9bcbec245c636337e88dcc268b524c8737e77e32ca30155f685eff182a99752c", "avgDurationSecs": 6.178, "validDurationSecs": 47684.376, "totalHrs": 13.24, "validHrs": 13.24 }, "ca": { "buckets": { "dev": 16416, "invalidated": 143118, "other": 224906, "test": 16416, "train": 1218703, "validated": 2328479 }, "duration": 14006942694, "reportedSentences": 9924, "validatedSentences": 1306364, "unvalidatedSentences": 3204, "clips": 2696503, "splits": { "accent": { "valencian": 111374, "central": 100539, "": 179707, "learner_es": 1317, "northwestern": 92962, "balearic": 3645, "northern": 1677, "learner_other": 5941 }, "variant": { "ca-central": 1407216, "ca-valencia-southern": 14860, "ca-balear": 29103, "ca-nwestern": 101555, "ca-northern": 33176, "ca-valencia-northern": 2840, "ca-valencia-central": 973, "ca-valencia-tortosi": 44444, "ca-valencia-alacant": 7311 }, "age": { "thirties": 0.046, "fifties": 0.1925, "fourties": 0.1184, "twenties": 0.0394, "": 0.2657, "sixties": 0.2881, "teens": 0.0085, "seventies": 0.0391, "eighties": 0.0022, "nineties": 0 }, "gender": { "male_masculine": 0.5204, "": 0.2714, "female_feminine": 0.208, "do_not_wish_to_say": 0.0001, "intersex": 0 }, "sentence_domain": { "service_retail": 64, "media_entertainment": 48, "history_law_government": 70, "general": 1936, "nature_environment": 150, "language_fundamentals": 39, "agriculture_food": 72, "healthcare": 41, "automotive_transport": 39, "news_current_affairs": 41, "technology_robotics": 34, "finance": 4 } }, "users": 36911, "size": 84474450726, "checksum": "3226cd27a1a93962af6717c3801f942adb1df588eb03aea96b6aec37b3a40508", "avgDurationSecs": 5.194, "validDurationSecs": 12095247.777, "totalHrs": 3890.81, "validHrs": 3359.79 }, "cjk": { "buckets": { "dev": 377, "invalidated": 1186, "other": 0, "test": 377, "train": 387, "validated": 7333 }, "duration": 50038200, "reportedSentences": 0, "validatedSentences": 1141, "unvalidatedSentences": 1, "clips": 8519, "splits": { "accent": {}, "variant": {}, "age": { "": 0.1087, "thirties": 0.2652, "twenties": 0.1225, "fourties": 0.486, "fifties": 0.0176 }, "gender": { "": 0.6157, "male_masculine": 0.034, "female_feminine": 0.3503 }, "sentence_domain": {} }, "users": 45, "size": 275785495, "checksum": "a355fb55d25a4af2485825d091f6a222b7fa621f2e74940e328f76ed26373b7a", "avgDurationSecs": 5.874, "validDurationSecs": 43071.971, "totalHrs": 13.89, "validHrs": 11.96 }, "ckb": { "buckets": { "dev": 5353, "invalidated": 9314, "other": 40215, "test": 5353, "train": 7893, "validated": 120424 }, "duration": 695919708, "reportedSentences": 2558, "validatedSentences": 19446, "unvalidatedSentences": 15087, "clips": 169953, "splits": { "accent": { "": 38643 }, "variant": {}, "age": { "": 0.3199, "thirties": 0.1034, "twenties": 0.4236, "fourties": 0.0206, "teens": 0.0194, "fifties": 0.0113, "seventies": 0.0014, "sixties": 0.1004 }, "gender": { "": 0.3271, "male_masculine": 0.6219, "female_feminine": 0.0508, "do_not_wish_to_say": 0.0001, "non-binary": 0 }, "sentence_domain": { "automotive_transport": 10, "general": 134, "language_fundamentals": 10, "history_law_government": 10, "media_entertainment": 103 } }, "users": 2005, "size": 3859763014, "checksum": "76432036cb19f61072c4d6e72a0ba1f68c9694a42d12c5b3ec21ea25cad01d82", "avgDurationSecs": 4.095, "validDurationSecs": 493109.477, "totalHrs": 193.31, "validHrs": 136.97 }, "cnh": { "buckets": { "dev": 762, "invalidated": 437, "other": 3286, "test": 767, "train": 818, "validated": 2465 }, "duration": 21761232, "reportedSentences": 9, "validatedSentences": 5218, "unvalidatedSentences": 0, "clips": 6188, "splits": { "accent": {}, "variant": {}, "age": { "": 0.4803, "twenties": 0.3993, "fourties": 0.0113, "teens": 0.0149, "thirties": 0.075, "fifties": 0.0192 }, "gender": { "": 0.4809, "male_masculine": 0.3048, "female_feminine": 0.2143 }, "sentence_domain": {} }, "users": 301, "size": 168262909, "checksum": "77e6faad541194db207ee092721f445af245c523183fa3b8da0f712869532a3f", "avgDurationSecs": 3.517, "validDurationSecs": 8668.623, "totalHrs": 6.04, "validHrs": 2.4 }, "cpy": { "buckets": { "dev": 188, "invalidated": 26, "other": 84, "test": 294, "train": 518, "validated": 8162 }, "duration": 36567612, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 15, "clips": 8272, "splits": { "accent": { "": 2087 }, "variant": {}, "age": { "twenties": 0.1206, "fourties": 0.3855, "teens": 0.2421, "thirties": 0.1215, "": 0.01, "sixties": 0.1202 }, "gender": { "": 0.7668, "female_feminine": 0.2332 }, "sentence_domain": {} }, "users": 15, "size": 210669797, "checksum": "90ec3fc32dd489980eb2f4d1b3bf6959968164449e5a7cae2e61120858e2a56b", "avgDurationSecs": 4.421, "validDurationSecs": 36081.341, "totalHrs": 10.15, "validHrs": 10.02 }, "cs": { "buckets": { "dev": 9473, "invalidated": 2665, "other": 148758, "test": 9463, "train": 22269, "validated": 65452 }, "duration": 967331019, "reportedSentences": 966, "validatedSentences": 451253, "unvalidatedSentences": 105, "clips": 216875, "splits": { "accent": { "": 24609 }, "variant": {}, "age": { "fourties": 0.0705, "": 0.238, "thirties": 0.467, "teens": 0.0246, "twenties": 0.1861, "fifties": 0.0123, "sixties": 0.0014, "seventies": 0.0001 }, "gender": { "male_masculine": 0.5343, "": 0.2469, "female_feminine": 0.2189 }, "sentence_domain": { "media_entertainment": 4, "general": 15, "technology_robotics": 8, "history_law_government": 1, "automotive_transport": 1 } }, "users": 1134, "size": 5970163541, "checksum": "99366f369ee223d9727ccef4545c2ea3438cb8ff6ebcfcf8e6970db3d28968e9", "avgDurationSecs": 4.46, "validDurationSecs": 291936.599, "totalHrs": 268.7, "validHrs": 81.09 }, "cut": { "buckets": { "dev": 290, "invalidated": 5, "other": 130, "test": 325, "train": 378, "validated": 5481 }, "duration": 37239084, "reportedSentences": 0, "validatedSentences": 994, "unvalidatedSentences": 1, "clips": 5616, "splits": { "accent": {}, "variant": {}, "age": { "": 0.2507, "thirties": 0.4183, "fifties": 0.1599, "fourties": 0.1708, "twenties": 0.0004 }, "gender": { "": 0.3218, "female_feminine": 0.6782 }, "sentence_domain": {} }, "users": 15, "size": 219737376, "checksum": "9b9fcbdfbc6bc044ad4c486f5bf9afd4e0a68dcfd4d13869ed8b4b0d6519a909", "avgDurationSecs": 6.631, "validDurationSecs": 36343.914, "totalHrs": 10.34, "validHrs": 10.09 }, "cux": { "buckets": { "dev": 647, "invalidated": 325, "other": 9, "test": 494, "train": 1107, "validated": 9016 }, "duration": 38339748, "reportedSentences": 1, "validatedSentences": 2251, "unvalidatedSentences": 18, "clips": 9350, "splits": { "accent": {}, "variant": {}, "age": { "teens": 0.4105, "": 0.0011, "thirties": 0.2329, "twenties": 0.3555 }, "gender": { "female_feminine": 0.646, "": 0.354 }, "sentence_domain": {} }, "users": 11, "size": 174532161, "checksum": "e6e1b66805f88ccfe995eb464e8500befcf1bf2b423bd2bf806432d3b0e262f9", "avgDurationSecs": 4.101, "validDurationSecs": 36970.178, "totalHrs": 10.64, "validHrs": 10.26 }, "cv": { "buckets": { "dev": 1242, "invalidated": 2215, "other": 741, "test": 1288, "train": 1456, "validated": 17500 }, "duration": 103130544, "reportedSentences": 139, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 20456, "splits": { "accent": { "": 51 }, "variant": {}, "age": { "twenties": 0.4997, "": 0.238, "fourties": 0.0614, "thirties": 0.0079, "teens": 0.1825, "fifties": 0.0105 }, "gender": { "male_masculine": 0.5141, "": 0.2199, "female_feminine": 0.2659 }, "sentence_domain": { "technology_robotics": 1 } }, "users": 122, "size": 700495298, "checksum": "28d2fbd488799a6bb7ee6faa6ef001925fb85358096cefe3442b7069c70138dc", "avgDurationSecs": 5.042, "validDurationSecs": 88227.636, "totalHrs": 28.64, "validHrs": 24.5 }, "cy": { "buckets": { "dev": 5413, "invalidated": 4472, "other": 20746, "test": 5415, "train": 8030, "validated": 90884 }, "duration": 570789009, "reportedSentences": 176, "validatedSentences": 118044, "unvalidatedSentences": 15, "clips": 116102, "splits": { "accent": { "": 3651, "united_kingdom": 59782 }, "variant": { "cy-northwes": 1091, "cy-southwes": 726, "cy-midwales": 1288, "cy-northeas": 3463, "cy-southeas": 1526 }, "age": { "fourties": 0.1569, "twenties": 0.1305, "sixties": 0.0682, "fifties": 0.0906, "": 0.4253, "thirties": 0.0933, "seventies": 0.0087, "eighties": 0.0028, "teens": 0.0236 }, "gender": { "male_masculine": 0.3261, "female_feminine": 0.2452, "": 0.4279, "non-binary": 0.0009 }, "sentence_domain": { "general": 1 } }, "users": 1933, "size": 4175137538, "checksum": "3624342e755771f7255e029170915e90a15eef0b42bc6d6ea32eb2c490b390e0", "avgDurationSecs": 4.916, "validDurationSecs": 446810.462, "totalHrs": 158.55, "validHrs": 124.11 }, "da": { "buckets": { "dev": 2743, "invalidated": 442, "other": 2382, "test": 2756, "train": 3604, "validated": 11493 }, "duration": 58295412, "reportedSentences": 688, "validatedSentences": 20986, "unvalidatedSentences": 173, "clips": 14317, "splits": { "accent": { "": 4495 }, "variant": {}, "age": { "": 0.3914, "thirties": 0.2237, "twenties": 0.1552, "sixties": 0.008, "fourties": 0.1723, "fifties": 0.0425, "teens": 0.0068 }, "gender": { "": 0.4711, "female_feminine": 0.059, "male_masculine": 0.4699 }, "sentence_domain": { "general": 27, "media_entertainment": 17, "agriculture_food": 1, "nature_environment": 2, "healthcare": 1, "service_retail": 1, "technology_robotics": 3, "automotive_transport": 10, "language_fundamentals": 3 } }, "users": 300, "size": 339266849, "checksum": "d51753540bbbb215aa449b162bf7f07abd0c8bb0d36cbae9262119df4c4169e3", "avgDurationSecs": 4.072, "validDurationSecs": 46796.757, "totalHrs": 16.19, "validHrs": 12.99 }, "dag": { "buckets": { "dev": 1465, "invalidated": 256, "other": 9234, "test": 1464, "train": 1610, "validated": 13596 }, "duration": 99136260, "reportedSentences": 23, "validatedSentences": 5603, "unvalidatedSentences": 15134, "clips": 23086, "splits": { "accent": { "": 136 }, "variant": {}, "age": { "": 0.3333, "thirties": 0.2531, "twenties": 0.4032, "fourties": 0.0104 }, "gender": { "": 0.7664, "male_masculine": 0.0185, "female_feminine": 0.197, "do_not_wish_to_say": 0.018 }, "sentence_domain": { "general": 23, "agriculture_food": 4, "automotive_transport": 3, "finance": 13, "nature_environment": 6, "news_current_affairs": 4, "language_fundamentals": 3 } }, "users": 69, "size": 552191590, "checksum": "f0b067e954219ffe56d339d063bbd0ceb74674231f6240e4a334cec6f84831c5", "avgDurationSecs": 4.294, "validDurationSecs": 58384.155, "totalHrs": 27.53, "validHrs": 16.21 }, "dar": { "buckets": { "dev": 1442, "invalidated": 178, "other": 2625, "test": 1387, "train": 1999, "validated": 9181 }, "duration": 68345172, "reportedSentences": 52, "validatedSentences": 5960, "unvalidatedSentences": 993, "clips": 11984, "splits": { "accent": { "": 911 }, "variant": {}, "age": { "twenties": 0.2427, "teens": 0.0132, "": 0.2137, "fifties": 0.3588, "fourties": 0.1657, "thirties": 0.0058 }, "gender": { "": 0.619, "female_feminine": 0.381 }, "sentence_domain": { "general": 9271, "media_entertainment": 4064, "news_current_affairs": 3245, "language_fundamentals": 211, "healthcare": 126, "history_law_government": 1597, "agriculture_food": 466 } }, "users": 39, "size": 399955795, "checksum": "230dc639d6dfbc369891082cd0270bbd20f725949d4c73b2759925cd67c11173", "avgDurationSecs": 5.703, "validDurationSecs": 52359.565, "totalHrs": 18.98, "validHrs": 14.54 }, "dav": { "buckets": { "dev": 1276, "invalidated": 663, "other": 40703, "test": 1004, "train": 2098, "validated": 8264 }, "duration": 201431016, "reportedSentences": 11, "validatedSentences": 11773, "unvalidatedSentences": 20119, "clips": 49630, "splits": { "accent": {}, "variant": {}, "age": { "fifties": 0.0102, "": 0.0402, "thirties": 0.0994, "fourties": 0.0056, "twenties": 0.6127, "sixties": 0.2319 }, "gender": { "female_feminine": 0.2675, "": 0.7325 }, "sentence_domain": { "healthcare": 967, "general": 45323, "agriculture_food": 499, "nature_environment": 873, "service_retail": 1218, "history_law_government": 334, "finance": 304, "news_current_affairs": 86, "technology_robotics": 75, "language_fundamentals": 150, "automotive_transport": 100, "media_entertainment": 68 } }, "users": 24, "size": 1089120745, "checksum": "99e932f4fddcb701014007547a747fc3eb0c0b852165e95f6c62c18e4869cd28", "avgDurationSecs": 4.059, "validDurationSecs": 33540.72, "totalHrs": 55.95, "validHrs": 9.31 }, "de": { "buckets": { "dev": 16206, "invalidated": 58471, "other": 7782, "test": 16206, "train": 621144, "validated": 949698 }, "duration": 5348699189, "reportedSentences": 11409, "validatedSentences": 2058021, "unvalidatedSentences": 1623, "clips": 1015951, "splits": { "accent": { "russia": 1080, "germany": 627340, "austria": 26654, "liechtenstein": 92, "switzerland": 18414, "france": 1624, "poland": 170, "united_kingdom": 278, "hungary": 278, "netherlands": 99, "": 145303, "namibia": 1, "slovakia": 76, "united_states": 615, "italy": 1128, "czechia": 62, "brazil": 60, "canada": 129, "turkey": 55, "slovenia": 10, "luxembourg": 65, "belgium": 967, "denmark": 1, "greece": 140, "lithuania": 5, "finland": 35, "bulgaria": 5, "romania": 5, "paraguay": 5 }, "variant": {}, "age": { "twenties": 0.174, "fourties": 0.1707, "": 0.3302, "thirties": 0.1607, "teens": 0.0289, "sixties": 0.0235, "fifties": 0.1097, "seventies": 0.0022, "eighties": 0.0001, "nineties": 0.0001 }, "gender": { "male_masculine": 0.5788, "": 0.3392, "female_feminine": 0.0818, "transgender": 0.0001, "non-binary": 0.0001, "do_not_wish_to_say": 0 }, "sentence_domain": { "healthcare": 16, "nature_environment": 12, "general": 143, "history_law_government": 14, "language_fundamentals": 3, "service_retail": 12, "news_current_affairs": 4, "automotive_transport": 6, "technology_robotics": 9, "media_entertainment": 1 } }, "users": 20466, "size": 37251336418, "checksum": "92f8f29075f7dc35d78f31ba1d4f94f3493545b23007c7be73be9fd28807f99b", "avgDurationSecs": 5.265, "validDurationSecs": 4999895.588, "totalHrs": 1485.74, "validHrs": 1388.85 }, "dmk": { "buckets": { "dev": 0, "invalidated": 110, "other": 21, "test": 938, "train": 3200, "validated": 10648 }, "duration": 37129032, "reportedSentences": 732, "validatedSentences": 4139, "unvalidatedSentences": 194, "clips": 10779, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0357, "fourties": 0.8069, "thirties": 0.1473, "twenties": 0.01 }, "gender": { "": 0.9549, "female_feminine": 0.0451 }, "sentence_domain": { "general": 2, "agriculture_food": 2, "healthcare": 2 } }, "users": 10, "size": 216969975, "checksum": "5203e03d5848906083df76cd6c954df29d662e2a47ed1718f98b5002ac171c9a", "avgDurationSecs": 3.445, "validDurationSecs": 36677.793, "totalHrs": 10.31, "validHrs": 10.18 }, "dml": { "buckets": { "dev": 1101, "invalidated": 68, "other": 299, "test": 626, "train": 3190, "validated": 6115 }, "duration": 39013560, "reportedSentences": 0, "validatedSentences": 5670, "unvalidatedSentences": 676, "clips": 6482, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0791, "fourties": 0.7913, "twenties": 0.0947, "fifties": 0.0349 }, "gender": { "": 1 }, "sentence_domain": { "general": 7, "nature_environment": 4, "history_law_government": 3 } }, "users": 5, "size": 230760481, "checksum": "eab336e5b6210b475e31c07bc8eef5f060d046131527d6e1d0c0ca876257bb5f", "avgDurationSecs": 6.019, "validDurationSecs": 36804.677, "totalHrs": 10.83, "validHrs": 10.22 }, "dru": { "buckets": { "dev": 933, "invalidated": 116, "other": 1, "test": 957, "train": 1074, "validated": 6576 }, "duration": 37990332, "reportedSentences": 2, "validatedSentences": 3867, "unvalidatedSentences": 477, "clips": 6693, "splits": { "accent": {}, "variant": { "dru-oponoho": 2096, "dru-teldreka": 256, "dru-labuane": 471, "dru-taromak": 1761, "dru-veday": 2108 }, "age": { "fourties": 0.123, "": 0.877 }, "gender": { "female_feminine": 0.123, "": 0.0024, "do_not_wish_to_say": 0.8746 }, "sentence_domain": { "general": 10031, "language_fundamentals": 1759 } }, "users": 20, "size": 224022937, "checksum": "bbe355a58f22bef1f8ce1ee60548076dba8346122876503091557eb6d7e017de", "avgDurationSecs": 5.676, "validDurationSecs": 37326.225, "totalHrs": 10.55, "validHrs": 10.36 }, "dsb": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 40284, "reportedSentences": 0, "validatedSentences": 796, "unvalidatedSentences": 0, "clips": 5, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": { "media_entertainment": 2, "general": 2, "history_law_government": 2 } }, "users": 1, "size": 296827, "checksum": "32b028732135503d78cb87f2421b30d0d90341032bb599fdc6ea3da0eaef92db", "avgDurationSecs": 8.057, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "dua": { "buckets": { "dev": 326, "invalidated": 1073, "other": 4, "test": 311, "train": 366, "validated": 7018 }, "duration": 52001208, "reportedSentences": 1, "validatedSentences": 1003, "unvalidatedSentences": 1, "clips": 8095, "splits": { "accent": {}, "variant": {}, "age": { "": 0.7746, "thirties": 0.0967, "twenties": 0.1134, "fourties": 0.0153 }, "gender": { "": 0.8866, "female_feminine": 0.1134 }, "sentence_domain": {} }, "users": 13, "size": 292401751, "checksum": "d261c0e2bb7ad3b5185050e551943bd240f6070e1ef4409b7486c1093a1abc42", "avgDurationSecs": 6.424, "validDurationSecs": 45082.703, "totalHrs": 14.44, "validHrs": 12.52 }, "dv": { "buckets": { "dev": 2243, "invalidated": 1662, "other": 15081, "test": 2228, "train": 2648, "validated": 26583 }, "duration": 221817879, "reportedSentences": 59, "validatedSentences": 7174, "unvalidatedSentences": 9, "clips": 43326, "splits": { "accent": { "": 165 }, "variant": {}, "age": { "": 0.2375, "twenties": 0.1135, "thirties": 0.3889, "fourties": 0.2267, "teens": 0.0079, "nineties": 0.0001, "fifties": 0.0253, "sixties": 0.0001 }, "gender": { "": 0.2408, "male_masculine": 0.2353, "female_feminine": 0.522, "do_not_wish_to_say": 0.0019 }, "sentence_domain": { "media_entertainment": 1 } }, "users": 392, "size": 1415477900, "checksum": "5a284602ed26250450b761b1ebfebf590697a220a75a3226746e2d610601b5d5", "avgDurationSecs": 5.12, "validDurationSecs": 136098.063, "totalHrs": 61.61, "validHrs": 37.8 }, "dyu": { "buckets": { "dev": 50, "invalidated": 15, "other": 69, "test": 63, "train": 90, "validated": 211 }, "duration": 1861236, "reportedSentences": 0, "validatedSentences": 5028, "unvalidatedSentences": 41, "clips": 295, "splits": { "accent": {}, "variant": {}, "age": { "": 0.3051, "fourties": 0.0203, "twenties": 0.6576, "teens": 0.0169 }, "gender": { "": 0.3051, "male_masculine": 0.3932, "female_feminine": 0.3017 }, "sentence_domain": {} }, "users": 35, "size": 10797797, "checksum": "c8eb8616660253cdffaac95bb0802919d63ff4246308b999f59ba667553899f6", "avgDurationSecs": 6.309, "validDurationSecs": 1331.257, "totalHrs": 0.51, "validHrs": 0.36 }, "ebr": { "buckets": { "dev": 0, "invalidated": 1, "other": 474, "test": 228, "train": 644, "validated": 1544 }, "duration": 8412732, "reportedSentences": 5, "validatedSentences": 872, "unvalidatedSentences": 3, "clips": 2019, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.3239, "fourties": 0.4314, "": 0.0243, "twenties": 0.2204 }, "gender": { "female_feminine": 0.3239, "": 0.6761 }, "sentence_domain": {} }, "users": 7, "size": 46940887, "checksum": "b77e193a2736aa7e53a838e3991de9ee32b9ec3fc27149a5d9c60cbb891e2c45", "avgDurationSecs": 4.167, "validDurationSecs": 6433.511, "totalHrs": 2.33, "validHrs": 1.78 }, "eko": { "buckets": { "dev": 353, "invalidated": 64, "other": 12, "test": 414, "train": 519, "validated": 3950 }, "duration": 30360960, "reportedSentences": 0, "validatedSentences": 1288, "unvalidatedSentences": 16, "clips": 4026, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0738, "thirties": 0.5571, "twenties": 0.3182, "fourties": 0.0484, "teens": 0.0025 }, "gender": { "": 0.469, "female_feminine": 0.531 }, "sentence_domain": { "general": 2257, "language_fundamentals": 24 } }, "users": 7, "size": 176504080, "checksum": "2b93e69837ca6751336284784ef2ba09bc0f14fb8ea22eaf67c76609535d843f", "avgDurationSecs": 7.541, "validDurationSecs": 29787.827, "totalHrs": 8.43, "validHrs": 8.27 }, "el": { "buckets": { "dev": 1733, "invalidated": 893, "other": 11039, "test": 1735, "train": 1945, "validated": 17300 }, "duration": 121362154, "reportedSentences": 72, "validatedSentences": 8323, "unvalidatedSentences": 3395, "clips": 29232, "splits": { "accent": { "": 1395 }, "variant": {}, "age": { "thirties": 0.3821, "fourties": 0.1651, "": 0.3027, "twenties": 0.1134, "fifties": 0.0261, "teens": 0.0093, "sixties": 0.0014 }, "gender": { "male_masculine": 0.6394, "": 0.3283, "female_feminine": 0.0323 }, "sentence_domain": {} }, "users": 452, "size": 805024855, "checksum": "83d74dca0446da81c27b7805edba4765865dd0719c1b56fb5623b9a372e51708", "avgDurationSecs": 4.152, "validDurationSecs": 71824.209, "totalHrs": 33.71, "validHrs": 19.95 }, "en": { "buckets": { "dev": 16403, "invalidated": 310527, "other": 382810, "test": 16403, "train": 1147812, "validated": 1881067 }, "duration": 13556740351, "reportedSentences": 9638, "validatedSentences": 1681626, "unvalidatedSentences": 38862, "clips": 2574404, "splits": { "accent": { "canada": 98459, "england": 203338, "us": 572552, "indian": 152535, "australia": 69328, "malaysia": 4235, "newzealand": 20763, "african": 60057, "ireland": 11204, "philippines": 7492, "singapore": 4712, "scotland": 68091, "hongkong": 7004, "bermuda": 1161, "": 199478, "southatlandtic": 332, "wales": 3032 }, "variant": {}, "age": { "": 0.3566, "twenties": 0.2486, "sixties": 0.0443, "thirties": 0.1388, "teens": 0.0587, "seventies": 0.0068, "fourties": 0.0933, "fifties": 0.0519, "eighties": 0.0009, "nineties": 0.0001 }, "gender": { "": 0.3887, "male_masculine": 0.4327, "female_feminine": 0.1775, "transgender": 0.0001, "non-binary": 0.0002, "do_not_wish_to_say": 0.0008, "intersex": 0.0001 }, "sentence_domain": { "technology_robotics": 101, "general": 660, "media_entertainment": 117, "service_retail": 31, "nature_environment": 61, "news_current_affairs": 13, "healthcare": 26, "history_law_government": 123, "agriculture_food": 166, "language_fundamentals": 11, "finance": 44, "automotive_transport": 8 } }, "users": 99724, "size": 94318514032, "checksum": "5fc6abeb1832af175a72bd13b79bf90866d3eb0096463daea612fdae6d7cc692", "avgDurationSecs": 5.266, "validDurationSecs": 9905646.861, "totalHrs": 3765.76, "validHrs": 2751.56 }, "eo": { "buckets": { "dev": 14912, "invalidated": 127553, "other": 169134, "test": 14920, "train": 144895, "validated": 853567 }, "duration": 6991238635, "reportedSentences": 2411, "validatedSentences": 181254, "unvalidatedSentences": 21774, "clips": 1150254, "splits": { "accent": { "internacia": 673583, "": 26083 }, "variant": {}, "age": { "twenties": 0.5531, "thirties": 0.1212, "": 0.1955, "fourties": 0.0447, "fifties": 0.023, "seventies": 0.0021, "teens": 0.0458, "sixties": 0.0019, "eighties": 0.0128 }, "gender": { "male_masculine": 0.6873, "": 0.2005, "female_feminine": 0.1122 }, "sentence_domain": { "general": 5 } }, "users": 1890, "size": 41880993802, "checksum": "1fdcc230fdb5ac6c624b3a6896976df3bd81a4c198043df673841ec7e3b92d9d", "avgDurationSecs": 6.078, "validDurationSecs": 5187976.384, "totalHrs": 1942.01, "validHrs": 1441.1 }, "es": { "buckets": { "dev": 15902, "invalidated": 95120, "other": 1145592, "test": 15902, "train": 358330, "validated": 437718 }, "duration": 8190571722, "reportedSentences": 2644, "validatedSentences": 1082350, "unvalidatedSentences": 4883, "clips": 1678430, "splits": { "accent": { "nortepeninsular": 66633, "rioplatense": 23500, "centrosurpeninsular": 30573, "americacentral": 12689, "surpeninsular": 177373, "canario": 16193, "andino": 38723, "mexicano": 879470, "chileno": 12569, "caribe": 21830, "filipinas": 606, "": 6569 }, "variant": {}, "age": { "thirties": 0.0928, "": 0.1277, "fifties": 0.0422, "twenties": 0.5262, "teens": 0.0785, "fourties": 0.0277, "sixties": 0.1042, "eighties": 0.0001, "seventies": 0.0005, "nineties": 0.0001 }, "gender": { "male_masculine": 0.555, "": 0.1324, "female_feminine": 0.3126, "do_not_wish_to_say": 0 }, "sentence_domain": { "technology_robotics": 22, "healthcare": 4, "general": 46, "media_entertainment": 8, "news_current_affairs": 19, "automotive_transport": 4, "nature_environment": 12, "finance": 6, "language_fundamentals": 8, "service_retail": 3, "history_law_government": 37, "agriculture_food": 1 } }, "users": 26896, "size": 51784143268, "checksum": "633352ca8f5a65b22ed8391a6c9ea7e18988c1fe2e6ade031dfa3925b89c07c7", "avgDurationSecs": 4.88, "validDurationSecs": 2136020.372, "totalHrs": 2275.15, "validHrs": 593.33 }, "esu": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 501, "train": 6125, "validated": 7219 }, "duration": 27453780, "reportedSentences": 17, "validatedSentences": 6929, "unvalidatedSentences": 5, "clips": 7224, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.9993, "teens": 0.0007 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 4, "size": 142933775, "checksum": "1aa45ce99e45d2172a5c1ef0347d486a43546788dd5d0b5d5b18938245b9ef1e", "avgDurationSecs": 3.8, "validDurationSecs": 27434.778, "totalHrs": 7.62, "validHrs": 7.62 }, "et": { "buckets": { "dev": 2890, "invalidated": 7961, "other": 78, "test": 2890, "train": 3499, "validated": 27701 }, "duration": 240056138, "reportedSentences": 567, "validatedSentences": 11090, "unvalidatedSentences": 1876, "clips": 35740, "splits": { "accent": { "": 1366 }, "variant": {}, "age": { "": 0.1763, "thirties": 0.0794, "twenties": 0.7041, "fourties": 0.033, "fifties": 0.0055, "seventies": 0.0007, "teens": 0.0011 }, "gender": { "": 0.2401, "male_masculine": 0.4932, "female_feminine": 0.2647, "do_not_wish_to_say": 0.002 }, "sentence_domain": { "general": 620, "agriculture_food": 61, "finance": 42, "nature_environment": 111, "automotive_transport": 46, "history_law_government": 40, "technology_robotics": 46, "healthcare": 36, "service_retail": 14, "language_fundamentals": 17, "media_entertainment": 46, "news_current_affairs": 23 } }, "users": 1067, "size": 1603835772, "checksum": "d1f4a97ba736f7bc8cec6084b43d587af392a6c0b6a5a9680ad5afb159dad65a", "avgDurationSecs": 6.717, "validDurationSecs": 186060.299, "totalHrs": 66.68, "validHrs": 51.68 }, "eto": { "buckets": { "dev": 298, "invalidated": 308, "other": 200, "test": 311, "train": 357, "validated": 10555 }, "duration": 35675028, "reportedSentences": 0, "validatedSentences": 966, "unvalidatedSentences": 0, "clips": 11063, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9882, "thirties": 0.0118 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 26, "size": 202113285, "checksum": "fa476adddbd8d01ae910f6f71872cda0c5517ba4f16f6c641e9baf6645fc3286", "avgDurationSecs": 3.225, "validDurationSecs": 34036.873, "totalHrs": 9.9, "validHrs": 9.45 }, "eu": { "buckets": { "dev": 14808, "invalidated": 50590, "other": 101397, "test": 14808, "train": 134472, "validated": 312746 }, "duration": 2527325355, "reportedSentences": 1257, "validatedSentences": 1018163, "unvalidatedSentences": 12815, "clips": 464733, "splits": { "accent": { "mendebalekoa": 134808, "erdialdekoa_nafarra": 105190, "nafarlapurtarra_zuberoatarra": 837, "": 9495 }, "variant": {}, "age": { "fourties": 0.1625, "thirties": 0.055, "fifties": 0.1058, "twenties": 0.1692, "": 0.4483, "teens": 0.016, "sixties": 0.0392, "seventies": 0.0036, "eighties": 0, "nineties": 0.0005 }, "gender": { "male_masculine": 0.2491, "female_feminine": 0.285, "": 0.4658, "intersex": 0, "do_not_wish_to_say": 0.0001 }, "sentence_domain": { "general": 21543, "healthcare": 22, "automotive_transport": 6, "service_retail": 7, "nature_environment": 6, "technology_robotics": 23, "media_entertainment": 6, "language_fundamentals": 2, "finance": 4, "agriculture_food": 7 } }, "users": 11048, "size": 15552486520, "checksum": "4cea1dc7b49e5f14d0ba1e24bb46177fcf90e9fe565569dea02ab055324929c2", "avgDurationSecs": 5.438, "validDurationSecs": 1700784.957, "totalHrs": 702.03, "validHrs": 472.44 }, "ewo": { "buckets": { "dev": 288, "invalidated": 318, "other": 102, "test": 288, "train": 307, "validated": 7571 }, "duration": 51720480, "reportedSentences": 0, "validatedSentences": 883, "unvalidatedSentences": 0, "clips": 7991, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9433, "sixties": 0.0468, "fourties": 0.0006, "twenties": 0.0085, "thirties": 0.0008 }, "gender": { "": 0.9909, "female_feminine": 0.0091 }, "sentence_domain": {} }, "users": 31, "size": 295260851, "checksum": "6286bf0075fd841a8328cba31d52812b5f76bc90e0dafdd6d8b2da893b66907c", "avgDurationSecs": 6.472, "validDurationSecs": 49002.097, "totalHrs": 14.36, "validHrs": 13.61 }, "fa": { "buckets": { "dev": 10716, "invalidated": 15438, "other": 34907, "test": 10716, "train": 30105, "validated": 340608 }, "duration": 1541056616, "reportedSentences": 2807, "validatedSentences": 57812, "unvalidatedSentences": 283439, "clips": 390953, "splits": { "accent": { "": 89022 }, "variant": {}, "age": { "": 0.2482, "twenties": 0.329, "thirties": 0.353, "fifties": 0.0147, "fourties": 0.0248, "teens": 0.0297, "sixties": 0.0005, "seventies": 0.0001 }, "gender": { "": 0.2329, "male_masculine": 0.7028, "female_feminine": 0.0643 }, "sentence_domain": { "general": 28, "automotive_transport": 4, "nature_environment": 2 } }, "users": 4655, "size": 11166598367, "checksum": "8da423768ba09cd4c57076343e074f131ea93c417ea4e10805bd9024ba495ce6", "avgDurationSecs": 3.942, "validDurationSecs": 1342606.942, "totalHrs": 428.07, "validHrs": 372.94 }, "fan": { "buckets": { "dev": 327, "invalidated": 81, "other": 289, "test": 328, "train": 349, "validated": 7708 }, "duration": 35533116, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": 1, "clips": 8078, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8892, "fourties": 0.0972, "teens": 0.0136 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 43, "size": 208521821, "checksum": "1ebc99c4722ad73fbf9f6a50e2ef1a732c6847311489b23a6687ffb2a6dac83d", "avgDurationSecs": 4.399, "validDurationSecs": 33905.578, "totalHrs": 9.87, "validHrs": 9.41 }, "fi": { "buckets": { "dev": 1817, "invalidated": 423, "other": 5231, "test": 1813, "train": 2068, "validated": 12251 }, "duration": 82883955, "reportedSentences": 78, "validatedSentences": 5865, "unvalidatedSentences": 49163, "clips": 17905, "splits": { "accent": { "": 3505 }, "variant": {}, "age": { "thirties": 0.1718, "": 0.3263, "twenties": 0.1599, "fourties": 0.2997, "teens": 0.0052, "fifties": 0.0309, "seventies": 0.0006, "sixties": 0.0055 }, "gender": { "male_masculine": 0.3572, "": 0.3429, "female_feminine": 0.2999 }, "sentence_domain": {} }, "users": 251, "size": 496618382, "checksum": "d942128c5ed902a2ff59c86806fe1c0507d705584addb59ff7ff29ac2c6383c0", "avgDurationSecs": 4.629, "validDurationSecs": 56711.049, "totalHrs": 23.02, "validHrs": 15.75 }, "fmp": { "buckets": { "dev": 326, "invalidated": 288, "other": 105, "test": 327, "train": 351, "validated": 5206 }, "duration": 44035524, "reportedSentences": 5, "validatedSentences": 1004, "unvalidatedSentences": 0, "clips": 5599, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.7394, "twenties": 0.1979, "fifties": 0.0627 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 32, "size": 253042991, "checksum": "63d8dd9b65d9f04e75f129c02d4ec49465c4e4291aebefe7b9810818b8a90bb9", "avgDurationSecs": 7.865, "validDurationSecs": 40944.622, "totalHrs": 12.23, "validHrs": 11.37 }, "fr": { "buckets": { "dev": 16201, "invalidated": 68142, "other": 13229, "test": 16201, "train": 613431, "validated": 783357 }, "duration": 4354965250, "reportedSentences": 7562, "validatedSentences": 1649097, "unvalidatedSentences": 43765, "clips": 864728, "splits": { "accent": { "germany": 552, "belgium": 11381, "switzerland": 5916, "guadeloupe": 175, "reunion": 1307, "monaco": 111, "tunisia": 121, "canada": 12869, "mayotte": 12, "algeria": 1070, "netherlands": 101, "senegal": 197, "martinique": 100, "united_states": 1610, "fr-metro-east": 209, "cote_d_ivoire": 201, "st_pierre_et_miquelon": 15, "": 7511, "french_guiana": 188, "benin": 1073, "italy": 171, "united_kingdom": 502, "madagascar": 283, "cameroon": 163, "new_caledonia": 159, "fr-metro-south": 229, "haiti": 498, "morocco": 211, "romania": 150, "congo_kinshasa": 45, "fr-metro-west": 166, "luxembourg": 20, "mali": 39, "mauritius": 10, "fr-metro-north": 535 }, "variant": { "fr-metro": 542845, "fr-europe": 26543, "fr-droum": 1936, "fr-nafrica": 1342, "fr-namerica": 14424, "fr-safrica": 2066, "fr-samerica": 90 }, "age": { "twenties": 0.1711, "thirties": 0.1451, "": 0.3747, "teens": 0.0283, "fourties": 0.1422, "fifties": 0.0942, "sixties": 0.0335, "seventies": 0.0107, "eighties": 0.0002, "nineties": 0 }, "gender": { "male_masculine": 0.5683, "": 0.3242, "female_feminine": 0.1068, "do_not_wish_to_say": 0.0003, "non-binary": 0.0003, "transgender": 0 }, "sentence_domain": { "technology_robotics": 18, "general": 70, "media_entertainment": 17, "nature_environment": 8, "news_current_affairs": 2, "history_law_government": 19, "healthcare": 5, "language_fundamentals": 7, "automotive_transport": 1, "finance": 1 } }, "users": 21003, "size": 30483243491, "checksum": "e7ea3e1413a59207bcac9275efdafe8307caea1a67d56082198c1515892ec2ac", "avgDurationSecs": 5.036, "validDurationSecs": 3945162.541, "totalHrs": 1209.71, "validHrs": 1095.87 }, "fub": { "buckets": { "dev": 277, "invalidated": 29, "other": 209, "test": 345, "train": 445, "validated": 7686 }, "duration": 48081816, "reportedSentences": 0, "validatedSentences": 1067, "unvalidatedSentences": 0, "clips": 7924, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9739, "twenties": 0.017, "teens": 0.0091 }, "gender": { "": 0.9903, "female_feminine": 0.0097 }, "sentence_domain": {} }, "users": 19, "size": 262798335, "checksum": "ae44594dc694a4df2567b7629e88b52709ca836853fa5e20f392d7bd46579d29", "avgDurationSecs": 6.068, "validDurationSecs": 46637.663, "totalHrs": 13.35, "validHrs": 12.95 }, "fue": { "buckets": { "dev": 0, "invalidated": 11, "other": 108, "test": 88, "train": 900, "validated": 7146 }, "duration": 38734056, "reportedSentences": 0, "validatedSentences": 988, "unvalidatedSentences": 0, "clips": 7265, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8599, "thirties": 0.1353, "teens": 0.0048 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 10, "size": 221111740, "checksum": "54d64c6cf1e7d1db9fd5efbcfe80647a3db3d71204c049c28776708ff2e69a49", "avgDurationSecs": 5.332, "validDurationSecs": 38099.596, "totalHrs": 10.75, "validHrs": 10.58 }, "fy-NL": { "buckets": { "dev": 3181, "invalidated": 4058, "other": 103270, "test": 3181, "train": 3934, "validated": 52264 }, "duration": 774403839, "reportedSentences": 551, "validatedSentences": 10902, "unvalidatedSentences": 15, "clips": 159592, "splits": { "accent": { "": 35053 }, "variant": {}, "age": { "": 0.5608, "fifties": 0.1022, "thirties": 0.1023, "twenties": 0.0362, "fourties": 0.0669, "sixties": 0.123, "seventies": 0.0041, "teens": 0.0033, "eighties": 0.0012 }, "gender": { "": 0.5672, "male_masculine": 0.129, "female_feminine": 0.3038 }, "sentence_domain": { "general": 468, "agriculture_food": 1229, "nature_environment": 3, "media_entertainment": 1446, "language_fundamentals": 292, "news_current_affairs": 224, "finance": 208, "technology_robotics": 25, "history_law_government": 1157 } }, "users": 2096, "size": 4658098080, "checksum": "5a189aa27cd4d082748d08d992a4d4597a611c64f97900a7ca12e3d79f2b3b6a", "avgDurationSecs": 4.852, "validDurationSecs": 253605.709, "totalHrs": 215.11, "validHrs": 70.44 }, "ga-IE": { "buckets": { "dev": 873, "invalidated": 1302, "other": 674, "test": 874, "train": 925, "validated": 13379 }, "duration": 58823811, "reportedSentences": 42, "validatedSentences": 4883, "unvalidatedSentences": 1471, "clips": 15355, "splits": { "accent": { "": 1074, "cainteoir_líofa_ní_ó_dhúchas": 1307, "cainteoir_dúchais_gaeltacht": 901, "foghlaimeoir_fásta_ó_bhéarla": 10, "cainteoir_dúchais_lasmuigh_den_ghaeltacht": 10, "foghlaimeoir_i_ngaelscoil": 10 }, "variant": { "ga-IE-chonnact": 3978, "ga-IE-uladh": 1567, "ga-IE-mumhan": 3157, "ga-IE-caighd": 53 }, "age": { "twenties": 0.3475, "": 0.3032, "thirties": 0.2345, "fourties": 0.055, "sixties": 0.0036, "teens": 0.0119, "fifties": 0.0444 }, "gender": { "male_masculine": 0.3369, "": 0.3293, "female_feminine": 0.3314, "do_not_wish_to_say": 0.0024 }, "sentence_domain": { "general": 298, "nature_environment": 4, "history_law_government": 10, "automotive_transport": 6, "healthcare": 2, "service_retail": 7, "news_current_affairs": 6, "technology_robotics": 3, "media_entertainment": 8, "finance": 1, "agriculture_food": 5 } }, "users": 278, "size": 379570015, "checksum": "d49efb897f66ea61d53cdc5d7f7da3766584cc5a8b0126d9d0100f594d34eb15", "avgDurationSecs": 3.831, "validDurationSecs": 51253.909, "totalHrs": 16.33, "validHrs": 14.23 }, "gej": { "buckets": { "dev": 946, "invalidated": 333, "other": 23, "test": 949, "train": 1293, "validated": 16417 }, "duration": 40986360, "reportedSentences": 0, "validatedSentences": 3188, "unvalidatedSentences": 0, "clips": 16773, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 20, "size": 226789835, "checksum": "e6f3bfb7c360e28a66a7dbec22c4fe5932d053899d3a7f79345cc1358dca4a70", "avgDurationSecs": 2.444, "validDurationSecs": 40116.441, "totalHrs": 11.38, "validHrs": 11.14 }, "ggg": { "buckets": { "dev": 0, "invalidated": 3, "other": 4393, "test": 318, "train": 1687, "validated": 6684 }, "duration": 44190504, "reportedSentences": 0, "validatedSentences": 2005, "unvalidatedSentences": 0, "clips": 11080, "splits": { "accent": {}, "variant": {}, "age": { "": 0.304, "twenties": 0.1871, "thirties": 0.3263, "fourties": 0.1818, "teens": 0.0009 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 24, "size": 259647125, "checksum": "98c4afea95d5df6c1bc950d732dcca045f9aecfe706057af4ddc4b597c4d7f12", "avgDurationSecs": 3.988, "validDurationSecs": 26657.882, "totalHrs": 12.27, "validHrs": 7.4 }, "gid": { "buckets": { "dev": 286, "invalidated": 203, "other": 0, "test": 302, "train": 329, "validated": 5111 }, "duration": 37200456, "reportedSentences": 0, "validatedSentences": 917, "unvalidatedSentences": 0, "clips": 5314, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9853, "thirties": 0.0147 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 16, "size": 212623164, "checksum": "0552aabb037cef242baa1f989403b5d996b4aa5bb96ef21c7252110d56b9305d", "avgDurationSecs": 7, "validDurationSecs": 35779.362, "totalHrs": 10.33, "validHrs": 9.93 }, "gig": { "buckets": { "dev": 0, "invalidated": 6, "other": 23, "test": 135, "train": 1869, "validated": 12130 }, "duration": 36376452, "reportedSentences": 0, "validatedSentences": 2005, "unvalidatedSentences": 0, "clips": 12159, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.6703, "": 0.0012, "fifties": 0.1637, "thirties": 0.1648 }, "gender": { "": 0.9975, "female_feminine": 0.0025 }, "sentence_domain": {} }, "users": 20, "size": 212742257, "checksum": "426d1fe98057f338e22421c1570026d905d5b37c92ce7b6cef62a9b5fa47ff71", "avgDurationSecs": 2.992, "validDurationSecs": 36289.692, "totalHrs": 10.1, "validHrs": 10.08 }, "giz": { "buckets": { "dev": 308, "invalidated": 149, "other": 5, "test": 277, "train": 406, "validated": 6494 }, "duration": 37043316, "reportedSentences": 14, "validatedSentences": 991, "unvalidatedSentences": 0, "clips": 6648, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8908, "thirties": 0.0734, "twenties": 0.0343, "fifties": 0.0015 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 32, "size": 216362560, "checksum": "01806e658f7518e905fd304c551916d31f240561e99706ba670f34202023d6f5", "avgDurationSecs": 5.572, "validDurationSecs": 36185.213, "totalHrs": 10.28, "validHrs": 10.05 }, "gjk": { "buckets": { "dev": 570, "invalidated": 166, "other": 10, "test": 623, "train": 808, "validated": 8548 }, "duration": 39500640, "reportedSentences": 0, "validatedSentences": 2004, "unvalidatedSentences": 0, "clips": 8724, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.5888, "fourties": 0.2309, "": 0.0248, "twenties": 0.1206, "teens": 0.035 }, "gender": { "": 0.7353, "female_feminine": 0.2647 }, "sentence_domain": {} }, "users": 23, "size": 229669015, "checksum": "407fec72be1bc3e244ff8dd0983333629fe73f6d863b28dfd467c9d082cd2cff", "avgDurationSecs": 4.528, "validDurationSecs": 38703.745, "totalHrs": 10.97, "validHrs": 10.75 }, "gju": { "buckets": { "dev": 0, "invalidated": 154, "other": 506, "test": 623, "train": 3207, "validated": 11081 }, "duration": 38441268, "reportedSentences": 0, "validatedSentences": 3852, "unvalidatedSentences": 2, "clips": 11741, "splits": { "accent": {}, "variant": {}, "age": { "": 0.2177, "fourties": 0.7708, "thirties": 0.0114, "twenties": 0.0001 }, "gender": { "": 1 }, "sentence_domain": { "general": 3 } }, "users": 7, "size": 220417487, "checksum": "13d47fc4817b0a3fafa8465b8750209be390f4a69ec00ee87fdbce58be87dd5a", "avgDurationSecs": 3.274, "validDurationSecs": 36280.359, "totalHrs": 10.67, "validHrs": 10.07 }, "gl": { "buckets": { "dev": 15255, "invalidated": 34523, "other": 31646, "test": 15266, "train": 184932, "validated": 222281 }, "duration": 1438900740, "reportedSentences": 946, "validatedSentences": 1002019, "unvalidatedSentences": 3817, "clips": 288450, "splits": { "accent": { "normativo": 64917, "neofalante": 13038, "oriental": 11902, "atlantico": 24693, "central": 29691, "": 7954 }, "variant": {}, "age": { "": 0.514, "thirties": 0.0651, "fifties": 0.1096, "twenties": 0.1775, "fourties": 0.088, "teens": 0.0366, "sixties": 0.0087, "seventies": 0.0005 }, "gender": { "": 0.636, "male_masculine": 0.1208, "female_feminine": 0.2373, "non-binary": 0.0001, "do_not_wish_to_say": 0.0058 }, "sentence_domain": { "general": 269, "finance": 40, "nature_environment": 15, "agriculture_food": 34, "media_entertainment": 41, "history_law_government": 52, "healthcare": 3, "news_current_affairs": 14, "service_retail": 17, "automotive_transport": 4, "technology_robotics": 5, "language_fundamentals": 19 } }, "users": 6592, "size": 8389130210, "checksum": "6591e5d43e6d4761e3545fa60969949494efd269619a627129e46f41796a9adf", "avgDurationSecs": 4.988, "validDurationSecs": 1108824.044, "totalHrs": 399.69, "validHrs": 308 }, "gn": { "buckets": { "dev": 662, "invalidated": 220, "other": 21595, "test": 1106, "train": 1724, "validated": 4039 }, "duration": 118860552, "reportedSentences": 52, "validatedSentences": 5894, "unvalidatedSentences": 301, "clips": 25854, "splits": { "accent": { "": 272 }, "variant": {}, "age": { "": 0.1859, "twenties": 0.5388, "thirties": 0.1677, "sixties": 0.0017, "fourties": 0.0779, "teens": 0.0228, "fifties": 0.0048, "seventies": 0.0002 }, "gender": { "": 0.2031, "male_masculine": 0.3317, "female_feminine": 0.4646, "non-binary": 0.0006 }, "sentence_domain": {} }, "users": 313, "size": 683920826, "checksum": "7c74e2b17525579089fd51ef9f3a8d8902ebf01d062ae00bf8557e8fe76ebee4", "avgDurationSecs": 4.597, "validDurationSecs": 18568.801, "totalHrs": 33.01, "validHrs": 5.15 }, "gsw": { "buckets": { "dev": 28, "invalidated": 53, "other": 590, "test": 28, "train": 31, "validated": 400 }, "duration": 5961888, "reportedSentences": 2, "validatedSentences": 87, "unvalidatedSentences": 5, "clips": 1043, "splits": { "accent": { "": 115 }, "variant": { "gsw-FR-sudnalem": 62, "gsw-FR-nordalem": 434, "gsw-FR-rhinfran": 38 }, "age": { "seventies": 0.07, "twenties": 0.0058, "": 0.116, "fifties": 0.2013, "sixties": 0.5177, "eighties": 0.0048, "fourties": 0.0431, "thirties": 0.0412 }, "gender": { "": 0.5292, "female_feminine": 0.4708 }, "sentence_domain": { "media_entertainment": 855, "general": 181, "service_retail": 19, "news_current_affairs": 10, "healthcare": 13, "history_law_government": 2 } }, "users": 37, "size": 33852437, "checksum": "df415a73dfc3af13c07a58b9ed2616b66a58149a24667ef292eee0cadc6260af", "avgDurationSecs": 5.716, "validDurationSecs": 2286.438, "totalHrs": 1.65, "validHrs": 0.63 }, "gv": { "buckets": { "dev": 1166, "invalidated": 151, "other": 137, "test": 525, "train": 2420, "validated": 6302 }, "duration": 38105820, "reportedSentences": 265, "validatedSentences": 4219, "unvalidatedSentences": 21, "clips": 6590, "splits": { "accent": { "": 2476 }, "variant": {}, "age": { "fifties": 0.0196, "thirties": 0.0439, "twenties": 0.5439, "fourties": 0.3927 }, "gender": { "": 0.4599, "male_masculine": 0.5047, "female_feminine": 0.0354 }, "sentence_domain": {} }, "users": 9, "size": 224910221, "checksum": "9ce1ca8a1469cd7377846c49c627a46af29b60435b66a866af23196ab9290bce", "avgDurationSecs": 5.782, "validDurationSecs": 36440.497, "totalHrs": 10.58, "validHrs": 10.12 }, "gwc": { "buckets": { "dev": 776, "invalidated": 669, "other": 1698, "test": 918, "train": 3268, "validated": 7421 }, "duration": 55315548, "reportedSentences": 1, "validatedSentences": 5574, "unvalidatedSentences": 0, "clips": 9788, "splits": { "accent": { "": 4659 }, "variant": {}, "age": { "": 0.0309, "twenties": 0.3448, "fourties": 0.4803, "teens": 0.0048, "fifties": 0.1393 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 22, "size": 325099435, "checksum": "f76663c0e344691385a1c21b43bf64c87de4fb10c0dc2d803f07106c8454732a", "avgDurationSecs": 5.651, "validDurationSecs": 41938.77, "totalHrs": 15.36, "validHrs": 11.64 }, "gwt": { "buckets": { "dev": 0, "invalidated": 57, "other": 0, "test": 220, "train": 3136, "validated": 7681 }, "duration": 44273880, "reportedSentences": 0, "validatedSentences": 3719, "unvalidatedSentences": 2087, "clips": 7738, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.4048, "": 0.161, "sixties": 0.4342 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 5, "size": 261982481, "checksum": "a488312c40ef56f41de1553bc052091b7fb0b32a0c3da74595c6e0c8f2239820", "avgDurationSecs": 5.722, "validDurationSecs": 43947.748, "totalHrs": 12.29, "validHrs": 12.2 }, "gya": { "buckets": { "dev": 346, "invalidated": 480, "other": 0, "test": 346, "train": 362, "validated": 6914 }, "duration": 37549980, "reportedSentences": 1, "validatedSentences": 1054, "unvalidatedSentences": 0, "clips": 7394, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8618, "thirties": 0.1381, "fourties": 0.0001 }, "gender": { "": 0.9999, "female_feminine": 0.0001 }, "sentence_domain": {} }, "users": 31, "size": 216406435, "checksum": "f3b918353a57d121c252db4602312704c7a92d72f9b85c43ef6fcf54a03f66b3", "avgDurationSecs": 5.078, "validDurationSecs": 35112.329, "totalHrs": 10.43, "validHrs": 9.75 }, "ha": { "buckets": { "dev": 634, "invalidated": 175, "other": 6776, "test": 783, "train": 1901, "validated": 3496 }, "duration": 45455724, "reportedSentences": 33, "validatedSentences": 5450, "unvalidatedSentences": 240, "clips": 10447, "splits": { "accent": { "": 82 }, "variant": {}, "age": { "": 0.1915, "thirties": 0.6539, "twenties": 0.1297, "fourties": 0.0011, "fifties": 0.0238 }, "gender": { "": 0.1877, "male_masculine": 0.5726, "female_feminine": 0.2396, "do_not_wish_to_say": 0.0001 }, "sentence_domain": {} }, "users": 60, "size": 266821775, "checksum": "538c91eeb1dad37dc79d9a8949a765e8a977479280dd526f30fb7701a3e26b9f", "avgDurationSecs": 4.351, "validDurationSecs": 15211.373, "totalHrs": 12.62, "validHrs": 4.22 }, "haz": { "buckets": { "dev": 86, "invalidated": 1, "other": 5, "test": 446, "train": 823, "validated": 8319 }, "duration": 37924992, "reportedSentences": 29, "validatedSentences": 1361, "unvalidatedSentences": 0, "clips": 8325, "splits": { "accent": {}, "variant": {}, "age": { "": 0.352, "twenties": 0.3229, "thirties": 0.3252 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 9, "size": 183716195, "checksum": "6897ca0735ad1a8db00ea7c40c56eb204443c0db45a29aafad917b740e551e06", "avgDurationSecs": 4.556, "validDurationSecs": 37897.659, "totalHrs": 10.53, "validHrs": 10.52 }, "he": { "buckets": { "dev": 401, "invalidated": 354, "other": 1002, "test": 930, "train": 1919, "validated": 4168 }, "duration": 25141464, "reportedSentences": 43, "validatedSentences": 4188, "unvalidatedSentences": 6836, "clips": 5524, "splits": { "accent": { "": 5370 }, "variant": {}, "age": { "eighties": 0.0353, "": 0.0684, "thirties": 0.1582, "fourties": 0.0235, "twenties": 0.7118, "teens": 0.0027 }, "gender": { "male_masculine": 0.9053, "": 0.0757, "female_feminine": 0.019 }, "sentence_domain": { "general": 1 } }, "users": 58, "size": 146428151, "checksum": "f6cdafd5db35d1672e37563f5038144e08ec24101d3aa08687300d482ff4bd3d", "avgDurationSecs": 4.551, "validDurationSecs": 18969.881, "totalHrs": 6.98, "validHrs": 5.26 }, "hem": { "buckets": { "dev": 337, "invalidated": 467, "other": 0, "test": 337, "train": 345, "validated": 5974 }, "duration": 38642724, "reportedSentences": 2, "validatedSentences": 1020, "unvalidatedSentences": 2, "clips": 6441, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 0.1293, "thirties": 0.272, "": 0.1245, "twenties": 0.4741 }, "gender": { "": 0.7699, "female_feminine": 0.1981, "male_masculine": 0.032 }, "sentence_domain": {} }, "users": 41, "size": 209284179, "checksum": "286b5905a0214ecb4df88c42c7ebc375a709949d2a205ca06f74aa7bc68dd3c1", "avgDurationSecs": 5.999, "validDurationSecs": 35840.962, "totalHrs": 10.73, "validHrs": 9.95 }, "hi": { "buckets": { "dev": 2809, "invalidated": 946, "other": 6952, "test": 3326, "train": 4894, "validated": 11108 }, "duration": 95884368, "reportedSentences": 146, "validatedSentences": 32204, "unvalidatedSentences": 9965, "clips": 19006, "splits": { "accent": { "": 2952 }, "variant": {}, "age": { "twenties": 0.2747, "fourties": 0.0907, "": 0.2864, "thirties": 0.32, "teens": 0.0097, "fifties": 0.0134, "sixties": 0.0052 }, "gender": { "male_masculine": 0.5006, "female_feminine": 0.03, "": 0.4693 }, "sentence_domain": {} }, "users": 474, "size": 570821124, "checksum": "60cf88cca5326001810f90c1914fa3cc6c1a34f3533084fff2bb7bed84e5e025", "avgDurationSecs": 5.045, "validDurationSecs": 56039.333, "totalHrs": 26.63, "validHrs": 15.56 }, "hno": { "buckets": { "dev": 765, "invalidated": 282, "other": 5, "test": 708, "train": 863, "validated": 9120 }, "duration": 37708020, "reportedSentences": 21, "validatedSentences": 2349, "unvalidatedSentences": 0, "clips": 9407, "splits": { "accent": { "": 2144 }, "variant": {}, "age": { "twenties": 0.6638, "": 0.0927, "teens": 0.0156, "thirties": 0.2279 }, "gender": { "": 0.516, "female_feminine": 0.484 }, "sentence_domain": { "general": 2574, "agriculture_food": 5816, "finance": 3245, "media_entertainment": 5816 } }, "users": 36, "size": 216295492, "checksum": "3283df47236e23be1f59f67ee08a79c963d8c7d35c2bec27b2aa9d8f4911efca", "avgDurationSecs": 4.009, "validDurationSecs": 36557.579, "totalHrs": 10.47, "validHrs": 10.15 }, "hr": { "buckets": { "dev": 0, "invalidated": 0, "other": 25, "test": 0, "train": 0, "validated": 0 }, "duration": 107784, "reportedSentences": 2, "validatedSentences": 1733, "unvalidatedSentences": 959, "clips": 25, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 1 }, "gender": { "do_not_wish_to_say": 1 }, "sentence_domain": { "news_current_affairs": 1, "automotive_transport": 1, "general": 1 } }, "users": 1, "size": 793494, "checksum": "64f2a7f1a9b7ec600d4f18c59139741dc12ed1adf1d3f534bc6bee2ff53c2263", "avgDurationSecs": 4.311, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "hsb": { "buckets": { "dev": 350, "invalidated": 252, "other": 2389, "test": 475, "train": 812, "validated": 1637 }, "duration": 32098284, "reportedSentences": 128, "validatedSentences": 7182, "unvalidatedSentences": 540, "clips": 4278, "splits": { "accent": { "": 10 }, "variant": {}, "age": { "fourties": 0.2228, "": 0.6052, "thirties": 0.0409, "sixties": 0.0094, "seventies": 0.0129, "twenties": 0.047, "fifties": 0.0295, "teens": 0.0325 }, "gender": { "male_masculine": 0.3495, "": 0.6251, "female_feminine": 0.0255 }, "sentence_domain": {} }, "users": 33, "size": 210098112, "checksum": "dfd6220faa04e813f60240c7bc21796008de61dee9e4cffd44d94696405de3b8", "avgDurationSecs": 7.503, "validDurationSecs": 12282.583, "totalHrs": 8.91, "validHrs": 3.41 }, "ht": { "buckets": { "dev": 8, "invalidated": 1, "other": 0, "test": 5, "train": 12, "validated": 29 }, "duration": 166968, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": 1629, "clips": 30, "splits": { "accent": {}, "variant": {}, "age": { "": 0.6667, "thirties": 0.3333 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 3, "size": 1094970, "checksum": "57d2a374a7a75b44996f1ac120a0506eed1fb7ffba74198a8a498eb68464addb", "avgDurationSecs": 5.566, "validDurationSecs": 161.402, "totalHrs": 0.04, "validHrs": 0.04 }, "hu": { "buckets": { "dev": 12944, "invalidated": 4089, "other": 27286, "test": 12989, "train": 59395, "validated": 86128 }, "duration": 651056797, "reportedSentences": 554, "validatedSentences": 326531, "unvalidatedSentences": 232, "clips": 117503, "splits": { "accent": { "": 20354 }, "variant": {}, "age": { "teens": 0.0204, "": 0.2781, "thirties": 0.1255, "twenties": 0.1114, "fifties": 0.3229, "fourties": 0.113, "sixties": 0.0144, "seventies": 0.0139, "eighties": 0.0003 }, "gender": { "male_masculine": 0.2345, "": 0.3008, "female_feminine": 0.4648 }, "sentence_domain": { "technology_robotics": 9, "general": 68, "history_law_government": 20, "media_entertainment": 2 } }, "users": 1677, "size": 3842633394, "checksum": "5ab924dad958253ca51f4934da9dcd94f349b37db6ab147c41ea13442cef6d40", "avgDurationSecs": 5.541, "validDurationSecs": 477215.218, "totalHrs": 180.84, "validHrs": 132.55 }, "hux": { "buckets": { "dev": 25, "invalidated": 1322, "other": 1, "test": 326, "train": 637, "validated": 9285 }, "duration": 41113224, "reportedSentences": 0, "validatedSentences": 995, "unvalidatedSentences": 0, "clips": 10608, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.1862, "fifties": 0.3714, "thirties": 0.0934, "fourties": 0.2557, "": 0.0009, "sixties": 0.0924 }, "gender": { "female_feminine": 0.6265, "": 0.3735 }, "sentence_domain": {} }, "users": 13, "size": 241551584, "checksum": "2c5563546c7d929eb63f00a6bf3405b06603290d0d2c3af8e5aed647da7dff7f", "avgDurationSecs": 3.876, "validDurationSecs": 35985.698, "totalHrs": 11.42, "validHrs": 9.99 }, "hy-AM": { "buckets": { "dev": 5968, "invalidated": 1399, "other": 14046, "test": 6245, "train": 10501, "validated": 22953 }, "duration": 206689032, "reportedSentences": 184, "validatedSentences": 237419, "unvalidatedSentences": 1377, "clips": 38398, "splits": { "accent": { "": 8815 }, "variant": {}, "age": { "": 0.1613, "thirties": 0.074, "twenties": 0.6142, "fifties": 0.0069, "teens": 0.1064, "fourties": 0.0372 }, "gender": { "": 0.2164, "male_masculine": 0.2406, "female_feminine": 0.543 }, "sentence_domain": { "finance": 1, "general": 1 } }, "users": 586, "size": 1222233164, "checksum": "127f0254953d8cf6c083f169a04c4d7176df8cd383ba9e84b96c7cdb0c4f67fe", "avgDurationSecs": 5.383, "validDurationSecs": 123551.574, "totalHrs": 57.41, "validHrs": 34.31 }, "ia": { "buckets": { "dev": 1897, "invalidated": 361, "other": 2188, "test": 1896, "train": 4882, "validated": 12221 }, "duration": 62070024, "reportedSentences": 274, "validatedSentences": 9069, "unvalidatedSentences": 144, "clips": 14770, "splits": { "accent": { "": 15 }, "variant": {}, "age": { "seventies": 0.2239, "fourties": 0.2936, "": 0.3833, "twenties": 0.0475, "thirties": 0.0194, "teens": 0.0024, "fifties": 0.0284, "sixties": 0.0014 }, "gender": { "male_masculine": 0.6116, "": 0.3833, "female_feminine": 0.005, "do_not_wish_to_say": 0.0001 }, "sentence_domain": {} }, "users": 72, "size": 421849127, "checksum": "37769614c273586290dda0fd8617b22de8820f7dcd1d579e8234695cbe6e96c6", "avgDurationSecs": 4.202, "validDurationSecs": 51358.007, "totalHrs": 17.24, "validHrs": 14.26 }, "ibb": { "buckets": { "dev": 319, "invalidated": 25, "other": 4143, "test": 307, "train": 372, "validated": 3106 }, "duration": 65480940, "reportedSentences": 0, "validatedSentences": 1031, "unvalidatedSentences": 0, "clips": 7274, "splits": { "accent": { "": 1 }, "variant": {}, "age": { "": 0.9887, "thirties": 0.0113 }, "gender": { "": 0.972, "female_feminine": 0.028 }, "sentence_domain": {} }, "users": 19, "size": 379704530, "checksum": "09dd62ef9c4cb21d51d91455336b5250d480686a6717911b54fec3139c23b79d", "avgDurationSecs": 9.002, "validDurationSecs": 27960.379, "totalHrs": 18.18, "validHrs": 7.76 }, "id": { "buckets": { "dev": 3469, "invalidated": 2798, "other": 27322, "test": 3694, "train": 4975, "validated": 30331 }, "duration": 241276908, "reportedSentences": 363, "validatedSentences": 13321, "unvalidatedSentences": 11861, "clips": 60451, "splits": { "accent": { "": 4644 }, "variant": {}, "age": { "": 0.2533, "twenties": 0.413, "thirties": 0.1007, "teens": 0.2119, "fifties": 0.0039, "fourties": 0.0173 }, "gender": { "": 0.3046, "male_masculine": 0.437, "female_feminine": 0.2581, "intersex": 0.0001, "do_not_wish_to_say": 0.0002 }, "sentence_domain": {} }, "users": 666, "size": 1520453235, "checksum": "1023a14b3252b18a05a9ec86914e00f5338bdb00824d287bfdd80b3ef2d5a1ab", "avgDurationSecs": 3.991, "validDurationSecs": 121059.534, "totalHrs": 67.02, "validHrs": 33.62 }, "ig": { "buckets": { "dev": 587, "invalidated": 657, "other": 9928, "test": 573, "train": 635, "validated": 1805 }, "duration": 67250664, "reportedSentences": 41, "validatedSentences": 15294, "unvalidatedSentences": 3099, "clips": 12390, "splits": { "accent": { "": 1438 }, "variant": {}, "age": { "": 0.2783, "twenties": 0.3984, "teens": 0.1045, "eighties": 0.002, "thirties": 0.1958, "sixties": 0.0109, "fourties": 0.0097, "fifties": 0.0004 }, "gender": { "": 0.3697, "male_masculine": 0.0646, "female_feminine": 0.5656 }, "sentence_domain": { "language_fundamentals": 2, "history_law_government": 1, "media_entertainment": 2, "agriculture_food": 1 } }, "users": 182, "size": 391298720, "checksum": "c97cfad66b4e0cd750ce5abe437302bd9178fb6ebe4980beb4413cd7498a29c1", "avgDurationSecs": 5.428, "validDurationSecs": 9797.211, "totalHrs": 18.68, "validHrs": 2.72 }, "ipk": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 160, "train": 3104, "validated": 3264 }, "duration": 25908516, "reportedSentences": 0, "validatedSentences": 4386, "unvalidatedSentences": 27, "clips": 3264, "splits": { "accent": {}, "variant": {}, "age": { "eighties": 0.951, "": 0.049 }, "gender": { "female_feminine": 0.951, "": 0.049 }, "sentence_domain": {} }, "users": 2, "size": 153468347, "checksum": "ce774bec16071d4741f6bfc3943bdc473157dbc00cd7e1bf46cc21d3f22e35f3", "avgDurationSecs": 7.938, "validDurationSecs": 25908.516, "totalHrs": 7.19, "validHrs": 7.19 }, "is": { "buckets": { "dev": 0, "invalidated": 15, "other": 290, "test": 35, "train": 74, "validated": 109 }, "duration": 2660040, "reportedSentences": 1, "validatedSentences": 5083, "unvalidatedSentences": 113, "clips": 414, "splits": { "accent": { "": 344 }, "variant": {}, "age": { "": 0.3744, "fourties": 0.0121, "thirties": 0.6135 }, "gender": { "": 0.9638, "male_masculine": 0.0362 }, "sentence_domain": {} }, "users": 8, "size": 16015289, "checksum": "a5f021b2cef026d3a2201f359f08681df1d8b2fd2a1314d371aef675a55ac861", "avgDurationSecs": 6.425, "validDurationSecs": 700.349, "totalHrs": 0.73, "validHrs": 0.19 }, "it": { "buckets": { "dev": 15177, "invalidated": 20876, "other": 20242, "test": 15184, "train": 173515, "validated": 240502 }, "duration": 1529903652, "reportedSentences": 5978, "validatedSentences": 925138, "unvalidatedSentences": 908, "clips": 281620, "splits": { "accent": { "": 26093 }, "variant": {}, "age": { "thirties": 0.1574, "twenties": 0.198, "": 0.272, "fifties": 0.1378, "fourties": 0.1232, "seventies": 0.0038, "sixties": 0.0985, "teens": 0.0094, "eighties": 0, "nineties": 0 }, "gender": { "female_feminine": 0.1014, "male_masculine": 0.6199, "": 0.2784, "do_not_wish_to_say": 0.0003 }, "sentence_domain": { "general": 11, "service_retail": 1, "language_fundamentals": 3, "healthcare": 1, "news_current_affairs": 1, "nature_environment": 3, "history_law_government": 5, "media_entertainment": 1, "technology_robotics": 1 } }, "users": 7335, "size": 10424613036, "checksum": "486958feed03d52d2cccc7aa35807c8bf7b118011c1c672b762107c4bc59a80f", "avgDurationSecs": 5.433, "validDurationSecs": 1306529.679, "totalHrs": 424.97, "validHrs": 362.92 }, "ja": { "buckets": { "dev": 9019, "invalidated": 55101, "other": 229564, "test": 9019, "train": 19695, "validated": 299767 }, "duration": 2610422145, "reportedSentences": 791, "validatedSentences": 43942, "unvalidatedSentences": 7265, "clips": 584432, "splits": { "accent": { "": 53100 }, "variant": {}, "age": { "twenties": 0.5458, "": 0.1672, "teens": 0.0817, "fifties": 0.0431, "thirties": 0.0689, "fourties": 0.0828, "sixties": 0.0093, "seventies": 0.001, "nineties": 0.0002 }, "gender": { "male_masculine": 0.2681, "": 0.3321, "female_feminine": 0.3902, "non-binary": 0.0002, "do_not_wish_to_say": 0.0088, "transgender": 0.0003, "intersex": 0.0002 }, "sentence_domain": { "agriculture_food": 9173, "media_entertainment": 5068, "healthcare": 2647, "history_law_government": 4744, "finance": 2214, "general": 24569, "service_retail": 490, "language_fundamentals": 3849, "nature_environment": 2761, "news_current_affairs": 1068, "automotive_transport": 2152, "technology_robotics": 674 } }, "users": 7813, "size": 15393560105, "checksum": "983c8b9f02f68f3ad24ceb340446799f51dfc0dd0bcf249fccd811c7c4a24dc3", "avgDurationSecs": 4.467, "validDurationSecs": 1338938.346, "totalHrs": 725.11, "validHrs": 371.92 }, "jgo": { "buckets": { "dev": 336, "invalidated": 89, "other": 20, "test": 336, "train": 362, "validated": 6080 }, "duration": 41432724, "reportedSentences": 0, "validatedSentences": 1035, "unvalidatedSentences": 2, "clips": 6189, "splits": { "accent": {}, "variant": {}, "age": { "": 0.4392, "thirties": 0.149, "fourties": 0.3013, "twenties": 0.1105 }, "gender": { "": 0.6683, "female_feminine": 0.3317 }, "sentence_domain": { "general": 1, "media_entertainment": 1, "nature_environment": 1 } }, "users": 16, "size": 227487146, "checksum": "92565166dd1ae4047375b8e5f2efc15c327be45cf90f2660868fa34f040df28c", "avgDurationSecs": 6.695, "validDurationSecs": 40703.015, "totalHrs": 11.5, "validHrs": 11.3 }, "jqr": { "buckets": { "dev": 323, "invalidated": 115, "other": 79, "test": 273, "train": 403, "validated": 6060 }, "duration": 36884772, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": 0, "clips": 6254, "splits": { "accent": {}, "variant": {}, "age": { "": 0.2283, "fifties": 0.1581, "sixties": 0.012, "fourties": 0.6015 }, "gender": { "": 0.4846, "female_feminine": 0.5154 }, "sentence_domain": {} }, "users": 12, "size": 214460084, "checksum": "c48c212770ecb154c48b47e2fb877230ae17d281103a99f6e53445ad26bc50c0", "avgDurationSecs": 5.898, "validDurationSecs": 35740.601, "totalHrs": 10.24, "validHrs": 9.92 }, "ka": { "buckets": { "dev": 13047, "invalidated": 6744, "other": 102888, "test": 13117, "train": 63077, "validated": 117767 }, "duration": 1164595428, "reportedSentences": 1097, "validatedSentences": 132630, "unvalidatedSentences": 1891, "clips": 227399, "splits": { "accent": { "": 17479 }, "variant": {}, "age": { "twenties": 0.2172, "thirties": 0.3259, "": 0.2319, "fourties": 0.0663, "fifties": 0.0358, "teens": 0.106, "sixties": 0.0159, "nineties": 0, "seventies": 0.0011 }, "gender": { "male_masculine": 0.1677, "female_feminine": 0.5417, "": 0.2884, "do_not_wish_to_say": 0.0021 }, "sentence_domain": { "general": 255, "media_entertainment": 23, "news_current_affairs": 10, "language_fundamentals": 32, "healthcare": 10, "nature_environment": 17, "agriculture_food": 9, "history_law_government": 202, "service_retail": 9, "finance": 4, "automotive_transport": 11, "technology_robotics": 3 } }, "users": 2764, "size": 6834487542, "checksum": "1b62c788937a162aea667807716ad27aa77c523ac84de0ce3a1cd466d91de1ea", "avgDurationSecs": 5.121, "validDurationSecs": 603128.905, "totalHrs": 323.49, "validHrs": 167.53 }, "kab": { "buckets": { "dev": 15004, "invalidated": 20080, "other": 112371, "test": 15004, "train": 152533, "validated": 614834 }, "duration": 2497211788, "reportedSentences": 9710, "validatedSentences": 182810, "unvalidatedSentences": 205847, "clips": 747285, "splits": { "accent": { "": 552 }, "variant": {}, "age": { "fourties": 0.0921, "thirties": 0.2912, "": 0.2776, "fifties": 0.1848, "twenties": 0.1242, "eighties": 0.0019, "teens": 0.0022, "sixties": 0.025, "seventies": 0.0011 }, "gender": { "male_masculine": 0.5323, "": 0.2644, "female_feminine": 0.2033 }, "sentence_domain": {} }, "users": 1603, "size": 18713729996, "checksum": "d7e7011c431a85610b5772058f2ad4e83aef55f5fb5269cfeec2ab373dcc9ba5", "avgDurationSecs": 3.342, "validDurationSecs": 2054598.597, "totalHrs": 693.66, "validHrs": 570.72 }, "kbd": { "buckets": { "dev": 7558, "invalidated": 2763, "other": 8367, "test": 7560, "train": 13878, "validated": 157434 }, "duration": 1049054796, "reportedSentences": 3995, "validatedSentences": 30253, "unvalidatedSentences": 2391, "clips": 168564, "splits": { "accent": { "": 1407, "kuban": 975, "kuban_zelenchuk": 2590, "bakhsan": 18292, "besleney": 1811, "terek": 16196, "malka": 1203, "mozdok": 442 }, "variant": { "kbd-Cyrl-TR": 2020, "kbd-RU": 15319, "kbd-Cyrl": 96773, "kbd-Cyrl-JOR": 240 }, "age": { "": 0.1389, "thirties": 0.2522, "twenties": 0.2414, "fifties": 0.2368, "fourties": 0.092, "sixties": 0.0072, "teens": 0.0314, "nineties": 0.0001 }, "gender": { "": 0.5714, "female_feminine": 0.4252, "male_masculine": 0.0032, "do_not_wish_to_say": 0.0002 }, "sentence_domain": { "general": 180080, "history_law_government": 289, "news_current_affairs": 6, "nature_environment": 133, "language_fundamentals": 16, "healthcare": 64, "media_entertainment": 9, "automotive_transport": 6, "agriculture_food": 7 } }, "users": 347, "size": 5926501620, "checksum": "c0a6baab168a8992a3f95f4b662e121dbfcfd7c928a3cca81e94595dccee39e9", "avgDurationSecs": 6.223, "validDurationSecs": 979787.456, "totalHrs": 291.4, "validHrs": 272.16 }, "kdh": { "buckets": { "dev": 356, "invalidated": 198, "other": 496, "test": 363, "train": 498, "validated": 13513 }, "duration": 34783992, "reportedSentences": 0, "validatedSentences": 1219, "unvalidatedSentences": 0, "clips": 14207, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9768, "thirties": 0.0232 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 20, "size": 182185865, "checksum": "c234959528308eecc284316500d2ac9d6aee87d0870015d36b74c002de88e962", "avgDurationSecs": 2.448, "validDurationSecs": 33084.823, "totalHrs": 9.66, "validHrs": 9.19 }, "khw": { "buckets": { "dev": 1607, "invalidated": 207, "other": 1138, "test": 1535, "train": 2642, "validated": 8500 }, "duration": 66720564, "reportedSentences": 4, "validatedSentences": 7051, "unvalidatedSentences": 200, "clips": 9845, "splits": { "accent": { "": 89 }, "variant": {}, "age": { "fourties": 0.5509, "": 0.0702, "thirties": 0.2647, "fifties": 0.0005, "twenties": 0.1137 }, "gender": { "": 0.999, "female_feminine": 0.001 }, "sentence_domain": { "general": 252, "language_fundamentals": 30, "news_current_affairs": 4, "media_entertainment": 4, "agriculture_food": 18, "nature_environment": 9, "automotive_transport": 53, "history_law_government": 8, "technology_robotics": 1, "healthcare": 4, "finance": 2 } }, "users": 50, "size": 393343846, "checksum": "90e551eef1788418c61491d5178082abbcb6b47b65955384201060742293c4a6", "avgDurationSecs": 6.777, "validDurationSecs": 57605.363, "totalHrs": 18.53, "validHrs": 16 }, "kk": { "buckets": { "dev": 549, "invalidated": 275, "other": 732, "test": 598, "train": 671, "validated": 1827 }, "duration": 14016420, "reportedSentences": 43, "validatedSentences": 6229, "unvalidatedSentences": 241, "clips": 2834, "splits": { "accent": { "": 671 }, "variant": {}, "age": { "": 0.5614, "thirties": 0.0441, "twenties": 0.2184, "teens": 0.0572, "fifties": 0.0473, "fourties": 0.0716 }, "gender": { "": 0.5819, "male_masculine": 0.3024, "female_feminine": 0.1157 }, "sentence_domain": {} }, "users": 201, "size": 80393028, "checksum": "4ac3706ff7b41db98fba061a89143436028fbce12fcd9596e66d54f6b3e86bd8", "avgDurationSecs": 4.946, "validDurationSecs": 9035.991, "totalHrs": 3.89, "validHrs": 2.5 }, "kln": { "buckets": { "dev": 6412, "invalidated": 479, "other": 37248, "test": 6076, "train": 11065, "validated": 32315 }, "duration": 317256300, "reportedSentences": 0, "validatedSentences": 29900, "unvalidatedSentences": 61, "clips": 70042, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.5924, "": 0.0229, "thirties": 0.0477, "fourties": 0.2913, "teens": 0.0457 }, "gender": { "": 0.27, "female_feminine": 0.7024, "male_masculine": 0.0276 }, "sentence_domain": { "general": 68796, "service_retail": 625, "agriculture_food": 5, "technology_robotics": 483, "nature_environment": 23, "finance": 32, "healthcare": 54, "automotive_transport": 16, "media_entertainment": 2, "news_current_affairs": 2 } }, "users": 41, "size": 1730787788, "checksum": "7262ef0a752c8b386180366a22eeff0d0ccc04a38add1c4faff1e0844bb4240d", "avgDurationSecs": 4.53, "validDurationSecs": 146371.282, "totalHrs": 88.12, "validHrs": 40.65 }, "kls": { "buckets": { "dev": 1258, "invalidated": 142, "other": 6, "test": 1233, "train": 1415, "validated": 9779 }, "duration": 36720936, "reportedSentences": 2, "validatedSentences": 3912, "unvalidatedSentences": 4, "clips": 9927, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "fourties": 0.2982, "": 0.6404, "twenties": 0.0614 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 23, "size": 209489102, "checksum": "b6c2744f448af8cc3192aed9e2f7b8d2193a8b1f2569f9d071c448fa2ac3d3d5", "avgDurationSecs": 3.699, "validDurationSecs": 36173.47, "totalHrs": 10.2, "validHrs": 10.04 }, "kmr": { "buckets": { "dev": 4182, "invalidated": 2637, "other": 24233, "test": 4174, "train": 5603, "validated": 65501 }, "duration": 384361200, "reportedSentences": 1074, "validatedSentences": 17030, "unvalidatedSentences": 1132, "clips": 92371, "splits": { "accent": { "": 13851 }, "variant": {}, "age": { "": 0.4709, "twenties": 0.3514, "thirties": 0.0926, "fourties": 0.0309, "fifties": 0.0442, "teens": 0.0094, "sixties": 0.0006 }, "gender": { "": 0.4982, "male_masculine": 0.4107, "female_feminine": 0.0912 }, "sentence_domain": { "media_entertainment": 3, "language_fundamentals": 2, "nature_environment": 6, "general": 10, "agriculture_food": 3, "history_law_government": 2, "healthcare": 1 } }, "users": 690, "size": 2055073824, "checksum": "16426f8bd778bf159a5d2fb13d0988cdb800e0c661d62383c606123e805bb4a7", "avgDurationSecs": 4.161, "validDurationSecs": 272553.539, "totalHrs": 106.76, "validHrs": 75.7 }, "ko": { "buckets": { "dev": 506, "invalidated": 495, "other": 4888, "test": 554, "train": 678, "validated": 1739 }, "duration": 37113300, "reportedSentences": 29, "validatedSentences": 7401, "unvalidatedSentences": 2623, "clips": 7122, "splits": { "accent": { "": 1334 }, "variant": {}, "age": { "": 0.1834, "thirties": 0.1939, "twenties": 0.5319, "teens": 0.0268, "fourties": 0.0382, "sixties": 0.0001, "fifties": 0.0257 }, "gender": { "": 0.4031, "male_masculine": 0.3126, "female_feminine": 0.2755, "non-binary": 0.0042, "do_not_wish_to_say": 0.0046 }, "sentence_domain": { "history_law_government": 1, "service_retail": 1, "technology_robotics": 2, "general": 1 } }, "users": 200, "size": 216897230, "checksum": "1875b466d7433e04e27462b8f4c1ffb1cbfbb6bbc2b972daf5b881520acfbb24", "avgDurationSecs": 5.211, "validDurationSecs": 9062.065, "totalHrs": 10.3, "validHrs": 2.51 }, "ksf": { "buckets": { "dev": 308, "invalidated": 232, "other": 222, "test": 320, "train": 376, "validated": 7414 }, "duration": 65727108, "reportedSentences": 0, "validatedSentences": 1004, "unvalidatedSentences": 0, "clips": 7868, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8795, "twenties": 0.071, "thirties": 0.0494 }, "gender": { "": 0.9506, "female_feminine": 0.0494 }, "sentence_domain": {} }, "users": 24, "size": 359207958, "checksum": "92e56c8dffb1f859e2edeedb82c720015dc31b3b202c5dda1e9334953fdc8aaf", "avgDurationSecs": 8.354, "validDurationSecs": 61934.517, "totalHrs": 18.25, "validHrs": 17.2 }, "kvx": { "buckets": { "dev": 654, "invalidated": 36, "other": 57, "test": 450, "train": 913, "validated": 7464 }, "duration": 40217904, "reportedSentences": 1, "validatedSentences": 2025, "unvalidatedSentences": 0, "clips": 7557, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.2713, "": 0.0385, "fourties": 0.2051, "twenties": 0.2027, "teens": 0.2764, "seventies": 0.0026, "sixties": 0.002, "fifties": 0.0013 }, "gender": { "": 0.7949, "female_feminine": 0.2051 }, "sentence_domain": {} }, "users": 22, "size": 230163617, "checksum": "58ef9909ec4a33d6c0ecf91a5d21344ecf61215aa1c06031f274cd2733026797", "avgDurationSecs": 5.322, "validDurationSecs": 39722.964, "totalHrs": 11.17, "validHrs": 11.03 }, "kw": { "buckets": { "dev": 0, "invalidated": 398, "other": 60, "test": 2401, "train": 6223, "validated": 10806 }, "duration": 46422216, "reportedSentences": 43, "validatedSentences": 9609, "unvalidatedSentences": 0, "clips": 11264, "splits": { "accent": { "": 2828 }, "variant": {}, "age": { "fifties": 0.4706, "fourties": 0.3432, "": 0.1167, "seventies": 0.0467, "sixties": 0.0228 }, "gender": { "": 0.6559, "female_feminine": 0.3441 }, "sentence_domain": { "general": 12719 } }, "users": 10, "size": 272873692, "checksum": "25ff0f54a7e973eedf7e3375eb2d50989468eac2b16b88466d750c26f700dd7e", "avgDurationSecs": 4.121, "validDurationSecs": 44534.665, "totalHrs": 12.89, "validHrs": 12.37 }, "kxp": { "buckets": { "dev": 0, "invalidated": 95, "other": 20, "test": 197, "train": 1881, "validated": 11644 }, "duration": 40016700, "reportedSentences": 0, "validatedSentences": 2079, "unvalidatedSentences": 0, "clips": 11759, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.5254, "twenties": 0.2834, "": 0.0077, "fourties": 0.1767, "teens": 0.006, "seventies": 0.0009 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 22, "size": 231913099, "checksum": "134293181e704b5a091de7c5c15f27b8fb1cc912bac8bd2cfad33c0593de77ec", "avgDurationSecs": 3.403, "validDurationSecs": 39625.347, "totalHrs": 11.11, "validHrs": 11 }, "ky": { "buckets": { "dev": 1614, "invalidated": 5685, "other": 1447, "test": 1615, "train": 1790, "validated": 30719 }, "duration": 172338348, "reportedSentences": 39, "validatedSentences": 5047, "unvalidatedSentences": 240, "clips": 37851, "splits": { "accent": { "": 80 }, "variant": {}, "age": { "thirties": 0.0943, "": 0.071, "fourties": 0.012, "twenties": 0.6493, "teens": 0.1733, "fifties": 0.0001 }, "gender": { "male_masculine": 0.5364, "": 0.1203, "female_feminine": 0.3433 }, "sentence_domain": {} }, "users": 299, "size": 1112700534, "checksum": "47bb63343c432dfbed1ed136e748a00f5e1d1838b9d73af6e15ab799008363d0", "avgDurationSecs": 4.553, "validDurationSecs": 139865.835, "totalHrs": 47.87, "validHrs": 38.85 }, "lg": { "buckets": { "dev": 13356, "invalidated": 39183, "other": 37696, "test": 13391, "train": 71092, "validated": 271894 }, "duration": 2017334043, "reportedSentences": 6479, "validatedSentences": 191409, "unvalidatedSentences": 179, "clips": 348773, "splits": { "accent": { "": 32 }, "variant": {}, "age": { "": 0.256, "thirties": 0.2193, "twenties": 0.378, "fourties": 0.0722, "fifties": 0.0453, "teens": 0.0066, "nineties": 0.0032, "sixties": 0.0172, "seventies": 0.0019, "eighties": 0.0003 }, "gender": { "": 0.2564, "female_feminine": 0.3866, "male_masculine": 0.357 }, "sentence_domain": {} }, "users": 667, "size": 11873593497, "checksum": "892b65b7ce6f45c0ebbea47dea1a60f81a6456da6ebc0254b7c0f4d10e6a7f59", "avgDurationSecs": 5.784, "validDurationSecs": 1572659.071, "totalHrs": 560.37, "validHrs": 436.84 }, "lij": { "buckets": { "dev": 943, "invalidated": 580, "other": 0, "test": 1358, "train": 2304, "validated": 4686 }, "duration": 20379708, "reportedSentences": 378, "validatedSentences": 6079, "unvalidatedSentences": 50, "clips": 5266, "splits": { "accent": { "": 1920 }, "variant": {}, "age": { "thirties": 0.4497, "": 0.1941, "seventies": 0.1147, "twenties": 0.0697, "sixties": 0.1719 }, "gender": { "male_masculine": 0.4478, "": 0.2693, "female_feminine": 0.2829 }, "sentence_domain": { "general": 364, "automotive_transport": 1, "media_entertainment": 1, "agriculture_food": 14, "news_current_affairs": 1 } }, "users": 39, "size": 113412405, "checksum": "4b1bacd2700a752e37bc850e357202911d422974b3e3bd040e8a8b696eaaebd9", "avgDurationSecs": 3.87, "validDurationSecs": 18135.076, "totalHrs": 5.66, "validHrs": 5.03 }, "lo": { "buckets": { "dev": 28, "invalidated": 16, "other": 14, "test": 39, "train": 126, "validated": 193 }, "duration": 1456560, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5926, "clips": 223, "splits": { "accent": { "": 48 }, "variant": {}, "age": { "": 0.1121, "twenties": 0.3274, "fourties": 0.0045, "thirties": 0.5561 }, "gender": { "": 0.1121, "male_masculine": 0.8879 }, "sentence_domain": {} }, "users": 16, "size": 9359455, "checksum": "8f7e2495cd61681d72174e991f575d35d041b790e930866d3f3c35fa7456f107", "avgDurationSecs": 6.532, "validDurationSecs": 1260.61, "totalHrs": 0.4, "validHrs": 0.35 }, "lrk": { "buckets": { "dev": 0, "invalidated": 185, "other": 123, "test": 347, "train": 1654, "validated": 11462 }, "duration": 40958568, "reportedSentences": 0, "validatedSentences": 2006, "unvalidatedSentences": 0, "clips": 11770, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0008, "twenties": 0.2837, "fourties": 0.1939, "teens": 0.3417, "fifties": 0.0086, "sixties": 0.1662, "thirties": 0.0051 }, "gender": { "": 0.8296, "female_feminine": 0.1704 }, "sentence_domain": {} }, "users": 20, "size": 240314094, "checksum": "e7e5395eca7e9f62c2e38fead16f31fcae3cc1a2752de79065d7e57f00da0878", "avgDurationSecs": 3.48, "validDurationSecs": 39886.755, "totalHrs": 11.37, "validHrs": 11.07 }, "lss": { "buckets": { "dev": 474, "invalidated": 34, "other": 92, "test": 645, "train": 903, "validated": 10326 }, "duration": 36197136, "reportedSentences": 0, "validatedSentences": 2029, "unvalidatedSentences": 0, "clips": 10452, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.4485, "thirties": 0.3266, "": 0.0372, "fourties": 0.1876 }, "gender": { "do_not_wish_to_say": 0.1091, "": 0.7031, "female_feminine": 0.1878 }, "sentence_domain": {} }, "users": 15, "size": 184088725, "checksum": "5f079a8290b6dfe4597e11841038ae65d4a61c7cfaaacc0fca1729977e4444ad", "avgDurationSecs": 3.463, "validDurationSecs": 35760.776, "totalHrs": 10.05, "validHrs": 9.93 }, "lt": { "buckets": { "dev": 5545, "invalidated": 923, "other": 3905, "test": 5644, "train": 8640, "validated": 20031 }, "duration": 126874556, "reportedSentences": 191, "validatedSentences": 129304, "unvalidatedSentences": 4821, "clips": 24859, "splits": { "accent": { "": 13173 }, "variant": {}, "age": { "twenties": 0.2607, "": 0.1577, "thirties": 0.2272, "fifties": 0.034, "sixties": 0.0046, "teens": 0.0254, "fourties": 0.2904 }, "gender": { "male_masculine": 0.4144, "": 0.4239, "female_feminine": 0.1603, "do_not_wish_to_say": 0.0014 }, "sentence_domain": {} }, "users": 331, "size": 777925724, "checksum": "f0b209ed17ece3f7912496c64c81de793509ebe4fb559111614652254e56cea3", "avgDurationSecs": 5.104, "validDurationSecs": 102233.567, "totalHrs": 35.24, "validHrs": 28.39 }, "ltg": { "buckets": { "dev": 3663, "invalidated": 1857, "other": 684, "test": 3685, "train": 4777, "validated": 22712 }, "duration": 120576888, "reportedSentences": 80, "validatedSentences": 15086, "unvalidatedSentences": 2268, "clips": 25253, "splits": { "accent": { "": 12789 }, "variant": {}, "age": { "thirties": 0.2366, "fifties": 0.0088, "fourties": 0.3235, "twenties": 0.0882, "": 0.3418, "sixties": 0.0012 }, "gender": { "female_feminine": 0.6063, "male_masculine": 0.0507, "": 0.343 }, "sentence_domain": { "general": 6435, "history_law_government": 732, "healthcare": 1, "agriculture_food": 1, "service_retail": 1 } }, "users": 387, "size": 697775641, "checksum": "4bcc7e3ea772af3372810dfcd22153197bb6a132df052a552e783d37c1492af8", "avgDurationSecs": 4.775, "validDurationSecs": 108444.236, "totalHrs": 33.49, "validHrs": 30.12 }, "lua": { "buckets": { "dev": 291, "invalidated": 821, "other": 0, "test": 291, "train": 298, "validated": 4801 }, "duration": 37463076, "reportedSentences": 0, "validatedSentences": 880, "unvalidatedSentences": 0, "clips": 5622, "splits": { "accent": { "": 184 }, "variant": {}, "age": { "twenties": 0.6711, "thirties": 0.1882, "": 0.1256, "fourties": 0.0151 }, "gender": { "": 0.6804, "female_feminine": 0.3196 }, "sentence_domain": {} }, "users": 46, "size": 216550903, "checksum": "deabfef1395136661e6add041c3b4a042086aed284215c54e296ff057c09e643", "avgDurationSecs": 6.664, "validDurationSecs": 31992.214, "totalHrs": 10.4, "validHrs": 8.88 }, "luo": { "buckets": { "dev": 3060, "invalidated": 704, "other": 61373, "test": 3001, "train": 4498, "validated": 20343 }, "duration": 401670612, "reportedSentences": 7, "validatedSentences": 14692, "unvalidatedSentences": 13870, "clips": 82420, "splits": { "accent": { "": 20 }, "variant": {}, "age": { "": 0.3582, "fourties": 0.0042, "thirties": 0.2832, "twenties": 0.3352, "fifties": 0.0191, "teens": 0.0001 }, "gender": { "": 0.6053, "female_feminine": 0.3947 }, "sentence_domain": { "history_law_government": 864, "general": 69231, "language_fundamentals": 549, "healthcare": 2229, "news_current_affairs": 2691, "service_retail": 2737, "media_entertainment": 802, "finance": 965, "agriculture_food": 1134, "nature_environment": 628, "automotive_transport": 340, "technology_robotics": 239 } }, "users": 47, "size": 2229793887, "checksum": "a210be0b5c9f319f37874e7b24dd9dfa15001e37b49211fd6e7866d0bbfb1866", "avgDurationSecs": 4.873, "validDurationSecs": 99140.806, "totalHrs": 111.57, "validHrs": 27.53 }, "lv": { "buckets": { "dev": 7825, "invalidated": 11635, "other": 21508, "test": 7825, "train": 14772, "validated": 198641 }, "duration": 1114826885, "reportedSentences": 3107, "validatedSentences": 34635, "unvalidatedSentences": 64972, "clips": 231784, "splits": { "accent": { "": 55746 }, "variant": {}, "age": { "thirties": 0.2836, "fourties": 0.0935, "": 0.4436, "twenties": 0.0628, "teens": 0.0133, "fifties": 0.0827, "sixties": 0.019, "eighties": 0.0006, "seventies": 0.0009 }, "gender": { "male_masculine": 0.248, "female_feminine": 0.3009, "": 0.4505, "do_not_wish_to_say": 0.0001, "non-binary": 0.0005 }, "sentence_domain": { "news_current_affairs": 5, "general": 750, "technology_robotics": 15, "media_entertainment": 17, "nature_environment": 45, "healthcare": 36, "finance": 5, "language_fundamentals": 9, "service_retail": 16, "automotive_transport": 3, "agriculture_food": 8, "history_law_government": 9 } }, "users": 6548, "size": 6268439792, "checksum": "d160703c89d9becb93b2f41ab7801ed44e52427d5d43ce461b1a323ac33eb402", "avgDurationSecs": 4.81, "validDurationSecs": 955416.799, "totalHrs": 309.67, "validHrs": 265.39 }, "lzz": { "buckets": { "dev": 3551, "invalidated": 856, "other": 805, "test": 3495, "train": 5051, "validated": 21151 }, "duration": 101506392, "reportedSentences": 32, "validatedSentences": 30216, "unvalidatedSentences": 16402, "clips": 22812, "splits": { "accent": { "atinuri": 841, "arkaburi": 1003, "vitzuri": 4768, "artaşenuri": 2760, "xopuri": 5799, "": 5452, "çxaluri": 5 }, "variant": { "lzz-xopuri": 6330, "lzz-atinuri": 5056, "lzz-arkaburi": 6304 }, "age": { "fifties": 0.2133, "fourties": 0.156, "": 0.2739, "twenties": 0.0708, "thirties": 0.0472, "sixties": 0.2381, "teens": 0.0007, "seventies": 0.0001 }, "gender": { "female_feminine": 0.2964, "": 0.7036 }, "sentence_domain": { "general": 24074, "nature_environment": 959, "news_current_affairs": 256, "healthcare": 10, "automotive_transport": 12, "history_law_government": 2, "agriculture_food": 9, "finance": 5, "language_fundamentals": 40 } }, "users": 120, "size": 572019168, "checksum": "54c6f6c599a954ced1f60bd2fe9124bdd99d23bc96fdaf3fea1eb1bc5c9782bc", "avgDurationSecs": 4.45, "validDurationSecs": 94115.452, "totalHrs": 28.19, "validHrs": 26.14 }, "mau": { "buckets": { "dev": 252, "invalidated": 13, "other": 216, "test": 312, "train": 555, "validated": 6040 }, "duration": 38969352, "reportedSentences": 0, "validatedSentences": 1119, "unvalidatedSentences": 0, "clips": 6269, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.4344, "fourties": 0.3568, "twenties": 0.2088 }, "gender": { "": 0.6454, "female_feminine": 0.3546 }, "sentence_domain": {} }, "users": 7, "size": 207504483, "checksum": "06e76a6633a31de767281674be46d47831cebc0ea3c2daddb548facda09757fb", "avgDurationSecs": 6.216, "validDurationSecs": 37545.842, "totalHrs": 10.82, "validHrs": 10.42 }, "mbo": { "buckets": { "dev": 321, "invalidated": 75, "other": 701, "test": 263, "train": 410, "validated": 7106 }, "duration": 43456968, "reportedSentences": 4, "validatedSentences": 998, "unvalidatedSentences": 4, "clips": 7882, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 13, "size": 254215940, "checksum": "f736c84b143e69636d077db815aa72efdfd86bb60678b3c85e3feae1f549fe64", "avgDurationSecs": 5.513, "validDurationSecs": 39178.535, "totalHrs": 12.07, "validHrs": 10.88 }, "mcf": { "buckets": { "dev": 0, "invalidated": 28, "other": 78, "test": 320, "train": 677, "validated": 13884 }, "duration": 37156428, "reportedSentences": 0, "validatedSentences": 997, "unvalidatedSentences": 0, "clips": 13990, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0534, "twenties": 0.2804, "fourties": 0.0716, "thirties": 0.3401, "teens": 0.2545 }, "gender": { "": 0.74, "female_feminine": 0.26 }, "sentence_domain": {} }, "users": 21, "size": 184570882, "checksum": "39a8785ca75b94bcd81048e699aba704a7c7e954f93a071a47cf4f386516e42c", "avgDurationSecs": 2.656, "validDurationSecs": 36874.9, "totalHrs": 10.32, "validHrs": 10.24 }, "mcn": { "buckets": { "dev": 316, "invalidated": 32, "other": 607, "test": 316, "train": 325, "validated": 8621 }, "duration": 39026016, "reportedSentences": 4, "validatedSentences": 959, "unvalidatedSentences": 0, "clips": 9260, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 35, "size": 228279077, "checksum": "40973b294e771ed7287f110bf84522194aa0ce59702208a870e5489b3c72c418", "avgDurationSecs": 4.214, "validDurationSecs": 36332.968, "totalHrs": 10.84, "validHrs": 10.09 }, "mcx": { "buckets": { "dev": 111, "invalidated": 67, "other": 2, "test": 249, "train": 543, "validated": 5491 }, "duration": 36755964, "reportedSentences": 0, "validatedSentences": 903, "unvalidatedSentences": 0, "clips": 5560, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9903, "thirties": 0.0097 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 10, "size": 217255599, "checksum": "4316d5b58a4c1e1f70c9596fcbba6079d513a8051cec134df5d02aad6896e1c2", "avgDurationSecs": 6.611, "validDurationSecs": 36299.82, "totalHrs": 10.2, "validHrs": 10.08 }, "mdd": { "buckets": { "dev": 0, "invalidated": 25, "other": 0, "test": 46, "train": 877, "validated": 5542 }, "duration": 36172872, "reportedSentences": 0, "validatedSentences": 923, "unvalidatedSentences": 0, "clips": 5567, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 7, "size": 207183410, "checksum": "4c570dd0030eec3aeb8d3ad7fdf8dcace84e5299b1674ae842d602c4a8e50b76", "avgDurationSecs": 6.498, "validDurationSecs": 36010.429, "totalHrs": 10.04, "validHrs": 10 }, "mdf": { "buckets": { "dev": 54, "invalidated": 9, "other": 0, "test": 107, "train": 175, "validated": 336 }, "duration": 1811340, "reportedSentences": 15, "validatedSentences": 6039, "unvalidatedSentences": 4, "clips": 345, "splits": { "accent": {}, "variant": {}, "age": { "sixties": 0.058, "": 0.5826, "fourties": 0.342, "twenties": 0.0174 }, "gender": { "male_masculine": 0.0754, "": 0.5826, "female_feminine": 0.342 }, "sentence_domain": {} }, "users": 11, "size": 11062151, "checksum": "ed236b871c76627c161f5794bc9849cd2776c58cd4489e9cc5a2bdfb049ef840", "avgDurationSecs": 5.25, "validDurationSecs": 1764.088, "totalHrs": 0.5, "validHrs": 0.49 }, "mgg": { "buckets": { "dev": 296, "invalidated": 290, "other": 5, "test": 212, "train": 483, "validated": 4863 }, "duration": 39095352, "reportedSentences": 0, "validatedSentences": 991, "unvalidatedSentences": 0, "clips": 5158, "splits": { "accent": { "": 989 }, "variant": {}, "age": { "": 0.9917, "fourties": 0.0083 }, "gender": { "": 0.9917, "female_feminine": 0.0083 }, "sentence_domain": {} }, "users": 9, "size": 231220366, "checksum": "3955c26fafdcd1df6156623269bf627aafd396fa7b24a02da36218254cdab779", "avgDurationSecs": 7.58, "validDurationSecs": 36859.383, "totalHrs": 10.85, "validHrs": 10.23 }, "mhk": { "buckets": { "dev": 305, "invalidated": 52, "other": 46, "test": 202, "train": 493, "validated": 6597 }, "duration": 41164560, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 0, "clips": 6695, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9323, "fourties": 0.0677 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 9, "size": 230660158, "checksum": "3a7fe5fea23ffc49875c40e4c578078c9307da887ddd85bdd05b459855a00bd4", "avgDurationSecs": 6.149, "validDurationSecs": 40562.002, "totalHrs": 11.43, "validHrs": 11.26 }, "mhr": { "buckets": { "dev": 14663, "invalidated": 16021, "other": 5, "test": 15152, "train": 186565, "validated": 218744 }, "duration": 1085137524, "reportedSentences": 100, "validatedSentences": 328269, "unvalidatedSentences": 3, "clips": 234770, "splits": { "accent": { "": 8728 }, "variant": {}, "age": { "fifties": 0.0385, "": 0.1173, "sixties": 0.0322, "thirties": 0.2094, "fourties": 0.1199, "twenties": 0.3334, "teens": 0.1446, "seventies": 0.0047 }, "gender": { "male_masculine": 0.1944, "": 0.117, "female_feminine": 0.6886 }, "sentence_domain": {} }, "users": 500, "size": 6147326746, "checksum": "f86456ac96d42d72f2410e6320c7757004c5a4fadb533e5fe3e758532f4f519f", "avgDurationSecs": 4.622, "validDurationSecs": 1011063.264, "totalHrs": 301.42, "validHrs": 280.85 }, "mk": { "buckets": { "dev": 1815, "invalidated": 1055, "other": 21469, "test": 1828, "train": 2067, "validated": 18703 }, "duration": 198157644, "reportedSentences": 161, "validatedSentences": 6123, "unvalidatedSentences": 571, "clips": 41227, "splits": { "accent": { "": 11503 }, "variant": {}, "age": { "thirties": 0.265, "": 0.105, "twenties": 0.3577, "teens": 0.0255, "fourties": 0.1968, "fifties": 0.0196, "sixties": 0.0303 }, "gender": { "male_masculine": 0.2746, "": 0.3374, "female_feminine": 0.3871, "do_not_wish_to_say": 0.0008 }, "sentence_domain": { "media_entertainment": 7, "general": 219, "healthcare": 27, "finance": 7, "agriculture_food": 14, "nature_environment": 21, "history_law_government": 9, "technology_robotics": 1 } }, "users": 468, "size": 1092673629, "checksum": "907f7e36c97742772af84959a7c160bd5d3bae4d59e2123feaacecb5ab85b24b", "avgDurationSecs": 4.807, "validDurationSecs": 89896.001, "totalHrs": 55.04, "validHrs": 24.97 }, "mki": { "buckets": { "dev": 0, "invalidated": 135, "other": 6, "test": 26, "train": 2005, "validated": 11184 }, "duration": 36174528, "reportedSentences": 0, "validatedSentences": 2055, "unvalidatedSentences": 0, "clips": 11325, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.7189, "twenties": 0.2787, "": 0.0024 }, "gender": { "": 0.8192, "female_feminine": 0.1808 }, "sentence_domain": {} }, "users": 12, "size": 196867286, "checksum": "dbd6a2939aed327f141970fe8b329813c143011f7df288ab516a7a9b035c761e", "avgDurationSecs": 3.194, "validDurationSecs": 35724.143, "totalHrs": 10.04, "validHrs": 9.92 }, "ml": { "buckets": { "dev": 941, "invalidated": 120, "other": 6025, "test": 888, "train": 1232, "validated": 3485 }, "duration": 40854852, "reportedSentences": 211, "validatedSentences": 5487, "unvalidatedSentences": 4728, "clips": 9630, "splits": { "accent": { "": 4290 }, "variant": {}, "age": { "": 0.2221, "twenties": 0.5871, "thirties": 0.0776, "fourties": 0.1079, "sixties": 0.0047, "teens": 0.0006 }, "gender": { "": 0.3396, "male_masculine": 0.4709, "female_feminine": 0.1895 }, "sentence_domain": {} }, "users": 154, "size": 230958780, "checksum": "d5ae24f10fc510d005e2c53663bfa4c792917636b1abe1c37ed832e693d15c66", "avgDurationSecs": 4.242, "validDurationSecs": 14784.959, "totalHrs": 11.34, "validHrs": 4.1 }, "mn": { "buckets": { "dev": 1935, "invalidated": 3028, "other": 59142, "test": 1936, "train": 2195, "validated": 33735 }, "duration": 469181852, "reportedSentences": 46, "validatedSentences": 6099, "unvalidatedSentences": 15, "clips": 95905, "splits": { "accent": { "": 12360 }, "variant": {}, "age": { "thirties": 0.0329, "": 0.0401, "twenties": 0.9146, "fourties": 0.0014, "teens": 0.0026, "nineties": 0.0075, "fifties": 0.0001, "eighties": 0.0008 }, "gender": { "male_masculine": 0.1437, "": 0.4541, "female_feminine": 0.398, "do_not_wish_to_say": 0.0042 }, "sentence_domain": { "general": 35, "news_current_affairs": 17, "history_law_government": 3, "healthcare": 4, "language_fundamentals": 3, "media_entertainment": 3, "nature_environment": 6 } }, "users": 609, "size": 2869479251, "checksum": "a0e9ad3d5eaf63a55f3ef115feaf0e10fa26d278a70671256a7a0976c91e486e", "avgDurationSecs": 4.892, "validDurationSecs": 165036.753, "totalHrs": 130.32, "validHrs": 45.84 }, "mr": { "buckets": { "dev": 1766, "invalidated": 2301, "other": 2793, "test": 1798, "train": 2189, "validated": 10997 }, "duration": 99768888, "reportedSentences": 59, "validatedSentences": 6717, "unvalidatedSentences": 12102, "clips": 16091, "splits": { "accent": { "": 2527 }, "variant": {}, "age": { "thirties": 0.1286, "sixties": 0.004, "twenties": 0.2845, "": 0.0556, "teens": 0.5263, "fourties": 0.0006, "fifties": 0.0003 }, "gender": { "male_masculine": 0.1912, "female_feminine": 0.7524, "": 0.0564 }, "sentence_domain": {} }, "users": 105, "size": 588617134, "checksum": "b8bd6d46f2036df6c294a2706744cf48ccd7ee7a1896e65490f4229c2ebfd6d3", "avgDurationSecs": 6.2, "validDurationSecs": 68184.604, "totalHrs": 27.71, "validHrs": 18.94 }, "mrj": { "buckets": { "dev": 7314, "invalidated": 1261, "other": 1459, "test": 7164, "train": 14325, "validated": 28973 }, "duration": 132879852, "reportedSentences": 50, "validatedSentences": 54143, "unvalidatedSentences": 4, "clips": 31693, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.3524, "": 0.0955, "thirties": 0.2234, "sixties": 0.0353, "fourties": 0.1211, "fifties": 0.1133, "teens": 0.059 }, "gender": { "male_masculine": 0.2516, "female_feminine": 0.6213, "": 0.1271 }, "sentence_domain": {} }, "users": 60, "size": 750451513, "checksum": "f6f1e13eed8b8e902c0f805b1732787c7546ecbcbf33d4061ee184ccf41a5020", "avgDurationSecs": 4.193, "validDurationSecs": 121475.656, "totalHrs": 36.91, "validHrs": 33.74 }, "ms": { "buckets": { "dev": 7, "invalidated": 1, "other": 2087, "test": 9, "train": 17, "validated": 39 }, "duration": 13135500, "reportedSentences": 0, "validatedSentences": 257, "unvalidatedSentences": 41, "clips": 2127, "splits": { "accent": { "": 20 }, "variant": {}, "age": { "": 0.9953, "twenties": 0.0024, "thirties": 0.0024 }, "gender": { "": 1 }, "sentence_domain": { "history_law_government": 29, "automotive_transport": 6, "technology_robotics": 9 } }, "users": 28, "size": 77188480, "checksum": "927ff0cb031289e7d71ca63f4273061cc9f4a7a510e4c22e00672d5dc8c591da", "avgDurationSecs": 6.176, "validDurationSecs": 240.848, "totalHrs": 3.64, "validHrs": 0.06 }, "mse": { "buckets": { "dev": 327, "invalidated": 228, "other": 9, "test": 284, "train": 391, "validated": 4338 }, "duration": 29334204, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": 0, "clips": 4575, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.2085, "": 0.7915 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 18, "size": 162132698, "checksum": "04dcdaed19b868147942c2b80e4e16c8b9ad17d9fd68f7d6ee5d7b7c8f1b4a67", "avgDurationSecs": 6.412, "validDurationSecs": 27814.596, "totalHrs": 8.14, "validHrs": 7.72 }, "mt": { "buckets": { "dev": 1627, "invalidated": 326, "other": 6401, "test": 1669, "train": 1907, "validated": 6620 }, "duration": 63360432, "reportedSentences": 10, "validatedSentences": 5274, "unvalidatedSentences": 11772, "clips": 13347, "splits": { "accent": { "": 205 }, "variant": {}, "age": { "twenties": 0.197, "": 0.266, "fourties": 0.1651, "thirties": 0.0879, "teens": 0.0255, "fifties": 0.2476, "sixties": 0.0109 }, "gender": { "male_masculine": 0.2487, "": 0.2891, "female_feminine": 0.4622 }, "sentence_domain": {} }, "users": 224, "size": 471558326, "checksum": "08c1ca2125ce076dc4a5390e883b9754d1a8cad74a21fab3eabc242391ae00d1", "avgDurationSecs": 4.747, "validDurationSecs": 31426.243, "totalHrs": 17.6, "validHrs": 8.72 }, "mua": { "buckets": { "dev": 238, "invalidated": 22, "other": 0, "test": 328, "train": 429, "validated": 8672 }, "duration": 34829352, "reportedSentences": 5, "validatedSentences": 995, "unvalidatedSentences": 0, "clips": 8694, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 17, "size": 193740155, "checksum": "b2b4562a069393b397fbd06706c2be246b59f445f5644df96e4eaa63bd34f8c3", "avgDurationSecs": 4.006, "validDurationSecs": 34741.217, "totalHrs": 9.67, "validHrs": 9.65 }, "mug": { "buckets": { "dev": 297, "invalidated": 72, "other": 0, "test": 297, "train": 404, "validated": 2687 }, "duration": 20065428, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": 0, "clips": 2759, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9946, "thirties": 0.0054 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 17, "size": 118779069, "checksum": "d24da0cc1e464916ec7adef4d82b4ee4ac79cca9483420e6bee4e29c2f4e7d92", "avgDurationSecs": 7.273, "validDurationSecs": 19541.792, "totalHrs": 5.57, "validHrs": 5.42 }, "mve": { "buckets": { "dev": 555, "invalidated": 31, "other": 11, "test": 146, "train": 1296, "validated": 7648 }, "duration": 36439344, "reportedSentences": 0, "validatedSentences": 2003, "unvalidatedSentences": 0, "clips": 7690, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.2649, "thirties": 0.335, "fourties": 0.3908, "": 0.0016, "teens": 0.0039, "fifties": 0.0039 }, "gender": { "": 0.9987, "female_feminine": 0.0013 }, "sentence_domain": {} }, "users": 20, "size": 214193637, "checksum": "ca3b672147808bec58d377950c9dfa7aff60de4d3498e2919ddf2c42b22184a8", "avgDurationSecs": 4.739, "validDurationSecs": 36240.325, "totalHrs": 10.12, "validHrs": 10.06 }, "mvy": { "buckets": { "dev": 1946, "invalidated": 570, "other": 1284, "test": 2077, "train": 2558, "validated": 16601 }, "duration": 88423920, "reportedSentences": 90, "validatedSentences": 6635, "unvalidatedSentences": 221, "clips": 18455, "splits": { "accent": { "": 2910 }, "variant": {}, "age": { "twenties": 0.3943, "": 0.0391, "thirties": 0.4723, "fourties": 0.0591, "fifties": 0.0318, "teens": 0.0035 }, "gender": { "": 0.9989, "female_feminine": 0.0011 }, "sentence_domain": { "history_law_government": 78, "technology_robotics": 9, "agriculture_food": 33, "general": 108, "nature_environment": 7, "media_entertainment": 11 } }, "users": 56, "size": 469941412, "checksum": "a237f3a5cfd31d87046a8201ea63d813cfafcc13eae22cad6c8424d53f5cc38c", "avgDurationSecs": 4.791, "validDurationSecs": 79540.802, "totalHrs": 24.56, "validHrs": 22.09 }, "mxu": { "buckets": { "dev": 309, "invalidated": 7, "other": 2, "test": 317, "train": 333, "validated": 7749 }, "duration": 44476020, "reportedSentences": 0, "validatedSentences": 959, "unvalidatedSentences": 0, "clips": 7758, "splits": { "accent": {}, "variant": {}, "age": { "": 0.559, "twenties": 0.3229, "thirties": 0.1181 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 22, "size": 260747916, "checksum": "5b918757c12ee6b2232238088b13fae9cc49c03ae7c2f3ff58470553d46b7714", "avgDurationSecs": 5.733, "validDurationSecs": 44424.424, "totalHrs": 12.35, "validHrs": 12.34 }, "myv": { "buckets": { "dev": 239, "invalidated": 23, "other": 15, "test": 482, "train": 1241, "validated": 1972 }, "duration": 11624940, "reportedSentences": 31, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 2010, "splits": { "accent": { "": 35 }, "variant": {}, "age": { "sixties": 0.2582, "": 0.3871, "thirties": 0.2403, "twenties": 0.0871, "teens": 0.01, "fourties": 0.0174 }, "gender": { "male_masculine": 0.5408, "": 0.3871, "female_feminine": 0.0721 }, "sentence_domain": {} }, "users": 13, "size": 68338555, "checksum": "65450d92435898b2cb0943cfc2d45a35f2b501efb3cbb5174ee77e50be0d881b", "avgDurationSecs": 5.784, "validDurationSecs": 11405.165, "totalHrs": 3.22, "validHrs": 3.16 }, "nan-tw": { "buckets": { "dev": 5999, "invalidated": 628, "other": 2211, "test": 6423, "train": 11507, "validated": 29587 }, "duration": 85950108, "reportedSentences": 223, "validatedSentences": 26907, "unvalidatedSentences": 370, "clips": 32426, "splits": { "accent": { "": 26293, "pingtung_county": 839, "changhua_county": 30, "yilan_county": 5, "yunlin_county": 10, "chiayi_county": 5, "other_county": 5, "taipei_city": 10, "taichung_city": 5, "new_taipei_city": 10 }, "variant": { "nan-TW-tailo": 233, "nan-TW-pehoeji": 45 }, "age": { "thirties": 0.1523, "": 0.1274, "twenties": 0.5031, "fourties": 0.1646, "teens": 0.0174, "fifties": 0.0117, "sixties": 0.0237 }, "gender": { "male_masculine": 0.6192, "": 0.2701, "female_feminine": 0.1104, "do_not_wish_to_say": 0.0003 }, "sentence_domain": { "nature_environment": 55, "language_fundamentals": 55, "service_retail": 2, "healthcare": 1, "general": 802, "agriculture_food": 4 } }, "users": 299, "size": 490372936, "checksum": "94bf37a7d3b369e3a6f5ac2f29b3085a3fb94a1ca167547644b65561af7e51e9", "avgDurationSecs": 2.651, "validDurationSecs": 78424.901, "totalHrs": 23.87, "validHrs": 21.78 }, "nb-NO": { "buckets": { "dev": 416, "invalidated": 95, "other": 139, "test": 313, "train": 1273, "validated": 2003 }, "duration": 9378972, "reportedSentences": 1, "validatedSentences": 4403, "unvalidatedSentences": 3665, "clips": 2237, "splits": { "accent": { "": 1895 }, "variant": {}, "age": { "fifties": 0.0161, "": 0.1793, "thirties": 0.6169, "fourties": 0.1878 }, "gender": { "male_masculine": 0.5995, "": 0.4005 }, "sentence_domain": { "general": 717, "nature_environment": 37, "history_law_government": 21, "news_current_affairs": 32, "language_fundamentals": 13, "service_retail": 37, "media_entertainment": 17, "finance": 4, "agriculture_food": 3 } }, "users": 67, "size": 54591548, "checksum": "15d24caf321da466361689b3b55ac85ae2a7da906a1d54d95a6d0094652c3a92", "avgDurationSecs": 4.193, "validDurationSecs": 8397.89, "totalHrs": 2.6, "validHrs": 2.33 }, "ncx": { "buckets": { "dev": 340, "invalidated": 62, "other": 803, "test": 345, "train": 359, "validated": 8644 }, "duration": 42381792, "reportedSentences": 2, "validatedSentences": 1518, "unvalidatedSentences": 5, "clips": 9509, "splits": { "accent": { "": 6254 }, "variant": {}, "age": { "thirties": 0.6339, "twenties": 0.1458, "fourties": 0.1724, "": 0.0432, "teens": 0.0042, "sixties": 0.0005 }, "gender": { "": 0.6101, "male_masculine": 0.0133, "female_feminine": 0.3767 }, "sentence_domain": { "media_entertainment": 4700, "finance": 212, "healthcare": 212, "language_fundamentals": 2047 } }, "users": 41, "size": 243165666, "checksum": "b2c35410070ba3b983926612806a85193a2b481499593acfc2360b1ee968d64d", "avgDurationSecs": 4.457, "validDurationSecs": 38526.471, "totalHrs": 11.77, "validHrs": 10.7 }, "ne-NP": { "buckets": { "dev": 314, "invalidated": 100, "other": 413, "test": 289, "train": 355, "validated": 1181 }, "duration": 6964560, "reportedSentences": 5, "validatedSentences": 1316, "unvalidatedSentences": 30, "clips": 1694, "splits": { "accent": { "": 406 }, "variant": {}, "age": { "thirties": 0.1234, "": 0.3087, "twenties": 0.5679 }, "gender": { "male_masculine": 0.6057, "": 0.3849, "female_feminine": 0.0094 }, "sentence_domain": { "agriculture_food": 1 } }, "users": 61, "size": 40875666, "checksum": "7d820241c149f0845433f22337ee6bd26db8d195a9138dbc3c56d9eaf138b082", "avgDurationSecs": 4.111, "validDurationSecs": 4855.458, "totalHrs": 1.93, "validHrs": 1.34 }, "nhi": { "buckets": { "dev": 10, "invalidated": 0, "other": 387, "test": 6, "train": 24, "validated": 40 }, "duration": 2169576, "reportedSentences": 0, "validatedSentences": 758, "unvalidatedSentences": 11, "clips": 427, "splits": { "accent": {}, "variant": {}, "age": { "": 0.1874, "thirties": 0.082, "twenties": 0.7307 }, "gender": { "": 0.1874, "male_masculine": 0.082, "female_feminine": 0.7307 }, "sentence_domain": {} }, "users": 6, "size": 12741781, "checksum": "2802f977863e6fd6ede7f68e5596feb5f9becd0452a472c018818ae6a0d5b1c7", "avgDurationSecs": 5.081, "validDurationSecs": 203.239, "totalHrs": 0.6, "validHrs": 0.05 }, "nl": { "buckets": { "dev": 12247, "invalidated": 6139, "other": 5725, "test": 12247, "train": 46392, "validated": 103951 }, "duration": 506253349, "reportedSentences": 407, "validatedSentences": 260894, "unvalidatedSentences": 10519, "clips": 115815, "splits": { "accent": { "netherlands": 80548, "belgium": 8966, "germany": 60, "": 1576, "france": 109, "suriname": 737, "curacao": 195, "aruba": 51, "south_africa": 10 }, "variant": {}, "age": { "": 0.363, "twenties": 0.1816, "fourties": 0.1841, "thirties": 0.1196, "teens": 0.0185, "fifties": 0.1138, "sixties": 0.0183, "nineties": 0, "eighties": 0.0001, "seventies": 0.0008 }, "gender": { "": 0.3828, "male_masculine": 0.5172, "female_feminine": 0.1 }, "sentence_domain": { "general": 455, "technology_robotics": 5, "nature_environment": 56, "automotive_transport": 21, "language_fundamentals": 179, "agriculture_food": 18, "healthcare": 56, "history_law_government": 59, "finance": 8, "news_current_affairs": 4, "media_entertainment": 3 } }, "users": 1874, "size": 3395193380, "checksum": "d0d40534eb023bd2845f4dcd5f5586896d6fa9c8aee19016bad4ce2da1e507f2", "avgDurationSecs": 4.371, "validDurationSecs": 454393.143, "totalHrs": 140.62, "validHrs": 126.22 }, "nla": { "buckets": { "dev": 172, "invalidated": 135, "other": 5, "test": 220, "train": 592, "validated": 4635 }, "duration": 33091056, "reportedSentences": 0, "validatedSentences": 984, "unvalidatedSentences": 0, "clips": 4775, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 15, "size": 187041753, "checksum": "1ce3bcebac6d9994bf9cb465c06cc4d8a22dceae800c277b849ef1dad00a4d69", "avgDurationSecs": 6.93, "validDurationSecs": 32120.847, "totalHrs": 9.19, "validHrs": 8.92 }, "nlv": { "buckets": { "dev": 249, "invalidated": 128, "other": 59, "test": 290, "train": 467, "validated": 6657 }, "duration": 42922260, "reportedSentences": 1, "validatedSentences": 1006, "unvalidatedSentences": 0, "clips": 6844, "splits": { "accent": { "": 1076 }, "variant": {}, "age": { "thirties": 0.8549, "": 0.0102, "twenties": 0.1349 }, "gender": { "": 0.9079, "female_feminine": 0.0884, "male_masculine": 0.0037 }, "sentence_domain": { "media_entertainment": 944, "language_fundamentals": 824, "general": 698 } }, "users": 16, "size": 249293770, "checksum": "998872cf733607abfc538fd4a55d714acd28568b4aef6a7fdd6d6b4be8466997", "avgDurationSecs": 6.272, "validDurationSecs": 41749.486, "totalHrs": 11.92, "validHrs": 11.59 }, "nmg": { "buckets": { "dev": 272, "invalidated": 89, "other": 5, "test": 270, "train": 483, "validated": 5865 }, "duration": 38142936, "reportedSentences": 1, "validatedSentences": 1025, "unvalidatedSentences": 0, "clips": 5959, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8605, "fifties": 0.1395 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 11, "size": 217315183, "checksum": "e6ab21386c4af350902ce206316082559803d1b307c97d4fa810982e6aa87e38", "avgDurationSecs": 6.401, "validDurationSecs": 37541.252, "totalHrs": 10.59, "validHrs": 10.42 }, "nmz": { "buckets": { "dev": 795, "invalidated": 106, "other": 313, "test": 795, "train": 846, "validated": 13698 }, "duration": 41604372, "reportedSentences": 2, "validatedSentences": 2446, "unvalidatedSentences": 1, "clips": 14117, "splits": { "accent": {}, "variant": {}, "age": { "": 0.918, "thirties": 0.082 }, "gender": { "": 0.9945, "female_feminine": 0.0055 }, "sentence_domain": {} }, "users": 35, "size": 215927106, "checksum": "d2909d771d5dce4442492808c9b7138e16489787286574acee67c92a27d60370", "avgDurationSecs": 2.947, "validDurationSecs": 40369.532, "totalHrs": 11.55, "validHrs": 11.21 }, "nn-NO": { "buckets": { "dev": 322, "invalidated": 61, "other": 190, "test": 392, "train": 588, "validated": 1307 }, "duration": 6919416, "reportedSentences": 41, "validatedSentences": 5318, "unvalidatedSentences": 6, "clips": 1558, "splits": { "accent": { "": 1795 }, "variant": {}, "age": { "": 0.346, "thirties": 0.3582, "twenties": 0.2452, "fourties": 0.0289, "fifties": 0.0064, "teens": 0.0154 }, "gender": { "": 0.3684, "female_feminine": 0.1175, "male_masculine": 0.5141 }, "sentence_domain": {} }, "users": 41, "size": 40289066, "checksum": "1a8d15804506173e1304af4df141fb2a042e034afef0da2978e0ddac77f2ac95", "avgDurationSecs": 4.441, "validDurationSecs": 5804.671, "totalHrs": 1.92, "validHrs": 1.61 }, "nnh": { "buckets": { "dev": 186, "invalidated": 14, "other": 1, "test": 288, "train": 412, "validated": 7628 }, "duration": 68146020, "reportedSentences": 0, "validatedSentences": 886, "unvalidatedSentences": 0, "clips": 7643, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8905, "thirties": 0.1095 }, "gender": { "": 0.8905, "female_feminine": 0.1095 }, "sentence_domain": {} }, "users": 17, "size": 403364764, "checksum": "95c7f5bef12e5427ec4adb079555271b47199c7caf8cdf190fefa328c3a3252b", "avgDurationSecs": 8.916, "validDurationSecs": 68012.278, "totalHrs": 18.92, "validHrs": 18.89 }, "nr": { "buckets": { "dev": 0, "invalidated": 0, "other": 7, "test": 0, "train": 0, "validated": 0 }, "duration": 60696, "reportedSentences": 0, "validatedSentences": 319, "unvalidatedSentences": 7468, "clips": 7, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.2857, "thirties": 0.7143 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 890459, "checksum": "4c155fd1a4f3d712b1d2a2b7000cbbb0187be9bfa82a7c5d9b541b93c12fdb90", "avgDurationSecs": 8.671, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "invalidated": 1, "other": 4, "test": 0, "train": 15, "validated": 15 }, "duration": 92592, "reportedSentences": 0, "validatedSentences": 2247, "unvalidatedSentences": 14224, "clips": 20, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1568532, "checksum": "98d4d9072a60afa249e19ba7dd635cb56a6994e0e56ddcf2370a9b64947cf2d9", "avgDurationSecs": 4.63, "validDurationSecs": 69.444, "totalHrs": 0.02, "validHrs": 0.01 }, "nyu": { "buckets": { "dev": 0, "invalidated": 44, "other": 1, "test": 248, "train": 1023, "validated": 3639 }, "duration": 33372972, "reportedSentences": 0, "validatedSentences": 1271, "unvalidatedSentences": 0, "clips": 3684, "splits": { "accent": { "": 59 }, "variant": {}, "age": { "": 0.3241, "twenties": 0.336, "thirties": 0.3398 }, "gender": { "": 0.664, "female_feminine": 0.336 }, "sentence_domain": { "agriculture_food": 4, "history_law_government": 2, "healthcare": 2, "general": 1075, "nature_environment": 4 } }, "users": 5, "size": 189719934, "checksum": "a2718dd28f5360cc07c495b96720943bc7d70471815d54df1b645fecd17797ee", "avgDurationSecs": 9.059, "validDurationSecs": 32965.322, "totalHrs": 9.27, "validHrs": 9.15 }, "oc": { "buckets": { "dev": 273, "invalidated": 203, "other": 7698, "test": 275, "train": 303, "validated": 1984 }, "duration": 48154680, "reportedSentences": 13, "validatedSentences": 4934, "unvalidatedSentences": 14, "clips": 9885, "splits": { "accent": { "": 2328 }, "variant": {}, "age": { "": 0.502, "fifties": 0.1555, "twenties": 0.0157, "thirties": 0.0361, "seventies": 0.002, "fourties": 0.1702, "teens": 0.0051, "sixties": 0.1135 }, "gender": { "": 0.5171, "male_masculine": 0.084, "female_feminine": 0.3989 }, "sentence_domain": {} }, "users": 149, "size": 274949376, "checksum": "f2d20240846b8a9fed5f9a9433370d9779a26e53da8f637d907d0def335ed3b2", "avgDurationSecs": 4.871, "validDurationSecs": 9665.036, "totalHrs": 13.37, "validHrs": 2.68 }, "odk": { "buckets": { "dev": 467, "invalidated": 46, "other": 0, "test": 627, "train": 941, "validated": 6341 }, "duration": 40676832, "reportedSentences": 0, "validatedSentences": 2047, "unvalidatedSentences": 0, "clips": 6387, "splits": { "accent": {}, "variant": {}, "age": { "fifties": 0.0778, "": 0.3138, "thirties": 0.1843, "twenties": 0.1956, "teens": 0.15, "fourties": 0.0786 }, "gender": { "": 0.6975, "male_masculine": 0.1447, "female_feminine": 0.1578 }, "sentence_domain": {} }, "users": 20, "size": 236376686, "checksum": "cbec787af61b44f191b5da4ecaed33b6489624159e354bcbe427adafb3691819", "avgDurationSecs": 6.369, "validDurationSecs": 40383.872, "totalHrs": 11.29, "validHrs": 11.21 }, "or": { "buckets": { "dev": 640, "invalidated": 439, "other": 19710, "test": 479, "train": 2151, "validated": 4089 }, "duration": 134259324, "reportedSentences": 100, "validatedSentences": 6201, "unvalidatedSentences": 86, "clips": 24238, "splits": { "accent": { "": 14911 }, "variant": {}, "age": { "twenties": 0.6549, "": 0.0569, "thirties": 0.2712, "fourties": 0.0117, "teens": 0.0006, "fifties": 0.0047 }, "gender": { "male_masculine": 0.3414, "": 0.4213, "female_feminine": 0.0308, "do_not_wish_to_say": 0.2065 }, "sentence_domain": { "general": 7 } }, "users": 164, "size": 832859086, "checksum": "55f192055118f504109854b61848a6f9cefc65601f46f07dee25409f436dc532", "avgDurationSecs": 5.539, "validDurationSecs": 22649.822, "totalHrs": 37.29, "validHrs": 6.29 }, "oru": { "buckets": { "dev": 1700, "invalidated": 90, "other": 0, "test": 1459, "train": 4146, "validated": 10439 }, "duration": 76930020, "reportedSentences": 0, "validatedSentences": 7355, "unvalidatedSentences": 902, "clips": 10529, "splits": { "accent": {}, "variant": {}, "age": { "sixties": 0.6413, "": 0.3587 }, "gender": { "do_not_wish_to_say": 0.3081, "": 0.3587, "male_masculine": 0.3332 }, "sentence_domain": { "general": 117, "agriculture_food": 117, "language_fundamentals": 117 } }, "users": 15, "size": 457492822, "checksum": "e8c203081aadbbd9fe5e61daeb8e7e98348371b079dacdd6286181c115bc6e4c", "avgDurationSecs": 7.306, "validDurationSecs": 76272.436, "totalHrs": 21.36, "validHrs": 21.18 }, "os": { "buckets": { "dev": 253, "invalidated": 35, "other": 4, "test": 219, "train": 414, "validated": 917 }, "duration": 5280084, "reportedSentences": 5, "validatedSentences": 1888, "unvalidatedSentences": 34, "clips": 956, "splits": { "accent": { "": 660 }, "variant": {}, "age": { "": 0.2291, "fourties": 0.1998, "teens": 0.0209, "twenties": 0.0879, "thirties": 0.4623 }, "gender": { "": 0.7793, "male_masculine": 0.2207 }, "sentence_domain": { "history_law_government": 1 } }, "users": 37, "size": 30917348, "checksum": "e2b97e8158b84c368ba92f9d394158f8ede445304d3fcb04281cdf0598cc48e4", "avgDurationSecs": 5.523, "validDurationSecs": 5064.683, "totalHrs": 1.46, "validHrs": 1.4 }, "pa-IN": { "buckets": { "dev": 489, "invalidated": 104, "other": 1343, "test": 524, "train": 808, "validated": 1821 }, "duration": 15704738, "reportedSentences": 245, "validatedSentences": 14825, "unvalidatedSentences": 22, "clips": 3268, "splits": { "accent": { "": 657 }, "variant": {}, "age": { "": 0.2855, "fourties": 0.0627, "fifties": 0.0431, "thirties": 0.3898, "twenties": 0.213, "sixties": 0.0031, "teens": 0.0028 }, "gender": { "": 0.3608, "male_masculine": 0.6288, "female_feminine": 0.0104 }, "sentence_domain": { "language_fundamentals": 1 } }, "users": 75, "size": 109901539, "checksum": "247bba66858358f7852b075d17d8c58896a06a31221c14fcf7d7070d3dcc487c", "avgDurationSecs": 4.806, "validDurationSecs": 8751.018, "totalHrs": 4.36, "validHrs": 2.43 }, "pcm": { "buckets": { "dev": 326, "invalidated": 52, "other": 1138, "test": 326, "train": 335, "validated": 7754 }, "duration": 51944472, "reportedSentences": 0, "validatedSentences": 987, "unvalidatedSentences": 0, "clips": 8944, "splits": { "accent": { "": 15 }, "variant": {}, "age": { "fourties": 0.1626, "": 0.7261, "thirties": 0.1114 }, "gender": { "": 0.8519, "female_feminine": 0.1481 }, "sentence_domain": {} }, "users": 59, "size": 295929937, "checksum": "4d9924419793d84279e1b7284c15c6e520aa43987db070b265d96e956d16620b", "avgDurationSecs": 5.808, "validDurationSecs": 45033.255, "totalHrs": 14.42, "validHrs": 12.5 }, "phl": { "buckets": { "dev": 1408, "invalidated": 69, "other": 5515, "test": 1378, "train": 1946, "validated": 15558 }, "duration": 104047416, "reportedSentences": 0, "validatedSentences": 4745, "unvalidatedSentences": 1184, "clips": 21142, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 0.1787, "thirties": 0.2267, "twenties": 0.4756, "": 0.0552, "teens": 0.0639 }, "gender": { "": 0.9991, "female_feminine": 0.0009 }, "sentence_domain": { "history_law_government": 24, "healthcare": 10, "automotive_transport": 4 } }, "users": 20, "size": 613489119, "checksum": "84d70fc16dbcd1f3c921922f37d7121f761b5f888af3248c358c7d2d860a8182", "avgDurationSecs": 4.921, "validDurationSecs": 76566.536, "totalHrs": 28.9, "validHrs": 21.26 }, "phr": { "buckets": { "dev": 673, "invalidated": 120, "other": 10, "test": 656, "train": 743, "validated": 12695 }, "duration": 50821308, "reportedSentences": 1, "validatedSentences": 2077, "unvalidatedSentences": 0, "clips": 12825, "splits": { "accent": {}, "variant": {}, "age": { "": 0.2916, "fourties": 0.0152, "thirties": 0.1744, "twenties": 0.5188 }, "gender": { "": 0.7009, "female_feminine": 0.2991 }, "sentence_domain": {} }, "users": 63, "size": 290032754, "checksum": "fbf4e8b452200c9458c3ce45f03b25a2e338f897e1a09090553ff069c67f7854", "avgDurationSecs": 3.963, "validDurationSecs": 50306.16, "totalHrs": 14.11, "validHrs": 13.97 }, "pl": { "buckets": { "dev": 10063, "invalidated": 6991, "other": 2633, "test": 10063, "train": 25458, "validated": 139066 }, "duration": 679825730, "reportedSentences": 603, "validatedSentences": 230282, "unvalidatedSentences": 23597, "clips": 148690, "splits": { "accent": { "": 6606 }, "variant": {}, "age": { "twenties": 0.2701, "": 0.2426, "teens": 0.02, "thirties": 0.3437, "fourties": 0.1111, "fifties": 0.0062, "nineties": 0.0053, "sixties": 0.001, "seventies": 0 }, "gender": { "male_masculine": 0.5807, "": 0.2818, "female_feminine": 0.1364, "non-binary": 0.001, "transgender": 0.0001 }, "sentence_domain": { "general": 4, "finance": 13, "technology_robotics": 4, "news_current_affairs": 2, "agriculture_food": 3 } }, "users": 3465, "size": 4821699579, "checksum": "ed955ac66045e1404c35e933b696adc92cddd85b5a5e50e72810663f1cf2413c", "avgDurationSecs": 4.572, "validDurationSecs": 635823.828, "totalHrs": 188.84, "validHrs": 176.61 }, "plk": { "buckets": { "dev": 1237, "invalidated": 960, "other": 1960, "test": 662, "train": 2517, "validated": 9015 }, "duration": 59980824, "reportedSentences": 0, "validatedSentences": 4657, "unvalidatedSentences": 0, "clips": 11935, "splits": { "accent": {}, "variant": {}, "age": { "": 0.1378, "sixties": 0.7061, "thirties": 0.1561 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 10, "size": 346938797, "checksum": "7d11f12e012b784bcad84f75d484ae545ef00df9d040dd152cec95031b28225f", "avgDurationSecs": 5.026, "validDurationSecs": 45306.002, "totalHrs": 16.66, "validHrs": 12.58 }, "prq": { "buckets": { "dev": 282, "invalidated": 59, "other": 179, "test": 287, "train": 417, "validated": 7422 }, "duration": 36197676, "reportedSentences": 0, "validatedSentences": 986, "unvalidatedSentences": 0, "clips": 7660, "splits": { "accent": { "": 1437 }, "variant": {}, "age": { "fourties": 0.5696, "twenties": 0.1735, "thirties": 0.2569 }, "gender": { "": 0.1601, "female_feminine": 0.8399 }, "sentence_domain": {} }, "users": 13, "size": 196537639, "checksum": "1a07a265660445cb24acc6d0cdf1980efce9987a4e062f73a18525f2a4a3eb1a", "avgDurationSecs": 4.726, "validDurationSecs": 35072.996, "totalHrs": 10.05, "validHrs": 9.74 }, "pt": { "buckets": { "dev": 9669, "invalidated": 7872, "other": 27779, "test": 9670, "train": 23092, "validated": 161047 }, "duration": 823679935, "reportedSentences": 2839, "validatedSentences": 43613, "unvalidatedSentences": 108, "clips": 196698, "splits": { "accent": { "": 87116 }, "variant": { "pt-BR": 85061, "pt-PT": 3407 }, "age": { "": 0.243, "twenties": 0.3584, "teens": 0.0229, "thirties": 0.1758, "fourties": 0.1293, "sixties": 0.0446, "fifties": 0.0258, "seventies": 0.0001 }, "gender": { "": 0.2669, "male_masculine": 0.675, "female_feminine": 0.058, "non-binary": 0.0001 }, "sentence_domain": { "general": 1426, "media_entertainment": 82, "history_law_government": 67, "language_fundamentals": 17, "healthcare": 206, "service_retail": 25, "technology_robotics": 193, "finance": 27, "automotive_transport": 37, "nature_environment": 53, "agriculture_food": 198, "news_current_affairs": 4 } }, "users": 3817, "size": 5177099911, "checksum": "8b416d6835abeeef22e31e79f73d5b5e132026a70b3a1b4bfe2f35b82d417d73", "avgDurationSecs": 4.188, "validDurationSecs": 674390.093, "totalHrs": 228.79, "validHrs": 187.33 }, "pua": { "buckets": { "dev": 495, "invalidated": 195, "other": 445, "test": 547, "train": 1153, "validated": 7536 }, "duration": 39943620, "reportedSentences": 1, "validatedSentences": 2201, "unvalidatedSentences": 0, "clips": 8176, "splits": { "accent": {}, "variant": {}, "age": { "": 0.1421, "thirties": 0.3097, "twenties": 0.5482 }, "gender": { "": 0.7194, "female_feminine": 0.2806 }, "sentence_domain": {} }, "users": 9, "size": 226913905, "checksum": "8231b06f77784c70f5f0a46fb63b9e5d7a9edbeda687113cbe93a7e8551d3e36", "avgDurationSecs": 4.885, "validDurationSecs": 36816.918, "totalHrs": 11.09, "validHrs": 10.22 }, "pwn": { "buckets": { "dev": 722, "invalidated": 138, "other": 0, "test": 724, "train": 761, "validated": 10800 }, "duration": 53298360, "reportedSentences": 0, "validatedSentences": 2207, "unvalidatedSentences": 241, "clips": 10938, "splits": { "accent": { "": 5 }, "variant": { "pwn-eastern": 7622, "pwn-southern": 1350, "pwn-northern": 452 }, "age": { "": 0.2885, "twenties": 0.0523, "thirties": 0.4344, "fifties": 0.2248 }, "gender": { "": 0.4867, "female_feminine": 0.2349, "do_not_wish_to_say": 0.238, "male_masculine": 0.0404 }, "sentence_domain": { "general": 13132 } }, "users": 27, "size": 295451635, "checksum": "e7539f4ebc41fee5d00fa9b506d30d120168d81a704d229c0d52db166f93eb98", "avgDurationSecs": 4.873, "validDurationSecs": 52625.918, "totalHrs": 14.8, "validHrs": 14.61 }, "qup": { "buckets": { "dev": 288, "invalidated": 2, "other": 2464, "test": 254, "train": 449, "validated": 7202 }, "duration": 57346848, "reportedSentences": 4, "validatedSentences": 991, "unvalidatedSentences": 0, "clips": 9668, "splits": { "accent": { "": 1989 }, "variant": {}, "age": { "": 0.0757, "thirties": 0.7163, "twenties": 0.1056, "fifties": 0.1024 }, "gender": { "": 0.792, "female_feminine": 0.208 }, "sentence_domain": {} }, "users": 15, "size": 329493066, "checksum": "f302b366a76a7916a189749ff3617e6ee27f2c67b395a89099052e8ef0b60b6e", "avgDurationSecs": 5.932, "validDurationSecs": 42719.487, "totalHrs": 15.92, "validHrs": 11.86 }, "qur": { "buckets": { "dev": 0, "invalidated": 27, "other": 41, "test": 32, "train": 977, "validated": 10408 }, "duration": 36351540, "reportedSentences": 0, "validatedSentences": 1010, "unvalidatedSentences": 0, "clips": 10476, "splits": { "accent": { "": 4464 }, "variant": {}, "age": { "": 0.0006, "thirties": 0.2364, "twenties": 0.763 }, "gender": { "": 0.1924, "female_feminine": 0.8076 }, "sentence_domain": {} }, "users": 12, "size": 195413918, "checksum": "df06a9cb1c276dc2a801c444fa787bc221583abb8e27db218179d5035f1efee0", "avgDurationSecs": 3.47, "validDurationSecs": 36115.581, "totalHrs": 10.09, "validHrs": 10.03 }, "qus": { "buckets": { "dev": 314, "invalidated": 77, "other": 432, "test": 330, "train": 354, "validated": 10323 }, "duration": 40610484, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": 0, "clips": 10832, "splits": { "accent": {}, "variant": {}, "age": { "fifties": 0.0921, "": 0.2008, "twenties": 0.3589, "fourties": 0.0256, "thirties": 0.3226 }, "gender": { "female_feminine": 0.4569, "": 0.5431 }, "sentence_domain": {} }, "users": 24, "size": 217180420, "checksum": "bee1f786d68bc8a0adeaca78c99e7517316118844cb64717a2d877b98b33e036", "avgDurationSecs": 3.749, "validDurationSecs": 38702.181, "totalHrs": 11.28, "validHrs": 10.75 }, "qux": { "buckets": { "dev": 280, "invalidated": 23, "other": 1995, "test": 299, "train": 399, "validated": 6156 }, "duration": 46764684, "reportedSentences": 0, "validatedSentences": 978, "unvalidatedSentences": 0, "clips": 8174, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0172, "thirties": 0.3852, "fourties": 0.0001, "twenties": 0.2393, "fifties": 0.2372, "sixties": 0.1209 }, "gender": { "": 0.3747, "female_feminine": 0.6253 }, "sentence_domain": {} }, "users": 15, "size": 261485674, "checksum": "30027421188de36a8f92ceb18bb44d3e82738db9b3cc1d3c63317191d176edb7", "avgDurationSecs": 5.721, "validDurationSecs": 35219.402, "totalHrs": 12.99, "validHrs": 9.78 }, "quy": { "buckets": { "dev": 0, "invalidated": 7, "other": 1184, "test": 10, "train": 27, "validated": 38 }, "duration": 6137136, "reportedSentences": 0, "validatedSentences": 759, "unvalidatedSentences": 282, "clips": 1229, "splits": { "accent": { "": 229 }, "variant": {}, "age": { "": 0.0171, "fourties": 0.0203, "twenties": 0.6981, "sixties": 0.2644 }, "gender": { "": 0.8137, "female_feminine": 0.1863 }, "sentence_domain": {} }, "users": 8, "size": 35489506, "checksum": "7d9cc8d0c13185f563f2e950bdcfb0eab90105fab8be79c5368d98957f4363a1", "avgDurationSecs": 4.994, "validDurationSecs": 189.757, "totalHrs": 1.7, "validHrs": 0.05 }, "qva": { "buckets": { "dev": 200, "invalidated": 15, "other": 17, "test": 326, "train": 473, "validated": 8222 }, "duration": 35388504, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 0, "clips": 8254, "splits": { "accent": { "": 4494 }, "variant": {}, "age": { "thirties": 0.5454, "": 0.1089, "fourties": 0.0036, "sixties": 0.1212, "fifties": 0.1235, "twenties": 0.0974 }, "gender": { "female_feminine": 0.8839, "": 0.1161 }, "sentence_domain": {} }, "users": 13, "size": 193037943, "checksum": "9d9327bd3ad5d624827fe46e9438ee5fa7849fd76006d664dc3dbccece80b7ca", "avgDurationSecs": 4.287, "validDurationSecs": 35251.306, "totalHrs": 9.83, "validHrs": 9.79 }, "qvi": { "buckets": { "dev": 364, "invalidated": 237, "other": 692, "test": 379, "train": 405, "validated": 9267 }, "duration": 45140940, "reportedSentences": 0, "validatedSentences": 1149, "unvalidatedSentences": 0, "clips": 10196, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.2577, "thirties": 0.6372, "": 0.001, "teens": 0.1041 }, "gender": { "female_feminine": 0.7154, "": 0.2846 }, "sentence_domain": {} }, "users": 15, "size": 262898457, "checksum": "306afd9969cbbf4e3a0821f6b50e84ec1d6476072f000090a8e7e9437de53151", "avgDurationSecs": 4.427, "validDurationSecs": 41027.961, "totalHrs": 12.53, "validHrs": 11.39 }, "qvj": { "buckets": { "dev": 349, "invalidated": 8, "other": 186, "test": 348, "train": 362, "validated": 6552 }, "duration": 40215924, "reportedSentences": 0, "validatedSentences": 1059, "unvalidatedSentences": 0, "clips": 6746, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 0.3931, "": 0.0007, "twenties": 0.5836, "thirties": 0.0225 }, "gender": { "": 0.2029, "female_feminine": 0.7971 }, "sentence_domain": {} }, "users": 16, "size": 232522339, "checksum": "db77f1a9cfce30b2fd5e12947b987025772083e32f1c8e9862b23583a7226dfa", "avgDurationSecs": 5.961, "validDurationSecs": 39059.403, "totalHrs": 11.17, "validHrs": 10.84 }, "qvl": { "buckets": { "dev": 222, "invalidated": 2, "other": 13, "test": 321, "train": 457, "validated": 9082 }, "duration": 36019620, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 0, "clips": 9097, "splits": { "accent": { "": 6246 }, "variant": {}, "age": { "": 0.0018, "twenties": 0.8264, "teens": 0.0619, "thirties": 0.1099 }, "gender": { "": 0.3898, "female_feminine": 0.6102 }, "sentence_domain": {} }, "users": 19, "size": 199926070, "checksum": "834859dffd01f8e2cc31794e900acd2e57ecf6309b222b201dc873ac343cbd90", "avgDurationSecs": 3.96, "validDurationSecs": 35960.227, "totalHrs": 10, "validHrs": 9.98 }, "qwa": { "buckets": { "dev": 106, "invalidated": 0, "other": 1146, "test": 308, "train": 586, "validated": 7217 }, "duration": 41404320, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 0, "clips": 8363, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.0036, "fourties": 0.2012, "fifties": 0.3591, "thirties": 0.3165, "twenties": 0.1196 }, "gender": { "": 0.6005, "female_feminine": 0.3995 }, "sentence_domain": {} }, "users": 13, "size": 220087179, "checksum": "2f790c5a23876a6516c5d6498bc813d6fc4678529831ff01f2ed81979d6f59a3", "avgDurationSecs": 4.951, "validDurationSecs": 35730.596, "totalHrs": 11.5, "validHrs": 9.92 }, "qws": { "buckets": { "dev": 0, "invalidated": 2, "other": 96, "test": 61, "train": 937, "validated": 8879 }, "duration": 37376136, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": 0, "clips": 8977, "splits": { "accent": { "": 4986 }, "variant": {}, "age": { "": 0.2239, "twenties": 0.5541, "thirties": 0.1112, "fourties": 0.1108 }, "gender": { "": 0.4448, "female_feminine": 0.5552 }, "sentence_domain": {} }, "users": 12, "size": 211220917, "checksum": "5ac17a9533e22ac00e5daf18a707f7a3ee9d3f907b4a63c9fe4052bb7e028221", "avgDurationSecs": 4.164, "validDurationSecs": 36968.109, "totalHrs": 10.38, "validHrs": 10.26 }, "qxa": { "buckets": { "dev": 233, "invalidated": 7, "other": 356, "test": 315, "train": 451, "validated": 8356 }, "duration": 37868472, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": 0, "clips": 8719, "splits": { "accent": { "": 776 }, "variant": {}, "age": { "fourties": 0.1091, "": 0.019, "twenties": 0.6696, "thirties": 0.2023 }, "gender": { "": 0.448, "female_feminine": 0.552 }, "sentence_domain": {} }, "users": 18, "size": 207758475, "checksum": "02f90413398f5c0475fae5f6a3dbf1523ea8d846119a853d9b8e2e606967c33f", "avgDurationSecs": 4.343, "validDurationSecs": 36291.886, "totalHrs": 10.51, "validHrs": 10.08 }, "qxp": { "buckets": { "dev": 679, "invalidated": 2572, "other": 83, "test": 679, "train": 709, "validated": 22727 }, "duration": 125319060, "reportedSentences": 197, "validatedSentences": 2067, "unvalidatedSentences": 3, "clips": 25382, "splits": { "accent": { "": 8919 }, "variant": {}, "age": { "thirties": 0.2678, "twenties": 0.5355, "sixties": 0.0071, "": 0.0113, "fourties": 0.0919, "fifties": 0.0766, "teens": 0.0099 }, "gender": { "": 0.2822, "female_feminine": 0.7027, "male_masculine": 0.0151 }, "sentence_domain": { "general": 19798, "agriculture_food": 25257, "healthcare": 19834, "news_current_affairs": 161, "finance": 111, "nature_environment": 9752, "history_law_government": 1342 } }, "users": 81, "size": 721260508, "checksum": "26b4c9a27718c54392e8f835aa002c3905a5e9d6c65c605be35104d9b15ee255", "avgDurationSecs": 4.937, "validDurationSecs": 112210.475, "totalHrs": 34.81, "validHrs": 31.16 }, "qxt": { "buckets": { "dev": 60, "invalidated": 12, "other": 2469, "test": 321, "train": 616, "validated": 8647 }, "duration": 47738988, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": 0, "clips": 11128, "splits": { "accent": { "": 998 }, "variant": {}, "age": { "": 0.0044, "fourties": 0.5366, "thirties": 0.1794, "twenties": 0.19, "fifties": 0.0897 }, "gender": { "": 0.2735, "female_feminine": 0.7265 }, "sentence_domain": {} }, "users": 16, "size": 274765415, "checksum": "7c6e9481f063df6b999caff75242391cbe28162322e51cf8891937a6a02c342b", "avgDurationSecs": 4.29, "validDurationSecs": 37095.527, "totalHrs": 13.26, "validHrs": 10.3 }, "qxu": { "buckets": { "dev": 0, "invalidated": 272, "other": 1898, "test": 325, "train": 673, "validated": 8712 }, "duration": 45547416, "reportedSentences": 0, "validatedSentences": 998, "unvalidatedSentences": 0, "clips": 10882, "splits": { "accent": { "": 2776 }, "variant": {}, "age": { "": 0.0032, "fifties": 0.0018, "thirties": 0.3456, "twenties": 0.4683, "teens": 0.0893, "fourties": 0.0917 }, "gender": { "": 0.4654, "female_feminine": 0.5346 }, "sentence_domain": {} }, "users": 16, "size": 262372455, "checksum": "a1fca9d7c71f919bcc807c5ed1c8f341ad06cd891e038923d03c793744667de1", "avgDurationSecs": 4.186, "validDurationSecs": 36464.72, "totalHrs": 12.65, "validHrs": 10.12 }, "qxw": { "buckets": { "dev": 167, "invalidated": 81, "other": 1150, "test": 227, "train": 608, "validated": 8021 }, "duration": 48569508, "reportedSentences": 0, "validatedSentences": 1002, "unvalidatedSentences": 0, "clips": 9252, "splits": { "accent": { "": 6009 }, "variant": {}, "age": { "twenties": 0.5445, "fourties": 0.2161, "thirties": 0.1125, "": 0.0182, "seventies": 0.1082, "fifties": 0.0005 }, "gender": { "": 0.459, "female_feminine": 0.541 }, "sentence_domain": {} }, "users": 14, "size": 275170036, "checksum": "f64f524283774f565dc6df099baf20e10aeac605d918faba18c1720629c9e2d4", "avgDurationSecs": 5.25, "validDurationSecs": 42107.223, "totalHrs": 13.49, "validHrs": 11.69 }, "rm-sursilv": { "buckets": { "dev": 1579, "invalidated": 758, "other": 1902, "test": 1648, "train": 1917, "validated": 5303 }, "duration": 42836729, "reportedSentences": 19, "validatedSentences": 9333, "unvalidatedSentences": 7170, "clips": 7963, "splits": { "accent": { "": 323 }, "variant": {}, "age": { "thirties": 0.0664, "twenties": 0.1449, "": 0.5726, "teens": 0.049, "fourties": 0.167 }, "gender": { "male_masculine": 0.1604, "female_feminine": 0.1675, "": 0.6721 }, "sentence_domain": {} }, "users": 93, "size": 318351247, "checksum": "97c56a9b2d4203bbd3cc52552b3b7e820a57dbe913aebbd729f21cc72f461dec", "avgDurationSecs": 5.379, "validDurationSecs": 28527.336, "totalHrs": 11.89, "validHrs": 7.92 }, "rm-vallader": { "buckets": { "dev": 500, "invalidated": 394, "other": 740, "test": 463, "train": 557, "validated": 1527 }, "duration": 15519074, "reportedSentences": 36, "validatedSentences": 5913, "unvalidatedSentences": 1172, "clips": 2661, "splits": { "accent": {}, "variant": {}, "age": { "": 0.3679, "fourties": 0.4059, "twenties": 0.1379, "thirties": 0.0564, "fifties": 0.0038, "sixties": 0.0282 }, "gender": { "": 0.3735, "male_masculine": 0.4363, "female_feminine": 0.1902 }, "sentence_domain": {} }, "users": 55, "size": 118511077, "checksum": "c03b161e2aa675779033de4e4a4616bc68f625626a48aed5f1ba9ec2a6334a08", "avgDurationSecs": 5.832, "validDurationSecs": 8905.534, "totalHrs": 4.31, "validHrs": 2.47 }, "ro": { "buckets": { "dev": 3932, "invalidated": 1107, "other": 23463, "test": 3946, "train": 5187, "validated": 19850 }, "duration": 179718439, "reportedSentences": 518, "validatedSentences": 13755, "unvalidatedSentences": 89202, "clips": 44420, "splits": { "accent": { "ro-muntenia": 4722, "": 332, "ro-transilv": 1509, "ro-moldova": 105, "ro-oltenia": 5, "ro-banat": 5 }, "variant": { "ro-RO": 6341 }, "age": { "thirties": 0.1259, "teens": 0.0152, "": 0.162, "fourties": 0.0898, "sixties": 0.0026, "twenties": 0.5828, "fifties": 0.021, "eighties": 0.0007 }, "gender": { "male_masculine": 0.6863, "": 0.1879, "female_feminine": 0.1257 }, "sentence_domain": {} }, "users": 461, "size": 1121563973, "checksum": "8597d080884320ec16efeaf3b1e9fc6b4150c31de25054c3175d1e32ed5ccb7e", "avgDurationSecs": 4.046, "validDurationSecs": 80310.919, "totalHrs": 49.92, "validHrs": 22.3 }, "rof": { "buckets": { "dev": 461, "invalidated": 808, "other": 0, "test": 461, "train": 477, "validated": 9557 }, "duration": 40479228, "reportedSentences": 0, "validatedSentences": 1399, "unvalidatedSentences": 0, "clips": 10365, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.4079, "fourties": 0.2243, "twenties": 0.3524, "fifties": 0.0153 }, "gender": { "female_feminine": 0.6312, "": 0.3688 }, "sentence_domain": {} }, "users": 24, "size": 231892952, "checksum": "5a8d9f9c2d111fe74b05136f209772545a162a1c4c133f1c2aaff4da9d0630af", "avgDurationSecs": 3.905, "validDurationSecs": 37323.684, "totalHrs": 11.24, "validHrs": 10.36 }, "ru": { "buckets": { "dev": 10282, "invalidated": 10749, "other": 16061, "test": 10283, "train": 26920, "validated": 175137 }, "duration": 1045865844, "reportedSentences": 572, "validatedSentences": 47850, "unvalidatedSentences": 242, "clips": 201947, "splits": { "accent": { "": 88967 }, "variant": {}, "age": { "twenties": 0.3265, "teens": 0.0924, "": 0.2371, "fourties": 0.1337, "thirties": 0.1798, "fifties": 0.0292, "sixties": 0.0011, "seventies": 0 }, "gender": { "male_masculine": 0.5925, "": 0.2594, "female_feminine": 0.148, "non-binary": 0.0002 }, "sentence_domain": { "general": 43, "service_retail": 12, "healthcare": 7, "history_law_government": 48, "finance": 8, "automotive_transport": 4, "media_entertainment": 22, "nature_environment": 6, "language_fundamentals": 4, "technology_robotics": 23, "agriculture_food": 4, "news_current_affairs": 5 } }, "users": 3695, "size": 7037932554, "checksum": "7e0ababffab9794bfa5d87944652a9411fff54ea093a84a6f09442749be7403c", "avgDurationSecs": 5.179, "validDurationSecs": 907019.2, "totalHrs": 290.51, "validHrs": 251.94 }, "rup": { "buckets": { "dev": 5, "invalidated": 19, "other": 124, "test": 8, "train": 16, "validated": 29 }, "duration": 1207620, "reportedSentences": 0, "validatedSentences": 3507, "unvalidatedSentences": 0, "clips": 172, "splits": { "accent": {}, "variant": {}, "age": { "": 0.1163, "thirties": 0.7326, "fifties": 0.0581, "twenties": 0.0058, "teens": 0.0872 }, "gender": { "": 0.2093, "female_feminine": 0.7907 }, "sentence_domain": {} }, "users": 9, "size": 7266270, "checksum": "a2db6e000cc222d7665bf32333fbe32b381eb957aa179a352fa0c072eb36483d", "avgDurationSecs": 7.021, "validDurationSecs": 203.61, "totalHrs": 0.33, "validHrs": 0.05 }, "rw": { "buckets": { "dev": 15987, "invalidated": 227823, "other": 47244, "test": 16213, "train": 1003033, "validated": 1439161 }, "duration": 8583067625, "reportedSentences": 634, "validatedSentences": 1404857, "unvalidatedSentences": 28, "clips": 1714228, "splits": { "accent": { "": 75 }, "variant": {}, "age": { "": 0.0539, "twenties": 0.6083, "thirties": 0.1155, "teens": 0.2027, "fourties": 0.0159, "fifties": 0.0037 }, "gender": { "": 0.1025, "male_masculine": 0.5699, "female_feminine": 0.3276 }, "sentence_domain": {} }, "users": 1181, "size": 61394599201, "checksum": "268c56f36d83fffc2ef0d6d1d8485688d13db8a9edd1553b06e1a8e4084c33fc", "avgDurationSecs": 5.007, "validDurationSecs": 7205818.705, "totalHrs": 2384.18, "validHrs": 2001.61 }, "sah": { "buckets": { "dev": 1746, "invalidated": 478, "other": 3409, "test": 1812, "train": 2314, "validated": 9287 }, "duration": 83468568, "reportedSentences": 24, "validatedSentences": 6079, "unvalidatedSentences": 382, "clips": 13174, "splits": { "accent": { "": 977 }, "variant": {}, "age": { "": 0.3096, "twenties": 0.2303, "fourties": 0.1026, "thirties": 0.1457, "teens": 0.0323, "fifties": 0.1794 }, "gender": { "": 0.3224, "male_masculine": 0.2358, "female_feminine": 0.4418 }, "sentence_domain": {} }, "users": 252, "size": 531573865, "checksum": "6d4ee03fae4228abc2e7a4aa975def15b2e4a447a7cc25b88543a2dcd2e29c34", "avgDurationSecs": 6.336, "validDurationSecs": 58841.095, "totalHrs": 23.18, "validHrs": 16.34 }, "sat": { "buckets": { "dev": 71, "invalidated": 18, "other": 303, "test": 127, "train": 335, "validated": 533 }, "duration": 3826872, "reportedSentences": 7, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 854, "splits": { "accent": { "": 435 }, "variant": {}, "age": { "": 0.4227, "twenties": 0.4215, "fourties": 0.0117, "fifties": 0.0117, "teens": 0.0117, "thirties": 0.1206 }, "gender": { "": 0.4028, "male_masculine": 0.5855, "female_feminine": 0.0117 }, "sentence_domain": {} }, "users": 14, "size": 21506334, "checksum": "e6306189a9a2549a9d02f1e65ff5a25e74a8032e37ed33f992c7bb6bd941e07f", "avgDurationSecs": 4.481, "validDurationSecs": 2388.434, "totalHrs": 1.06, "validHrs": 0.66 }, "sbn": { "buckets": { "dev": 0, "invalidated": 11, "other": 18, "test": 247, "train": 1754, "validated": 10957 }, "duration": 38715228, "reportedSentences": 0, "validatedSentences": 2001, "unvalidatedSentences": 0, "clips": 10986, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 0.2588, "twenties": 0.5527, "teens": 0.1881, "thirties": 0.0005 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 21, "size": 227018737, "checksum": "89ceaa5d2d963510736c3f14c3c79c583e756449e173d508514a9681f663c4d0", "avgDurationSecs": 3.524, "validDurationSecs": 38613.031, "totalHrs": 10.75, "validHrs": 10.72 }, "sc": { "buckets": { "dev": 554, "invalidated": 120, "other": 396, "test": 652, "train": 926, "validated": 2306 }, "duration": 13278780, "reportedSentences": 36, "validatedSentences": 5489, "unvalidatedSentences": 349, "clips": 2822, "splits": { "accent": { "": 1408 }, "variant": {}, "age": { "": 0.4518, "thirties": 0.2831, "twenties": 0.0904, "fourties": 0.0113, "sixties": 0.0167, "fifties": 0.1467 }, "gender": { "": 0.5464, "female_feminine": 0.3306, "male_masculine": 0.123 }, "sentence_domain": { "automotive_transport": 1, "agriculture_food": 2, "nature_environment": 1, "general": 9, "healthcare": 2, "news_current_affairs": 3, "finance": 1, "media_entertainment": 1 } }, "users": 43, "size": 77182490, "checksum": "35eae35044217d3047813ac8ad4ba627df9f37aad85f0d63c099bb98461340ab", "avgDurationSecs": 4.705, "validDurationSecs": 10850.768, "totalHrs": 3.68, "validHrs": 3.01 }, "scl": { "buckets": { "dev": 862, "invalidated": 143, "other": 3, "test": 984, "train": 1407, "validated": 8876 }, "duration": 36733392, "reportedSentences": 0, "validatedSentences": 3300, "unvalidatedSentences": 4, "clips": 9022, "splits": { "accent": {}, "variant": {}, "age": { "": 0.1624, "thirties": 0.8159, "fourties": 0.0006, "twenties": 0.0212 }, "gender": { "": 0.9834, "female_feminine": 0.0166 }, "sentence_domain": { "general": 4 } }, "users": 40, "size": 214681241, "checksum": "cb09650dccc52b7a925016b58028b755cb4708b0ed98e49cab3180b6991e9a2d", "avgDurationSecs": 4.072, "validDurationSecs": 36138.948, "totalHrs": 10.2, "validHrs": 10.03 }, "sd": { "buckets": { "dev": 0, "invalidated": 232, "other": 41582, "test": 42, "train": 279, "validated": 321 }, "duration": 171510444, "reportedSentences": 9, "validatedSentences": 13348, "unvalidatedSentences": 71, "clips": 42135, "splits": { "accent": { "": 10 }, "variant": {}, "age": { "": 0.0164, "thirties": 0.1387, "fourties": 0.8435, "twenties": 0.0007, "teens": 0.0006 }, "gender": { "": 0.861, "male_masculine": 0.139 }, "sentence_domain": { "general": 110, "language_fundamentals": 3 } }, "users": 31, "size": 909756280, "checksum": "380d556a1661a5c245467c12d4b89f08ba804ff95a31b2d8339700e771e50683", "avgDurationSecs": 4.07, "validDurationSecs": 1306.63, "totalHrs": 47.64, "validHrs": 0.36 }, "sei": { "buckets": { "dev": 435, "invalidated": 254, "other": 50, "test": 452, "train": 728, "validated": 8006 }, "duration": 37687932, "reportedSentences": 0, "validatedSentences": 1615, "unvalidatedSentences": 0, "clips": 8310, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.0124, "fourties": 0.4848, "thirties": 0.268, "twenties": 0.2348 }, "gender": { "": 0.1819, "female_feminine": 0.8057, "do_not_wish_to_say": 0.0124 }, "sentence_domain": {} }, "users": 17, "size": 219326897, "checksum": "f6f6161ca5e50b6e8dff29ab1e62bd34537cecac4c95e346e9033e3326de561a", "avgDurationSecs": 4.535, "validDurationSecs": 36309.216, "totalHrs": 10.46, "validHrs": 10.08 }, "sk": { "buckets": { "dev": 5547, "invalidated": 1241, "other": 2, "test": 5965, "train": 9760, "validated": 47465 }, "duration": 209251224, "reportedSentences": 348, "validatedSentences": 21980, "unvalidatedSentences": 1006, "clips": 48708, "splits": { "accent": { "": 25053 }, "variant": {}, "age": { "": 0.2176, "thirties": 0.1708, "twenties": 0.0627, "fourties": 0.1221, "teens": 0.0459, "fifties": 0.3809 }, "gender": { "": 0.6278, "male_masculine": 0.3371, "female_feminine": 0.0351 }, "sentence_domain": { "nature_environment": 289, "general": 14398, "automotive_transport": 271, "technology_robotics": 945, "healthcare": 4, "finance": 3, "media_entertainment": 455, "language_fundamentals": 122, "agriculture_food": 3, "history_law_government": 9056, "news_current_affairs": 330 } }, "users": 272, "size": 1212617605, "checksum": "3d0e7613b3eebab14ebf3e02d0302fb2f6fa293ee1cb16ed4b2608fcaa25c60f", "avgDurationSecs": 4.296, "validDurationSecs": 203911.254, "totalHrs": 58.12, "validHrs": 56.64 }, "skr": { "buckets": { "dev": 1145, "invalidated": 467, "other": 1618, "test": 1037, "train": 1558, "validated": 3752 }, "duration": 24307020, "reportedSentences": 2, "validatedSentences": 6011, "unvalidatedSentences": 93, "clips": 5837, "splits": { "accent": { "": 10 }, "variant": {}, "age": { "": 0.3533, "twenties": 0.241, "fifties": 0.3728, "teens": 0.0207, "fourties": 0.0002, "thirties": 0.012 }, "gender": { "": 0.3704, "male_masculine": 0.6296 }, "sentence_domain": {} }, "users": 60, "size": 139981573, "checksum": "dbf7627507505105d83b222c9ec8e484dc0685361b161aa5c4f76cdd1002e157", "avgDurationSecs": 4.164, "validDurationSecs": 15624.454, "totalHrs": 6.75, "validHrs": 4.34 }, "sl": { "buckets": { "dev": 1337, "invalidated": 510, "other": 3351, "test": 1341, "train": 1463, "validated": 15660 }, "duration": 77802310, "reportedSentences": 55, "validatedSentences": 4209, "unvalidatedSentences": 29, "clips": 19521, "splits": { "accent": { "": 2287 }, "variant": {}, "age": { "twenties": 0.3068, "teens": 0.0502, "": 0.3715, "sixties": 0.043, "fifties": 0.0484, "fourties": 0.1476, "thirties": 0.0319, "seventies": 0.0008 }, "gender": { "female_feminine": 0.0959, "male_masculine": 0.5192, "": 0.3849 }, "sentence_domain": { "general": 1, "history_law_government": 1, "media_entertainment": 1 } }, "users": 1002, "size": 511622621, "checksum": "d0c14487102920acdef169cfd4ac0a21036fc16ce7247691e05c122e39663108", "avgDurationSecs": 3.986, "validDurationSecs": 62414.025, "totalHrs": 21.61, "validHrs": 17.33 }, "sq": { "buckets": { "dev": 1795, "invalidated": 177, "other": 0, "test": 1908, "train": 2658, "validated": 6371 }, "duration": 33316812, "reportedSentences": 122, "validatedSentences": 50009, "unvalidatedSentences": 2635, "clips": 6548, "splits": { "accent": { "": 3157 }, "variant": {}, "age": { "": 0.2382, "twenties": 0.084, "thirties": 0.1559, "fourties": 0.1765, "fifties": 0.343, "teens": 0.0023 }, "gender": { "": 0.3968, "male_masculine": 0.4643, "female_feminine": 0.139 }, "sentence_domain": { "history_law_government": 3151, "general": 1802, "news_current_affairs": 25 } }, "users": 154, "size": 198575142, "checksum": "7b4575fb73be1f2f5d0bc8afbb0a8a70cbc3e1c88c395a8728d1b895bc5ad3b1", "avgDurationSecs": 5.088, "validDurationSecs": 32416.22, "totalHrs": 9.25, "validHrs": 9 }, "sr": { "buckets": { "dev": 1866, "invalidated": 429, "other": 5235, "test": 1935, "train": 2516, "validated": 8395 }, "duration": 45840564, "reportedSentences": 190, "validatedSentences": 8155, "unvalidatedSentences": 572, "clips": 14059, "splits": { "accent": { "": 5180 }, "variant": {}, "age": { "twenties": 0.262, "": 0.1408, "fifties": 0.1265, "fourties": 0.0998, "thirties": 0.1191, "teens": 0.0007, "sixties": 0.2511 }, "gender": { "male_masculine": 0.3194, "": 0.5336, "female_feminine": 0.147 }, "sentence_domain": { "general": 14, "technology_robotics": 33, "history_law_government": 2, "language_fundamentals": 5, "nature_environment": 5 } }, "users": 184, "size": 258414004, "checksum": "7aee8b1e8bec5f052c239dcd85c8d7e13ae9075ced3da268ce7185eecf5241eb", "avgDurationSecs": 3.261, "validDurationSecs": 27372.611, "totalHrs": 12.73, "validHrs": 7.6 }, "ss": { "buckets": { "dev": 0, "invalidated": 0, "other": 111, "test": 0, "train": 0, "validated": 0 }, "duration": 825840, "reportedSentences": 0, "validatedSentences": 1614, "unvalidatedSentences": 6796, "clips": 111, "splits": { "accent": {}, "variant": {}, "age": { "": 0.4505, "twenties": 0.5495 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 5086378, "checksum": "0097fae2c1d4817321a69972c7466c8b71c661a8b4981d9d170db9e75fb9dc47", "avgDurationSecs": 7.44, "validDurationSecs": 0, "totalHrs": 0.22, "validHrs": 0 }, "ssi": { "buckets": { "dev": 0, "invalidated": 12, "other": 0, "test": 161, "train": 1845, "validated": 10188 }, "duration": 37951740, "reportedSentences": 0, "validatedSentences": 2007, "unvalidatedSentences": 0, "clips": 10200, "splits": { "accent": { "": 4014 }, "variant": {}, "age": { "fifties": 0.3945, "twenties": 0.5981, "fourties": 0.0025, "thirties": 0.0039, "teens": 0.001 }, "gender": { "": 0.9868, "female_feminine": 0.0132 }, "sentence_domain": {} }, "users": 21, "size": 222141030, "checksum": "576cd0f2392c9edbcb6b51098be28dbfe9a969bcb2c8cfdbce917bb23ee38eb7", "avgDurationSecs": 3.721, "validDurationSecs": 37907.091, "totalHrs": 10.54, "validHrs": 10.52 }, "st": { "buckets": { "dev": 0, "invalidated": 1, "other": 204, "test": 0, "train": 4, "validated": 4 }, "duration": 673668, "reportedSentences": 0, "validatedSentences": 2339, "unvalidatedSentences": 14837, "clips": 209, "splits": { "accent": { "": 149 }, "variant": {}, "age": { "twenties": 0.2871, "thirties": 0.7129 }, "gender": { "": 0.2871, "male_masculine": 0.7129 }, "sentence_domain": {} }, "users": 2, "size": 4977639, "checksum": "1ed7f600835aebe250a2ee7322dbbb5deba8baff17e171fbb67bca9dbaae52bc", "avgDurationSecs": 3.223, "validDurationSecs": 12.893, "totalHrs": 0.18, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 5501, "invalidated": 1602, "other": 5946, "test": 5505, "train": 8254, "validated": 42664 }, "duration": 202076943, "reportedSentences": 597, "validatedSentences": 26876, "unvalidatedSentences": 6596, "clips": 50212, "splits": { "accent": { "": 4132 }, "variant": {}, "age": { "thirties": 0.2531, "": 0.1905, "teens": 0.0308, "fifties": 0.0325, "twenties": 0.1186, "fourties": 0.3724, "sixties": 0.001, "seventies": 0.0007, "nineties": 0.0004 }, "gender": { "male_masculine": 0.4677, "": 0.2129, "female_feminine": 0.3194 }, "sentence_domain": { "media_entertainment": 1, "general": 2 } }, "users": 887, "size": 1280582336, "checksum": "44dc5868c82525ff1aceb4165891f746d40e9443cb74e40e2ef57edc6068f220", "avgDurationSecs": 4.024, "validDurationSecs": 171700.205, "totalHrs": 56.13, "validHrs": 47.69 }, "sva": { "buckets": { "dev": 467, "invalidated": 1374, "other": 16, "test": 503, "train": 567, "validated": 9567 }, "duration": 64878732, "reportedSentences": 9, "validatedSentences": 1538, "unvalidatedSentences": 11, "clips": 10957, "splits": { "accent": { "kala": 2467, "lenjeri": 3693, "ushgul": 2340, "kheledi": 348, "mestia": 1492, "lentekhi": 308, "khopuri": 308, "ipari": 1487, "upperbaldali": 1487, "latali": 1487, "tskhumari": 5 }, "variant": { "sva-upperbal": 5905, "sva-lentekh": 348, "sva-lowerbal": 10 }, "age": { "fifties": 0.2114, "thirties": 0.2044, "": 0.0693, "sixties": 0.1298, "fourties": 0.2386, "twenties": 0.1049, "seventies": 0.0417 }, "gender": { "female_feminine": 0.7987, "male_masculine": 0.0005, "": 0.2008 }, "sentence_domain": { "agriculture_food": 108, "healthcare": 162, "general": 96 } }, "users": 55, "size": 370153365, "checksum": "e7fa34dd862b2faafafe3fb6756875cd5b5f4507ca6267abad4144e83222a590", "avgDurationSecs": 5.921, "validDurationSecs": 56648.246, "totalHrs": 18.02, "validHrs": 15.73 }, "sw": { "buckets": { "dev": 12269, "invalidated": 80838, "other": 380262, "test": 12270, "train": 46735, "validated": 269087 }, "duration": 3830474916, "reportedSentences": 1899, "validatedSentences": 134729, "unvalidatedSentences": 5757, "clips": 730187, "splits": { "accent": { "": 8022 }, "variant": { "sw-sanifu": 21833, "sw-baratz": 22486, "sw-barake": 6487, "sw-kingwana": 1756, "sw-kimvita": 15, "sw-katanga": 5 }, "age": { "": 0.305, "twenties": 0.4988, "thirties": 0.1089, "teens": 0.0053, "fifties": 0.0413, "fourties": 0.0347, "sixties": 0.0061 }, "gender": { "": 0.2921, "male_masculine": 0.3413, "female_feminine": 0.3666 }, "sentence_domain": {} }, "users": 1518, "size": 22410511055, "checksum": "1e3dfd2560ebd5b36aa6c4a1566ca2762b81d336ec9d308c709fb9ec99c53ae6", "avgDurationSecs": 5.246, "validDurationSecs": 1411598.678, "totalHrs": 1064.02, "validHrs": 392.11 }, "szy": { "buckets": { "dev": 257, "invalidated": 485, "other": 0, "test": 264, "train": 280, "validated": 9158 }, "duration": 52040448, "reportedSentences": 0, "validatedSentences": 801, "unvalidatedSentences": 2, "clips": 9643, "splits": { "accent": { "": 1067 }, "variant": {}, "age": { "": 0.4855, "seventies": 0.083, "fifties": 0.1145, "sixties": 0.1039, "twenties": 0.13, "thirties": 0.0831 }, "gender": { "": 0.2584, "female_feminine": 0.3545, "do_not_wish_to_say": 0.3871 }, "sentence_domain": { "general": 9825, "language_fundamentals": 9 } }, "users": 26, "size": 294423910, "checksum": "e1e9d5a13c7f80b78de43b77f15b1be19dbb4f880c93ccd876b3d53c1c9a3922", "avgDurationSecs": 5.397, "validDurationSecs": 49423.045, "totalHrs": 14.45, "validHrs": 13.72 }, "ta": { "buckets": { "dev": 12168, "invalidated": 5749, "other": 105198, "test": 12241, "train": 46517, "validated": 136717 }, "duration": 1531178484, "reportedSentences": 3437, "validatedSentences": 118292, "unvalidatedSentences": 550, "clips": 247664, "splits": { "accent": { "": 5134 }, "variant": {}, "age": { "twenties": 0.1083, "thirties": 0.119, "": 0.6603, "fourties": 0.027, "seventies": 0.0194, "fifties": 0.0246, "teens": 0.0381, "sixties": 0.003, "eighties": 0.0003 }, "gender": { "male_masculine": 0.1894, "": 0.6563, "female_feminine": 0.1543 }, "sentence_domain": {} }, "users": 979, "size": 9206987708, "checksum": "a3001772dd5cfc3d3fd4ce7d8ed1fc895d4019df9e54820fea9731fe417d2d04", "avgDurationSecs": 6.182, "validDurationSecs": 845250.536, "totalHrs": 425.32, "validHrs": 234.79 }, "tar": { "buckets": { "dev": 406, "invalidated": 23, "other": 7, "test": 404, "train": 430, "validated": 7892 }, "duration": 36039168, "reportedSentences": 32, "validatedSentences": 1258, "unvalidatedSentences": 0, "clips": 7922, "splits": { "accent": { "": 5941 }, "variant": {}, "age": { "thirties": 0.4683, "twenties": 0.1094, "fifties": 0.041, "fourties": 0.3812 }, "gender": { "": 0.3431, "female_feminine": 0.6569 }, "sentence_domain": {} }, "users": 17, "size": 211018236, "checksum": "9975f56d0c9b39a1e33c704451b77121d0cba9d5b8f0c9f4f43c154b893293e0", "avgDurationSecs": 4.549, "validDurationSecs": 35902.69, "totalHrs": 10.01, "validHrs": 9.97 }, "tay": { "buckets": { "dev": 452, "invalidated": 407, "other": 0, "test": 851, "train": 1299, "validated": 7446 }, "duration": 43758576, "reportedSentences": 0, "validatedSentences": 3790, "unvalidatedSentences": 77, "clips": 7853, "splits": { "accent": { "": 195 }, "variant": { "tay-squliq": 5338, "tay-klesan": 718, "tay-pingawan": 458, "tay-ciuli": 814 }, "age": { "sixties": 0.1498, "fifties": 0.1778, "fourties": 0.1273, "": 0.3266, "twenties": 0.216, "thirties": 0.0025 }, "gender": { "female_feminine": 0.4574, "": 0.2179, "do_not_wish_to_say": 0.3247 }, "sentence_domain": { "general": 9017 } }, "users": 18, "size": 258255513, "checksum": "749e47e3272981c6e56589e60dc9af409fb44e677fa1fd2aa24167778e705842", "avgDurationSecs": 5.572, "validDurationSecs": 41490.686, "totalHrs": 12.15, "validHrs": 11.52 }, "te": { "buckets": { "dev": 87, "invalidated": 199, "other": 1619, "test": 86, "train": 95, "validated": 688 }, "duration": 10511784, "reportedSentences": 29, "validatedSentences": 370, "unvalidatedSentences": 19699, "clips": 2506, "splits": { "accent": { "": 1460 }, "variant": {}, "age": { "twenties": 0.6959, "": 0.1285, "thirties": 0.0239, "sixties": 0.0722, "fourties": 0.012, "teens": 0.0674 }, "gender": { "male_masculine": 0.6496, "": 0.184, "female_feminine": 0.1664 }, "sentence_domain": { "language_fundamentals": 2, "healthcare": 1 } }, "users": 62, "size": 62701731, "checksum": "f6a853aa1c1313c62a147cd083649335c5db9e7be80ad08da92bb271a60a846c", "avgDurationSecs": 4.195, "validDurationSecs": 2885.917, "totalHrs": 2.91, "validHrs": 0.8 }, "tg": { "buckets": { "dev": 123, "invalidated": 42, "other": 36, "test": 121, "train": 328, "validated": 572 }, "duration": 3267540, "reportedSentences": 18, "validatedSentences": 4000, "unvalidatedSentences": 6771, "clips": 650, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0231, "twenties": 0.9738, "seventies": 0.0031 }, "gender": { "": 0.8385, "do_not_wish_to_say": 0.1615 }, "sentence_domain": { "media_entertainment": 149, "general": 117, "technology_robotics": 9, "language_fundamentals": 321, "nature_environment": 5, "news_current_affairs": 55, "finance": 15, "service_retail": 17, "healthcare": 13, "history_law_government": 31, "automotive_transport": 12, "agriculture_food": 7 } }, "users": 9, "size": 19550438, "checksum": "75b1088b5315c2b60af4eebabe6250feb1e60d78c5e3ed6abecda90f124cc455", "avgDurationSecs": 5.027, "validDurationSecs": 2875.435, "totalHrs": 0.9, "validHrs": 0.79 }, "th": { "buckets": { "dev": 11059, "invalidated": 9360, "other": 208383, "test": 11059, "train": 32977, "validated": 148765 }, "duration": 1536844836, "reportedSentences": 4312, "validatedSentences": 64491, "unvalidatedSentences": 121701, "clips": 366508, "splits": { "accent": { "": 10720 }, "variant": {}, "age": { "twenties": 0.2447, "": 0.4008, "thirties": 0.0739, "fourties": 0.0388, "teens": 0.0418, "fifties": 0.1996, "eighties": 0.0002, "sixties": 0.0002 }, "gender": { "male_masculine": 0.4205, "": 0.4118, "female_feminine": 0.1676, "do_not_wish_to_say": 0, "transgender": 0.0001 }, "sentence_domain": {} }, "users": 7987, "size": 8998543506, "checksum": "60fcf3eb8a2a5578547c95868e2bbbbf2c3ffd39a209cd5c821079dbf9645bda", "avgDurationSecs": 4.193, "validDurationSecs": 623802.815, "totalHrs": 426.9, "validHrs": 173.27 }, "ti": { "buckets": { "dev": 21, "invalidated": 7, "other": 369, "test": 19, "train": 35, "validated": 75 }, "duration": 2342484, "reportedSentences": 3, "validatedSentences": 4571, "unvalidatedSentences": 382, "clips": 451, "splits": { "accent": { "": 224 }, "variant": {}, "age": { "": 0.1818, "fourties": 0.4856, "thirties": 0.0266, "twenties": 0.306 }, "gender": { "": 0.9623, "male_masculine": 0.0377 }, "sentence_domain": {} }, "users": 16, "size": 14092564, "checksum": "21bd257e38316cc3160ed8ee4fe31995eeaf3368a517e36a04c1eb190684a9a9", "avgDurationSecs": 5.194, "validDurationSecs": 389.548, "totalHrs": 0.65, "validHrs": 0.1 }, "tig": { "buckets": { "dev": 1611, "invalidated": 223, "other": 105, "test": 1614, "train": 1989, "validated": 6439 }, "duration": 40492980, "reportedSentences": 39, "validatedSentences": 5417, "unvalidatedSentences": 90, "clips": 6767, "splits": { "accent": { "": 120 }, "variant": {}, "age": { "": 0.6948, "twenties": 0.2027, "thirties": 0.1017, "teens": 0.0007 }, "gender": { "": 0.9934, "male_masculine": 0.0059, "female_feminine": 0.0007 }, "sentence_domain": { "general": 6, "history_law_government": 6, "language_fundamentals": 3 } }, "users": 150, "size": 215807967, "checksum": "71408341e52629f1e851b019aa5a83e4a29350e1ea18fcddd472dd56a5e30046", "avgDurationSecs": 5.984, "validDurationSecs": 38530.264, "totalHrs": 11.24, "validHrs": 10.7 }, "tk": { "buckets": { "dev": 524, "invalidated": 140, "other": 2872, "test": 522, "train": 732, "validated": 2010 }, "duration": 27668232, "reportedSentences": 27, "validatedSentences": 2494, "unvalidatedSentences": 238, "clips": 5022, "splits": { "accent": { "": 2854 }, "variant": {}, "age": { "": 0.367, "twenties": 0.5293, "thirties": 0.0785, "teens": 0.002, "fourties": 0.0233 }, "gender": { "": 0.4273, "male_masculine": 0.4066, "female_feminine": 0.1661 }, "sentence_domain": {} }, "users": 147, "size": 154907569, "checksum": "e5a50d61ffa4b38f25a42317a2ba61aeebee5723ba56ab552fb07375cdc0ed6f", "avgDurationSecs": 5.509, "validDurationSecs": 11073.904, "totalHrs": 7.68, "validHrs": 3.07 }, "tli": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 524, "train": 2312, "validated": 2836 }, "duration": 35744256, "reportedSentences": 0, "validatedSentences": 7863, "unvalidatedSentences": 10, "clips": 2836, "splits": { "accent": {}, "variant": {}, "age": { "": 0.817, "thirties": 0.183 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 3, "size": 212257755, "checksum": "b9b527934a04bca17f3b4e3bd631a344cddfb610393eb38088726183131784e8", "avgDurationSecs": 12.604, "validDurationSecs": 35744.256, "totalHrs": 9.92, "validHrs": 9.92 }, "tn": { "buckets": { "dev": 311, "invalidated": 11, "other": 532, "test": 369, "train": 1079, "validated": 3506 }, "duration": 17693028, "reportedSentences": 1, "validatedSentences": 1765, "unvalidatedSentences": 15342, "clips": 4049, "splits": { "accent": {}, "variant": { "tn-central": 30, "tn-northern": 6 }, "age": { "": 0.0336, "fifties": 0.4347, "thirties": 0.5105, "twenties": 0.0212 }, "gender": { "": 0.4846, "female_feminine": 0.4772, "do_not_wish_to_say": 0.0383 }, "sentence_domain": {} }, "users": 17, "size": 89835277, "checksum": "136fbe85bf19eefdfbb4f26acbe1c3c9bcb311c4f5cbd7841192c4872a2ae286", "avgDurationSecs": 4.37, "validDurationSecs": 15320.266, "totalHrs": 4.91, "validHrs": 4.25 }, "tok": { "buckets": { "dev": 2259, "invalidated": 402, "other": 5943, "test": 2265, "train": 2632, "validated": 12992 }, "duration": 83345940, "reportedSentences": 165, "validatedSentences": 7524, "unvalidatedSentences": 2239, "clips": 19337, "splits": { "accent": { "": 2735 }, "variant": {}, "age": { "": 0.3058, "twenties": 0.1901, "teens": 0.2137, "thirties": 0.0597, "fourties": 0.0005, "eighties": 0.2302 }, "gender": { "": 0.4032, "male_masculine": 0.5235, "female_feminine": 0.0485, "non-binary": 0.0001, "do_not_wish_to_say": 0.0249 }, "sentence_domain": { "general": 2, "finance": 2 } }, "users": 172, "size": 489260183, "checksum": "ebad46971fdd6436ebdd4596a22e20935c9266eb5a661aab787d5073ed4956cd", "avgDurationSecs": 4.31, "validDurationSecs": 55997.851, "totalHrs": 23.15, "validHrs": 15.55 }, "tr": { "buckets": { "dev": 11797, "invalidated": 5018, "other": 660, "test": 11819, "train": 40815, "validated": 120832 }, "duration": 487073763, "reportedSentences": 486, "validatedSentences": 410410, "unvalidatedSentences": 3490, "clips": 126510, "splits": { "accent": { "": 49235 }, "variant": {}, "age": { "": 0.2466, "thirties": 0.0857, "twenties": 0.2259, "teens": 0.0155, "fourties": 0.0254, "fifties": 0.0707, "sixties": 0.2, "eighties": 0.0972, "seventies": 0.0331 }, "gender": { "": 0.2574, "male_masculine": 0.4218, "female_feminine": 0.3209 }, "sentence_domain": { "automotive_transport": 1, "media_entertainment": 1, "language_fundamentals": 12, "general": 2 } }, "users": 1816, "size": 2982022981, "checksum": "7caad8ca1424d632f89180d23888d61a260835dfd78a20f076a4df385e854665", "avgDurationSecs": 3.85, "validDurationSecs": 465213.002, "totalHrs": 135.29, "validHrs": 129.22 }, "trv": { "buckets": { "dev": 558, "invalidated": 104, "other": 0, "test": 529, "train": 885, "validated": 6386 }, "duration": 36415152, "reportedSentences": 1, "validatedSentences": 1974, "unvalidatedSentences": 2, "clips": 6490, "splits": { "accent": {}, "variant": { "trv-tgdaya": 3540, "trv-truku": 1799, "trv-teuda": 1151 }, "age": { "": 0.9314, "twenties": 0.0686 }, "gender": { "": 0.6803, "female_feminine": 0.1307, "male_masculine": 0.0686, "do_not_wish_to_say": 0.1205 }, "sentence_domain": { "general": 6276 } }, "users": 10, "size": 214926791, "checksum": "d61ec8ee27548b5ea2ac1eba67fd3bae8f00876cc40e7c3068c878fc19e9745b", "avgDurationSecs": 5.611, "validDurationSecs": 35831.612, "totalHrs": 10.11, "validHrs": 9.95 }, "trw": { "buckets": { "dev": 2083, "invalidated": 1363, "other": 14, "test": 1956, "train": 2859, "validated": 11054 }, "duration": 66795264, "reportedSentences": 578, "validatedSentences": 7770, "unvalidatedSentences": 27, "clips": 12431, "splits": { "accent": { "": 6586 }, "variant": {}, "age": { "": 0.054, "fifties": 0.0949, "fourties": 0.3614, "twenties": 0.3406, "thirties": 0.1099, "teens": 0.0393 }, "gender": { "": 1 }, "sentence_domain": { "agriculture_food": 57, "service_retail": 57, "nature_environment": 63, "language_fundamentals": 17, "news_current_affairs": 67, "general": 55, "media_entertainment": 1, "finance": 2, "healthcare": 49, "history_law_government": 49, "automotive_transport": 2 } }, "users": 27, "size": 392742263, "checksum": "46dc35100fdca92152ce48a60846aaa7c781cb06a3b56946de8d7313edeacde5", "avgDurationSecs": 5.373, "validDurationSecs": 59396.255, "totalHrs": 18.55, "validHrs": 16.49 }, "ts": { "buckets": { "dev": 0, "invalidated": 1, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 8928, "reportedSentences": 0, "validatedSentences": 352, "unvalidatedSentences": 16012, "clips": 1, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1058046, "checksum": "6ccc53dc58c3317c5ad59080b579997b3d4991f2c2d9264bfb34626166a582c0", "avgDurationSecs": 8.928, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 4425, "invalidated": 540, "other": 333, "test": 5103, "train": 8460, "validated": 30290 }, "duration": 119505770, "reportedSentences": 6, "validatedSentences": 18051, "unvalidatedSentences": 32, "clips": 31163, "splits": { "accent": { "": 1751 }, "variant": {}, "age": { "": 0.211, "thirties": 0.7053, "twenties": 0.0532, "sixties": 0.0101, "fifties": 0.0065, "teens": 0.0059, "fourties": 0.0005, "seventies": 0.0074 }, "gender": { "": 0.2137, "male_masculine": 0.7559, "female_feminine": 0.0305 }, "sentence_domain": { "general": 15 } }, "users": 292, "size": 871723720, "checksum": "b6a206d58195fc51542bf72257af47d713692f2b39d059637427635b4ab97f26", "avgDurationSecs": 3.835, "validDurationSecs": 116157.936, "totalHrs": 33.19, "validHrs": 32.26 }, "tui": { "buckets": { "dev": 602, "invalidated": 102, "other": 904, "test": 644, "train": 722, "validated": 7570 }, "duration": 39743928, "reportedSentences": 6, "validatedSentences": 1971, "unvalidatedSentences": 5, "clips": 8576, "splits": { "accent": {}, "variant": { "tui-bangwere": 825, "tui-banggo": 105 }, "age": { "thirties": 0.5129, "": 0.2264, "seventies": 0.0026, "twenties": 0.215, "fourties": 0.043 }, "gender": { "": 0.9331, "female_feminine": 0.0616, "male_masculine": 0.0054 }, "sentence_domain": {} }, "users": 51, "size": 232660196, "checksum": "41f745e06b8f5d247321a8cb62cbba5d797cec7696d9094f8844957b3c404756", "avgDurationSecs": 4.634, "validDurationSecs": 35081.802, "totalHrs": 11.03, "validHrs": 9.74 }, "tvu": { "buckets": { "dev": 495, "invalidated": 122, "other": 65, "test": 438, "train": 725, "validated": 5247 }, "duration": 38085048, "reportedSentences": 24, "validatedSentences": 1943, "unvalidatedSentences": 347, "clips": 5434, "splits": { "accent": {}, "variant": { "tvu-ndikini": 5 }, "age": { "": 0.7979, "thirties": 0.1965, "fourties": 0.0055 }, "gender": { "": 0.8081, "female_feminine": 0.1919 }, "sentence_domain": {} }, "users": 13, "size": 204912062, "checksum": "3b5d96eeb00a9384b84781823b89f3930ac743092c3f32efa2d2cea1471a39b1", "avgDurationSecs": 7.009, "validDurationSecs": 36774.429, "totalHrs": 10.57, "validHrs": 10.21 }, "tw": { "buckets": { "dev": 0, "invalidated": 5, "other": 93, "test": 30, "train": 213, "validated": 243 }, "duration": 1495440, "reportedSentences": 3, "validatedSentences": 49165, "unvalidatedSentences": 23, "clips": 341, "splits": { "accent": { "": 241 }, "variant": {}, "age": { "": 0.2669, "twenties": 0.088, "thirties": 0.6452 }, "gender": { "": 0.3519, "male_masculine": 0.6481 }, "sentence_domain": {} }, "users": 19, "size": 11559689, "checksum": "5e08839750690bf52da7b7db661183305a51234e0740b17cb1d30cfd1ef8d6a7", "avgDurationSecs": 4.385, "validDurationSecs": 1065.665, "totalHrs": 0.41, "validHrs": 0.29 }, "udl": { "buckets": { "dev": 285, "invalidated": 74, "other": 32, "test": 329, "train": 396, "validated": 6398 }, "duration": 34808904, "reportedSentences": 1, "validatedSentences": 1011, "unvalidatedSentences": 0, "clips": 6504, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8539, "twenties": 0.1461 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 20, "size": 204668293, "checksum": "146addb942f0a93e913cf7f7831fe776554ff1bf33d884f23ef13ee98bf1d2e3", "avgDurationSecs": 5.352, "validDurationSecs": 34241.6, "totalHrs": 9.66, "validHrs": 9.51 }, "ug": { "buckets": { "dev": 14717, "invalidated": 21635, "other": 11341, "test": 14717, "train": 127296, "validated": 273945 }, "duration": 1817486748, "reportedSentences": 1509, "validatedSentences": 231441, "unvalidatedSentences": 11020, "clips": 306921, "splits": { "accent": { "": 37940 }, "variant": { "ug-Arab": 3441 }, "age": { "": 0.2107, "fifties": 0.0082, "twenties": 0.3478, "thirties": 0.2694, "fourties": 0.1007, "teens": 0.0618, "eighties": 0.0014 }, "gender": { "": 0.5316, "male_masculine": 0.2604, "female_feminine": 0.202, "do_not_wish_to_say": 0.0059, "intersex": 0.0001 }, "sentence_domain": { "general": 2546, "healthcare": 27, "media_entertainment": 16, "agriculture_food": 2, "technology_robotics": 10, "language_fundamentals": 10, "finance": 1, "automotive_transport": 2, "history_law_government": 3, "nature_environment": 1, "service_retail": 1 } }, "users": 1845, "size": 10405085717, "checksum": "e79e0bf7b2c31e79336b6e463732c7a13fa36687f1631b1d5809820af1456f3d", "avgDurationSecs": 5.922, "validDurationSecs": 1622213.557, "totalHrs": 504.85, "validHrs": 450.61 }, "uk": { "buckets": { "dev": 10372, "invalidated": 3373, "other": 9412, "test": 10372, "train": 27562, "validated": 78654 }, "duration": 425066316, "reportedSentences": 660, "validatedSentences": 212668, "unvalidatedSentences": 1661, "clips": 91439, "splits": { "accent": { "": 24506 }, "variant": {}, "age": { "twenties": 0.3294, "teens": 0.1015, "": 0.2273, "fourties": 0.0923, "thirties": 0.2446, "fifties": 0.0045, "sixties": 0.0003 }, "gender": { "male_masculine": 0.576, "female_feminine": 0.171, "": 0.253 }, "sentence_domain": {} }, "users": 1173, "size": 2795234026, "checksum": "a89c2c4e9a148452f33576809a5f2c0fba31c0f93b513579ae28409ed16a4149", "avgDurationSecs": 4.649, "validDurationSecs": 365633.548, "totalHrs": 118.07, "validHrs": 101.56 }, "ur": { "buckets": { "dev": 5071, "invalidated": 10721, "other": 172213, "test": 5072, "train": 7300, "validated": 67845 }, "duration": 1060727076, "reportedSentences": 416, "validatedSentences": 21706, "unvalidatedSentences": 48257, "clips": 250779, "splits": { "accent": { "": 53169 }, "variant": {}, "age": { "twenties": 0.9173, "": 0.0646, "fourties": 0.0104, "thirties": 0.0043, "teens": 0.003, "fifties": 0.0003, "sixties": 0 }, "gender": { "male_masculine": 0.5279, "": 0.221, "female_feminine": 0.2511 }, "sentence_domain": { "general": 46, "healthcare": 45, "agriculture_food": 11, "media_entertainment": 1 } }, "users": 501, "size": 6211299240, "checksum": "521ba1d336630f6efa9df4a3432f41cf628efa6208f7cd77cdd6f53b8c82aebb", "avgDurationSecs": 4.23, "validDurationSecs": 286965.928, "totalHrs": 294.64, "validHrs": 79.71 }, "ush": { "buckets": { "dev": 195, "invalidated": 82, "other": 18, "test": 357, "train": 586, "validated": 3861 }, "duration": 24351012, "reportedSentences": 0, "validatedSentences": 1170, "unvalidatedSentences": 0, "clips": 3961, "splits": { "accent": { "": 40 }, "variant": {}, "age": { "fourties": 0.5951, "": 0.1452, "teens": 0.0172, "thirties": 0.0126, "twenties": 0.23 }, "gender": { "": 1 }, "sentence_domain": { "general": 3 } }, "users": 23, "size": 135821225, "checksum": "023b145a883af681fa58938c7e580bca015e88a87e7a97bd45fc007bfd3fa1d4", "avgDurationSecs": 6.148, "validDurationSecs": 23736.243, "totalHrs": 6.76, "validHrs": 6.59 }, "uz": { "buckets": { "dev": 12300, "invalidated": 14158, "other": 128440, "test": 12402, "train": 48941, "validated": 87442 }, "duration": 956721636, "reportedSentences": 1815, "validatedSentences": 148738, "unvalidatedSentences": 137888, "clips": 230040, "splits": { "accent": { "": 3719 }, "variant": {}, "age": { "twenties": 0.3973, "": 0.4111, "thirties": 0.0074, "teens": 0.1774, "fifties": 0.0003, "fourties": 0.0064, "nineties": 0 }, "gender": { "male_masculine": 0.4423, "": 0.4087, "female_feminine": 0.149 }, "sentence_domain": {} }, "users": 2299, "size": 5388982231, "checksum": "392f2e5145a61c8adc96b65a4bb4819fd78f33ec8f87466e6d4bd994d963fe95", "avgDurationSecs": 4.159, "validDurationSecs": 363665.681, "totalHrs": 265.75, "validHrs": 101.01 }, "var": { "buckets": { "dev": 546, "invalidated": 291, "other": 864, "test": 611, "train": 752, "validated": 6851 }, "duration": 42487380, "reportedSentences": 246, "validatedSentences": 1977, "unvalidatedSentences": 0, "clips": 8006, "splits": { "accent": { "": 2583 }, "variant": { "var-delrio": 7451, "var-sierra": 553 }, "age": { "teens": 0.136, "": 0.0006, "twenties": 0.2889, "thirties": 0.5741, "fourties": 0.0004 }, "gender": { "female_feminine": 0.9445, "": 0.0555 }, "sentence_domain": {} }, "users": 12, "size": 247667987, "checksum": "022963c067436c410f087b4ddabc83dde4b033bd8203b40ca164ab186fa181fd", "avgDurationSecs": 5.307, "validDurationSecs": 36357.862, "totalHrs": 11.8, "validHrs": 10.09 }, "ve": { "buckets": { "dev": 0, "invalidated": 0, "other": 25, "test": 0, "train": 0, "validated": 0 }, "duration": 143316, "reportedSentences": 0, "validatedSentences": 1735, "unvalidatedSentences": 7843, "clips": 25, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.4, "fourties": 0.6 }, "gender": { "": 0.4, "female_feminine": 0.6 }, "sentence_domain": {} }, "users": 3, "size": 1395412, "checksum": "5ed1109cae87a0db58df7c8a4b13121d03ea296e5dcfca9522f7973f98259f7d", "avgDurationSecs": 5.733, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "vi": { "buckets": { "dev": 1333, "invalidated": 474, "other": 12773, "test": 1472, "train": 1879, "validated": 6549 }, "duration": 79130816, "reportedSentences": 199, "validatedSentences": 13398, "unvalidatedSentences": 5424, "clips": 19796, "splits": { "accent": { "": 1557 }, "variant": { "vi-hanoi": 75, "vi-saigon": 384, "vi-huett": 1157 }, "age": { "thirties": 0.0445, "twenties": 0.2039, "": 0.259, "teens": 0.1975, "seventies": 0.022, "fourties": 0.018, "sixties": 0.2548, "fifties": 0.0003 }, "gender": { "male_masculine": 0.4778, "": 0.318, "female_feminine": 0.204, "do_not_wish_to_say": 0.0003 }, "sentence_domain": { "general": 1321 } }, "users": 375, "size": 463292097, "checksum": "1af4d408cf65eb5599017fdc4055e9ec2aee48d49d5ad488163e8a42a8b7bc55", "avgDurationSecs": 3.997, "validDurationSecs": 26178.405, "totalHrs": 21.98, "validHrs": 7.27 }, "vot": { "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 7, "train": 96, "validated": 103 }, "duration": 1029972, "reportedSentences": 0, "validatedSentences": 5437, "unvalidatedSentences": 1, "clips": 427, "splits": { "accent": {}, "variant": {}, "age": { "": 0.2529, "twenties": 0.7354, "teens": 0.0117 }, "gender": { "": 0.2529, "male_masculine": 0.7471 }, "sentence_domain": {} }, "users": 6, "size": 8190103, "checksum": "ba8d26da8d3bd70bb406505d5a924395109458868500460e8cb701a15d8fa74d", "avgDurationSecs": 2.412, "validDurationSecs": 248.448, "totalHrs": 0.28, "validHrs": 0.06 }, "wbl": { "buckets": { "dev": 1110, "invalidated": 89, "other": 1639, "test": 1128, "train": 2542, "validated": 6464 }, "duration": 55320552, "reportedSentences": 1, "validatedSentences": 5493, "unvalidatedSentences": 114, "clips": 8192, "splits": { "accent": {}, "variant": {}, "age": { "seventies": 0.6541, "": 0.2125, "thirties": 0.0055, "sixties": 0.12, "fifties": 0.0031, "teens": 0.0049 }, "gender": { "": 0.8818, "male_masculine": 0.1182 }, "sentence_domain": { "nature_environment": 3 } }, "users": 14, "size": 321023495, "checksum": "ecb77641a47d4c54483652a80fca550ab837a5578a5f8b81a2eb5bd0ad403d85", "avgDurationSecs": 6.753, "validDurationSecs": 43651.373, "totalHrs": 15.36, "validHrs": 12.12 }, "wes": { "buckets": { "dev": 324, "invalidated": 51, "other": 353, "test": 322, "train": 352, "validated": 9013 }, "duration": 38779632, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": 0, "clips": 9417, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9356, "fourties": 0.0589, "thirties": 0.0001, "twenties": 0.0053 }, "gender": { "": 0.9358, "female_feminine": 0.0642 }, "sentence_domain": {} }, "users": 18, "size": 209729941, "checksum": "46b49ef2899bf3d179f1b23f82e077849c1680054b04b989eca887d12e3c6bbd", "avgDurationSecs": 4.118, "validDurationSecs": 37115.942, "totalHrs": 10.77, "validHrs": 10.3 }, "xh": { "buckets": { "dev": 0, "invalidated": 2, "other": 19, "test": 2, "train": 7, "validated": 9 }, "duration": 179712, "reportedSentences": 0, "validatedSentences": 1476, "unvalidatedSentences": 14922, "clips": 30, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 4, "size": 2216145, "checksum": "e46e23cb93ef7a620e13dba1591b893893043edb495150ec49e4bf6ff20c916e", "avgDurationSecs": 5.99, "validDurationSecs": 53.914, "totalHrs": 0.04, "validHrs": 0.01 }, "xhe": { "buckets": { "dev": 0, "invalidated": 129, "other": 14, "test": 1443, "train": 3542, "validated": 11429 }, "duration": 35080848, "reportedSentences": 1, "validatedSentences": 5040, "unvalidatedSentences": 2, "clips": 11572, "splits": { "accent": { "": 5040 }, "variant": {}, "age": { "thirties": 0.0004, "twenties": 0.8517, "": 0.1459, "teens": 0.002 }, "gender": { "male_masculine": 0.0004, "": 0.9996 }, "sentence_domain": {} }, "users": 11, "size": 205702908, "checksum": "b69cc8ead392448306bf73a24caa7f65e4e1b79c77d9b6c9a6b878b444ca8959", "avgDurationSecs": 3.032, "validDurationSecs": 34647.339, "totalHrs": 9.74, "validHrs": 9.62 }, "xka": { "buckets": { "dev": 0, "invalidated": 52, "other": 578, "test": 401, "train": 1576, "validated": 11203 }, "duration": 37380348, "reportedSentences": 4, "validatedSentences": 1980, "unvalidatedSentences": 46, "clips": 11833, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.7369, "": 0.0086, "fifties": 0.2545 }, "gender": { "": 1 }, "sentence_domain": { "general": 26 } }, "users": 9, "size": 199216198, "checksum": "bdd00e082e4d5ab6e5a521503e3cf0575dfd3d5db6dbb3e52066a2dec553d4aa", "avgDurationSecs": 3.159, "validDurationSecs": 35390.183, "totalHrs": 10.38, "validHrs": 9.83 }, "xmf": { "buckets": { "dev": 327, "invalidated": 590, "other": 32, "test": 326, "train": 336, "validated": 6766 }, "duration": 45481536, "reportedSentences": 20, "validatedSentences": 1004, "unvalidatedSentences": 36, "clips": 7388, "splits": { "accent": { "chkhor": 372, "zugid": 964, "bandza": 55, "jvar": 5, "": 5, "senak": 15 }, "variant": { "xmf-senmar": 467, "xmf-samurz": 339, "xmf-zugdid": 413 }, "age": { "": 0.2293, "thirties": 0.3025, "twenties": 0.3143, "fourties": 0.1331, "fifties": 0.0007, "teens": 0.0202 }, "gender": { "": 0.4557, "female_feminine": 0.49, "male_masculine": 0.0543 }, "sentence_domain": { "general": 19, "automotive_transport": 19, "agriculture_food": 9 } }, "users": 89, "size": 266731489, "checksum": "bf0b460cd6d6ab75533ead50f57e2b304758ce50224fa6dd8a2e8af0324dcb95", "avgDurationSecs": 6.156, "validDurationSecs": 41652.419, "totalHrs": 12.63, "validHrs": 11.57 }, "yaq": { "buckets": { "dev": 131, "invalidated": 1, "other": 845, "test": 800, "train": 1897, "validated": 6925 }, "duration": 41273208, "reportedSentences": 0, "validatedSentences": 2838, "unvalidatedSentences": 0, "clips": 7771, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.3643, "fourties": 0.3634, "thirties": 0.2715, "": 0.0006, "fifties": 0.0001 }, "gender": { "": 0.3649, "female_feminine": 0.6351 }, "sentence_domain": {} }, "users": 5, "size": 243193492, "checksum": "3a918cfdd03d9b101149798cd6337a341163eb118217e7418a75ddecdbd28fd3", "avgDurationSecs": 5.311, "validDurationSecs": 36779.947, "totalHrs": 11.46, "validHrs": 10.21 }, "yav": { "buckets": { "dev": 274, "invalidated": 322, "other": 498, "test": 316, "train": 423, "validated": 4679 }, "duration": 36596088, "reportedSentences": 0, "validatedSentences": 1013, "unvalidatedSentences": 0, "clips": 5499, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8894, "fourties": 0.1106 }, "gender": { "": 0.8894, "female_feminine": 0.1106 }, "sentence_domain": {} }, "users": 10, "size": 212055030, "checksum": "39f1cdedd810dc8f55185bed61e634abcaaa917c963ca31eb7231b29e6bbf15b", "avgDurationSecs": 6.655, "validDurationSecs": 31138.952, "totalHrs": 10.16, "validHrs": 8.64 }, "ydg": { "buckets": { "dev": 0, "invalidated": 9, "other": 812, "test": 353, "train": 1445, "validated": 10642 }, "duration": 41216364, "reportedSentences": 0, "validatedSentences": 1937, "unvalidatedSentences": 169, "clips": 11463, "splits": { "accent": {}, "variant": {}, "age": { "": 0.8861, "thirties": 0.1139 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 17, "size": 240859534, "checksum": "e48d7072f84b0f102ada4590a76675e30af854368b202358bde8d329425e233b", "avgDurationSecs": 3.596, "validDurationSecs": 38264.376, "totalHrs": 11.44, "validHrs": 10.62 }, "yi": { "buckets": { "dev": 460, "invalidated": 87, "other": 63, "test": 451, "train": 483, "validated": 1849 }, "duration": 7946640, "reportedSentences": 17, "validatedSentences": 1550, "unvalidatedSentences": 85, "clips": 1999, "splits": { "accent": { "": 1004 }, "variant": {}, "age": { "": 0.7659, "twenties": 0.0615, "teens": 0.1086, "thirties": 0.064 }, "gender": { "": 0.9975, "male_masculine": 0.0025 }, "sentence_domain": { "general": 13, "finance": 2, "history_law_government": 1, "nature_environment": 3 } }, "users": 124, "size": 44771592, "checksum": "04b3a299e9537ee37da80f032ef41fa212cf59746b4994d1d50cea68a0854329", "avgDurationSecs": 3.975, "validDurationSecs": 7350.344, "totalHrs": 2.2, "validHrs": 2.04 }, "yo": { "buckets": { "dev": 975, "invalidated": 264, "other": 1120, "test": 1071, "train": 1422, "validated": 3470 }, "duration": 29319516, "reportedSentences": 21, "validatedSentences": 5419, "unvalidatedSentences": 17, "clips": 4854, "splits": { "accent": { "": 447 }, "variant": {}, "age": { "thirties": 0.0299, "twenties": 0.6096, "": 0.2919, "teens": 0.041, "sixties": 0.0031, "fourties": 0.001, "fifties": 0.0235 }, "gender": { "male_masculine": 0.2967, "": 0.335, "female_feminine": 0.3684 }, "sentence_domain": {} }, "users": 133, "size": 170912297, "checksum": "dca27f2ca5f4f3e343bec48ac6f2c984f6258fb7ad1f45991f7948c92358df23", "avgDurationSecs": 6.04, "validDurationSecs": 20959.769, "totalHrs": 8.14, "validHrs": 5.82 }, "yue": { "buckets": { "dev": 5130, "invalidated": 8092, "other": 79777, "test": 5130, "train": 7420, "validated": 191452 }, "duration": 1106513784, "reportedSentences": 2257, "validatedSentences": 18556, "unvalidatedSentences": 10171, "clips": 279321, "splits": { "accent": { "": 126764 }, "variant": {}, "age": { "thirties": 0.4699, "": 0.1875, "twenties": 0.2923, "fourties": 0.0366, "sixties": 0.0021, "fifties": 0.0008, "teens": 0.0072, "nineties": 0.0036 }, "gender": { "male_masculine": 0.1706, "": 0.201, "female_feminine": 0.6245, "do_not_wish_to_say": 0.0002, "non-binary": 0.0036 }, "sentence_domain": { "nature_environment": 188, "general": 1682, "automotive_transport": 52, "finance": 128, "news_current_affairs": 114, "history_law_government": 67, "service_retail": 612, "media_entertainment": 192, "agriculture_food": 18, "healthcare": 417, "technology_robotics": 95 } }, "users": 1179, "size": 6431854197, "checksum": "e3924bd720dee56c2849013320c62b73b35338d92c787a4eee3b074fc2e4638b", "avgDurationSecs": 3.961, "validDurationSecs": 758425.886, "totalHrs": 307.36, "validHrs": 210.67 }, "zgh": { "buckets": { "dev": 308, "invalidated": 95, "other": 665, "test": 244, "train": 894, "validated": 1446 }, "duration": 7851132, "reportedSentences": 7, "validatedSentences": 2740, "unvalidatedSentences": 356, "clips": 2206, "splits": { "accent": {}, "variant": { "zgh-shi": 1704, "zgh-tzm": 230 }, "age": { "": 0.117, "thirties": 0.5113, "twenties": 0.3717 }, "gender": { "": 0.1646, "male_masculine": 0.8092, "female_feminine": 0.0263 }, "sentence_domain": { "general": 35, "language_fundamentals": 5, "healthcare": 3, "technology_robotics": 1, "agriculture_food": 1 } }, "users": 47, "size": 43064698, "checksum": "0d4cd7786cffb23ba7cbcd7ad57e4dcb3e80a1744c99c68532c9e0d565ceb06e", "avgDurationSecs": 3.559, "validDurationSecs": 5146.3, "totalHrs": 2.18, "validHrs": 1.42 }, "zh-CN": { "buckets": { "dev": 10653, "invalidated": 59226, "other": 602741, "test": 10653, "train": 29608, "validated": 189674 }, "duration": 3865582860, "reportedSentences": 1145, "validatedSentences": 59143, "unvalidatedSentences": 909, "clips": 851641, "splits": { "accent": { "110000": 7174, "120000": 2889, "130000": 1930, "140000": 1391, "150000": 277, "210000": 2904, "220000": 534, "230000": 3971, "310000": 3526, "320000": 3913, "330000": 2917, "340000": 1411, "350000": 1810, "360000": 5638, "370000": 3701, "410000": 2275, "420000": 1792, "430000": 1053, "440000": 4621, "450000": 1685, "460000": 331, "500000": 1404, "510000": 2698, "520000": 164, "530000": 240, "540000": 5, "610000": 685, "620000": 182, "630000": 5, "640000": 414, "650000": 355, "710000": 85, "810000": 113, "": 3898 }, "variant": {}, "age": { "": 0.9209, "teens": 0.0136, "twenties": 0.0489, "thirties": 0.0138, "fourties": 0.0026, "nineties": 0, "fifties": 0.0002, "sixties": 0, "seventies": 0 }, "gender": { "": 0.9277, "male_masculine": 0.0569, "female_feminine": 0.0144, "do_not_wish_to_say": 0.0006, "non-binary": 0.0003 }, "sentence_domain": { "technology_robotics": 249, "general": 890, "history_law_government": 394, "media_entertainment": 1672, "news_current_affairs": 194, "automotive_transport": 75, "nature_environment": 63, "finance": 111, "service_retail": 58, "healthcare": 139, "language_fundamentals": 102, "agriculture_food": 55 } }, "users": 7546, "size": 22953106130, "checksum": "59ecd91ef0f2a23b5635c12c3dfab833721e4bb76518995811d5690893203d79", "avgDurationSecs": 4.539, "validDurationSecs": 860926.803, "totalHrs": 1073.77, "validHrs": 239.14 }, "zh-HK": { "buckets": { "dev": 5604, "invalidated": 4667, "other": 25383, "test": 5604, "train": 8450, "validated": 94118 }, "duration": 515372748, "reportedSentences": 812, "validatedSentences": 20148, "unvalidatedSentences": 60, "clips": 124168, "splits": { "accent": { "": 5289 }, "variant": {}, "age": { "fourties": 0.1235, "thirties": 0.1055, "": 0.3827, "teens": 0.0223, "fifties": 0.0286, "seventies": 0.002, "sixties": 0.0065, "twenties": 0.329 }, "gender": { "male_masculine": 0.4031, "": 0.3719, "female_feminine": 0.2241, "do_not_wish_to_say": 0.0008, "transgender": 0.0001 }, "sentence_domain": { "media_entertainment": 53, "general": 1032, "nature_environment": 58, "service_retail": 317, "automotive_transport": 31, "finance": 46, "history_law_government": 84, "healthcare": 94, "language_fundamentals": 88, "news_current_affairs": 25, "technology_robotics": 54 } }, "users": 3110, "size": 3674912896, "checksum": "c34935c4a75bb1a13f4fb9a4ff570291a55c53095a347c4e7b9b0a43ac7cb0d9", "avgDurationSecs": 4.151, "validDurationSecs": 390646.965, "totalHrs": 143.15, "validHrs": 108.51 }, "zh-TW": { "buckets": { "dev": 5119, "invalidated": 4920, "other": 50386, "test": 5119, "train": 7394, "validated": 85324 }, "duration": 472795042, "reportedSentences": 179, "validatedSentences": 20786, "unvalidatedSentences": 977, "clips": 140630, "splits": { "accent": { "taipei_city": 19646, "chiayi_city": 1195, "kaohsiung_city": 3266, "new_taipei_city": 8850, "taoyuan_city": 3015, "taichung_city": 4411, "nantou_county": 2101, "tainan_city": 1708, "keelung_city": 141, "yilan_county": 765, "hsinchu_county": 343, "hong_kong": 690, "yunlin_county": 2560, "changhua_county": 2009, "pingtung_county": 913, "": 5017, "chiayi_county": 379, "hualien_county": 878, "hsinchu_city": 2866, "kinmen_county": 55, "taitung_county": 10, "miaoli_county": 15, "penghu_county": 20 }, "variant": {}, "age": { "thirties": 0.1921, "twenties": 0.2963, "teens": 0.06, "": 0.268, "fifties": 0.0895, "seventies": 0.0002, "fourties": 0.0908, "sixties": 0.0031 }, "gender": { "male_masculine": 0.4873, "": 0.291, "female_feminine": 0.2208, "do_not_wish_to_say": 0.0002, "transgender": 0.0007 }, "sentence_domain": { "general": 1502, "history_law_government": 170, "media_entertainment": 170, "service_retail": 151, "nature_environment": 14, "language_fundamentals": 8, "automotive_transport": 278, "healthcare": 25, "news_current_affairs": 44, "agriculture_food": 12, "technology_robotics": 777, "finance": 3 } }, "users": 2317, "size": 3165560607, "checksum": "9f88c8978b145ad13d502c2df5010f942830d58a21f05854eee41b20126b1028", "avgDurationSecs": 3.362, "validDurationSecs": 286857.457, "totalHrs": 131.33, "validHrs": 79.68 }, "zoc": { "buckets": { "dev": 483, "invalidated": 2, "other": 60, "test": 474, "train": 524, "validated": 8886 }, "duration": 36533628, "reportedSentences": 0, "validatedSentences": 1481, "unvalidatedSentences": 7, "clips": 8948, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0897, "fourties": 0.3074, "twenties": 0.3814, "thirties": 0.2214 }, "gender": { "": 0.5561, "female_feminine": 0.4439 }, "sentence_domain": {} }, "users": 12, "size": 213980458, "checksum": "63cb37757d4210b08e53493df74c96ea79cf18fadb3898220363c6d26f2e8189", "avgDurationSecs": 4.083, "validDurationSecs": 36280.489, "totalHrs": 10.14, "validHrs": 10.07 }, "zu": { "buckets": { "dev": 0, "invalidated": 12, "other": 174, "test": 2, "train": 15, "validated": 17 }, "duration": 1123272, "reportedSentences": 4, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 203, "splits": { "accent": { "": 80 }, "variant": {}, "age": { "": 0.4877, "twenties": 0.5123 }, "gender": { "": 0.4877, "do_not_wish_to_say": 0.3399, "female_feminine": 0.1724 }, "sentence_domain": {} }, "users": 8, "size": 7959944, "checksum": "ebf033ef7b822745ffa11f61917a6cd0ac87a794504035da9ff6aee4d49768df", "avgDurationSecs": 5.533, "validDurationSecs": 94.067, "totalHrs": 0.31, "validHrs": 0.02 }, "zza": { "buckets": { "dev": 463, "invalidated": 246, "other": 457, "test": 436, "train": 813, "validated": 1712 }, "duration": 9737244, "reportedSentences": 2322, "validatedSentences": 5657, "unvalidatedSentences": 989, "clips": 2415, "splits": { "accent": { "": 175 }, "variant": {}, "age": { "thirties": 0.5971, "": 0.2149, "sixties": 0.1093, "fifties": 0.0104, "twenties": 0.0683 }, "gender": { "male_masculine": 0.2559, "": 0.6584, "female_feminine": 0.0174, "do_not_wish_to_say": 0.0683 }, "sentence_domain": { "general": 772, "healthcare": 19, "language_fundamentals": 29, "service_retail": 10, "finance": 1, "nature_environment": 4, "automotive_transport": 1, "agriculture_food": 3, "technology_robotics": 2, "media_entertainment": 2 } }, "users": 23, "size": 57047908, "checksum": "db49e383e9b7f68992da5a7b8d3a797592dbbc5794a567f36cc2b5db975bb445", "avgDurationSecs": 4.032, "validDurationSecs": 6902.758, "totalHrs": 2.7, "validHrs": 1.91 }, "ps": { "buckets": { "dev": 15462, "invalidated": 177351, "other": 1896582, "test": 15462, "train": 218250, "validated": 2776943 }, "duration": 19129134672, "reportedSentences": 6119, "validatedSentences": 334539, "unvalidatedSentences": 118912, "clips": 4850876, "splits": { "accent": { "": 482561 }, "variant": {}, "age": { "fourties": 0.0297, "twenties": 0.6098, "": 0.0391, "thirties": 0.1835, "teens": 0.1132, "fifties": 0.0242, "sixties": 0.0005, "seventies": 0.0, "eighties": 0, "nineties": 0 }, "gender": { "male_masculine": 0.0052, "": 0.3672, "female_feminine": 0.627, "do_not_wish_to_say": 0.0006, "transgender": 0, "non-binary": 0 }, "sentence_domain": { "general": 567588, "language_fundamentals": 49603, "agriculture_food": 41672, "automotive_transport": 6086, "media_entertainment": 31393, "history_law_government": 164529, "nature_environment": 146980, "healthcare": 88914, "news_current_affairs": 12379, "service_retail": 25312, "technology_robotics": 29662, "finance": 10454 } }, "users": 8239, "size": 105017656913, "checksum": "3442a9c32aaa7d6f23f82d2647b26402a326d9a3a5c13eb601eb064c763d4865", "avgDurationSecs": 3.943, "validDurationSecs": 10950705.939, "totalHrs": 5313.64, "validHrs": 3041.86 } }, "totalDuration": 150455392087, "totalValidDurationSecs": 102162829, "totalHrs": 41792, "totalValidHrs": 28377 } ================================================ FILE: datasets/scripted-speech/cv-corpus-25.0-delta-2026-03-09.json ================================================ { "locales": { "ab": { "buckets": { "dev": 12419, "invalidated": 14978, "other": 345, "test": 12448, "train": 50489, "validated": 78839 }, "duration": 539435376, "reportedSentences": 123, "validatedSentences": 1046833, "unvalidatedSentences": 14, "clips": 94162, "splits": { "accent": { "": 45 }, "variant": {}, "age": { "": 0.1247, "twenties": 0.1227, "thirties": 0.0618, "teens": 0.6391, "fourties": 0.0299, "nineties": 0.0001, "fifties": 0.0196, "sixties": 0.002, "seventies": 0.0001 }, "gender": { "": 0.2738, "female_feminine": 0.7107, "transgender": 0.0001, "male_masculine": 0.0042, "do_not_wish_to_say": 0.0111 }, "sentence_domain": { "nature_environment": 1, "automotive_transport": 2, "agriculture_food": 1, "history_law_government": 2 } }, "users": 878, "size": 3198369258, "checksum": "20e9b811a7edee42d5e1583df949b2418f968f454e5bbec272cdee05b43edbdf", "avgDurationSecs": 5.729, "validDurationSecs": 451652.956, "totalHrs": 149.84, "validHrs": 125.45 }, "ady": { "buckets": { "dev": 844, "invalidated": 1024, "other": 9054, "test": 968, "train": 1159, "validated": 4809 }, "duration": 87547320, "reportedSentences": 792, "validatedSentences": 11364, "unvalidatedSentences": 2734, "clips": 14887, "splits": { "accent": { "abzakh": 1318, "bzhedug": 358, "temirgoy": 1944, "shapsug": 2623, "": 105 }, "variant": { "ady-RU": 3608, "ady-Cyrl": 1706, "ady-Cyrl-TR": 1008, "ady-Latn-TR-t-ady-cyrl": 29 }, "age": { "teens": 0.0146, "fifties": 0.1021, "twenties": 0.3926, "thirties": 0.4628, "": 0.0109, "fourties": 0.0124, "seventies": 0.0046 }, "gender": { "": 0.1098, "female_feminine": 0.8902 }, "sentence_domain": { "general": 23413, "service_retail": 12, "language_fundamentals": 71, "healthcare": 8, "agriculture_food": 9, "news_current_affairs": 17, "media_entertainment": 14, "nature_environment": 3, "technology_robotics": 54 } }, "users": 45, "size": 502284350, "checksum": "1aa018119f30b8f658b8007f9893e7a7f705fca705c15a173f2f8a45d1a91ab8", "avgDurationSecs": 5.881, "validDurationSecs": 28280.719, "totalHrs": 24.31, "validHrs": 7.85 }, "af": { "buckets": { "dev": 0, "invalidated": 0, "other": 1, "test": 0, "train": 0, "validated": 0 }, "duration": 7920, "reportedSentences": 0, "validatedSentences": 4805, "unvalidatedSentences": 16562, "clips": 1, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1630425, "checksum": "4b16c053301b4542741d53538f7998d692c521355a825aa7e2275f275e512670", "avgDurationSecs": 7.92, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ajg": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 3213, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 139906, "checksum": "6daaa6c3cbc5e8f39bf2eeb6b65b7ac8862e4946245582f0a0c8f423d9c72392", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "am": { "buckets": { "dev": 0, "invalidated": 0, "other": 20, "test": 0, "train": 0, "validated": 0 }, "duration": 201528, "reportedSentences": 0, "validatedSentences": 2208, "unvalidatedSentences": 15, "clips": 20, "splits": { "accent": {}, "variant": {}, "age": { "": 0.25, "fourties": 0.25, "thirties": 0.5 }, "gender": { "": 1 }, "sentence_domain": { "history_law_government": 1 } }, "users": 3, "size": 1311321, "checksum": "9c26ba8041e8a833d5739d2a5d66f3594b3c452409e1461dc54bffaaaa81918d", "avgDurationSecs": 10.076, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "an": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 10454, "unvalidatedSentences": 357, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 668828, "checksum": "e6897db054a7dfbf515a8156504a808ff69c802e9c9b7c0addd36cbb5e232395", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ar": { "buckets": { "dev": 0, "invalidated": 0, "other": 100, "test": 0, "train": 0, "validated": 0 }, "duration": 448812, "reportedSentences": 0, "validatedSentences": 59434, "unvalidatedSentences": 1742117, "clips": 100, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.6, "thirties": 0.1, "twenties": 0.3 }, "gender": { "": 0.6, "female_feminine": 0.4 }, "sentence_domain": {} }, "users": 13, "size": 127489553, "checksum": "e253f74d125766ef095e2f53ae483b5ba1460a0479296f810159603606abd2e8", "avgDurationSecs": 4.488, "validDurationSecs": 0, "totalHrs": 0.12, "validHrs": 0 }, "as": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 26172, "reportedSentences": 0, "validatedSentences": 7171, "unvalidatedSentences": 3344, "clips": 5, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 1 }, "gender": { "": 1 }, "sentence_domain": { "general": 2, "technology_robotics": 2 } }, "users": 1, "size": 942430, "checksum": "c5515469291e03271c8cca8478908748fd27eb68b0ea0ce8c84743b22670064c", "avgDurationSecs": 5.234, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ast": { "buckets": { "dev": 0, "invalidated": 0, "other": 10, "test": 0, "train": 0, "validated": 0 }, "duration": 45000, "reportedSentences": 0, "validatedSentences": 1645, "unvalidatedSentences": 287, "clips": 10, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 382615, "checksum": "4a84afc9b564455650b4c94c759a5885bf522efecb38d8c2ee9eaa8511cbb570", "avgDurationSecs": 4.5, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "az": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 93161, "unvalidatedSentences": 1862, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 6316658, "checksum": "2a4f7ac49c9dedcb94faa4ecd293c75d3343e97857e34680f43fdcb8e72201d7", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ba": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 153968, "unvalidatedSentences": 5, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 10314872, "checksum": "61f56527592149b6661ef7536cf47cab28f29f9f95b7154d5b46de25669fb395", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bbl": { "buckets": { "dev": 0, "invalidated": 0, "other": 160, "test": 0, "train": 0, "validated": 0 }, "duration": 1305576, "reportedSentences": 0, "validatedSentences": 1661, "unvalidatedSentences": 580, "clips": 160, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": { "general": 131, "agriculture_food": 17 } }, "users": 2, "size": 7960082, "checksum": "64aafac251715629f133e21a847702145ca86385f6829297698f9be1f363f1ef", "avgDurationSecs": 8.16, "validDurationSecs": 0, "totalHrs": 0.36, "validHrs": 0 }, "bci": { "buckets": { "dev": 0, "invalidated": 0, "other": 1, "test": 0, "train": 0, "validated": 0 }, "duration": 2448, "reportedSentences": 0, "validatedSentences": 876, "unvalidatedSentences": 0, "clips": 1, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 1 }, "gender": { "female_feminine": 1 }, "sentence_domain": {} }, "users": 1, "size": 66788, "checksum": "951fa0873755af48e2f4875b70ac606cfeda20a425a290ae5c95f8edfa06a324", "avgDurationSecs": 2.448, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "be": { "buckets": { "dev": 5, "invalidated": 0, "other": 1922, "test": 10, "train": 21, "validated": 38 }, "duration": 10835856, "reportedSentences": 19, "validatedSentences": 379562, "unvalidatedSentences": 1917, "clips": 1960, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9745, "teens": 0.0128, "twenties": 0.0128 }, "gender": { "": 0.9872, "male_masculine": 0.0128 }, "sentence_domain": { "history_law_government": 14, "news_current_affairs": 1, "technology_robotics": 1 } }, "users": 17, "size": 87449064, "checksum": "fb91ec50bb025a233cc6c1bb682e35a90b1e72fb4914a7044db7a4e20cb53201", "avgDurationSecs": 5.528, "validDurationSecs": 210.083, "totalHrs": 3, "validHrs": 0.05 }, "bft": { "buckets": { "dev": 0, "invalidated": 0, "other": 15, "test": 0, "train": 0, "validated": 0 }, "duration": 95904, "reportedSentences": 0, "validatedSentences": 7968, "unvalidatedSentences": 12, "clips": 15, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1102389, "checksum": "7f33733b421b5aecf8c233723a73d1dfadb12f074d0d9c613e92ee924e55b551", "avgDurationSecs": 6.394, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "bg": { "buckets": { "dev": 0, "invalidated": 1, "other": 53, "test": 0, "train": 1, "validated": 1 }, "duration": 345420, "reportedSentences": 3, "validatedSentences": 39572, "unvalidatedSentences": 3, "clips": 55, "splits": { "accent": { "": 50 }, "variant": {}, "age": { "fifties": 0.9091, "": 0.0909 }, "gender": { "female_feminine": 0.9091, "": 0.0909 }, "sentence_domain": {} }, "users": 2, "size": 4997824, "checksum": "6bacbba2c3173f79f3886fa224067f529a50a493f587ec74613c1b7985bb3837", "avgDurationSecs": 6.28, "validDurationSecs": 6.28, "totalHrs": 0.09, "validHrs": 0 }, "bgp": { "buckets": { "dev": 0, "invalidated": 0, "other": 648, "test": 0, "train": 8, "validated": 8 }, "duration": 3963420, "reportedSentences": 0, "validatedSentences": 6997, "unvalidatedSentences": 0, "clips": 656, "splits": { "accent": { "": 10 }, "variant": {}, "age": { "": 0.0305, "thirties": 0.9695 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 3, "size": 23850375, "checksum": "993d68d7f13ccba167e1c07da4296175d0a8cc668fb23d3c0484a2ef2aae3a13", "avgDurationSecs": 6.042, "validDurationSecs": 48.334, "totalHrs": 1.1, "validHrs": 0.01 }, "bn": { "buckets": { "dev": 0, "invalidated": 0, "other": 38, "test": 0, "train": 0, "validated": 0 }, "duration": 221076, "reportedSentences": 0, "validatedSentences": 562831, "unvalidatedSentences": 94115, "clips": 38, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.8684, "twenties": 0.1316 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 5, "size": 52831974, "checksum": "38192a8de3794c059a9220405fe200d42555bffd197956506f34b69a93f1db32", "avgDurationSecs": 5.818, "validDurationSecs": 0, "totalHrs": 0.06, "validHrs": 0 }, "br": { "buckets": { "dev": 70, "invalidated": 0, "other": 31, "test": 64, "train": 111, "validated": 245 }, "duration": 1001448, "reportedSentences": 0, "validatedSentences": 23284, "unvalidatedSentences": 18486, "clips": 276, "splits": { "accent": { "leoneg": 25, "kerneveg": 60, "gwenedeg": 111, "": 30 }, "variant": {}, "age": { "": 0.3804, "fifties": 0.0181, "thirties": 0.4022, "fourties": 0.163, "sixties": 0.0362 }, "gender": { "": 0.8188, "female_feminine": 0.1812 }, "sentence_domain": { "general": 212, "finance": 3 } }, "users": 15, "size": 8229886, "checksum": "7980a3be991c410305efed21a980c7e311a2139103516105494eb272c1e9aa53", "avgDurationSecs": 3.628, "validDurationSecs": 888.967, "totalHrs": 0.27, "validHrs": 0.24 }, "brh": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 3095, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 224071, "checksum": "2ab31742c0b634c24906320342cb68d6c78fae6e756c2dfbd48914cf1eaea3b2", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "bsk": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 2447, "unvalidatedSentences": 22, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 135165, "checksum": "24b7a09c5352a9f971ff43fc00ae60f3082041a5712faa6fce07be06222bdb51", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "btv": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1053, "unvalidatedSentences": 9, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 60546, "checksum": "f54458e1f97900aa8397fbabb3fa743cd801f9b1d35e62b66e6619cd66a4c9dc", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ca": { "buckets": { "dev": 83, "invalidated": 21, "other": 19546, "test": 80, "train": 101, "validated": 264 }, "duration": 93465432, "reportedSentences": 389, "validatedSentences": 1306364, "unvalidatedSentences": 3204, "clips": 19831, "splits": { "accent": { "northwestern": 2702, "central": 1758, "valencian": 5, "": 50, "balearic": 30 }, "variant": { "ca-central": 13997, "ca-nwestern": 2702, "ca-valencia-southern": 5, "ca-valencia-tortosi": 7, "ca-balear": 30 }, "age": { "fifties": 0.4773, "sixties": 0.34, "fourties": 0.02, "seventies": 0.1289, "": 0.0316, "thirties": 0.0023 }, "gender": { "male_masculine": 0.6207, "female_feminine": 0.3477, "": 0.0316 }, "sentence_domain": { "agriculture_food": 13, "general": 1279, "language_fundamentals": 5, "technology_robotics": 7, "healthcare": 5, "nature_environment": 15, "automotive_transport": 9, "history_law_government": 5, "media_entertainment": 11, "news_current_affairs": 5, "service_retail": 1 } }, "users": 60, "size": 642389599, "checksum": "d1d5961a34880d6777425a74cfd5fe02f4a2e390a9a796a0e5517afd60eae530", "avgDurationSecs": 4.713, "validDurationSecs": 1244.258, "totalHrs": 25.96, "validHrs": 0.34 }, "ckb": { "buckets": { "dev": 0, "invalidated": 1, "other": 1288, "test": 0, "train": 10, "validated": 10 }, "duration": 6369804, "reportedSentences": 29, "validatedSentences": 19446, "unvalidatedSentences": 15087, "clips": 1299, "splits": { "accent": { "": 15 }, "variant": {}, "age": { "": 0.1093, "sixties": 0.1794, "fifties": 0.0038, "twenties": 0.107, "thirties": 0.5774, "fourties": 0.0231 }, "gender": { "": 0.7998, "male_masculine": 0.2002 }, "sentence_domain": { "general": 14, "media_entertainment": 8 } }, "users": 32, "size": 35737457, "checksum": "72da8efd6f4d0b3e92707cba8c5e65083363e980adcc0fe736f04f4b7f35d790", "avgDurationSecs": 4.904, "validDurationSecs": 49.036, "totalHrs": 1.76, "validHrs": 0.01 }, "cnh": { "buckets": { "dev": 0, "invalidated": 0, "other": 1, "test": 0, "train": 0, "validated": 0 }, "duration": 9576, "reportedSentences": 0, "validatedSentences": 5218, "unvalidatedSentences": 0, "clips": 1, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 335165, "checksum": "cea011c60ae655b889c341cfd0d087657ce7cef4edad7b1e8ccc82b8e75a7f22", "avgDurationSecs": 9.576, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cs": { "buckets": { "dev": 0, "invalidated": 0, "other": 26, "test": 0, "train": 0, "validated": 0 }, "duration": 129492, "reportedSentences": 0, "validatedSentences": 451253, "unvalidatedSentences": 105, "clips": 26, "splits": { "accent": { "": 20 }, "variant": {}, "age": { "": 0.8077, "thirties": 0.1923 }, "gender": { "": 0.8077, "male_masculine": 0.1923 }, "sentence_domain": {} }, "users": 6, "size": 31433345, "checksum": "63ec6a63f257f90916be896dbb070efac4ed2c56fcb19e4e6a9d05b0eded7cdf", "avgDurationSecs": 4.98, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "cv": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4382, "unvalidatedSentences": 4, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 320068, "checksum": "69ccf37af867d78a449762470c21a3ca2f52802a61466f0fc2f2097da6e4e3dc", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "cy": { "buckets": { "dev": 0, "invalidated": 0, "other": 100, "test": 0, "train": 0, "validated": 0 }, "duration": 524916, "reportedSentences": 0, "validatedSentences": 118044, "unvalidatedSentences": 15, "clips": 100, "splits": { "accent": { "united_kingdom": 99 }, "variant": { "cy-northwes": 1 }, "age": { "": 0.01, "fourties": 0.84, "twenties": 0.15 }, "gender": { "": 0.01, "male_masculine": 0.84, "female_feminine": 0.15 }, "sentence_domain": {} }, "users": 3, "size": 10934669, "checksum": "676b1f1eef56aa96a81db9edc174beb37ed065fa87504edb0e01afbcc72224c8", "avgDurationSecs": 5.249, "validDurationSecs": 0, "totalHrs": 0.14, "validHrs": 0 }, "da": { "buckets": { "dev": 0, "invalidated": 0, "other": 227, "test": 0, "train": 0, "validated": 0 }, "duration": 1083672, "reportedSentences": 6, "validatedSentences": 20986, "unvalidatedSentences": 173, "clips": 227, "splits": { "accent": { "": 60 }, "variant": {}, "age": { "": 0.1278, "thirties": 0.1586, "fourties": 0.7137 }, "gender": { "": 1 }, "sentence_domain": { "automotive_transport": 7, "general": 11, "media_entertainment": 8, "technology_robotics": 2, "language_fundamentals": 3 } }, "users": 4, "size": 7709652, "checksum": "02fe09d707157198dbe2bf6096bdeb5907714791157bfbcd613bab05e37ee530", "avgDurationSecs": 4.774, "validDurationSecs": 0, "totalHrs": 0.3, "validHrs": 0 }, "dag": { "buckets": { "dev": 75, "invalidated": 18, "other": 1504, "test": 70, "train": 107, "validated": 290 }, "duration": 7701768, "reportedSentences": 23, "validatedSentences": 5603, "unvalidatedSentences": 15134, "clips": 1812, "splits": { "accent": { "": 111 }, "variant": {}, "age": { "fourties": 0.0028, "thirties": 0.3791, "twenties": 0.6153, "": 0.0028 }, "gender": { "": 0.5971, "female_feminine": 0.4001, "male_masculine": 0.0028 }, "sentence_domain": { "general": 16, "agriculture_food": 1, "nature_environment": 6, "finance": 7, "news_current_affairs": 4, "language_fundamentals": 3 } }, "users": 19, "size": 42586487, "checksum": "ef46caa566e3ef30a35534046469b8543d193b69f56aaf7ea2c8abadbbdc9770", "avgDurationSecs": 4.25, "validDurationSecs": 1232.623, "totalHrs": 2.13, "validHrs": 0.34 }, "dar": { "buckets": { "dev": 533, "invalidated": 96, "other": 2611, "test": 701, "train": 1220, "validated": 4120 }, "duration": 40751028, "reportedSentences": 48, "validatedSentences": 5960, "unvalidatedSentences": 993, "clips": 6827, "splits": { "accent": { "": 280 }, "variant": {}, "age": { "": 0.2369, "twenties": 0.1548, "teens": 0.0195, "fourties": 0.2909, "thirties": 0.0103, "fifties": 0.2877 }, "gender": { "female_feminine": 0.4415, "": 0.5585 }, "sentence_domain": { "general": 6057, "healthcare": 67, "media_entertainment": 2939, "news_current_affairs": 2789, "history_law_government": 1476, "language_fundamentals": 45, "agriculture_food": 448 } }, "users": 34, "size": 237985622, "checksum": "02e2b7ab4a33d043a3e2e1df057e67052af3181a7f63c2feb5ffc2c00eaf49d7", "avgDurationSecs": 5.969, "validDurationSecs": 24592.681, "totalHrs": 11.31, "validHrs": 6.83 }, "de": { "buckets": { "dev": 99, "invalidated": 109, "other": 81, "test": 491, "train": 1013, "validated": 1603 }, "duration": 10378764, "reportedSentences": 73, "validatedSentences": 2058021, "unvalidatedSentences": 1621, "clips": 1793, "splits": { "accent": { "germany": 1547, "switzerland": 49, "": 25 }, "variant": {}, "age": { "thirties": 0.0892, "": 0.1818, "fourties": 0.6146, "fifties": 0.0335, "twenties": 0.0502, "sixties": 0.0056, "teens": 0.0251 }, "gender": { "": 0.2432, "male_masculine": 0.696, "female_feminine": 0.0608 }, "sentence_domain": { "general": 1 } }, "users": 65, "size": 209135884, "checksum": "572f0feda2057ac9cec56d522dddb94a1c5e6f9c07c2f46c543e4564e91d5d97", "avgDurationSecs": 5.788, "validDurationSecs": 9278.951, "totalHrs": 2.88, "validHrs": 2.57 }, "dml": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 5670, "unvalidatedSentences": 676, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 407139, "checksum": "63ef03bb0a0ef1b9a92d7aa5fd014651f8bf98e89bdeff978b8a8568a700b4e5", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "dsb": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 796, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 57206, "checksum": "1b34be4786ea92c87de9096b2d8de589a242f4b7ed936d6ad33dd2fcd4afe3b2", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "dv": { "buckets": { "dev": 0, "invalidated": 0, "other": 23, "test": 0, "train": 0, "validated": 0 }, "duration": 117360, "reportedSentences": 0, "validatedSentences": 7174, "unvalidatedSentences": 9, "clips": 23, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1207629, "checksum": "bafcd8e73bf0034f30be5f32cf40fe638bfbac55b0eced3de2faae936a8ade9e", "avgDurationSecs": 5.103, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "ebr": { "buckets": { "dev": 0, "invalidated": 0, "other": 1, "test": 0, "train": 0, "validated": 0 }, "duration": 4968, "reportedSentences": 0, "validatedSentences": 872, "unvalidatedSentences": 3, "clips": 1, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 76177, "checksum": "d7c5feb1e1fe24534db809e744910219cdde90f9f2a4c237566a118af41bbd92", "avgDurationSecs": 4.968, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "el": { "buckets": { "dev": 0, "invalidated": 1, "other": 1001, "test": 0, "train": 9, "validated": 9 }, "duration": 4182300, "reportedSentences": 0, "validatedSentences": 8323, "unvalidatedSentences": 3395, "clips": 1011, "splits": { "accent": { "": 961 }, "variant": {}, "age": { "thirties": 0.9505, "": 0.0247, "fourties": 0.0247 }, "gender": { "male_masculine": 0.9505, "": 0.0247, "female_feminine": 0.0247 }, "sentence_domain": {} }, "users": 5, "size": 25176917, "checksum": "5ada3f647934569af523faceb64598b83b01cc8f9fe256889dd4f978ba94ce7b", "avgDurationSecs": 4.137, "validDurationSecs": 37.231, "totalHrs": 1.16, "validHrs": 0.01 }, "en": { "buckets": { "dev": 0, "invalidated": 0, "other": 6283, "test": 0, "train": 0, "validated": 0 }, "duration": 39203820, "reportedSentences": 39, "validatedSentences": 1681626, "unvalidatedSentences": 38860, "clips": 6283, "splits": { "accent": { "us": 786, "": 138, "indian": 489, "canada": 883, "england": 120, "malaysia": 70, "african": 30, "ireland": 20, "australia": 65, "hongkong": 5, "scotland": 40, "bermuda": 5, "philippines": 15 }, "variant": {}, "age": { "": 0.2823, "fourties": 0.0845, "twenties": 0.1982, "sixties": 0.1359, "fifties": 0.1437, "thirties": 0.0493, "teens": 0.1046, "seventies": 0.0013, "eighties": 0.0002 }, "gender": { "": 0.4881, "female_feminine": 0.1265, "male_masculine": 0.3381, "do_not_wish_to_say": 0.0344, "non-binary": 0.007, "transgender": 0.0059 }, "sentence_domain": { "general": 16, "nature_environment": 6, "media_entertainment": 2, "history_law_government": 3, "agriculture_food": 5, "technology_robotics": 3, "healthcare": 1, "service_retail": 2, "language_fundamentals": 1 } }, "users": 458, "size": 348680812, "checksum": "f6e34487c05f184cc6353d8e3d01f89c466a4cf31ff2458d669d8ad576d84504", "avgDurationSecs": 6.24, "validDurationSecs": 0, "totalHrs": 10.88, "validHrs": 0 }, "eo": { "buckets": { "dev": 0, "invalidated": 0, "other": 5702, "test": 0, "train": 0, "validated": 0 }, "duration": 40955508, "reportedSentences": 11, "validatedSentences": 181254, "unvalidatedSentences": 21774, "clips": 5702, "splits": { "accent": { "internacia": 3698 }, "variant": {}, "age": { "": 0.0465, "sixties": 0.0044, "eighties": 0.6433, "fifties": 0.2997, "twenties": 0.0053, "fourties": 0.0009 }, "gender": { "": 0.0509, "male_masculine": 0.9474, "female_feminine": 0.0018 }, "sentence_domain": {} }, "users": 19, "size": 255116436, "checksum": "893bf242421c7ade5f4e7e8a94f465d04c19fca4697ba206c014e6f75bee7f4a", "avgDurationSecs": 7.183, "validDurationSecs": 0, "totalHrs": 11.37, "validHrs": 0 }, "es": { "buckets": { "dev": 0, "invalidated": 0, "other": 2682, "test": 0, "train": 0, "validated": 0 }, "duration": 15050268, "reportedSentences": 5, "validatedSentences": 1082350, "unvalidatedSentences": 4883, "clips": 2682, "splits": { "accent": { "canario": 349, "centrosurpeninsular": 167, "rioplatense": 49, "mexicano": 10, "andino": 5, "surpeninsular": 1482, "nortepeninsular": 394, "": 399, "chileno": 10 }, "variant": {}, "age": { "twenties": 0.1342, "teens": 0.1536, "": 0.1577, "fifties": 0.0261, "sixties": 0.5283 }, "gender": { "": 0.456, "female_feminine": 0.0101, "male_masculine": 0.5339 }, "sentence_domain": { "history_law_government": 4, "general": 3, "news_current_affairs": 1, "nature_environment": 1, "finance": 1 } }, "users": 62, "size": 162315844, "checksum": "9e509c007795ea1c567595c00e527d02b5781f0c7fd76b73599826a235d6b2b7", "avgDurationSecs": 5.612, "validDurationSecs": 0, "totalHrs": 4.18, "validHrs": 0 }, "esu": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 6929, "unvalidatedSentences": 5, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 315041, "checksum": "b68960248bb8370fe04c5d38d848a25d2a03d61bfeb7ae910f8a2f198ac2a68a", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "et": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 11090, "unvalidatedSentences": 1876, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 1116287, "checksum": "9b4445b9069ec7f690142d89d931d27e2d11f4bff4150ad9e5647d41273535a6", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "eto": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 966, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 47123, "checksum": "000418278bbe31e3e93049d218f75d16dd83f49fe0b805a4739a099604064cc4", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "eu": { "buckets": { "dev": 0, "invalidated": 0, "other": 281, "test": 0, "train": 0, "validated": 0 }, "duration": 1625076, "reportedSentences": 0, "validatedSentences": 1018163, "unvalidatedSentences": 12815, "clips": 281, "splits": { "accent": { "mendebalekoa": 135, "erdialdekoa_nafarra": 26 }, "variant": {}, "age": { "": 0.4484, "fourties": 0.5516 }, "gender": { "": 0.4484, "male_masculine": 0.4804, "female_feminine": 0.0712 }, "sentence_domain": { "general": 284 } }, "users": 6, "size": 77168125, "checksum": "191344386999daf84d4435948599b10d7226a60be281919352ad6bcb44854b9d", "avgDurationSecs": 5.783, "validDurationSecs": 0, "totalHrs": 0.45, "validHrs": 0 }, "fa": { "buckets": { "dev": 0, "invalidated": 0, "other": 886, "test": 0, "train": 0, "validated": 0 }, "duration": 2428236, "reportedSentences": 7, "validatedSentences": 57812, "unvalidatedSentences": 283439, "clips": 886, "splits": { "accent": { "": 1168 }, "variant": {}, "age": { "twenties": 0.8307, "teens": 0.0023, "": 0.1445, "thirties": 0.0226 }, "gender": { "": 0.3409, "male_masculine": 0.6591 }, "sentence_domain": { "nature_environment": 1, "general": 1, "automotive_transport": 1 } }, "users": 17, "size": 31139265, "checksum": "95e09979cff6c52b2f131b38b4c428e07df0698c8f239a07cd28592d7aa8eef7", "avgDurationSecs": 2.741, "validDurationSecs": 0, "totalHrs": 0.67, "validHrs": 0 }, "fan": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1005, "unvalidatedSentences": 1, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 49060, "checksum": "908cad9aa0b7350bfb7bb2254f7b3d43741b2c4cb5032d28ad12ad75a948f2a9", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "fi": { "buckets": { "dev": 0, "invalidated": 0, "other": 35, "test": 0, "train": 0, "validated": 0 }, "duration": 200232, "reportedSentences": 0, "validatedSentences": 5865, "unvalidatedSentences": 49163, "clips": 35, "splits": { "accent": {}, "variant": {}, "age": { "": 0.4286, "fourties": 0.5714 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 3, "size": 4999288, "checksum": "03dd288fa2fb305b57b502d5a2280a5dcd9df1a0935788d93ee3cbdd089a5b66", "avgDurationSecs": 5.721, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "fr": { "buckets": { "dev": 199, "invalidated": 86, "other": 7720, "test": 231, "train": 323, "validated": 753 }, "duration": 44964180, "reportedSentences": 24, "validatedSentences": 1649097, "unvalidatedSentences": 43765, "clips": 8559, "splits": { "accent": { "martinique": 10, "fr-metro-east": 5, "fr-metro-west": 34, "fr-metro-south": 55, "canada": 110, "belgium": 23, "": 285, "tunisia": 25, "fr-metro-north": 5, "algeria": 5, "morocco": 15, "switzerland": 5 }, "variant": { "fr-metro": 7740, "fr-europe": 137, "fr-namerica": 110, "fr-nafrica": 40 }, "age": { "fifties": 0.083, "": 0.0672, "fourties": 0.7781, "thirties": 0.0285, "teens": 0.0025, "twenties": 0.0383, "sixties": 0.0025 }, "gender": { "male_masculine": 0.8691, "": 0.1168, "female_feminine": 0.014 }, "sentence_domain": { "media_entertainment": 1 } }, "users": 87, "size": 380533321, "checksum": "b19cf28336e60c76ccea67140885d8ca898f59bd824552a6ab3df28f93c8cce0", "avgDurationSecs": 5.253, "validDurationSecs": 3955.839, "totalHrs": 12.49, "validHrs": 1.09 }, "fub": { "buckets": { "dev": 0, "invalidated": 0, "other": 55, "test": 0, "train": 0, "validated": 0 }, "duration": 160704, "reportedSentences": 0, "validatedSentences": 1067, "unvalidatedSentences": 0, "clips": 55, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1005713, "checksum": "afceb52744d66111128b1bd3f73c4021ca8a2fb3e5aeb9efc44108eea6f6c06d", "avgDurationSecs": 2.922, "validDurationSecs": 0, "totalHrs": 0.04, "validHrs": 0 }, "fy-NL": { "buckets": { "dev": 0, "invalidated": 0, "other": 14, "test": 0, "train": 0, "validated": 0 }, "duration": 56340, "reportedSentences": 1, "validatedSentences": 10902, "unvalidatedSentences": 15, "clips": 14, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": { "agriculture_food": 13, "history_law_government": 9, "media_entertainment": 13, "language_fundamentals": 4, "news_current_affairs": 1, "finance": 1, "general": 1 } }, "users": 1, "size": 1021699, "checksum": "a42cc3bc49e99313a7ba015801e45502b30b811d56378177d358d8bfd94c7a1e", "avgDurationSecs": 4.024, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "ga-IE": { "buckets": { "dev": 0, "invalidated": 1, "other": 100, "test": 0, "train": 4, "validated": 4 }, "duration": 680580, "reportedSentences": 0, "validatedSentences": 4883, "unvalidatedSentences": 1471, "clips": 105, "splits": { "accent": { "cainteoir_líofa_ní_ó_dhúchas": 35, "cainteoir_dúchais_gaeltacht": 35 }, "variant": { "ga-IE-chonnact": 10, "ga-IE-mumhan": 60 }, "age": { "": 0.6667, "fourties": 0.3333 }, "gender": { "": 0.6667, "female_feminine": 0.3333 }, "sentence_domain": { "general": 9, "news_current_affairs": 2, "history_law_government": 2, "media_entertainment": 2, "automotive_transport": 1, "agriculture_food": 1, "service_retail": 2 } }, "users": 9, "size": 4143437, "checksum": "10aa15e6beb4296205741335b93a0b77c065d4443021b4e04b4424febdc89770", "avgDurationSecs": 6.482, "validDurationSecs": 25.927, "totalHrs": 0.18, "validHrs": 0 }, "gjk": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 2004, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 118386, "checksum": "ef2c0e7ad7a7bf178c13e118b378de3d0de9bd6fe6cdaa97374d85be3b691a0a", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gju": { "buckets": { "dev": 0, "invalidated": 0, "other": 1, "test": 0, "train": 0, "validated": 0 }, "duration": 3888, "reportedSentences": 0, "validatedSentences": 3852, "unvalidatedSentences": 2, "clips": 1, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 243368, "checksum": "224afb93352f334e2c814809e03f0aa1dde53e9baaaecbb32a7d035015775912", "avgDurationSecs": 3.888, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gl": { "buckets": { "dev": 1237, "invalidated": 495, "other": 10941, "test": 1815, "train": 4159, "validated": 7211 }, "duration": 98078076, "reportedSentences": 83, "validatedSentences": 1002019, "unvalidatedSentences": 3817, "clips": 18647, "splits": { "accent": { "central": 5880, "normativo": 4265, "oriental": 160, "atlantico": 5363, "neofalante": 1367, "": 15 }, "variant": {}, "age": { "fifties": 0.2453, "": 0.0829, "teens": 0.2245, "twenties": 0.4209, "fourties": 0.0046, "thirties": 0.0048, "sixties": 0.0169 }, "gender": { "": 0.5542, "female_feminine": 0.4458 }, "sentence_domain": { "general": 5, "agriculture_food": 1 } }, "users": 203, "size": 647743404, "checksum": "ab3895555077841aa76659f7fb5a84c201498e8b57ef6ccf7986bb8ca55cfc3b", "avgDurationSecs": 5.26, "validDurationSecs": 37927.871, "totalHrs": 27.24, "validHrs": 10.53 }, "gn": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 5894, "unvalidatedSentences": 301, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 345415, "checksum": "bff89885263971fa32424cc35188812a693c4b2266b3f55b942f407f7517da05", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "gsw": { "buckets": { "dev": 9, "invalidated": 2, "other": 277, "test": 9, "train": 14, "validated": 63 }, "duration": 1953612, "reportedSentences": 0, "validatedSentences": 87, "unvalidatedSentences": 5, "clips": 342, "splits": { "accent": {}, "variant": { "gsw-FR-nordalem": 158, "gsw-FR-sudnalem": 5, "gsw-FR-rhinfran": 38 }, "age": { "fourties": 0.1316, "sixties": 0.5175, "thirties": 0.1257, "fifties": 0.2105, "": 0.0146 }, "gender": { "": 0.5585, "female_feminine": 0.4415 }, "sentence_domain": { "media_entertainment": 181, "general": 154, "service_retail": 14, "news_current_affairs": 10, "healthcare": 13, "history_law_government": 2 } }, "users": 14, "size": 11460447, "checksum": "0c44c4976bc48bfbc403377fc6b17fd30e2f9f742f96ed6b1736928bd65cfed1", "avgDurationSecs": 5.712, "validDurationSecs": 359.876, "totalHrs": 0.54, "validHrs": 0.09 }, "gv": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4219, "unvalidatedSentences": 21, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 296539, "checksum": "eeeaf5fe44ef8b97f3f57821852af43e28bcb7b27a9a04065aca37e789360769", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "haz": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 24156, "reportedSentences": 0, "validatedSentences": 1361, "unvalidatedSentences": 0, "clips": 5, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 225655, "checksum": "c2bcaef5d3dfc2e36b3bb5cbd36a1aafb4ee34de7c08df29889a6bacf719906c", "avgDurationSecs": 4.831, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "he": { "buckets": { "dev": 0, "invalidated": 0, "other": 34, "test": 0, "train": 2, "validated": 2 }, "duration": 170856, "reportedSentences": 0, "validatedSentences": 4188, "unvalidatedSentences": 6836, "clips": 36, "splits": { "accent": { "": 48 }, "variant": {}, "age": { "thirties": 0.4444, "twenties": 0.1389, "": 0.4167 }, "gender": { "male_masculine": 0.4444, "female_feminine": 0.1389, "": 0.4167 }, "sentence_domain": {} }, "users": 5, "size": 1810591, "checksum": "b6067a9e98c1496def4fabf18fbd6562bcb20782c1ee30328d8cc7d4cd4d5db5", "avgDurationSecs": 4.746, "validDurationSecs": 9.492, "totalHrs": 0.04, "validHrs": 0 }, "hi": { "buckets": { "dev": 0, "invalidated": 0, "other": 525, "test": 0, "train": 0, "validated": 0 }, "duration": 3170340, "reportedSentences": 0, "validatedSentences": 32204, "unvalidatedSentences": 9965, "clips": 525, "splits": { "accent": { "": 10 }, "variant": {}, "age": { "": 0.0286, "fourties": 0.9714 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 4, "size": 21636029, "checksum": "9786ea917b771acfb1f62c85a80b1c8cfbd0c25d0957b551e5247f4203cc8f8e", "avgDurationSecs": 6.039, "validDurationSecs": 0, "totalHrs": 0.88, "validHrs": 0 }, "hsb": { "buckets": { "dev": 0, "invalidated": 0, "other": 2221, "test": 0, "train": 0, "validated": 0 }, "duration": 18995400, "reportedSentences": 0, "validatedSentences": 7182, "unvalidatedSentences": 540, "clips": 2221, "splits": { "accent": {}, "variant": {}, "age": { "fifties": 0.0252, "": 0.9703, "fourties": 0.0045 }, "gender": { "female_feminine": 0.0297, "": 0.9703 }, "sentence_domain": {} }, "users": 3, "size": 113087808, "checksum": "3948668a6aee7ff3a07b9a1d2c8e94b216de746ef1c3fe8dade7d9cce1226fe9", "avgDurationSecs": 8.553, "validDurationSecs": 0, "totalHrs": 5.27, "validHrs": 0 }, "ht": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 76, "unvalidatedSentences": 1629, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 104563, "checksum": "7466b216c60b685ec58c658e0757865ff7d01b4dc1c42ad9c289c3b567a225fa", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "hu": { "buckets": { "dev": 0, "invalidated": 0, "other": 36, "test": 0, "train": 0, "validated": 0 }, "duration": 228024, "reportedSentences": 0, "validatedSentences": 326531, "unvalidatedSentences": 232, "clips": 36, "splits": { "accent": { "": 30 }, "variant": {}, "age": { "": 0.4167, "eighties": 0.0278, "fourties": 0.1389, "sixties": 0.1389, "fifties": 0.2778 }, "gender": { "": 0.5556, "male_masculine": 0.1667, "female_feminine": 0.2778 }, "sentence_domain": { "general": 4, "history_law_government": 1 } }, "users": 5, "size": 23390263, "checksum": "b5d15819541c59f46af285b246ccdd870cab4acc434104f8046e874d9472c74b", "avgDurationSecs": 6.334, "validDurationSecs": 0, "totalHrs": 0.06, "validHrs": 0 }, "hy-AM": { "buckets": { "dev": 1, "invalidated": 3, "other": 809, "test": 1, "train": 2, "validated": 4 }, "duration": 4512312, "reportedSentences": 1, "validatedSentences": 237419, "unvalidatedSentences": 1377, "clips": 816, "splits": { "accent": {}, "variant": {}, "age": { "": 0.9461, "teens": 0.0539 }, "gender": { "": 0.989, "male_masculine": 0.011 }, "sentence_domain": {} }, "users": 145, "size": 44492527, "checksum": "53f3b83f290e012be25d8c4677c285b8f241e9bc0244edae058b0cb48f2846c7", "avgDurationSecs": 5.53, "validDurationSecs": 22.119, "totalHrs": 1.25, "validHrs": 0 }, "ia": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 9069, "unvalidatedSentences": 144, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 510707, "checksum": "6d26381acdb788717283097fcf646c1434bd1588cd50b635f9efb3b905394bf1", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "id": { "buckets": { "dev": 0, "invalidated": 0, "other": 900, "test": 0, "train": 0, "validated": 0 }, "duration": 2826684, "reportedSentences": 0, "validatedSentences": 13321, "unvalidatedSentences": 11861, "clips": 900, "splits": { "accent": { "": 50 }, "variant": {}, "age": { "twenties": 0.8278, "": 0.1111, "thirties": 0.0611 }, "gender": { "": 0.8622, "female_feminine": 0.1267, "do_not_wish_to_say": 0.0111 }, "sentence_domain": {} }, "users": 13, "size": 17991685, "checksum": "f6dd5e2c9d2814210061b321be1521f1f81be9293cb81adb814eaee9e5564e11", "avgDurationSecs": 3.141, "validDurationSecs": 0, "totalHrs": 0.78, "validHrs": 0 }, "ig": { "buckets": { "dev": 37, "invalidated": 12, "other": 3444, "test": 65, "train": 130, "validated": 232 }, "duration": 19626480, "reportedSentences": 26, "validatedSentences": 15294, "unvalidatedSentences": 3099, "clips": 3688, "splits": { "accent": { "": 996 }, "variant": {}, "age": { "": 0.0241, "teens": 0.1762, "twenties": 0.5488, "thirties": 0.2508 }, "gender": { "": 0.1329, "female_feminine": 0.8671 }, "sentence_domain": { "agriculture_food": 1, "media_entertainment": 1, "language_fundamentals": 1 } }, "users": 26, "size": 111667171, "checksum": "e0af5fcd2bcad7455d56149ad2e352f836ef440a68289572bbf624ed02a8fac6", "avgDurationSecs": 5.322, "validDurationSecs": 1234.638, "totalHrs": 5.45, "validHrs": 0.34 }, "is": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 5083, "unvalidatedSentences": 113, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 379284, "checksum": "efe4ae19b47fc5772f9ab254bda4265a736c2bd90deed6f90497af64d6b3f266", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "it": { "buckets": { "dev": 0, "invalidated": 0, "other": 1601, "test": 2, "train": 20, "validated": 22 }, "duration": 9336456, "reportedSentences": 0, "validatedSentences": 925138, "unvalidatedSentences": 908, "clips": 1623, "splits": { "accent": { "": 227 }, "variant": {}, "age": { "sixties": 0.5742, "": 0.2914, "fourties": 0.0123, "twenties": 0.1066, "fifties": 0.0062, "teens": 0.0092 }, "gender": { "male_masculine": 0.5835, "": 0.4165 }, "sentence_domain": {} }, "users": 19, "size": 119182588, "checksum": "56f8695881ba684a95df8457bd1150c86cbac718d6ff1f5f66ebc5887c403cce", "avgDurationSecs": 5.753, "validDurationSecs": 126.557, "totalHrs": 2.59, "validHrs": 0.03 }, "ja": { "buckets": { "dev": 0, "invalidated": 249, "other": 26657, "test": 0, "train": 0, "validated": 0 }, "duration": 113164416, "reportedSentences": 6, "validatedSentences": 43942, "unvalidatedSentences": 7265, "clips": 26906, "splits": { "accent": { "": 3197 }, "variant": {}, "age": { "fifties": 0.0827, "": 0.1658, "fourties": 0.1247, "twenties": 0.4505, "thirties": 0.1076, "teens": 0.0293, "sixties": 0.0394 }, "gender": { "female_feminine": 0.4866, "do_not_wish_to_say": 0.0234, "": 0.4706, "non-binary": 0.0044, "intersex": 0.0027, "male_masculine": 0.0123 }, "sentence_domain": { "general": 2817, "agriculture_food": 1021, "automotive_transport": 242, "healthcare": 379, "nature_environment": 222, "media_entertainment": 299, "language_fundamentals": 296, "finance": 209, "news_current_affairs": 97, "history_law_government": 229, "service_retail": 90, "technology_robotics": 32 } }, "users": 338, "size": 667850538, "checksum": "b76d4a72b8f65bdf4e36e3ad8cc28260fbead7094b775fb94f5b176af6576ca8", "avgDurationSecs": 4.206, "validDurationSecs": 0, "totalHrs": 31.43, "validHrs": 0 }, "ka": { "buckets": { "dev": 0, "invalidated": 0, "other": 59, "test": 0, "train": 0, "validated": 0 }, "duration": 365040, "reportedSentences": 0, "validatedSentences": 132630, "unvalidatedSentences": 1891, "clips": 59, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.1695, "teens": 0.0169, "": 0.8136 }, "gender": { "female_feminine": 0.1695, "male_masculine": 0.0169, "": 0.8136 }, "sentence_domain": { "automotive_transport": 1 } }, "users": 9, "size": 13029189, "checksum": "c5a14653c6efc68373cf7f1dbe39be86d6a6620d08ff5561bcc0e62c79d86d97", "avgDurationSecs": 6.187, "validDurationSecs": 0, "totalHrs": 0.1, "validHrs": 0 }, "kab": { "buckets": { "dev": 0, "invalidated": 0, "other": 38, "test": 0, "train": 0, "validated": 0 }, "duration": 218916, "reportedSentences": 14, "validatedSentences": 182810, "unvalidatedSentences": 205847, "clips": 38, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 0.6053, "twenties": 0.3947 }, "gender": { "": 0.6053, "male_masculine": 0.3947 }, "sentence_domain": {} }, "users": 5, "size": 24184598, "checksum": "fc6c2b714dc3f536faca87e4160c409b64c85fd8e3dc0b442760592b873f4e73", "avgDurationSecs": 5.761, "validDurationSecs": 0, "totalHrs": 0.06, "validHrs": 0 }, "kbd": { "buckets": { "dev": 6039, "invalidated": 74, "other": 8376, "test": 6749, "train": 13886, "validated": 47672 }, "duration": 350012088, "reportedSentences": 636, "validatedSentences": 30253, "unvalidatedSentences": 2390, "clips": 56122, "splits": { "accent": { "bakhsan": 9795, "terek": 8202, "": 5, "kuban_zelenchuk": 1206, "kuban": 150, "malka": 35, "besleney": 6 }, "variant": { "kbd-Cyrl": 34472, "kbd-RU": 1322 }, "age": { "fifties": 0.2327, "": 0.1179, "twenties": 0.3424, "thirties": 0.202, "fourties": 0.0637, "sixties": 0.0019, "teens": 0.0395 }, "gender": { "": 0.6869, "female_feminine": 0.313, "male_masculine": 0.0001 }, "sentence_domain": { "general": 58162, "history_law_government": 44, "agriculture_food": 3, "nature_environment": 2, "media_entertainment": 2, "news_current_affairs": 1, "automotive_transport": 1 } }, "users": 97, "size": 2014820740, "checksum": "7b6c8bcec05f59338ac7cff86d333288e3405eb3209dcfa3450d4e6e802eff4d", "avgDurationSecs": 6.237, "validDurationSecs": 297312.574, "totalHrs": 97.22, "validHrs": 82.58 }, "khw": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 7051, "unvalidatedSentences": 200, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 547008, "checksum": "d0a5d99abd16419bea97c3174e2d33b991f5382cf3f9ca9fe76e05ef720d2960", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kk": { "buckets": { "dev": 0, "invalidated": 4, "other": 74, "test": 0, "train": 6, "validated": 6 }, "duration": 454572, "reportedSentences": 1, "validatedSentences": 6229, "unvalidatedSentences": 241, "clips": 84, "splits": { "accent": { "": 59 }, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 8, "size": 3087610, "checksum": "10a2eca0d05077de517d3672e4e6cdb7608ff603959cdbbdb76d97371c1795ec", "avgDurationSecs": 5.412, "validDurationSecs": 32.469, "totalHrs": 0.12, "validHrs": 0 }, "kls": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 3912, "unvalidatedSentences": 4, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 182167, "checksum": "dfe6d0e5d9921b45666ea93838ebd30f10831ebf0cd528b403f30eca5d0686dc", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "kmr": { "buckets": { "dev": 0, "invalidated": 0, "other": 1073, "test": 0, "train": 0, "validated": 0 }, "duration": 6316776, "reportedSentences": 2, "validatedSentences": 17030, "unvalidatedSentences": 1132, "clips": 1073, "splits": { "accent": {}, "variant": {}, "age": { "fourties": 0.0186, "": 0.0261, "fifties": 0.2283, "twenties": 0.7269 }, "gender": { "male_masculine": 0.0186, "": 0.9814 }, "sentence_domain": { "nature_environment": 2, "general": 2, "healthcare": 1, "agriculture_food": 1, "media_entertainment": 1 } }, "users": 9, "size": 38234707, "checksum": "0e524bcd1c5380f9067b727789455f3a327b2d94a5304ceefa22f4a949a5d82f", "avgDurationSecs": 5.887, "validDurationSecs": 0, "totalHrs": 1.75, "validHrs": 0 }, "ko": { "buckets": { "dev": 0, "invalidated": 1, "other": 105, "test": 0, "train": 0, "validated": 0 }, "duration": 363132, "reportedSentences": 0, "validatedSentences": 7401, "unvalidatedSentences": 2623, "clips": 106, "splits": { "accent": { "": 11 }, "variant": {}, "age": { "twenties": 0.3868, "": 0.6132 }, "gender": { "female_feminine": 0.0943, "": 0.9057 }, "sentence_domain": {} }, "users": 6, "size": 2759111, "checksum": "0697b919890c81e3376e2b3145ac3df99495977f4b93fd59997bbf90f2a6bc92", "avgDurationSecs": 3.426, "validDurationSecs": 0, "totalHrs": 0.1, "validHrs": 0 }, "ky": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 27828, "reportedSentences": 0, "validatedSentences": 5047, "unvalidatedSentences": 240, "clips": 5, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 526683, "checksum": "7935c37242cff7ed11ea110ce6a1a844fc472bfa04e19cc9a4cff89bb3fb0c04", "avgDurationSecs": 5.566, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lg": { "buckets": { "dev": 0, "invalidated": 0, "other": 10, "test": 0, "train": 0, "validated": 0 }, "duration": 71172, "reportedSentences": 0, "validatedSentences": 191409, "unvalidatedSentences": 179, "clips": 10, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 12708652, "checksum": "e9a1884c2881c9d31024b8b2fcac5b7a730efae6cc1e81544052fad68e5fe951", "avgDurationSecs": 7.117, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "lo": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 5145, "unvalidatedSentences": 5926, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 873643, "checksum": "b8fc68296422ca249667bb23ce5ec23ea3024233425db2082862ac8688d6ba24", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "lt": { "buckets": { "dev": 0, "invalidated": 0, "other": 24, "test": 0, "train": 0, "validated": 0 }, "duration": 149112, "reportedSentences": 0, "validatedSentences": 129304, "unvalidatedSentences": 4821, "clips": 24, "splits": { "accent": {}, "variant": {}, "age": { "": 0.625, "fourties": 0.375 }, "gender": { "": 0.625, "male_masculine": 0.375 }, "sentence_domain": {} }, "users": 3, "size": 9654441, "checksum": "118b4b717a2d52e4c5146962b10bc9839f9a94d2c218e4786cacf5d2907f68c0", "avgDurationSecs": 6.213, "validDurationSecs": 0, "totalHrs": 0.04, "validHrs": 0 }, "ltg": { "buckets": { "dev": 0, "invalidated": 0, "other": 113, "test": 0, "train": 3, "validated": 3 }, "duration": 833400, "reportedSentences": 1, "validatedSentences": 15086, "unvalidatedSentences": 2268, "clips": 116, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.7845, "": 0.2155 }, "gender": { "female_feminine": 0.7845, "": 0.2155 }, "sentence_domain": { "general": 138, "agriculture_food": 1, "history_law_government": 23, "service_retail": 1 } }, "users": 6, "size": 6152790, "checksum": "a7fb80061ece7239b20047135af2e7bc27a75526b3103c0735301f6eceb1678d", "avgDurationSecs": 7.184, "validDurationSecs": 21.553, "totalHrs": 0.23, "validHrs": 0 }, "lv": { "buckets": { "dev": 0, "invalidated": 0, "other": 946, "test": 0, "train": 0, "validated": 0 }, "duration": 6078600, "reportedSentences": 0, "validatedSentences": 34635, "unvalidatedSentences": 64972, "clips": 946, "splits": { "accent": { "": 155 }, "variant": {}, "age": { "": 0.1184, "thirties": 0.1057, "fifties": 0.0211, "fourties": 0.0899, "twenties": 0.6649 }, "gender": { "": 0.2188, "female_feminine": 0.7548, "male_masculine": 0.0264 }, "sentence_domain": { "general": 28, "technology_robotics": 2, "nature_environment": 1, "history_law_government": 1, "media_entertainment": 1 } }, "users": 21, "size": 40953596, "checksum": "536fe8e784704957cdfd81cfed38052faf6e8c0a783b3744ec7259e41a1a436f", "avgDurationSecs": 6.426, "validDurationSecs": 0, "totalHrs": 1.68, "validHrs": 0 }, "lzz": { "buckets": { "dev": 2700, "invalidated": 535, "other": 862, "test": 3187, "train": 4473, "validated": 14293 }, "duration": 74556324, "reportedSentences": 16, "validatedSentences": 30216, "unvalidatedSentences": 16402, "clips": 15690, "splits": { "accent": { "vitzuri": 2496, "xopuri": 4325, "": 3983, "atinuri": 593, "arkaburi": 162, "artaşenuri": 2667 }, "variant": { "lzz-arkaburi": 2904, "lzz-xopuri": 4812, "lzz-atinuri": 4274 }, "age": { "sixties": 0.2081, "fifties": 0.2693, "fourties": 0.1115, "": 0.3474, "thirties": 0.0369, "twenties": 0.0256, "teens": 0.001, "seventies": 0.0002 }, "gender": { "": 0.7602, "female_feminine": 0.2398 }, "sentence_domain": { "general": 16252, "nature_environment": 850, "news_current_affairs": 232, "language_fundamentals": 37, "finance": 4, "agriculture_food": 6, "healthcare": 4, "automotive_transport": 4 } }, "users": 84, "size": 427822859, "checksum": "46014a19d0ca2834a2bea0d180a6f914661f50f4d0e8899366181a58e631610b", "avgDurationSecs": 4.752, "validDurationSecs": 67918.008, "totalHrs": 20.71, "validHrs": 18.86 }, "mcf": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 997, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 47883, "checksum": "536d9a0199ffdc005fa9ed53220b5bcdcff5e9ff274102ea09ff626f9be69570", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mcn": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 959, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 54119, "checksum": "a74bcebbe2f6b4e7827fc4ee23a2b510cd9fa88b234919a571725e91ae7d8c95", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mgg": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 30348, "reportedSentences": 0, "validatedSentences": 991, "unvalidatedSentences": 0, "clips": 5, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 209303, "checksum": "e47579c1f2e74652b21576309c50496062dbc40a8ae48acaadd132293d5e75af", "avgDurationSecs": 6.07, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mk": { "buckets": { "dev": 0, "invalidated": 0, "other": 227, "test": 0, "train": 0, "validated": 0 }, "duration": 1101600, "reportedSentences": 0, "validatedSentences": 6123, "unvalidatedSentences": 571, "clips": 227, "splits": { "accent": { "": 66 }, "variant": {}, "age": { "thirties": 0.1013, "": 0.0661, "twenties": 0.8326 }, "gender": { "male_masculine": 0.1013, "": 0.8987 }, "sentence_domain": { "history_law_government": 2, "general": 8, "technology_robotics": 1 } }, "users": 3, "size": 6976707, "checksum": "9f219159c8c2a979c70eb0766ab3c1a620427b27206a1386c852da73b4d74936", "avgDurationSecs": 4.853, "validDurationSecs": 0, "totalHrs": 0.3, "validHrs": 0 }, "mki": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 2055, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 102551, "checksum": "cc8449fe73e20ecdd3ccc00d2194859299fb2df7747ba8bd69b3f0760fac393a", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ml": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 5487, "unvalidatedSentences": 4728, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 797795, "checksum": "712a9391a15b5f4979fe84b03b39b0144bc76886657fccf3f76c52ce5b618781", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mn": { "buckets": { "dev": 0, "invalidated": 0, "other": 14, "test": 0, "train": 0, "validated": 0 }, "duration": 72396, "reportedSentences": 0, "validatedSentences": 6099, "unvalidatedSentences": 15, "clips": 14, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 0.5714, "": 0.4286 }, "gender": { "": 1 }, "sentence_domain": { "healthcare": 2, "history_law_government": 1, "general": 2, "nature_environment": 4, "media_entertainment": 1, "language_fundamentals": 1, "news_current_affairs": 1 } }, "users": 3, "size": 906606, "checksum": "23b3173b107929f514b666b6c5b12b056c3c9853f77745daf9474a8b20217453", "avgDurationSecs": 5.171, "validDurationSecs": 0, "totalHrs": 0.02, "validHrs": 0 }, "mr": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 6717, "unvalidatedSentences": 12102, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 1547961, "checksum": "d6a11c4600ee939e2cc1a1b2bcd14cac58719b1d1fa60aedb8f57945bc10063a", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ms": { "buckets": { "dev": 0, "invalidated": 0, "other": 35, "test": 0, "train": 0, "validated": 0 }, "duration": 259056, "reportedSentences": 0, "validatedSentences": 257, "unvalidatedSentences": 41, "clips": 35, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 1191423, "checksum": "2bfb74cfda180b1430c5c516baf4df1e5b2cf36f3466497b37fcef980734b4a5", "avgDurationSecs": 7.402, "validDurationSecs": 0, "totalHrs": 0.07, "validHrs": 0 }, "mt": { "buckets": { "dev": 0, "invalidated": 0, "other": 140, "test": 0, "train": 0, "validated": 0 }, "duration": 572652, "reportedSentences": 0, "validatedSentences": 5274, "unvalidatedSentences": 11772, "clips": 140, "splits": { "accent": { "": 140 }, "variant": {}, "age": { "": 0.0714, "twenties": 0.9286 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 4507545, "checksum": "a39d43501fda697e86d9f278afd6fe5eef9cda0d4020c8d9dca74448f3a24781", "avgDurationSecs": 4.09, "validDurationSecs": 0, "totalHrs": 0.15, "validHrs": 0 }, "mua": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 995, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 53699, "checksum": "3c74be9ea0e4250290031ddb7cf8ed06f0505eb0db7cff0c16bd53483cefc492", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "mvy": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 6634, "unvalidatedSentences": 222, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 459128, "checksum": "9e3791acbe36de80e0c6e16e54d103036ce0f209dcd72ed0265dc15cdbb04389", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "myv": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 5066, "unvalidatedSentences": 7, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 373629, "checksum": "2c61bf5a86d30065c617e771962cbf9d036a29004a82c5acb8414216161d42f9", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nan-tw": { "buckets": { "dev": 0, "invalidated": 2, "other": 177, "test": 0, "train": 1, "validated": 1 }, "duration": 706176, "reportedSentences": 0, "validatedSentences": 26907, "unvalidatedSentences": 370, "clips": 180, "splits": { "accent": { "": 235 }, "variant": { "nan-TW-tailo": 145 }, "age": { "": 0.0278, "thirties": 0.1667, "teens": 0.8056 }, "gender": { "": 0.8333, "male_masculine": 0.1667 }, "sentence_domain": { "general": 180 } }, "users": 3, "size": 4612672, "checksum": "202daf592df3bf01d639aede8310a5619282d955fc7fcad254f3fad42a149b89", "avgDurationSecs": 3.923, "validDurationSecs": 3.923, "totalHrs": 0.19, "validHrs": 0 }, "nb-NO": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4403, "unvalidatedSentences": 3665, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 513288, "checksum": "86f91e972daf3873fb80344f32c7bd284ba514fc114ff00742386e3d176d6947", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ncx": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1518, "unvalidatedSentences": 5, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 82145, "checksum": "80bb0db439316bb459611dc75367543c636361827ef82bffd885e72544ade540", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ne-NP": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1316, "unvalidatedSentences": 30, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 88910, "checksum": "2f542af55f15602c0f795f2bc2cb6a8d36d576bea16e81983a558f9fe5785c54", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "nl": { "buckets": { "dev": 0, "invalidated": 2, "other": 382, "test": 0, "train": 1, "validated": 1 }, "duration": 1728288, "reportedSentences": 2, "validatedSentences": 260894, "unvalidatedSentences": 10519, "clips": 385, "splits": { "accent": { "belgium": 10, "netherlands": 355 }, "variant": {}, "age": { "": 0.1299, "twenties": 0.1299, "fourties": 0.7403 }, "gender": { "": 0.2597, "male_masculine": 0.7403 }, "sentence_domain": { "general": 87, "language_fundamentals": 4, "news_current_affairs": 1, "agriculture_food": 2, "finance": 1, "nature_environment": 1, "history_law_government": 4, "media_entertainment": 1 } }, "users": 10, "size": 27545782, "checksum": "41cf57b90dad910664976313b02137c6e47835c06b0569aba5c12a5e498e65f6", "avgDurationSecs": 4.489, "validDurationSecs": 4.489, "totalHrs": 0.48, "validHrs": 0 }, "nn-NO": { "buckets": { "dev": 0, "invalidated": 0, "other": 179, "test": 0, "train": 0, "validated": 0 }, "duration": 869184, "reportedSentences": 18, "validatedSentences": 5318, "unvalidatedSentences": 6, "clips": 179, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 5423488, "checksum": "a3b858358c9c0a70e05bbac09812ac73b2d2b1481f0287ddcfb153ed34ace41d", "avgDurationSecs": 4.856, "validDurationSecs": 0, "totalHrs": 0.24, "validHrs": 0 }, "nso": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 2247, "unvalidatedSentences": 14224, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 1020016, "checksum": "d1a6bdf04509f618da67d2ee5fe243c217f3c26dd96272cb8320bbb000283921", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "oc": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4934, "unvalidatedSentences": 14, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 303525, "checksum": "98b2b0b6359cc5afeef62853f660bd4af64fe1f0095e3c30d15c2733dda1095f", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "or": { "buckets": { "dev": 0, "invalidated": 1, "other": 102, "test": 0, "train": 0, "validated": 0 }, "duration": 605772, "reportedSentences": 0, "validatedSentences": 6201, "unvalidatedSentences": 86, "clips": 103, "splits": { "accent": { "": 118 }, "variant": {}, "age": { "twenties": 0.8544, "fourties": 0.1456 }, "gender": { "male_masculine": 0.8544, "": 0.1456 }, "sentence_domain": {} }, "users": 2, "size": 4030814, "checksum": "006ea36c10871e179f2fd944b29c3d2151c068bc0ba00e89ea1b3561748a4766", "avgDurationSecs": 5.881, "validDurationSecs": 0, "totalHrs": 0.16, "validHrs": 0 }, "oru": { "buckets": { "dev": 0, "invalidated": 10, "other": 0, "test": 90, "train": 1700, "validated": 1790 }, "duration": 12731148, "reportedSentences": 0, "validatedSentences": 7355, "unvalidatedSentences": 902, "clips": 1800, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": { "general": 11, "agriculture_food": 11, "language_fundamentals": 11 } }, "users": 2, "size": 75919389, "checksum": "07155e2d40c3f475d0d5433de7930844a3173246c21989d3f6a773f1ba03598d", "avgDurationSecs": 7.073, "validDurationSecs": 12660.419, "totalHrs": 3.53, "validHrs": 3.51 }, "os": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1888, "unvalidatedSentences": 34, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 145530, "checksum": "d3334af4fa0502264dba9f9f0cfeedd46ecdcef6d27ae04f5b0069998c031194", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "phl": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4745, "unvalidatedSentences": 1184, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 376745, "checksum": "93c3ebff294c4781041b4d3ed4c05594720d61e440e01bdb698ec019bf663ef4", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "pl": { "buckets": { "dev": 0, "invalidated": 14, "other": 499, "test": 8, "train": 194, "validated": 202 }, "duration": 3155616, "reportedSentences": 0, "validatedSentences": 230282, "unvalidatedSentences": 23597, "clips": 715, "splits": { "accent": { "": 620 }, "variant": {}, "age": { "": 0.1455, "fifties": 0.0629, "twenties": 0.7636, "thirties": 0.007, "teens": 0.021 }, "gender": { "": 0.9161, "female_feminine": 0.0629, "transgender": 0.021 }, "sentence_domain": { "finance": 1, "technology_robotics": 1, "agriculture_food": 3, "general": 1 } }, "users": 11, "size": 36402818, "checksum": "b6003d257715f8f915e73a1a79a6ed235955cff9ec4cf923d276d0962f91c34a", "avgDurationSecs": 4.413, "validDurationSecs": 891.517, "totalHrs": 0.87, "validHrs": 0.24 }, "ps": { "buckets": { "dev": 13514, "invalidated": 74228, "other": 1900301, "test": 13567, "train": 73714, "validated": 470713 }, "duration": 9308562948, "reportedSentences": 1931, "validatedSentences": 334539, "unvalidatedSentences": 116181, "clips": 2445242, "splits": { "accent": { "": 250584 }, "variant": {}, "age": { "": 0.0234, "thirties": 0.1708, "twenties": 0.6239, "teens": 0.1463, "fourties": 0.0218, "fifties": 0.0137, "sixties": 0.0002, "seventies": 0 }, "gender": { "": 0.3157, "female_feminine": 0.6756, "male_masculine": 0.0077, "do_not_wish_to_say": 0.001 }, "sentence_domain": { "general": 287888, "nature_environment": 67529, "service_retail": 10349, "automotive_transport": 2431, "history_law_government": 94535, "language_fundamentals": 33571, "technology_robotics": 15638, "agriculture_food": 17553, "media_entertainment": 17719, "healthcare": 77755, "finance": 4001, "news_current_affairs": 8001 } }, "users": 1905, "size": 51980581747, "checksum": "4744522fd2f5c345e6a982af40d9c181bcc92fc90b69030276dc1ff56203a13c", "avgDurationSecs": 3.807, "validDurationSecs": 1791913.271, "totalHrs": 2585.71, "validHrs": 497.75 }, "pt": { "buckets": { "dev": 0, "invalidated": 0, "other": 506, "test": 0, "train": 0, "validated": 0 }, "duration": 2276784, "reportedSentences": 0, "validatedSentences": 43613, "unvalidatedSentences": 108, "clips": 506, "splits": { "accent": { "": 105 }, "variant": { "pt-BR": 120, "pt-PT": 325 }, "age": { "": 0.1502, "thirties": 0.1087, "sixties": 0.0296, "fourties": 0.6423, "fifties": 0.0099, "twenties": 0.0593 }, "gender": { "": 1 }, "sentence_domain": { "technology_robotics": 12, "general": 69, "healthcare": 15, "agriculture_food": 51, "media_entertainment": 6, "nature_environment": 4, "service_retail": 1, "history_law_government": 1 } }, "users": 14, "size": 15877618, "checksum": "b21bbc1079ede6cfd73d2f14637eff991efab6be05672a2c95ef95094f82e57e", "avgDurationSecs": 4.5, "validDurationSecs": 0, "totalHrs": 0.63, "validHrs": 0 }, "qvi": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1149, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 62961, "checksum": "708c13b0b9ef3df1cd0bcc10c2c001a4fc5c6d9b811a7561ce040fccd409c616", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qwa": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1000, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 53365, "checksum": "768a1ecf3b429540a70b2dbcc6658fb77df3ca10585626ddcefd0b22e2008d31", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qxa": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 999, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 53301, "checksum": "04c1e9105a996809c9c0c141928d3eb574b5c3a6245464e1729e752298b5e8ba", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "qxp": { "buckets": { "dev": 679, "invalidated": 2319, "other": 83, "test": 679, "train": 709, "validated": 13947 }, "duration": 83444796, "reportedSentences": 144, "validatedSentences": 2067, "unvalidatedSentences": 3, "clips": 16349, "splits": { "accent": { "": 2079 }, "variant": {}, "age": { "twenties": 0.4099, "thirties": 0.2902, "fourties": 0.1427, "": 0.0121, "fifties": 0.1188, "sixties": 0.0109, "teens": 0.0154 }, "gender": { "female_feminine": 0.7924, "": 0.1841, "male_masculine": 0.0235 }, "sentence_domain": { "healthcare": 12779, "general": 12755, "agriculture_food": 16259, "nature_environment": 6264, "history_law_government": 857, "news_current_affairs": 103, "finance": 68 } }, "users": 68, "size": 479174917, "checksum": "cde3bc13bfe18692e9d9070a1534ad6efa9b0fb705aea9932a163806d3496fa1", "avgDurationSecs": 5.104, "validDurationSecs": 71185.061, "totalHrs": 23.17, "validHrs": 19.77 }, "rm-sursilv": { "buckets": { "dev": 0, "invalidated": 0, "other": 459, "test": 0, "train": 0, "validated": 0 }, "duration": 2029860, "reportedSentences": 2, "validatedSentences": 9333, "unvalidatedSentences": 7170, "clips": 459, "splits": { "accent": {}, "variant": {}, "age": { "": 0.0218, "twenties": 0.9782 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 12946628, "checksum": "be0cc726c10fd2725bf37b5c799fc4aafa8fb2d7f580d8d3cb6adb5a0704d8f4", "avgDurationSecs": 4.422, "validDurationSecs": 0, "totalHrs": 0.56, "validHrs": 0 }, "ro": { "buckets": { "dev": 0, "invalidated": 0, "other": 840, "test": 0, "train": 0, "validated": 0 }, "duration": 3704796, "reportedSentences": 31, "validatedSentences": 13755, "unvalidatedSentences": 89202, "clips": 840, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 3, "size": 27661146, "checksum": "0fc4d20c5e07078547ed72146658844273c41e5cadd73bd15cabaca2724ae438", "avgDurationSecs": 4.41, "validDurationSecs": 0, "totalHrs": 1.02, "validHrs": 0 }, "ru": { "buckets": { "dev": 1, "invalidated": 3, "other": 638, "test": 1, "train": 3, "validated": 5 }, "duration": 3716820, "reportedSentences": 2, "validatedSentences": 47850, "unvalidatedSentences": 242, "clips": 646, "splits": { "accent": { "": 46 }, "variant": {}, "age": { "": 0.5604, "twenties": 0.0542, "thirties": 0.291, "teens": 0.0851, "fourties": 0.0077, "sixties": 0.0015 }, "gender": { "": 0.6749, "male_masculine": 0.2755, "female_feminine": 0.0495 }, "sentence_domain": { "media_entertainment": 4, "general": 2, "healthcare": 1, "technology_robotics": 7, "news_current_affairs": 2, "history_law_government": 4, "nature_environment": 1 } }, "users": 62, "size": 25219746, "checksum": "4cf52f61e5e19263a88370f9c3259c2d7725a9c63f15587cc73a61839b1f97f9", "avgDurationSecs": 5.754, "validDurationSecs": 28.768, "totalHrs": 1.03, "validHrs": 0 }, "rup": { "buckets": { "dev": 0, "invalidated": 0, "other": 110, "test": 0, "train": 0, "validated": 0 }, "duration": 837360, "reportedSentences": 0, "validatedSentences": 3507, "unvalidatedSentences": 0, "clips": 110, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 1 }, "gender": { "female_feminine": 1 }, "sentence_domain": {} }, "users": 1, "size": 5185101, "checksum": "497ccb803c4ce67becf511391646fa49240820c8bbc21b5ec8236fb8324ca303", "avgDurationSecs": 7.612, "validDurationSecs": 0, "totalHrs": 0.23, "validHrs": 0 }, "rw": { "buckets": { "dev": 0, "invalidated": 0, "other": 30, "test": 0, "train": 0, "validated": 0 }, "duration": 202212, "reportedSentences": 0, "validatedSentences": 1404857, "unvalidatedSentences": 28, "clips": 30, "splits": { "accent": { "": 25 }, "variant": {}, "age": { "": 0.3333, "twenties": 0.6667 }, "gender": { "": 0.3333, "female_feminine": 0.6667 }, "sentence_domain": {} }, "users": 2, "size": 89237875, "checksum": "11ca8c60bc25a55e47851734bbed2d38b25e00b6af760066c29d6df4b862381e", "avgDurationSecs": 6.74, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "sah": { "buckets": { "dev": 0, "invalidated": 0, "other": 75, "test": 0, "train": 0, "validated": 0 }, "duration": 595584, "reportedSentences": 0, "validatedSentences": 6079, "unvalidatedSentences": 382, "clips": 75, "splits": { "accent": { "": 50 }, "variant": {}, "age": { "": 0.3333, "fifties": 0.6, "fourties": 0.0667 }, "gender": { "": 0.4, "female_feminine": 0.6 }, "sentence_domain": {} }, "users": 5, "size": 3594686, "checksum": "85324cf3f8b667487f913861d147a518d21a2c9379d1536e420c2295091198da", "avgDurationSecs": 7.941, "validDurationSecs": 0, "totalHrs": 0.16, "validHrs": 0 }, "sat": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 25164, "reportedSentences": 0, "validatedSentences": 5421, "unvalidatedSentences": 540, "clips": 5, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 426421, "checksum": "947e82fcca0d65fefd6aa4c6ace740114321c82c95bab8f39e4c5222d5ff611a", "avgDurationSecs": 5.033, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sc": { "buckets": { "dev": 0, "invalidated": 0, "other": 7, "test": 0, "train": 0, "validated": 0 }, "duration": 54036, "reportedSentences": 0, "validatedSentences": 5489, "unvalidatedSentences": 349, "clips": 7, "splits": { "accent": {}, "variant": {}, "age": { "": 0.2857, "sixties": 0.7143 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 685386, "checksum": "eb5badb408865ae135f2d8afc12daa2bc380f6abc41ea59b3b6759b93c1bae53", "avgDurationSecs": 7.719, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "sd": { "buckets": { "dev": 0, "invalidated": 35, "other": 7200, "test": 0, "train": 0, "validated": 0 }, "duration": 28352808, "reportedSentences": 3, "validatedSentences": 13348, "unvalidatedSentences": 71, "clips": 7235, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.2556, "fourties": 0.7385, "": 0.0059 }, "gender": { "male_masculine": 0.2556, "": 0.7444 }, "sentence_domain": { "general": 21 } }, "users": 7, "size": 149729813, "checksum": "bed7e64c7fac815cf5b9c81bc833234c3ca8dfff005dc7058ae4d7b8c74c4c60", "avgDurationSecs": 3.919, "validDurationSecs": 0, "totalHrs": 7.87, "validHrs": 0 }, "sei": { "buckets": { "dev": 0, "invalidated": 0, "other": 50, "test": 0, "train": 0, "validated": 0 }, "duration": 114480, "reportedSentences": 0, "validatedSentences": 1615, "unvalidatedSentences": 0, "clips": 50, "splits": { "accent": {}, "variant": {}, "age": { "twenties": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 759555, "checksum": "f9d97644f74906e2ab9fd29863e018a07431fabcd12c5a4f7d4ae52c22dc6530", "avgDurationSecs": 2.29, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "sk": { "buckets": { "dev": 0, "invalidated": 10, "other": 0, "test": 272, "train": 1709, "validated": 1981 }, "duration": 9274392, "reportedSentences": 2, "validatedSentences": 21980, "unvalidatedSentences": 1006, "clips": 1991, "splits": { "accent": { "": 1976 }, "variant": {}, "age": { "": 0.0075, "fifties": 0.8624, "fourties": 0.1301 }, "gender": { "": 0.8699, "male_masculine": 0.1301 }, "sentence_domain": { "general": 1944, "history_law_government": 1253, "media_entertainment": 50, "technology_robotics": 50, "news_current_affairs": 52, "language_fundamentals": 21, "nature_environment": 29, "automotive_transport": 8, "finance": 1 } }, "users": 5, "size": 55931656, "checksum": "d2d3d57f518245c06c63a92a7896b6252db8a2527372137632f1e94a2c01f8e2", "avgDurationSecs": 4.658, "validDurationSecs": 9227.81, "totalHrs": 2.57, "validHrs": 2.56 }, "skr": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 6011, "unvalidatedSentences": 93, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 371413, "checksum": "7854291e98b8160c5b5599ca0f22ef72017ea28ad72c80084d9f8d0b71c56a54", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sl": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4209, "unvalidatedSentences": 29, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 245605, "checksum": "d07d67789ba26f84451ee1a83542285a5738d38ce5e1eea896696528e53794dc", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "sq": { "buckets": { "dev": 0, "invalidated": 1, "other": 0, "test": 0, "train": 15, "validated": 15 }, "duration": 89784, "reportedSentences": 0, "validatedSentences": 50009, "unvalidatedSentences": 2635, "clips": 16, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": { "history_law_government": 15, "general": 1 } }, "users": 4, "size": 4083732, "checksum": "ea99684d4d7fad94a7e1c97d41463e512e7077906fb9f3b00a9281c34857721e", "avgDurationSecs": 5.612, "validDurationSecs": 84.173, "totalHrs": 0.02, "validHrs": 0.02 }, "sr": { "buckets": { "dev": 0, "invalidated": 1, "other": 296, "test": 0, "train": 8, "validated": 8 }, "duration": 1091628, "reportedSentences": 5, "validatedSentences": 8155, "unvalidatedSentences": 572, "clips": 305, "splits": { "accent": { "": 245 }, "variant": {}, "age": { "thirties": 0.0721, "twenties": 0.9279 }, "gender": { "": 0.2689, "male_masculine": 0.7311 }, "sentence_domain": { "history_law_government": 1, "language_fundamentals": 2, "technology_robotics": 2, "general": 1, "nature_environment": 3 } }, "users": 3, "size": 6867527, "checksum": "4af63356aeb9ea4b267a85de1ff7c19c3bbe30e68218eccd8655e95d1dc4c831", "avgDurationSecs": 3.579, "validDurationSecs": 28.633, "totalHrs": 0.3, "validHrs": 0 }, "sv-SE": { "buckets": { "dev": 0, "invalidated": 0, "other": 25, "test": 0, "train": 0, "validated": 0 }, "duration": 141696, "reportedSentences": 0, "validatedSentences": 26876, "unvalidatedSentences": 6596, "clips": 25, "splits": { "accent": { "": 95 }, "variant": {}, "age": { "": 0.2, "thirties": 0.8 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 2937627, "checksum": "6b460f52c99095f843bde66f637facf0614e475a8bb93379fa5c1513bd63451d", "avgDurationSecs": 5.668, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "sw": { "buckets": { "dev": 0, "invalidated": 0, "other": 2517, "test": 0, "train": 0, "validated": 0 }, "duration": 16300080, "reportedSentences": 3, "validatedSentences": 134729, "unvalidatedSentences": 5757, "clips": 2517, "splits": { "accent": { "": 250 }, "variant": { "sw-baratz": 373 }, "age": { "": 0.4128, "twenties": 0.557, "thirties": 0.0302 }, "gender": { "": 0.9702, "female_feminine": 0.0298 }, "sentence_domain": {} }, "users": 23, "size": 103332463, "checksum": "464cf878971bf620dc407808f2dfebb8058309a899fb97d723eacfda4be4f60b", "avgDurationSecs": 6.476, "validDurationSecs": 0, "totalHrs": 4.52, "validHrs": 0 }, "ta": { "buckets": { "dev": 0, "invalidated": 0, "other": 130, "test": 0, "train": 0, "validated": 0 }, "duration": 806508, "reportedSentences": 0, "validatedSentences": 118292, "unvalidatedSentences": 550, "clips": 130, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 0.0385, "teens": 0.0769, "fourties": 0.7692, "": 0.1154 }, "gender": { "male_masculine": 0.6923, "": 0.3077 }, "sentence_domain": {} }, "users": 5, "size": 14085479, "checksum": "37b4474698ef4327a3f8e2298fade6e69b8837162e53be3b9c68caccc5334dd8", "avgDurationSecs": 6.204, "validDurationSecs": 0, "totalHrs": 0.22, "validHrs": 0 }, "tar": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1258, "unvalidatedSentences": 0, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 72169, "checksum": "6407f6495f831c4df668f69c97f5d0d2960012f34c5ff28a7c897a87dc829e17", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "te": { "buckets": { "dev": 0, "invalidated": 0, "other": 52, "test": 0, "train": 1, "validated": 1 }, "duration": 229176, "reportedSentences": 14, "validatedSentences": 370, "unvalidatedSentences": 19699, "clips": 53, "splits": { "accent": { "": 144 }, "variant": {}, "age": { "twenties": 0.9057, "": 0.0943 }, "gender": { "male_masculine": 0.9057, "": 0.0943 }, "sentence_domain": { "language_fundamentals": 1, "healthcare": 1 } }, "users": 2, "size": 2654734, "checksum": "e40bc14b1e52856114c8686152a0d8ebb22a471b538efe9141382f8bb5f3dcdd", "avgDurationSecs": 4.324, "validDurationSecs": 4.324, "totalHrs": 0.06, "validHrs": 0 }, "tg": { "buckets": { "dev": 0, "invalidated": 0, "other": 36, "test": 0, "train": 2, "validated": 2 }, "duration": 229572, "reportedSentences": 0, "validatedSentences": 4000, "unvalidatedSentences": 6771, "clips": 38, "splits": { "accent": {}, "variant": {}, "age": { "seventies": 0.0526, "twenties": 0.9474 }, "gender": { "": 1 }, "sentence_domain": { "media_entertainment": 6, "language_fundamentals": 22, "news_current_affairs": 3, "general": 9, "service_retail": 1, "healthcare": 1, "history_law_government": 2, "automotive_transport": 1 } }, "users": 2, "size": 2156935, "checksum": "e1d01f2e8ffcb6186195796bb71b280a7f7e69069b27cbb0e25d17db4cb202c0", "avgDurationSecs": 6.041, "validDurationSecs": 12.083, "totalHrs": 0.06, "validHrs": 0 }, "th": { "buckets": { "dev": 0, "invalidated": 0, "other": 108, "test": 0, "train": 0, "validated": 0 }, "duration": 417060, "reportedSentences": 1, "validatedSentences": 64491, "unvalidatedSentences": 121701, "clips": 108, "splits": { "accent": { "": 55 }, "variant": {}, "age": { "": 0.2315, "twenties": 0.7685 }, "gender": { "": 0.2315, "transgender": 0.1852, "male_masculine": 0.5833 }, "sentence_domain": {} }, "users": 6, "size": 14610859, "checksum": "d5535f8e0ed1338ce9091b0b4b07b9fe0c494ab5fab34f93d04b963e6dfe0d8a", "avgDurationSecs": 3.862, "validDurationSecs": 0, "totalHrs": 0.11, "validHrs": 0 }, "ti": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 4571, "unvalidatedSentences": 382, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 346251, "checksum": "b4026cbbc8575e41d30666511290acbef0a751810090ee833d02dea0b219bbf7", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tig": { "buckets": { "dev": 0, "invalidated": 0, "other": 20, "test": 0, "train": 0, "validated": 0 }, "duration": 112248, "reportedSentences": 0, "validatedSentences": 5417, "unvalidatedSentences": 90, "clips": 20, "splits": { "accent": {}, "variant": {}, "age": { "thirties": 1 }, "gender": { "": 1 }, "sentence_domain": { "language_fundamentals": 2 } }, "users": 1, "size": 935039, "checksum": "c015960a371b5bc33fc3b18efd4910b263e7b96583ae8ba91f1fab14d1bb3ecb", "avgDurationSecs": 5.612, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "tk": { "buckets": { "dev": 0, "invalidated": 0, "other": 75, "test": 0, "train": 0, "validated": 0 }, "duration": 457992, "reportedSentences": 0, "validatedSentences": 2494, "unvalidatedSentences": 238, "clips": 75, "splits": { "accent": { "": 20 }, "variant": {}, "age": { "": 0.6, "twenties": 0.4 }, "gender": { "": 0.8667, "female_feminine": 0.1333 }, "sentence_domain": {} }, "users": 6, "size": 2766851, "checksum": "1a0ee27a18a65c3cafe69fee50f873711b13d166ccdd97041c81b3eed337cd2a", "avgDurationSecs": 6.107, "validDurationSecs": 0, "totalHrs": 0.12, "validHrs": 0 }, "tok": { "buckets": { "dev": 0, "invalidated": 0, "other": 11, "test": 0, "train": 0, "validated": 0 }, "duration": 40968, "reportedSentences": 0, "validatedSentences": 7512, "unvalidatedSentences": 2248, "clips": 11, "splits": { "accent": { "": 11 }, "variant": {}, "age": { "teens": 1 }, "gender": { "male_masculine": 1 }, "sentence_domain": {} }, "users": 1, "size": 748259, "checksum": "f6a4fb95f08c1eee9478b2729a9a24a3973b0dcfbd1f98558c05726fcec038c9", "avgDurationSecs": 3.724, "validDurationSecs": 0, "totalHrs": 0.01, "validHrs": 0 }, "tr": { "buckets": { "dev": 0, "invalidated": 43, "other": 462, "test": 28, "train": 108, "validated": 136 }, "duration": 3982932, "reportedSentences": 2, "validatedSentences": 410410, "unvalidatedSentences": 3490, "clips": 641, "splits": { "accent": { "": 516 }, "variant": {}, "age": { "thirties": 0.064, "": 0.1248, "sixties": 0.78, "fifties": 0.0156, "twenties": 0.0156 }, "gender": { "": 0.1888, "male_masculine": 0.7722, "female_feminine": 0.039 }, "sentence_domain": { "general": 1 } }, "users": 24, "size": 52379416, "checksum": "04295ad8d8f068ec5823732f9e8457d6ea09504c6c319b7f0f1746e4fcb5f661", "avgDurationSecs": 6.214, "validDurationSecs": 845.053, "totalHrs": 1.1, "validHrs": 0.23 }, "trw": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 7770, "unvalidatedSentences": 27, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 556914, "checksum": "05449a413cd2e9757dcc81fc70b64e7852b6fe0013a6d90797d6abfba42edeb0", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tt": { "buckets": { "dev": 0, "invalidated": 0, "other": 153, "test": 0, "train": 10, "validated": 10 }, "duration": 746244, "reportedSentences": 0, "validatedSentences": 18051, "unvalidatedSentences": 32, "clips": 163, "splits": { "accent": { "": 65 }, "variant": {}, "age": { "": 0.6626, "sixties": 0.3374 }, "gender": { "": 0.6626, "female_feminine": 0.3374 }, "sentence_domain": { "general": 3 } }, "users": 5, "size": 4289693, "checksum": "5fa402d5c1839571d9f5bfda685fdcd68ed02e4d1707cc2b144e136fe6bc6e81", "avgDurationSecs": 4.578, "validDurationSecs": 45.782, "totalHrs": 0.2, "validHrs": 0.01 }, "tui": { "buckets": { "dev": 0, "invalidated": 0, "other": 2, "test": 0, "train": 0, "validated": 0 }, "duration": 25236, "reportedSentences": 0, "validatedSentences": 1971, "unvalidatedSentences": 5, "clips": 2, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 268076, "checksum": "3b9b625538ff39e979765b50e164d81809f2e7b6b93baa355503f5fc4cd062c8", "avgDurationSecs": 12.618, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "tw": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 49165, "unvalidatedSentences": 23, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 2754629, "checksum": "656bcbded803e326c13fe93a48c0f1437294f70d4fdd8278264350ac6451320f", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "ug": { "buckets": { "dev": 0, "invalidated": 19, "other": 1433, "test": 24, "train": 698, "validated": 722 }, "duration": 24621444, "reportedSentences": 9, "validatedSentences": 231441, "unvalidatedSentences": 11020, "clips": 2174, "splits": { "accent": { "": 1730 }, "variant": { "ug-Arab": 595 }, "age": { "teens": 0.1532, "twenties": 0.7958, "": 0.0511 }, "gender": { "female_feminine": 0.1532, "": 0.8468 }, "sentence_domain": { "general": 33, "media_entertainment": 1 } }, "users": 12, "size": 164205784, "checksum": "a3cabc54faf51e192506cfe25d73fe9b7ac235cfeb7451556389fa02d813141c", "avgDurationSecs": 11.325, "validDurationSecs": 8176.947, "totalHrs": 6.83, "validHrs": 2.27 }, "uk": { "buckets": { "dev": 0, "invalidated": 0, "other": 1289, "test": 1, "train": 5, "validated": 6 }, "duration": 6260076, "reportedSentences": 0, "validatedSentences": 212668, "unvalidatedSentences": 1661, "clips": 1295, "splits": { "accent": { "": 50 }, "variant": {}, "age": { "": 0.0927, "teens": 0.8803, "thirties": 0.0193, "twenties": 0.0077 }, "gender": { "": 0.973, "male_masculine": 0.027 }, "sentence_domain": {} }, "users": 11, "size": 50524152, "checksum": "6704c972515cbfb42d9dfab1fb59155a354100b18bd5f0955606615d86fdef49", "avgDurationSecs": 4.834, "validDurationSecs": 29.004, "totalHrs": 1.73, "validHrs": 0 }, "ur": { "buckets": { "dev": 0, "invalidated": 0, "other": 21, "test": 0, "train": 0, "validated": 0 }, "duration": 131832, "reportedSentences": 7, "validatedSentences": 21706, "unvalidatedSentences": 48257, "clips": 21, "splits": { "accent": { "": 8 }, "variant": {}, "age": { "": 0.9524, "twenties": 0.0476 }, "gender": { "": 1 }, "sentence_domain": { "general": 1, "media_entertainment": 1 } }, "users": 4, "size": 5721080, "checksum": "4c964e48ff01fd05c169c6f9e794b9cf0617cfaf5587673c4ab6edd547c8366b", "avgDurationSecs": 6.278, "validDurationSecs": 0, "totalHrs": 0.03, "validHrs": 0 }, "uz": { "buckets": { "dev": 0, "invalidated": 0, "other": 203, "test": 0, "train": 0, "validated": 0 }, "duration": 1089828, "reportedSentences": 0, "validatedSentences": 148738, "unvalidatedSentences": 137888, "clips": 203, "splits": { "accent": { "": 77 }, "variant": {}, "age": { "": 0.3448, "twenties": 0.4828, "thirties": 0.1478, "teens": 0.0246 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 18, "size": 25480306, "checksum": "a46a53a1bf2b5369411a4e641204ee86e833d5679b4eff51e98263735703e369", "avgDurationSecs": 5.369, "validDurationSecs": 0, "totalHrs": 0.3, "validHrs": 0 }, "ve": { "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "duration": 30240, "reportedSentences": 0, "validatedSentences": 1735, "unvalidatedSentences": 7843, "clips": 5, "splits": { "accent": { "": 5 }, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 788193, "checksum": "1deff463fddde59fae11f7bcbe38d039ad73c6acd2bb560b956e8030e39efe54", "avgDurationSecs": 6.048, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "vi": { "buckets": { "dev": 0, "invalidated": 0, "other": 586, "test": 0, "train": 0, "validated": 0 }, "duration": 2585988, "reportedSentences": 0, "validatedSentences": 13337, "unvalidatedSentences": 5485, "clips": 586, "splits": { "accent": {}, "variant": { "vi-huett": 469, "vi-hanoi": 5, "vi-saigon": 97 }, "age": { "": 0.6297, "twenties": 0.1826, "seventies": 0.1792, "fifties": 0.0085 }, "gender": { "": 0.8635, "female_feminine": 0.1365 }, "sentence_domain": { "general": 580 } }, "users": 7, "size": 16314780, "checksum": "5cb046af7a1a5121c2e89f3ee60c64d7af941aed0523d0e83cb6916310646847", "avgDurationSecs": 4.413, "validDurationSecs": 0, "totalHrs": 0.71, "validHrs": 0 }, "xmf": { "buckets": { "dev": 0, "invalidated": 3, "other": 7, "test": 1, "train": 9, "validated": 10 }, "duration": 155484, "reportedSentences": 0, "validatedSentences": 1004, "unvalidatedSentences": 36, "clips": 20, "splits": { "accent": { "senak": 10 }, "variant": { "xmf-senmar": 10 }, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 2, "size": 1008299, "checksum": "d2d1ea146d09531ff5f7fcb833c7680f9b421c4abaedc5e6409d580e6f20a5e9", "avgDurationSecs": 7.774, "validDurationSecs": 77.742, "totalHrs": 0.04, "validHrs": 0.02 }, "ydg": { "buckets": { "dev": 0, "invalidated": 0, "other": 261, "test": 0, "train": 1, "validated": 1 }, "duration": 1183356, "reportedSentences": 0, "validatedSentences": 1937, "unvalidatedSentences": 169, "clips": 262, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 3, "size": 6801833, "checksum": "f7792059438df13ff826e347672bc1bdbf3f5bd6799504fac33222cbe5ad3fb1", "avgDurationSecs": 4.517, "validDurationSecs": 4.517, "totalHrs": 0.32, "validHrs": 0 }, "yi": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1550, "unvalidatedSentences": 85, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 107237, "checksum": "3ffcafce0ac4e7103e0154140adff1765d530ee8712e0fda055eadce989590d3", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yo": { "buckets": { "dev": 0, "invalidated": 0, "other": 2, "test": 0, "train": 0, "validated": 0 }, "duration": 6444, "reportedSentences": 0, "validatedSentences": 5419, "unvalidatedSentences": 17, "clips": 2, "splits": { "accent": {}, "variant": {}, "age": { "": 1 }, "gender": { "": 1 }, "sentence_domain": {} }, "users": 1, "size": 426576, "checksum": "69ea11bfa32a3d4ef2ee2630ea13e515a757885ab3054a7853799b1f2bdb6d96", "avgDurationSecs": 3.222, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "yue": { "buckets": { "dev": 0, "invalidated": 0, "other": 100, "test": 0, "train": 0, "validated": 0 }, "duration": 326880, "reportedSentences": 0, "validatedSentences": 18556, "unvalidatedSentences": 10171, "clips": 100, "splits": { "accent": {}, "variant": {}, "age": { "": 0.25, "twenties": 0.75 }, "gender": { "": 1 }, "sentence_domain": { "service_retail": 6, "general": 4, "automotive_transport": 1 } }, "users": 5, "size": 3594220, "checksum": "96250e6aee65b031989c92cb4ac8001ea3f911a6ff6977914f80decb5e08bc10", "avgDurationSecs": 3.269, "validDurationSecs": 0, "totalHrs": 0.09, "validHrs": 0 }, "zgh": { "buckets": { "dev": 0, "invalidated": 0, "other": 37, "test": 0, "train": 0, "validated": 0 }, "duration": 290448, "reportedSentences": 1, "validatedSentences": 2740, "unvalidatedSentences": 356, "clips": 37, "splits": { "accent": {}, "variant": { "zgh-tzm": 7 }, "age": { "thirties": 0.0541, "": 0.9459 }, "gender": { "male_masculine": 0.0541, "": 0.9459 }, "sentence_domain": { "agriculture_food": 1, "general": 1 } }, "users": 10, "size": 1857795, "checksum": "0dbdee7ca116598946bb815900daeaa3b2ed229bcc2628b83e03694f395a4a8d", "avgDurationSecs": 7.85, "validDurationSecs": 0, "totalHrs": 0.08, "validHrs": 0 }, "zh-CN": { "buckets": { "dev": 0, "invalidated": 0, "other": 255, "test": 0, "train": 0, "validated": 0 }, "duration": 1406088, "reportedSentences": 18, "validatedSentences": 59143, "unvalidatedSentences": 902, "clips": 255, "splits": { "accent": { "150000": 5, "210000": 5, "310000": 100, "350000": 5, "360000": 54, "430000": 5, "440000": 3 }, "variant": {}, "age": { "teens": 0.1922, "": 0.2314, "twenties": 0.5529, "thirties": 0.0235 }, "gender": { "": 0.5059, "female_feminine": 0.102, "male_masculine": 0.3922 }, "sentence_domain": { "language_fundamentals": 16, "news_current_affairs": 16, "general": 61, "nature_environment": 10, "media_entertainment": 81, "history_law_government": 25, "healthcare": 7, "technology_robotics": 13, "finance": 10, "agriculture_food": 4, "automotive_transport": 4 } }, "users": 24, "size": 12535318, "checksum": "49e7905b506d2dc160bf4b13a8b2d78b7fba7d3d0fc9f091c61213823b8cb0d8", "avgDurationSecs": 5.514, "validDurationSecs": 0, "totalHrs": 0.39, "validHrs": 0 }, "zh-HK": { "buckets": { "dev": 0, "invalidated": 0, "other": 399, "test": 0, "train": 0, "validated": 0 }, "duration": 1833372, "reportedSentences": 0, "validatedSentences": 20148, "unvalidatedSentences": 60, "clips": 399, "splits": { "accent": { "": 317 }, "variant": {}, "age": { "fourties": 0.7945, "twenties": 0.2005, "": 0.005 }, "gender": { "": 1 }, "sentence_domain": { "general": 107, "technology_robotics": 9, "service_retail": 22, "language_fundamentals": 3, "finance": 2, "healthcare": 4, "history_law_government": 12, "nature_environment": 4, "automotive_transport": 2, "news_current_affairs": 1 } }, "users": 3, "size": 11962513, "checksum": "1973a7c1ce3dff66543e0472ff0420f08194b0f0e8ac4c64944440e7d6b00c19", "avgDurationSecs": 4.595, "validDurationSecs": 0, "totalHrs": 0.5, "validHrs": 0 }, "zh-TW": { "buckets": { "dev": 0, "invalidated": 0, "other": 407, "test": 0, "train": 0, "validated": 0 }, "duration": 1220724, "reportedSentences": 0, "validatedSentences": 20786, "unvalidatedSentences": 977, "clips": 407, "splits": { "accent": { "tainan_city": 6, "penghu_county": 20, "": 1, "changhua_county": 133, "kaohsiung_city": 75, "taipei_city": 5 }, "variant": {}, "age": { "": 0.5111, "twenties": 0.3292, "teens": 0.1597 }, "gender": { "": 1 }, "sentence_domain": { "general": 399, "technology_robotics": 234, "automotive_transport": 4, "nature_environment": 2, "history_law_government": 3, "media_entertainment": 3, "service_retail": 1, "healthcare": 3, "news_current_affairs": 1, "finance": 2 } }, "users": 13, "size": 7771443, "checksum": "b516a5719a8cc27d6c5d9a51987305ef0b60e582d0a5b19617080e4af98b4c5a", "avgDurationSecs": 2.999, "validDurationSecs": 0, "totalHrs": 0.33, "validHrs": 0 }, "zu": { "buckets": { "dev": 0, "invalidated": 0, "other": 0, "test": 0, "train": 0, "validated": 0 }, "duration": 0, "reportedSentences": 0, "validatedSentences": 1320, "unvalidatedSentences": 19379, "clips": 0, "splits": { "accent": {}, "variant": {}, "age": {}, "gender": {}, "sentence_domain": {} }, "users": 0, "size": 1448647, "checksum": "e0f4f2720a9655fc6a62d6bf312ff998991630a5f9adb7f5891de52becee593e", "avgDurationSecs": 0, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 }, "zza": { "buckets": { "dev": 0, "invalidated": 0, "other": 195, "test": 0, "train": 0, "validated": 0 }, "duration": 646380, "reportedSentences": 0, "validatedSentences": 5657, "unvalidatedSentences": 989, "clips": 195, "splits": { "accent": { "": 175 }, "variant": {}, "age": { "": 0.1538, "twenties": 0.8462 }, "gender": { "": 0.1538, "do_not_wish_to_say": 0.8462 }, "sentence_domain": { "general": 97, "language_fundamentals": 2, "healthcare": 3, "service_retail": 1 } }, "users": 3, "size": 4130735, "checksum": "40a1ca8deca39438db068cbd40ef99baf0f94865ab336fa6f7ad116d5230bfeb", "avgDurationSecs": 3.315, "validDurationSecs": 0, "totalHrs": 0.17, "validHrs": 0 } }, "totalDuration": 11198548944, "totalValidDurationSecs": 2821693, "totalHrs": 3110, "totalValidHrs": 784 } ================================================ FILE: datasets/scripted-speech/cv-corpus-3.json ================================================ { "date": "2019-06-24", "locales": { "en": { "clips": 896823, "splits": { "accent": { "": 0.52, "canada": 0.03, "england": 0.09, "us": 0.23, "indian": 0.04, "australia": 0.03, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0, "philippines": 0, "singapore": 0, "scotland": 0.01, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.02 }, "age": { "": 0.41, "twenties": 0.21, "sixties": 0.03, "thirties": 0.15, "teens": 0.05, "seventies": 0.01, "fourties": 0.08, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.41, "male": 0.47, "female": 0.11, "other": 0.02 } }, "users": 39577, "duration": 3914431152, "buckets": { "dev": 13178, "invalidated": 78904, "other": 173800, "test": 13178, "train": 63330, "validated": 644119 }, "size": 30159205077 }, "de": { "clips": 293932, "splits": { "accent": { "germany": 0.65, "russia": 0, "": 0.3, "austria": 0.02, "liechtenstein": 0, "switzerland": 0.01, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0 }, "age": { "twenties": 0.28, "fourties": 0.17, "": 0.22, "thirties": 0.18, "teens": 0.04, "sixties": 0.02, "fifties": 0.09, "seventies": 0 }, "gender": { "male": 0.68, "": 0.22, "female": 0.1, "other": 0 } }, "users": 5007, "duration": 1226382048, "buckets": { "dev": 5633, "invalidated": 12721, "other": 4, "test": 5633, "train": 8518, "validated": 281207 }, "size": 9533851782 }, "fr": { "clips": 162340, "splits": { "accent": { "": 0.29, "france": 0.64, "germany": 0, "belgium": 0.03, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0 }, "age": { "twenties": 0.24, "thirties": 0.22, "": 0.22, "teens": 0.04, "fourties": 0.17, "fifties": 0.05, "sixties": 0.06, "seventies": 0 }, "gender": { "male": 0.7, "": 0.2, "female": 0.09, "other": 0.01 } }, "users": 3005, "duration": 665133624, "buckets": { "dev": 10366, "invalidated": 9978, "other": 0, "test": 10366, "train": 27498, "validated": 152362 }, "size": 5176500440 }, "cy": { "clips": 38646, "splits": { "accent": { "united_kingdom": 0.43, "": 0.55, "other": 0.02 }, "age": { "fourties": 0.12, "twenties": 0.09, "sixties": 0.13, "fifties": 0.07, "": 0.49, "thirties": 0.07, "seventies": 0.01, "eighties": 0, "teens": 0.01 }, "gender": { "male": 0.33, "female": 0.18, "": 0.49, "other": 0.01 } }, "users": 748, "duration": 174368352, "buckets": { "dev": 484, "invalidated": 1696, "other": 2744, "test": 484, "train": 500, "validated": 34188 }, "size": 1348150916 }, "br": { "clips": 12155, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.14, "": 0.5, "fifties": 0.08, "fourties": 0.11, "thirties": 0.13, "sixties": 0.04 }, "gender": { "male": 0.47, "": 0.5, "female": 0.03 } }, "users": 118, "duration": 36714648, "buckets": { "dev": 1142, "invalidated": 289, "other": 7480, "test": 1328, "train": 1630, "validated": 4386 }, "size": 279505456 }, "cv": { "clips": 2355, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.04, "": 0.53, "fourties": 0.38, "thirties": 0.04, "teens": 0.01 }, "gender": { "male": 0.47, "": 0.53 } }, "users": 38, "duration": 10101114, "buckets": { "dev": 78, "invalidated": 108, "other": 1220, "test": 243, "train": 566, "validated": 1027 }, "size": 79035841 }, "tr": { "clips": 9700, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "": 0.19, "thirties": 0.33, "twenties": 0.42, "teens": 0.04, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.19, "male": 0.7, "female": 0.11 } }, "users": 344, "duration": 37589112, "buckets": { "dev": 1620, "invalidated": 829, "other": 0, "test": 1621, "train": 1802, "validated": 8871 }, "size": 288370679 }, "tt": { "clips": 25772, "splits": { "accent": { "": 1 }, "age": { "": 0.16, "thirties": 0.79, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0 }, "gender": { "": 0.16, "male": 0.83, "female": 0.02 } }, "users": 132, "duration": 94851408, "buckets": { "dev": 1949, "invalidated": 251, "other": 3438, "test": 4070, "train": 8264, "validated": 22083 }, "size": 710269716 }, "ky": { "clips": 15885, "splits": { "accent": { "": 0.91, "other": 0.09 }, "age": { "thirties": 0.15, "": 0.04, "fourties": 0.01, "twenties": 0.79, "teens": 0.01 }, "gender": { "male": 0.44, "": 0.09, "female": 0.47, "other": 0 } }, "users": 97, "duration": 72991440, "buckets": { "dev": 971, "invalidated": 779, "other": 8249, "test": 1501, "train": 2320, "validated": 6857 }, "size": 525926969 }, "ga-IE": { "clips": 3764, "splits": { "accent": { "": 0.29, "connachta": 0.4, "other": 0.1, "ulaidh": 0.09, "mumhain": 0.12 }, "age": { "twenties": 0.27, "": 0.23, "thirties": 0.38, "fourties": 0.08, "sixties": 0.01, "teens": 0.03 }, "gender": { "male": 0.62, "": 0.23, "female": 0.15 } }, "users": 63, "duration": 12982896, "buckets": { "dev": 462, "invalidated": 202, "other": 1033, "test": 482, "train": 522, "validated": 2529 }, "size": 98312781 }, "kab": { "clips": 192115, "splits": { "accent": { "": 0.79, "other": 0.21 }, "age": { "fourties": 0.13, "thirties": 0.32, "": 0.32, "twenties": 0.13, "fifties": 0.03, "eighties": 0, "teens": 0, "sixties": 0.05, "seventies": 0 }, "gender": { "male": 0.57, "": 0.28, "female": 0.15, "other": 0.01 } }, "users": 584, "duration": 693953592, "buckets": { "dev": 6646, "invalidated": 9717, "other": 1816, "test": 6648, "train": 11073, "validated": 180582 }, "size": 5395733580 }, "ca": { "clips": 95720, "splits": { "accent": { "valencian": 0.06, "central": 0.68, "": 0.21, "other": 0.01, "balearic": 0.01, "northwestern": 0.03, "northern": 0 }, "age": { "thirties": 0.12, "fifties": 0.27, "fourties": 0.25, "twenties": 0.1, "": 0.2, "sixties": 0.04, "teens": 0.02, "seventies": 0, "eighties": 0 }, "gender": { "male": 0.37, "": 0.2, "female": 0.43, "other": 0 } }, "users": 1834, "duration": 434964288, "buckets": { "dev": 8411, "invalidated": 4694, "other": 6208, "test": 8410, "train": 17011, "validated": 84818 }, "size": 3377582268 }, "zh-TW": { "clips": 52677, "splits": { "accent": { "": 0.65, "other": 0.14, "kaohsiung_city": 0.01, "new_taipei_city": 0.05, "yilan_county": 0.01, "taoyuan_city": 0, "hsinchu_city": 0.01, "taipei_city": 0.07, "chiayi_city": 0.01, "taichung_city": 0.01, "yunlin_county": 0, "nantou_county": 0, "tainan_city": 0.03, "changhua_county": 0, "pingtung_county": 0, "hsinchu_county": 0 }, "age": { "thirties": 0.32, "twenties": 0.38, "teens": 0.04, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.02 }, "gender": { "male": 0.46, "": 0.23, "female": 0.29, "other": 0.03 } }, "users": 949, "duration": 154923336, "buckets": { "dev": 1540, "invalidated": 2667, "other": 9330, "test": 1540, "train": 1698, "validated": 40680 }, "size": 1198002255 }, "sl": { "clips": 4714, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.75, "teens": 0, "": 0.05, "sixties": 0, "fifties": 0.17, "fourties": 0.02, "thirties": 0.01 }, "gender": { "female": 0.2, "male": 0.75, "": 0.05 } }, "users": 42, "duration": 18543912, "buckets": { "dev": 465, "invalidated": 146, "other": 2090, "test": 339, "train": 1178, "validated": 2478 }, "size": 145009295 }, "it": { "clips": 30274, "splits": { "accent": { "": 0.83, "other": 0.17 }, "age": { "thirties": 0.11, "twenties": 0.32, "": 0.2, "fifties": 0.19, "fourties": 0.1, "seventies": 0, "sixties": 0, "teens": 0.06 }, "gender": { "female": 0.18, "male": 0.62, "": 0.2, "other": 0 } }, "users": 602, "duration": 146844648, "buckets": { "dev": 4563, "invalidated": 3211, "other": 1, "test": 4569, "train": 6368, "validated": 27062 }, "size": 1143749798 }, "nl": { "clips": 22232, "splits": { "accent": { "": 0.25, "netherlands": 0.65, "belgium": 0.1, "other": 0 }, "age": { "": 0.22, "twenties": 0.28, "fourties": 0.14, "thirties": 0.11, "teens": 0.04, "fifties": 0.19, "sixties": 0.01 }, "gender": { "": 0.26, "male": 0.72, "female": 0.02, "other": 0 } }, "users": 502, "duration": 84629952, "buckets": { "dev": 1689, "invalidated": 954, "other": 4188, "test": 1689, "train": 1881, "validated": 17090 }, "size": 651528448 }, "cnh": { "clips": 4746, "splits": { "accent": { "": 0.82, "other": 0.18 }, "age": { "": 0.56, "twenties": 0.28, "fourties": 0.01, "teens": 0.02, "thirties": 0.1, "fifties": 0.03 }, "gender": { "": 0.56, "male": 0.24, "female": 0.2 } }, "users": 280, "duration": 17936232, "buckets": { "dev": 688, "invalidated": 463, "other": 2034, "test": 694, "train": 750, "validated": 2249 }, "size": 140129764 }, "eo": { "clips": 13234, "splits": { "accent": { "": 0.51, "internacia": 0.47, "other": 0.02 }, "age": { "twenties": 0.65, "thirties": 0.05, "": 0.11, "fourties": 0.16, "fifties": 0.01, "seventies": 0, "teens": 0.02, "sixties": 0 }, "gender": { "male": 0.51, "": 0.38, "female": 0.11, "other": 0 } }, "users": 129, "duration": 60410136, "buckets": { "dev": 1896, "invalidated": 744, "other": 1640, "test": 1887, "train": 2178, "validated": 10850 }, "size": 468605004 }, "et": { "clips": 6930, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "thirties": 0.11, "twenties": 0.7, "fourties": 0.13, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.05, "male": 0.57, "female": 0.37, "other": 0 } }, "users": 225, "duration": 46332792, "buckets": { "dev": 1769, "invalidated": 426, "other": 58, "test": 1783, "train": 2016, "validated": 6446 }, "size": 358541638 }, "fa": { "clips": 56394, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "twenties": 0.31, "thirties": 0.16, "fifties": 0.07, "fourties": 0.03, "teens": 0.04 }, "gender": { "": 0.39, "male": 0.47, "female": 0.13, "other": 0.01 } }, "users": 1240, "duration": 253556208, "buckets": { "dev": 2103, "invalidated": 2344, "other": 0, "test": 2103, "train": 2409, "validated": 54050 }, "size": 1940329402 }, "eu": { "clips": 58920, "splits": { "accent": { "mendebalekoa": 0.23, "erdialdekoa_nafarra": 0.35, "": 0.42, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.14, "thirties": 0.05, "fifties": 0.17, "twenties": 0.38, "": 0.23, "teens": 0.01, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.53, "female": 0.22, "": 0.24, "other": 0.02 } }, "users": 508, "duration": 299308320, "buckets": { "dev": 1981, "invalidated": 2145, "other": 23813, "test": 1981, "train": 2250, "validated": 32962 }, "size": 2327699622 }, "es": { "clips": 27329, "splits": { "accent": { "nortepeninsular": 0.29, "": 0.29, "rioplatense": 0.07, "centrosurpeninsular": 0.04, "americacentral": 0.01, "surpeninsular": 0.03, "canario": 0.02, "andino": 0.11, "mexicano": 0.03, "chileno": 0.08, "caribe": 0.01, "filipinas": 0.01 }, "age": { "thirties": 0.12, "": 0.18, "fifties": 0.17, "twenties": 0.32, "teens": 0.02, "fourties": 0.16, "sixties": 0.02 }, "gender": { "male": 0.74, "": 0.16, "other": 0.01, "female": 0.09 } }, "users": 611, "duration": 113152080, "buckets": { "dev": 2723, "invalidated": 3566, "other": 0, "test": 2729, "train": 3275, "validated": 23763 }, "size": 868898656 }, "zh-CN": { "clips": 7013, "splits": { "accent": { "110000": 0.12, "130000": 0.01, "210000": 0.03, "220000": 0.01, "230000": 0.04, "310000": 0, "320000": 0.08, "330000": 0.07, "340000": 0.03, "350000": 0.03, "360000": 0.01, "370000": 0.03, "410000": 0.01, "420000": 0.01, "430000": 0.01, "440000": 0.03, "450000": 0.01, "500000": 0.01, "510000": 0.05, "520000": 0, "610000": 0.01, "620000": 0, "650000": 0.01, "710000": 0.02, "": 0.36 }, "age": { "": 0.23, "teens": 0.11, "twenties": 0.41, "thirties": 0.25, "fourties": 0 }, "gender": { "": 0.23, "male": 0.76, "female": 0 } }, "users": 288, "duration": 45829032, "buckets": { "dev": 1954, "invalidated": 779, "other": 0, "test": 1964, "train": 2301, "validated": 6234 }, "size": 359184757 }, "mn": { "clips": 6306, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.31, "": 0.31, "twenties": 0.22, "fourties": 0.02, "teens": 0.02, "nineties": 0.11 }, "gender": { "male": 0.35, "": 0.31, "female": 0.22, "other": 0.11 } }, "users": 230, "duration": 34457088, "buckets": { "dev": 1407, "invalidated": 479, "other": 273, "test": 1610, "train": 2049, "validated": 5554 }, "size": 269325609 }, "sah": { "clips": 3793, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 35, "duration": 22638720, "buckets": { "dev": 0, "invalidated": 51, "other": 1850, "test": 516, "train": 1374, "validated": 1892 }, "size": 173419827 }, "dv": { "clips": 5944, "splits": { "accent": { "": 1 }, "age": { "": 0.08, "twenties": 0.29, "thirties": 0.57, "fourties": 0.02, "teens": 0.04 }, "gender": { "": 0.08, "male": 0.27, "female": 0.65 } }, "users": 92, "duration": 32169912, "buckets": { "dev": 1103, "invalidated": 266, "other": 1841, "test": 1237, "train": 1487, "validated": 3837 }, "size": 247579967 }, "rw": { "clips": 1470, "splits": { "accent": { "": 1 }, "age": { "": 0.87, "twenties": 0.12, "thirties": 0.01 }, "gender": { "": 0.87, "male": 0.13 } }, "users": 32, "duration": 6806688, "buckets": { "dev": 35, "invalidated": 31, "other": 1325, "test": 34, "train": 45, "validated": 114 }, "size": 53219628 }, "sv-SE": { "clips": 4113, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.38, "": 0.21, "teens": 0.07, "fifties": 0.26, "twenties": 0.03, "fourties": 0.04 }, "gender": { "male": 0.75, "": 0.21, "female": 0.03 } }, "users": 44, "duration": 12524424, "buckets": { "dev": 1237, "invalidated": 118, "other": 0, "test": 1134, "train": 1602, "validated": 3995 }, "size": 89437571 }, "ru": { "clips": 21598, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.73, "teens": 0, "": 0.02, "fourties": 0.15, "thirties": 0.02, "fifties": 0.07 }, "gender": { "male": 0.55, "": 0.02, "other": 0, "female": 0.42 } }, "users": 64, "duration": 111829272, "buckets": { "dev": 1748, "invalidated": 784, "other": 1559, "test": 1676, "train": 2113, "validated": 19255 }, "size": 850506531 } }, "totalDuration": 8836356426, "totalValidDurationSecs": 7125701, "totalHrs": 2454, "totalValidHrs": 1979, "totalClips": 2076894 } ================================================ FILE: datasets/scripted-speech/cv-corpus-4-2019-12-10.json ================================================ { "date": "2019-12-10", "locales": { "en": { "clips": 1137300, "splits": { "accent": { "": 0.51, "canada": 0.03, "england": 0.09, "us": 0.23, "indian": 0.04, "australia": 0.03, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.02, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.02 }, "age": { "": 0.39, "twenties": 0.22, "sixties": 0.04, "thirties": 0.15, "teens": 0.05, "seventies": 0.01, "fourties": 0.09, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.39, "male": 0.46, "female": 0.13, "other": 0.02 } }, "users": 51072, "duration": 5358858312, "buckets": { "dev": 15531, "invalidated": 124820, "other": 155485, "test": 15531, "train": 232975, "validated": 854444 }, "size": 41448227462, "avgDurationSecs": 4.712, "validDurationSecs": 4026065.534, "totalHrs": 1488.57, "validHrs": 1118.35 }, "de": { "clips": 413561, "splits": { "accent": { "germany": 0.64, "russia": 0, "": 0.31, "austria": 0.02, "liechtenstein": 0, "switzerland": 0.01, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0, "czechia": 0, "brazil": 0, "canada": 0, "turkey": 0, "slovenia": 0, "luxembourg": 0, "belgium": 0, "denmark": 0, "greece": 0 }, "age": { "twenties": 0.23, "fourties": 0.19, "": 0.23, "thirties": 0.17, "teens": 0.03, "sixties": 0.02, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.67, "": 0.23, "female": 0.09, "other": 0.01 } }, "users": 8460, "duration": 1939923648, "buckets": { "dev": 13511, "invalidated": 19511, "other": 22014, "test": 13511, "train": 71841, "validated": 370976 }, "size": 15073085593, "avgDurationSecs": 4.691, "validDurationSecs": 1740166.784, "totalHrs": 538.86, "validHrs": 483.37 }, "fr": { "clips": 311742, "splits": { "accent": { "": 0.34, "france": 0.59, "germany": 0, "belgium": 0.02, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.03, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0, "new_caledonia": 0, "haiti": 0, "morocco": 0 }, "age": { "twenties": 0.21, "thirties": 0.23, "": 0.26, "teens": 0.03, "fourties": 0.14, "fifties": 0.08, "sixties": 0.04, "seventies": 0.01 }, "gender": { "male": 0.65, "": 0.22, "female": 0.12, "other": 0.01 } }, "users": 8164, "duration": 1485039192, "buckets": { "dev": 14760, "invalidated": 23904, "other": 21466, "test": 14760, "train": 130604, "validated": 264897 }, "size": 11572099037, "avgDurationSecs": 4.764, "validDurationSecs": 1261884.593, "totalHrs": 412.51, "validHrs": 350.52 }, "cy": { "clips": 61272, "splits": { "accent": { "united_kingdom": 0.43, "": 0.56, "other": 0.01 }, "age": { "fourties": 0.12, "twenties": 0.08, "sixties": 0.08, "fifties": 0.07, "": 0.53, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.01 }, "gender": { "male": 0.29, "female": 0.18, "": 0.51, "other": 0.02 } }, "users": 1149, "duration": 278683344, "buckets": { "dev": 690, "invalidated": 2598, "other": 11578, "test": 690, "train": 721, "validated": 47059 }, "size": 2145771717, "avgDurationSecs": 4.548, "validDurationSecs": 214038.378, "totalHrs": 77.41, "validHrs": 59.45 }, "br": { "clips": 15073, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.14, "": 0.43, "fifties": 0.06, "fourties": 0.09, "thirties": 0.11, "sixties": 0.16 }, "gender": { "male": 0.55, "": 0.43, "female": 0.02 } }, "users": 133, "duration": 45595056, "buckets": { "dev": 1590, "invalidated": 553, "other": 8495, "test": 1685, "train": 2208, "validated": 6025 }, "size": 348309297, "avgDurationSecs": 3.025, "validDurationSecs": 18225.318, "totalHrs": 12.66, "validHrs": 5.06 }, "cv": { "clips": 2355, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.04, "": 0.53, "fourties": 0.38, "thirties": 0.04, "teens": 0.01 }, "gender": { "male": 0.47, "": 0.53 } }, "users": 38, "duration": 10101114, "buckets": { "dev": 66, "invalidated": 649, "other": 900, "test": 198, "train": 429, "validated": 806 }, "size": 79033262, "avgDurationSecs": 4.289, "validDurationSecs": 3457.112, "totalHrs": 2.8, "validHrs": 0.96 }, "tr": { "clips": 13185, "splits": { "accent": { "": 0.94, "other": 0.06 }, "age": { "": 0.18, "thirties": 0.3, "twenties": 0.45, "teens": 0.05, "fourties": 0.02, "fifties": 0, "sixties": 0.01 }, "gender": { "": 0.18, "male": 0.74, "female": 0.08 } }, "users": 461, "duration": 53375304, "buckets": { "dev": 1624, "invalidated": 1359, "other": 9, "test": 1629, "train": 1823, "validated": 12364 }, "size": 408922099, "avgDurationSecs": 4.048, "validDurationSecs": 50051.745, "totalHrs": 14.82, "validHrs": 13.9 }, "tt": { "clips": 26790, "splits": { "accent": { "": 1 }, "age": { "": 0.17, "thirties": 0.77, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0 }, "gender": { "": 0.17, "male": 0.81, "female": 0.02 } }, "users": 142, "duration": 99079848, "buckets": { "dev": 4815, "invalidated": 256, "other": 1933, "test": 4855, "train": 7131, "validated": 24601 }, "size": 743123406, "avgDurationSecs": 3.698, "validDurationSecs": 90984.074, "totalHrs": 27.52, "validHrs": 25.27 }, "ky": { "clips": 16913, "splits": { "accent": { "": 0.91, "other": 0.09 }, "age": { "thirties": 0.14, "": 0.05, "fourties": 0.02, "twenties": 0.78, "teens": 0.01 }, "gender": { "male": 0.45, "": 0.11, "female": 0.44, "other": 0 } }, "users": 119, "duration": 78514128, "buckets": { "dev": 1345, "invalidated": 1002, "other": 7248, "test": 1546, "train": 1989, "validated": 8663 }, "size": 560602458, "avgDurationSecs": 4.642, "validDurationSecs": 40215.686, "totalHrs": 21.8, "validHrs": 11.17 }, "ga-IE": { "clips": 4137, "splits": { "accent": { "": 0.31, "connachta": 0.38, "other": 0.09, "ulaidh": 0.08, "mumhain": 0.13 }, "age": { "twenties": 0.26, "": 0.25, "thirties": 0.36, "fourties": 0.08, "sixties": 0.01, "teens": 0.03, "fifties": 0.02 }, "gender": { "male": 0.59, "": 0.25, "female": 0.16 } }, "users": 80, "duration": 14632440, "buckets": { "dev": 444, "invalidated": 254, "other": 1166, "test": 486, "train": 555, "validated": 2717 }, "size": 110903924, "avgDurationSecs": 3.537, "validDurationSecs": 9609.944, "totalHrs": 4.06, "validHrs": 2.66 }, "kab": { "clips": 275021, "splits": { "accent": { "": 0.81, "other": 0.19 }, "age": { "fourties": 0.14, "thirties": 0.32, "": 0.27, "twenties": 0.19, "fifties": 0.03, "eighties": 0.01, "teens": 0, "sixties": 0.04, "seventies": 0 }, "gender": { "male": 0.55, "": 0.23, "female": 0.22, "other": 0 } }, "users": 693, "duration": 996003432, "buckets": { "dev": 11482, "invalidated": 12914, "other": 626, "test": 11483, "train": 37056, "validated": 261338 }, "size": 7723179003, "avgDurationSecs": 3.622, "validDurationSecs": 946449.707, "totalHrs": 276.66, "validHrs": 262.9 }, "ca": { "clips": 211172, "splits": { "accent": { "valencian": 0.07, "central": 0.58, "": 0.24, "other": 0, "balearic": 0.01, "northwestern": 0.06, "northern": 0.03 }, "age": { "thirties": 0.14, "fifties": 0.22, "fourties": 0.19, "twenties": 0.1, "": 0.22, "sixties": 0.11, "teens": 0.01, "seventies": 0, "eighties": 0 }, "gender": { "male": 0.43, "": 0.22, "female": 0.35, "other": 0 } }, "users": 3724, "duration": 1063996608, "buckets": { "dev": 12730, "invalidated": 8654, "other": 26257, "test": 12730, "train": 54173, "validated": 175625 }, "size": 8180437356, "avgDurationSecs": 5.039, "validDurationSecs": 884891.957, "totalHrs": 295.55, "validHrs": 245.8 }, "zh-TW": { "clips": 70216, "splits": { "accent": { "": 0.57, "taipei_city": 0.12, "other": 0.11, "kaohsiung_city": 0.03, "tainan_city": 0.04, "new_taipei_city": 0.05, "yilan_county": 0, "taoyuan_city": 0.01, "hsinchu_city": 0.01, "chiayi_city": 0.01, "taichung_city": 0.01, "yunlin_county": 0.03, "nantou_county": 0, "changhua_county": 0.01, "pingtung_county": 0, "hsinchu_county": 0, "hong_kong": 0 }, "age": { "thirties": 0.28, "twenties": 0.39, "teens": 0.03, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.05 }, "gender": { "male": 0.48, "": 0.23, "female": 0.26, "other": 0.03 } }, "users": 1108, "duration": 217510632, "buckets": { "dev": 2432, "invalidated": 3177, "other": 18071, "test": 2432, "train": 2849, "validated": 48968 }, "size": 1684558466, "avgDurationSecs": 3.098, "validDurationSecs": 151689.937, "totalHrs": 60.41, "validHrs": 42.13 }, "sl": { "clips": 6051, "splits": { "accent": { "other": 0.01, "": 0.99 }, "age": { "twenties": 0.67, "teens": 0.13, "": 0.04, "sixties": 0, "fifties": 0.13, "fourties": 0.01, "thirties": 0.01 }, "gender": { "female": 0.16, "male": 0.8, "": 0.04 } }, "users": 51, "duration": 23533392, "buckets": { "dev": 509, "invalidated": 142, "other": 2932, "test": 360, "train": 1566, "validated": 2972 }, "size": 183825568, "avgDurationSecs": 3.889, "validDurationSecs": 11558.625, "totalHrs": 6.53, "validHrs": 3.21 }, "it": { "clips": 80605, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "thirties": 0.14, "twenties": 0.23, "": 0.35, "fifties": 0.12, "fourties": 0.09, "seventies": 0.01, "sixties": 0.02, "teens": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.18, "male": 0.47, "": 0.35, "other": 0 } }, "users": 4292, "duration": 442751616, "buckets": { "dev": 8940, "invalidated": 8143, "other": 15314, "test": 8951, "train": 19387, "validated": 56009 }, "size": 3447602853, "avgDurationSecs": 5.493, "validDurationSecs": 307649.343, "totalHrs": 122.98, "validHrs": 85.45 }, "nl": { "clips": 31209, "splits": { "accent": { "": 0.27, "netherlands": 0.62, "belgium": 0.11, "other": 0, "germany": 0, "france": 0 }, "age": { "": 0.21, "twenties": 0.29, "fourties": 0.12, "thirties": 0.15, "teens": 0.06, "fifties": 0.16, "sixties": 0.01 }, "gender": { "": 0.23, "male": 0.66, "female": 0.1, "other": 0 } }, "users": 701, "duration": 120456600, "buckets": { "dev": 1699, "invalidated": 1301, "other": 6513, "test": 1699, "train": 1893, "validated": 22954 }, "size": 927639565, "avgDurationSecs": 3.86, "validDurationSecs": 88594.982, "totalHrs": 33.46, "validHrs": 24.6 }, "cnh": { "clips": 4824, "splits": { "accent": { "": 0.82, "other": 0.18 }, "age": { "": 0.57, "twenties": 0.27, "fourties": 0.01, "teens": 0.02, "thirties": 0.1, "fifties": 0.02 }, "gender": { "": 0.57, "male": 0.23, "female": 0.2 } }, "users": 290, "duration": 18281064, "buckets": { "dev": 716, "invalidated": 477, "other": 2040, "test": 718, "train": 756, "validated": 2307 }, "size": 142825168, "avgDurationSecs": 3.79, "validDurationSecs": 8742.623, "totalHrs": 5.07, "validHrs": 2.42 }, "eo": { "clips": 29463, "splits": { "accent": { "": 0.33, "internacia": 0.66, "other": 0.01 }, "age": { "twenties": 0.46, "thirties": 0.06, "": 0.1, "fourties": 0.31, "fifties": 0.04, "seventies": 0, "teens": 0.02, "sixties": 0.01 }, "gender": { "male": 0.7, "": 0.23, "female": 0.07, "other": 0.01 } }, "users": 215, "duration": 148750728, "buckets": { "dev": 2965, "invalidated": 1527, "other": 2652, "test": 3030, "train": 3845, "validated": 25147 }, "size": 1159923722, "avgDurationSecs": 5.049, "validDurationSecs": 126960.41, "totalHrs": 41.31, "validHrs": 35.26 }, "et": { "clips": 7061, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "thirties": 0.11, "twenties": 0.7, "fourties": 0.13, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.05, "male": 0.57, "female": 0.38, "other": 0 } }, "users": 230, "duration": 47286360, "buckets": { "dev": 1576, "invalidated": 1377, "other": 5, "test": 1571, "train": 1750, "validated": 5679 }, "size": 365304398, "avgDurationSecs": 6.697, "validDurationSecs": 38031.332, "totalHrs": 13.13, "validHrs": 10.56 }, "fa": { "clips": 227458, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "twenties": 0.31, "thirties": 0.43, "fifties": 0.02, "fourties": 0.02, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.16, "male": 0.78, "female": 0.06, "other": 0 } }, "users": 2763, "duration": 919781952, "buckets": { "dev": 3445, "invalidated": 9019, "other": 29719, "test": 3445, "train": 4347, "validated": 188313 }, "size": 7090877468, "avgDurationSecs": 4.044, "validDurationSecs": 761489.588, "totalHrs": 255.49, "validHrs": 211.52 }, "eu": { "clips": 69746, "splits": { "accent": { "mendebalekoa": 0.26, "erdialdekoa_nafarra": 0.32, "": 0.42, "nafarlapurtarra_zuberoatarra": 0 }, "age": { "fourties": 0.15, "thirties": 0.07, "fifties": 0.15, "twenties": 0.36, "": 0.23, "teens": 0.01, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.51, "female": 0.23, "": 0.24, "other": 0.02 } }, "users": 638, "duration": 357525936, "buckets": { "dev": 1981, "invalidated": 3384, "other": 20519, "test": 1981, "train": 2250, "validated": 45843 }, "size": 2779103409, "avgDurationSecs": 5.126, "validDurationSecs": 234996.437, "totalHrs": 99.31, "validHrs": 65.27 }, "es": { "clips": 148372, "splits": { "accent": { "nortepeninsular": 0.14, "": 0.43, "rioplatense": 0.06, "centrosurpeninsular": 0.03, "americacentral": 0.03, "surpeninsular": 0.13, "canario": 0, "andino": 0.06, "mexicano": 0.06, "chileno": 0.02, "caribe": 0.03, "filipinas": 0 }, "age": { "thirties": 0.12, "": 0.34, "fifties": 0.11, "twenties": 0.19, "teens": 0.03, "fourties": 0.08, "sixties": 0.13, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.55, "": 0.34, "other": 0.01, "female": 0.1 } }, "users": 8252, "duration": 798568008, "buckets": { "dev": 13221, "invalidated": 20340, "other": 14476, "test": 13221, "train": 64353, "validated": 112127 }, "size": 6226191576, "avgDurationSecs": 5.382, "validDurationSecs": 603490.113, "totalHrs": 221.82, "validHrs": 167.63 }, "zh-CN": { "clips": 19468, "splits": { "accent": { "110000": 0.05, "120000": 0, "130000": 0.04, "140000": 0, "150000": 0, "210000": 0.02, "220000": 0, "230000": 0.03, "310000": 0.11, "320000": 0.04, "330000": 0.07, "340000": 0.01, "350000": 0.02, "360000": 0.02, "370000": 0.03, "410000": 0.03, "420000": 0.01, "430000": 0.02, "440000": 0.03, "450000": 0.01, "500000": 0.03, "510000": 0.04, "520000": 0, "530000": 0, "610000": 0.01, "620000": 0, "640000": 0, "650000": 0.01, "710000": 0.01, "810000": 0, "": 0.35 }, "age": { "": 0.26, "teens": 0.11, "twenties": 0.4, "thirties": 0.19, "fourties": 0.04, "nineties": 0, "fifties": 0 }, "gender": { "": 0.26, "male": 0.64, "female": 0.1, "other": 0 } }, "users": 963, "duration": 111707304, "buckets": { "dev": 4843, "invalidated": 1948, "other": 612, "test": 4898, "train": 7079, "validated": 16898 }, "size": 873632587, "avgDurationSecs": 5.738, "validDurationSecs": 96960.655, "totalHrs": 31.02, "validHrs": 26.93 }, "mn": { "clips": 8262, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.28, "": 0.3, "twenties": 0.29, "fourties": 0.01, "teens": 0.03, "nineties": 0.09 }, "gender": { "male": 0.36, "": 0.3, "female": 0.25, "other": 0.09 } }, "users": 296, "duration": 44849040, "buckets": { "dev": 1761, "invalidated": 580, "other": 1311, "test": 1759, "train": 2018, "validated": 6371 }, "size": 350541861, "avgDurationSecs": 5.428, "validDurationSecs": 34584.027, "totalHrs": 12.45, "validHrs": 9.6 }, "sah": { "clips": 3860, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.35, "male": 0.54, "female": 0.1 } }, "users": 37, "duration": 23069640, "buckets": { "dev": 246, "invalidated": 76, "other": 1448, "test": 668, "train": 1420, "validated": 2336 }, "size": 176687745, "avgDurationSecs": 5.977, "validDurationSecs": 13961.316, "totalHrs": 6.4, "validHrs": 3.87 }, "dv": { "clips": 6057, "splits": { "accent": { "": 1 }, "age": { "": 0.08, "twenties": 0.3, "thirties": 0.56, "fourties": 0.02, "teens": 0.04 }, "gender": { "": 0.08, "male": 0.28, "female": 0.64 } }, "users": 101, "duration": 32793336, "buckets": { "dev": 1278, "invalidated": 290, "other": 1753, "test": 1188, "train": 1538, "validated": 4014 }, "size": 252447683, "avgDurationSecs": 5.414, "validDurationSecs": 21732.285, "totalHrs": 9.1, "validHrs": 6.03 }, "rw": { "clips": 13402, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "thirties": 0.01, "teens": 0.12, "fourties": 0 }, "gender": { "": 0.51, "male": 0.41, "female": 0.08 } }, "users": 129, "duration": 62166720, "buckets": { "dev": 62, "invalidated": 58, "other": 13141, "test": 59, "train": 78, "validated": 199 }, "size": 486618454, "avgDurationSecs": 4.639, "validDurationSecs": 923.084, "totalHrs": 17.26, "validHrs": 0.25 }, "sv-SE": { "clips": 7064, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.28, "": 0.18, "teens": 0.14, "fifties": 0.16, "twenties": 0.18, "fourties": 0.05 }, "gender": { "male": 0.74, "": 0.18, "female": 0.08 } }, "users": 99, "duration": 21861816, "buckets": { "dev": 1349, "invalidated": 175, "other": 991, "test": 1595, "train": 2152, "validated": 5898 }, "size": 161284616, "avgDurationSecs": 3.095, "validDurationSecs": 18253.255, "totalHrs": 6.07, "validHrs": 5.07 }, "ru": { "clips": 50631, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.54, "teens": 0.04, "": 0.06, "fourties": 0.2, "thirties": 0.09, "fifties": 0.07, "sixties": 0 }, "gender": { "male": 0.71, "": 0.06, "other": 0, "female": 0.23 } }, "users": 496, "duration": 275609496, "buckets": { "dev": 6110, "invalidated": 1825, "other": 710, "test": 6300, "train": 10352, "validated": 47987 }, "size": 2131930453, "avgDurationSecs": 5.443, "validDurationSecs": 261216.9, "totalHrs": 76.55, "validHrs": 72.56 }, "id": { "clips": 3618, "splits": { "accent": { "": 1 }, "age": { "": 0.13, "twenties": 0.6, "thirties": 0.25, "teens": 0.02, "fifties": 0, "fourties": 0 }, "gender": { "": 0.13, "male": 0.82, "female": 0.04 } }, "users": 56, "duration": 13691112, "buckets": { "dev": 792, "invalidated": 110, "other": 627, "test": 844, "train": 1243, "validated": 2881 }, "size": 106786387, "avgDurationSecs": 3.784, "validDurationSecs": 10902.182, "totalHrs": 3.8, "validHrs": 3.02 }, "ar": { "clips": 12275, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.22, "": 0.28, "twenties": 0.43, "fourties": 0.01, "teens": 0.07 }, "gender": { "female": 0.24, "": 0.28, "male": 0.48, "other": 0 } }, "users": 228, "duration": 45769368, "buckets": { "dev": 1758, "invalidated": 851, "other": 4585, "test": 1695, "train": 2029, "validated": 6839 }, "size": 346519122, "avgDurationSecs": 3.729, "validDurationSecs": 25500.343, "totalHrs": 12.71, "validHrs": 7.08 }, "ta": { "clips": 3997, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "thirties": 0.47, "": 0.11, "fourties": 0.03 }, "gender": { "male": 0.67, "": 0.2, "other": 0.03, "female": 0.1 } }, "users": 91, "duration": 17949792, "buckets": { "dev": 384, "invalidated": 184, "other": 1285, "test": 786, "train": 1358, "validated": 2528 }, "size": 139043206, "avgDurationSecs": 4.491, "validDurationSecs": 11352.783, "totalHrs": 4.98, "validHrs": 3.15 }, "ia": { "clips": 3466, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.4, "fourties": 0.48, "": 0.04, "twenties": 0.04, "thirties": 0.02, "teens": 0.01 }, "gender": { "male": 0.94, "": 0.04, "female": 0.02 } }, "users": 12, "duration": 14255976, "buckets": { "dev": 111, "invalidated": 79, "other": 2409, "test": 236, "train": 631, "validated": 978 }, "size": 111363014, "avgDurationSecs": 4.113, "validDurationSecs": 4022.604, "totalHrs": 3.95, "validHrs": 1.11 }, "pt": { "clips": 24243, "splits": { "accent": { "": 1 }, "age": { "": 0.09, "twenties": 0.41, "teens": 0.02, "thirties": 0.46, "fourties": 0.01, "sixties": 0, "fifties": 0.01, "seventies": 0 }, "gender": { "": 0.09, "male": 0.89, "female": 0.02, "other": 0 } }, "users": 354, "duration": 106563624, "buckets": { "dev": 3318, "invalidated": 865, "other": 1192, "test": 4023, "train": 6014, "validated": 22131 }, "size": 774265004, "avgDurationSecs": 4.396, "validDurationSecs": 97280.022, "totalHrs": 29.6, "validHrs": 27.02 }, "lv": { "clips": 6488, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.6, "fourties": 0.04, "": 0.19, "twenties": 0.13, "teens": 0.04, "fifties": 0 }, "gender": { "male": 0.64, "female": 0.17, "": 0.19 } }, "users": 86, "duration": 22526328, "buckets": { "dev": 1125, "invalidated": 120, "other": 1270, "test": 1629, "train": 2336, "validated": 5098 }, "size": 168378556, "avgDurationSecs": 3.472, "validDurationSecs": 17700.25, "totalHrs": 6.25, "validHrs": 4.91 }, "ja": { "clips": 2512, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.67, "": 0.18, "teens": 0.07, "fifties": 0.06, "thirties": 0.01 }, "gender": { "male": 0.81, "": 0.18, "female": 0 } }, "users": 52, "duration": 11333208, "buckets": { "dev": 635, "invalidated": 73, "other": 1, "test": 684, "train": 1119, "validated": 2438 }, "size": 88830917, "avgDurationSecs": 4.512, "validDurationSecs": 10999.348, "totalHrs": 3.14, "validHrs": 3.05 }, "vot": { "clips": 108, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 2, "duration": 440760, "buckets": { "invalidated": 0, "other": 108, "validated": 0 }, "size": 3446231, "avgDurationSecs": 4.081, "validDurationSecs": 0, "totalHrs": 0.12, "validHrs": 0 }, "ab": { "clips": 58, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "duration": 370272, "buckets": { "dev": 0, "invalidated": 0, "other": 56, "test": 0, "train": 2, "validated": 2 }, "size": 2902360, "avgDurationSecs": 6.384, "validDurationSecs": 12.768, "totalHrs": 0.1, "validHrs": 0 }, "zh-HK": { "clips": 462, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.22, "thirties": 0.12, "": 0.39, "teens": 0.01, "fifties": 0.02, "seventies": 0.15, "sixties": 0.09 }, "gender": { "male": 0.37, "": 0.39, "female": 0.24 } }, "users": 15, "duration": 2658216, "buckets": { "dev": 5, "invalidated": 9, "other": 407, "test": 14, "train": 27, "validated": 46 }, "size": 20859996, "avgDurationSecs": 5.754, "validDurationSecs": 264.671, "totalHrs": 0.73, "validHrs": 0.07 }, "rm-sursilv": { "clips": 20, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "duration": 108936, "buckets": { "dev": 4, "invalidated": 2, "other": 1, "test": 4, "train": 9, "validated": 17 }, "size": 857887, "avgDurationSecs": 5.447, "validDurationSecs": 92.596, "totalHrs": 0.03, "validHrs": 0.02 } }, "totalDuration": 15325973658, "totalValidDurationSecs": 12245003, "totalHrs": 4257, "totalValidHrs": 3401 } ================================================ FILE: datasets/scripted-speech/cv-corpus-5-2020-06-22.json ================================================ { "date": "2020-06-22", "locales": { "en": { "clips": 1429041, "splits": { "accent": { "": 0.52, "canada": 0.03, "england": 0.08, "us": 0.23, "indian": 0.05, "australia": 0.04, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.02, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.01 }, "age": { "": 0.37, "twenties": 0.23, "sixties": 0.04, "thirties": 0.14, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.47, "female": 0.14, "other": 0.02 } }, "users": 61584, "duration": 6957991918, "reportedSentences": 1233, "size": 53752852490, "checksum": "d00f7dc59f890def0cc228ce6d9aa9b82553fdc9e0c7da05717776527c9aa809", "buckets": { "dev": 16014, "invalidated": 164254, "other": 189139, "reported": 424, "test": 16014, "train": 425685, "validated": 1074005 }, "avgDurationSecs": 4.869, "validDurationSecs": 5229323.798, "totalHrs": 1932.77, "validHrs": 1452.58 }, "de": { "clips": 553819, "splits": { "accent": { "russia": 0, "germany": 0.64, "": 0.32, "austria": 0.02, "liechtenstein": 0, "switzerland": 0.02, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0, "czechia": 0, "brazil": 0, "canada": 0, "turkey": 0, "slovenia": 0, "luxembourg": 0, "belgium": 0, "denmark": 0, "greece": 0 }, "age": { "twenties": 0.22, "fourties": 0.19, "": 0.23, "thirties": 0.17, "teens": 0.04, "sixties": 0.03, "fifties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.66, "": 0.24, "female": 0.09, "other": 0.01 } }, "users": 11735, "duration": 2701658851, "reportedSentences": 2814, "size": 20939845241, "checksum": "2cb18869d8fcd94deac00e9605d129c322075c228c1bcffc6959856b245c1a5f", "buckets": { "dev": 15314, "invalidated": 29633, "other": 17418, "reported": 2823, "test": 15314, "train": 192047, "validated": 506608 }, "avgDurationSecs": 4.878, "validDurationSecs": 2471352.531, "totalHrs": 750.46, "validHrs": 686.48 }, "fr": { "clips": 450568, "splits": { "accent": { "": 0.33, "france": 0.61, "germany": 0, "belgium": 0.01, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0, "new_caledonia": 0, "haiti": 0, "morocco": 0, "romania": 0, "congo_kinshasa": 0, "congo_brazzaville": 0 }, "age": { "twenties": 0.2, "thirties": 0.19, "": 0.3, "teens": 0.03, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0.01, "eighties": 0 }, "gender": { "male": 0.65, "": 0.24, "female": 0.11, "other": 0.01 } }, "users": 11962, "duration": 2175911452, "reportedSentences": 2629, "size": 16959245151, "checksum": "19198077adaee7ba9b5b961e24b5cff9e9c5a755189debe7069daab9370ee879", "buckets": { "dev": 15620, "invalidated": 34796, "other": 2928, "reported": 2653, "test": 15620, "train": 254707, "validated": 412353 }, "avgDurationSecs": 4.829, "validDurationSecs": 1991361.16, "totalHrs": 604.41, "validHrs": 553.15 }, "cy": { "clips": 74411, "splits": { "accent": { "united_kingdom": 0.47, "": 0.51, "other": 0.01 }, "age": { "fourties": 0.12, "twenties": 0.11, "sixties": 0.08, "fifties": 0.07, "": 0.48, "thirties": 0.11, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.3, "female": 0.22, "": 0.46, "other": 0.02 } }, "users": 1258, "duration": 347373624, "reportedSentences": 45, "size": 2679311148, "checksum": "2a186f59d4cab97152fdd156d9a05ea999a2832a139701b237b5a1405e30e89f", "buckets": { "dev": 2911, "invalidated": 3111, "other": 6891, "reported": 44, "test": 2914, "train": 3539, "validated": 64387 }, "avgDurationSecs": 4.668, "validDurationSecs": 300578.483, "totalHrs": 96.49, "validHrs": 83.49 }, "br": { "clips": 19112, "splits": { "accent": { "other": 0, "": 0.74, "kerneveg": 0.21, "gwenedeg": 0.04, "leoneg": 0 }, "age": { "twenties": 0.17, "": 0.38, "fifties": 0.05, "fourties": 0.07, "thirties": 0.08, "sixties": 0.24, "teens": 0 }, "gender": { "male": 0.6, "": 0.38, "female": 0.02 } }, "users": 146, "duration": 57218116, "reportedSentences": 84, "size": 436878482, "checksum": "0a825a520a5c9a6df0bf15a34452fe598b6f2001432a6ec0fc10c747be66dee5", "buckets": { "dev": 1907, "invalidated": 546, "other": 11180, "reported": 83, "test": 1927, "train": 2275, "validated": 7386 }, "avgDurationSecs": 2.994, "validDurationSecs": 22112.443, "totalHrs": 15.89, "validHrs": 6.14 }, "cv": { "clips": 5901, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.41, "": 0.38, "fourties": 0.16, "thirties": 0.02, "teens": 0, "fifties": 0.03 }, "gender": { "male": 0.59, "": 0.38, "female": 0.03 } }, "users": 53, "duration": 28727634, "reportedSentences": 34, "size": 214068114, "checksum": "fd494d0ac2611abe13147b84a42f40422c38593b8f978cacdd98f965ac3c839d", "buckets": { "dev": 524, "invalidated": 1028, "other": 2492, "reported": 33, "test": 574, "train": 787, "validated": 2381 }, "avgDurationSecs": 4.868, "validDurationSecs": 11591.34, "totalHrs": 7.97, "validHrs": 3.21 }, "tr": { "clips": 20513, "splits": { "accent": { "": 0.96, "other": 0.04 }, "age": { "": 0.22, "thirties": 0.23, "twenties": 0.47, "teens": 0.04, "fourties": 0.02, "fifties": 0.01, "sixties": 0 }, "gender": { "": 0.22, "male": 0.72, "female": 0.06 } }, "users": 642, "duration": 80107435, "reportedSentences": 73, "size": 614041312, "checksum": "214381a7bf432cf782dc6adb4f324eb6d3ecfe69b6dfb98ab248a61516135eae", "buckets": { "dev": 1638, "invalidated": 1694, "other": 284, "reported": 72, "test": 1638, "train": 1821, "validated": 18535 }, "avgDurationSecs": 3.905, "validDurationSecs": 72382.943, "totalHrs": 22.25, "validHrs": 20.1 }, "tt": { "clips": 27539, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.76, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0 }, "gender": { "": 0.18, "male": 0.8, "female": 0.02 } }, "users": 173, "duration": 102466992, "reportedSentences": 1, "size": 767997128, "checksum": "4cf4c008e3c22337f4c1b89e2f1899427e43cc562a5da29c7f176c18b05cb1d6", "buckets": { "dev": 2114, "invalidated": 274, "other": 1694, "reported": 0, "test": 4356, "train": 11174, "validated": 25571 }, "avgDurationSecs": 3.721, "validDurationSecs": 95144.466, "totalHrs": 28.46, "validHrs": 26.42 }, "ky": { "clips": 17302, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "thirties": 0.13, "": 0.06, "fourties": 0.02, "twenties": 0.78, "teens": 0.01 }, "gender": { "male": 0.46, "": 0.11, "female": 0.43, "other": 0 } }, "users": 126, "duration": 80463096, "reportedSentences": 1, "size": 575925131, "checksum": "d36730afb1bdf595e912d5c3905958f863d1d3eaf3e4ff4854bc75d4fadbc42e", "buckets": { "dev": 1511, "invalidated": 923, "other": 7249, "reported": 0, "test": 1453, "train": 1939, "validated": 9130 }, "avgDurationSecs": 4.651, "validDurationSecs": 42459.142, "totalHrs": 22.35, "validHrs": 11.79 }, "ga-IE": { "clips": 5397, "splits": { "accent": { "": 0.43, "connachta": 0.3, "mumhain": 0.14, "other": 0.06, "ulaidh": 0.08 }, "age": { "twenties": 0.39, "": 0.21, "thirties": 0.3, "fourties": 0.07, "sixties": 0, "teens": 0.02, "fifties": 0.01 }, "gender": { "male": 0.62, "": 0.21, "female": 0.18 } }, "users": 96, "duration": 18294074, "reportedSentences": 4, "size": 139330091, "checksum": "27fb97e66aa11e32ab9415cffc63bc149c279333a7b496ed7937c5495ddf0904", "buckets": { "dev": 488, "invalidated": 375, "other": 1783, "reported": 3, "test": 501, "train": 530, "validated": 3239 }, "avgDurationSecs": 3.39, "validDurationSecs": 10979.156, "totalHrs": 5.08, "validHrs": 3.04 }, "kab": { "clips": 553576, "splits": { "accent": { "": 0.9, "other": 0.1 }, "age": { "fourties": 0.18, "thirties": 0.26, "": 0.28, "twenties": 0.13, "fifties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.58, "": 0.26, "female": 0.16, "other": 0 } }, "users": 887, "duration": 1832133149, "reportedSentences": 575, "size": 14154660040, "checksum": "9fff3fb5ecdd5c443146a10bc14fca6707213024bc1eba5fe2455578122dd7f4", "buckets": { "dev": 13381, "invalidated": 16885, "other": 21061, "reported": 585, "test": 13384, "train": 68406, "validated": 515359 }, "avgDurationSecs": 3.31, "validDurationSecs": 1705648.922, "totalHrs": 508.92, "validHrs": 473.79 }, "ca": { "clips": 390461, "splits": { "accent": { "valencian": 0.06, "central": 0.63, "": 0.25, "other": 0, "balearic": 0.01, "northwestern": 0.04, "northern": 0.02 }, "age": { "thirties": 0.14, "fifties": 0.23, "fourties": 0.16, "twenties": 0.08, "": 0.19, "sixties": 0.19, "teens": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.53, "": 0.19, "female": 0.28, "other": 0 } }, "users": 4872, "duration": 2086705172, "reportedSentences": 909, "size": 16094386076, "checksum": "bde3b841843a34affdbb3ba1fff5c66012d91baa453d5cd2c8220623aec5539f", "buckets": { "dev": 15291, "invalidated": 15457, "other": 57274, "reported": 911, "test": 15293, "train": 188896, "validated": 317693 }, "avgDurationSecs": 5.344, "validDurationSecs": 1697817.775, "totalHrs": 579.64, "validHrs": 471.61 }, "zh-TW": { "clips": 82757, "splits": { "accent": { "": 0.52, "taipei_city": 0.16, "other": 0.09, "kaohsiung_city": 0.03, "new_taipei_city": 0.05, "tainan_city": 0.04, "taichung_city": 0.03, "yilan_county": 0.01, "taoyuan_city": 0.01, "hsinchu_city": 0.01, "chiayi_city": 0, "yunlin_county": 0.03, "nantou_county": 0, "changhua_county": 0.01, "pingtung_county": 0.01, "hsinchu_county": 0, "hong_kong": 0, "chiayi_county": 0, "keelung_city": 0, "hualien_county": 0.01, "kinmen_county": 0 }, "age": { "thirties": 0.25, "twenties": 0.37, "teens": 0.03, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.11, "sixties": 0 }, "gender": { "male": 0.48, "": 0.22, "female": 0.27, "other": 0.03 } }, "users": 1332, "duration": 265863297, "reportedSentences": 35, "size": 2055806145, "checksum": "5aeb2f5666d8ff35e6d79c8cb30c231496a27985877da7df81ecabc9e959d361", "buckets": { "dev": 2612, "invalidated": 3386, "other": 19564, "reported": 34, "test": 2612, "train": 3099, "validated": 59805 }, "avgDurationSecs": 3.213, "validDurationSecs": 192128.212, "totalHrs": 73.85, "validHrs": 53.36 }, "sl": { "clips": 7018, "splits": { "accent": { "other": 0.04, "": 0.96 }, "age": { "twenties": 0.63, "teens": 0.11, "": 0.05, "sixties": 0.03, "fifties": 0.12, "fourties": 0.01, "thirties": 0.05 }, "gender": { "female": 0.17, "male": 0.78, "": 0.05, "other": 0 } }, "users": 69, "duration": 27544553, "reportedSentences": 13, "size": 215257983, "checksum": "9b30b6ebbd87b6f0b8deda569cee6524a08fd9bda6afb5c188495b48f518f56f", "buckets": { "dev": 527, "invalidated": 85, "other": 2484, "reported": 12, "test": 777, "train": 1939, "validated": 4449 }, "avgDurationSecs": 3.925, "validDurationSecs": 17461.63, "totalHrs": 7.65, "validHrs": 4.85 }, "it": { "clips": 108951, "splits": { "accent": { "": 0.94, "other": 0.06 }, "age": { "thirties": 0.15, "twenties": 0.23, "": 0.34, "fifties": 0.1, "fourties": 0.12, "seventies": 0.01, "sixties": 0.03, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.16, "male": 0.49, "": 0.34, "other": 0.01 } }, "users": 5447, "duration": 602618949, "reportedSentences": 559, "size": 4691723399, "checksum": "7189297db22597f5ea4a777b46788ceff8b94694a0ffb0f491ea552774d888ed", "buckets": { "dev": 11832, "invalidated": 10971, "other": 13808, "reported": 563, "test": 11835, "train": 41009, "validated": 83407 }, "avgDurationSecs": 5.531, "validDurationSecs": 461332.514, "totalHrs": 167.39, "validHrs": 128.14 }, "nl": { "clips": 41974, "splits": { "accent": { "": 0.31, "netherlands": 0.56, "belgium": 0.12, "other": 0, "germany": 0, "france": 0, "suriname": 0.01 }, "age": { "": 0.23, "twenties": 0.29, "fourties": 0.13, "thirties": 0.16, "teens": 0.05, "fifties": 0.13, "sixties": 0.01 }, "gender": { "": 0.24, "male": 0.62, "female": 0.13, "other": 0 } }, "users": 889, "duration": 163953358, "reportedSentences": 126, "size": 1260671455, "checksum": "7858f81c9f1db8313770fad83637e729bb6b31283bfa3b9aa78397fa3d73255d", "buckets": { "dev": 2198, "invalidated": 2746, "other": 178, "reported": 125, "test": 2198, "train": 2533, "validated": 38902 }, "avgDurationSecs": 3.906, "validDurationSecs": 151953.913, "totalHrs": 45.54, "validHrs": 42.2 }, "cnh": { "clips": 5799, "splits": { "accent": { "": 0.85, "other": 0.15 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 297, "duration": 20663664, "reportedSentences": 9, "size": 161327068, "checksum": "1a066a880064eac517ccd0287ffc67cd43dd8a2662cd5564d45ec942df97de38", "buckets": { "dev": 755, "invalidated": 433, "other": 2936, "reported": 8, "test": 751, "train": 807, "validated": 2430 }, "avgDurationSecs": 3.563, "validDurationSecs": 8658.856, "totalHrs": 5.73, "validHrs": 2.4 }, "eo": { "clips": 58287, "splits": { "accent": { "": 0.31, "internacia": 0.69, "other": 0.01 }, "age": { "twenties": 0.35, "thirties": 0.1, "": 0.12, "fourties": 0.3, "fifties": 0.04, "seventies": 0.01, "teens": 0.03, "sixties": 0.02, "eighties": 0.04 }, "gender": { "male": 0.73, "": 0.18, "female": 0.09, "other": 0.01 } }, "users": 505, "duration": 321501358, "reportedSentences": 354, "size": 2504730609, "checksum": "7847f29bacb2181ae463fc42d89df27f0ed9177322ed4d3a7430ebd4d7c3ad0f", "buckets": { "dev": 8437, "invalidated": 3594, "other": 25, "reported": 355, "test": 8453, "train": 17287, "validated": 54654 }, "avgDurationSecs": 5.516, "validDurationSecs": 301462.337, "totalHrs": 89.3, "validHrs": 83.73 }, "et": { "clips": 14709, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "thirties": 0.07, "twenties": 0.71, "fourties": 0.07, "fifties": 0.01, "seventies": 0, "teens": 0 }, "gender": { "": 0.15, "male": 0.6, "female": 0.25, "other": 0 } }, "users": 538, "duration": 98151487, "reportedSentences": 103, "size": 762100869, "checksum": "4d644e8f29980c865230a64f06a9115470cfe11c9e227a36a6f0ebd4039871c9", "buckets": { "dev": 2481, "invalidated": 3523, "other": 596, "reported": 102, "test": 2483, "train": 2925, "validated": 10590 }, "avgDurationSecs": 6.673, "validDurationSecs": 70665.868, "totalHrs": 27.26, "validHrs": 19.62 }, "fa": { "clips": 271732, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "twenties": 0.33, "thirties": 0.4, "fifties": 0.02, "fourties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.18, "male": 0.76, "female": 0.07, "other": 0 } }, "users": 3412, "duration": 1100290166, "reportedSentences": 1346, "size": 8470555472, "checksum": "c66040b84f1bb767f388a3feb059cbcf49f93fa2ff99c4db0e558acb695bf761", "buckets": { "dev": 3989, "invalidated": 11301, "other": 15568, "reported": 1349, "test": 3989, "train": 5247, "validated": 244736 }, "avgDurationSecs": 4.049, "validDurationSecs": 990978.663, "totalHrs": 305.63, "validHrs": 275.27 }, "eu": { "clips": 90857, "splits": { "accent": { "mendebalekoa": 0.31, "erdialdekoa_nafarra": 0.3, "": 0.39, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.14, "twenties": 0.39, "": 0.22, "teens": 0.03, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.5, "female": 0.25, "": 0.23, "other": 0.02 } }, "users": 1003, "duration": 467076631, "reportedSentences": 38, "size": 3621478151, "checksum": "f07799913ea4426abeb8d527c30946f96ba71bb9438016daa0074fb425dcc00d", "buckets": { "dev": 4791, "invalidated": 5321, "other": 24099, "reported": 37, "test": 4791, "train": 6730, "validated": 61426 }, "avgDurationSecs": 5.141, "validDurationSecs": 315778.081, "totalHrs": 129.74, "validHrs": 87.71 }, "es": { "clips": 381613, "splits": { "accent": { "nortepeninsular": 0.1, "": 0.53, "rioplatense": 0.04, "centrosurpeninsular": 0.03, "americacentral": 0.02, "surpeninsular": 0.12, "canario": 0, "andino": 0.05, "mexicano": 0.06, "chileno": 0.02, "caribe": 0.03, "filipinas": 0 }, "age": { "thirties": 0.1, "": 0.34, "fifties": 0.09, "twenties": 0.28, "teens": 0.03, "fourties": 0.06, "sixties": 0.11, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.46, "": 0.34, "other": 0, "female": 0.19 } }, "users": 18908, "duration": 1878616534, "reportedSentences": 982, "size": 14601483231, "checksum": "ee028e688ebd68f67c8c3933608c89063a2b44b41274b794b81f313771a8e755", "buckets": { "dev": 14838, "invalidated": 38052, "other": 133116, "reported": 989, "test": 14838, "train": 137034, "validated": 210339 }, "avgDurationSecs": 4.923, "validDurationSecs": 1035463.475, "totalHrs": 521.83, "validHrs": 287.62 }, "zh-CN": { "clips": 45427, "splits": { "accent": { "110000": 0.05, "120000": 0, "130000": 0.02, "140000": 0.01, "150000": 0, "210000": 0.01, "220000": 0, "230000": 0.04, "310000": 0.05, "320000": 0.04, "330000": 0.04, "340000": 0.01, "350000": 0.01, "360000": 0.02, "370000": 0.04, "410000": 0.03, "420000": 0.02, "430000": 0.01, "440000": 0.03, "450000": 0.01, "460000": 0.01, "500000": 0.02, "510000": 0.04, "520000": 0, "530000": 0, "610000": 0.01, "620000": 0, "640000": 0, "650000": 0, "710000": 0.01, "810000": 0, "": 0.46 }, "age": { "": 0.39, "teens": 0.08, "twenties": 0.36, "thirties": 0.13, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.39, "male": 0.51, "female": 0.1, "other": 0 } }, "users": 3333, "duration": 254593542, "reportedSentences": 239, "size": 1981965535, "checksum": "7649657d880594ff1b18662569f83d7028c80ada5960c9c73c726d0f2fe4cebd", "buckets": { "dev": 8223, "invalidated": 4879, "other": 7735, "reported": 239, "test": 8216, "train": 16266, "validated": 32783 }, "avgDurationSecs": 5.604, "validDurationSecs": 183730.823, "totalHrs": 70.72, "validHrs": 51.03 }, "mn": { "clips": 10868, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.21, "": 0.25, "twenties": 0.44, "fourties": 0.01, "teens": 0.02, "nineties": 0.07 }, "gender": { "male": 0.33, "": 0.25, "female": 0.35, "other": 0.07 } }, "users": 349, "duration": 59173272, "reportedSentences": 2, "size": 463039061, "checksum": "24f4167645422d6c4c919baeda026dd17b1c6d344ae326ff1fcb4d43c2e75a91", "buckets": { "dev": 1664, "invalidated": 632, "other": 2996, "reported": 1, "test": 1810, "train": 2180, "validated": 7239 }, "avgDurationSecs": 5.445, "validDurationSecs": 39414.365, "totalHrs": 16.43, "validHrs": 10.94 }, "sah": { "clips": 3916, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 40, "duration": 23414976, "size": 179490083, "checksum": "9de160b4b1dbbef46f082aac41c4bbf3a0a0bc6ffdc2d9714a6c1a3fbb673459", "buckets": { "dev": 395, "invalidated": 61, "other": 1314, "test": 705, "train": 1439, "validated": 2541 }, "avgDurationSecs": 5.979, "validDurationSecs": 15193.425, "totalHrs": 6.5, "validHrs": 4.22 }, "dv": { "clips": 7515, "splits": { "accent": { "": 1 }, "age": { "": 0.14, "twenties": 0.28, "thirties": 0.52, "fourties": 0.02, "teens": 0.04 }, "gender": { "": 0.14, "male": 0.25, "female": 0.61 } }, "users": 123, "duration": 41061552, "reportedSentences": 1, "size": 315850677, "checksum": "e3e1978ac2a6f16fe10490f6d81f3e906a7036be17c76ffe5cfcee881c080185", "buckets": { "dev": 1359, "invalidated": 389, "other": 1950, "reported": 0, "test": 1629, "train": 2178, "validated": 5176 }, "avgDurationSecs": 5.464, "validDurationSecs": 28281.383, "totalHrs": 11.4, "validHrs": 7.85 }, "rw": { "clips": 401076, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.54, "thirties": 0.03, "teens": 0.38, "fourties": 0 }, "gender": { "": 0.05, "male": 0.85, "female": 0.1, "other": 0 } }, "users": 340, "duration": 2108407752, "reportedSentences": 490, "size": 16543364566, "checksum": "24a1511952eef54c7d9e43f690eab1c1269aa5831e33a32fd921767d4225fc88", "buckets": { "dev": 14127, "invalidated": 177516, "other": 20604, "reported": 489, "test": 14048, "train": 101047, "validated": 202475 }, "avgDurationSecs": 5.257, "validDurationSecs": 1064386.449, "totalHrs": 585.66, "validHrs": 295.66 }, "sv-SE": { "clips": 12266, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.3, "": 0.18, "teens": 0.08, "fifties": 0.11, "twenties": 0.25, "fourties": 0.07, "sixties": 0 }, "gender": { "male": 0.72, "": 0.18, "female": 0.09, "other": 0.01 } }, "users": 178, "duration": 37547448, "reportedSentences": 27, "size": 281209017, "checksum": "5ea8d0213830e031d5217611747f81342c3ba528fce11ac6b666a033ffb894e3", "buckets": { "dev": 1776, "invalidated": 435, "other": 20, "reported": 26, "test": 1775, "train": 2110, "validated": 11515 }, "avgDurationSecs": 3.061, "validDurationSecs": 35248.562, "totalHrs": 10.42, "validHrs": 9.79 }, "ru": { "clips": 76582, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.48, "teens": 0.09, "": 0.08, "fourties": 0.17, "thirties": 0.12, "fifties": 0.05, "sixties": 0 }, "gender": { "male": 0.73, "": 0.09, "other": 0, "female": 0.18 } }, "users": 928, "duration": 419475986, "reportedSentences": 113, "size": 3256126691, "checksum": "6e4e172ef3e9bfe06158510f324edfcc3adeb52fb9e45a0a9e32a18cc0290091", "buckets": { "dev": 7244, "invalidated": 2748, "other": 4724, "reported": 119, "test": 7281, "train": 13009, "validated": 69094 }, "avgDurationSecs": 5.477, "validDurationSecs": 378460.654, "totalHrs": 116.52, "validHrs": 105.12 }, "id": { "clips": 14031, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.41, "thirties": 0.08, "teens": 0.42, "fifties": 0, "fourties": 0.04 }, "gender": { "": 0.05, "male": 0.61, "female": 0.21, "other": 0.13 } }, "users": 141, "duration": 54218808, "reportedSentences": 197, "size": 421995525, "checksum": "17e0d57aa405783c69efc0da791d3f6094f46835805616c076c78d2c526b242a", "buckets": { "dev": 1565, "invalidated": 384, "other": 6068, "reported": 196, "test": 1617, "train": 1912, "validated": 7578 }, "avgDurationSecs": 3.864, "validDurationSecs": 29283.025, "totalHrs": 15.06, "validHrs": 8.13 }, "ar": { "clips": 19098, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.24, "": 0.24, "twenties": 0.46, "fourties": 0.01, "teens": 0.05, "fifties": 0 }, "gender": { "female": 0.17, "": 0.23, "male": 0.6, "other": 0 } }, "users": 371, "duration": 70329391, "reportedSentences": 135, "size": 533528543, "checksum": "290fb2a8f1bd3c30c3bcfe1ad7d81f0f44d936e8c2b2c04ecfc90ea8ae68377d", "buckets": { "dev": 1833, "invalidated": 1460, "other": 7048, "reported": 134, "test": 1885, "train": 2186, "validated": 10590 }, "avgDurationSecs": 3.683, "validDurationSecs": 38998.233, "totalHrs": 19.53, "validHrs": 10.83 }, "ta": { "clips": 17942, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.48, "thirties": 0.42, "": 0.08, "fourties": 0.01, "seventies": 0, "fifties": 0.01, "teens": 0 }, "gender": { "male": 0.76, "": 0.09, "other": 0.01, "female": 0.14 } }, "users": 190, "duration": 74836349, "reportedSentences": 234, "size": 581764886, "checksum": "28352ead09c52ac2007c04e234ec25ae94c8afe68665075346aafd9a093589d7", "buckets": { "dev": 1662, "invalidated": 534, "other": 5272, "reported": 234, "test": 1635, "train": 1892, "validated": 12136 }, "avgDurationSecs": 4.171, "validDurationSecs": 50619.437, "totalHrs": 20.78, "validHrs": 14.06 }, "ia": { "clips": 6480, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.26, "fourties": 0.6, "": 0.06, "twenties": 0.05, "thirties": 0.02, "teens": 0.01, "fifties": 0.01 }, "gender": { "male": 0.93, "": 0.06, "female": 0.01 } }, "users": 27, "duration": 26276745, "reportedSentences": 226, "size": 205193796, "checksum": "b7e859a1f6cb716fc03eec3f02abc36c36f42af2e80434eacb2ed5510065e643", "buckets": { "dev": 1512, "invalidated": 166, "other": 1438, "reported": 230, "test": 788, "train": 2574, "validated": 4875 }, "avgDurationSecs": 4.055, "validDurationSecs": 19768.385, "totalHrs": 7.29, "validHrs": 5.49 }, "pt": { "clips": 43921, "splits": { "accent": { "": 1 }, "age": { "": 0.1, "twenties": 0.37, "teens": 0.03, "thirties": 0.35, "fourties": 0.11, "sixties": 0, "fifties": 0.04, "seventies": 0 }, "gender": { "": 0.1, "male": 0.85, "female": 0.03, "other": 0.02 } }, "users": 744, "duration": 194178465, "reportedSentences": 1620, "size": 1454377422, "checksum": "144fca2c822b3ea49549cdf4779b0fb120984f3216142b792bf59861f9a3e80c", "buckets": { "dev": 4176, "invalidated": 1587, "other": 3245, "reported": 1623, "test": 4229, "train": 5760, "validated": 39072 }, "avgDurationSecs": 4.421, "validDurationSecs": 172740.625, "totalHrs": 53.93, "validHrs": 47.98 }, "lv": { "clips": 8057, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.49, "fourties": 0.03, "": 0.16, "twenties": 0.29, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.71, "female": 0.13, "": 0.16 } }, "users": 94, "duration": 27322937, "reportedSentences": 21, "size": 205701538, "checksum": "8a46e2059d23ce0d14e861db4f2f1e7fb2e8d68ef687111f9d65d198c8e47e78", "buckets": { "dev": 1901, "invalidated": 139, "other": 1622, "reported": 20, "test": 1837, "train": 2550, "validated": 6296 }, "avgDurationSecs": 3.391, "validDurationSecs": 21351.025, "totalHrs": 7.58, "validHrs": 5.93 }, "ja": { "clips": 8279, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.6, "": 0.21, "teens": 0.04, "fifties": 0.03, "thirties": 0.05, "fourties": 0.06, "sixties": 0 }, "gender": { "male": 0.76, "": 0.14, "female": 0.1, "other": 0 } }, "users": 194, "duration": 35700744, "reportedSentences": 54, "size": 277985639, "checksum": "303d267e2cbb04c5e6a23fa1f31037020877fe0e133ac213e6ca8d4b9a02be42", "buckets": { "dev": 1219, "invalidated": 1394, "other": 725, "reported": 54, "test": 1334, "train": 1642, "validated": 6158 }, "avgDurationSecs": 4.312, "validDurationSecs": 26554.558, "totalHrs": 9.91, "validHrs": 7.37 }, "vot": { "clips": 421, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "duration": 1008748, "size": 7792302, "checksum": "3a7c6088227e19996974c45e63f7b2415c37d24c3453de8298bd0b9c4ab6d326", "buckets": { "dev": 0, "invalidated": 0, "other": 418, "test": 0, "train": 3, "validated": 3 }, "avgDurationSecs": 2.396, "validDurationSecs": 7.188, "totalHrs": 0.28, "validHrs": 0 }, "ab": { "clips": 679, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.66, "thirties": 0.33, "": 0 }, "gender": { "male": 0.66, "female": 0.33, "": 0 } }, "users": 7, "duration": 4511760, "reportedSentences": 38, "size": 35501298, "checksum": "0a6a6dbb04dd7e44326557cdd04a7a84d01066a260b4bae820a29cce1d908b3c", "buckets": { "dev": 0, "invalidated": 6, "other": 647, "reported": 37, "test": 7, "train": 19, "validated": 26 }, "avgDurationSecs": 6.645, "validDurationSecs": 172.763, "totalHrs": 1.25, "validHrs": 0.04 }, "zh-HK": { "clips": 45485, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.1, "": 0.39, "teens": 0.02, "fifties": 0.04, "seventies": 0.01, "sixties": 0, "twenties": 0.3 }, "gender": { "male": 0.35, "": 0.39, "female": 0.24, "other": 0.02 } }, "users": 1588, "duration": 216898152, "reportedSentences": 183, "size": 1662950324, "checksum": "27c5b1ba46016462d2cc8bc6d44bd936923bc2e275a4ca1b139a0581fb995dee", "buckets": { "dev": 2805, "invalidated": 2535, "other": 14805, "reported": 187, "test": 2805, "train": 3379, "validated": 27643 }, "avgDurationSecs": 4.769, "validDurationSecs": 131817.426, "totalHrs": 60.24, "validHrs": 36.61 }, "rm-sursilv": { "clips": 6524, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.11, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.15, "female": 0.2, "": 0.64, "other": 0 } }, "users": 78, "duration": 35885741, "reportedSentences": 10, "size": 275926141, "checksum": "15b766a09b1b2976221834f983a514459201719172efbfb9f8fda88a23055b37", "buckets": { "dev": 1141, "invalidated": 632, "other": 2180, "reported": 9, "test": 1199, "train": 1372, "validated": 3712 }, "avgDurationSecs": 5.501, "validDurationSecs": 20418.129, "totalHrs": 9.96, "validHrs": 5.67 }, "hsb": { "clips": 1601, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.57, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11 }, "gender": { "male": 0.83, "": 0.17, "other": 0 } }, "users": 18, "duration": 9760920, "reportedSentences": 57, "size": 76663497, "checksum": "f3ede992a0b3971b8b521fb32f70eaee09c5ec45369222249dd8d7c7ff121be4", "buckets": { "dev": 172, "invalidated": 227, "other": 15, "reported": 56, "test": 379, "train": 808, "validated": 1359 }, "avgDurationSecs": 6.097, "validDurationSecs": 8285.503, "totalHrs": 2.71, "validHrs": 2.3 }, "ro": { "clips": 6934, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.58, "teens": 0.04, "": 0.05, "fourties": 0.03, "sixties": 0.02, "twenties": 0.28, "fifties": 0 }, "gender": { "male": 0.72, "": 0.05, "female": 0.23 } }, "users": 79, "duration": 27449616, "reportedSentences": 55, "size": 212089647, "checksum": "aa77eccf349230a03077571f9c55c87eecd4a241b50eba2c631b5bdbfc4a7ceb", "buckets": { "dev": 396, "invalidated": 429, "other": 2317, "reported": 54, "test": 1303, "train": 2485, "validated": 4188 }, "avgDurationSecs": 3.959, "validDurationSecs": 16579.03, "totalHrs": 7.62, "validHrs": 4.6 }, "fy-NL": { "clips": 19904, "splits": { "accent": { "": 1 }, "age": { "": 0.54, "fifties": 0.1, "thirties": 0.07, "twenties": 0.04, "fourties": 0.08, "sixties": 0.18, "seventies": 0 }, "gender": { "": 0.54, "male": 0.11, "female": 0.35 } }, "users": 223, "duration": 99689179, "reportedSentences": 269, "size": 728057924, "checksum": "26a2bfc32e07541b5267e5f63ebb80eb24c3004c8aa537fa11651a24b638f7b2", "buckets": { "dev": 1954, "invalidated": 720, "other": 12474, "reported": 268, "test": 2090, "train": 2646, "validated": 6694 }, "avgDurationSecs": 5.008, "validDurationSecs": 33526.897, "totalHrs": 27.69, "validHrs": 9.31 }, "cs": { "clips": 26104, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.14, "": 0.38, "thirties": 0.13, "teens": 0.01, "twenties": 0.32, "fifties": 0.01, "sixties": 0.01 }, "gender": { "male": 0.6, "": 0.37, "female": 0.02 } }, "users": 252, "duration": 104759013, "reportedSentences": 416, "size": 811680292, "checksum": "2345c629a2ff036654a4c8de67c889e929c736accc84ddd19611a2a6dc9c98a9", "buckets": { "dev": 2460, "invalidated": 534, "other": 2039, "reported": 415, "test": 2473, "train": 2959, "validated": 23527 }, "avgDurationSecs": 4.013, "validDurationSecs": 94417.151, "totalHrs": 29.09, "validHrs": 26.22 }, "el": { "clips": 11396, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.66, "fourties": 0.19, "": 0.12, "twenties": 0.02, "fifties": 0.01, "teens": 0 }, "gender": { "male": 0.87, "": 0.12, "other": 0.01, "female": 0 } }, "users": 95, "duration": 47321468, "reportedSentences": 17, "size": 367892799, "checksum": "50ee2847e0db56c2655205e4f3b1af5f2753ca23437db1d8349999aa4d706386", "buckets": { "dev": 1172, "invalidated": 155, "other": 5605, "reported": 16, "test": 1572, "train": 2277, "validated": 5635 }, "avgDurationSecs": 4.152, "validDurationSecs": 23399.129, "totalHrs": 13.14, "validHrs": 6.49 }, "rm-vallader": { "clips": 2100, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "fourties": 0.37, "twenties": 0.17, "thirties": 0.06, "fifties": 0 }, "gender": { "": 0.39, "male": 0.5, "female": 0.1, "other": 0.01 } }, "users": 34, "duration": 11824142, "reportedSentences": 20, "size": 92089215, "checksum": "e390c0af9c588fefdcc7ccf0e6115b8d4db79d662253cb141c7d4debf3083f86", "buckets": { "dev": 317, "invalidated": 347, "other": 637, "reported": 21, "test": 307, "train": 486, "validated": 1116 }, "avgDurationSecs": 5.631, "validDurationSecs": 6283.687, "totalHrs": 3.28, "validHrs": 1.74 }, "pl": { "clips": 100090, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.26, "": 0.23, "teens": 0.01, "thirties": 0.37, "fourties": 0.12, "fifties": 0, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.61, "": 0.23, "female": 0.16, "other": 0 } }, "users": 2464, "duration": 431016122, "reportedSentences": 410, "size": 3262638775, "checksum": "c34df1bce89e1e9127292f6453674d8cc93a924217512cf75b6758fd529183bc", "buckets": { "dev": 4426, "invalidated": 4451, "other": 9080, "reported": 411, "test": 4426, "train": 6033, "validated": 86551 }, "avgDurationSecs": 4.306, "validDurationSecs": 372713.322, "totalHrs": 119.72, "validHrs": 103.53 }, "as": { "clips": 458, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.64, "thirties": 0.02 }, "gender": { "male": 0.36, "": 0.64 } }, "users": 11, "duration": 2411664, "reportedSentences": 2, "size": 18951397, "checksum": "23cf2081bc20109042505d2b9268d37681f1a0d14f6b47ba3901120ba8d18ce0", "buckets": { "dev": 124, "invalidated": 23, "other": 0, "reported": 1, "test": 41, "train": 270, "validated": 435 }, "avgDurationSecs": 5.266, "validDurationSecs": 2290.554, "totalHrs": 0.66, "validHrs": 0.63 }, "uk": { "clips": 19535, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.22, "teens": 0.09, "": 0.13, "fourties": 0.26, "thirties": 0.3, "fifties": 0.01 }, "gender": { "male": 0.7, "female": 0.16, "": 0.14 } }, "users": 235, "duration": 93583387, "reportedSentences": 88, "size": 727416894, "checksum": "3a7a42beef9b681aaade452fe361debf1fa7bf4b6a7f86160a2282e451210505", "buckets": { "dev": 1669, "invalidated": 603, "other": 2373, "reported": 87, "test": 1671, "train": 1862, "validated": 16559 }, "avgDurationSecs": 4.791, "validDurationSecs": 79326.711, "totalHrs": 25.99, "validHrs": 22.03 }, "mt": { "clips": 11077, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "": 0.22, "fourties": 0.18, "thirties": 0.09, "teens": 0.03, "fifties": 0.3, "sixties": 0.01 }, "gender": { "male": 0.26, "": 0.22, "female": 0.51, "other": 0.01 } }, "users": 155, "duration": 52279224, "reportedSentences": 4, "size": 397528588, "checksum": "b619420e757b0f0b63b43043d8d3164e982a224b0a7bf5d80643076e3ef7e73c", "buckets": { "dev": 1402, "invalidated": 308, "other": 5355, "reported": 3, "test": 1562, "train": 1937, "validated": 5414 }, "avgDurationSecs": 4.72, "validDurationSecs": 25552.019, "totalHrs": 14.52, "validHrs": 7.09 }, "ka": { "clips": 2176, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.61, "thirties": 0.11, "": 0.26, "fourties": 0.03 }, "gender": { "male": 0.37, "female": 0.38, "": 0.25 } }, "users": 35, "duration": 12460944, "reportedSentences": 3, "size": 92191608, "checksum": "90d2b9ebaef6a11c21b3a0c2230fc2385c3c8c85195a64ffea64cca4da6c6ac8", "buckets": { "dev": 621, "invalidated": 110, "other": 0, "reported": 2, "test": 653, "train": 775, "validated": 2066 }, "avgDurationSecs": 5.727, "validDurationSecs": 11831.025, "totalHrs": 3.46, "validHrs": 3.28 }, "pa-IN": { "clips": 174, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "fourties": 0.62, "fifties": 0.14, "thirties": 0.03 }, "gender": { "": 0.21, "male": 0.79 } }, "users": 8, "duration": 826512, "reportedSentences": 56, "size": 6413022, "checksum": "0e29fa7342c48f92e44080db25c313a5b190173798f322f3b31ea8777d1432e6", "buckets": { "dev": 0, "invalidated": 12, "other": 126, "reported": 55, "test": 9, "train": 26, "validated": 35 }, "avgDurationSecs": 4.75, "validDurationSecs": 166.252, "totalHrs": 0.22, "validHrs": 0.04 }, "or": { "clips": 4061, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.11, "": 0.04, "thirties": 0.85, "fourties": 0 }, "gender": { "male": 0.96, "": 0.04, "female": 0 } }, "users": 30, "duration": 20706672, "reportedSentences": 5, "size": 161413409, "checksum": "c921cc19110e907e61f8672e65f75a870fc06806e78d5f30b226f247f24fd1fc", "buckets": { "dev": 127, "invalidated": 60, "other": 3475, "reported": 4, "test": 84, "train": 310, "validated": 521 }, "avgDurationSecs": 5.099, "validDurationSecs": 2656.532, "totalHrs": 5.75, "validHrs": 0.73 }, "vi": { "clips": 220, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.09, "twenties": 0.74, "": 0.15, "teens": 0.02 }, "gender": { "male": 0.69, "": 0.15, "female": 0.16 } }, "users": 14, "duration": 919680, "reportedSentences": 1, "size": 7152005, "checksum": "302cde4244bc09367fcdb915e628ebcc9e08c6e0856440d24c38c1f9bab49b20", "buckets": { "dev": 34, "invalidated": 17, "other": 95, "reported": 0, "test": 35, "train": 39, "validated": 108 }, "avgDurationSecs": 4.18, "validDurationSecs": 451.479, "totalHrs": 0.25, "validHrs": 0.12 } }, "totalDuration": 26015182421, "totalValidDurationSecs": 20130561, "totalHrs": 7226, "totalValidHrs": 5591 } ================================================ FILE: datasets/scripted-speech/cv-corpus-5-singleword.json ================================================ { "date": "2020-06-22", "locales": { "es": { "clips": 69284, "splits": { "accent": { "surpeninsular": 0.01, "": 0.8, "americacentral": 0.02, "centrosurpeninsular": 0.02, "mexicano": 0.05, "caribe": 0.02, "nortepeninsular": 0.02, "andino": 0.04, "rioplatense": 0.02, "chileno": 0.01, "canario": 0, "filipinas": 0 }, "age": { "sixties": 0.01, "": 0.74, "thirties": 0.05, "fifties": 0.02, "fourties": 0.04, "twenties": 0.11, "teens": 0.04, "seventies": 0, "nineties": 0, "eighties": 0 }, "gender": { "male": 0.17, "": 0.74, "female": 0.09, "other": 0 } }, "users": 8970, "duration": 200607380, "buckets": { "dev": 4, "invalidated": 6433, "other": 40532, "reported": 989, "test": 4, "train": 6, "validated": 22290 }, "reportedSentences": 982, "avgDurationSecs": 2.895, "validDurationSecs": 64539.266, "totalHrs": 55.72, "validHrs": 17.92 }, "pl": { "clips": 929, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.23, "thirties": 0.15, "fourties": 0.03, "teens": 0.09, "": 0.47, "sixties": 0.03 }, "gender": { "female": 0.13, "male": 0.39, "": 0.47, "other": 0.01 } }, "users": 88, "duration": 2129666, "buckets": { "dev": 4, "invalidated": 30, "other": 0, "reported": 410, "test": 4, "train": 6, "validated": 898 }, "reportedSentences": 410, "avgDurationSecs": 2.292, "validDurationSecs": 2058.601, "totalHrs": 0.59, "validHrs": 0.57 }, "ca": { "clips": 2372, "splits": { "accent": { "valencian": 0.12, "": 0.44, "central": 0.39, "northern": 0.01, "northwestern": 0.03, "balearic": 0.01 }, "age": { "fifties": 0.1, "": 0.4, "sixties": 0.04, "thirties": 0.13, "fourties": 0.13, "twenties": 0.13, "teens": 0.05, "seventies": 0.02 }, "gender": { "male": 0.44, "": 0.4, "female": 0.16, "other": 0.01 } }, "users": 214, "duration": 5963630, "buckets": { "dev": 4, "invalidated": 120, "other": 21, "reported": 911, "test": 4, "train": 6, "validated": 2231 }, "reportedSentences": 909, "avgDurationSecs": 2.514, "validDurationSecs": 5609.131, "totalHrs": 1.65, "validHrs": 1.55 }, "tr": { "clips": 408, "splits": { "accent": { "": 1 }, "age": { "": 0.47, "fourties": 0.07, "twenties": 0.29, "teens": 0.03, "thirties": 0.13 }, "gender": { "": 0.47, "male": 0.46, "female": 0.07 } }, "users": 46, "duration": 962552, "buckets": { "dev": 4, "invalidated": 34, "other": 31, "reported": 72, "test": 4, "train": 6, "validated": 343 }, "reportedSentences": 73, "avgDurationSecs": 2.359, "validDurationSecs": 809.204, "totalHrs": 0.26, "validHrs": 0.22 }, "ru": { "clips": 1556, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "teens": 0.14, "thirties": 0.15, "twenties": 0.26, "fourties": 0.04 }, "gender": { "": 0.44, "male": 0.5, "female": 0.05, "other": 0.01 } }, "users": 152, "duration": 3548113, "buckets": { "dev": 4, "invalidated": 80, "other": 0, "reported": 119, "test": 4, "train": 6, "validated": 1476 }, "reportedSentences": 113, "avgDurationSecs": 2.28, "validDurationSecs": 3365.691, "totalHrs": 0.98, "validHrs": 0.93 }, "pt": { "clips": 1079, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "fourties": 0.12, "twenties": 0.29, "thirties": 0.18, "fifties": 0.02, "sixties": 0, "teens": 0.01 }, "gender": { "": 0.39, "male": 0.54, "female": 0.06, "other": 0.01 } }, "users": 116, "duration": 2792491, "buckets": { "dev": 4, "invalidated": 66, "other": 68, "reported": 1623, "test": 4, "train": 6, "validated": 945 }, "reportedSentences": 1620, "avgDurationSecs": 2.588, "validDurationSecs": 2445.694, "totalHrs": 0.77, "validHrs": 0.67 }, "nl": { "clips": 475, "splits": { "accent": { "suriname": 0.03, "netherlands": 0.37, "": 0.42, "belgium": 0.17 }, "age": { "": 0.4, "twenties": 0.36, "sixties": 0, "thirties": 0.08, "fifties": 0.06, "fourties": 0.1 }, "gender": { "male": 0.55, "female": 0.05, "": 0.4 } }, "users": 49, "duration": 1118014, "buckets": { "dev": 4, "invalidated": 28, "other": 0, "reported": 125, "test": 4, "train": 6, "validated": 447 }, "reportedSentences": 126, "avgDurationSecs": 2.354, "validDurationSecs": 1052.11, "totalHrs": 0.31, "validHrs": 0.29 }, "de": { "clips": 13589, "splits": { "accent": { "germany": 0.25, "austria": 0.02, "": 0.71, "france": 0, "italy": 0, "switzerland": 0.01, "united_states": 0, "russia": 0, "united_kingdom": 0 }, "age": { "fourties": 0.04, "fifties": 0.06, "twenties": 0.12, "thirties": 0.07, "": 0.63, "seventies": 0.01, "teens": 0.06, "sixties": 0.02, "eighties": 0 }, "gender": { "male": 0.3, "female": 0.06, "": 0.63, "other": 0 } }, "users": 1465, "duration": 35864336, "buckets": { "dev": 4, "invalidated": 1322, "other": 69, "reported": 2823, "test": 4, "train": 6, "validated": 12191 }, "reportedSentences": 2814, "avgDurationSecs": 2.639, "validDurationSecs": 32174.709, "totalHrs": 9.96, "validHrs": 8.93 }, "ta": { "clips": 247, "splits": { "accent": { "": 1 }, "age": { "": 0.47, "fifties": 0.06, "fourties": 0, "twenties": 0.23, "thirties": 0.19, "teens": 0.04 }, "gender": { "": 0.52, "male": 0.31, "female": 0.17 } }, "users": 32, "duration": 738774, "buckets": { "dev": 4, "invalidated": 33, "other": 11, "reported": 234, "test": 4, "train": 6, "validated": 203 }, "reportedSentences": 234, "avgDurationSecs": 2.991, "validDurationSecs": 607.171, "totalHrs": 0.2, "validHrs": 0.16 }, "ar": { "clips": 340, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "thirties": 0.1, "twenties": 0.33, "teens": 0.08, "fifties": 0.04, "fourties": 0.04 }, "gender": { "": 0.4, "male": 0.53, "female": 0.07 } }, "users": 45, "duration": 884372, "buckets": { "dev": 4, "invalidated": 24, "other": 0, "reported": 134, "test": 4, "train": 6, "validated": 316 }, "reportedSentences": 135, "avgDurationSecs": 2.601, "validDurationSecs": 821.946, "totalHrs": 0.24, "validHrs": 0.22 }, "en": { "clips": 43373, "splits": { "accent": { "": 0.76, "us": 0.11, "england": 0.04, "indian": 0.04, "australia": 0.01, "canada": 0.01, "newzealand": 0, "scotland": 0, "hongkong": 0.01, "ireland": 0, "singapore": 0, "philippines": 0, "malaysia": 0, "african": 0, "wales": 0 }, "age": { "": 0.62, "twenties": 0.18, "teens": 0.05, "fourties": 0.04, "thirties": 0.07, "sixties": 0.01, "fifties": 0.03, "seventies": 0.01, "nineties": 0, "eighties": 0 }, "gender": { "": 0.62, "female": 0.09, "male": 0.28, "other": 0.01 } }, "users": 5183, "duration": 117342212, "buckets": { "dev": 4, "invalidated": 3454, "other": 13838, "reported": 1268, "test": 4, "train": 6, "validated": 26070 }, "reportedSentences": 1233, "avgDurationSecs": 2.705, "validDurationSecs": 70530.318, "totalHrs": 32.59, "validHrs": 19.59 }, "fr": { "clips": 20673, "splits": { "accent": { "france": 0.21, "": 0.76, "germany": 0, "canada": 0.01, "united_kingdom": 0, "belgium": 0.01, "switzerland": 0.01, "reunion": 0, "senegal": 0, "martinique": 0, "congo_brazzaville": 0, "cameroon": 0, "united_states": 0, "madagascar": 0, "romania": 0, "benin": 0 }, "age": { "sixties": 0.01, "": 0.7, "twenties": 0.1, "thirties": 0.06, "fourties": 0.05, "teens": 0.05, "fifties": 0.03, "seventies": 0 }, "gender": { "male": 0.23, "": 0.7, "female": 0.06, "other": 0.01 } }, "users": 2382, "duration": 54892201, "buckets": { "dev": 4, "invalidated": 2181, "other": 2021, "reported": 2653, "test": 4, "train": 6, "validated": 16467 }, "reportedSentences": 2629, "avgDurationSecs": 2.655, "validDurationSecs": 43724.175, "totalHrs": 15.24, "validHrs": 12.14 }, "cy": { "clips": 263, "splits": { "accent": { "united_kingdom": 0.63, "": 0.37 }, "age": { "twenties": 0.25, "thirties": 0.11, "": 0.3, "teens": 0.08, "fifties": 0.11, "fourties": 0.08, "sixties": 0.04, "seventies": 0.04 }, "gender": { "female": 0.34, "male": 0.32, "": 0.33 } }, "users": 28, "duration": 699792, "buckets": { "dev": 3, "invalidated": 8, "other": 23, "reported": 44, "test": 3, "train": 4, "validated": 232 }, "reportedSentences": 45, "avgDurationSecs": 2.661, "validDurationSecs": 617.307, "totalHrs": 0.19, "validHrs": 0.17 }, "eo": { "clips": 802, "splits": { "accent": { "": 0.66, "internacia": 0.34 }, "age": { "": 0.45, "fourties": 0.04, "twenties": 0.27, "teens": 0.06, "eighties": 0.01, "sixties": 0.03, "thirties": 0.13, "fifties": 0.01 }, "gender": { "": 0.46, "male": 0.47, "other": 0.01, "female": 0.06 } }, "users": 76, "duration": 1935968, "buckets": { "dev": 3, "invalidated": 32, "other": 0, "reported": 354, "test": 3, "train": 6, "validated": 770 }, "reportedSentences": 354, "avgDurationSecs": 2.414, "validDurationSecs": 1858.722, "totalHrs": 0.53, "validHrs": 0.51 }, "ja": { "clips": 525, "splits": { "accent": { "": 1 }, "age": { "teens": 0.04, "twenties": 0.53, "": 0.29, "fifties": 0.02, "fourties": 0.05, "thirties": 0.07 }, "gender": { "male": 0.43, "female": 0.28, "": 0.27, "other": 0.02 } }, "users": 62, "duration": 1200960, "buckets": { "dev": 3, "invalidated": 22, "other": 0, "reported": 53, "test": 3, "train": 6, "validated": 503 }, "reportedSentences": 54, "avgDurationSecs": 2.288, "validDurationSecs": 1150.634, "totalHrs": 0.33, "validHrs": 0.31 }, "id": { "clips": 110, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.63, "teens": 0.11 }, "gender": { "": 0.26, "male": 0.54, "female": 0.11, "other": 0.09 } }, "users": 11, "duration": 243816, "buckets": { "dev": 3, "invalidated": 9, "other": 0, "reported": 196, "test": 3, "train": 6, "validated": 101 }, "reportedSentences": 197, "avgDurationSecs": 2.217, "validDurationSecs": 223.867, "totalHrs": 0.06, "validHrs": 0.06 }, "eu": { "clips": 467, "splits": { "accent": { "": 0.89, "mendebalekoa": 0.05, "erdialdekoa_nafarra": 0.06 }, "age": { "": 0.86, "teens": 0.02, "fourties": 0.05, "fifties": 0.05, "thirties": 0.03 }, "gender": { "": 0.86, "female": 0.06, "male": 0.08 } }, "users": 62, "duration": 1178544, "buckets": { "dev": 3, "invalidated": 12, "other": 284, "reported": 37, "test": 3, "train": 6, "validated": 170 }, "reportedSentences": 38, "avgDurationSecs": 2.524, "validDurationSecs": 429.02, "totalHrs": 0.32, "validHrs": 0.11 }, "tt": { "clips": 29, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.83, "": 0.17 }, "gender": { "male": 0.83, "": 0.17 } }, "users": 3, "duration": 69672, "buckets": { "dev": 0, "invalidated": 1, "other": 23, "reported": 0, "test": 1, "train": 4, "validated": 5 }, "reportedSentences": 1, "avgDurationSecs": 2.402, "validDurationSecs": 12.012, "totalHrs": 0.01, "validHrs": 0 }, "cv-corpus-5-singleword": null }, "overall": { "size": 3250877247, "checksum": "0c0cd77a1e412f40a507301f098bc192c4608edc7bd05e7e6973f8519a7a88d8" }, "totalDuration": 432172493, "totalValidDurationSecs": 232029, "totalHrs": 120, "totalValidHrs": 64 } ================================================ FILE: datasets/scripted-speech/cv-corpus-5.1-2020-06-22.json ================================================ { "date": "2020-06-22", "locales": { "en": { "size": 53753543765, "checksum": "cb5903dc0775f96de81cd2fbaae4198be90a566cf3e9a0b2501111893d1dc98e", "duration": 6957981334, "buckets": { "dev": 16029, "invalidated": 166816, "other": 175084, "reported": 1354, "test": 16029, "train": 435947, "validated": 1085494 }, "reportedSentences": 1318, "clips": 1427394, "splits": { "accent": { "": 0.51, "canada": 0.03, "england": 0.08, "us": 0.23, "indian": 0.05, "australia": 0.04, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.02, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.01 }, "age": { "": 0.37, "twenties": 0.23, "sixties": 0.04, "thirties": 0.14, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.47, "female": 0.14, "other": 0.02 } }, "users": 61528, "avgDurationSecs": 4.875, "validDurationSecs": 5291354.027, "totalHrs": 1932.77, "validHrs": 1469.82 }, "de": { "size": 20940330454, "checksum": "be5f792b9a3916c030918f9ad655cc434ecd78038a8f1af3f5995fa78f10a778", "buckets": { "dev": 15341, "invalidated": 29792, "other": 12783, "reported": 2905, "test": 15341, "train": 196404, "validated": 511084 }, "reportedSentences": 2896, "clips": 553659, "splits": { "accent": { "russia": 0, "germany": 0.64, "": 0.31, "austria": 0.02, "liechtenstein": 0, "switzerland": 0.02, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0, "czechia": 0, "brazil": 0, "canada": 0, "turkey": 0, "slovenia": 0, "luxembourg": 0, "belgium": 0, "denmark": 0, "greece": 0 }, "age": { "twenties": 0.22, "fourties": 0.19, "": 0.23, "thirties": 0.17, "teens": 0.04, "sixties": 0.03, "fifties": 0.12, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.66, "": 0.24, "female": 0.09, "other": 0.01 } }, "users": 11731, "duration": 2701658851, "avgDurationSecs": 4.88, "validDurationSecs": 2493908.005, "totalHrs": 750.46, "validHrs": 692.75 }, "fr": { "size": 16959458032, "checksum": "ffda45f2006fb6092fb435c786cde422e38183f7837e9faa65cb273439cf369e", "buckets": { "dev": 15621, "invalidated": 34853, "other": 2135, "reported": 2679, "test": 15621, "train": 254863, "validated": 413089 }, "reportedSentences": 2655, "clips": 450077, "splits": { "accent": { "": 0.33, "france": 0.61, "germany": 0, "belgium": 0.01, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0, "new_caledonia": 0, "haiti": 0, "morocco": 0, "romania": 0, "congo_kinshasa": 0, "congo_brazzaville": 0 }, "age": { "twenties": 0.2, "thirties": 0.19, "": 0.3, "teens": 0.03, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0.01, "eighties": 0 }, "gender": { "male": 0.65, "": 0.24, "female": 0.11, "other": 0.01 } }, "users": 11960, "duration": 2175911452, "avgDurationSecs": 4.835, "validDurationSecs": 1997091.799, "totalHrs": 604.41, "validHrs": 554.74 }, "cy": { "size": 2679332290, "checksum": "b32f8d86e2cc6e9619a417b9e7a11a27ec10d125e43f5307251bb8e06672567e", "buckets": { "dev": 2933, "invalidated": 3122, "other": 6724, "reported": 44, "test": 2937, "train": 3572, "validated": 64543 }, "reportedSentences": 45, "clips": 74389, "splits": { "accent": { "united_kingdom": 0.47, "": 0.51, "other": 0.01 }, "age": { "fourties": 0.12, "twenties": 0.11, "sixties": 0.08, "fifties": 0.07, "": 0.48, "thirties": 0.11, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.3, "female": 0.22, "": 0.46, "other": 0.02 } }, "users": 1257, "duration": 347373624, "avgDurationSecs": 4.67, "validDurationSecs": 301395.849, "totalHrs": 96.49, "validHrs": 83.72 }, "br": { "size": 436877699, "checksum": "619ca871b3d5ebbd1e1ee8e902417f2b190584aa881b0356295460ca76c64e73", "buckets": { "dev": 1922, "invalidated": 555, "other": 10998, "reported": 84, "test": 1953, "train": 2389, "validated": 7559 }, "reportedSentences": 85, "clips": 19112, "splits": { "accent": { "other": 0, "": 0.74, "kerneveg": 0.21, "gwenedeg": 0.04, "leoneg": 0 }, "age": { "twenties": 0.17, "": 0.38, "fifties": 0.05, "fourties": 0.07, "thirties": 0.08, "sixties": 0.24, "teens": 0 }, "gender": { "male": 0.6, "": 0.38, "female": 0.02 } }, "users": 146, "duration": 57218116, "avgDurationSecs": 2.994, "validDurationSecs": 22630.376, "totalHrs": 15.89, "validHrs": 6.28 }, "cv": { "size": 214074733, "checksum": "94dbfd1f7ee029ef9d8ec175df358cc7bb1332de291a7a9e05e8e06516c58c40", "buckets": { "dev": 529, "invalidated": 1028, "other": 2449, "reported": 33, "test": 584, "train": 805, "validated": 2424 }, "reportedSentences": 34, "clips": 5901, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.41, "": 0.38, "fourties": 0.16, "thirties": 0.02, "teens": 0, "fifties": 0.03 }, "gender": { "male": 0.59, "": 0.38, "female": 0.03 } }, "users": 53, "duration": 28727634, "avgDurationSecs": 4.868, "validDurationSecs": 11800.675, "totalHrs": 7.97, "validHrs": 3.27 }, "tr": { "size": 614056515, "checksum": "0937fee773a2a477e1936b4b4e4341ad18b1add3693cb052cb67ab7e3e4779f9", "buckets": { "dev": 1638, "invalidated": 1696, "other": 273, "reported": 72, "test": 1639, "train": 1821, "validated": 18544 }, "reportedSentences": 73, "clips": 20513, "splits": { "accent": { "": 0.96, "other": 0.04 }, "age": { "": 0.22, "thirties": 0.23, "twenties": 0.47, "teens": 0.04, "fourties": 0.02, "fifties": 0.01, "sixties": 0 }, "gender": { "": 0.22, "male": 0.72, "female": 0.06 } }, "users": 642, "duration": 80107435, "avgDurationSecs": 3.905, "validDurationSecs": 72418.09, "totalHrs": 22.25, "validHrs": 20.11 }, "tt": { "size": 768000901, "checksum": "c88ecbf35de2cc8b5440d8055f9b81e5df0a271924aca992ad3d2c3b59160a3c", "buckets": { "dev": 2116, "invalidated": 274, "other": 1679, "reported": 0, "test": 4365, "train": 11178, "validated": 25586 }, "reportedSentences": 1, "clips": 27539, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.76, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0 }, "gender": { "": 0.18, "male": 0.8, "female": 0.02 } }, "users": 173, "duration": 102466992, "avgDurationSecs": 3.721, "validDurationSecs": 95200.278, "totalHrs": 28.46, "validHrs": 26.44 }, "ky": { "size": 575939697, "checksum": "4839808c363a6913310c2af728a59e7cee6bc43007aceb123f2cfed2a82032bc", "buckets": { "dev": 1512, "invalidated": 924, "other": 7197, "reported": 0, "test": 1476, "train": 1951, "validated": 9181 }, "reportedSentences": 1, "clips": 17302, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "thirties": 0.13, "": 0.06, "fourties": 0.02, "twenties": 0.78, "teens": 0.01 }, "gender": { "male": 0.46, "": 0.11, "female": 0.43, "other": 0 } }, "users": 126, "duration": 80463096, "avgDurationSecs": 4.651, "validDurationSecs": 42696.317, "totalHrs": 22.35, "validHrs": 11.86 }, "ga-IE": { "size": 139327407, "checksum": "8a2927d9b0b748c22cbb4f728e6bba61bd57ef5c72a0e526f84937cc8a4c3e45", "buckets": { "dev": 495, "invalidated": 407, "other": 1660, "reported": 4, "test": 497, "train": 541, "validated": 3330 }, "reportedSentences": 5, "clips": 5397, "splits": { "accent": { "": 0.43, "connachta": 0.3, "mumhain": 0.14, "other": 0.06, "ulaidh": 0.08 }, "age": { "twenties": 0.39, "": 0.21, "thirties": 0.3, "fourties": 0.07, "sixties": 0, "teens": 0.02, "fifties": 0.01 }, "gender": { "male": 0.62, "": 0.21, "female": 0.18 } }, "users": 96, "duration": 18294074, "avgDurationSecs": 3.39, "validDurationSecs": 11287.617, "totalHrs": 5.08, "validHrs": 3.13 }, "kab": { "size": 14154930305, "checksum": "1bde4ad1df759be9dc5ccc41cdc267f54cf70a5d75b58af34f385a5a0fb61ec5", "buckets": { "dev": 13384, "invalidated": 16885, "other": 21061, "reported": 719, "test": 13384, "train": 68403, "validated": 515359 }, "reportedSentences": 715, "clips": 553305, "splits": { "accent": { "": 0.9, "other": 0.1 }, "age": { "fourties": 0.08, "thirties": 0.26, "": 0.28, "fifties": 0.22, "twenties": 0.13, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.59, "": 0.25, "female": 0.16, "other": 0 } }, "users": 886, "duration": 1832133149, "avgDurationSecs": 3.311, "validDurationSecs": 1706484.322, "totalHrs": 508.92, "validHrs": 474.02 }, "ca": { "size": 16095259341, "checksum": "c989ee555cfc27ea46f6f94378659872fe35e7b73af734e188e731a716980865", "buckets": { "dev": 15364, "invalidated": 15681, "other": 45517, "reported": 932, "test": 15364, "train": 200265, "validated": 329226 }, "reportedSentences": 930, "clips": 390424, "splits": { "accent": { "valencian": 0.06, "central": 0.63, "": 0.25, "other": 0, "balearic": 0.01, "northwestern": 0.04, "northern": 0.02 }, "age": { "thirties": 0.14, "fifties": 0.23, "fourties": 0.16, "twenties": 0.08, "": 0.19, "sixties": 0.19, "teens": 0.01, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.53, "": 0.19, "female": 0.28, "other": 0 } }, "users": 4871, "duration": 2086740956, "avgDurationSecs": 5.345, "validDurationSecs": 1759649.453, "totalHrs": 579.65, "validHrs": 488.79 }, "zh-TW": { "size": 2055790976, "checksum": "8199549605e9d5d211cb0c1e69c15d948d40985c5116dfe02456c92fe1ce309b", "buckets": { "dev": 2627, "invalidated": 3430, "other": 19383, "reported": 34, "test": 2627, "train": 3120, "validated": 59942 }, "reportedSentences": 35, "clips": 82755, "splits": { "accent": { "": 0.52, "taipei_city": 0.16, "other": 0.09, "kaohsiung_city": 0.03, "new_taipei_city": 0.05, "tainan_city": 0.04, "taichung_city": 0.03, "yilan_county": 0.01, "hong_kong": 0, "taoyuan_city": 0.01, "hsinchu_city": 0.01, "chiayi_city": 0, "yunlin_county": 0.03, "nantou_county": 0, "changhua_county": 0.01, "pingtung_county": 0.01, "hsinchu_county": 0, "chiayi_county": 0, "keelung_city": 0, "hualien_county": 0.01, "kinmen_county": 0 }, "age": { "thirties": 0.25, "twenties": 0.37, "teens": 0.03, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.11, "sixties": 0 }, "gender": { "male": 0.48, "": 0.22, "female": 0.27, "other": 0.03 } }, "users": 1331, "duration": 265863297, "avgDurationSecs": 3.213, "validDurationSecs": 192572.99, "totalHrs": 73.85, "validHrs": 53.49 }, "sl": { "size": 215262574, "checksum": "6a6ad404c7c9c8b269e79a608648035dc58724c033419e9005385267fc38baec", "buckets": { "dev": 527, "invalidated": 85, "other": 2403, "reported": 12, "test": 825, "train": 1937, "validated": 4530 }, "reportedSentences": 13, "clips": 7018, "splits": { "accent": { "other": 0.04, "": 0.96 }, "age": { "twenties": 0.63, "teens": 0.11, "": 0.05, "sixties": 0.03, "fifties": 0.12, "fourties": 0.01, "thirties": 0.05 }, "gender": { "female": 0.17, "male": 0.78, "": 0.05, "other": 0 } }, "users": 69, "duration": 27544553, "avgDurationSecs": 3.925, "validDurationSecs": 17779.542, "totalHrs": 7.65, "validHrs": 4.93 }, "it": { "size": 4691758913, "checksum": "a304332d1dfdb772ae75859441f46a88438305d2f5420680c3c64fba62ea6830", "buckets": { "dev": 11904, "invalidated": 11033, "other": 12793, "reported": 572, "test": 11900, "train": 41823, "validated": 84358 }, "reportedSentences": 568, "clips": 108184, "splits": { "accent": { "": 0.94, "other": 0.06 }, "age": { "thirties": 0.15, "twenties": 0.23, "": 0.34, "fifties": 0.11, "fourties": 0.12, "seventies": 0.01, "sixties": 0.03, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.16, "male": 0.49, "": 0.34, "other": 0.01 } }, "users": 5446, "duration": 602609572, "avgDurationSecs": 5.57, "validDurationSecs": 469893.314, "totalHrs": 167.39, "validHrs": 130.52 }, "nl": { "size": 1260682780, "checksum": "be233a9fb069b54678246102b7bef6bdea23203ccc78b85ee7981daa698d5fee", "buckets": { "dev": 2198, "invalidated": 2748, "other": 161, "reported": 130, "test": 2198, "train": 2533, "validated": 38917 }, "reportedSentences": 131, "clips": 41826, "splits": { "accent": { "": 0.31, "netherlands": 0.56, "belgium": 0.12, "other": 0, "germany": 0, "france": 0, "suriname": 0.01 }, "age": { "": 0.23, "twenties": 0.29, "fourties": 0.13, "thirties": 0.16, "teens": 0.05, "fifties": 0.13, "sixties": 0.01 }, "gender": { "": 0.24, "male": 0.62, "female": 0.13, "other": 0 } }, "users": 887, "duration": 163953358, "avgDurationSecs": 3.92, "validDurationSecs": 152550.395, "totalHrs": 45.54, "validHrs": 42.37 }, "cnh": { "size": 161328222, "checksum": "b872ffbe17c90771d855c934de0c00d4ab165d5d1cb495945173fdff06cf8a38", "buckets": { "dev": 755, "invalidated": 433, "other": 2936, "reported": 8, "test": 751, "train": 807, "validated": 2430 }, "reportedSentences": 9, "clips": 5799, "splits": { "accent": { "": 0.85, "other": 0.15 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 297, "duration": 20663664, "avgDurationSecs": 3.563, "validDurationSecs": 8658.856, "totalHrs": 5.73, "validHrs": 2.4 }, "eo": { "size": 2504082848, "checksum": "ff74ca44382ee15bd5ef00bd62fee30bc4bbc02dd769ef93ea5f1ab32da57717", "buckets": { "dev": 8437, "invalidated": 3593, "other": 25, "reported": 358, "test": 8453, "train": 17276, "validated": 54643 }, "reportedSentences": 358, "clips": 58261, "splits": { "accent": { "": 0.31, "internacia": 0.69, "other": 0.01 }, "age": { "twenties": 0.35, "thirties": 0.1, "": 0.12, "fourties": 0.3, "fifties": 0.04, "seventies": 0.01, "teens": 0.03, "sixties": 0.02, "eighties": 0.04 }, "gender": { "male": 0.73, "": 0.18, "female": 0.09, "other": 0.01 } }, "users": 505, "duration": 321414430, "avgDurationSecs": 5.517, "validDurationSecs": 301454.639, "totalHrs": 89.28, "validHrs": 83.73 }, "et": { "size": 762109792, "checksum": "bb08ee4b0c784ec33dd9f2acdf02ac0ad2cd427a13591f39f523ca6bef49a1a5", "buckets": { "dev": 2489, "invalidated": 3531, "other": 572, "reported": 102, "test": 2485, "train": 2931, "validated": 10606 }, "reportedSentences": 103, "clips": 14709, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "thirties": 0.07, "twenties": 0.71, "fourties": 0.07, "fifties": 0.01, "seventies": 0, "teens": 0 }, "gender": { "": 0.15, "male": 0.6, "female": 0.25, "other": 0 } }, "users": 538, "duration": 98151487, "avgDurationSecs": 6.673, "validDurationSecs": 70772.634, "totalHrs": 27.26, "validHrs": 19.65 }, "fa": { "size": 8470694566, "checksum": "d45eb20e6aede80c64dddd29f84ef45b4879da3008a7cd1a22e98cbb52521ca3", "buckets": { "dev": 3989, "invalidated": 11301, "other": 15568, "reported": 1432, "test": 3989, "train": 5247, "validated": 244736 }, "reportedSentences": 1428, "clips": 271605, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "twenties": 0.33, "thirties": 0.4, "fifties": 0.02, "fourties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.18, "male": 0.76, "female": 0.07, "other": 0 } }, "users": 3412, "duration": 1100290166, "avgDurationSecs": 4.051, "validDurationSecs": 991442.036, "totalHrs": 305.63, "validHrs": 275.4 }, "eu": { "size": 3621558052, "checksum": "065ec28fb1214e489ff0d1050003c13d5871f3cf47fc4658cdd7a1181d0c76df", "buckets": { "dev": 4912, "invalidated": 5353, "other": 23559, "reported": 37, "test": 4912, "train": 6973, "validated": 61934 }, "reportedSentences": 38, "clips": 90846, "splits": { "accent": { "mendebalekoa": 0.31, "erdialdekoa_nafarra": 0.3, "": 0.39, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.14, "twenties": 0.39, "": 0.22, "teens": 0.03, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.5, "female": 0.25, "": 0.23, "other": 0.02 } }, "users": 1002, "duration": 467076631, "avgDurationSecs": 5.141, "validDurationSecs": 318428.154, "totalHrs": 129.74, "validHrs": 88.45 }, "es": { "size": 14601842052, "checksum": "cce32c2c95090168d02053a798e8265e74c4f93211625bb3b30fe446472e0140", "buckets": { "dev": 14860, "invalidated": 38299, "other": 130816, "reported": 1021, "test": 14860, "train": 138878, "validated": 212392 }, "reportedSentences": 1013, "clips": 381507, "splits": { "accent": { "nortepeninsular": 0.1, "": 0.53, "rioplatense": 0.04, "centrosurpeninsular": 0.03, "americacentral": 0.02, "surpeninsular": 0.12, "canario": 0, "andino": 0.05, "mexicano": 0.06, "chileno": 0.02, "caribe": 0.03, "filipinas": 0 }, "age": { "thirties": 0.1, "": 0.34, "fifties": 0.09, "twenties": 0.28, "teens": 0.03, "fourties": 0.06, "sixties": 0.11, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.46, "": 0.34, "other": 0, "female": 0.19 } }, "users": 18906, "duration": 1878616534, "avgDurationSecs": 4.924, "validDurationSecs": 1045860.555, "totalHrs": 521.83, "validHrs": 290.51 }, "zh-CN": { "size": 1982005342, "checksum": "a78852ea07c7b101db2589b0ac541b2530536d5f9ca57d6d7839eb038a9a23b9", "buckets": { "dev": 8249, "invalidated": 4930, "other": 7368, "reported": 245, "test": 8273, "train": 16499, "validated": 33099 }, "reportedSentences": 245, "clips": 45397, "splits": { "accent": { "110000": 0.05, "120000": 0, "130000": 0.02, "140000": 0.01, "150000": 0, "210000": 0.01, "220000": 0, "230000": 0.04, "310000": 0.05, "320000": 0.04, "330000": 0.04, "340000": 0.01, "350000": 0.01, "360000": 0.02, "370000": 0.04, "410000": 0.03, "420000": 0.02, "430000": 0.01, "440000": 0.03, "450000": 0.01, "460000": 0.01, "500000": 0.02, "510000": 0.04, "520000": 0, "530000": 0, "610000": 0.01, "620000": 0, "640000": 0, "650000": 0, "710000": 0.01, "810000": 0, "": 0.46 }, "age": { "": 0.39, "teens": 0.08, "twenties": 0.36, "thirties": 0.13, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.39, "male": 0.51, "female": 0.1, "other": 0 } }, "users": 3333, "duration": 254593542, "avgDurationSecs": 5.608, "validDurationSecs": 185624.417, "totalHrs": 70.72, "validHrs": 51.56 }, "mn": { "size": 463048695, "checksum": "d60286528876ebace1bd822211930e8af7c13f1078d6d16dfa6699add3945b93", "duration": 59173272, "buckets": { "dev": 1690, "invalidated": 632, "other": 2993, "reported": 1, "test": 1786, "train": 2179, "validated": 7242 }, "reportedSentences": 2, "clips": 10867, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.21, "": 0.25, "twenties": 0.44, "fourties": 0.01, "teens": 0.02, "nineties": 0.07 }, "gender": { "male": 0.33, "": 0.25, "female": 0.35, "other": 0.07 } }, "users": 349, "avgDurationSecs": 5.445, "validDurationSecs": 39434.327, "totalHrs": 16.43, "validHrs": 10.95 }, "sah": { "size": 179496518, "checksum": "4f74994f14676920fc1993ab90c3fe328acfdaf95e89453c92e06a90f3621493", "duration": 23414976, "buckets": { "dev": 398, "invalidated": 62, "other": 1305, "test": 709, "train": 1440, "validated": 2549 }, "clips": 3916, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 40, "avgDurationSecs": 5.979, "validDurationSecs": 15241.26, "totalHrs": 6.5, "validHrs": 4.23 }, "dv": { "size": 315891381, "checksum": "8c9e397545ee87a7c822f36b69e7b92f586f42225e00928da6bda81e1308d6fd", "duration": 41061552, "buckets": { "dev": 1678, "invalidated": 503, "other": 926, "reported": 1, "test": 1902, "train": 2483, "validated": 6086 }, "reportedSentences": 2, "clips": 7515, "splits": { "accent": { "": 1 }, "age": { "": 0.14, "twenties": 0.28, "thirties": 0.52, "fourties": 0.02, "teens": 0.04 }, "gender": { "": 0.14, "male": 0.25, "female": 0.61 } }, "users": 123, "avgDurationSecs": 5.464, "validDurationSecs": 33253.574, "totalHrs": 11.4, "validHrs": 9.23 }, "rw": { "size": 16543746497, "checksum": "cf3b4d77fa9902ebdaa6f3af8ed9c4b6ea945f1c766fb7b130b231e8cbd357cf", "duration": 2108407752, "buckets": { "dev": 14124, "invalidated": 182266, "other": 501, "reported": 490, "test": 14251, "train": 105792, "validated": 217828 }, "reportedSentences": 491, "clips": 400595, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.54, "thirties": 0.03, "teens": 0.38, "fourties": 0 }, "gender": { "": 0.05, "male": 0.85, "female": 0.1, "other": 0 } }, "users": 339, "avgDurationSecs": 5.263, "validDurationSecs": 1146470.235, "totalHrs": 585.66, "validHrs": 318.46 }, "sv-SE": { "size": 281207422, "checksum": "2afa036a727f005d81844129792933c61a8629368a52b5a8e74f5ba874780716", "buckets": { "dev": 1773, "invalidated": 436, "other": 5, "reported": 52, "test": 1784, "train": 2104, "validated": 11529 }, "reportedSentences": 53, "clips": 11970, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.31, "": 0.18, "teens": 0.09, "fifties": 0.12, "twenties": 0.23, "fourties": 0.07, "sixties": 0 }, "gender": { "male": 0.72, "": 0.18, "female": 0.09, "other": 0.01 } }, "users": 176, "duration": 37547448, "avgDurationSecs": 3.137, "validDurationSecs": 36164.121, "totalHrs": 10.42, "validHrs": 10.04 }, "ru": { "size": 3256217166, "checksum": "bc7ac96f8af48ba660171fc5db2ab8369b5349ba308fdbde67fd85d054c1c6ca", "duration": 419475986, "buckets": { "dev": 7242, "invalidated": 2778, "other": 4278, "reported": 120, "test": 7307, "train": 13189, "validated": 69510 }, "reportedSentences": 114, "clips": 76566, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.48, "teens": 0.09, "": 0.08, "fourties": 0.17, "thirties": 0.12, "fifties": 0.05, "sixties": 0 }, "gender": { "male": 0.73, "": 0.09, "other": 0, "female": 0.18 } }, "users": 927, "avgDurationSecs": 5.479, "validDurationSecs": 380818.846, "totalHrs": 116.52, "validHrs": 105.78 }, "id": { "size": 422002066, "checksum": "f5368613182645f5164e0ddb5b504971bee6193173e63c9b9df9b377c5db9243", "duration": 54218808, "buckets": { "dev": 1572, "invalidated": 393, "other": 6005, "reported": 196, "test": 1643, "train": 1925, "validated": 7632 }, "reportedSentences": 197, "clips": 14030, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.41, "thirties": 0.08, "teens": 0.42, "fifties": 0, "fourties": 0.04 }, "gender": { "": 0.05, "male": 0.61, "female": 0.21, "other": 0.13 } }, "users": 141, "avgDurationSecs": 3.864, "validDurationSecs": 29493.795, "totalHrs": 15.06, "validHrs": 8.19 }, "ar": { "size": 533550548, "checksum": "490480195e444806882b10b62d1515b081dd79f15b02da3e5f037bf1effdf10e", "duration": 70329391, "buckets": { "dev": 1918, "invalidated": 1606, "other": 5714, "reported": 162, "test": 1911, "train": 2207, "validated": 11778 }, "reportedSentences": 163, "clips": 19098, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.24, "": 0.24, "twenties": 0.46, "fourties": 0.01, "teens": 0.05, "fifties": 0 }, "gender": { "female": 0.17, "": 0.23, "male": 0.6, "other": 0 } }, "users": 371, "avgDurationSecs": 3.683, "validDurationSecs": 43373.105, "totalHrs": 19.53, "validHrs": 12.04 }, "ta": { "size": 581773571, "checksum": "d87eefb469be2bbd06ab592b858ad48e78bb3af103467246dc4ba7021c3e4d24", "buckets": { "dev": 1668, "invalidated": 538, "other": 5241, "reported": 234, "test": 1638, "train": 1896, "validated": 12163 }, "reportedSentences": 234, "clips": 17942, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.48, "thirties": 0.42, "": 0.08, "fourties": 0.01, "seventies": 0, "fifties": 0.01, "teens": 0 }, "gender": { "male": 0.76, "": 0.09, "other": 0.01, "female": 0.14 } }, "users": 190, "duration": 74836349, "avgDurationSecs": 4.171, "validDurationSecs": 50732.054, "totalHrs": 20.78, "validHrs": 14.09 }, "ia": { "size": 205201540, "checksum": "8636c959e471c8a28f18617da82798ff4ed63a2ab64072e30bdcc642c729d976", "duration": 26276745, "buckets": { "dev": 1520, "invalidated": 166, "other": 1260, "reported": 231, "test": 791, "train": 2741, "validated": 5053 }, "reportedSentences": 227, "clips": 6479, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.26, "fourties": 0.6, "": 0.06, "twenties": 0.05, "thirties": 0.02, "teens": 0.01, "fifties": 0.01 }, "gender": { "male": 0.93, "": 0.06, "female": 0.01 } }, "users": 27, "avgDurationSecs": 4.056, "validDurationSecs": 20493.347, "totalHrs": 7.29, "validHrs": 5.69 }, "pt": { "size": 1454367644, "checksum": "c78b540c4b82acda9288a6c0bed664cb02ad5af4ebf4c837eaf8da3a22f700be", "duration": 194172081, "buckets": { "dev": 4329, "invalidated": 1622, "other": 2699, "reported": 1641, "test": 4334, "train": 5923, "validated": 39581 }, "reportedSentences": 1638, "clips": 43902, "splits": { "accent": { "": 1 }, "age": { "": 0.1, "twenties": 0.37, "teens": 0.03, "thirties": 0.35, "fourties": 0.11, "sixties": 0, "fifties": 0.04, "seventies": 0 }, "gender": { "": 0.1, "male": 0.85, "female": 0.03, "other": 0.02 } }, "users": 744, "avgDurationSecs": 4.423, "validDurationSecs": 175060.934, "totalHrs": 53.93, "validHrs": 48.62 }, "lv": { "size": 205706961, "checksum": "209842b1863bdca8c4a7aa3f7eaf0aac3bcb85bf3945e63fd81d3c2c163d5cea", "buckets": { "dev": 1931, "invalidated": 141, "other": 1567, "reported": 20, "test": 1859, "train": 2551, "validated": 6349 }, "reportedSentences": 21, "clips": 8057, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.49, "fourties": 0.03, "": 0.16, "twenties": 0.29, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.71, "female": 0.13, "": 0.16 } }, "users": 94, "duration": 27322937, "avgDurationSecs": 3.391, "validDurationSecs": 21530.759, "totalHrs": 7.58, "validHrs": 5.98 }, "ja": { "size": 277993266, "checksum": "aea95bc7d74c30e694b20815753920af4939539b79a1bd219f439b7da0a1f617", "duration": 35700744, "buckets": { "dev": 1219, "invalidated": 1394, "other": 725, "reported": 88, "test": 1334, "train": 1642, "validated": 6158 }, "reportedSentences": 89, "clips": 8277, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.6, "": 0.21, "teens": 0.04, "fifties": 0.03, "thirties": 0.05, "fourties": 0.06, "sixties": 0 }, "gender": { "male": 0.76, "": 0.14, "female": 0.1, "other": 0 } }, "users": 194, "avgDurationSecs": 4.313, "validDurationSecs": 26560.974, "totalHrs": 9.91, "validHrs": 7.37 }, "vot": { "size": 7792479, "checksum": "da5ab7020eae6057d8b4f36c64118d4a495777fe28a7258e5ae1448233c28493", "duration": 1008748, "buckets": { "dev": 0, "invalidated": 0, "other": 418, "test": 0, "train": 3, "validated": 3 }, "clips": 421, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "avgDurationSecs": 2.396, "validDurationSecs": 7.188, "totalHrs": 0.28, "validHrs": 0 }, "ab": { "size": 35501884, "checksum": "c0020d04b06da0135bd69ae29870312f822df76712686b88dd0b951909ce0213", "duration": 4511760, "buckets": { "dev": 0, "invalidated": 6, "other": 647, "reported": 37, "test": 7, "train": 19, "validated": 26 }, "reportedSentences": 38, "clips": 679, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.66, "thirties": 0.33, "": 0 }, "gender": { "male": 0.66, "female": 0.33, "": 0 } }, "users": 7, "avgDurationSecs": 6.645, "validDurationSecs": 172.763, "totalHrs": 1.25, "validHrs": 0.04 }, "zh-HK": { "size": 1662923573, "checksum": "14359d47b37b8ae18e07492bdd0a723067f0d15379e1631ad3d4e9900abc112c", "duration": 216898152, "buckets": { "dev": 2805, "invalidated": 2535, "other": 14805, "reported": 197, "test": 2805, "train": 3379, "validated": 27643 }, "reportedSentences": 189, "clips": 44983, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.14, "thirties": 0.1, "": 0.39, "teens": 0.02, "fifties": 0.04, "seventies": 0.01, "sixties": 0, "twenties": 0.31 }, "gender": { "male": 0.35, "": 0.39, "female": 0.24, "other": 0.02 } }, "users": 1576, "avgDurationSecs": 4.822, "validDurationSecs": 133288.478, "totalHrs": 60.24, "validHrs": 37.02 }, "rm-sursilv": { "size": 275942783, "checksum": "1535575b8dbba6fdfec6752cd568234d9b8731ea678ec8007c940bd6adf7b9d8", "buckets": { "dev": 1200, "invalidated": 637, "other": 2132, "reported": 9, "test": 1175, "train": 1380, "validated": 3755 }, "reportedSentences": 10, "clips": 6524, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.11, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.15, "female": 0.2, "": 0.64, "other": 0 } }, "users": 78, "duration": 35885741, "avgDurationSecs": 5.501, "validDurationSecs": 20654.653, "totalHrs": 9.96, "validHrs": 5.73 }, "hsb": { "size": 76664581, "checksum": "8960590eefc82851ddb92a1c1d0d0f178825131a15f1436c876d0b1fa2f0f8a2", "duration": 9760920, "buckets": { "dev": 172, "invalidated": 227, "other": 15, "reported": 56, "test": 379, "train": 808, "validated": 1359 }, "reportedSentences": 57, "clips": 1601, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.57, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11 }, "gender": { "male": 0.83, "": 0.17, "other": 0 } }, "users": 18, "avgDurationSecs": 6.097, "validDurationSecs": 8285.503, "totalHrs": 2.71, "validHrs": 2.3 }, "ro": { "size": 212099990, "checksum": "c20fa58f15e2b0d34428bd83990ed6823bf892cf1d5a29e0fe649af29b1ae4ae", "buckets": { "dev": 396, "invalidated": 434, "other": 1943, "reported": 56, "test": 1372, "train": 2785, "validated": 4557 }, "reportedSentences": 57, "clips": 6934, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.58, "teens": 0.04, "": 0.05, "fourties": 0.03, "sixties": 0.02, "twenties": 0.28, "fifties": 0 }, "gender": { "male": 0.72, "": 0.05, "female": 0.23 } }, "users": 79, "duration": 27449616, "avgDurationSecs": 3.959, "validDurationSecs": 18039.789, "totalHrs": 7.62, "validHrs": 5.01 }, "fy-NL": { "size": 728080736, "checksum": "c858a7675d78c7849aecf7081f0a1fe83fa9bde336bee9ca151ce070488730b2", "duration": 99691219, "buckets": { "dev": 2089, "invalidated": 729, "other": 12278, "reported": 268, "test": 2095, "train": 2690, "validated": 6881 }, "reportedSentences": 269, "clips": 19888, "splits": { "accent": { "": 1 }, "age": { "": 0.54, "fifties": 0.1, "thirties": 0.07, "twenties": 0.04, "fourties": 0.08, "sixties": 0.18, "seventies": 0 }, "gender": { "": 0.54, "male": 0.11, "female": 0.36 } }, "users": 222, "avgDurationSecs": 5.013, "validDurationSecs": 34491.919, "totalHrs": 27.69, "validHrs": 9.58 }, "cs": { "size": 811702344, "checksum": "0ffca0531680cf3965c341df4414b489e44513140ad15fc4b51f3a071ad06e7d", "duration": 104759013, "buckets": { "dev": 2584, "invalidated": 539, "other": 1542, "reported": 420, "test": 2574, "train": 3085, "validated": 24019 }, "reportedSentences": 421, "clips": 26100, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.14, "": 0.38, "thirties": 0.13, "teens": 0.01, "twenties": 0.32, "fifties": 0.01, "sixties": 0.01 }, "gender": { "male": 0.6, "": 0.37, "female": 0.02 } }, "users": 252, "avgDurationSecs": 4.014, "validDurationSecs": 96406.388, "totalHrs": 29.09, "validHrs": 26.77 }, "el": { "size": 367902685, "checksum": "09a16643563b5befb7f14cecf73a92fd51e72db3ce35820838d0bfde4e70b707", "duration": 47321468, "buckets": { "dev": 1212, "invalidated": 165, "other": 5401, "reported": 16, "test": 1625, "train": 2331, "validated": 5829 }, "reportedSentences": 17, "clips": 11395, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.66, "fourties": 0.19, "": 0.12, "twenties": 0.02, "fifties": 0.01, "teens": 0 }, "gender": { "male": 0.87, "": 0.12, "other": 0.01, "female": 0 } }, "users": 95, "avgDurationSecs": 4.153, "validDurationSecs": 24206.831, "totalHrs": 13.14, "validHrs": 6.72 }, "rm-vallader": { "size": 92093154, "checksum": "1842b18309b7f30c89ddb3e8e38a496605ec45cebe846d5ced3542c9593fbffd", "duration": 11824142, "buckets": { "dev": 323, "invalidated": 350, "other": 608, "reported": 21, "test": 319, "train": 494, "validated": 1142 }, "reportedSentences": 20, "clips": 2100, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "fourties": 0.37, "twenties": 0.17, "thirties": 0.06, "fifties": 0 }, "gender": { "": 0.39, "male": 0.5, "female": 0.1, "other": 0.01 } }, "users": 34, "avgDurationSecs": 5.631, "validDurationSecs": 6430.081, "totalHrs": 3.28, "validHrs": 1.78 }, "pl": { "size": 3262705972, "checksum": "5d97c69a6289fee4d94dc671adbc1b4cfd219d0d0334dde1999cb52970df92b6", "duration": 431016122, "buckets": { "dev": 4458, "invalidated": 4475, "other": 8601, "reported": 415, "test": 4458, "train": 6090, "validated": 87006 }, "reportedSentences": 415, "clips": 100082, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.26, "": 0.23, "teens": 0.01, "thirties": 0.37, "fourties": 0.12, "fifties": 0, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.61, "": 0.23, "female": 0.16, "other": 0 } }, "users": 2464, "avgDurationSecs": 4.307, "validDurationSecs": 374702.631, "totalHrs": 119.72, "validHrs": 104.08 }, "as": { "size": 18951900, "checksum": "a8e448a273be7e1ee9a56eb7867b28a77d23f50abd54087fc5880b7d11e21710", "duration": 2411664, "buckets": { "dev": 124, "invalidated": 23, "other": 0, "reported": 1, "test": 41, "train": 270, "validated": 435 }, "reportedSentences": 2, "clips": 458, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.64, "thirties": 0.02 }, "gender": { "male": 0.36, "": 0.64 } }, "users": 11, "avgDurationSecs": 5.266, "validDurationSecs": 2290.554, "totalHrs": 0.66, "validHrs": 0.63 }, "uk": { "size": 727430378, "checksum": "542b4d93097843fe86493992b512973ad0636841bb030c1c8c59ba816f98dc1f", "duration": 93583387, "buckets": { "dev": 1669, "invalidated": 603, "other": 2373, "reported": 87, "test": 1671, "train": 1862, "validated": 16559 }, "reportedSentences": 88, "clips": 19535, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.22, "teens": 0.09, "": 0.13, "fourties": 0.26, "thirties": 0.3, "fifties": 0.01 }, "gender": { "male": 0.7, "female": 0.16, "": 0.14 } }, "users": 235, "avgDurationSecs": 4.791, "validDurationSecs": 79326.711, "totalHrs": 25.99, "validHrs": 22.03 }, "mt": { "size": 397538153, "checksum": "812313d2602b22b4b57df796dba2a38a0e753ba8ddcb86c1b54ea4648202f1b7", "duration": 52279224, "buckets": { "dev": 1408, "invalidated": 309, "other": 5305, "reported": 3, "test": 1583, "train": 1954, "validated": 5463 }, "reportedSentences": 4, "clips": 11077, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "": 0.22, "fourties": 0.18, "thirties": 0.09, "teens": 0.03, "fifties": 0.3, "sixties": 0.01 }, "gender": { "male": 0.26, "": 0.22, "female": 0.51, "other": 0.01 } }, "users": 155, "avgDurationSecs": 4.72, "validDurationSecs": 25783.281, "totalHrs": 14.52, "validHrs": 7.16 }, "ka": { "size": 92193989, "checksum": "224ddc35bec03247e75d6b21aa4117f802288de512746e2c85b19b6e4fc33b02", "duration": 12460944, "buckets": { "dev": 621, "invalidated": 110, "other": 0, "reported": 2, "test": 653, "train": 775, "validated": 2066 }, "reportedSentences": 3, "clips": 2176, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.61, "thirties": 0.11, "": 0.26, "fourties": 0.03 }, "gender": { "male": 0.37, "female": 0.38, "": 0.25 } }, "users": 35, "avgDurationSecs": 5.727, "validDurationSecs": 11831.025, "totalHrs": 3.46, "validHrs": 3.28 }, "pa-IN": { "size": 6413240, "checksum": "1714d955d47fff0eee6319f7e7e4314d9723a4d4f1ee06b340df93e0c6cf1839", "duration": 826512, "buckets": { "dev": 12, "invalidated": 12, "other": 122, "reported": 55, "test": 12, "train": 15, "validated": 39 }, "reportedSentences": 56, "clips": 173, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "fourties": 0.62, "fifties": 0.14, "thirties": 0.03 }, "gender": { "": 0.2, "male": 0.8 } }, "users": 7, "avgDurationSecs": 4.778, "validDurationSecs": 186.324, "totalHrs": 0.22, "validHrs": 0.05 }, "or": { "size": 161416298, "checksum": "e9ff473d5bfa5e23b0b53989bb5b27ff58fefc3f9e2510ea24f427dda55a6db3", "duration": 20706672, "buckets": { "dev": 127, "invalidated": 60, "other": 3466, "reported": 4, "test": 84, "train": 319, "validated": 530 }, "reportedSentences": 5, "clips": 4056, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.11, "": 0.04, "thirties": 0.85, "fourties": 0 }, "gender": { "male": 0.96, "": 0.04, "female": 0 } }, "users": 30, "avgDurationSecs": 5.105, "validDurationSecs": 2705.753, "totalHrs": 5.75, "validHrs": 0.75 }, "vi": { "size": 7155714, "checksum": "100113373ba344210c526788169db04dffa117d6d10881122310c1321982ea1e", "duration": 919680, "buckets": { "dev": 25, "invalidated": 23, "other": 25, "reported": 0, "test": 57, "train": 90, "validated": 172 }, "reportedSentences": 1, "clips": 220, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.09, "twenties": 0.74, "": 0.15, "teens": 0.02 }, "gender": { "male": 0.69, "": 0.15, "female": 0.16 } }, "users": 14, "avgDurationSecs": 4.18, "validDurationSecs": 719.023, "totalHrs": 0.25, "validHrs": 0.19 } }, "totalDuration": 26015106972, "totalValidDurationSecs": 20419110, "totalHrs": 7226, "totalValidHrs": 5671, "totalClips": 5540535 } ================================================ FILE: datasets/scripted-speech/cv-corpus-5.1-singleword.json ================================================ { "date": "2020-06-22", "locales": { "es": { "clips": 68817, "splits": { "accent": { "surpeninsular": 0.01, "": 0.8, "americacentral": 0.02, "centrosurpeninsular": 0.02, "mexicano": 0.05, "caribe": 0.02, "nortepeninsular": 0.02, "andino": 0.04, "rioplatense": 0.02, "chileno": 0.01, "canario": 0, "filipinas": 0 }, "age": { "sixties": 0.01, "": 0.74, "thirties": 0.05, "fifties": 0.02, "fourties": 0.04, "twenties": 0.11, "teens": 0.04, "seventies": 0, "nineties": 0, "eighties": 0 }, "gender": { "male": 0.17, "": 0.74, "female": 0.09, "other": 0 } }, "users": 8968, "duration": 200607380, "buckets": { "dev": 6238, "invalidated": 6461, "other": 40337, "test": 6238, "train": 9979, "validated": 22455 }, "avgDurationSecs": 2.915, "validDurationSecs": 65458.226, "totalHrs": 55.72, "validHrs": 18.18 }, "pl": { "clips": 899, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.22, "thirties": 0.16, "fourties": 0.03, "teens": 0.09, "": 0.48, "sixties": 0.01 }, "gender": { "female": 0.12, "male": 0.38, "": 0.48, "other": 0.01 } }, "users": 86, "duration": 2129666, "buckets": { "dev": 294, "invalidated": 30, "other": 0, "test": 296, "train": 308, "validated": 898 }, "avgDurationSecs": 2.369, "validDurationSecs": 2127.297, "totalHrs": 0.59, "validHrs": 0.59 }, "ca": { "clips": 2279, "splits": { "accent": { "valencian": 0.12, "": 0.44, "central": 0.4, "northern": 0.01, "northwestern": 0.03, "balearic": 0.01 }, "age": { "fifties": 0.1, "": 0.39, "sixties": 0.04, "thirties": 0.14, "fourties": 0.13, "twenties": 0.13, "teens": 0.05, "seventies": 0.02 }, "gender": { "male": 0.45, "": 0.39, "female": 0.15, "other": 0.01 } }, "users": 213, "duration": 5963630, "buckets": { "dev": 733, "invalidated": 122, "other": 0, "test": 733, "train": 784, "validated": 2250 }, "avgDurationSecs": 2.617, "validDurationSecs": 5887.744, "totalHrs": 1.65, "validHrs": 1.63 }, "tr": { "clips": 381, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "fourties": 0.06, "twenties": 0.31, "teens": 0.04, "thirties": 0.14 }, "gender": { "": 0.46, "male": 0.46, "female": 0.07 } }, "users": 46, "duration": 962552, "buckets": { "dev": 114, "invalidated": 36, "other": 21, "test": 112, "train": 125, "validated": 351 }, "avgDurationSecs": 2.526, "validDurationSecs": 886.761, "totalHrs": 0.26, "validHrs": 0.24 }, "ru": { "clips": 1513, "splits": { "accent": { "": 1 }, "age": { "": 0.42, "teens": 0.15, "thirties": 0.15, "twenties": 0.25, "fourties": 0.04 }, "gender": { "": 0.44, "male": 0.5, "female": 0.05, "other": 0.01 } }, "users": 149, "duration": 3548113, "buckets": { "dev": 478, "invalidated": 80, "other": 0, "test": 480, "train": 518, "validated": 1476 }, "avgDurationSecs": 2.345, "validDurationSecs": 3461.345, "totalHrs": 0.98, "validHrs": 0.96 }, "pt": { "clips": 983, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "fourties": 0.12, "twenties": 0.28, "thirties": 0.17, "fifties": 0.02, "sixties": 0.01, "teens": 0.01 }, "gender": { "": 0.39, "male": 0.54, "female": 0.05, "other": 0.01 } }, "users": 115, "duration": 2792491, "buckets": { "dev": 312, "invalidated": 74, "other": 31, "test": 318, "train": 344, "validated": 974 }, "avgDurationSecs": 2.841, "validDurationSecs": 2766.924, "totalHrs": 0.77, "validHrs": 0.76 }, "nl": { "clips": 457, "splits": { "accent": { "suriname": 0.03, "netherlands": 0.39, "": 0.43, "belgium": 0.15 }, "age": { "": 0.4, "twenties": 0.37, "sixties": 0, "thirties": 0.06, "fifties": 0.06, "fourties": 0.1 }, "gender": { "male": 0.54, "female": 0.06, "": 0.4 } }, "users": 47, "duration": 1118014, "buckets": { "dev": 135, "invalidated": 15, "other": 0, "test": 144, "train": 168, "validated": 447 }, "avgDurationSecs": 2.446, "validDurationSecs": 1093.55, "totalHrs": 0.31, "validHrs": 0.3 }, "de": { "clips": 13133, "splits": { "accent": { "germany": 0.25, "austria": 0.02, "": 0.71, "france": 0, "italy": 0, "switzerland": 0.01, "united_states": 0, "russia": 0, "united_kingdom": 0 }, "age": { "fourties": 0.04, "fifties": 0.06, "twenties": 0.12, "thirties": 0.07, "": 0.63, "seventies": 0.01, "teens": 0.06, "sixties": 0.02, "eighties": 0 }, "gender": { "male": 0.3, "female": 0.06, "": 0.63, "other": 0 } }, "users": 1457, "duration": 35864336, "buckets": { "dev": 3719, "invalidated": 1310, "other": 1, "test": 3717, "train": 4818, "validated": 12254 }, "avgDurationSecs": 2.731, "validDurationSecs": 33463.913, "totalHrs": 9.96, "validHrs": 9.29 }, "ta": { "clips": 221, "splits": { "accent": { "": 1 }, "age": { "": 0.44, "fifties": 0.07, "fourties": 0, "twenties": 0.26, "thirties": 0.18, "teens": 0.05 }, "gender": { "": 0.49, "male": 0.35, "female": 0.16 } }, "users": 32, "duration": 738774, "buckets": { "dev": 62, "invalidated": 37, "other": 5, "test": 66, "train": 77, "validated": 205 }, "avgDurationSecs": 3.343, "validDurationSecs": 685.288, "totalHrs": 0.2, "validHrs": 0.19 }, "ar": { "clips": 332, "splits": { "accent": { "": 1 }, "age": { "": 0.4, "thirties": 0.1, "twenties": 0.33, "teens": 0.08, "fifties": 0.04, "fourties": 0.04 }, "gender": { "": 0.39, "male": 0.54, "female": 0.07 } }, "users": 44, "duration": 884372, "buckets": { "dev": 94, "invalidated": 24, "other": 0, "test": 96, "train": 126, "validated": 316 }, "avgDurationSecs": 2.664, "validDurationSecs": 841.752, "totalHrs": 0.24, "validHrs": 0.23 }, "en": { "clips": 40199, "splits": { "accent": { "": 0.76, "us": 0.11, "england": 0.04, "indian": 0.04, "australia": 0.01, "canada": 0.01, "newzealand": 0, "scotland": 0, "hongkong": 0.01, "ireland": 0, "singapore": 0, "philippines": 0, "malaysia": 0, "african": 0, "wales": 0 }, "age": { "": 0.61, "twenties": 0.19, "teens": 0.05, "fourties": 0.04, "thirties": 0.07, "sixties": 0.01, "fifties": 0.02, "seventies": 0.01, "nineties": 0, "eighties": 0 }, "gender": { "": 0.62, "female": 0.09, "male": 0.29, "other": 0.01 } }, "users": 4990, "duration": 117312884, "buckets": { "dev": 7201, "invalidated": 3556, "other": 12452, "test": 7205, "train": 12717, "validated": 27230 }, "avgDurationSecs": 2.918, "validDurationSecs": 79465.405, "totalHrs": 32.58, "validHrs": 22.07 }, "fr": { "clips": 19603, "splits": { "accent": { "france": 0.2, "": 0.77, "germany": 0, "canada": 0.01, "united_kingdom": 0, "belgium": 0.01, "switzerland": 0.01, "reunion": 0, "senegal": 0, "martinique": 0, "congo_brazzaville": 0, "cameroon": 0, "united_states": 0, "madagascar": 0, "romania": 0, "benin": 0 }, "age": { "sixties": 0.01, "": 0.72, "twenties": 0.09, "thirties": 0.05, "fourties": 0.05, "teens": 0.05, "fifties": 0.03, "seventies": 0 }, "gender": { "male": 0.22, "": 0.72, "female": 0.06, "other": 0.01 } }, "users": 2367, "duration": 54892201, "buckets": { "dev": 4943, "invalidated": 2191, "other": 1484, "test": 4948, "train": 7051, "validated": 16942 }, "avgDurationSecs": 2.8, "validDurationSecs": 47440.885, "totalHrs": 15.24, "validHrs": 13.17 }, "cy": { "clips": 232, "splits": { "accent": { "united_kingdom": 0.63, "": 0.37 }, "age": { "twenties": 0.22, "thirties": 0.13, "": 0.3, "teens": 0.09, "fifties": 0.13, "fourties": 0.09, "sixties": 0.03, "seventies": 0.01 }, "gender": { "female": 0.39, "male": 0.29, "": 0.32 } }, "users": 28, "duration": 699792, "buckets": { "dev": 80, "invalidated": 9, "other": 5, "test": 79, "train": 90, "validated": 249 }, "avgDurationSecs": 3.016, "validDurationSecs": 751.07, "totalHrs": 0.19, "validHrs": 0.2 }, "eo": { "clips": 792, "splits": { "accent": { "": 0.66, "internacia": 0.34 }, "age": { "": 0.45, "fourties": 0.04, "twenties": 0.27, "teens": 0.05, "eighties": 0.02, "sixties": 0.03, "thirties": 0.14, "fifties": 0.02 }, "gender": { "": 0.47, "male": 0.46, "other": 0.02, "female": 0.06 } }, "users": 75, "duration": 1935968, "buckets": { "dev": 246, "invalidated": 32, "other": 0, "test": 248, "train": 276, "validated": 770 }, "avgDurationSecs": 2.444, "validDurationSecs": 1882.191, "totalHrs": 0.53, "validHrs": 0.52 }, "ja": { "clips": 524, "splits": { "accent": { "": 1 }, "age": { "teens": 0.04, "twenties": 0.53, "": 0.29, "fifties": 0.02, "fourties": 0.05, "thirties": 0.07 }, "gender": { "male": 0.43, "female": 0.28, "": 0.27, "other": 0.02 } }, "users": 62, "duration": 1200960, "buckets": { "dev": 161, "invalidated": 22, "other": 0, "test": 162, "train": 180, "validated": 503 }, "avgDurationSecs": 2.292, "validDurationSecs": 1152.83, "totalHrs": 0.33, "validHrs": 0.32 }, "id": { "clips": 95, "splits": { "accent": { "": 1 }, "age": { "": 0.31, "twenties": 0.57, "teens": 0.13 }, "gender": { "": 0.31, "male": 0.49, "female": 0.13, "other": 0.07 } }, "users": 10, "duration": 243816, "buckets": { "dev": 22, "invalidated": 9, "other": 0, "test": 31, "train": 48, "validated": 101 }, "avgDurationSecs": 2.566, "validDurationSecs": 259.215, "totalHrs": 0.06, "validHrs": 0.07 }, "eu": { "clips": 404, "splits": { "accent": { "": 0.89, "mendebalekoa": 0.05, "erdialdekoa_nafarra": 0.06 }, "age": { "": 0.86, "teens": 0.02, "fourties": 0.05, "fifties": 0.04, "thirties": 0.03 }, "gender": { "": 0.86, "female": 0.07, "male": 0.07 } }, "users": 60, "duration": 1178544, "buckets": { "dev": 57, "invalidated": 13, "other": 260, "test": 61, "train": 75, "validated": 193 }, "avgDurationSecs": 2.917, "validDurationSecs": 563.017, "totalHrs": 0.32, "validHrs": 0.15 }, "tt": { "clips": 22, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.77, "": 0.23 }, "gender": { "male": 0.77, "": 0.23 } }, "users": 3, "duration": 69672, "buckets": { "dev": 0, "invalidated": 2, "other": 16, "test": 0, "train": 11, "validated": 11 }, "avgDurationSecs": 3.167, "validDurationSecs": 34.836, "totalHrs": 0.01, "validHrs": 0 } }, "overall": { "size": 3251558895, "checksum": "cdf46ebdaeb4699ba6786907a1573ccdde28a114e8351caab03a54a8e7c47b49" }, "totalDuration": 432143165, "totalValidDurationSecs": 248222, "totalHrs": 120, "totalValidHrs": 68 } ================================================ FILE: datasets/scripted-speech/cv-corpus-6.0-2020-12-11.json ================================================ { "date": "2020-12-11", "locales": { "en": { "clips": 1582837, "splits": { "accent": { "": 0.51, "canada": 0.03, "england": 0.08, "us": 0.24, "indian": 0.05, "australia": 0.03, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.02, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.01 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.14, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8, "female": 0.2 } }, "users": 66151, "duration": 7845900691, "buckets": { "dev": 16164, "invalidated": 190028, "other": 167678, "reported": 1731, "test": 16164, "train": 564641, "validated": 1226615 }, "reportedSentences": 1680, "size": 60612746725, "checksum": "f030e6cca6163c841d5d621ced640b0e70a5d6dcd78def30273883ceae5438fb", "avgDurationSecs": 4.957, "validDurationSecs": 6080158.27, "totalHrs": 2179.41, "validHrs": 1688.93 }, "de": { "clips": 607943, "splits": { "accent": { "russia": 0, "germany": 0.65, "": 0.31, "austria": 0.03, "liechtenstein": 0, "switzerland": 0.01, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0, "czechia": 0, "brazil": 0, "canada": 0, "turkey": 0, "slovenia": 0, "luxembourg": 0, "belgium": 0, "denmark": 0, "greece": 0, "lithuania": 0 }, "age": { "twenties": 0.22, "fourties": 0.2, "": 0.23, "thirties": 0.16, "teens": 0.03, "sixties": 0.03, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.2, "": 0.8 } }, "users": 12655, "buckets": { "dev": 15588, "invalidated": 32806, "other": 10177, "reported": 3708, "test": 15588, "train": 246372, "validated": 565087 }, "reportedSentences": 3698, "duration": 3009518417, "size": 23283795490, "checksum": "d4df4396998d6436f2fe30d45c0e75f1064e187c6fab0d55202bd272a3bdb86b", "avgDurationSecs": 4.95, "validDurationSecs": 2797367.078, "totalHrs": 835.97, "validHrs": 777.04 }, "fr": { "clips": 504180, "splits": { "accent": { "": 0.32, "france": 0.62, "germany": 0, "belgium": 0.02, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0, "new_caledonia": 0, "haiti": 0, "morocco": 0, "romania": 0, "congo_kinshasa": 0, "congo_brazzaville": 0, "gabon": 0 }, "age": { "twenties": 0.2, "thirties": 0.19, "": 0.3, "teens": 0.04, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0.01, "eighties": 0 }, "gender": { "female": 0.17, "": 0.83 } }, "users": 12950, "duration": 2456111530, "buckets": { "dev": 15758, "invalidated": 40182, "other": 5286, "reported": 3178, "test": 15758, "train": 297089, "validated": 459109 }, "reportedSentences": 3150, "size": 19130019144, "checksum": "4c3030771eb49af63157fb0a59ff18e3484ae1d7946c4beb9d9f150b96021b98", "avgDurationSecs": 4.871, "validDurationSecs": 2236548.273, "totalHrs": 682.25, "validHrs": 621.26 }, "cy": { "clips": 94528, "splits": { "accent": { "united_kingdom": 0.52, "": 0.47, "other": 0.01 }, "age": { "fourties": 0.17, "twenties": 0.14, "sixties": 0.07, "fifties": 0.09, "": 0.41, "thirties": 0.1, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "": 0.95, "female": 0.05 } }, "users": 1382, "duration": 447428421, "buckets": { "dev": 4754, "invalidated": 3670, "other": 18029, "reported": 122, "test": 4814, "train": 6841, "validated": 72852 }, "reportedSentences": 123, "size": 3434447665, "checksum": "16959b18a5c5ac84a48e11356ac712290618bc8c8cb38648cb05da49a3171b3d", "avgDurationSecs": 4.733, "validDurationSecs": 344829.631, "totalHrs": 124.28, "validHrs": 95.78 }, "br": { "clips": 20095, "splits": { "accent": { "other": 0, "": 0.74, "kerneveg": 0.2, "gwenedeg": 0.05, "leoneg": 0, "tregerieg": 0 }, "age": { "twenties": 0.17, "": 0.37, "fifties": 0.06, "fourties": 0.08, "thirties": 0.08, "sixties": 0.2, "seventies": 0.03, "teens": 0.01 }, "gender": { "female": 0.17, "": 0.83 } }, "users": 157, "duration": 60895123, "buckets": { "dev": 2158, "invalidated": 777, "other": 10291, "reported": 163, "test": 2157, "train": 2553, "validated": 9027 }, "reportedSentences": 164, "size": 465278116, "checksum": "d852feae9adf63c5218527ebdfd17fbcffa4fc66b5c278fba375ad7d521dc6c8", "avgDurationSecs": 3.03, "validDurationSecs": 27355.077, "totalHrs": 16.91, "validHrs": 7.59 }, "cv": { "clips": 11705, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.48, "": 0.27, "fourties": 0.09, "thirties": 0.01, "teens": 0.13, "fifties": 0.02 }, "gender": { "female": 0.29, "": 0.71 } }, "users": 92, "duration": 58562220, "buckets": { "dev": 853, "invalidated": 1348, "other": 6279, "reported": 74, "test": 781, "train": 979, "validated": 4078 }, "reportedSentences": 75, "size": 439332696, "checksum": "d79149ef4ccf3ace3d44ecb9c4e448ab58edc65b7bde6175306a407604da3a31", "avgDurationSecs": 5.003, "validDurationSecs": 20402.967, "totalHrs": 16.26, "validHrs": 5.66 }, "tr": { "clips": 20736, "splits": { "accent": { "": 0.96, "other": 0.04 }, "age": { "": 0.23, "thirties": 0.23, "twenties": 0.47, "teens": 0.04, "fourties": 0.03, "fifties": 0.01, "sixties": 0 }, "gender": { "": 0.58, "female": 0.42 } }, "users": 678, "duration": 80991855, "buckets": { "dev": 1646, "invalidated": 1736, "other": 295, "reported": 72, "test": 1647, "train": 1832, "validated": 18705 }, "reportedSentences": 73, "size": 620830479, "checksum": "0f99593e1110d46dc619d4c61795544d19e061411b03cf34aff64cd5b7493d9c", "avgDurationSecs": 3.906, "validDurationSecs": 73059.059, "totalHrs": 22.49, "validHrs": 20.29 }, "tt": { "clips": 27866, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.75, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.96, "female": 0.04 } }, "users": 185, "duration": 103646870, "buckets": { "dev": 2127, "invalidated": 287, "other": 1794, "reported": 2, "test": 4489, "train": 11211, "validated": 25785 }, "reportedSentences": 3, "size": 777143015, "checksum": "3b2c57acfac8d8d575d54603ee0b96df15a8d73f61c9d94ff6b84835f8bf8bfa", "avgDurationSecs": 3.719, "validDurationSecs": 95906.644, "totalHrs": 28.79, "validHrs": 26.64 }, "ky": { "clips": 17385, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "thirties": 0.13, "": 0.06, "fourties": 0.02, "twenties": 0.78, "teens": 0.01 }, "gender": { "": 0.66, "female": 0.34 } }, "users": 134, "duration": 80915736, "buckets": { "dev": 1509, "invalidated": 992, "other": 6756, "reported": 0, "test": 1525, "train": 1942, "validated": 9637 }, "reportedSentences": 1, "size": 579432347, "checksum": "10413ca4529863e2de2ac4ba0aa975b954bcad1be7fdb0ef11169078d2001c84", "avgDurationSecs": 4.654, "validDurationSecs": 44853.894, "totalHrs": 22.47, "validHrs": 12.45 }, "ga-IE": { "clips": 5891, "splits": { "accent": { "": 0.4, "connachta": 0.27, "mumhain": 0.15, "other": 0.06, "ulaidh": 0.12 }, "age": { "twenties": 0.37, "": 0.19, "thirties": 0.3, "fourties": 0.07, "sixties": 0, "teens": 0.02, "fifties": 0.06 }, "gender": { "female": 0.31, "": 0.69 } }, "users": 101, "duration": 20497419, "buckets": { "dev": 507, "invalidated": 534, "other": 1878, "reported": 4, "test": 496, "train": 541, "validated": 3479 }, "reportedSentences": 5, "size": 156554524, "checksum": "dad7a17fcc8bf7c3e154e08a769d5d550dc23b006d792c103611be3b97e39c2f", "avgDurationSecs": 3.479, "validDurationSecs": 12104.994, "totalHrs": 5.69, "validHrs": 3.36 }, "kab": { "clips": 679785, "splits": { "accent": { "": 0.9, "other": 0.1 }, "age": { "fourties": 0.1, "thirties": 0.3, "": 0.26, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "": 0.96, "female": 0.04 } }, "users": 1306, "duration": 2241399940, "buckets": { "dev": 14622, "invalidated": 18134, "other": 88029, "reported": 3092, "test": 14622, "train": 120530, "validated": 573710 }, "reportedSentences": 3052, "size": 17171374359, "checksum": "bb0046c5b368993c16cd3537e5f0ac397aef97f3e54c817f39f57e7dec421e1b", "avgDurationSecs": 3.297, "validDurationSecs": 1891647.447, "totalHrs": 622.61, "validHrs": 525.45 }, "ca": { "clips": 498357, "splits": { "accent": { "valencian": 0.05, "central": 0.67, "": 0.22, "other": 0, "balearic": 0.01, "northwestern": 0.03, "northern": 0.01 }, "age": { "thirties": 0.13, "fifties": 0.23, "fourties": 0.16, "twenties": 0.08, "": 0.17, "sixties": 0.22, "teens": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "": 0.93, "female": 0.07 } }, "users": 5370, "duration": 2683762302, "buckets": { "dev": 15711, "invalidated": 18521, "other": 68926, "reported": 1189, "test": 15711, "train": 281374, "validated": 412561 }, "reportedSentences": 1185, "size": 20742924739, "checksum": "ff0f4724e6de643a74fd4216f7d9afdab5e833c09d5dd78ab8c4c2f862c114ba", "avgDurationSecs": 5.385, "validDurationSecs": 2221731.929, "totalHrs": 745.48, "validHrs": 617.14 }, "zh-TW": { "clips": 87293, "splits": { "accent": { "": 0.51, "taipei_city": 0.16, "other": 0.09, "kaohsiung_city": 0.03, "new_taipei_city": 0.05, "tainan_city": 0.04, "taichung_city": 0.03, "yilan_county": 0.01, "hong_kong": 0.01, "taoyuan_city": 0.01, "hsinchu_city": 0.02, "chiayi_city": 0, "yunlin_county": 0.02, "nantou_county": 0, "changhua_county": 0.01, "pingtung_county": 0.01, "hsinchu_county": 0, "chiayi_county": 0, "keelung_city": 0, "hualien_county": 0.01, "kinmen_county": 0 }, "age": { "thirties": 0.24, "twenties": 0.36, "teens": 0.04, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.12, "sixties": 0 }, "gender": { "": 0.77, "female": 0.23 } }, "users": 1444, "duration": 282305146, "buckets": { "dev": 2888, "invalidated": 3747, "other": 21701, "reported": 38, "test": 2888, "train": 3496, "validated": 61845 }, "reportedSentences": 39, "size": 2182782100, "checksum": "b85c7744e65f35e99516082b45128125a396edfc5466710af273828535be9a5c", "avgDurationSecs": 3.234, "validDurationSecs": 200006.435, "totalHrs": 78.41, "validHrs": 55.55 }, "sl": { "clips": 7263, "splits": { "accent": { "other": 0.04, "": 0.96 }, "age": { "twenties": 0.62, "teens": 0.11, "": 0.06, "sixties": 0.03, "fifties": 0.11, "fourties": 0.01, "thirties": 0.06 }, "gender": { "": 0.41, "female": 0.59 } }, "users": 82, "duration": 28545274, "buckets": { "dev": 534, "invalidated": 140, "other": 2130, "reported": 13, "test": 996, "train": 1972, "validated": 4993 }, "reportedSentences": 14, "size": 222749198, "checksum": "c597481b8d1c60878a96bcda42e17e2a185c314fda876e6fe38eb099d85a05ce", "avgDurationSecs": 3.93, "validDurationSecs": 19623.648, "totalHrs": 7.92, "validHrs": 5.45 }, "it": { "clips": 129115, "splits": { "accent": { "": 0.95, "other": 0.05 }, "age": { "thirties": 0.19, "twenties": 0.23, "": 0.31, "fifties": 0.09, "fourties": 0.14, "seventies": 0.01, "sixties": 0.03, "teens": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "": 0.81, "female": 0.19 } }, "users": 5727, "duration": 717521160, "buckets": { "dev": 12933, "invalidated": 12190, "other": 14386, "reported": 1031, "test": 12938, "train": 58158, "validated": 102741 }, "reportedSentences": 1011, "size": 5585770901, "checksum": "947f2bd67cd202f9fcba1ed9ca4fc7bbaa4fbc43364dc7dad0f6e66a5e502eb6", "avgDurationSecs": 5.557, "validDurationSecs": 570954.897, "totalHrs": 199.31, "validHrs": 158.59 }, "nl": { "clips": 55457, "splits": { "accent": { "": 0.26, "netherlands": 0.61, "belgium": 0.11, "other": 0, "germany": 0, "france": 0, "suriname": 0.01, "curacao": 0 }, "age": { "": 0.29, "twenties": 0.24, "fourties": 0.18, "thirties": 0.14, "teens": 0.04, "fifties": 0.1, "sixties": 0.01 }, "gender": { "": 0.81, "female": 0.19 } }, "users": 1012, "duration": 226181377, "buckets": { "dev": 4927, "invalidated": 3310, "other": 72, "reported": 199, "test": 5703, "train": 9456, "validated": 52441 }, "reportedSentences": 199, "size": 1741815049, "checksum": "c16ab7b20c87410e8d64e3ebd6e20f890d395e231a9f9c34ab759d917582f052", "avgDurationSecs": 4.079, "validDurationSecs": 213880.621, "totalHrs": 62.82, "validHrs": 59.41 }, "cnh": { "clips": 5799, "splits": { "accent": { "": 0.85, "other": 0.15 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.8, "female": 0.2 } }, "users": 297, "duration": 20663664, "buckets": { "dev": 743, "invalidated": 472, "other": 2824, "reported": 8, "test": 762, "train": 865, "validated": 2503 }, "reportedSentences": 9, "size": 161334217, "checksum": "1e1b4af7c20e6f1b264e63eab763ecea6417ae216c9601e6bbcf21ec50752353", "avgDurationSecs": 3.563, "validDurationSecs": 8918.978, "totalHrs": 5.73, "validHrs": 2.47 }, "eo": { "clips": 65726, "splits": { "accent": { "": 0.29, "internacia": 0.7, "other": 0 }, "age": { "twenties": 0.32, "thirties": 0.11, "": 0.12, "fourties": 0.31, "fifties": 0.03, "seventies": 0.01, "teens": 0.03, "sixties": 0.02, "eighties": 0.06 }, "gender": { "female": 0.22, "": 0.78 } }, "users": 574, "duration": 369742003, "buckets": { "dev": 8976, "invalidated": 4762, "other": 2676, "reported": 404, "test": 9024, "train": 19764, "validated": 58338 }, "reportedSentences": 402, "size": 2883565367, "checksum": "cd6379c89fa7d57e0cd254a91d1a95fbd36117e774a1f33bace8e1dbaf7e0b1d", "avgDurationSecs": 5.626, "validDurationSecs": 328180.765, "totalHrs": 102.7, "validHrs": 91.16 }, "et": { "clips": 14809, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "thirties": 0.07, "twenties": 0.71, "fourties": 0.07, "fifties": 0.01, "seventies": 0, "teens": 0 }, "gender": { "": 0.48, "female": 0.52 } }, "users": 543, "duration": 98800166, "buckets": { "dev": 2507, "invalidated": 3575, "other": 532, "reported": 107, "test": 2509, "train": 2965, "validated": 10702 }, "reportedSentences": 108, "size": 767173893, "checksum": "98ade2d2c5298f503aa45c3c880ad79a2b39f2f9577bda5da2dae008116aeb00", "avgDurationSecs": 6.672, "validDurationSecs": 71399.782, "totalHrs": 27.44, "validHrs": 19.83 }, "fa": { "clips": 285645, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "twenties": 0.33, "thirties": 0.39, "fifties": 0.02, "fourties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.7, "female": 0.3 } }, "users": 3654, "duration": 1154779892, "buckets": { "dev": 5161, "invalidated": 11806, "other": 20469, "reported": 1522, "test": 5161, "train": 7485, "validated": 253592 }, "reportedSentences": 1516, "size": 8884631439, "checksum": "58c4fc315923e0524d4c7710d7a9be4285377f4dbecf1d0caa8e9e95d13db32e", "avgDurationSecs": 4.043, "validDurationSecs": 1025198.909, "totalHrs": 320.77, "validHrs": 284.77 }, "eu": { "clips": 91966, "splits": { "accent": { "mendebalekoa": 0.31, "erdialdekoa_nafarra": 0.3, "": 0.39, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.14, "thirties": 0.08, "fifties": 0.14, "twenties": 0.38, "": 0.22, "teens": 0.03, "sixties": 0.01, "seventies": 0 }, "gender": { "": 0.78, "female": 0.22 } }, "users": 1028, "duration": 472668999, "buckets": { "dev": 5195, "invalidated": 5527, "other": 23026, "reported": 38, "test": 5195, "train": 7555, "validated": 63413 }, "reportedSentences": 39, "size": 3664570179, "checksum": "e5e7edc46a3f9b2567ac257f8f9fd41f35a371bad5862b94e5ca5dc4c6bd2a24", "avgDurationSecs": 5.14, "validDurationSecs": 325917.831, "totalHrs": 131.29, "validHrs": 90.53 }, "es": { "clips": 421193, "splits": { "accent": { "nortepeninsular": 0.1, "": 0.5, "rioplatense": 0.04, "centrosurpeninsular": 0.03, "americacentral": 0.02, "surpeninsular": 0.15, "canario": 0, "andino": 0.05, "mexicano": 0.06, "chileno": 0.02, "caribe": 0.03, "filipinas": 0 }, "age": { "thirties": 0.1, "": 0.32, "fifties": 0.09, "twenties": 0.26, "teens": 0.03, "fourties": 0.06, "sixties": 0.14, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "": 0.84, "female": 0.16 } }, "users": 19480, "duration": 2082967866, "buckets": { "dev": 15077, "invalidated": 40487, "other": 146299, "reported": 1109, "test": 15077, "train": 160481, "validated": 234959 }, "reportedSentences": 1098, "size": 16188722605, "checksum": "0ac532f1b042201d02160e63f35670f91b11589034ad0b79632e3c90fd05d675", "avgDurationSecs": 4.945, "validDurationSecs": 1161966.241, "totalHrs": 578.6, "validHrs": 322.76 }, "zh-CN": { "clips": 50658, "splits": { "accent": { "110000": 0.04, "120000": 0, "130000": 0.02, "140000": 0.01, "150000": 0, "210000": 0.01, "220000": 0, "230000": 0.04, "310000": 0.05, "320000": 0.05, "330000": 0.03, "340000": 0.02, "350000": 0.01, "360000": 0.02, "370000": 0.03, "410000": 0.04, "420000": 0.02, "430000": 0.01, "440000": 0.03, "450000": 0.02, "460000": 0.01, "500000": 0.02, "510000": 0.03, "520000": 0, "530000": 0, "610000": 0.01, "620000": 0, "640000": 0, "650000": 0, "710000": 0, "810000": 0, "": 0.45 }, "age": { "": 0.38, "teens": 0.08, "twenties": 0.38, "thirties": 0.13, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.68, "female": 0.32 } }, "users": 3501, "duration": 281080968, "buckets": { "dev": 8736, "invalidated": 5357, "other": 8829, "reported": 279, "test": 8772, "train": 18605, "validated": 36472 }, "reportedSentences": 279, "size": 2184559529, "checksum": "b44c46fffadb14a3effb60362295533e1484c1dcb76cbdb0ef04a06c87316667", "avgDurationSecs": 5.549, "validDurationSecs": 202368.531, "totalHrs": 78.07, "validHrs": 56.21 }, "mn": { "clips": 11426, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.25, "twenties": 0.42, "fourties": 0.01, "teens": 0.02, "nineties": 0.06 }, "gender": { "": 0.79, "female": 0.21 } }, "users": 376, "duration": 62534240, "buckets": { "dev": 1863, "invalidated": 693, "other": 3192, "reported": 2, "test": 1866, "train": 2163, "validated": 7541 }, "reportedSentences": 3, "size": 486368725, "checksum": "34efd592ff52d3b1d17add62f6af785b69fa945769ea539c275b33e6e0438dee", "avgDurationSecs": 5.473, "validDurationSecs": 41271.723, "totalHrs": 17.37, "validHrs": 11.46 }, "sah": { "clips": 3947, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.88, "female": 0.12 } }, "users": 42, "duration": 23639016, "buckets": { "dev": 581, "invalidated": 73, "other": 1073, "test": 776, "train": 1442, "validated": 2801 }, "size": 181255394, "checksum": "45b40288f769f758c6e66c6d663fd934fd06a3064ea2db6e173907946e2ae756", "avgDurationSecs": 5.989, "validDurationSecs": 16775.496, "totalHrs": 6.56, "validHrs": 4.65 }, "dv": { "clips": 12706, "splits": { "accent": { "": 1 }, "age": { "": 0.11, "twenties": 0.19, "thirties": 0.62, "fourties": 0.05, "teens": 0.02 }, "gender": { "": 0.97, "female": 0.03 } }, "users": 167, "duration": 70042959, "buckets": { "dev": 2072, "invalidated": 780, "other": 116, "reported": 2, "test": 2192, "train": 2693, "validated": 11810 }, "reportedSentences": 3, "size": 540487184, "checksum": "1196b9fa4976531e4f5b1828065278865f72be719cef7ebe2681e90a431a0c47", "avgDurationSecs": 5.513, "validDurationSecs": 65103.679, "totalHrs": 19.45, "validHrs": 18.08 }, "rw": { "clips": 1062642, "splits": { "accent": { "": 1 }, "age": { "": 0.02, "twenties": 0.63, "thirties": 0.11, "teens": 0.24, "fourties": 0 }, "gender": { "": 0.61, "female": 0.39 } }, "users": 411, "duration": 5436192485, "buckets": { "dev": 15007, "invalidated": 206525, "other": 27057, "reported": 618, "test": 15733, "train": 511386, "validated": 829060 }, "reportedSentences": 619, "size": 42544959200, "checksum": "8270c36a7f1e0f937567628b259088f19a3e6156e609da775b6af73fb9c770d1", "avgDurationSecs": 5.116, "validDurationSecs": 4241249.397, "totalHrs": 1510.05, "validHrs": 1178.12 }, "sv-SE": { "clips": 16032, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.27, "": 0.16, "teens": 0.07, "fifties": 0.09, "twenties": 0.2, "fourties": 0.22, "sixties": 0 }, "gender": { "": 0.84, "female": 0.16 } }, "users": 222, "duration": 55467663, "buckets": { "dev": 1971, "invalidated": 469, "other": 3082, "reported": 151, "test": 1970, "train": 2374, "validated": 12506 }, "reportedSentences": 152, "size": 421428100, "checksum": "2998270c70ea2c2201a5f87ce4e467529b99408befdf6c39f25f4d93388ff185", "avgDurationSecs": 3.46, "validDurationSecs": 43268.375, "totalHrs": 15.4, "validHrs": 12.01 }, "ru": { "clips": 87534, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.46, "teens": 0.08, "": 0.11, "fourties": 0.16, "thirties": 0.13, "fifties": 0.05, "sixties": 0, "seventies": 0 }, "gender": { "female": 0.35, "": 0.65 } }, "users": 1410, "duration": 471270204, "buckets": { "dev": 7958, "invalidated": 3091, "other": 10098, "reported": 142, "test": 8007, "train": 15486, "validated": 74370 }, "reportedSentences": 136, "size": 3655671426, "checksum": "a324e505042f52336d64031a1ffef1b13199210e688b482dfa82871f05cc1511", "avgDurationSecs": 5.384, "validDurationSecs": 400397.161, "totalHrs": 130.9, "validHrs": 111.22 }, "id": { "clips": 15914, "splits": { "accent": { "": 1 }, "age": { "": 0.09, "twenties": 0.42, "thirties": 0.08, "teens": 0.36, "fifties": 0.01, "fourties": 0.04 }, "gender": { "": 0.66, "female": 0.34 } }, "users": 219, "duration": 61607784, "buckets": { "dev": 1801, "invalidated": 528, "other": 6862, "reported": 200, "test": 1818, "train": 2117, "validated": 8558 }, "reportedSentences": 201, "size": 475917179, "checksum": "17d8501b57be65d3a339ee210951c10090321806b54eaa5fba45842a72fe1857", "avgDurationSecs": 3.871, "validDurationSecs": 33130.54, "totalHrs": 17.11, "validHrs": 9.2 }, "ar": { "clips": 56153, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "": 0.42, "twenties": 0.37, "fourties": 0.01, "teens": 0.06, "fifties": 0 }, "gender": { "": 0.73, "female": 0.27 } }, "users": 659, "duration": 227977686, "buckets": { "dev": 7567, "invalidated": 5437, "other": 22528, "reported": 825, "test": 7626, "train": 14173, "validated": 39953 }, "reportedSentences": 457, "size": 1756262997, "checksum": "1a40f4948d0ea92b5ca928877156911bc40c1d5f027fa3abe164c1b6262ae5cf", "avgDurationSecs": 4.06, "validDurationSecs": 162206.694, "totalHrs": 63.32, "validHrs": 45.05 }, "ta": { "clips": 20674, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.45, "thirties": 0.41, "": 0.1, "fourties": 0.01, "seventies": 0, "fifties": 0.01, "teens": 0.01, "sixties": 0 }, "gender": { "female": 0.36, "": 0.64 } }, "users": 266, "duration": 88153080, "buckets": { "dev": 1747, "invalidated": 723, "other": 6839, "reported": 257, "test": 1780, "train": 2065, "validated": 13112 }, "reportedSentences": 257, "size": 679765629, "checksum": "29b09471ce309432eb6717b30c8a1521a2b7323c9ccb9967b5b414afe959d470", "avgDurationSecs": 4.264, "validDurationSecs": 55909.025, "totalHrs": 24.48, "validHrs": 15.53 }, "ia": { "clips": 7260, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.24, "fourties": 0.58, "": 0.06, "twenties": 0.07, "thirties": 0.03, "teens": 0, "fifties": 0.01, "sixties": 0 }, "gender": { "": 0.93, "female": 0.07 } }, "users": 35, "duration": 29092476, "buckets": { "dev": 1590, "invalidated": 207, "other": 1184, "reported": 240, "test": 886, "train": 3397, "validated": 5874 }, "reportedSentences": 236, "size": 226498881, "checksum": "5a68c83b7ecbc7470bdcaa79ea38b10e786248eabad06ba668e75040741d7353", "avgDurationSecs": 4.007, "validDurationSecs": 23538.458, "totalHrs": 8.08, "validHrs": 6.53 }, "pt": { "clips": 51679, "splits": { "accent": { "": 1 }, "age": { "": 0.14, "twenties": 0.35, "teens": 0.03, "thirties": 0.32, "fourties": 0.13, "sixties": 0, "fifties": 0.04, "seventies": 0 }, "gender": { "": 0.64, "female": 0.36 } }, "users": 1117, "duration": 226859263, "buckets": { "dev": 4639, "invalidated": 1799, "other": 7929, "reported": 1698, "test": 4642, "train": 6446, "validated": 41986 }, "reportedSentences": 1694, "size": 1704250497, "checksum": "bec46b0f07821023e7a9564c072ba0fd5609e0ba710c8036059d8900caa0c0da", "avgDurationSecs": 4.39, "validDurationSecs": 184309.159, "totalHrs": 63.01, "validHrs": 51.19 }, "lv": { "clips": 8147, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "fourties": 0.03, "": 0.16, "twenties": 0.29, "teens": 0.03, "fifties": 0 }, "gender": { "": 0.7, "female": 0.3 } }, "users": 99, "duration": 27668465, "buckets": { "dev": 1656, "invalidated": 175, "other": 1296, "reported": 20, "test": 1910, "train": 3102, "validated": 6676 }, "reportedSentences": 21, "size": 208314325, "checksum": "b8654831ce5814ba179630bdfc2f8d51718462909c3ad2cf3c252d094fa909da", "avgDurationSecs": 3.396, "validDurationSecs": 22672.723, "totalHrs": 7.68, "validHrs": 6.29 }, "ja": { "clips": 4452, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "": 0.3, "teens": 0.04, "fifties": 0.02, "thirties": 0.06, "fourties": 0.06, "sixties": 0 }, "gender": { "female": 0.4, "": 0.6 } }, "users": 232, "duration": 19685913, "buckets": { "dev": 588, "invalidated": 507, "other": 883, "reported": 43, "test": 630, "train": 724, "validated": 3071 }, "reportedSentences": 44, "size": 152880299, "checksum": "e2a4c64e6d9fb77289c4a25d40aff206b6659f8107a25ee66e33d48c680e31d4", "avgDurationSecs": 4.422, "validDurationSecs": 13579.389, "totalHrs": 5.46, "validHrs": 3.77 }, "vot": { "clips": 420, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.74 }, "gender": { "": 0.26, "female": 0.74 } }, "users": 3, "duration": 1008696, "buckets": { "dev": 0, "invalidated": 60, "other": 352, "test": 0, "train": 8, "validated": 8 }, "size": 7791984, "checksum": "f11fc1ddcc9461de832cb304118993ea5782e0136782aaf99406f6f9f1a51e99", "avgDurationSecs": 2.402, "validDurationSecs": 19.213, "totalHrs": 0.28, "validHrs": 0 }, "ab": { "clips": 791, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.63, "thirties": 0.29, "": 0.08, "teens": 0.01 }, "gender": { "": 1 } }, "users": 14, "duration": 5216688, "buckets": { "dev": 0, "invalidated": 8, "other": 752, "reported": 42, "test": 9, "train": 22, "validated": 31 }, "reportedSentences": 43, "size": 41038479, "checksum": "ad0c6d987ba72d7c47f709b95f5b1fccc8300b378fd05df49c95819a01dc2793", "avgDurationSecs": 6.595, "validDurationSecs": 204.447, "totalHrs": 1.44, "validHrs": 0.05 }, "zh-HK": { "clips": 83664, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.15, "thirties": 0.11, "": 0.4, "teens": 0.02, "fifties": 0.03, "seventies": 0, "sixties": 0, "twenties": 0.29 }, "gender": { "": 0.84, "female": 0.16 } }, "users": 2536, "duration": 361968900, "buckets": { "dev": 5172, "invalidated": 3201, "other": 37498, "reported": 571, "test": 5173, "train": 7514, "validated": 42965 }, "reportedSentences": 560, "size": 2774151973, "checksum": "f9dfbd13c1721753580f02535859e849d0cbf8d4ca974df34126fb2ad53d4f81", "avgDurationSecs": 4.326, "validDurationSecs": 185886.329, "totalHrs": 100.54, "validHrs": 51.63 }, "rm-sursilv": { "clips": 6524, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.11, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "": 0.93, "female": 0.07 } }, "users": 78, "duration": 35885741, "buckets": { "dev": 1208, "invalidated": 664, "other": 2038, "reported": 9, "test": 1222, "train": 1392, "validated": 3822 }, "reportedSentences": 10, "size": 275949719, "checksum": "4ed2cb43b36cfd2ab917fd9285b76ba3d01ef78872ef51e714fcff2bb4cb0e49", "avgDurationSecs": 5.501, "validDurationSecs": 21023.192, "totalHrs": 9.96, "validHrs": 5.83 }, "hsb": { "clips": 1656, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "": 0.89, "female": 0.11 } }, "users": 19, "duration": 10103328, "buckets": { "dev": 172, "invalidated": 227, "other": 62, "reported": 56, "test": 387, "train": 808, "validated": 1367 }, "reportedSentences": 57, "size": 79362015, "checksum": "6096fa84693d8e1abeed2d07271451f59db5e41caf22c53be5824546c60ca523", "avgDurationSecs": 6.101, "validDurationSecs": 8340.126, "totalHrs": 2.8, "validHrs": 2.31 }, "ro": { "clips": 8469, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.51, "teens": 0.05, "": 0.07, "fourties": 0.08, "sixties": 0.01, "twenties": 0.27, "fifties": 0 }, "gender": { "": 0.91, "female": 0.09 } }, "users": 130, "duration": 34024999, "buckets": { "dev": 849, "invalidated": 493, "other": 1946, "reported": 59, "test": 1783, "train": 3394, "validated": 6030 }, "reportedSentences": 60, "size": 261978819, "checksum": "3cbad11a867402d4ece946c3b34e3f1c804f503947da74847b6757e0cc410c15", "avgDurationSecs": 4.018, "validDurationSecs": 24226.089, "totalHrs": 9.45, "validHrs": 6.72 }, "fy-NL": { "clips": 33075, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "fifties": 0.1, "thirties": 0.06, "twenties": 0.03, "fourties": 0.11, "sixties": 0.15, "seventies": 0 }, "gender": { "": 0.99, "female": 0.01 } }, "users": 466, "duration": 168875163, "buckets": { "dev": 2791, "invalidated": 1032, "other": 21590, "reported": 281, "test": 3017, "train": 3927, "validated": 10473 }, "reportedSentences": 281, "size": 1237741428, "checksum": "34387d3293893cde2603199a7f5beaac939104d617a89cb3dcb99906f306632c", "avgDurationSecs": 5.106, "validDurationSecs": 53473.306, "totalHrs": 46.9, "validHrs": 14.85 }, "cs": { "clips": 38566, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.26, "": 0.3, "thirties": 0.14, "teens": 0.01, "twenties": 0.27, "fifties": 0.02, "sixties": 0.01 }, "gender": { "": 0.74, "female": 0.26 } }, "users": 353, "duration": 164221815, "buckets": { "dev": 4095, "invalidated": 729, "other": 7026, "reported": 585, "test": 4117, "train": 5725, "validated": 30836 }, "reportedSentences": 582, "size": 1271911386, "checksum": "26fedfa2d6bab4b39104a6865e9dd47c181ad4f4b2027f1274d00de4896c7b8a", "avgDurationSecs": 4.258, "validDurationSecs": 131305.914, "totalHrs": 45.61, "validHrs": 36.47 }, "el": { "clips": 11840, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.64, "fourties": 0.19, "": 0.14, "twenties": 0.02, "fifties": 0.01, "teens": 0 }, "gender": { "": 0.97, "female": 0.03 } }, "users": 118, "duration": 49105618, "buckets": { "dev": 1392, "invalidated": 274, "other": 5557, "reported": 18, "test": 1563, "train": 2273, "validated": 6009 }, "reportedSentences": 19, "size": 381566104, "checksum": "1e273d7b34be5c4d7e210dcb78d99172921725ee2e37072ec976acb4fa29d991", "avgDurationSecs": 4.147, "validDurationSecs": 24921.931, "totalHrs": 13.64, "validHrs": 6.92 }, "rm-vallader": { "clips": 2417, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "fourties": 0.44, "twenties": 0.15, "thirties": 0.06, "fifties": 0 }, "gender": { "": 0.87, "female": 0.13 } }, "users": 39, "duration": 13869698, "buckets": { "dev": 357, "invalidated": 382, "other": 719, "reported": 21, "test": 378, "train": 574, "validated": 1316 }, "reportedSentences": 20, "size": 108113716, "checksum": "9a85ea50ac56aac6ba3a37bde29f71a8ef033a84113359195b31241870f25ecb", "avgDurationSecs": 5.738, "validDurationSecs": 7551.726, "totalHrs": 3.85, "validHrs": 2.09 }, "pl": { "clips": 108240, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "": 0.24, "teens": 0.02, "thirties": 0.35, "fourties": 0.12, "fifties": 0, "nineties": 0.01, "sixties": 0 }, "gender": { "female": 0.19, "": 0.81 } }, "users": 2647, "duration": 466550750, "buckets": { "dev": 5153, "invalidated": 4623, "other": 12706, "reported": 431, "test": 5153, "train": 7468, "validated": 90911 }, "reportedSentences": 431, "size": 3537006506, "checksum": "c2f85c8e844e35f46b05d6c1a6db25b08abd658fa4662dacdf66e630f3c81123", "avgDurationSecs": 4.31, "validDurationSecs": 391856.94, "totalHrs": 129.59, "validHrs": 108.84 }, "as": { "clips": 535, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.29, "": 0.62, "thirties": 0.08 }, "gender": { "female": 0.29, "": 0.71 } }, "users": 17, "duration": 2835893, "buckets": { "dev": 124, "invalidated": 26, "other": 5, "reported": 2, "test": 110, "train": 270, "validated": 504 }, "reportedSentences": 3, "size": 22225821, "checksum": "9852899831cabe53b45549da8a09de1649505ce852f3cea2fa804baa8b554157", "avgDurationSecs": 5.301, "validDurationSecs": 2671.57, "totalHrs": 0.78, "validHrs": 0.74 }, "uk": { "clips": 31753, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "teens": 0.07, "": 0.19, "fourties": 0.22, "thirties": 0.24, "fifties": 0.01, "sixties": 0 }, "gender": { "female": 0.16, "": 0.84 } }, "users": 459, "duration": 158034012, "buckets": { "dev": 3239, "invalidated": 1259, "other": 8175, "reported": 104, "test": 3234, "train": 4032, "validated": 22319 }, "reportedSentences": 95, "size": 1218554958, "checksum": "4978874e7dcaba60f32e3698785acac3c8f9ee127ec681b44584dae549b736c8", "avgDurationSecs": 4.977, "validDurationSecs": 111081.193, "totalHrs": 43.89, "validHrs": 30.85 }, "mt": { "clips": 11775, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "": 0.25, "fourties": 0.18, "thirties": 0.08, "teens": 0.03, "fifties": 0.28, "sixties": 0.01 }, "gender": { "female": 0.12, "": 0.88 } }, "users": 171, "duration": 55832112, "buckets": { "dev": 1492, "invalidated": 321, "other": 5633, "reported": 4, "test": 1661, "train": 2015, "validated": 5821 }, "reportedSentences": 5, "size": 425114531, "checksum": "814ebac24593526a378c5b4c14163418306d84ae3a82e88302592704c373628c", "avgDurationSecs": 4.742, "validDurationSecs": 27600.741, "totalHrs": 15.5, "validHrs": 7.66 }, "ka": { "clips": 2458, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.59, "thirties": 0.1, "": 0.26, "fourties": 0.05 }, "gender": { "female": 0.3, "": 0.7 } }, "users": 44, "duration": 14005512, "buckets": { "dev": 454, "invalidated": 138, "other": 75, "reported": 2, "test": 711, "train": 1058, "validated": 2245 }, "reportedSentences": 3, "size": 104279708, "checksum": "099bb42f00b819df3ecf743afc7a2f98d38abe27f7f18ac1164f63d4e0cf52bd", "avgDurationSecs": 5.698, "validDurationSecs": 12791.853, "totalHrs": 3.89, "validHrs": 3.55 }, "pa-IN": { "clips": 1825, "splits": { "accent": { "": 1 }, "age": { "": 0.13, "fourties": 0.06, "fifties": 0.01, "thirties": 0.56, "twenties": 0.23 }, "gender": { "": 0.77, "female": 0.23 } }, "users": 26, "duration": 8904554, "buckets": { "dev": 124, "invalidated": 202, "other": 1175, "reported": 232, "test": 136, "train": 188, "validated": 448 }, "reportedSentences": 231, "size": 69751993, "checksum": "19511cd19e2dcaffedd5be65ac28a41c91dce4dcc8e1b21efb22858236b7bae0", "avgDurationSecs": 4.879, "validDurationSecs": 2185.885, "totalHrs": 2.47, "validHrs": 0.6 }, "or": { "clips": 4979, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.13, "": 0.03, "thirties": 0.83, "fourties": 0, "teens": 0 }, "gender": { "female": 0.13, "": 0.87 } }, "users": 34, "duration": 25499688, "buckets": { "dev": 0, "invalidated": 98, "other": 4060, "reported": 4, "test": 129, "train": 691, "validated": 821 }, "reportedSentences": 5, "size": 199087467, "checksum": "9ca9f69fb8425932084ba603c06b887ca259eed89103e2c835ff3fecce75ef04", "avgDurationSecs": 5.121, "validDurationSecs": 4204.709, "totalHrs": 7.08, "validHrs": 1.16 }, "vi": { "clips": 1567, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.05, "twenties": 0.23, "": 0.15, "teens": 0.02, "seventies": 0.01, "fourties": 0, "sixties": 0.53 }, "gender": { "": 0.79, "female": 0.21 } }, "users": 62, "duration": 6755804, "buckets": { "dev": 181, "invalidated": 96, "other": 853, "reported": 7, "test": 201, "train": 236, "validated": 618 }, "reportedSentences": 8, "size": 51929202, "checksum": "0e5ac63f3130b5fd00af0ae18f836abd57d462bfe2d00ce844755474ad395565", "avgDurationSecs": 4.311, "validDurationSecs": 2664.382, "totalHrs": 1.87, "validHrs": 0.74 }, "hu": { "clips": 6911, "splits": { "accent": { "": 1 }, "age": { "teens": 0, "": 0.05, "thirties": 0.14, "twenties": 0.75, "fifties": 0.06 }, "gender": { "": 0.31, "female": 0.69 } }, "users": 46, "duration": 31171321, "buckets": { "dev": 1912, "invalidated": 193, "other": 115, "reported": 31, "test": 2050, "train": 2623, "validated": 6613 }, "reportedSentences": 32, "size": 242765616, "checksum": "10a6c3833a98586792dd6cb1eea5da03c4b5a4c569f9b81d318576430ca83dc5", "avgDurationSecs": 4.51, "validDurationSecs": 29827.224, "totalHrs": 8.65, "validHrs": 8.28 }, "fi": { "clips": 1513, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.4, "": 0.23, "twenties": 0.14, "fourties": 0.17, "teens": 0.04, "fifties": 0.02 }, "gender": { "": 0.86, "female": 0.14 } }, "users": 27, "duration": 6905415, "buckets": { "dev": 406, "invalidated": 59, "other": 198, "reported": 3, "test": 408, "train": 440, "validated": 1256 }, "reportedSentences": 3, "size": 49881693, "checksum": "e746afdbe5e92874190719f4ea53ccd0d9eb0073681b4797e08d62954fcf3027", "avgDurationSecs": 4.564, "validDurationSecs": 5732.453, "totalHrs": 1.91, "validHrs": 1.59 }, "th": { "clips": 10166, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.3, "": 0.17, "thirties": 0.18, "fourties": 0.3, "teens": 0.05, "fifties": 0.01 }, "gender": { "female": 0.21, "": 0.79 } }, "users": 182, "duration": 44472936, "buckets": { "dev": 1951, "invalidated": 492, "other": 2734, "reported": 274, "test": 2159, "train": 2829, "validated": 6940 }, "reportedSentences": 275, "size": 341305351, "checksum": "7577e16ded6e579e905932f114fba8dfa8ae6e7f1aef465b495c36912d7f6047", "avgDurationSecs": 4.375, "validDurationSecs": 30360.238, "totalHrs": 12.35, "validHrs": 8.43 }, "lg": { "clips": 5615, "splits": { "accent": { "": 1 }, "age": { "": 0.06, "thirties": 0.63, "twenties": 0.31, "fourties": 0 }, "gender": { "": 0.89, "female": 0.11 } }, "users": 77, "duration": 29566467, "buckets": { "dev": 355, "invalidated": 363, "other": 3044, "reported": 1, "test": 699, "train": 1154, "validated": 2213 }, "reportedSentences": 2, "size": 208200104, "checksum": "831f6c489904e2e6d985bf30dbe81b65f324a59dbbfeed654963f48e98449a1a", "avgDurationSecs": 5.266, "validDurationSecs": 11652.821, "totalHrs": 8.21, "validHrs": 3.23 }, "lt": { "clips": 3370, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.48, "": 0.12, "thirties": 0.35, "fifties": 0.04, "sixties": 0.02 }, "gender": { "female": 0.47, "": 0.53 } }, "users": 30, "duration": 17557532, "buckets": { "dev": 326, "invalidated": 147, "other": 1473, "reported": 41, "test": 502, "train": 924, "validated": 1755 }, "reportedSentences": 42, "size": 135303346, "checksum": "9a437a259b5de086a062e891f3445368a7db79eff5ec184d84f7d79ef38f95a7", "avgDurationSecs": 5.21, "validDurationSecs": 9143.463, "totalHrs": 4.87, "validHrs": 2.53 }, "hi": { "clips": 618, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.63, "fourties": 0.05, "": 0.27, "thirties": 0.04, "teens": 0.01 }, "gender": { "female": 0.63, "": 0.37 } }, "users": 31, "duration": 2899824, "buckets": { "dev": 139, "invalidated": 63, "other": 130, "reported": 4, "test": 129, "train": 157, "validated": 425 }, "reportedSentences": 5, "size": 21424043, "checksum": "91b14aaa3e5fe004c23dad7221d962a3a8b42cfa599af575a1df2d91228a740d", "avgDurationSecs": 4.692, "validDurationSecs": 1994.216, "totalHrs": 0.8, "validHrs": 0.55 } }, "totalDuration": 33340350739, "totalValidDurationSecs": 26378513, "totalHrs": 9261, "totalValidHrs": 7327, "totalClips": 7023545 } ================================================ FILE: datasets/scripted-speech/cv-corpus-6.0-singleword.json ================================================ { "date": "2020-12-11", "locales": { "es": { "clips": 70038, "splits": { "accent": { "surpeninsular": 0.01, "": 0.8, "americacentral": 0.02, "centrosurpeninsular": 0.02, "mexicano": 0.05, "caribe": 0.02, "nortepeninsular": 0.02, "andino": 0.04, "rioplatense": 0.02, "chileno": 0.01, "canario": 0, "filipinas": 0 }, "age": { "sixties": 0.01, "": 0.74, "thirties": 0.05, "fifties": 0.02, "fourties": 0.04, "twenties": 0.11, "teens": 0.04, "seventies": 0, "nineties": 0, "eighties": 0 }, "gender": { "": 0.91, "female": 0.09 } }, "users": 9052, "duration": 202516646, "reportedSentences": 1100, "buckets": { "dev": 4, "invalidated": 6542, "other": 40513, "reported": 1109, "test": 4, "train": 6, "validated": 22983 }, "avgDurationSecs": 2.892, "validDurationSecs": 66455.925, "totalHrs": 56.25, "validHrs": 18.45 }, "pl": { "clips": 1182, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "thirties": 0.17, "fourties": 0.02, "teens": 0.08, "": 0.47, "sixties": 0.02 }, "gender": { "": 0.79, "female": 0.21 } }, "users": 112, "duration": 2713140, "reportedSentences": 431, "buckets": { "dev": 4, "invalidated": 37, "other": 3, "reported": 431, "test": 4, "train": 6, "validated": 1142 }, "avgDurationSecs": 2.295, "validDurationSecs": 2621.325, "totalHrs": 0.75, "validHrs": 0.72 }, "ca": { "clips": 4180, "splits": { "accent": { "valencian": 0.08, "": 0.51, "central": 0.35, "northern": 0.01, "northwestern": 0.03, "balearic": 0.02 }, "age": { "fifties": 0.08, "": 0.52, "sixties": 0.04, "thirties": 0.12, "fourties": 0.1, "twenties": 0.1, "teens": 0.04, "seventies": 0.01 }, "gender": { "": 0.92, "female": 0.08 } }, "users": 387, "duration": 10707094, "reportedSentences": 1187, "buckets": { "dev": 4, "invalidated": 217, "other": 295, "reported": 1189, "test": 4, "train": 6, "validated": 3668 }, "avgDurationSecs": 2.562, "validDurationSecs": 9395.603, "totalHrs": 2.97, "validHrs": 2.6 }, "tr": { "clips": 537, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "fourties": 0.12, "twenties": 0.26, "teens": 0.03, "thirties": 0.1 }, "gender": { "": 0.8, "female": 0.2 } }, "users": 60, "duration": 1278637, "reportedSentences": 73, "buckets": { "dev": 4, "invalidated": 46, "other": 44, "reported": 72, "test": 4, "train": 6, "validated": 447 }, "avgDurationSecs": 2.381, "validDurationSecs": 1064.34, "totalHrs": 0.35, "validHrs": 0.29 }, "ru": { "clips": 2195, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "teens": 0.12, "thirties": 0.13, "twenties": 0.23, "fourties": 0.05, "fifties": 0.01 }, "gender": { "": 0.79, "female": 0.21 } }, "users": 214, "duration": 4977118, "reportedSentences": 137, "buckets": { "dev": 4, "invalidated": 104, "other": 52, "reported": 143, "test": 4, "train": 6, "validated": 2039 }, "avgDurationSecs": 2.267, "validDurationSecs": 4623.391, "totalHrs": 1.38, "validHrs": 1.28 }, "pt": { "clips": 1307, "splits": { "accent": { "": 1 }, "age": { "": 0.4, "fourties": 0.11, "twenties": 0.29, "thirties": 0.16, "fifties": 0.03, "sixties": 0, "teens": 0.02 }, "gender": { "": 0.75, "female": 0.25 } }, "users": 140, "duration": 3449335, "reportedSentences": 1694, "buckets": { "dev": 4, "invalidated": 87, "other": 40, "reported": 1698, "test": 4, "train": 6, "validated": 1180 }, "avgDurationSecs": 2.639, "validDurationSecs": 3114.166, "totalHrs": 0.95, "validHrs": 0.86 }, "nl": { "clips": 644, "splits": { "accent": { "suriname": 0.02, "netherlands": 0.39, "": 0.46, "belgium": 0.13 }, "age": { "": 0.5, "twenties": 0.32, "sixties": 0, "thirties": 0.04, "fifties": 0.04, "fourties": 0.1 }, "gender": { "": 0.74, "female": 0.26 } }, "users": 67, "duration": 1531757, "reportedSentences": 202, "buckets": { "dev": 4, "invalidated": 21, "other": 7, "reported": 201, "test": 4, "train": 6, "validated": 616 }, "avgDurationSecs": 2.379, "validDurationSecs": 1465.159, "totalHrs": 0.42, "validHrs": 0.4 }, "de": { "clips": 15090, "splits": { "accent": { "germany": 0.26, "austria": 0.02, "": 0.7, "france": 0, "italy": 0, "switzerland": 0.01, "united_states": 0, "russia": 0, "united_kingdom": 0, "lithuania": 0 }, "age": { "fourties": 0.04, "fifties": 0.05, "twenties": 0.12, "thirties": 0.07, "": 0.63, "seventies": 0.01, "teens": 0.06, "sixties": 0.02, "eighties": 0 }, "gender": { "": 0.89, "female": 0.11 } }, "users": 1618, "duration": 39418281, "reportedSentences": 3698, "buckets": { "dev": 4, "invalidated": 1386, "other": 40, "reported": 3708, "test": 4, "train": 7, "validated": 13664 }, "avgDurationSecs": 2.612, "validDurationSecs": 35693.267, "totalHrs": 10.94, "validHrs": 9.91 }, "ta": { "clips": 341, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "fifties": 0.04, "fourties": 0.04, "twenties": 0.17, "thirties": 0.26, "teens": 0.03 }, "gender": { "": 0.87, "female": 0.13 } }, "users": 39, "duration": 992550, "reportedSentences": 257, "buckets": { "dev": 4, "invalidated": 39, "other": 50, "reported": 257, "test": 4, "train": 6, "validated": 252 }, "avgDurationSecs": 2.911, "validDurationSecs": 733.497, "totalHrs": 0.27, "validHrs": 0.2 }, "ar": { "clips": 564, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "thirties": 0.11, "twenties": 0.28, "teens": 0.05, "fifties": 0.02, "fourties": 0.02 }, "gender": { "": 0.79, "female": 0.21 } }, "users": 67, "duration": 1530140, "reportedSentences": 1312, "buckets": { "dev": 4, "invalidated": 46, "other": 33, "reported": 1317, "test": 4, "train": 6, "validated": 485 }, "avgDurationSecs": 2.713, "validDurationSecs": 1315.812, "totalHrs": 0.42, "validHrs": 0.36 }, "en": { "clips": 49370, "splits": { "accent": { "": 0.76, "us": 0.12, "england": 0.04, "indian": 0.04, "australia": 0.01, "canada": 0.02, "newzealand": 0, "scotland": 0, "hongkong": 0.01, "ireland": 0, "singapore": 0, "philippines": 0, "malaysia": 0, "african": 0, "wales": 0 }, "age": { "": 0.61, "twenties": 0.18, "teens": 0.05, "fourties": 0.04, "thirties": 0.07, "sixties": 0.01, "fifties": 0.03, "seventies": 0.01, "nineties": 0, "eighties": 0 }, "gender": { "": 0.86, "female": 0.14 } }, "users": 5876, "duration": 132611224, "reportedSentences": 1698, "buckets": { "dev": 4, "invalidated": 4071, "other": 12571, "reported": 1744, "test": 4, "train": 6, "validated": 32728 }, "avgDurationSecs": 2.686, "validDurationSecs": 87909.665, "totalHrs": 36.83, "validHrs": 24.41 }, "fr": { "clips": 22382, "splits": { "accent": { "france": 0.23, "": 0.74, "germany": 0, "canada": 0.01, "united_kingdom": 0, "belgium": 0.01, "switzerland": 0.01, "reunion": 0, "senegal": 0, "martinique": 0, "congo_brazzaville": 0, "cameroon": 0, "united_states": 0, "madagascar": 0, "romania": 0, "benin": 0, "cote_d_ivoire": 0, "morocco": 0 }, "age": { "sixties": 0.01, "": 0.7, "twenties": 0.1, "thirties": 0.06, "fourties": 0.05, "teens": 0.05, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.91, "female": 0.09 } }, "users": 2560, "duration": 59131896, "reportedSentences": 3155, "buckets": { "dev": 4, "invalidated": 2345, "other": 20, "reported": 3181, "test": 4, "train": 6, "validated": 20017 }, "avgDurationSecs": 2.642, "validDurationSecs": 52883.708, "totalHrs": 16.42, "validHrs": 14.68 }, "cy": { "clips": 553, "splits": { "accent": { "united_kingdom": 0.45, "": 0.55 }, "age": { "twenties": 0.17, "thirties": 0.07, "": 0.53, "teens": 0.04, "fifties": 0.09, "fourties": 0.06, "sixties": 0.02, "seventies": 0.02 }, "gender": { "": 0.94, "female": 0.06 } }, "users": 64, "duration": 1512525, "reportedSentences": 123, "buckets": { "dev": 3, "invalidated": 15, "other": 62, "reported": 122, "test": 3, "train": 4, "validated": 476 }, "avgDurationSecs": 2.735, "validDurationSecs": 1301.92, "totalHrs": 0.42, "validHrs": 0.36 }, "eo": { "clips": 965, "splits": { "accent": { "": 0.67, "internacia": 0.33 }, "age": { "": 0.45, "fourties": 0.03, "twenties": 0.28, "teens": 0.06, "eighties": 0.01, "sixties": 0.02, "thirties": 0.14, "fifties": 0.01 }, "gender": { "": 0.77, "female": 0.23 } }, "users": 91, "duration": 2260824, "reportedSentences": 404, "buckets": { "dev": 3, "invalidated": 41, "other": 23, "reported": 404, "test": 3, "train": 6, "validated": 901 }, "avgDurationSecs": 2.343, "validDurationSecs": 2110.883, "totalHrs": 0.62, "validHrs": 0.58 }, "ja": { "clips": 644, "splits": { "accent": { "": 1 }, "age": { "teens": 0.05, "twenties": 0.5, "": 0.32, "fifties": 0.02, "fourties": 0.04, "thirties": 0.06 }, "gender": { "": 0.68, "female": 0.32 } }, "users": 74, "duration": 1454877, "reportedSentences": 44, "buckets": { "dev": 3, "invalidated": 34, "other": 18, "reported": 43, "test": 3, "train": 6, "validated": 592 }, "avgDurationSecs": 2.259, "validDurationSecs": 1337.402, "totalHrs": 0.4, "validHrs": 0.37 }, "id": { "clips": 171, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.63 }, "gender": { "": 0.5, "female": 0.5 } }, "users": 20, "duration": 375907, "reportedSentences": 201, "buckets": { "dev": 3, "invalidated": 11, "other": 17, "reported": 200, "test": 3, "train": 6, "validated": 143 }, "avgDurationSecs": 2.198, "validDurationSecs": 314.355, "totalHrs": 0.1, "validHrs": 0.08 }, "eu": { "clips": 635, "splits": { "accent": { "": 0.8, "mendebalekoa": 0.12, "erdialdekoa_nafarra": 0.08 }, "age": { "": 0.74, "teens": 0.01, "fourties": 0.16, "fifties": 0.04, "thirties": 0.04, "twenties": 0.02 }, "gender": { "": 0.99, "female": 0.01 } }, "users": 80, "duration": 1577809, "reportedSentences": 39, "buckets": { "dev": 2, "invalidated": 13, "other": 406, "reported": 38, "test": 3, "train": 7, "validated": 216 }, "avgDurationSecs": 2.485, "validDurationSecs": 536.704, "totalHrs": 0.43, "validHrs": 0.14 }, "tt": { "clips": 55, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.44, "": 0.55, "twenties": 0.02 }, "gender": { "": 1 } }, "users": 8, "duration": 154248, "reportedSentences": 3, "buckets": { "dev": 4, "invalidated": 2, "other": 18, "reported": 2, "test": 4, "train": 6, "validated": 35 }, "avgDurationSecs": 2.805, "validDurationSecs": 98.158, "totalHrs": 0.04, "validHrs": 0.02 }, "kab": { "clips": 513, "splits": { "accent": { "": 0.98, "other": 0.02 }, "age": { "fourties": 0.08, "thirties": 0.2, "": 0.44, "fifties": 0.05, "twenties": 0.19, "sixties": 0.01, "teens": 0.02 }, "gender": { "": 0.84, "female": 0.16 } }, "users": 53, "duration": 1458225, "reportedSentences": 3088, "buckets": { "dev": 3, "invalidated": 10, "other": 344, "reported": 3092, "test": 3, "train": 6, "validated": 159 }, "avgDurationSecs": 2.843, "validDurationSecs": 451.964, "totalHrs": 0.4, "validHrs": 0.12 }, "br": { "clips": 42, "splits": { "accent": { "gwenedeg": 0.55, "": 0.38, "kerneveg": 0.07 }, "age": { "fourties": 0.05, "twenties": 0.26, "": 0.21, "fifties": 0.4, "teens": 0.07 }, "gender": { "": 0.74, "female": 0.26 } }, "users": 7, "duration": 109416, "reportedSentences": 167, "buckets": { "dev": 0, "invalidated": 2, "other": 35, "reported": 166, "test": 1, "train": 3, "validated": 5 }, "avgDurationSecs": 2.605, "validDurationSecs": 13.026, "totalHrs": 0.03, "validHrs": 0 }, "zh-CN": { "clips": 319, "splits": { "accent": { "140000": 0, "230000": 0.08, "320000": 0.05, "340000": 0.04, "350000": 0.04, "410000": 0.04, "420000": 0.04, "450000": 0.04, "500000": 0.02, "620000": 0.04, "": 0.63 }, "age": { "thirties": 0.13, "": 0.51, "twenties": 0.21, "teens": 0.09, "fourties": 0.05 }, "gender": { "": 0.88, "female": 0.12 } }, "users": 39, "duration": 778185, "reportedSentences": 279, "buckets": { "dev": 3, "invalidated": 24, "other": 2, "reported": 279, "test": 3, "train": 6, "validated": 293 }, "avgDurationSecs": 2.439, "validDurationSecs": 714.759, "totalHrs": 0.21, "validHrs": 0.19 }, "fy-NL": { "clips": 216, "splits": { "accent": { "": 1 }, "age": { "": 0.84, "sixties": 0.05, "fourties": 0.11 }, "gender": { "": 1 } }, "users": 22, "duration": 650279, "reportedSentences": 281, "buckets": { "dev": 3, "invalidated": 13, "other": 126, "reported": 281, "test": 3, "train": 6, "validated": 77 }, "avgDurationSecs": 3.011, "validDurationSecs": 231.812, "totalHrs": 0.18, "validHrs": 0.06 }, "zh-TW": { "clips": 27, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "teens": 0.44, "twenties": 0.19 }, "gender": { "": 0.81, "female": 0.19 } }, "users": 3, "duration": 61796, "reportedSentences": 39, "buckets": { "dev": 0, "invalidated": 2, "other": 14, "reported": 38, "test": 1, "train": 8, "validated": 11 }, "avgDurationSecs": 2.289, "validDurationSecs": 25.176, "totalHrs": 0.01, "validHrs": 0 }, "cs": { "clips": 542, "splits": { "accent": { "": 1 }, "age": { "teens": 0.04, "": 0.7, "thirties": 0.15, "twenties": 0.04, "fourties": 0.04, "fifties": 0.02 }, "gender": { "": 0.94, "female": 0.06 } }, "users": 54, "duration": 1358979, "reportedSentences": 582, "buckets": { "dev": 3, "invalidated": 14, "other": 0, "reported": 585, "test": 3, "train": 6, "validated": 528 }, "avgDurationSecs": 2.507, "validDurationSecs": 1323.876, "totalHrs": 0.37, "validHrs": 0.36 }, "zh-HK": { "clips": 503, "splits": { "accent": { "": 1 }, "age": { "": 0.74, "twenties": 0.03, "fourties": 0.09, "thirties": 0.14, "teens": 0.01 }, "gender": { "": 0.97, "female": 0.03 } }, "users": 53, "duration": 1335754, "reportedSentences": 560, "buckets": { "dev": 3, "invalidated": 21, "other": 1, "reported": 571, "test": 3, "train": 6, "validated": 481 }, "avgDurationSecs": 2.656, "validDurationSecs": 1277.331, "totalHrs": 0.37, "validHrs": 0.35 }, "lg": { "clips": 56, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "": 0.39, "thirties": 0.21 }, "gender": { "female": 0.18, "": 0.82 } }, "users": 7, "duration": 163592, "reportedSentences": 2, "buckets": { "dev": 0, "invalidated": 3, "other": 39, "reported": 1, "test": 1, "train": 9, "validated": 14 }, "avgDurationSecs": 2.921, "validDurationSecs": 40.898, "totalHrs": 0.04, "validHrs": 0.01 }, "ka": { "clips": 29, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "twenties": 0.41 }, "gender": { "": 0.59, "female": 0.41 } }, "users": 3, "duration": 74256, "reportedSentences": 3, "buckets": { "dev": 0, "invalidated": 0, "other": 12, "reported": 2, "test": 0, "train": 12, "validated": 17 }, "avgDurationSecs": 2.561, "validDurationSecs": 43.529, "totalHrs": 0.02, "validHrs": 0.01 }, "sv-SE": { "clips": 29, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.48, "": 0.52 }, "gender": { "": 1 } }, "users": 3, "duration": 73608, "reportedSentences": 152, "buckets": { "dev": 0, "invalidated": 0, "other": 6, "reported": 151, "test": 1, "train": 11, "validated": 23 }, "avgDurationSecs": 2.538, "validDurationSecs": 58.379, "totalHrs": 0.02, "validHrs": 0.01 }, "cv": { "clips": 67, "splits": { "accent": { "": 1 }, "age": { "teens": 0.31, "twenties": 0.18, "": 0.33, "fourties": 0.18 }, "gender": { "female": 0.24, "": 0.76 } }, "users": 8, "duration": 186589, "reportedSentences": 75, "buckets": { "dev": 1, "invalidated": 2, "other": 25, "reported": 74, "test": 3, "train": 8, "validated": 40 }, "avgDurationSecs": 2.785, "validDurationSecs": 111.396, "totalHrs": 0.05, "validHrs": 0.03 }, "ky": { "clips": 5, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "duration": 15744, "reportedSentences": 1, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "reported": 0, "test": 0, "train": 5, "validated": 5 }, "avgDurationSecs": 3.149, "validDurationSecs": 15.744, "totalHrs": 0, "validHrs": 0 }, "rw": { "clips": 16, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.75, "": 0.25 }, "gender": { "female": 0.75, "": 0.25 } }, "users": 2, "duration": 41736, "reportedSentences": 619, "buckets": { "dev": 0, "invalidated": 2, "other": 3, "reported": 618, "test": 1, "train": 9, "validated": 11 }, "avgDurationSecs": 2.609, "validDurationSecs": 28.694, "totalHrs": 0.01, "validHrs": 0 } }, "overall": { "size": 3562672677, "checksum": "2c3ae4af727d07da13a5437c6cb7a0ada84e4985be125c0ab1b37c4ec1b039a7" }, "totalDuration": 474502167, "totalValidDurationSecs": 277311, "totalHrs": 131, "totalValidHrs": 77, "totalClips": 173217 } ================================================ FILE: datasets/scripted-speech/cv-corpus-6.1-2020-12-11.json ================================================ { "date": "2020-12-11", "locales": { "en": { "reportedSentences": 1762, "size": 60613063630, "checksum": "0f8fdfc4fe715738be94ee49c4fb63d5f1608d2e6a43a2bed80f6cb871171c36", "buckets": { "dev": 16164, "invalidated": 189562, "other": 169895, "reported": 1810, "test": 16164, "train": 564337, "validated": 1224864 }, "clips": 1584321, "splits": { "accent": { "": 0.51, "canada": 0.03, "england": 0.08, "us": 0.24, "indian": 0.05, "australia": 0.03, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.02, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.01 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.14, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male": 0.47, "female": 0.15, "other": 0.02 } }, "users": 66173, "duration": 7854934999, "avgDurationSecs": 4.958, "validDurationSecs": 6072776.352, "totalHrs": 2181.92, "validHrs": 1686.88 }, "fa": { "reportedSentences": 1519, "size": 8884585819, "checksum": "5454efe3b2f6d06d51e7177469b7bef9a962adbf7611e3cd21771451112abe6d", "buckets": { "dev": 5213, "invalidated": 11698, "other": 22510, "reported": 1524, "test": 5213, "train": 7593, "validated": 251659 }, "clips": 285867, "splits": { "accent": { "": 1 }, "age": { "": 0.21, "twenties": 0.33, "thirties": 0.39, "fifties": 0.02, "fourties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "": 0.18, "male": 0.75, "female": 0.07, "other": 0 } }, "users": 3655, "duration": 1155788252, "avgDurationSecs": 4.043, "validDurationSecs": 1017481.961, "totalHrs": 321.05, "validHrs": 282.63 }, "fr": { "reportedSentences": 3193, "size": 19130141984, "checksum": "719ef964b55d830a095a602aff311db39b77239e9d600b6af646ec2ed57e5e45", "buckets": { "dev": 15763, "invalidated": 40351, "other": 3222, "reported": 3219, "test": 15763, "train": 298982, "validated": 461004 }, "clips": 504577, "splits": { "accent": { "": 0.32, "france": 0.62, "germany": 0, "belgium": 0.02, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0, "new_caledonia": 0, "haiti": 0, "morocco": 0, "romania": 0, "congo_kinshasa": 0, "congo_brazzaville": 0, "gabon": 0 }, "age": { "twenties": 0.2, "thirties": 0.19, "": 0.3, "teens": 0.04, "fourties": 0.12, "fifties": 0.11, "sixties": 0.04, "seventies": 0.01, "eighties": 0 }, "gender": { "male": 0.65, "": 0.24, "female": 0.1, "other": 0.01 } }, "users": 12953, "duration": 2458421254, "avgDurationSecs": 4.872, "validDurationSecs": 2246123.053, "totalHrs": 682.89, "validHrs": 623.92 }, "es": { "reportedSentences": 1101, "size": 16188844718, "checksum": "276ca393783cd8b208d56b5032b87c13a40fcadde5b3925596e67c15578d0235", "buckets": { "dev": 15089, "invalidated": 40640, "other": 144791, "reported": 1110, "test": 15089, "train": 161813, "validated": 236314 }, "clips": 421745, "splits": { "accent": { "nortepeninsular": 0.1, "": 0.49, "rioplatense": 0.04, "centrosurpeninsular": 0.03, "americacentral": 0.02, "surpeninsular": 0.15, "canario": 0, "andino": 0.05, "mexicano": 0.06, "chileno": 0.02, "caribe": 0.03, "filipinas": 0 }, "age": { "thirties": 0.1, "": 0.32, "fifties": 0.09, "twenties": 0.26, "teens": 0.03, "fourties": 0.06, "sixties": 0.13, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.5, "": 0.32, "other": 0, "female": 0.18 } }, "users": 19484, "duration": 2085846138, "avgDurationSecs": 4.946, "validDurationSecs": 1168750.416, "totalHrs": 579.4, "validHrs": 324.65 }, "sl": { "reportedSentences": 14, "size": 222751292, "checksum": "184cfbfe876a1f1c6317e4e34680c82a940db833afca78203c2929db1768a353", "buckets": { "dev": 556, "invalidated": 92, "other": 2502, "reported": 13, "test": 881, "train": 2038, "validated": 4669 }, "clips": 7263, "splits": { "accent": { "other": 0.04, "": 0.96 }, "age": { "twenties": 0.62, "teens": 0.11, "": 0.06, "sixties": 0.03, "fifties": 0.11, "fourties": 0.01, "thirties": 0.06 }, "gender": { "female": 0.18, "male": 0.76, "": 0.06, "other": 0 } }, "users": 82, "duration": 28545274, "avgDurationSecs": 3.93, "validDurationSecs": 18350.253, "totalHrs": 7.92, "validHrs": 5.09 }, "kab": { "reportedSentences": 3183, "size": 17171606918, "checksum": "d2089107d4f3a84856c457a436a47a883b872022f2085cfad0501469be91fd95", "buckets": { "dev": 14622, "invalidated": 18134, "other": 88021, "reported": 3202, "test": 14622, "train": 120530, "validated": 573718 }, "clips": 679873, "splits": { "accent": { "": 0.9, "other": 0.1 }, "age": { "fourties": 0.1, "thirties": 0.3, "": 0.26, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.55, "": 0.24, "female": 0.21, "other": 0 } }, "users": 1309, "duration": 2241648880, "avgDurationSecs": 3.297, "validDurationSecs": 1891639.045, "totalHrs": 622.68, "validHrs": 525.45 }, "cy": { "reportedSentences": 127, "size": 3434474658, "checksum": "269da0cbbb2887d1903c0e17bbb71ea9bcd83506ba928fe75c660cb3e52f9a67", "buckets": { "dev": 4776, "invalidated": 3648, "other": 17919, "reported": 126, "test": 4820, "train": 6839, "validated": 72984 }, "clips": 94551, "splits": { "accent": { "united_kingdom": 0.52, "": 0.47, "other": 0.01 }, "age": { "fourties": 0.17, "twenties": 0.14, "sixties": 0.07, "fifties": 0.09, "": 0.41, "thirties": 0.1, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.34, "female": 0.25, "": 0.4, "other": 0.01 } }, "users": 1382, "duration": 447561765, "avgDurationSecs": 4.734, "validDurationSecs": 345473.32, "totalHrs": 124.32, "validHrs": 95.96 }, "ca": { "reportedSentences": 1192, "size": 20743110341, "checksum": "a27bec66c151ddb21c1736781b3bca972047cc20c02488bad94d2311c40bc6da", "buckets": { "dev": 15724, "invalidated": 18846, "other": 64446, "reported": 1195, "test": 15724, "train": 285584, "validated": 416701 }, "clips": 499993, "splits": { "accent": { "valencian": 0.05, "central": 0.67, "": 0.22, "other": 0, "balearic": 0.01, "northwestern": 0.03, "northern": 0.01 }, "age": { "thirties": 0.13, "fifties": 0.23, "fourties": 0.16, "twenties": 0.08, "": 0.17, "sixties": 0.22, "teens": 0.01, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.58, "": 0.18, "female": 0.24, "other": 0 } }, "users": 5376, "duration": 2693148330, "avgDurationSecs": 5.386, "validDurationSecs": 2244506.628, "totalHrs": 748.09, "validHrs": 623.47 }, "de": { "reportedSentences": 3748, "size": 23283812097, "checksum": "733e6e367da4b9588b4bb175ac45c6c0ec545e41df5494a7ee4a7e4ff3141ef7", "buckets": { "dev": 15588, "invalidated": 32789, "other": 10095, "reported": 3768, "test": 15588, "train": 246525, "validated": 565186 }, "clips": 608070, "splits": { "accent": { "russia": 0, "germany": 0.65, "": 0.31, "austria": 0.03, "liechtenstein": 0, "switzerland": 0.01, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0, "czechia": 0, "brazil": 0, "canada": 0, "turkey": 0, "slovenia": 0, "luxembourg": 0, "belgium": 0, "denmark": 0, "greece": 0, "lithuania": 0 }, "age": { "twenties": 0.22, "fourties": 0.2, "": 0.23, "thirties": 0.16, "teens": 0.03, "sixties": 0.03, "fifties": 0.11, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.67, "": 0.23, "female": 0.09, "other": 0.01 } }, "users": 12659, "duration": 3010229669, "avgDurationSecs": 4.95, "validDurationSecs": 2797933.899, "totalHrs": 836.17, "validHrs": 777.2 }, "tt": { "reportedSentences": 3, "size": 777153207, "checksum": "89c8d7a49584de720f1790df39e6f07996e2eecb07f6273f4ba2668e9fe4ad46", "buckets": { "dev": 2127, "invalidated": 287, "other": 1798, "reported": 2, "test": 4485, "train": 11211, "validated": 25781 }, "clips": 27866, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.75, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.18, "male": 0.8, "female": 0.02 } }, "users": 185, "duration": 103646870, "avgDurationSecs": 3.719, "validDurationSecs": 95891.766, "totalHrs": 28.79, "validHrs": 26.63 }, "ta": { "reportedSentences": 257, "size": 679766097, "checksum": "78560d9d608a63ee75c3fdeb7f96f33cf0d85855ba6294b13e945de066eb46d8", "buckets": { "dev": 1779, "invalidated": 594, "other": 7428, "reported": 257, "test": 1781, "train": 2009, "validated": 12652 }, "clips": 20674, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.45, "thirties": 0.41, "": 0.1, "fourties": 0.01, "seventies": 0, "fifties": 0.01, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.72, "": 0.11, "other": 0.01, "female": 0.17 } }, "users": 266, "duration": 88153080, "avgDurationSecs": 4.264, "validDurationSecs": 53947.604, "totalHrs": 24.48, "validHrs": 14.98 }, "ru": { "reportedSentences": 137, "size": 3655676916, "checksum": "dcbb460e58d4afc78047c3801c9eb56d940b388eb350ee3da3de5bfe5a74a025", "buckets": { "dev": 7963, "invalidated": 3056, "other": 10247, "reported": 143, "test": 8007, "train": 15481, "validated": 74256 }, "clips": 87559, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.46, "teens": 0.08, "": 0.11, "fourties": 0.16, "thirties": 0.13, "fifties": 0.05, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.71, "": 0.11, "other": 0, "female": 0.18 } }, "users": 1412, "duration": 471429144, "avgDurationSecs": 5.384, "validDurationSecs": 399804.047, "totalHrs": 130.95, "validHrs": 111.05 }, "nl": { "reportedSentences": 215, "size": 1741827548, "checksum": "048f823408e3bbd16e63111d1b4caecb0102606c440bbdf3e5b6a6bae1e1e3f1", "buckets": { "dev": 4938, "invalidated": 3308, "other": 27, "reported": 215, "test": 5708, "train": 9460, "validated": 52488 }, "clips": 55823, "splits": { "accent": { "": 0.26, "netherlands": 0.61, "belgium": 0.11, "other": 0, "germany": 0, "france": 0, "suriname": 0.01, "curacao": 0 }, "age": { "": 0.29, "twenties": 0.24, "fourties": 0.18, "thirties": 0.14, "teens": 0.04, "fifties": 0.1, "sixties": 0.01 }, "gender": { "": 0.3, "male": 0.58, "female": 0.12, "other": 0 } }, "users": 1012, "duration": 227719441, "avgDurationSecs": 4.079, "validDurationSecs": 214114.935, "totalHrs": 63.25, "validHrs": 59.47 }, "it": { "reportedSentences": 1046, "size": 5585781573, "checksum": "3a75b1631958af1487ee49b13cd27efc951183737ed515832cf714ed20c97808", "buckets": { "dev": 12928, "invalidated": 12189, "other": 14549, "reported": 1052, "test": 12928, "train": 58015, "validated": 102579 }, "clips": 129317, "splits": { "accent": { "": 0.95, "other": 0.05 }, "age": { "thirties": 0.19, "twenties": 0.23, "": 0.31, "fifties": 0.09, "fourties": 0.14, "seventies": 0.01, "sixties": 0.03, "teens": 0.02, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.14, "male": 0.54, "": 0.31, "other": 0.01 } }, "users": 5729, "duration": 718680432, "avgDurationSecs": 5.558, "validDurationSecs": 570083.748, "totalHrs": 199.63, "validHrs": 158.35 }, "eu": { "reportedSentences": 39, "size": 3664586106, "checksum": "55b6eaf7ca7c120faa0b60d71c87189b610412334e6b710fe12c2a79489ab06f", "buckets": { "dev": 5172, "invalidated": 5387, "other": 23570, "reported": 38, "test": 5172, "train": 7505, "validated": 63009 }, "clips": 91966, "splits": { "accent": { "mendebalekoa": 0.31, "erdialdekoa_nafarra": 0.3, "": 0.39, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.14, "thirties": 0.08, "fifties": 0.14, "twenties": 0.38, "": 0.22, "teens": 0.03, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.5, "female": 0.25, "": 0.23, "other": 0.02 } }, "users": 1028, "duration": 472668999, "avgDurationSecs": 5.14, "validDurationSecs": 323841.43, "totalHrs": 131.29, "validHrs": 89.95 }, "tr": { "reportedSentences": 73, "size": 620848700, "checksum": "b3f266c868b1fe9f76270ba76226b1cdc17f33b3e387e6b44a64d5419f8b9768", "buckets": { "dev": 1647, "invalidated": 1726, "other": 325, "reported": 72, "test": 1647, "train": 1831, "validated": 18685 }, "clips": 20736, "splits": { "accent": { "": 0.96, "other": 0.04 }, "age": { "": 0.23, "thirties": 0.23, "twenties": 0.47, "teens": 0.04, "fourties": 0.03, "fifties": 0.01, "sixties": 0 }, "gender": { "": 0.23, "male": 0.71, "female": 0.06, "other": 0 } }, "users": 678, "duration": 80991855, "avgDurationSecs": 3.906, "validDurationSecs": 72980.942, "totalHrs": 22.49, "validHrs": 20.27 }, "ar": { "reportedSentences": 1797, "size": 1756264615, "checksum": "516b369da8a000c1b98d8f5ee3b90fa12bcc5d5438391fcf01f3d5e78ccdd6fa", "buckets": { "dev": 7517, "invalidated": 6333, "other": 18283, "reported": 1810, "test": 7622, "train": 14227, "validated": 43291 }, "clips": 67907, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.49, "twenties": 0.34, "fourties": 0.01, "teens": 0.05, "fifties": 0 }, "gender": { "female": 0.21, "": 0.48, "male": 0.31, "other": 0 } }, "users": 672, "duration": 278490582, "avgDurationSecs": 4.101, "validDurationSecs": 177538.925, "totalHrs": 77.35, "validHrs": 49.31 }, "zh-TW": { "reportedSentences": 39, "size": 2182836295, "checksum": "67fadf561f8237690d4a4a1d63a9b3ac271b5d05438dc745b7e04282d909460f", "buckets": { "dev": 2895, "invalidated": 3584, "other": 22477, "reported": 38, "test": 2895, "train": 3507, "validated": 61232 }, "clips": 87293, "splits": { "accent": { "": 0.51, "taipei_city": 0.16, "other": 0.09, "kaohsiung_city": 0.03, "new_taipei_city": 0.05, "tainan_city": 0.04, "taichung_city": 0.03, "yilan_county": 0.01, "hong_kong": 0.01, "taoyuan_city": 0.01, "hsinchu_city": 0.02, "chiayi_city": 0, "yunlin_county": 0.02, "nantou_county": 0, "changhua_county": 0.01, "pingtung_county": 0.01, "hsinchu_county": 0, "chiayi_county": 0, "keelung_city": 0, "hualien_county": 0.01, "kinmen_county": 0 }, "age": { "thirties": 0.24, "twenties": 0.36, "teens": 0.04, "": 0.23, "fifties": 0.01, "seventies": 0, "fourties": 0.12, "sixties": 0 }, "gender": { "male": 0.49, "": 0.22, "female": 0.27, "other": 0.03 } }, "users": 1444, "duration": 282305146, "avgDurationSecs": 3.234, "validDurationSecs": 198023.996, "totalHrs": 78.41, "validHrs": 55 }, "br": { "reportedSentences": 167, "size": 465276982, "checksum": "d323d71337055b794c8fe3dcdf5a0dc03d6bf8f7c8c19f96369884410aef4606", "buckets": { "dev": 1997, "invalidated": 623, "other": 10912, "reported": 166, "test": 2087, "train": 2780, "validated": 8560 }, "clips": 20095, "splits": { "accent": { "other": 0, "": 0.74, "kerneveg": 0.2, "gwenedeg": 0.05, "leoneg": 0, "tregerieg": 0 }, "age": { "twenties": 0.17, "": 0.37, "fifties": 0.06, "fourties": 0.08, "thirties": 0.08, "sixties": 0.2, "seventies": 0.03, "teens": 0.01 }, "gender": { "male": 0.6, "": 0.38, "female": 0.02 } }, "users": 157, "duration": 60895123, "avgDurationSecs": 3.03, "validDurationSecs": 25939.898, "totalHrs": 16.91, "validHrs": 7.2 }, "pt": { "reportedSentences": 1697, "size": 1704252567, "checksum": "6700de499f728e0e3f3ed4d7005e5b7db27ba2ddc872b21b0b404c3b4859d84b", "buckets": { "dev": 4592, "invalidated": 1740, "other": 8390, "reported": 1701, "test": 4641, "train": 6514, "validated": 41584 }, "clips": 51714, "splits": { "accent": { "": 1 }, "age": { "": 0.14, "twenties": 0.35, "teens": 0.03, "thirties": 0.32, "fourties": 0.13, "sixties": 0, "fifties": 0.04, "seventies": 0 }, "gender": { "": 0.14, "male": 0.81, "female": 0.03, "other": 0.02 } }, "users": 1120, "duration": 226986379, "avgDurationSecs": 4.389, "validDurationSecs": 182523.138, "totalHrs": 63.05, "validHrs": 50.7 }, "eo": { "reportedSentences": 404, "size": 2883560869, "checksum": "c19900010aee0f9eb39416406598509b1cdba136a16318e746b1a64f97d7809c", "buckets": { "dev": 8987, "invalidated": 4736, "other": 2946, "reported": 404, "test": 8969, "train": 19587, "validated": 58094 }, "clips": 65776, "splits": { "accent": { "": 0.29, "internacia": 0.7, "other": 0 }, "age": { "twenties": 0.32, "thirties": 0.11, "": 0.12, "fourties": 0.31, "fifties": 0.03, "seventies": 0.01, "teens": 0.03, "sixties": 0.02, "eighties": 0.06 }, "gender": { "male": 0.74, "": 0.17, "female": 0.08, "other": 0 } }, "users": 574, "duration": 370020067, "avgDurationSecs": 5.625, "validDurationSecs": 326805.305, "totalHrs": 102.78, "validHrs": 90.77 }, "zh-CN": { "reportedSentences": 280, "size": 2184602350, "checksum": "cd8589cac28541f9f996d1954f14c307954f1146ac44a8eadad8e31ebaf1f15e", "buckets": { "dev": 8743, "invalidated": 5305, "other": 8948, "reported": 280, "test": 8760, "train": 18541, "validated": 36405 }, "clips": 50658, "splits": { "accent": { "110000": 0.04, "120000": 0, "130000": 0.02, "140000": 0.01, "150000": 0, "210000": 0.01, "220000": 0, "230000": 0.04, "310000": 0.05, "320000": 0.05, "330000": 0.03, "340000": 0.02, "350000": 0.01, "360000": 0.02, "370000": 0.03, "410000": 0.04, "420000": 0.02, "430000": 0.01, "440000": 0.03, "450000": 0.02, "460000": 0.01, "500000": 0.02, "510000": 0.03, "520000": 0, "530000": 0, "610000": 0.01, "620000": 0, "640000": 0, "650000": 0, "710000": 0, "810000": 0, "": 0.45 }, "age": { "": 0.38, "teens": 0.08, "twenties": 0.38, "thirties": 0.13, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.37, "male": 0.53, "female": 0.09, "other": 0.01 } }, "users": 3501, "duration": 281080968, "avgDurationSecs": 5.549, "validDurationSecs": 201996.775, "totalHrs": 78.07, "validHrs": 56.11 }, "id": { "reportedSentences": 201, "size": 475918233, "checksum": "71177fa9d2fac29f48db5feabc294f1d6bbcaa0c326b0d1099be66c0b804b245", "buckets": { "dev": 1835, "invalidated": 470, "other": 6782, "reported": 200, "test": 1844, "train": 2130, "validated": 8696 }, "clips": 15948, "splits": { "accent": { "": 1 }, "age": { "": 0.09, "twenties": 0.42, "thirties": 0.08, "teens": 0.36, "fifties": 0.01, "fourties": 0.04 }, "gender": { "": 0.09, "male": 0.62, "female": 0.18, "other": 0.11 } }, "users": 219, "duration": 61766436, "avgDurationSecs": 3.873, "validDurationSecs": 33679.516, "totalHrs": 17.15, "validHrs": 9.35 }, "ia": { "reportedSentences": 236, "size": 226499645, "checksum": "47a137a805ea8ce01f2cf9277739919a824a9fd13468345dfbd84eddb52c02f1", "buckets": { "dev": 1601, "invalidated": 192, "other": 1095, "reported": 240, "test": 899, "train": 3477, "validated": 5978 }, "clips": 7265, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.24, "fourties": 0.58, "": 0.06, "twenties": 0.07, "thirties": 0.03, "teens": 0, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.93, "": 0.06, "female": 0.01 } }, "users": 36, "duration": 29112996, "avgDurationSecs": 4.007, "validDurationSecs": 23955.608, "totalHrs": 8.08, "validHrs": 6.65 }, "lv": { "reportedSentences": 21, "size": 208307691, "checksum": "8a4350ccf24884ee1012032bfd5a87e0de50d780b1f8450d1cb52afe3f69c671", "buckets": { "dev": 2002, "invalidated": 143, "other": 1560, "reported": 20, "test": 1882, "train": 2552, "validated": 6444 }, "clips": 8147, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "fourties": 0.03, "": 0.16, "twenties": 0.29, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.7, "female": 0.13, "": 0.16 } }, "users": 99, "duration": 27668465, "avgDurationSecs": 3.396, "validDurationSecs": 21884.815, "totalHrs": 7.68, "validHrs": 6.07 }, "ja": { "reportedSentences": 44, "size": 152879796, "checksum": "3614cd0d0abac80794351c78183967c83179fab390d7e19cad97758eb85ae558", "buckets": { "dev": 586, "invalidated": 504, "other": 885, "reported": 43, "test": 632, "train": 722, "validated": 3072 }, "clips": 4461, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.51, "": 0.3, "teens": 0.04, "fifties": 0.02, "thirties": 0.06, "fourties": 0.06, "sixties": 0 }, "gender": { "male": 0.72, "": 0.14, "female": 0.14, "other": 0 } }, "users": 235, "duration": 19732425, "avgDurationSecs": 4.423, "validDurationSecs": 13588.435, "totalHrs": 5.48, "validHrs": 3.77 }, "rw": { "reportedSentences": 619, "size": 42545189583, "checksum": "cf8a07059b3713022d487f9a6b8f465271f3457c525a8b350f829f87b0132b41", "buckets": { "dev": 15032, "invalidated": 206790, "other": 22923, "reported": 618, "test": 15724, "train": 515197, "validated": 832929 }, "clips": 1062642, "splits": { "accent": { "": 1 }, "age": { "": 0.02, "twenties": 0.63, "thirties": 0.11, "teens": 0.24, "fourties": 0 }, "gender": { "": 0.09, "male": 0.6, "female": 0.31, "other": 0 } }, "users": 410, "duration": 5436192485, "avgDurationSecs": 5.116, "validDurationSecs": 4261042.167, "totalHrs": 1510.05, "validHrs": 1183.62 }, "sv-SE": { "reportedSentences": 152, "size": 421434184, "checksum": "dc8634dafacb33be00f06e376f6c479d53f84f4834952593c8903f1080535213", "buckets": { "dev": 2019, "invalidated": 462, "other": 3043, "reported": 151, "test": 2027, "train": 2331, "validated": 12552 }, "clips": 16057, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.28, "": 0.16, "teens": 0.07, "fifties": 0.09, "twenties": 0.2, "fourties": 0.22, "sixties": 0 }, "gender": { "male": 0.61, "": 0.16, "female": 0.22, "other": 0.01 } }, "users": 222, "duration": 55571595, "avgDurationSecs": 3.461, "validDurationSecs": 43441.157, "totalHrs": 15.43, "validHrs": 12.06 }, "cnh": { "reportedSentences": 9, "size": 161331331, "checksum": "9c27ce17ea8db73e7a2c8715bdb3a45a40792d6d64238cfbb467a81c6b71d71f", "buckets": { "dev": 756, "invalidated": 433, "other": 2934, "reported": 8, "test": 752, "train": 807, "validated": 2432 }, "clips": 5799, "splits": { "accent": { "": 0.85, "other": 0.15 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 297, "duration": 20663664, "avgDurationSecs": 3.563, "validDurationSecs": 8665.982, "totalHrs": 5.73, "validHrs": 2.4 }, "et": { "reportedSentences": 108, "size": 767174465, "checksum": "50a861393e4e7013ab71f1b63bca8c42c26dca1519c15a3b9cdb3cb5b6c561a2", "buckets": { "dev": 2507, "invalidated": 3557, "other": 569, "reported": 107, "test": 2509, "train": 2966, "validated": 10683 }, "clips": 14809, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "thirties": 0.07, "twenties": 0.71, "fourties": 0.07, "fifties": 0.01, "seventies": 0, "teens": 0 }, "gender": { "": 0.16, "male": 0.6, "female": 0.24, "other": 0 } }, "users": 543, "duration": 98800166, "avgDurationSecs": 6.672, "validDurationSecs": 71273.021, "totalHrs": 27.44, "validHrs": 19.79 }, "ky": { "reportedSentences": 1, "size": 579440853, "checksum": "6efe0ca5384d0419fcf5fda0e0229a1b5eb80d8eeba2d7528a4c3c9f2593206f", "buckets": { "dev": 1511, "invalidated": 926, "other": 7223, "reported": 0, "test": 1503, "train": 1955, "validated": 9236 }, "clips": 17385, "splits": { "accent": { "": 0.92, "other": 0.08 }, "age": { "thirties": 0.13, "": 0.06, "fourties": 0.02, "twenties": 0.78, "teens": 0.01 }, "gender": { "male": 0.46, "": 0.11, "female": 0.43, "other": 0 } }, "users": 134, "duration": 80915736, "avgDurationSecs": 4.654, "validDurationSecs": 42987.503, "totalHrs": 22.47, "validHrs": 11.94 }, "ro": { "reportedSentences": 60, "size": 261978702, "checksum": "450b159e936ef6ff136fcdfad193675caec5b2230d1b6ca24c5cde491ff002cd", "buckets": { "dev": 858, "invalidated": 485, "other": 1945, "reported": 59, "test": 1778, "train": 3399, "validated": 6039 }, "clips": 8469, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.51, "teens": 0.05, "": 0.07, "fourties": 0.08, "sixties": 0.01, "twenties": 0.27, "fifties": 0 }, "gender": { "male": 0.72, "": 0.07, "female": 0.2, "other": 0 } }, "users": 130, "duration": 34024999, "avgDurationSecs": 4.018, "validDurationSecs": 24262.247, "totalHrs": 9.45, "validHrs": 6.73 }, "hsb": { "reportedSentences": 57, "size": 79362060, "checksum": "3dd3d79aaa078ad7955552ebc596e0a8894ffd7a4a88a51b2c8ee80c0e088152", "buckets": { "dev": 172, "invalidated": 227, "other": 62, "reported": 56, "test": 387, "train": 808, "validated": 1367 }, "clips": 1656, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.83, "": 0.17, "other": 0 } }, "users": 19, "duration": 10103328, "avgDurationSecs": 6.101, "validDurationSecs": 8340.126, "totalHrs": 2.8, "validHrs": 2.31 }, "el": { "reportedSentences": 19, "size": 381570611, "checksum": "86c67e7bda7658a7087b5a1997d140d57957a05bb413a188610db61807c53ee4", "buckets": { "dev": 1401, "invalidated": 185, "other": 5659, "reported": 18, "test": 1522, "train": 2316, "validated": 5996 }, "clips": 11840, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.64, "fourties": 0.19, "": 0.14, "twenties": 0.02, "fifties": 0.01, "teens": 0 }, "gender": { "male": 0.85, "": 0.14, "other": 0.01, "female": 0 } }, "users": 118, "duration": 49105618, "avgDurationSecs": 4.147, "validDurationSecs": 24868.014, "totalHrs": 13.64, "validHrs": 6.9 }, "cs": { "reportedSentences": 582, "size": 1271909933, "checksum": "68a1d6f27eb7161fdf28da889e7d37e8c86b7aff73b0b6df52edc8359e30ac56", "buckets": { "dev": 4118, "invalidated": 685, "other": 7475, "reported": 585, "test": 4144, "train": 5655, "validated": 30431 }, "clips": 38591, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.26, "": 0.3, "thirties": 0.14, "teens": 0.01, "twenties": 0.27, "fifties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.69, "": 0.29, "female": 0.02 } }, "users": 353, "duration": 164383635, "avgDurationSecs": 4.26, "validDurationSecs": 129625.001, "totalHrs": 45.66, "validHrs": 36 }, "pl": { "reportedSentences": 431, "size": 3537012341, "checksum": "acbf77d36e083e2bcb7152ffb52ab7d1e3e64d33a3f51f106cdff7feff6279aa", "buckets": { "dev": 5153, "invalidated": 4601, "other": 12848, "reported": 431, "test": 5153, "train": 7468, "validated": 90791 }, "clips": 108240, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "": 0.24, "teens": 0.02, "thirties": 0.35, "fourties": 0.12, "fifties": 0, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.24, "female": 0.15, "other": 0 } }, "users": 2647, "duration": 466550750, "avgDurationSecs": 4.31, "validDurationSecs": 391339.7, "totalHrs": 129.59, "validHrs": 108.7 }, "rm-sursilv": { "reportedSentences": 10, "size": 275950479, "checksum": "3cfc4971b6ab8958d7c3d784977690fcc04ebd7570ecf788d5948df84a5481a1", "buckets": { "dev": 1205, "invalidated": 639, "other": 2102, "reported": 9, "test": 1194, "train": 1384, "validated": 3783 }, "clips": 6524, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.11, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.15, "female": 0.2, "": 0.64, "other": 0 } }, "users": 78, "duration": 35885741, "avgDurationSecs": 5.501, "validDurationSecs": 20808.669, "totalHrs": 9.96, "validHrs": 5.78 }, "rm-vallader": { "reportedSentences": 20, "size": 108113989, "checksum": "4fdb7dc5e20862a636ee7975831b39db29012d615f9139edf2d266b878ce43ae", "buckets": { "dev": 357, "invalidated": 374, "other": 727, "reported": 21, "test": 378, "train": 574, "validated": 1316 }, "clips": 2417, "splits": { "accent": { "": 1 }, "age": { "": 0.35, "fourties": 0.44, "twenties": 0.15, "thirties": 0.06, "fifties": 0 }, "gender": { "": 0.35, "male": 0.44, "female": 0.21, "other": 0.01 } }, "users": 39, "duration": 13869698, "avgDurationSecs": 5.738, "validDurationSecs": 7551.726, "totalHrs": 3.85, "validHrs": 2.09 }, "mn": { "reportedSentences": 3, "size": 486369317, "checksum": "3aebc40d40eb19263576664a981f4bb8b221abeab78c8154adc3d16875c75ec7", "buckets": { "dev": 1837, "invalidated": 667, "other": 3272, "reported": 2, "test": 1862, "train": 2183, "validated": 7487 }, "clips": 11426, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.25, "twenties": 0.42, "fourties": 0.01, "teens": 0.02, "nineties": 0.06 }, "gender": { "male": 0.35, "": 0.25, "female": 0.34, "other": 0.06 } }, "users": 376, "duration": 62534240, "avgDurationSecs": 5.473, "validDurationSecs": 40976.182, "totalHrs": 17.37, "validHrs": 11.38 }, "zh-HK": { "reportedSentences": 560, "size": 2774145806, "checksum": "8a525ce4664d6647701449d5e72f7d8658cc3a5fabc72e05c6883994fd3c0134", "buckets": { "dev": 5172, "invalidated": 2999, "other": 38830, "reported": 571, "test": 5172, "train": 7506, "validated": 41835 }, "clips": 83664, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.15, "thirties": 0.11, "": 0.4, "teens": 0.02, "fifties": 0.03, "seventies": 0, "sixties": 0, "twenties": 0.29 }, "gender": { "male": 0.4, "": 0.34, "female": 0.24, "other": 0.01 } }, "users": 2536, "duration": 361968900, "avgDurationSecs": 4.326, "validDurationSecs": 180997.429, "totalHrs": 100.54, "validHrs": 50.27 }, "ab": { "reportedSentences": 43, "size": 41038412, "checksum": "801de9c63f740c4d2c821709586921bed216c736e593051306579cf478a54388", "buckets": { "dev": 0, "invalidated": 8, "other": 752, "reported": 42, "test": 9, "train": 22, "validated": 31 }, "clips": 791, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.63, "thirties": 0.29, "": 0.08, "teens": 0.01 }, "gender": { "male": 0.64, "female": 0.29, "": 0.08 } }, "users": 14, "duration": 5216688, "avgDurationSecs": 6.595, "validDurationSecs": 204.447, "totalHrs": 1.44, "validHrs": 0.05 }, "cv": { "reportedSentences": 75, "size": 439329081, "checksum": "c3fb84c28a5718f01b91cf1026985b1dcd83bb312d32620f16b5ed4f12fb8c73", "buckets": { "dev": 818, "invalidated": 1282, "other": 6927, "reported": 74, "test": 788, "train": 931, "validated": 3496 }, "clips": 11705, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.48, "": 0.27, "fourties": 0.09, "thirties": 0.01, "teens": 0.13, "fifties": 0.02 }, "gender": { "male": 0.45, "": 0.26, "female": 0.3 } }, "users": 92, "duration": 58562220, "avgDurationSecs": 5.003, "validDurationSecs": 17491.117, "totalHrs": 16.26, "validHrs": 4.85 }, "uk": { "reportedSentences": 323, "size": 1218559031, "checksum": "f3ca0143cd84f5eacb583187052e69efec21c571a426efee91a765a2284519c2", "buckets": { "dev": 3236, "invalidated": 1255, "other": 8161, "reported": 323, "test": 3235, "train": 4035, "validated": 22337 }, "clips": 31753, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "teens": 0.07, "": 0.19, "fourties": 0.22, "thirties": 0.24, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.59, "female": 0.22, "": 0.19 } }, "users": 459, "duration": 158034012, "avgDurationSecs": 4.977, "validDurationSecs": 111170.778, "totalHrs": 43.89, "validHrs": 30.88 }, "mt": { "reportedSentences": 5, "size": 425114242, "checksum": "9d53000d7832d130c4d35fb412bfc092ab8de8e763a5d2a528aebf37f052af03", "buckets": { "dev": 1516, "invalidated": 314, "other": 5714, "reported": 4, "test": 1617, "train": 2036, "validated": 5747 }, "clips": 11775, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "": 0.25, "fourties": 0.18, "thirties": 0.08, "teens": 0.03, "fifties": 0.28, "sixties": 0.01 }, "gender": { "male": 0.26, "": 0.25, "female": 0.48, "other": 0.01 } }, "users": 171, "duration": 55832112, "avgDurationSecs": 4.742, "validDurationSecs": 27249.864, "totalHrs": 15.5, "validHrs": 7.56 }, "as": { "reportedSentences": 8, "size": 22226465, "checksum": "d9afd6d28e9c837ff0943a94452fb12ce8a7885b38fdeb25fc2912bbe4977f40", "buckets": { "dev": 124, "invalidated": 31, "other": 0, "reported": 7, "test": 110, "train": 270, "validated": 504 }, "clips": 535, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.29, "": 0.62, "thirties": 0.08 }, "gender": { "male": 0.38, "": 0.62 } }, "users": 17, "duration": 2835893, "avgDurationSecs": 5.301, "validDurationSecs": 2671.57, "totalHrs": 0.78, "validHrs": 0.74 }, "ka": { "reportedSentences": 3, "size": 104280554, "checksum": "7677df9d650234306a11bf8518be5807e72e7d5fc440d391304d1b99dd5517f5", "buckets": { "dev": 527, "invalidated": 139, "other": 44, "reported": 2, "test": 656, "train": 1058, "validated": 2275 }, "clips": 2458, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.59, "thirties": 0.1, "": 0.26, "fourties": 0.05 }, "gender": { "male": 0.4, "female": 0.35, "": 0.25 } }, "users": 44, "duration": 14005512, "avgDurationSecs": 5.698, "validDurationSecs": 12962.791, "totalHrs": 3.89, "validHrs": 3.6 }, "fy-NL": { "reportedSentences": 281, "size": 1237743070, "checksum": "ddee4fc3ce52df2379fa4069090d8f5c853155dc0462eb645f6111e2da627297", "buckets": { "dev": 2790, "invalidated": 1031, "other": 21569, "reported": 281, "test": 3020, "train": 3927, "validated": 10495 }, "clips": 33095, "splits": { "accent": { "": 1 }, "age": { "": 0.56, "fifties": 0.1, "thirties": 0.06, "twenties": 0.03, "fourties": 0.11, "sixties": 0.15, "seventies": 0 }, "gender": { "": 0.56, "male": 0.08, "female": 0.36 } }, "users": 467, "duration": 168972795, "avgDurationSecs": 5.106, "validDurationSecs": 53584.212, "totalHrs": 46.93, "validHrs": 14.88 }, "dv": { "reportedSentences": 3, "size": 540488041, "checksum": "b2c8617df5e7aebd74d88491913ecc6b94066198e875853b0b3847d13e70f419", "buckets": { "dev": 2077, "invalidated": 840, "other": 0, "reported": 2, "test": 2202, "train": 2680, "validated": 11866 }, "clips": 12706, "splits": { "accent": { "": 1 }, "age": { "": 0.11, "twenties": 0.19, "thirties": 0.62, "fourties": 0.05, "teens": 0.02 }, "gender": { "": 0.11, "male": 0.35, "female": 0.53 } }, "users": 167, "duration": 70042959, "avgDurationSecs": 5.513, "validDurationSecs": 65412.384, "totalHrs": 19.45, "validHrs": 18.17 }, "pa-IN": { "reportedSentences": 231, "size": 69748265, "checksum": "d2e30f28a227ecb8209340c4133edf6489f35f8e3d1eb55ff22b96b12f36952c", "buckets": { "dev": 44, "invalidated": 43, "other": 1411, "reported": 232, "test": 116, "train": 211, "validated": 371 }, "clips": 1825, "splits": { "accent": { "": 1 }, "age": { "": 0.13, "fourties": 0.06, "fifties": 0.01, "thirties": 0.56, "twenties": 0.23 }, "gender": { "": 0.13, "male": 0.87 } }, "users": 26, "duration": 8904554, "avgDurationSecs": 4.879, "validDurationSecs": 1810.186, "totalHrs": 2.47, "validHrs": 0.5 }, "vi": { "reportedSentences": 22, "size": 51929480, "checksum": "704bce8031932377cc21c017923ff1e96ebd2be9bd520adcf839f7a0f5f03b6e", "buckets": { "dev": 200, "invalidated": 78, "other": 870, "reported": 22, "test": 198, "train": 221, "validated": 619 }, "clips": 1567, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.05, "twenties": 0.23, "": 0.15, "teens": 0.02, "seventies": 0.01, "fourties": 0, "sixties": 0.53 }, "gender": { "male": 0.82, "": 0.15, "female": 0.02, "other": 0 } }, "users": 62, "duration": 6755804, "avgDurationSecs": 4.311, "validDurationSecs": 2668.693, "totalHrs": 1.87, "validHrs": 0.74 }, "or": { "reportedSentences": 5, "size": 199077358, "checksum": "f3edad30166fe454f4d2b14adeece1434dc4b8eb7b0ece37aac8389b7122218a", "buckets": { "dev": 129, "invalidated": 62, "other": 4302, "reported": 4, "test": 98, "train": 388, "validated": 615 }, "clips": 4979, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.13, "": 0.03, "thirties": 0.83, "fourties": 0, "teens": 0 }, "gender": { "male": 0.97, "": 0.03, "female": 0 } }, "users": 34, "duration": 25499688, "avgDurationSecs": 5.121, "validDurationSecs": 3149.69, "totalHrs": 7.08, "validHrs": 0.87 }, "ga-IE": { "reportedSentences": 5, "size": 156553447, "checksum": "27223fc99af6a45f81190ecb90034806991ff3b9e3aa38a7e97caaabbb0a4ddc", "buckets": { "dev": 497, "invalidated": 409, "other": 2130, "reported": 4, "test": 506, "train": 541, "validated": 3352 }, "clips": 5891, "splits": { "accent": { "": 0.4, "connachta": 0.27, "mumhain": 0.15, "other": 0.06, "ulaidh": 0.12 }, "age": { "twenties": 0.37, "": 0.19, "thirties": 0.3, "fourties": 0.07, "sixties": 0, "teens": 0.02, "fifties": 0.06 }, "gender": { "male": 0.65, "": 0.19, "female": 0.17 } }, "users": 101, "duration": 20497419, "avgDurationSecs": 3.479, "validDurationSecs": 11663.104, "totalHrs": 5.69, "validHrs": 3.23 }, "fi": { "reportedSentences": 4, "size": 49882909, "checksum": "eb26d0904beef5ec08cf53267be7e78b8ba5056fd162057d5b085a7cba51f035", "buckets": { "dev": 415, "invalidated": 59, "other": 149, "reported": 3, "test": 428, "train": 460, "validated": 1305 }, "clips": 1513, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.4, "": 0.23, "twenties": 0.14, "fourties": 0.17, "teens": 0.04, "fifties": 0.02 }, "gender": { "male": 0.77, "": 0.23, "female": 0 } }, "users": 27, "duration": 6905415, "avgDurationSecs": 4.564, "validDurationSecs": 5956.092, "totalHrs": 1.91, "validHrs": 1.65 }, "hu": { "reportedSentences": 32, "size": 242758708, "checksum": "61f933155cba6c54c0b76d0ddd2caebd62d69228b7c935382112abe172660953", "buckets": { "dev": 1434, "invalidated": 169, "other": 295, "reported": 31, "test": 1649, "train": 3348, "validated": 6457 }, "clips": 6921, "splits": { "accent": { "": 1 }, "age": { "teens": 0, "": 0.05, "thirties": 0.14, "twenties": 0.75, "fifties": 0.06 }, "gender": { "male": 0.89, "": 0.05, "female": 0.06 } }, "users": 47, "duration": 31226113, "avgDurationSecs": 4.512, "validDurationSecs": 29132.641, "totalHrs": 8.67, "validHrs": 8.09 }, "sah": { "size": 181245626, "checksum": "dea1a454813c8f90abcbdf427fa922e1b7a116753deeb410af096ce5f0ae2405", "buckets": { "dev": 405, "invalidated": 66, "other": 1275, "test": 757, "train": 1442, "validated": 2606 }, "clips": 3947, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 42, "duration": 23639016, "avgDurationSecs": 5.989, "validDurationSecs": 15607.62, "totalHrs": 6.56, "validHrs": 4.33 }, "vot": { "size": 7792602, "checksum": "7fb07dd25b0575e8cd811bb8d1e5aebd17fdbca079a4ee50d81e0aaaff50f8b0", "buckets": { "dev": 0, "invalidated": 6, "other": 411, "test": 0, "train": 3, "validated": 3 }, "clips": 420, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "duration": 1008696, "avgDurationSecs": 2.402, "validDurationSecs": 7.205, "totalHrs": 0.28, "validHrs": 0 }, "th": { "size": 341305736, "checksum": "a3d11043c49d3ea8ffb58dfab117cd831dd62a641e0a26ac60eb43e483534f7a", "buckets": { "dev": 1922, "invalidated": 467, "other": 2671, "reported": 275, "test": 2188, "train": 2917, "validated": 7028 }, "clips": 10166, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.3, "": 0.17, "thirties": 0.18, "fourties": 0.3, "teens": 0.05, "fifties": 0.01 }, "gender": { "male": 0.64, "": 0.17, "female": 0.19, "other": 0 } }, "users": 182, "duration": 44472936, "reportedSentences": 276, "avgDurationSecs": 4.375, "validDurationSecs": 30745.209, "totalHrs": 12.35, "validHrs": 8.54 }, "lg": { "size": 208197149, "checksum": "71243c65f638cd7f392fabe22e37cbafbdca4eb5a199210000ae957a88768040", "buckets": { "dev": 384, "invalidated": 290, "other": 3110, "reported": 2, "test": 584, "train": 1250, "validated": 2220 }, "clips": 5620, "splits": { "accent": { "": 1 }, "age": { "": 0.06, "thirties": 0.63, "twenties": 0.31, "fourties": 0 }, "gender": { "": 0.08, "female": 0.79, "male": 0.13 } }, "users": 76, "duration": 29590983, "reportedSentences": 3, "avgDurationSecs": 5.265, "validDurationSecs": 11688.965, "totalHrs": 8.21, "validHrs": 3.24 }, "lt": { "size": 135299706, "checksum": "5ad3d93bc308f58a70e6685f71ae035237ef9caa0922232ac76846f7587bb8aa", "buckets": { "dev": 244, "invalidated": 102, "other": 1629, "reported": 41, "test": 466, "train": 931, "validated": 1644 }, "clips": 3375, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.48, "": 0.12, "thirties": 0.35, "fifties": 0.04, "sixties": 0.02 }, "gender": { "male": 0.86, "": 0.12, "female": 0.03 } }, "users": 30, "duration": 17591228, "reportedSentences": 42, "avgDurationSecs": 5.212, "validDurationSecs": 8568.883, "totalHrs": 4.88, "validHrs": 2.38 }, "hi": { "size": 21424045, "checksum": "5492393b04dd1307a52d93525a7db08fc392c8ba0df553668945152e434f58c9", "buckets": { "dev": 135, "invalidated": 60, "other": 139, "reported": 5, "test": 127, "train": 157, "validated": 419 }, "clips": 618, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.63, "fourties": 0.05, "": 0.27, "thirties": 0.04, "teens": 0.01 }, "gender": { "male": 0.71, "female": 0.02, "": 0.27 } }, "users": 31, "duration": 2899824, "reportedSentences": 5, "avgDurationSecs": 4.692, "validDurationSecs": 1966.062, "totalHrs": 0.8, "validHrs": 0.54 } }, "totalDuration": 33420537391, "totalValidDurationSecs": 26409526, "totalHrs": 9283, "totalValidHrs": 7335, "totalClips": 7040644 } ================================================ FILE: datasets/scripted-speech/cv-corpus-6.1-singleword.json ================================================ { "date": "2021-07-21", "locales": { "es": { "clips": 70038, "splits": { "accent": { "surpeninsular": 0.01, "": 0.8, "americacentral": 0.02, "centrosurpeninsular": 0.02, "mexicano": 0.05, "caribe": 0.02, "nortepeninsular": 0.02, "andino": 0.04, "rioplatense": 0.02, "chileno": 0.01, "canario": 0, "filipinas": 0 }, "age": { "sixties": 0.01, "": 0.74, "thirties": 0.05, "fifties": 0.02, "fourties": 0.04, "twenties": 0.11, "teens": 0.04, "seventies": 0, "nineties": 0, "eighties": 0 }, "gender": { "male": 0.17, "": 0.74, "female": 0.09, "other": 0 } }, "users": 9052, "duration": 202516646, "buckets": { "dev": 4, "invalidated": 6542, "other": 40514, "reported": 1110, "test": 4, "train": 6, "validated": 22982 }, "reportedSentences": 1101, "avgDurationSecs": 2.892, "validDurationSecs": 66453.033, "totalHrs": 56.25, "validHrs": 18.45 }, "pl": { "clips": 1182, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "thirties": 0.17, "fourties": 0.02, "teens": 0.08, "": 0.47, "sixties": 0.02 }, "gender": { "female": 0.13, "male": 0.4, "": 0.47, "other": 0.01 } }, "users": 112, "duration": 2713140, "buckets": { "dev": 4, "invalidated": 37, "other": 3, "reported": 431, "test": 4, "train": 6, "validated": 1142 }, "reportedSentences": 431, "avgDurationSecs": 2.295, "validDurationSecs": 2621.325, "totalHrs": 0.75, "validHrs": 0.72 }, "ca": { "clips": 4180, "splits": { "accent": { "valencian": 0.08, "": 0.51, "central": 0.35, "northern": 0.01, "northwestern": 0.03, "balearic": 0.02 }, "age": { "fifties": 0.08, "": 0.52, "sixties": 0.04, "thirties": 0.12, "fourties": 0.1, "twenties": 0.1, "teens": 0.04, "seventies": 0.01 }, "gender": { "male": 0.35, "": 0.53, "female": 0.12, "other": 0 } }, "users": 387, "duration": 10707094, "buckets": { "dev": 4, "invalidated": 217, "other": 297, "reported": 1193, "test": 4, "train": 6, "validated": 3666 }, "reportedSentences": 1191, "avgDurationSecs": 2.562, "validDurationSecs": 9390.48, "totalHrs": 2.97, "validHrs": 2.6 }, "tr": { "clips": 537, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "fourties": 0.12, "twenties": 0.26, "teens": 0.03, "thirties": 0.1 }, "gender": { "": 0.5, "male": 0.45, "female": 0.05 } }, "users": 60, "duration": 1278637, "buckets": { "dev": 4, "invalidated": 46, "other": 44, "reported": 72, "test": 4, "train": 6, "validated": 447 }, "reportedSentences": 73, "avgDurationSecs": 2.381, "validDurationSecs": 1064.34, "totalHrs": 0.35, "validHrs": 0.29 }, "ru": { "clips": 2195, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "teens": 0.12, "thirties": 0.13, "twenties": 0.23, "fourties": 0.05, "fifties": 0.01 }, "gender": { "": 0.47, "male": 0.46, "female": 0.06, "other": 0.01 } }, "users": 214, "duration": 4977118, "buckets": { "dev": 4, "invalidated": 104, "other": 57, "reported": 143, "test": 4, "train": 6, "validated": 2034 }, "reportedSentences": 137, "avgDurationSecs": 2.267, "validDurationSecs": 4612.054, "totalHrs": 1.38, "validHrs": 1.28 }, "pt": { "clips": 1307, "splits": { "accent": { "": 1 }, "age": { "": 0.4, "fourties": 0.11, "twenties": 0.29, "thirties": 0.16, "fifties": 0.03, "sixties": 0, "teens": 0.02 }, "gender": { "": 0.4, "male": 0.54, "female": 0.05, "other": 0.01 } }, "users": 140, "duration": 3449335, "buckets": { "dev": 4, "invalidated": 87, "other": 42, "reported": 1701, "test": 4, "train": 6, "validated": 1178 }, "reportedSentences": 1697, "avgDurationSecs": 2.639, "validDurationSecs": 3108.888, "totalHrs": 0.95, "validHrs": 0.86 }, "nl": { "clips": 644, "splits": { "accent": { "suriname": 0.02, "netherlands": 0.39, "": 0.46, "belgium": 0.13 }, "age": { "": 0.5, "twenties": 0.32, "sixties": 0, "thirties": 0.04, "fifties": 0.04, "fourties": 0.1 }, "gender": { "male": 0.45, "female": 0.06, "": 0.5 } }, "users": 67, "duration": 1531757, "buckets": { "dev": 4, "invalidated": 21, "other": 7, "reported": 210, "test": 4, "train": 6, "validated": 616 }, "reportedSentences": 211, "avgDurationSecs": 2.379, "validDurationSecs": 1465.159, "totalHrs": 0.42, "validHrs": 0.4 }, "de": { "clips": 15090, "splits": { "accent": { "germany": 0.26, "austria": 0.02, "": 0.7, "france": 0, "italy": 0, "switzerland": 0.01, "united_states": 0, "russia": 0, "united_kingdom": 0, "lithuania": 0 }, "age": { "fourties": 0.04, "fifties": 0.05, "twenties": 0.12, "thirties": 0.07, "": 0.63, "seventies": 0.01, "teens": 0.06, "sixties": 0.02, "eighties": 0 }, "gender": { "male": 0.31, "female": 0.06, "": 0.63, "other": 0 } }, "users": 1618, "duration": 39418281, "buckets": { "dev": 4, "invalidated": 1386, "other": 56, "reported": 3721, "test": 4, "train": 7, "validated": 13648 }, "reportedSentences": 3709, "avgDurationSecs": 2.612, "validDurationSecs": 35651.471, "totalHrs": 10.94, "validHrs": 9.9 }, "ta": { "clips": 341, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "fifties": 0.04, "fourties": 0.04, "twenties": 0.17, "thirties": 0.26, "teens": 0.03 }, "gender": { "": 0.49, "male": 0.35, "female": 0.16 } }, "users": 39, "duration": 992550, "buckets": { "dev": 4, "invalidated": 39, "other": 50, "reported": 257, "test": 4, "train": 6, "validated": 252 }, "reportedSentences": 257, "avgDurationSecs": 2.911, "validDurationSecs": 733.497, "totalHrs": 0.27, "validHrs": 0.2 }, "ar": { "clips": 564, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "thirties": 0.11, "twenties": 0.28, "teens": 0.05, "fifties": 0.02, "fourties": 0.02 }, "gender": { "": 0.5, "male": 0.42, "female": 0.08 } }, "users": 67, "duration": 1530140, "buckets": { "dev": 4, "invalidated": 45, "other": 39, "reported": 1750, "test": 4, "train": 6, "validated": 480 }, "reportedSentences": 1745, "avgDurationSecs": 2.713, "validDurationSecs": 1302.247, "totalHrs": 0.42, "validHrs": 0.36 }, "en": { "clips": 49370, "splits": { "accent": { "": 0.76, "us": 0.12, "england": 0.04, "indian": 0.04, "australia": 0.01, "canada": 0.02, "newzealand": 0, "scotland": 0, "hongkong": 0.01, "ireland": 0, "singapore": 0, "philippines": 0, "malaysia": 0, "african": 0, "wales": 0 }, "age": { "": 0.61, "twenties": 0.18, "teens": 0.05, "fourties": 0.04, "thirties": 0.07, "sixties": 0.01, "fifties": 0.03, "seventies": 0.01, "nineties": 0, "eighties": 0 }, "gender": { "": 0.62, "female": 0.09, "male": 0.29, "other": 0.01 } }, "users": 5876, "duration": 132611224, "buckets": { "dev": 4, "invalidated": 4071, "other": 12573, "reported": 1806, "test": 4, "train": 6, "validated": 32726 }, "reportedSentences": 1760, "avgDurationSecs": 2.686, "validDurationSecs": 87904.292, "totalHrs": 36.83, "validHrs": 24.41 }, "fr": { "clips": 22382, "splits": { "accent": { "france": 0.23, "": 0.74, "germany": 0, "canada": 0.01, "united_kingdom": 0, "belgium": 0.01, "switzerland": 0.01, "reunion": 0, "senegal": 0, "martinique": 0, "congo_brazzaville": 0, "cameroon": 0, "united_states": 0, "madagascar": 0, "romania": 0, "benin": 0, "cote_d_ivoire": 0, "morocco": 0 }, "age": { "sixties": 0.01, "": 0.7, "twenties": 0.1, "thirties": 0.06, "fourties": 0.05, "teens": 0.05, "fifties": 0.03, "seventies": 0 }, "gender": { "male": 0.24, "": 0.7, "female": 0.06, "other": 0.01 } }, "users": 2560, "duration": 59131896, "buckets": { "dev": 4, "invalidated": 2345, "other": 20, "reported": 3211, "test": 4, "train": 6, "validated": 20017 }, "reportedSentences": 3185, "avgDurationSecs": 2.642, "validDurationSecs": 52883.708, "totalHrs": 16.42, "validHrs": 14.68 }, "cy": { "clips": 553, "splits": { "accent": { "united_kingdom": 0.45, "": 0.55 }, "age": { "twenties": 0.17, "thirties": 0.07, "": 0.53, "teens": 0.04, "fifties": 0.09, "fourties": 0.06, "sixties": 0.02, "seventies": 0.02 }, "gender": { "female": 0.22, "male": 0.24, "": 0.54 } }, "users": 64, "duration": 1512525, "buckets": { "dev": 3, "invalidated": 15, "other": 62, "reported": 126, "test": 3, "train": 4, "validated": 476 }, "reportedSentences": 127, "avgDurationSecs": 2.735, "validDurationSecs": 1301.92, "totalHrs": 0.42, "validHrs": 0.36 }, "eo": { "clips": 965, "splits": { "accent": { "": 0.67, "internacia": 0.33 }, "age": { "": 0.45, "fourties": 0.03, "twenties": 0.28, "teens": 0.06, "eighties": 0.01, "sixties": 0.02, "thirties": 0.14, "fifties": 0.01 }, "gender": { "": 0.46, "male": 0.48, "other": 0.01, "female": 0.05 } }, "users": 91, "duration": 2260824, "buckets": { "dev": 3, "invalidated": 41, "other": 27, "reported": 404, "test": 3, "train": 6, "validated": 897 }, "reportedSentences": 404, "avgDurationSecs": 2.343, "validDurationSecs": 2101.512, "totalHrs": 0.62, "validHrs": 0.58 }, "ja": { "clips": 644, "splits": { "accent": { "": 1 }, "age": { "teens": 0.05, "twenties": 0.5, "": 0.32, "fifties": 0.02, "fourties": 0.04, "thirties": 0.06 }, "gender": { "male": 0.43, "female": 0.24, "": 0.31, "other": 0.02 } }, "users": 74, "duration": 1454877, "buckets": { "dev": 3, "invalidated": 34, "other": 18, "reported": 43, "test": 3, "train": 6, "validated": 592 }, "reportedSentences": 44, "avgDurationSecs": 2.259, "validDurationSecs": 1337.402, "totalHrs": 0.4, "validHrs": 0.37 }, "id": { "clips": 171, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.63 }, "gender": { "": 0.37, "male": 0.5, "female": 0.07, "other": 0.06 } }, "users": 20, "duration": 375907, "buckets": { "dev": 3, "invalidated": 11, "other": 17, "reported": 200, "test": 3, "train": 6, "validated": 143 }, "reportedSentences": 201, "avgDurationSecs": 2.198, "validDurationSecs": 314.355, "totalHrs": 0.1, "validHrs": 0.08 }, "eu": { "clips": 635, "splits": { "accent": { "": 0.8, "mendebalekoa": 0.12, "erdialdekoa_nafarra": 0.08 }, "age": { "": 0.74, "teens": 0.01, "fourties": 0.16, "fifties": 0.04, "thirties": 0.04, "twenties": 0.02 }, "gender": { "": 0.74, "female": 0.09, "male": 0.17 } }, "users": 80, "duration": 1577809, "buckets": { "dev": 2, "invalidated": 13, "other": 406, "reported": 38, "test": 3, "train": 7, "validated": 216 }, "reportedSentences": 39, "avgDurationSecs": 2.485, "validDurationSecs": 536.704, "totalHrs": 0.43, "validHrs": 0.14 }, "tt": { "clips": 55, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.44, "": 0.55, "twenties": 0.02 }, "gender": { "male": 0.44, "": 0.55, "female": 0.02 } }, "users": 8, "duration": 154248, "buckets": { "dev": 4, "invalidated": 2, "other": 18, "reported": 2, "test": 4, "train": 6, "validated": 35 }, "reportedSentences": 3, "avgDurationSecs": 2.805, "validDurationSecs": 98.158, "totalHrs": 0.04, "validHrs": 0.02 }, "kab": { "clips": 513, "splits": { "accent": { "": 0.98, "other": 0.02 }, "age": { "fourties": 0.08, "thirties": 0.2, "": 0.44, "fifties": 0.05, "twenties": 0.19, "sixties": 0.01, "teens": 0.02 }, "gender": { "male": 0.5, "": 0.45, "female": 0.05 } }, "users": 53, "duration": 1458225, "buckets": { "dev": 3, "invalidated": 11, "other": 274, "reported": 3166, "test": 3, "train": 6, "validated": 228 }, "reportedSentences": 3162, "avgDurationSecs": 2.843, "validDurationSecs": 648.1, "totalHrs": 0.4, "validHrs": 0.18 }, "br": { "clips": 42, "splits": { "accent": { "gwenedeg": 0.55, "": 0.38, "kerneveg": 0.07 }, "age": { "fourties": 0.05, "twenties": 0.26, "": 0.21, "fifties": 0.4, "teens": 0.07 }, "gender": { "male": 0.79, "": 0.21 } }, "users": 7, "duration": 109416, "buckets": { "dev": 0, "invalidated": 0, "other": 40, "reported": 166, "test": 0, "train": 2, "validated": 2 }, "reportedSentences": 167, "avgDurationSecs": 2.605, "validDurationSecs": 5.21, "totalHrs": 0.03, "validHrs": 0 }, "zh-CN": { "clips": 319, "splits": { "accent": { "140000": 0, "230000": 0.08, "320000": 0.05, "340000": 0.04, "350000": 0.04, "410000": 0.04, "420000": 0.04, "450000": 0.04, "500000": 0.02, "620000": 0.04, "": 0.63 }, "age": { "thirties": 0.13, "": 0.51, "twenties": 0.21, "teens": 0.09, "fourties": 0.05 }, "gender": { "male": 0.39, "": 0.51, "female": 0.1 } }, "users": 39, "duration": 778185, "buckets": { "dev": 3, "invalidated": 24, "other": 0, "reported": 280, "test": 3, "train": 6, "validated": 295 }, "reportedSentences": 280, "avgDurationSecs": 2.439, "validDurationSecs": 719.638, "totalHrs": 0.21, "validHrs": 0.19 }, "fy-NL": { "clips": 216, "splits": { "accent": { "": 1 }, "age": { "": 0.84, "sixties": 0.05, "fourties": 0.11 }, "gender": { "": 0.84, "female": 0.1, "male": 0.06 } }, "users": 22, "duration": 650279, "buckets": { "dev": 3, "invalidated": 13, "other": 126, "reported": 281, "test": 3, "train": 6, "validated": 77 }, "reportedSentences": 281, "avgDurationSecs": 3.011, "validDurationSecs": 231.812, "totalHrs": 0.18, "validHrs": 0.06 }, "zh-TW": { "clips": 27, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "teens": 0.44, "twenties": 0.19 }, "gender": { "": 0.37, "male": 0.63 } }, "users": 3, "duration": 61796, "buckets": { "dev": 0, "invalidated": 0, "other": 17, "reported": 38, "test": 0, "train": 8, "validated": 10 }, "reportedSentences": 39, "avgDurationSecs": 2.289, "validDurationSecs": 22.887, "totalHrs": 0.01, "validHrs": 0 }, "cs": { "clips": 542, "splits": { "accent": { "": 1 }, "age": { "teens": 0.04, "": 0.7, "thirties": 0.15, "twenties": 0.04, "fourties": 0.04, "fifties": 0.02 }, "gender": { "male": 0.24, "": 0.7, "female": 0.06 } }, "users": 54, "duration": 1358979, "buckets": { "dev": 3, "invalidated": 14, "other": 0, "reported": 585, "test": 3, "train": 6, "validated": 528 }, "reportedSentences": 582, "avgDurationSecs": 2.507, "validDurationSecs": 1323.876, "totalHrs": 0.37, "validHrs": 0.36 }, "zh-HK": { "clips": 503, "splits": { "accent": { "": 1 }, "age": { "": 0.74, "twenties": 0.03, "fourties": 0.09, "thirties": 0.14, "teens": 0.01 }, "gender": { "": 0.74, "male": 0.2, "female": 0.07 } }, "users": 53, "duration": 1335754, "buckets": { "dev": 3, "invalidated": 21, "other": 0, "reported": 571, "test": 3, "train": 6, "validated": 482 }, "reportedSentences": 560, "avgDurationSecs": 2.656, "validDurationSecs": 1279.987, "totalHrs": 0.37, "validHrs": 0.35 }, "lg": { "clips": 56, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "": 0.39, "thirties": 0.21 }, "gender": { "male": 0.18, "": 0.61, "female": 0.21 } }, "users": 7, "duration": 163592, "buckets": { "dev": 0, "invalidated": 3, "other": 41, "reported": 1, "test": 1, "train": 9, "validated": 12 }, "reportedSentences": 2, "avgDurationSecs": 2.921, "validDurationSecs": 35.055, "totalHrs": 0.04, "validHrs": 0 }, "ka": { "clips": 29, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "twenties": 0.41 }, "gender": { "": 0.59, "male": 0.41 } }, "users": 3, "duration": 74256, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "reported": 2, "test": 0, "train": 12, "validated": 29 }, "reportedSentences": 3, "avgDurationSecs": 2.561, "validDurationSecs": 74.256, "totalHrs": 0.02, "validHrs": 0.02 }, "sv-SE": { "clips": 29, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.48, "": 0.52 }, "gender": { "female": 0.41, "": 0.52, "male": 0.07 } }, "users": 3, "duration": 73608, "buckets": { "dev": 0, "invalidated": 0, "other": 6, "reported": 151, "test": 1, "train": 11, "validated": 23 }, "reportedSentences": 152, "avgDurationSecs": 2.538, "validDurationSecs": 58.379, "totalHrs": 0.02, "validHrs": 0.01 }, "cv": { "clips": 67, "splits": { "accent": { "": 1 }, "age": { "teens": 0.31, "twenties": 0.18, "": 0.33, "fourties": 0.18 }, "gender": { "female": 0.24, "male": 0.43, "": 0.33 } }, "users": 8, "duration": 186589, "buckets": { "dev": 1, "invalidated": 2, "other": 25, "reported": 74, "test": 3, "train": 8, "validated": 40 }, "reportedSentences": 75, "avgDurationSecs": 2.785, "validDurationSecs": 111.396, "totalHrs": 0.05, "validHrs": 0.03 }, "ky": { "clips": 5, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "duration": 15744, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "reported": 0, "test": 0, "train": 5, "validated": 5 }, "reportedSentences": 1, "avgDurationSecs": 3.149, "validDurationSecs": 15.744, "totalHrs": 0, "validHrs": 0 }, "rw": { "clips": 16, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.75, "": 0.25 }, "gender": { "male": 0.75, "": 0.25 } }, "users": 2, "duration": 41736, "buckets": { "dev": 0, "invalidated": 0, "other": 0, "reported": 618, "test": 0, "train": 12, "validated": 16 }, "reportedSentences": 619, "avgDurationSecs": 2.609, "validDurationSecs": 41.736, "totalHrs": 0.01, "validHrs": 0.01 } }, "overall": { "size": 3564521974, "checksum": "738186f52f92f1ffb01d61ff5abfabcd012f7d6a3def2d4c1822c366e600c2df" }, "totalDuration": 474502167, "totalValidDurationSecs": 277448, "totalHrs": 131, "totalValidHrs": 77, "totalClips": 173217 } ================================================ FILE: datasets/scripted-speech/cv-corpus-7.0-2021-07-21.json ================================================ { "date": "2021-07-21", "locales": { "en": { "buckets": { "dev": 16284, "invalidated": 220015, "other": 220176, "reported": 2732, "test": 16284, "train": 759975, "validated": 1425784 }, "reportedSentences": 2679, "duration": 9493711987, "clips": 1865975, "splits": { "accent": { "": 0.51, "canada": 0.03, "england": 0.08, "us": 0.23, "indian": 0.07, "australia": 0.03, "malaysia": 0, "newzealand": 0.01, "african": 0.01, "ireland": 0.01, "philippines": 0, "singapore": 0, "scotland": 0.02, "hongkong": 0, "bermuda": 0, "southatlandtic": 0, "wales": 0, "other": 0.01 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.45, "female": 0.15, "other": 0.02 } }, "users": 75879, "size": 70115911389, "checksum": "eb1e30cc9e0e17ff0b8839c02709a6af12f13e36b1f2d0f8cd9cbd88baa387cf", "avgDurationSecs": 5.088, "validDurationSecs": 7254107.183, "totalHrs": 2637.14, "validHrs": 2015.02 }, "fa": { "buckets": { "dev": 7111, "invalidated": 12124, "other": 24321, "reported": 1635, "test": 7111, "train": 12420, "validated": 261539 }, "reportedSentences": 1628, "duration": 1205816984, "clips": 297984, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.33, "thirties": 0.39, "fifties": 0.02, "fourties": 0.03, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.19, "male": 0.74, "female": 0.07, "other": 0 } }, "users": 3835, "size": 9173036961, "checksum": "2ed8723a1406f17de626336e944ef59dd153b0e157508e76bc5e9060e271f74d", "avgDurationSecs": 4.047, "validDurationSecs": 1058339.267, "totalHrs": 334.94, "validHrs": 293.98 }, "fr": { "buckets": { "dev": 15941, "invalidated": 48074, "other": 15092, "reported": 4969, "test": 15941, "train": 379101, "validated": 542288 }, "reportedSentences": 4907, "duration": 3002987494, "clips": 605454, "splits": { "accent": { "": 0.32, "france": 0.63, "germany": 0, "belgium": 0.02, "switzerland": 0.01, "guadeloupe": 0, "reunion": 0, "monaco": 0, "tunisia": 0, "canada": 0.02, "other": 0, "mayotte": 0, "algeria": 0, "netherlands": 0, "senegal": 0, "martinique": 0, "portugal": 0, "united_states": 0, "cote_d_ivoire": 0, "st_pierre_et_miquelon": 0, "french_guiana": 0, "benin": 0, "italy": 0, "united_kingdom": 0, "madagascar": 0, "cameroon": 0, "new_caledonia": 0, "haiti": 0, "morocco": 0, "romania": 0, "congo_kinshasa": 0, "congo_brazzaville": 0, "gabon": 0, "burundi": 0, "luxembourg": 0, "ireland": 0, "mali": 0 }, "age": { "twenties": 0.19, "thirties": 0.18, "": 0.33, "teens": 0.03, "fourties": 0.12, "fifties": 0.1, "sixties": 0.04, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.63, "": 0.26, "female": 0.11, "other": 0.01 } }, "users": 15391, "size": 22312655903, "checksum": "5f272cbe0f916b4bd6191f84e8017cbc66faca2c709896301cca69fde95287d5", "avgDurationSecs": 4.96, "validDurationSecs": 2689690.847, "totalHrs": 834.16, "validHrs": 747.13 }, "es": { "buckets": { "dev": 15338, "invalidated": 45088, "other": 174407, "reported": 1427, "test": 15338, "train": 196006, "validated": 271010 }, "reportedSentences": 1418, "duration": 2461740750, "clips": 490505, "splits": { "accent": { "nortepeninsular": 0.1, "": 0.47, "rioplatense": 0.04, "centrosurpeninsular": 0.03, "americacentral": 0.02, "surpeninsular": 0.17, "canario": 0, "andino": 0.06, "mexicano": 0.06, "chileno": 0.02, "caribe": 0.03, "filipinas": 0 }, "age": { "thirties": 0.1, "": 0.32, "fifties": 0.09, "twenties": 0.24, "teens": 0.03, "fourties": 0.06, "sixties": 0.16, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.51, "": 0.32, "other": 0, "female": 0.17 } }, "users": 21761, "size": 18379233074, "checksum": "22915f54e1dd4c86155ffc34bf30a1ddb8eb98c04f35b467d23f7e669d0f8301", "avgDurationSecs": 5.019, "validDurationSecs": 1360141.814, "totalHrs": 683.81, "validHrs": 377.81 }, "sl": { "buckets": { "dev": 1211, "invalidated": 183, "other": 586, "reported": 24, "test": 1184, "train": 1347, "validated": 8517 }, "reportedSentences": 25, "duration": 36033166, "clips": 9286, "splits": { "accent": { "other": 0.03, "": 0.97 }, "age": { "twenties": 0.53, "teens": 0.1, "": 0.14, "sixties": 0.09, "fifties": 0.09, "fourties": 0.01, "thirties": 0.05 }, "gender": { "female": 0.16, "male": 0.7, "": 0.14, "other": 0 } }, "users": 113, "size": 265780682, "checksum": "a6888de32a421dad513df87e52719ebdaa866aa08a748e34892d14922acc914c", "avgDurationSecs": 3.88, "validDurationSecs": 33049.157, "totalHrs": 10, "validHrs": 9.18 }, "kab": { "buckets": { "dev": 14860, "invalidated": 18900, "other": 97594, "reported": 4304, "test": 14860, "train": 138897, "validated": 594182 }, "reportedSentences": 4300, "duration": 2362024060, "clips": 710676, "splits": { "accent": { "": 0.9, "other": 0.1 }, "age": { "fourties": 0.09, "thirties": 0.3, "": 0.27, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.55, "": 0.25, "female": 0.2, "other": 0 } }, "users": 1408, "size": 17857634451, "checksum": "efadc5795461d8c12ae3a2367bf1731191f893e1682e7fc85866a6c8177a9669", "avgDurationSecs": 3.324, "validDurationSecs": 1974841.109, "totalHrs": 656.11, "validHrs": 548.56 }, "cy": { "buckets": { "dev": 4850, "invalidated": 4003, "other": 19215, "reported": 148, "test": 4850, "train": 6845, "validated": 83149 }, "reportedSentences": 149, "duration": 510753249, "clips": 106367, "splits": { "accent": { "united_kingdom": 0.53, "": 0.46, "other": 0.01 }, "age": { "fourties": 0.16, "twenties": 0.14, "sixties": 0.07, "fifties": 0.08, "": 0.42, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.25, "": 0.41, "other": 0.01 } }, "users": 1655, "size": 3800150610, "checksum": "1942276a10d0184a605fcb23fc6b6e1163b7b5be8b23d3596c8385563303f5cc", "avgDurationSecs": 4.802, "validDurationSecs": 399265.015, "totalHrs": 141.87, "validHrs": 110.9 }, "ca": { "buckets": { "dev": 15957, "invalidated": 26754, "other": 57721, "reported": 1636, "test": 15959, "train": 389949, "validated": 521535 }, "reportedSentences": 1626, "duration": 3303096258, "clips": 606010, "splits": { "accent": { "valencian": 0.05, "central": 0.7, "": 0.21, "other": 0, "balearic": 0.01, "northwestern": 0.03, "northern": 0.01, "learner_es": 0 }, "age": { "thirties": 0.12, "fifties": 0.22, "fourties": 0.14, "twenties": 0.07, "": 0.17, "sixties": 0.25, "teens": 0.01, "seventies": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.62, "": 0.18, "female": 0.21, "other": 0 } }, "users": 6095, "size": 24307605172, "checksum": "14e0f140fcacc2db0b1ef3ea63c7f29904d087dd1f44173ecc173bac15b4e6ef", "avgDurationSecs": 5.451, "validDurationSecs": 2842659.869, "totalHrs": 917.52, "validHrs": 789.62 }, "de": { "buckets": { "dev": 15907, "invalidated": 41074, "other": 8836, "reported": 5862, "test": 15907, "train": 360664, "validated": 684794 }, "reportedSentences": 5839, "duration": 3727824149, "clips": 734704, "splits": { "accent": { "russia": 0, "germany": 0.63, "": 0.32, "austria": 0.03, "liechtenstein": 0, "switzerland": 0.01, "france": 0, "other": 0, "poland": 0, "united_kingdom": 0, "hungary": 0, "netherlands": 0, "namibia": 0, "slovakia": 0, "united_states": 0, "italy": 0, "czechia": 0, "brazil": 0, "canada": 0, "turkey": 0, "slovenia": 0, "luxembourg": 0, "belgium": 0, "denmark": 0, "greece": 0, "lithuania": 0, "finland": 0, "bulgaria": 0 }, "age": { "twenties": 0.21, "fourties": 0.19, "": 0.28, "thirties": 0.16, "teens": 0.03, "sixties": 0.03, "fifties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.62, "": 0.29, "female": 0.09, "other": 0.01 } }, "users": 15620, "size": 27463071197, "checksum": "2a7ae9e77c61bb8810b0f102757030709ea81bdf79033199e019c2efa90124bb", "avgDurationSecs": 5.074, "validDurationSecs": 3474585.153, "totalHrs": 1035.5, "validHrs": 965.16 }, "tt": { "buckets": { "dev": 2839, "invalidated": 329, "other": 825, "reported": 3, "test": 4686, "train": 10408, "validated": 27468 }, "reportedSentences": 4, "duration": 107001962, "clips": 28622, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "thirties": 0.73, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.19, "male": 0.79, "female": 0.02 } }, "users": 203, "size": 794848701, "checksum": "722d8c923d4b95236c73a5f9e46f39a959ade98b38191863bf81635cdbedf336", "avgDurationSecs": 3.738, "validDurationSecs": 102687.789, "totalHrs": 29.72, "validHrs": 28.52 }, "ta": { "buckets": { "dev": 11244, "invalidated": 4734, "other": 5693, "reported": 3086, "test": 11245, "train": 35340, "validated": 115644 }, "reportedSentences": 3086, "duration": 778360200, "clips": 126071, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.11, "thirties": 0.13, "": 0.59, "fourties": 0.04, "seventies": 0.04, "fifties": 0.02, "teens": 0.05, "sixties": 0.01, "eighties": 0 }, "gender": { "male": 0.22, "": 0.57, "other": 0, "female": 0.21 } }, "users": 580, "size": 4726923179, "checksum": "0a6dcc5016ce5a1df7f950443ffcdd408dacb2884a721a8de85600fcdd84e4ab", "avgDurationSecs": 6.174, "validDurationSecs": 713984.08, "totalHrs": 216.21, "validHrs": 198.32 }, "ru": { "buckets": { "dev": 8422, "invalidated": 5469, "other": 10810, "reported": 238, "test": 8425, "train": 17070, "validated": 99709 }, "reportedSentences": 232, "duration": 623025432, "clips": 115988, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.44, "teens": 0.09, "": 0.17, "fourties": 0.15, "thirties": 0.12, "fifties": 0.04, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.66, "": 0.17, "other": 0, "female": 0.17 } }, "users": 2136, "size": 4536990731, "checksum": "41e88ff75969f43082d15270231b95e6f83151cafdc571fff9584f441c4e2b0a", "avgDurationSecs": 5.371, "validDurationSecs": 535583.36, "totalHrs": 173.06, "validHrs": 148.77 }, "nl": { "buckets": { "dev": 9911, "invalidated": 4641, "other": 27, "reported": 294, "test": 10131, "train": 26121, "validated": 79188 }, "reportedSentences": 295, "duration": 355904749, "clips": 83856, "splits": { "accent": { "": 0.24, "netherlands": 0.67, "belgium": 0.08, "other": 0, "germany": 0, "france": 0, "suriname": 0.01, "curacao": 0, "aruba": 0 }, "age": { "": 0.41, "twenties": 0.23, "fourties": 0.15, "thirties": 0.1, "teens": 0.03, "fifties": 0.08, "sixties": 0.01, "nineties": 0 }, "gender": { "": 0.42, "male": 0.48, "female": 0.11, "other": 0 } }, "users": 1352, "size": 2490744175, "checksum": "5c8e3bd85eab4b1a4fdcf17ed776433277b60d520cffd5524649d2c3f3ab354f", "avgDurationSecs": 4.244, "validDurationSecs": 336092.65, "totalHrs": 98.86, "validHrs": 93.35 }, "it": { "buckets": { "dev": 14764, "invalidated": 16248, "other": 2947, "reported": 4913, "test": 14765, "train": 131041, "validated": 194732 }, "reportedSentences": 4909, "duration": 1142277348, "clips": 213927, "splits": { "accent": { "": 0.96, "other": 0.04 }, "age": { "thirties": 0.16, "twenties": 0.21, "": 0.3, "fifties": 0.14, "fourties": 0.14, "seventies": 0, "sixties": 0.03, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.12, "male": 0.58, "": 0.3, "other": 0 } }, "users": 6407, "size": 8054153738, "checksum": "c2ccf746740c0ca0e4cf2f57e9b749a2c8574d0828e6ba73c34e921136fef91b", "avgDurationSecs": 5.34, "validDurationSecs": 1039784.378, "totalHrs": 317.29, "validHrs": 288.82 }, "eu": { "buckets": { "dev": 5358, "invalidated": 5435, "other": 23339, "reported": 41, "test": 5397, "train": 8027, "validated": 63942 }, "reportedSentences": 42, "duration": 476900331, "clips": 92716, "splits": { "accent": { "mendebalekoa": 0.31, "erdialdekoa_nafarra": 0.3, "": 0.39, "nafarlapurtarra_zuberoatarra": 0.01 }, "age": { "fourties": 0.14, "thirties": 0.08, "fifties": 0.14, "twenties": 0.38, "": 0.22, "teens": 0.03, "sixties": 0.01, "seventies": 0 }, "gender": { "male": 0.5, "female": 0.25, "": 0.23, "other": 0.02 } }, "users": 1050, "size": 3689025098, "checksum": "d80cb1f8fd062559e60bee82723323a426100ebe1cf61d3cbe911dbc29428980", "avgDurationSecs": 5.144, "validDurationSecs": 328896.425, "totalHrs": 132.47, "validHrs": 91.36 }, "tr": { "buckets": { "dev": 3209, "invalidated": 2350, "other": 4448, "reported": 239, "test": 3209, "train": 3979, "validated": 29643 }, "reportedSentences": 240, "duration": 136532475, "clips": 36441, "splits": { "accent": { "": 0.98, "other": 0.02 }, "age": { "": 0.26, "thirties": 0.17, "twenties": 0.47, "teens": 0.03, "fourties": 0.02, "fifties": 0.04, "sixties": 0.01 }, "gender": { "": 0.26, "male": 0.68, "female": 0.06, "other": 0 } }, "users": 960, "size": 939617663, "checksum": "59ad7c27129a4dc4cb55d03135f2a16ad2d5db73178adc576c3416a63ae45bd9", "avgDurationSecs": 3.747, "validDurationSecs": 111062.599, "totalHrs": 37.92, "validHrs": 30.85 }, "ar": { "buckets": { "dev": 10146, "invalidated": 14507, "other": 29886, "reported": 1957, "test": 10284, "train": 27172, "validated": 74115 }, "reportedSentences": 1950, "duration": 493814574, "clips": 118508, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.12, "": 0.58, "twenties": 0.26, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.17, "": 0.57, "male": 0.25, "other": 0 } }, "users": 1052, "size": 2947038610, "checksum": "5fdfae60464ec9298eccaf2b567494133c2592ef9de596145f1d308439fc840a", "avgDurationSecs": 4.167, "validDurationSecs": 308832.038, "totalHrs": 137.17, "validHrs": 85.78 }, "zh-TW": { "buckets": { "dev": 3827, "invalidated": 4064, "other": 23642, "reported": 81, "test": 3827, "train": 4972, "validated": 67491 }, "reportedSentences": 82, "duration": 308410222, "clips": 95197, "splits": { "accent": { "": 0.49, "taipei_city": 0.16, "other": 0.08, "kaohsiung_city": 0.02, "new_taipei_city": 0.05, "tainan_city": 0.04, "taichung_city": 0.03, "yilan_county": 0.01, "hsinchu_county": 0, "hong_kong": 0.01, "taoyuan_city": 0.02, "hsinchu_city": 0.02, "chiayi_city": 0.01, "yunlin_county": 0.02, "nantou_county": 0, "changhua_county": 0.02, "pingtung_county": 0.01, "chiayi_county": 0, "keelung_city": 0, "hualien_county": 0.01, "kinmen_county": 0 }, "age": { "thirties": 0.22, "twenties": 0.37, "teens": 0.05, "": 0.24, "fifties": 0.01, "seventies": 0, "fourties": 0.12, "sixties": 0 }, "gender": { "male": 0.49, "": 0.23, "female": 0.26, "other": 0.03 } }, "users": 1584, "size": 2333969324, "checksum": "9f7e8b2cf631b0a837f15fb3fa4fddc75e503441a2111de6db94b90af273d397", "avgDurationSecs": 3.24, "validDurationSecs": 218650.948, "totalHrs": 85.66, "validHrs": 60.73 }, "br": { "buckets": { "dev": 2107, "invalidated": 735, "other": 11396, "reported": 182, "test": 2079, "train": 2683, "validated": 10818 }, "reportedSentences": 183, "duration": 70487467, "clips": 22949, "splits": { "accent": { "other": 0, "": 0.74, "kerneveg": 0.19, "gwenedeg": 0.05, "leoneg": 0.02, "tregerieg": 0 }, "age": { "twenties": 0.25, "": 0.33, "fifties": 0.06, "fourties": 0.07, "thirties": 0.08, "sixties": 0.18, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.64, "": 0.34, "female": 0.02 } }, "users": 170, "size": 521023883, "checksum": "4269d55e5882d47e4d3a2b8356120fc5ef1b02d3d68d7541ef72b741aef6e81b", "avgDurationSecs": 3.071, "validDurationSecs": 33227.305, "totalHrs": 19.57, "validHrs": 9.22 }, "pt": { "buckets": { "dev": 7066, "invalidated": 3100, "other": 19962, "reported": 2128, "test": 7077, "train": 12325, "validated": 71203 }, "reportedSentences": 2123, "duration": 403495315, "clips": 94265, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "twenties": 0.39, "teens": 0.03, "thirties": 0.26, "fourties": 0.1, "sixties": 0, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.19, "male": 0.75, "female": 0.04, "other": 0.02 } }, "users": 2038, "size": 2724725035, "checksum": "5c136caea5899b123a390822ff18519469e26e54dc695cd2c146d9880a489def", "avgDurationSecs": 4.28, "validDurationSecs": 304779.896, "totalHrs": 112.08, "validHrs": 84.66 }, "eo": { "buckets": { "dev": 14667, "invalidated": 48298, "other": 9240, "reported": 1325, "test": 14663, "train": 123729, "validated": 445748 }, "reportedSentences": 1322, "duration": 3040825687, "clips": 503286, "splits": { "accent": { "": 0.41, "internacia": 0.59, "other": 0 }, "age": { "twenties": 0.49, "thirties": 0.16, "": 0.18, "fourties": 0.07, "fifties": 0.01, "seventies": 0, "teens": 0.08, "sixties": 0, "eighties": 0.01 }, "gender": { "male": 0.72, "": 0.19, "female": 0.09, "other": 0 } }, "users": 1066, "size": 18560215763, "checksum": "33becb581e25635ee56f67ab446e5ce31a957e10e69c49feedcefe24ef22bd6e", "avgDurationSecs": 6.042, "validDurationSecs": 2693184.329, "totalHrs": 844.67, "validHrs": 748.1 }, "zh-CN": { "buckets": { "dev": 9338, "invalidated": 5883, "other": 10014, "reported": 383, "test": 9338, "train": 21279, "validated": 42488 }, "reportedSentences": 383, "duration": 312880236, "clips": 58385, "splits": { "accent": { "110000": 0.05, "120000": 0, "130000": 0.02, "140000": 0.02, "150000": 0, "210000": 0.02, "220000": 0, "230000": 0.04, "310000": 0.04, "320000": 0.05, "330000": 0.04, "340000": 0.02, "350000": 0.01, "360000": 0.02, "370000": 0.03, "410000": 0.03, "420000": 0.02, "430000": 0.01, "440000": 0.03, "450000": 0.02, "460000": 0.01, "500000": 0.02, "510000": 0.03, "520000": 0, "530000": 0, "610000": 0.01, "620000": 0, "640000": 0, "650000": 0, "710000": 0, "810000": 0, "": 0.44 }, "age": { "": 0.38, "teens": 0.09, "twenties": 0.37, "thirties": 0.12, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.38, "male": 0.51, "female": 0.1, "other": 0.01 } }, "users": 3792, "size": 2369273560, "checksum": "e95b9b8320234dc8310fbf58d9068d3dd51bec7c234e878b522daee7eb19a726", "avgDurationSecs": 5.359, "validDurationSecs": 227689.569, "totalHrs": 86.91, "validHrs": 63.24 }, "id": { "buckets": { "dev": 2879, "invalidated": 2232, "other": 20193, "reported": 243, "test": 3038, "train": 3890, "validated": 20695 }, "reportedSentences": 244, "duration": 178210812, "clips": 43120, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.4, "thirties": 0.06, "teens": 0.25, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.26, "male": 0.41, "female": 0.28, "other": 0.05 } }, "users": 340, "size": 1147111673, "checksum": "a7226a14c7aea42a714490932ea86c1543d2973d36d8b7b1db391ee9c68c1f79", "avgDurationSecs": 4.133, "validDurationSecs": 85530.444, "totalHrs": 49.5, "validHrs": 23.75 }, "ia": { "buckets": { "dev": 1805, "invalidated": 313, "other": 2988, "reported": 258, "test": 1602, "train": 5079, "validated": 10629 }, "reportedSentences": 254, "duration": 57838476, "clips": 13930, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.21, "fourties": 0.3, "": 0.39, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.02, "sixties": 0 }, "gender": { "male": 0.6, "": 0.39, "female": 0.01 } }, "users": 53, "size": 394424550, "checksum": "31df0a17185d80fadcb5fd7157d7b4f5508b9d126fde872dfc90e3e986147e50", "avgDurationSecs": 4.152, "validDurationSecs": 44132.46, "totalHrs": 16.06, "validHrs": 12.25 }, "lv": { "buckets": { "dev": 1766, "invalidated": 156, "other": 1287, "reported": 23, "test": 2084, "train": 3197, "validated": 7268 }, "reportedSentences": 24, "duration": 29698001, "clips": 8711, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.48, "fourties": 0.03, "": 0.17, "twenties": 0.29, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.69, "female": 0.14, "": 0.17 } }, "users": 113, "size": 220119979, "checksum": "415f5afd12b0de1de6f72dfa165d7ff6cc3b06a73744924dfcebe94c7a198c1d", "avgDurationSecs": 3.409, "validDurationSecs": 24778.449, "totalHrs": 8.24, "validHrs": 6.88 }, "ja": { "buckets": { "dev": 2038, "invalidated": 1529, "other": 843, "reported": 98, "test": 3676, "train": 6391, "validated": 21026 }, "reportedSentences": 98, "duration": 107886741, "clips": 23398, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.37, "": 0.24, "teens": 0.04, "fifties": 0.01, "thirties": 0.09, "fourties": 0.26, "sixties": 0 }, "gender": { "male": 0.53, "": 0.2, "female": 0.26, "other": 0 } }, "users": 397, "size": 670230519, "checksum": "9a1f5ff0dc2b7c75c58dff4df83679a3777a3ddc9558cc3bf05456773340f45f", "avgDurationSecs": 4.611, "validDurationSecs": 96949.595, "totalHrs": 29.96, "validHrs": 26.93 }, "rw": { "buckets": { "dev": 16315, "invalidated": 214508, "other": 43621, "reported": 618, "test": 16175, "train": 926594, "validated": 1354163 }, "reportedSentences": 619, "duration": 8118818057, "clips": 1612292, "splits": { "accent": { "": 1 }, "age": { "": 0.03, "twenties": 0.63, "thirties": 0.12, "teens": 0.19, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.09, "male": 0.57, "female": 0.34, "other": 0 } }, "users": 1023, "size": 58295067948, "checksum": "90936e232efb021cedaff7a284100631de6ce6d2a6ba7107c120c5893d2163bc", "avgDurationSecs": 5.036, "validDurationSecs": 6818989.995, "totalHrs": 2255.22, "validHrs": 1894.16 }, "sv-SE": { "buckets": { "dev": 4599, "invalidated": 1143, "other": 8581, "reported": 561, "test": 4620, "train": 6431, "validated": 32630 }, "reportedSentences": 562, "duration": 164469579, "clips": 42354, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.26, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.11, "fourties": 0.38, "sixties": 0 }, "gender": { "male": 0.49, "": 0.18, "female": 0.33, "other": 0 } }, "users": 682, "size": 1053186277, "checksum": "38fc97b07c29b7da0941e0c294e2487f294bd2c6055a541e636f20d4834fec32", "avgDurationSecs": 3.883, "validDurationSecs": 126709.221, "totalHrs": 45.68, "validHrs": 35.19 }, "cnh": { "buckets": { "dev": 761, "invalidated": 435, "other": 2914, "reported": 8, "test": 760, "train": 814, "validated": 2452 }, "reportedSentences": 9, "duration": 20673132, "clips": 5801, "splits": { "accent": { "": 0.85, "other": 0.15 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 298, "size": 161377408, "checksum": "5e7640d4ce9707fe90b8d4acac21b1fca70cf23ebfe80884968ee40477b24281", "avgDurationSecs": 3.564, "validDurationSecs": 8738.238, "totalHrs": 5.74, "validHrs": 2.42 }, "et": { "buckets": { "dev": 2608, "invalidated": 5412, "other": 653, "reported": 441, "test": 2609, "train": 3097, "validated": 17238 }, "reportedSentences": 439, "duration": 157084742, "clips": 23303, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "thirties": 0.07, "twenties": 0.69, "fourties": 0.05, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.19, "male": 0.56, "female": 0.24, "other": 0 } }, "users": 718, "size": 1107863883, "checksum": "3ebda1aa5712c53f331bde49c3acea014db87ad5f45c48249ce932fe2d8a6057", "avgDurationSecs": 6.741, "validDurationSecs": 116200.78, "totalHrs": 43.63, "validHrs": 32.27 }, "ky": { "buckets": { "dev": 1613, "invalidated": 5390, "other": 320, "reported": 32, "test": 1613, "train": 1787, "validated": 29334 }, "reportedSentences": 33, "duration": 159320640, "clips": 35044, "splits": { "accent": { "": 0.95, "other": 0.05 }, "age": { "thirties": 0.08, "": 0.06, "fourties": 0.01, "twenties": 0.67, "teens": 0.19 }, "gender": { "male": 0.54, "": 0.11, "female": 0.36, "other": 0 } }, "users": 227, "size": 1032606586, "checksum": "7773a0d6f88360a288774c4cb55a38fc6e4660c8ed0653df25755de08ac66d37", "avgDurationSecs": 4.546, "validDurationSecs": 133361.25, "totalHrs": 44.25, "validHrs": 37.04 }, "ro": { "buckets": { "dev": 2661, "invalidated": 658, "other": 18357, "reported": 127, "test": 3172, "train": 4513, "validated": 10352 }, "reportedSentences": 128, "duration": 114205063, "clips": 29367, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.16, "teens": 0.02, "": 0.1, "fourties": 0.03, "sixties": 0, "twenties": 0.68, "fifties": 0.01 }, "gender": { "male": 0.73, "": 0.09, "female": 0.18, "other": 0 } }, "users": 302, "size": 728306654, "checksum": "928f4de2157d286a2fa72a8f59f0a979d178f76a11f1dbe9a398042ae840bb52", "avgDurationSecs": 3.889, "validDurationSecs": 40257.8, "totalHrs": 31.72, "validHrs": 11.18 }, "hsb": { "buckets": { "dev": 172, "invalidated": 227, "other": 47, "reported": 56, "test": 402, "train": 808, "validated": 1382 }, "reportedSentences": 57, "duration": 10103328, "clips": 1656, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.83, "": 0.17, "other": 0 } }, "users": 19, "size": 79355401, "checksum": "d1f29fdb22db8d7d7a1048110da577369254cdee553549357c3a057d2a70e7a5", "avgDurationSecs": 6.101, "validDurationSecs": 8431.642, "totalHrs": 2.8, "validHrs": 2.34 }, "el": { "buckets": { "dev": 1626, "invalidated": 516, "other": 8001, "reported": 61, "test": 1703, "train": 1975, "validated": 11637 }, "reportedSentences": 62, "duration": 83217454, "clips": 20154, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.45, "fourties": 0.16, "": 0.25, "twenties": 0.09, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.72, "": 0.25, "other": 0.02, "female": 0.02 } }, "users": 271, "size": 579685993, "checksum": "ab478fdf4d40377a802eba960f854e33a9b38cdce796bb4611374e8a9593297f", "avgDurationSecs": 4.129, "validDurationSecs": 48050.09, "totalHrs": 23.11, "validHrs": 13.34 }, "cs": { "buckets": { "dev": 6451, "invalidated": 1092, "other": 10399, "reported": 665, "test": 6449, "train": 10756, "validated": 41570 }, "reportedSentences": 662, "duration": 228937467, "clips": 53061, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.21, "": 0.36, "thirties": 0.13, "teens": 0.01, "twenties": 0.27, "fifties": 0.02, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.35, "female": 0.02 } }, "users": 484, "size": 1648345476, "checksum": "cd7362c3cec0a9470f9ed0f282ec98f114291726ee548463a4641e78f7b144f7", "avgDurationSecs": 4.315, "validDurationSecs": 179358.295, "totalHrs": 63.59, "validHrs": 49.82 }, "pl": { "buckets": { "dev": 6724, "invalidated": 5268, "other": 14002, "reported": 492, "test": 6724, "train": 11286, "validated": 105669 }, "reportedSentences": 492, "duration": 549521030, "clips": 124939, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.29, "": 0.25, "teens": 0.02, "thirties": 0.32, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.59, "": 0.25, "female": 0.15, "other": 0.01 } }, "users": 2918, "size": 4016577061, "checksum": "bf78e061160228cd31d8e9b5fcf0ef81d9adf552a78ffb3018fb9ffb0c343f11", "avgDurationSecs": 4.398, "validDurationSecs": 464765.507, "totalHrs": 152.64, "validHrs": 129.1 }, "rm-sursilv": { "buckets": { "dev": 1170, "invalidated": 655, "other": 2133, "reported": 9, "test": 1240, "train": 1505, "validated": 3915 }, "reportedSentences": 10, "duration": 36887225, "clips": 6703, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.11, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.16, "female": 0.2, "": 0.64, "other": 0 } }, "users": 80, "size": 281830812, "checksum": "fc1052e2d0e726cc8755f25a628c6d19889d1d53fe4682b972de34295581acb6", "avgDurationSecs": 5.503, "validDurationSecs": 21544.605, "totalHrs": 10.24, "validHrs": 5.98 }, "rm-vallader": { "buckets": { "dev": 368, "invalidated": 391, "other": 742, "reported": 30, "test": 421, "train": 642, "validated": 1438 }, "reportedSentences": 29, "duration": 14887814, "clips": 2571, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.2, "other": 0.01 } }, "users": 49, "size": 114127527, "checksum": "ef2a1b540077fcffe0d3c11910068c425f27f015ab4f3999c5b12f7636d7e707", "avgDurationSecs": 5.791, "validDurationSecs": 8326.984, "totalHrs": 4.13, "validHrs": 2.31 }, "mn": { "buckets": { "dev": 1819, "invalidated": 713, "other": 3231, "reported": 17, "test": 1880, "train": 2184, "validated": 8012 }, "reportedSentences": 18, "duration": 65414060, "clips": 11956, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.26, "twenties": 0.41, "fourties": 0.01, "teens": 0.02, "nineties": 0.06 }, "gender": { "male": 0.35, "": 0.26, "female": 0.32, "other": 0.06 } }, "users": 435, "size": 502934499, "checksum": "ac4c2ec4b1b39992b1709f03ca5e34e20390398d99eb6d5182201e22487d8cb2", "avgDurationSecs": 5.471, "validDurationSecs": 43835.518, "totalHrs": 18.17, "validHrs": 12.17 }, "zh-HK": { "buckets": { "dev": 5369, "invalidated": 3430, "other": 11285, "reported": 587, "test": 5369, "train": 7927, "validated": 81797 }, "reportedSentences": 576, "duration": 409273656, "clips": 96512, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.42, "teens": 0.02, "fifties": 0.03, "seventies": 0, "sixties": 0, "twenties": 0.29 }, "gender": { "male": 0.4, "": 0.37, "female": 0.22, "other": 0.01 } }, "users": 2656, "size": 3048576629, "checksum": "0cc748decf1edf5050000cec615d50b76e5b3bdd4981d3216bcc75f9e3621329", "avgDurationSecs": 4.241, "validDurationSecs": 346872.485, "totalHrs": 113.68, "validHrs": 96.35 }, "ab": { "buckets": { "dev": 299, "invalidated": 259, "other": 860, "reported": 122, "test": 301, "train": 405, "validated": 1166 }, "reportedSentences": 122, "duration": 14665608, "clips": 2285, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "thirties": 0.34, "": 0.06, "teens": 0.11, "twenties": 0.01, "fifties": 0.14, "sixties": 0.02, "fourties": 0.1 }, "gender": { "male": 0.48, "female": 0.46, "": 0.06 } }, "users": 35, "size": 96828631, "checksum": "e9c7e1321547dffa0ca987264c6d44ed4eb948ee34e3959fecd835db4341c692", "avgDurationSecs": 6.418, "validDurationSecs": 7483.632, "totalHrs": 4.07, "validHrs": 2.07 }, "cv": { "buckets": { "dev": 582, "invalidated": 1357, "other": 8260, "reported": 138, "test": 810, "train": 1212, "validated": 3756 }, "reportedSentences": 134, "duration": 66740268, "clips": 13373, "splits": { "accent": { "other": 0, "": 1 }, "age": { "twenties": 0.49, "": 0.26, "fourties": 0.08, "thirties": 0.01, "teens": 0.15, "fifties": 0.02 }, "gender": { "male": 0.49, "": 0.23, "female": 0.28 } }, "users": 93, "size": 486325136, "checksum": "0c341ac2370063f3610f108f2f126698701ae43432132f3520161f8e1e1c6e24", "avgDurationSecs": 4.991, "validDurationSecs": 18744.967, "totalHrs": 18.53, "validHrs": 5.2 }, "uk": { "buckets": { "dev": 4332, "invalidated": 1782, "other": 5956, "reported": 501, "test": 4332, "train": 5861, "validated": 41069 }, "reportedSentences": 502, "duration": 240050364, "clips": 48807, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "teens": 0.06, "": 0.27, "fourties": 0.16, "thirties": 0.24, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.55, "female": 0.18, "": 0.27 } }, "users": 615, "size": 1694344342, "checksum": "42b93daf79a4136869c98408a16ab0125fb3f55c6c780312745f1fcde7e8b0e7", "avgDurationSecs": 4.918, "validDurationSecs": 201992.099, "totalHrs": 66.68, "validHrs": 56.1 }, "mt": { "buckets": { "dev": 1550, "invalidated": 318, "other": 5963, "reported": 6, "test": 1646, "train": 1976, "validated": 6195 }, "reportedSentences": 7, "duration": 59261112, "clips": 12476, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "": 0.27, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.26, "sixties": 0.01 }, "gender": { "male": 0.26, "": 0.27, "female": 0.46, "other": 0.01 } }, "users": 200, "size": 444369609, "checksum": "f2c95d42415041396e89bf048d4179e61978970e3ee130d48d2fcefe3eeeca99", "avgDurationSecs": 4.75, "validDurationSecs": 29426.306, "totalHrs": 16.46, "validHrs": 8.17 }, "as": { "buckets": { "dev": 116, "invalidated": 45, "other": 0, "reported": 9, "test": 281, "train": 508, "validated": 905 }, "reportedSentences": 10, "duration": 5042009, "clips": 950, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.55, "thirties": 0.05 }, "gender": { "male": 0.47, "": 0.53 } }, "users": 37, "size": 34898250, "checksum": "41c279c83d3c23e401092fd44518f74e957b63182ade5a447405dffe78775b93", "avgDurationSecs": 5.307, "validDurationSecs": 4803.177, "totalHrs": 1.4, "validHrs": 1.33 }, "ka": { "buckets": { "dev": 1182, "invalidated": 252, "other": 44, "reported": 31, "test": 1225, "train": 1504, "validated": 4562 }, "reportedSentences": 32, "duration": 25834176, "clips": 4858, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.41, "thirties": 0.21, "": 0.34, "fourties": 0.02, "fifties": 0, "teens": 0.01 }, "gender": { "male": 0.45, "female": 0.2, "": 0.35 } }, "users": 114, "size": 173494246, "checksum": "2bcf23b5c159bd4cdd3f5044953544d0d38ef2aaf15b39a098199ecd46e931b5", "avgDurationSecs": 5.318, "validDurationSecs": 24260.089, "totalHrs": 7.17, "validHrs": 6.73 }, "fy-NL": { "buckets": { "dev": 3021, "invalidated": 1799, "other": 31399, "reported": 366, "test": 3014, "train": 3706, "validated": 21520 }, "reportedSentences": 364, "duration": 272269827, "clips": 54718, "splits": { "accent": { "": 1 }, "age": { "": 0.58, "fifties": 0.13, "thirties": 0.04, "twenties": 0.02, "fourties": 0.08, "sixties": 0.14, "seventies": 0 }, "gender": { "": 0.59, "male": 0.08, "female": 0.34 } }, "users": 678, "size": 1821742478, "checksum": "6fe37b543b08c0fd2ae734943bfdea05ce3b36d52bd34bf621de830d62393e3b", "avgDurationSecs": 4.976, "validDurationSecs": 107080.79, "totalHrs": 75.63, "validHrs": 29.74 }, "dv": { "buckets": { "dev": 2116, "invalidated": 1389, "other": 13422, "reported": 44, "test": 2224, "train": 2694, "validated": 23484 }, "reportedSentences": 45, "duration": 194045823, "clips": 38295, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.38, "fourties": 0.21, "teens": 0.01, "nineties": 0 }, "gender": { "": 0.22, "male": 0.29, "female": 0.5 } }, "users": 271, "size": 1258575355, "checksum": "30cea79852196258aa8f6246ebbaf9c971d2a6d5d45c46b3893bb275d6ab2319", "avgDurationSecs": 5.067, "validDurationSecs": 118996.53, "totalHrs": 53.9, "validHrs": 33.05 }, "pa-IN": { "buckets": { "dev": 255, "invalidated": 67, "other": 1303, "reported": 233, "test": 343, "train": 526, "validated": 1124 }, "reportedSentences": 232, "duration": 11979854, "clips": 2494, "splits": { "accent": { "": 1 }, "age": { "": 0.23, "fourties": 0.05, "fifties": 0.01, "thirties": 0.44, "twenties": 0.27, "sixties": 0, "teens": 0 }, "gender": { "": 0.23, "male": 0.77, "female": 0 } }, "users": 46, "size": 87690513, "checksum": "30f5448b1fec91152f29bd1b14df15763931b3b557cad11653e631b4a0f1294d", "avgDurationSecs": 4.803, "validDurationSecs": 5399.1, "totalHrs": 3.32, "validHrs": 1.49 }, "vi": { "buckets": { "dev": 0, "invalidated": 158, "other": 10293, "reported": 160, "test": 761, "train": 2170, "validated": 2933 }, "reportedSentences": 159, "duration": 53409536, "clips": 13384, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.17, "": 0.2, "teens": 0.23, "seventies": 0, "fourties": 0, "sixties": 0.37 }, "gender": { "male": 0.54, "": 0.2, "female": 0.23, "other": 0.03 } }, "users": 138, "size": 310698285, "checksum": "a8b4fe0a1dbbb2d6828ce55a7c463e63f3a30f27dfb8674dcac4b737563d0f72", "avgDurationSecs": 3.991, "validDurationSecs": 11704.286, "totalHrs": 14.83, "validHrs": 3.25 }, "or": { "buckets": { "dev": 133, "invalidated": 82, "other": 4642, "reported": 4, "test": 112, "train": 404, "validated": 666 }, "reportedSentences": 5, "duration": 27507192, "clips": 5390, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.16, "": 0.04, "thirties": 0.8, "fourties": 0, "teens": 0 }, "gender": { "male": 0.96, "": 0.04, "female": 0 } }, "users": 41, "size": 210775441, "checksum": "d6660c2a0b55e2270b32a6ec11f9d200a6df69cf4e0cc8b118c2dd4ceec71bb2", "avgDurationSecs": 5.103, "validDurationSecs": 3398.848, "totalHrs": 7.64, "validHrs": 0.94 }, "ga-IE": { "buckets": { "dev": 506, "invalidated": 711, "other": 3489, "reported": 9, "test": 509, "train": 529, "validated": 4045 }, "reportedSentences": 10, "duration": 28701207, "clips": 8245, "splits": { "accent": { "": 0.54, "connachta": 0.21, "mumhain": 0.12, "other": 0.04, "ulaidh": 0.09 }, "age": { "twenties": 0.27, "": 0.38, "thirties": 0.24, "fourties": 0.05, "sixties": 0, "teens": 0.01, "fifties": 0.05 }, "gender": { "male": 0.48, "": 0.38, "female": 0.13 } }, "users": 132, "size": 203941551, "checksum": "04eb7f57d9f5459c03900259f91233f9fe2234ef961b1a0537c5f21c6ad7ff66", "avgDurationSecs": 3.481, "validDurationSecs": 14080.823, "totalHrs": 7.97, "validHrs": 3.91 }, "fi": { "buckets": { "dev": 1493, "invalidated": 159, "other": 4230, "reported": 35, "test": 1599, "train": 2134, "validated": 5303 }, "reportedSentences": 36, "duration": 44351211, "clips": 9692, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "": 0.32, "twenties": 0.11, "fourties": 0.4, "teens": 0.01, "fifties": 0.01 }, "gender": { "male": 0.26, "": 0.32, "female": 0.41, "other": 0 } }, "users": 139, "size": 268332961, "checksum": "2f8ff33359e55c628551bc3be9339c7e19281f378e1574e8c0814582c9b0a545", "avgDurationSecs": 4.576, "validDurationSecs": 24266.867, "totalHrs": 12.31, "validHrs": 6.74 }, "hu": { "buckets": { "dev": 3058, "invalidated": 525, "other": 3169, "reported": 74, "test": 3427, "train": 6014, "validated": 12532 }, "reportedSentences": 75, "duration": 78705469, "clips": 16226, "splits": { "accent": { "": 1 }, "age": { "teens": 0.09, "": 0.28, "thirties": 0.13, "twenties": 0.41, "fifties": 0.07, "fourties": 0.01, "sixties": 0.01 }, "gender": { "male": 0.62, "": 0.28, "female": 0.1 } }, "users": 169, "size": 518917337, "checksum": "2969cd2cf7701b5e91d134fea9f5a768511a32c2c945fb6cbfba99e61371ac5c", "avgDurationSecs": 4.851, "validDurationSecs": 60787.436, "totalHrs": 21.86, "validHrs": 16.88 }, "th": { "buckets": { "dev": 9712, "invalidated": 7529, "other": 90315, "reported": 2155, "test": 9712, "train": 23332, "validated": 107747 }, "reportedSentences": 2150, "duration": 918994080, "clips": 205591, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.2, "": 0.44, "thirties": 0.06, "fourties": 0.05, "teens": 0.05, "fifties": 0.2, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.3, "": 0.44, "female": 0.24, "other": 0.01 } }, "users": 7212, "size": 5370380496, "checksum": "6aeb12c420ff786cec5ff355d685480a2d56ae408987d5b67f638f0ba28f0dc7", "avgDurationSecs": 4.47, "validDurationSecs": 481630.301, "totalHrs": 255.27, "validHrs": 133.78 }, "lt": { "buckets": { "dev": 3246, "invalidated": 528, "other": 1906, "reported": 117, "test": 3431, "train": 4743, "validated": 11423 }, "reportedSentences": 117, "duration": 71550188, "clips": 13857, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.24, "thirties": 0.27, "fifties": 0.06, "sixties": 0.01, "teens": 0.03, "fourties": 0.05 }, "gender": { "male": 0.62, "": 0.24, "female": 0.15 } }, "users": 232, "size": 440325435, "checksum": "57d8a1ddcf6b890a8beb8d233353c07273cb1028adb58dd712210837fd828874", "avgDurationSecs": 5.163, "validDurationSecs": 58982.305, "totalHrs": 19.87, "validHrs": 16.38 }, "lg": { "buckets": { "dev": 3549, "invalidated": 2195, "other": 29407, "reported": 1354, "test": 4276, "train": 6626, "validated": 20190 }, "reportedSentences": 1344, "duration": 288018279, "clips": 51792, "splits": { "accent": { "": 1 }, "age": { "": 0.34, "thirties": 0.32, "twenties": 0.32, "fourties": 0.01, "fifties": 0.01 }, "gender": { "": 0.35, "female": 0.35, "male": 0.3 } }, "users": 260, "size": 1722860000, "checksum": "c4baed47d1a7f0c27345af792a266e27c4c262fe8801b8aeab43eeb51e373f4b", "avgDurationSecs": 5.561, "validDurationSecs": 112277.747, "totalHrs": 80, "validHrs": 31.18 }, "hi": { "buckets": { "dev": 2020, "invalidated": 485, "other": 2126, "reported": 89, "test": 2095, "train": 2691, "validated": 6812 }, "reportedSentences": 90, "duration": 42645048, "clips": 9423, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.44, "fourties": 0.02, "": 0.31, "thirties": 0.19, "teens": 0.01, "fifties": 0.02 }, "gender": { "male": 0.65, "female": 0.04, "": 0.31 } }, "users": 214, "size": 252728592, "checksum": "d701712db4156ed62b77e689f4b201483192a81a22ddd8cc899b17ab242a53c4", "avgDurationSecs": 4.526, "validDurationSecs": 30828.618, "totalHrs": 11.84, "validHrs": 8.56 }, "bas": { "buckets": { "dev": 264, "invalidated": 423, "other": 330, "reported": 7, "test": 375, "train": 744, "validated": 1383 }, "reportedSentences": 8, "duration": 9398988, "clips": 2136, "splits": { "accent": { "": 1 }, "age": { "": 0.99, "fourties": 0.01 }, "gender": { "": 0.99, "female": 0.01 } }, "users": 28, "size": 52096262, "checksum": "46ee0a775b7bfaedb75fa95814bd6ecffb3456a221a1a28ff20e4e7dcee6ee72", "avgDurationSecs": 4.4, "validDurationSecs": 6085.581, "totalHrs": 2.61, "validHrs": 1.69 }, "sk": { "buckets": { "dev": 1340, "invalidated": 162, "other": 1531, "reported": 16, "test": 2173, "train": 3508, "validated": 9738 }, "reportedSentences": 17, "duration": 45524628, "clips": 11431, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "thirties": 0.23, "twenties": 0.05, "fourties": 0.12, "teens": 0 }, "gender": { "": 0.58, "male": 0.38, "female": 0.01, "other": 0.02 } }, "users": 82, "size": 259299408, "checksum": "24bcf5e37d04b55d25f5299a1d569ca51223c0cf562d5ffceaef6f7dde5b6ca0", "avgDurationSecs": 3.983, "validDurationSecs": 38782.156, "totalHrs": 12.64, "validHrs": 10.77 }, "kmr": { "buckets": { "dev": 2358, "invalidated": 1452, "other": 4301, "reported": 576, "test": 2395, "train": 2858, "validated": 37023 }, "reportedSentences": 577, "duration": 189219096, "clips": 42776, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "twenties": 0.3, "thirties": 0.06, "fourties": 0.04, "fifties": 0.09, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.5, "male": 0.34, "female": 0.17 } }, "users": 278, "size": 1008380764, "checksum": "b9a736b9309659e7b4a728c271d8231fab48161927fc098dfca40383eb479728", "avgDurationSecs": 4.423, "validDurationSecs": 163770.773, "totalHrs": 52.56, "validHrs": 45.49 }, "bg": { "buckets": { "dev": 1104, "invalidated": 161, "other": 598, "reported": 26, "test": 953, "train": 1394, "validated": 3455 }, "reportedSentences": 27, "duration": 22042296, "clips": 4214, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.31, "thirties": 0.06, "": 0.37, "twenties": 0.25, "teens": 0.01 }, "gender": { "male": 0.59, "female": 0.04, "": 0.37 } }, "users": 39, "size": 129479983, "checksum": "7dd16e8748be72284498f084d93289d6d985eb5ad4a34c98431540be230fad7e", "avgDurationSecs": 5.231, "validDurationSecs": 18072.172, "totalHrs": 6.12, "validHrs": 5.02 }, "kk": { "buckets": { "dev": 154, "invalidated": 37, "other": 471, "reported": 0, "test": 173, "train": 204, "validated": 532 }, "reportedSentences": 1, "duration": 5195052, "clips": 1040, "splits": { "accent": { "": 1 }, "age": { "": 0.49, "thirties": 0.03, "twenties": 0.31, "teens": 0.07, "fifties": 0.1 }, "gender": { "": 0.5, "male": 0.47, "female": 0.03 } }, "users": 66, "size": 29597030, "checksum": "1ea0f3832aafffa09148f191e12abe66f3b0d1e547a7ca55ed25091cab4f0554", "avgDurationSecs": 4.995, "validDurationSecs": 2657.469, "totalHrs": 1.44, "validHrs": 0.73 }, "ba": { "buckets": { "dev": 14481, "invalidated": 7834, "other": 5703, "reported": 857, "test": 14466, "train": 114090, "validated": 201011 }, "reportedSentences": 854, "duration": 949797828, "clips": 214548, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.19, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 861, "size": 5325656457, "checksum": "bf91393c637f0b41a750e4597c6facf518bf1f5b2f8c8e321dab164ce2485ca1", "avgDurationSecs": 4.427, "validDurationSecs": 889869.918, "totalHrs": 263.83, "validHrs": 247.18 }, "gl": { "buckets": { "dev": 1728, "invalidated": 235, "other": 102, "reported": 83, "test": 1716, "train": 1942, "validated": 5417 }, "reportedSentences": 84, "duration": 28208628, "clips": 5754, "splits": { "accent": { "": 0.68, "normativo": 0.14, "neofalante": 0.11, "oriental": 0.01, "atlantico": 0.01, "central": 0.05 }, "age": { "": 0.4, "thirties": 0.36, "fifties": 0.04, "twenties": 0.13, "fourties": 0.07, "teens": 0 }, "gender": { "": 0.42, "male": 0.53, "female": 0.05 } }, "users": 106, "size": 161168322, "checksum": "0bd735e18863ecc1f37edd030083aeba2b12d2089abf9b96bf98d45fb8b705a2", "avgDurationSecs": 4.902, "validDurationSecs": 26556.506, "totalHrs": 7.83, "validHrs": 7.37 }, "ug": { "buckets": { "dev": 2642, "invalidated": 1398, "other": 382, "reported": 133, "test": 2620, "train": 3213, "validated": 24981 }, "reportedSentences": 128, "duration": 159344568, "clips": 26761, "splits": { "accent": { "": 1 }, "age": { "": 0.63, "fifties": 0.01, "twenties": 0.2, "thirties": 0.11, "fourties": 0.04, "teens": 0.01, "eighties": 0 }, "gender": { "": 0.63, "male": 0.29, "female": 0.08, "other": 0 } }, "users": 296, "size": 929601304, "checksum": "0a67532dd6fa7868ccad9db50ec7d8ce31f678a4eab26b76b901103b37a8f1b7", "avgDurationSecs": 5.954, "validDurationSecs": 148745.811, "totalHrs": 44.26, "validHrs": 41.31 }, "hy-AM": { "buckets": { "dev": 218, "invalidated": 33, "other": 795, "reported": 22, "test": 212, "train": 336, "validated": 767 }, "reportedSentences": 23, "duration": 10063260, "clips": 1595, "splits": { "accent": { "": 1 }, "age": { "": 0.43, "thirties": 0.15, "twenties": 0.32, "fifties": 0.07, "teens": 0.02 }, "gender": { "": 0.43, "male": 0.3, "female": 0.27 } }, "users": 28, "size": 59015187, "checksum": "d9729faa39606059881e7edc93441903102da42699403c51c2f323dc284dae8d", "avgDurationSecs": 6.309, "validDurationSecs": 4839.198, "totalHrs": 2.79, "validHrs": 1.34 }, "be": { "buckets": { "dev": 14577, "invalidated": 8859, "other": 45254, "reported": 2307, "test": 14574, "train": 117578, "validated": 182881 }, "reportedSentences": 2304, "duration": 1283979096, "clips": 236994, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "fourties": 0.11, "thirties": 0.23, "twenties": 0.16, "teens": 0.03, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.43, "male": 0.23, "female": 0.33, "other": 0 } }, "users": 3831, "size": 7305605035, "checksum": "c53b56a7604a87c3d93053eab1e79e1b624ae476147c106d853931d34fe57f38", "avgDurationSecs": 5.418, "validDurationSecs": 990807.282, "totalHrs": 356.66, "validHrs": 275.22 }, "ur": { "buckets": { "dev": 97, "invalidated": 26, "other": 884, "reported": 3, "test": 142, "train": 281, "validated": 520 }, "reportedSentences": 4, "duration": 5935644, "clips": 1430, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.42, "": 0.41, "fourties": 0.02, "thirties": 0.12, "teens": 0.03 }, "gender": { "male": 0.42, "": 0.41, "female": 0.17 } }, "users": 25, "size": 34411520, "checksum": "b6b56bb2335a7b34a905f739cdab67ce549eaac11ce0e91e2b105c840d9642ab", "avgDurationSecs": 4.151, "validDurationSecs": 2158.416, "totalHrs": 1.64, "validHrs": 0.59 }, "gn": { "buckets": { "dev": 36, "invalidated": 45, "other": 904, "reported": 13, "test": 134, "train": 262, "validated": 432 }, "reportedSentences": 14, "duration": 6191748, "clips": 1381, "splits": { "accent": { "": 1 }, "age": { "": 0.49, "twenties": 0.46, "thirties": 0.05 }, "gender": { "": 0.49, "male": 0.35, "female": 0.16 } }, "users": 55, "size": 33178619, "checksum": "243d01829f902a848d73f4775aab5ebccc575905c11cfe814099ec93e896cd45", "avgDurationSecs": 4.484, "validDurationSecs": 1936.883, "totalHrs": 1.71, "validHrs": 0.53 }, "sr": { "buckets": { "dev": 77, "invalidated": 7, "other": 398, "reported": 6, "test": 74, "train": 88, "validated": 239 }, "reportedSentences": 7, "duration": 1858392, "clips": 644, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.62, "": 0.12, "fifties": 0.02, "fourties": 0.23 }, "gender": { "male": 0.47, "": 0.12, "female": 0.41 } }, "users": 11, "size": 9879034, "checksum": "995be7360053b1e977890dbbf4657140ec667fef991a3b5807ca05a85260bd03", "avgDurationSecs": 2.886, "validDurationSecs": 689.683, "totalHrs": 0.51, "validHrs": 0.19 }, "uz": { "buckets": { "dev": 0, "invalidated": 3, "other": 368, "reported": 8, "test": 34, "train": 127, "validated": 161 }, "reportedSentences": 9, "duration": 2892636, "clips": 532, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "": 0.73 }, "gender": { "male": 0.08, "": 0.73, "female": 0.18 } }, "users": 7, "size": 14900815, "checksum": "07844d32518a9b0a4ae98b1bdb7c4f1228dbfe5af6abcbc9e265fabaeee107df", "avgDurationSecs": 5.437, "validDurationSecs": 875.403, "totalHrs": 0.8, "validHrs": 0.24 }, "sah": { "buckets": { "dev": 414, "invalidated": 68, "other": 1235, "test": 800, "train": 1443, "validated": 2659 }, "duration": 23744460, "clips": 3962, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 45, "size": 181815287, "checksum": "2a02e09d1adc4b9a441a5cd73126490a55f29ef088e568f9bd7cf9861e9be7fd", "avgDurationSecs": 5.993, "validDurationSecs": 15935.517, "totalHrs": 6.59, "validHrs": 4.42 }, "vot": { "buckets": { "dev": 0, "invalidated": 97, "other": 320, "test": 0, "train": 9, "validated": 9 }, "duration": 1025976, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 5, "size": 7892226, "checksum": "b6afcc4df1866ce837ae5e09190450831730cb6bfbf5cf342bce41cfeed6087b", "avgDurationSecs": 2.408, "validDurationSecs": 21.676, "totalHrs": 0.28, "validHrs": 0 }, "az": { "buckets": { "dev": 0, "invalidated": 0, "other": 28, "test": 0, "train": 0, "validated": 0 }, "duration": 184392, "clips": 28, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 4, "size": 1083579, "checksum": "542fb43f6c91de84972ab4aaac11b5e98b043527109e44f6ea50e61b880fde88", "avgDurationSecs": 6.585, "validDurationSecs": 0, "totalHrs": 0.05, "validHrs": 0 }, "ha": { "buckets": { "dev": 0, "invalidated": 38, "other": 938, "test": 149, "train": 1386, "validated": 1535 }, "duration": 10919520, "clips": 2511, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "thirties": 0.94, "twenties": 0.01, "fourties": 0 }, "gender": { "": 0.05, "male": 0.15, "female": 0.81 } }, "users": 17, "size": 63636525, "checksum": "52cb845e46b6d752fdac6ede4200440a916bf3a0d106845bfe985e2e18928107", "avgDurationSecs": 4.349, "validDurationSecs": 6675.214, "totalHrs": 3.03, "validHrs": 1.85 } }, "totalDuration": 50059648831, "totalValidDurationSecs": 40292591, "totalHrs": 13905, "totalValidHrs": 11192 } ================================================ FILE: datasets/scripted-speech/cv-corpus-7.0-singleword.json ================================================ { "locales": { "es": { "clips": 70038, "splits": { "accent": { "surpeninsular": 0.01, "": 0.8, "americacentral": 0.02, "centrosurpeninsular": 0.02, "mexicano": 0.05, "caribe": 0.02, "nortepeninsular": 0.02, "andino": 0.04, "rioplatense": 0.02, "chileno": 0.01, "canario": 0, "filipinas": 0 }, "age": { "sixties": 0.01, "": 0.74, "thirties": 0.05, "fifties": 0.02, "fourties": 0.04, "twenties": 0.11, "teens": 0.04, "seventies": 0, "nineties": 0, "eighties": 0 }, "gender": { "male": 0.17, "": 0.74, "female": 0.09, "other": 0 } }, "users": 9052, "buckets": { "dev": 4, "invalidated": 6542, "other": 40514, "test": 4, "train": 6, "validated": 22982 }, "duration": 202516646, "avgDurationSecs": 2.892, "validDurationSecs": 66453.033, "totalHrs": 56.25, "validHrs": 18.45 }, "pl": { "clips": 1182, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "thirties": 0.17, "fourties": 0.02, "teens": 0.08, "": 0.47, "sixties": 0.02 }, "gender": { "female": 0.13, "male": 0.4, "": 0.47, "other": 0.01 } }, "users": 112, "buckets": { "dev": 4, "invalidated": 37, "other": 3, "test": 4, "train": 6, "validated": 1142 }, "duration": 2713140, "avgDurationSecs": 2.295, "validDurationSecs": 2621.325, "totalHrs": 0.75, "validHrs": 0.72 }, "ca": { "clips": 4180, "splits": { "accent": { "valencian": 0.08, "": 0.51, "central": 0.35, "northern": 0.01, "northwestern": 0.03, "balearic": 0.02 }, "age": { "fifties": 0.08, "": 0.52, "sixties": 0.04, "thirties": 0.12, "fourties": 0.1, "twenties": 0.1, "teens": 0.04, "seventies": 0.01 }, "gender": { "male": 0.35, "": 0.53, "female": 0.12, "other": 0 } }, "users": 387, "buckets": { "dev": 4, "invalidated": 217, "other": 297, "test": 4, "train": 6, "validated": 3666 }, "duration": 10707094, "avgDurationSecs": 2.562, "validDurationSecs": 9390.48, "totalHrs": 2.97, "validHrs": 2.6 }, "tr": { "clips": 537, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "fourties": 0.12, "twenties": 0.26, "teens": 0.03, "thirties": 0.1 }, "gender": { "": 0.5, "male": 0.45, "female": 0.05 } }, "users": 60, "buckets": { "dev": 4, "invalidated": 46, "other": 44, "test": 4, "train": 6, "validated": 447 }, "duration": 1278637, "avgDurationSecs": 2.381, "validDurationSecs": 1064.34, "totalHrs": 0.35, "validHrs": 0.29 }, "ru": { "clips": 2195, "splits": { "accent": { "": 1 }, "age": { "": 0.46, "teens": 0.12, "thirties": 0.13, "twenties": 0.23, "fourties": 0.05, "fifties": 0.01 }, "gender": { "": 0.47, "male": 0.46, "female": 0.06, "other": 0.01 } }, "users": 214, "buckets": { "dev": 4, "invalidated": 104, "other": 57, "test": 4, "train": 6, "validated": 2034 }, "duration": 4977118, "avgDurationSecs": 2.267, "validDurationSecs": 4612.054, "totalHrs": 1.38, "validHrs": 1.28 }, "pt": { "clips": 1307, "splits": { "accent": { "": 1 }, "age": { "": 0.4, "fourties": 0.11, "twenties": 0.29, "thirties": 0.16, "fifties": 0.03, "sixties": 0, "teens": 0.02 }, "gender": { "": 0.4, "male": 0.54, "female": 0.05, "other": 0.01 } }, "users": 140, "buckets": { "dev": 4, "invalidated": 87, "other": 42, "test": 4, "train": 6, "validated": 1178 }, "duration": 3449335, "avgDurationSecs": 2.639, "validDurationSecs": 3108.888, "totalHrs": 0.95, "validHrs": 0.86 }, "nl": { "clips": 644, "splits": { "accent": { "suriname": 0.02, "netherlands": 0.39, "": 0.46, "belgium": 0.13 }, "age": { "": 0.5, "twenties": 0.32, "sixties": 0, "thirties": 0.04, "fifties": 0.04, "fourties": 0.1 }, "gender": { "male": 0.45, "female": 0.06, "": 0.5 } }, "users": 67, "buckets": { "dev": 4, "invalidated": 21, "other": 7, "test": 4, "train": 6, "validated": 616 }, "duration": 1531757, "avgDurationSecs": 2.379, "validDurationSecs": 1465.159, "totalHrs": 0.42, "validHrs": 0.4 }, "de": { "clips": 15090, "splits": { "accent": { "germany": 0.26, "austria": 0.02, "": 0.7, "france": 0, "italy": 0, "switzerland": 0.01, "united_states": 0, "russia": 0, "united_kingdom": 0, "lithuania": 0 }, "age": { "fourties": 0.04, "fifties": 0.05, "twenties": 0.12, "thirties": 0.07, "": 0.63, "seventies": 0.01, "teens": 0.06, "sixties": 0.02, "eighties": 0 }, "gender": { "male": 0.31, "female": 0.06, "": 0.63, "other": 0 } }, "users": 1618, "buckets": { "dev": 4, "invalidated": 1386, "other": 56, "test": 4, "train": 7, "validated": 13648 }, "duration": 39418281, "avgDurationSecs": 2.612, "validDurationSecs": 35651.471, "totalHrs": 10.94, "validHrs": 9.9 }, "ta": { "clips": 341, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "fifties": 0.04, "fourties": 0.04, "twenties": 0.17, "thirties": 0.26, "teens": 0.03 }, "gender": { "": 0.49, "male": 0.35, "female": 0.16 } }, "users": 39, "buckets": { "dev": 4, "invalidated": 39, "other": 50, "test": 4, "train": 6, "validated": 252 }, "duration": 992550, "avgDurationSecs": 2.911, "validDurationSecs": 733.497, "totalHrs": 0.27, "validHrs": 0.2 }, "ar": { "clips": 564, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "thirties": 0.11, "twenties": 0.28, "teens": 0.05, "fifties": 0.02, "fourties": 0.02 }, "gender": { "": 0.5, "male": 0.42, "female": 0.08 } }, "users": 67, "buckets": { "dev": 4, "invalidated": 45, "other": 39, "test": 4, "train": 6, "validated": 480 }, "duration": 1530140, "avgDurationSecs": 2.713, "validDurationSecs": 1302.247, "totalHrs": 0.42, "validHrs": 0.36 }, "en": { "clips": 49370, "splits": { "accent": { "": 0.75, "us": 0.12, "england": 0.04, "indian": 0.04, "australia": 0.01, "canada": 0.02, "newzealand": 0, "scotland": 0, "hongkong": 0.01, "ireland": 0, "singapore": 0, "philippines": 0, "malaysia": 0, "african": 0, "wales": 0 }, "age": { "": 0.61, "twenties": 0.18, "teens": 0.05, "fourties": 0.04, "thirties": 0.07, "sixties": 0.01, "fifties": 0.03, "seventies": 0.01, "nineties": 0, "eighties": 0 }, "gender": { "": 0.62, "female": 0.09, "male": 0.29, "other": 0.01 } }, "users": 5876, "buckets": { "dev": 4, "invalidated": 4071, "other": 12573, "test": 4, "train": 6, "validated": 32726 }, "duration": 132611224, "avgDurationSecs": 2.686, "validDurationSecs": 87904.292, "totalHrs": 36.83, "validHrs": 24.41 }, "fr": { "clips": 22382, "splits": { "accent": { "france": 0.23, "": 0.74, "germany": 0, "canada": 0.01, "united_kingdom": 0, "belgium": 0.01, "switzerland": 0.01, "reunion": 0, "senegal": 0, "martinique": 0, "congo_brazzaville": 0, "cameroon": 0, "united_states": 0, "madagascar": 0, "romania": 0, "benin": 0, "cote_d_ivoire": 0, "morocco": 0 }, "age": { "sixties": 0.01, "": 0.7, "twenties": 0.1, "thirties": 0.06, "fourties": 0.05, "teens": 0.05, "fifties": 0.03, "seventies": 0 }, "gender": { "male": 0.24, "": 0.7, "female": 0.06, "other": 0.01 } }, "users": 2560, "buckets": { "dev": 4, "invalidated": 2345, "other": 20, "test": 4, "train": 6, "validated": 20017 }, "duration": 59131896, "avgDurationSecs": 2.642, "validDurationSecs": 52883.708, "totalHrs": 16.42, "validHrs": 14.68 }, "cy": { "clips": 553, "splits": { "accent": { "united_kingdom": 0.47, "": 0.53 }, "age": { "twenties": 0.17, "thirties": 0.07, "": 0.53, "teens": 0.04, "fifties": 0.09, "fourties": 0.06, "sixties": 0.02, "seventies": 0.02 }, "gender": { "female": 0.22, "male": 0.24, "": 0.54 } }, "users": 64, "buckets": { "dev": 3, "invalidated": 15, "other": 62, "test": 3, "train": 4, "validated": 476 }, "duration": 1512525, "avgDurationSecs": 2.735, "validDurationSecs": 1301.92, "totalHrs": 0.42, "validHrs": 0.36 }, "eo": { "clips": 965, "splits": { "accent": { "": 0.67, "internacia": 0.33 }, "age": { "": 0.45, "fourties": 0.03, "twenties": 0.28, "teens": 0.06, "eighties": 0.01, "sixties": 0.02, "thirties": 0.14, "fifties": 0.01 }, "gender": { "": 0.46, "male": 0.48, "other": 0.01, "female": 0.05 } }, "users": 91, "buckets": { "dev": 3, "invalidated": 41, "other": 27, "test": 3, "train": 6, "validated": 897 }, "duration": 2260824, "avgDurationSecs": 2.343, "validDurationSecs": 2101.512, "totalHrs": 0.62, "validHrs": 0.58 }, "ja": { "clips": 644, "splits": { "accent": { "": 1 }, "age": { "teens": 0.05, "twenties": 0.5, "": 0.32, "fifties": 0.02, "fourties": 0.04, "thirties": 0.06 }, "gender": { "male": 0.43, "female": 0.24, "": 0.31, "other": 0.02 } }, "users": 74, "buckets": { "dev": 3, "invalidated": 34, "other": 18, "test": 3, "train": 6, "validated": 592 }, "duration": 1454877, "avgDurationSecs": 2.259, "validDurationSecs": 1337.402, "totalHrs": 0.4, "validHrs": 0.37 }, "id": { "clips": 171, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.63 }, "gender": { "": 0.37, "male": 0.5, "female": 0.07, "other": 0.06 } }, "users": 20, "buckets": { "dev": 3, "invalidated": 11, "other": 17, "test": 3, "train": 6, "validated": 143 }, "duration": 375907, "avgDurationSecs": 2.198, "validDurationSecs": 314.355, "totalHrs": 0.1, "validHrs": 0.08 }, "eu": { "clips": 635, "splits": { "accent": { "": 0.8, "mendebalekoa": 0.12, "erdialdekoa_nafarra": 0.08 }, "age": { "": 0.74, "teens": 0.01, "fourties": 0.16, "fifties": 0.04, "thirties": 0.04, "twenties": 0.02 }, "gender": { "": 0.74, "female": 0.09, "male": 0.17 } }, "users": 80, "buckets": { "dev": 2, "invalidated": 13, "other": 406, "test": 3, "train": 7, "validated": 216 }, "duration": 1577809, "avgDurationSecs": 2.485, "validDurationSecs": 536.704, "totalHrs": 0.43, "validHrs": 0.14 }, "tt": { "clips": 55, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.44, "": 0.55, "twenties": 0.02 }, "gender": { "male": 0.44, "": 0.55, "female": 0.02 } }, "users": 8, "buckets": { "dev": 4, "invalidated": 2, "other": 17, "test": 4, "train": 6, "validated": 36 }, "duration": 154248, "avgDurationSecs": 2.805, "validDurationSecs": 100.962, "totalHrs": 0.04, "validHrs": 0.02 }, "kab": { "clips": 1529, "splits": { "accent": { "": 0.98, "other": 0.02 }, "age": { "fourties": 0.06, "thirties": 0.13, "": 0.59, "fifties": 0.02, "twenties": 0.17, "sixties": 0.02, "teens": 0.01 }, "gender": { "male": 0.35, "": 0.6, "female": 0.06 } }, "users": 168, "buckets": { "dev": 3, "invalidated": 150, "other": 160, "test": 3, "train": 6, "validated": 1219 }, "duration": 4224717, "avgDurationSecs": 2.763, "validDurationSecs": 3368.169, "totalHrs": 1.17, "validHrs": 0.93 }, "br": { "clips": 44, "splits": { "accent": { "gwenedeg": 0.52, "": 0.41, "kerneveg": 0.07 }, "age": { "fourties": 0.05, "twenties": 0.3, "": 0.2, "fifties": 0.39, "teens": 0.07 }, "gender": { "male": 0.8, "": 0.2 } }, "users": 8, "buckets": { "dev": 0, "invalidated": 0, "other": 42, "test": 0, "train": 2, "validated": 2 }, "duration": 113952, "avgDurationSecs": 2.59, "validDurationSecs": 5.18, "totalHrs": 0.03, "validHrs": 0 }, "zh-CN": { "clips": 2731, "splits": { "accent": { "110000": 0.06, "120000": 0, "130000": 0.01, "140000": 0.01, "210000": 0.01, "230000": 0.02, "310000": 0, "320000": 0.03, "330000": 0.04, "340000": 0.02, "350000": 0.02, "360000": 0.01, "370000": 0.02, "410000": 0.01, "420000": 0.02, "430000": 0.01, "440000": 0.03, "450000": 0, "460000": 0, "500000": 0, "510000": 0.01, "530000": 0.01, "610000": 0.01, "620000": 0.01, "650000": 0.01, "710000": 0, "": 0.63 }, "age": { "thirties": 0.06, "": 0.6, "twenties": 0.23, "teens": 0.08, "fourties": 0.02, "fifties": 0.01 }, "gender": { "male": 0.27, "": 0.61, "female": 0.1, "other": 0.02 } }, "users": 338, "buckets": { "dev": 3, "invalidated": 216, "other": 0, "test": 3, "train": 6, "validated": 2515 }, "duration": 6478785, "avgDurationSecs": 2.372, "validDurationSecs": 5966.366, "totalHrs": 1.79, "validHrs": 1.65 }, "fy-NL": { "clips": 216, "splits": { "accent": { "": 1 }, "age": { "": 0.84, "sixties": 0.05, "fourties": 0.11 }, "gender": { "": 0.84, "female": 0.1, "male": 0.06 } }, "users": 22, "buckets": { "dev": 3, "invalidated": 13, "other": 126, "test": 3, "train": 6, "validated": 77 }, "duration": 650279, "avgDurationSecs": 3.011, "validDurationSecs": 231.812, "totalHrs": 0.18, "validHrs": 0.06 }, "zh-TW": { "clips": 29, "splits": { "accent": { "": 0.97, "new_taipei_city": 0.03 }, "age": { "": 0.38, "teens": 0.41, "twenties": 0.21 }, "gender": { "": 0.38, "male": 0.62 } }, "users": 5, "buckets": { "dev": 0, "invalidated": 1, "other": 0, "test": 2, "train": 10, "validated": 28 }, "duration": 66620, "avgDurationSecs": 2.297, "validDurationSecs": 64.323, "totalHrs": 0.01, "validHrs": 0.01 }, "cs": { "clips": 1962, "splits": { "accent": { "": 1 }, "age": { "teens": 0.05, "": 0.68, "thirties": 0.11, "twenties": 0.13, "fourties": 0.03, "fifties": 0.01, "seventies": 0.01 }, "gender": { "male": 0.29, "": 0.67, "female": 0.04 } }, "users": 198, "buckets": { "dev": 3, "invalidated": 34, "other": 0, "test": 3, "train": 6, "validated": 1928 }, "duration": 4678287, "avgDurationSecs": 2.384, "validDurationSecs": 4597.216, "totalHrs": 1.29, "validHrs": 1.27 }, "zh-HK": { "clips": 1623, "splits": { "accent": { "": 1 }, "age": { "": 0.71, "twenties": 0.11, "fourties": 0.06, "thirties": 0.08, "teens": 0.02, "fifties": 0.01, "sixties": 0.01 }, "gender": { "": 0.71, "male": 0.2, "female": 0.1 } }, "users": 177, "buckets": { "dev": 3, "invalidated": 93, "other": 0, "test": 3, "train": 6, "validated": 1530 }, "duration": 4064590, "avgDurationSecs": 2.504, "validDurationSecs": 3831.684, "totalHrs": 1.12, "validHrs": 1.06 }, "lg": { "clips": 58, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.38, "thirties": 0.22 }, "gender": { "male": 0.19, "": 0.59, "female": 0.22 } }, "users": 9, "buckets": { "dev": 3, "invalidated": 3, "other": 38, "test": 3, "train": 5, "validated": 17 }, "duration": 168596, "avgDurationSecs": 2.907, "validDurationSecs": 49.416, "totalHrs": 0.04, "validHrs": 0.01 }, "ka": { "clips": 656, "splits": { "accent": { "": 1 }, "age": { "": 0.71, "twenties": 0.2, "thirties": 0.03, "fifties": 0.01, "teens": 0.04 }, "gender": { "": 0.71, "male": 0.21, "female": 0.08 } }, "users": 73, "buckets": { "dev": 3, "invalidated": 31, "other": 12, "test": 3, "train": 6, "validated": 613 }, "duration": 1587372, "avgDurationSecs": 2.42, "validDurationSecs": 1483.322, "totalHrs": 0.44, "validHrs": 0.41 }, "sv-SE": { "clips": 34, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.47, "": 0.47, "thirties": 0.06 }, "gender": { "female": 0.35, "": 0.47, "male": 0.18 } }, "users": 6, "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 12, "validated": 29 }, "duration": 86892, "avgDurationSecs": 2.556, "validDurationSecs": 74.114, "totalHrs": 0.02, "validHrs": 0.02 }, "cv": { "clips": 125, "splits": { "accent": { "": 1 }, "age": { "teens": 0.26, "twenties": 0.19, "": 0.45, "fourties": 0.1 }, "gender": { "female": 0.22, "male": 0.42, "": 0.35 } }, "users": 13, "buckets": { "dev": 3, "invalidated": 2, "other": 37, "test": 3, "train": 6, "validated": 86 }, "duration": 310861, "avgDurationSecs": 2.487, "validDurationSecs": 213.872, "totalHrs": 0.08, "validHrs": 0.05 }, "ky": { "clips": 946, "splits": { "accent": { "": 0.99, "other": 0.01 }, "age": { "": 0.27, "thirties": 0.05, "twenties": 0.46, "teens": 0.23 }, "gender": { "": 0.29, "male": 0.51, "female": 0.19 } }, "users": 96, "buckets": { "dev": 3, "invalidated": 72, "other": 88, "test": 3, "train": 6, "validated": 786 }, "duration": 2327196, "avgDurationSecs": 2.46, "validDurationSecs": 1933.59, "totalHrs": 0.64, "validHrs": 0.53 }, "rw": { "clips": 6921, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.58, "": 0.13, "fourties": 0.05, "teens": 0.09, "thirties": 0.13, "fifties": 0.01 }, "gender": { "male": 0.48, "": 0.15, "female": 0.37 } }, "users": 609, "buckets": { "dev": 3, "invalidated": 23, "other": 6306, "test": 3, "train": 6, "validated": 592 }, "duration": 16715172, "avgDurationSecs": 2.415, "validDurationSecs": 1429.762, "totalHrs": 4.64, "validHrs": 0.39 }, "or": { "clips": 80, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.06, "": 0.47, "thirties": 0.06, "twenties": 0.4 }, "gender": { "male": 0.61, "": 0.39 } }, "users": 9, "buckets": { "dev": 3, "invalidated": 3, "other": 48, "test": 3, "train": 6, "validated": 29 }, "duration": 221652, "avgDurationSecs": 2.771, "validDurationSecs": 80.349, "totalHrs": 0.06, "validHrs": 0.02 }, "ab": { "clips": 221, "splits": { "accent": { "": 1 }, "age": { "": 0.17, "twenties": 0.07, "fifties": 0.22, "thirties": 0.29, "teens": 0.11, "sixties": 0.1, "fourties": 0.05 }, "gender": { "": 0.17, "male": 0.45, "female": 0.38 } }, "users": 21, "buckets": { "dev": 3, "invalidated": 11, "other": 37, "test": 3, "train": 6, "validated": 173 }, "duration": 553140, "avgDurationSecs": 2.503, "validDurationSecs": 433.001, "totalHrs": 0.15, "validHrs": 0.12 }, "th": { "clips": 31, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.45, "": 0.1, "twenties": 0.29, "thirties": 0.1, "fourties": 0.06 }, "gender": { "male": 0.58, "": 0.1, "female": 0.32 } }, "users": 11, "buckets": { "dev": 3, "invalidated": 0, "other": 8, "test": 3, "train": 6, "validated": 23 }, "duration": 73332, "avgDurationSecs": 2.366, "validDurationSecs": 54.408, "totalHrs": 0.02, "validHrs": 0.01 } }, "overall": { "size": 3766437719, "checksum": "f96d00524f8859a8bd154eb98822ec06a9d451e5902a419deb5e1472f1d2c3ff" }, "totalDuration": 510515451, "totalValidDurationSecs": 296699, "totalHrs": 141, "totalValidHrs": 82 } ================================================ FILE: datasets/scripted-speech/cv-corpus-8.0-2022-01-19.json ================================================ { "date": "2022-01-19", "locales": { "en": { "duration": 10390463635, "buckets": { "dev": 16326, "invalidated": 239065, "other": 251332, "reported": 3558, "test": 16326, "train": 864448, "validated": 1530385 }, "reportedSentences": 3500, "clips": 2020782, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.04, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.46, "female": 0.16, "other": 0.02 } }, "users": 79398, "size": 75356163484, "checksum": "8b82525e6adb8382e28eabfed1beeedd3f315c1d3cdf7445a3ff33743f42025d", "avgDurationSecs": 5.142, "validDurationSecs": 7868938.703, "totalHrs": 2886.23, "validHrs": 2185.81 }, "fa": { "buckets": { "dev": 9728, "invalidated": 13124, "other": 30224, "reported": 1949, "test": 9728, "train": 23425, "validated": 284178 }, "reportedSentences": 1941, "clips": 327526, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "twenties": 0.32, "thirties": 0.38, "fifties": 0.02, "fourties": 0.02, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.21, "male": 0.72, "female": 0.07, "other": 0 } }, "users": 4016, "duration": 1316470688, "size": 9804861437, "checksum": "5b351ade3858fd9636691569dc9918e7270fedbc1a68fc47b4fe82bf6d34fd9a", "avgDurationSecs": 4.019, "validDurationSecs": 1142236.058, "totalHrs": 365.68, "validHrs": 317.28 }, "fr": { "buckets": { "dev": 16021, "invalidated": 53389, "other": 1444, "reported": 5989, "test": 16021, "train": 430464, "validated": 596858 }, "duration": 3247351570, "reportedSentences": 5914, "clips": 651691, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "thirties": 0.18, "": 0.35, "teens": 0.03, "fourties": 0.13, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.62, "": 0.27, "female": 0.11, "other": 0.01 } }, "users": 16082, "size": 23747554124, "checksum": "0e11673e55bbcf30244d5da0c3a8ab07f7deb96ae98b3d2ae752b51c2caa0ea2", "avgDurationSecs": 4.983, "validDurationSecs": 2974120.808, "totalHrs": 902.04, "validHrs": 826.14 }, "es": { "buckets": { "dev": 15440, "invalidated": 47683, "other": 191609, "reported": 1632, "test": 15440, "train": 214001, "validated": 289210 }, "duration": 2661738714, "reportedSentences": 1623, "clips": 528502, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.32, "fifties": 0.08, "twenties": 0.25, "teens": 0.03, "fourties": 0.06, "sixties": 0.15, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.51, "": 0.32, "other": 0.01, "female": 0.17 } }, "users": 22741, "size": 19549932331, "checksum": "b138e13fd63ffdf8668120ad457376c257756ca8e4bcad6485185683fa2f520b", "avgDurationSecs": 5.036, "validDurationSecs": 1456572.451, "totalHrs": 739.37, "validHrs": 404.6 }, "sl": { "buckets": { "dev": 1229, "invalidated": 207, "other": 832, "reported": 28, "test": 1193, "train": 1377, "validated": 8975 }, "duration": 38735110, "reportedSentences": 29, "clips": 10014, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.5, "teens": 0.09, "": 0.18, "sixties": 0.08, "fifties": 0.08, "fourties": 0.02, "thirties": 0.05 }, "gender": { "female": 0.15, "male": 0.67, "": 0.18, "other": 0 } }, "users": 125, "size": 281432145, "checksum": "38a3bfd9e5dfe83cd5d6eb44851b8416449cc0675e965b4016a1ad2365010062", "avgDurationSecs": 3.868, "validDurationSecs": 34716.159, "totalHrs": 10.75, "validHrs": 9.64 }, "kab": { "buckets": { "dev": 14874, "invalidated": 19127, "other": 100690, "reported": 4798, "test": 14874, "train": 140104, "validated": 596082 }, "duration": 2380327432, "reportedSentences": 4793, "clips": 715899, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.3, "": 0.27, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.55, "": 0.25, "female": 0.2, "other": 0 } }, "users": 1431, "size": 17959985919, "checksum": "45834be25ed44de041ca41c722344bc0743bae556b24e58278771e9a9db1969c", "avgDurationSecs": 3.325, "validDurationSecs": 1981942.057, "totalHrs": 661.2, "validHrs": 550.53 }, "cy": { "buckets": { "dev": 5131, "invalidated": 4226, "other": 17588, "reported": 149, "test": 5144, "train": 7459, "validated": 86743 }, "duration": 524128257, "reportedSentences": 150, "clips": 108557, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.14, "sixties": 0.07, "fifties": 0.09, "": 0.42, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.25, "": 0.41, "other": 0.01 } }, "users": 1695, "size": 3878708024, "checksum": "b087632306ff2fb11da1753c5adfda39634c727abc7e027be13c6c0d652b71a1", "avgDurationSecs": 4.828, "validDurationSecs": 418807.239, "totalHrs": 145.59, "validHrs": 116.33 }, "ca": { "buckets": { "dev": 16077, "invalidated": 37122, "other": 42468, "reported": 2172, "test": 16078, "train": 475770, "validated": 607599 }, "duration": 3732735162, "reportedSentences": 2131, "clips": 687189, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "fifties": 0.21, "fourties": 0.13, "twenties": 0.07, "": 0.17, "sixties": 0.28, "teens": 0.01, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.63, "": 0.18, "female": 0.19, "other": 0 } }, "users": 6665, "size": 26810695121, "checksum": "9ce115587d1641da7dba3b3a8251809d71c9c08e539bfcc3a2c0a17a518b2299", "avgDurationSecs": 5.432, "validDurationSecs": 3300411.025, "totalHrs": 1036.87, "validHrs": 916.78 }, "de": { "buckets": { "dev": 16007, "invalidated": 44525, "other": 5115, "reported": 6726, "test": 16007, "train": 420163, "validated": 746482 }, "duration": 4080408545, "reportedSentences": 6702, "clips": 796122, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.2, "fourties": 0.18, "": 0.31, "thirties": 0.15, "teens": 0.03, "sixties": 0.03, "fifties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.31, "female": 0.09, "other": 0.01 } }, "users": 16390, "size": 29533045352, "checksum": "ed2ff5a0564f5e4c0da9606d9fb10ef9189d3c369a9046cdde95d2f1ba5b4e0e", "avgDurationSecs": 5.125, "validDurationSecs": 3825985.881, "totalHrs": 1133.44, "validHrs": 1062.77 }, "tt": { "buckets": { "dev": 2812, "invalidated": 351, "other": 186, "reported": 3, "test": 5086, "train": 10062, "validated": 28102 }, "duration": 107079434, "reportedSentences": 4, "clips": 28639, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "thirties": 0.73, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.19, "male": 0.79, "female": 0.02 } }, "users": 206, "size": 795268101, "checksum": "31f43dac431fcef8e07ca6f96320d9425ca4bfbf862c6a1fd38dea10b6cc5db2", "avgDurationSecs": 3.739, "validDurationSecs": 105071.625, "totalHrs": 29.74, "validHrs": 29.18 }, "ta": { "buckets": { "dev": 11473, "invalidated": 5460, "other": 66259, "reported": 3190, "test": 11499, "train": 38077, "validated": 126014 }, "duration": 1229526168, "reportedSentences": 3190, "clips": 197733, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.08, "thirties": 0.1, "": 0.7, "fourties": 0.03, "seventies": 0.02, "fifties": 0.02, "teens": 0.04, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.7, "other": 0, "female": 0.14 } }, "users": 679, "size": 7385758869, "checksum": "afc50e741d646f789f4781af0447e7ac7e352a3816ba6957052e55666285a486", "avgDurationSecs": 6.218, "validDurationSecs": 783569.311, "totalHrs": 341.53, "validHrs": 217.65 }, "ru": { "buckets": { "dev": 9415, "invalidated": 6385, "other": 15202, "reported": 282, "test": 9419, "train": 21712, "validated": 111904 }, "duration": 698374512, "reportedSentences": 276, "clips": 133491, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.42, "teens": 0.08, "": 0.19, "fourties": 0.15, "thirties": 0.13, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.64, "": 0.2, "other": 0, "female": 0.16 } }, "users": 2452, "size": 4974445991, "checksum": "5dc41389cd248c8e80568751048da64145f876778959f6ef34ca15858eeeb304", "avgDurationSecs": 5.232, "validDurationSecs": 585439.478, "totalHrs": 193.99, "validHrs": 162.62 }, "nl": { "buckets": { "dev": 10477, "invalidated": 4910, "other": 1443, "reported": 302, "test": 10477, "train": 28286, "validated": 82265 }, "duration": 380247373, "reportedSentences": 303, "clips": 88618, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.22, "fourties": 0.15, "thirties": 0.1, "teens": 0.02, "fifties": 0.08, "sixties": 0.01, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.42, "male": 0.47, "female": 0.11, "other": 0 } }, "users": 1462, "size": 2632034245, "checksum": "a173515c0f7c99559ef5a446c26d0c16e91d72201ffd64bd68703b7776fee2c1", "avgDurationSecs": 4.291, "validDurationSecs": 352987.544, "totalHrs": 105.62, "validHrs": 98.05 }, "it": { "buckets": { "dev": 14905, "invalidated": 16853, "other": 27, "reported": 5147, "test": 14905, "train": 142934, "validated": 209027 }, "duration": 1208481924, "reportedSentences": 5143, "clips": 225907, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.16, "twenties": 0.2, "": 0.3, "fifties": 0.16, "fourties": 0.14, "seventies": 0, "sixties": 0.02, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.12, "male": 0.58, "": 0.29, "other": 0 } }, "users": 6576, "size": 8431167904, "checksum": "55d734e2269fb843b3d63e27fcd8ca51349551db1dedf1f9e3d6bc3f1ae0e1ef", "avgDurationSecs": 5.349, "validDurationSecs": 1118182.93, "totalHrs": 335.68, "validHrs": 310.6 }, "eu": { "buckets": { "dev": 6463, "invalidated": 5726, "other": 26030, "reported": 42, "test": 6463, "train": 10567, "validated": 68653 }, "duration": 520773963, "reportedSentences": 43, "clips": 100409, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.13, "twenties": 0.36, "": 0.24, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.47, "female": 0.26, "": 0.25, "other": 0.02 } }, "users": 1192, "size": 3939191632, "checksum": "c63d79e13a292e9cff9a4dbb84b1abaa7f8f9ed48f5eb5173df2811247366db2", "avgDurationSecs": 5.187, "validDurationSecs": 356070.62, "totalHrs": 144.65, "validHrs": 98.9 }, "tr": { "buckets": { "dev": 8110, "invalidated": 3182, "other": 151, "reported": 287, "test": 8339, "train": 16948, "validated": 63236 }, "duration": 246710007, "reportedSentences": 288, "clips": 66569, "splits": { "accent": { "": 1 }, "age": { "": 0.33, "thirties": 0.11, "twenties": 0.28, "teens": 0.02, "fourties": 0.04, "fifties": 0.1, "sixties": 0.08, "eighties": 0, "seventies": 0.04 }, "gender": { "": 0.33, "male": 0.46, "female": 0.21, "other": 0 } }, "users": 1228, "size": 1550870485, "checksum": "7a82a2cf91f91fafd67f16a15676e09f2ba92a6b8a99a495bb3438386f87e794", "avgDurationSecs": 3.706, "validDurationSecs": 234357.644, "totalHrs": 68.53, "validHrs": 65.09 }, "ar": { "buckets": { "dev": 10386, "invalidated": 14733, "other": 30308, "reported": 2029, "test": 10388, "train": 27823, "validated": 75110 }, "duration": 500993000, "reportedSentences": 2022, "clips": 120151, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.12, "": 0.58, "twenties": 0.26, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.17, "": 0.57, "male": 0.25, "other": 0 } }, "users": 1216, "size": 2987783290, "checksum": "b208528232109044d218035e65a91803c79319bfbcb50b616b7a1a9a7421fede", "avgDurationSecs": 4.17, "validDurationSecs": 306616.003, "totalHrs": 139.17, "validHrs": 85.17 }, "zh-TW": { "buckets": { "dev": 4200, "invalidated": 4269, "other": 25517, "reported": 85, "test": 4200, "train": 5619, "validated": 69509 }, "duration": 321779398, "reportedSentences": 86, "clips": 99295, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.22, "twenties": 0.36, "teens": 0.05, "": 0.24, "fifties": 0.01, "seventies": 0, "fourties": 0.11, "sixties": 0 }, "gender": { "male": 0.49, "": 0.23, "female": 0.25, "other": 0.02 } }, "users": 1695, "size": 2410877453, "checksum": "b0fe7534d4a45cc75d4ddccbdec1120d3016a36532ee29629fa50280dc66db77", "avgDurationSecs": 3.241, "validDurationSecs": 225253.68, "totalHrs": 89.38, "validHrs": 62.57 }, "br": { "buckets": { "dev": 2157, "invalidated": 756, "other": 11281, "reported": 182, "test": 2147, "train": 2568, "validated": 11127 }, "duration": 71230579, "reportedSentences": 183, "clips": 23164, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.33, "fifties": 0.06, "fourties": 0.07, "thirties": 0.08, "sixties": 0.18, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.65, "": 0.34, "female": 0.02 } }, "users": 171, "size": 525334515, "checksum": "5a7fbf4904a5ecca643637672154e270562b97d806b731dfe652ef366d3bcdba", "avgDurationSecs": 3.075, "validDurationSecs": 34216.139, "totalHrs": 19.78, "validHrs": 9.5 }, "pt": { "buckets": { "dev": 8302, "invalidated": 4152, "other": 11486, "reported": 2225, "test": 8301, "train": 16575, "validated": 95429 }, "duration": 469418983, "reportedSentences": 2220, "clips": 111067, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "twenties": 0.41, "teens": 0.03, "thirties": 0.23, "fourties": 0.1, "sixties": 0, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.2, "male": 0.74, "female": 0.04, "other": 0.02 } }, "users": 2365, "size": 3098993125, "checksum": "0cb4b6d603d4277bd279609404f127040ed2446f6f7d5d6cab890f036f351fd1", "avgDurationSecs": 4.226, "validDurationSecs": 403325.777, "totalHrs": 130.39, "validHrs": 112.03 }, "eo": { "buckets": { "dev": 14902, "invalidated": 127227, "other": 126649, "reported": 2091, "test": 14915, "train": 143982, "validated": 847678 }, "duration": 6683979355, "reportedSentences": 2090, "clips": 1101554, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.57, "thirties": 0.12, "": 0.19, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1415, "size": 39920892777, "checksum": "0a69c50e958736afe155c8f9044016c9c90e024524369741dd2bc30dfd036a4b", "avgDurationSecs": 6.068, "validDurationSecs": 5068560, "totalHrs": 1856.66111111, "validHrs": 1407.93 }, "zh-CN": { "buckets": { "dev": 9688, "invalidated": 6424, "other": 12312, "reported": 415, "test": 9698, "train": 23261, "validated": 46743 }, "duration": 343002480, "reportedSentences": 414, "clips": 65479, "splits": { "accent": { "": 1 }, "age": { "": 0.38, "teens": 0.09, "twenties": 0.39, "thirties": 0.11, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.38, "male": 0.52, "female": 0.09, "other": 0.01 } }, "users": 4013, "size": 2545234598, "checksum": "3e47195ee95e41bf256c42c457d1726fd1d8b5f5a0c36bf0efe4d78568ac52eb", "avgDurationSecs": 5.238, "validDurationSecs": 244856.594, "totalHrs": 95.27, "validHrs": 68.01 }, "id": { "buckets": { "dev": 3207, "invalidated": 2442, "other": 22385, "reported": 250, "test": 3608, "train": 5032, "validated": 22874 }, "duration": 193962072, "reportedSentences": 251, "clips": 47701, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.39, "thirties": 0.06, "teens": 0.27, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.26, "male": 0.41, "female": 0.29, "other": 0.04 } }, "users": 394, "size": 1237534480, "checksum": "58c934a8798c3d29c84bc99e8f425c43ddaefb3f945766094134c4769a790fce", "avgDurationSecs": 4.066, "validDurationSecs": 93010.386, "totalHrs": 53.87, "validHrs": 25.83 }, "ia": { "buckets": { "dev": 1786, "invalidated": 326, "other": 2863, "reported": 263, "test": 1708, "train": 5050, "validated": 11145 }, "duration": 59854908, "reportedSentences": 259, "clips": 14334, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.39, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.39, "female": 0.01 } }, "users": 57, "size": 406246505, "checksum": "4bae7a76fdac182ad6844c7d037589770e2cc3a4bdd70731a898eab64a890632", "avgDurationSecs": 4.176, "validDurationSecs": 46538.506, "totalHrs": 16.62, "validHrs": 12.92 }, "lv": { "buckets": { "dev": 1829, "invalidated": 165, "other": 1218, "reported": 23, "test": 2148, "train": 3134, "validated": 7542 }, "duration": 30428333, "reportedSentences": 24, "clips": 8925, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.49, "fourties": 0.03, "": 0.17, "twenties": 0.28, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.7, "female": 0.13, "": 0.17 } }, "users": 115, "size": 224375158, "checksum": "037fbf62bd6a4e8703ce6fa6bd7b414825e388e7532c3b59d621ff983cb7f246", "avgDurationSecs": 3.409, "validDurationSecs": 25713.22, "totalHrs": 8.45, "validHrs": 7.14 }, "ja": { "buckets": { "dev": 4124, "invalidated": 1988, "other": 115, "reported": 121, "test": 4483, "train": 6499, "validated": 30814 }, "duration": 156864273, "reportedSentences": 121, "clips": 32917, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "": 0.23, "teens": 0.04, "fifties": 0.01, "thirties": 0.1, "fourties": 0.28, "sixties": 0 }, "gender": { "male": 0.5, "": 0.21, "female": 0.29, "other": 0 } }, "users": 550, "size": 958012363, "checksum": "6185cda4aee9c4a7e3c18739ed48c2ac84e235f5bc2a7f7bdd458a981d3f1f86", "avgDurationSecs": 4.765, "validDurationSecs": 146842.535, "totalHrs": 43.57, "validHrs": 40.78 }, "rw": { "buckets": { "dev": 15988, "invalidated": 227700, "other": 47331, "reported": 623, "test": 16213, "train": 1003810, "validated": 1438253 }, "duration": 8579999777, "reportedSentences": 624, "clips": 1713284, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1055, "size": 60993257948, "checksum": "b454b08caadfa678564b80fbd0a20bdea09508bac4da79a7204e4325838aa933", "avgDurationSecs": 5.008, "validDurationSecs": 7202664.835, "totalHrs": 2383.33, "validHrs": 2000.74 }, "sv-SE": { "buckets": { "dev": 4764, "invalidated": 1288, "other": 5849, "reported": 566, "test": 4843, "train": 6926, "validated": 37302 }, "duration": 174893451, "reportedSentences": 567, "clips": 44439, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.11, "fourties": 0.39, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.47, "": 0.18, "female": 0.34, "other": 0 } }, "users": 718, "size": 1113702755, "checksum": "8cf722f710eea600fefea628d88ab534cb375103fac38474d70c8fbd5f4f4eb1", "avgDurationSecs": 3.936, "validDurationSecs": 146805.183, "totalHrs": 48.58, "validHrs": 40.77 }, "cnh": { "duration": 20673132, "buckets": { "dev": 761, "invalidated": 435, "other": 2913, "reported": 8, "test": 761, "train": 814, "validated": 2453 }, "reportedSentences": 9, "clips": 5801, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 298, "size": 161377131, "checksum": "a5bc2a2505a3cebf92c76d15d3bd822a583cfaaa1f9fa46275b3e7b41ac34ec6", "avgDurationSecs": 3.564, "validDurationSecs": 8741.802, "totalHrs": 5.74, "validHrs": 2.42 }, "et": { "duration": 158860334, "buckets": { "dev": 2613, "invalidated": 5461, "other": 805, "reported": 450, "test": 2613, "train": 3103, "validated": 17303 }, "reportedSentences": 448, "clips": 23569, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.07, "twenties": 0.68, "fourties": 0.05, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.56, "female": 0.24, "other": 0 } }, "users": 723, "size": 1118010789, "checksum": "b277d1c98c6f46abdb0867e582aa5ea37cd788b868f4537144dbdc5499b52d65", "avgDurationSecs": 6.74, "validDurationSecs": 116626.092, "totalHrs": 44.12, "validHrs": 32.39 }, "ky": { "duration": 159748788, "buckets": { "dev": 1613, "invalidated": 5557, "other": 109, "reported": 32, "test": 1613, "train": 1787, "validated": 29492 }, "reportedSentences": 33, "clips": 35158, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.06, "fourties": 0.01, "twenties": 0.67, "teens": 0.19 }, "gender": { "male": 0.54, "": 0.11, "female": 0.36, "other": 0 } }, "users": 234, "size": 1034900324, "checksum": "05ddbd6beddc51c341b4d414723d4408a8c1e9c21167744d02110ce0df273235", "avgDurationSecs": 4.544, "validDurationSecs": 134003.961, "totalHrs": 44.37, "validHrs": 37.22 }, "ro": { "duration": 133037923, "buckets": { "dev": 3683, "invalidated": 850, "other": 18388, "reported": 283, "test": 3843, "train": 5158, "validated": 14381 }, "reportedSentences": 284, "clips": 33619, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "teens": 0.02, "": 0.11, "fourties": 0.05, "sixties": 0, "twenties": 0.66, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.73, "": 0.1, "female": 0.16, "other": 0.01 } }, "users": 332, "size": 837935454, "checksum": "7c7f81d93b7c591a61040aea88ff0b216f21e4d77f5a99ca2817a307793fdbe3", "avgDurationSecs": 3.957, "validDurationSecs": 56908.842, "totalHrs": 36.95, "validHrs": 15.8 }, "hsb": { "duration": 10103328, "buckets": { "dev": 172, "invalidated": 228, "other": 30, "reported": 56, "test": 418, "train": 808, "validated": 1398 }, "reportedSentences": 57, "clips": 1656, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.83, "": 0.17, "other": 0 } }, "users": 19, "size": 79355786, "checksum": "2920dfc81bd6083572b7a91a9cf5a98aa2d9a5f0346e559084f27546cc32512d", "avgDurationSecs": 6.101, "validDurationSecs": 8529.259, "totalHrs": 2.8, "validHrs": 2.36 }, "el": { "duration": 93490738, "buckets": { "dev": 1690, "invalidated": 734, "other": 8084, "reported": 63, "test": 1681, "train": 1936, "validated": 13850 }, "reportedSentences": 64, "clips": 22668, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.41, "fourties": 0.14, "": 0.27, "twenties": 0.13, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.68, "": 0.27, "other": 0.02, "female": 0.03 } }, "users": 312, "size": 638647250, "checksum": "bcaaf3fa149e4ac9f670a8271c77be8a4d14fc9607bda6e9cf6ae35593af908e", "avgDurationSecs": 4.124, "validDurationSecs": 57122.231, "totalHrs": 25.96, "validHrs": 15.86 }, "cs": { "duration": 244312851, "buckets": { "dev": 6950, "invalidated": 1173, "other": 9639, "reported": 693, "test": 7267, "train": 13250, "validated": 45752 }, "reportedSentences": 690, "clips": 56564, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.2, "": 0.36, "thirties": 0.13, "teens": 0.01, "twenties": 0.28, "fifties": 0.02, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.35, "female": 0.03 } }, "users": 525, "size": 1734765011, "checksum": "7512363c0b7fa44931b5e9402a4e8f27041fab88b353e7a3344e3654f4cb3883", "avgDurationSecs": 4.319, "validDurationSecs": 197613.351, "totalHrs": 67.86, "validHrs": 54.89 }, "pl": { "duration": 584124014, "buckets": { "dev": 7748, "invalidated": 5762, "other": 10418, "reported": 523, "test": 7749, "train": 14504, "validated": 114864 }, "reportedSentences": 523, "clips": 131044, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3026, "size": 4217805363, "checksum": "4c67cc1917503fde008510eec0117ccd79f90540c8cd0d4cfba25f9c6a06d94e", "avgDurationSecs": 4.457, "validDurationSecs": 512002.234, "totalHrs": 162.25, "validHrs": 142.22 }, "rm-sursilv": { "duration": 37259753, "buckets": { "dev": 1296, "invalidated": 659, "other": 2102, "reported": 9, "test": 1255, "train": 1454, "validated": 4005 }, "reportedSentences": 10, "clips": 6766, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.64, "teens": 0.06, "fourties": 0.17 }, "gender": { "male": 0.16, "female": 0.2, "": 0.64, "other": 0 } }, "users": 84, "size": 284030804, "checksum": "b58ba0a7b617abb3a2f30d8b6c62d481c03f7b1cadcc9e5562f18336bf301421", "avgDurationSecs": 5.507, "validDurationSecs": 22055.175, "totalHrs": 10.34, "validHrs": 6.12 }, "rm-vallader": { "duration": 14917514, "buckets": { "dev": 372, "invalidated": 391, "other": 726, "reported": 30, "test": 427, "train": 653, "validated": 1459 }, "reportedSentences": 29, "clips": 2576, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.2, "other": 0.01 } }, "users": 50, "size": 114303398, "checksum": "c1fb6a7fe372b0eb1b32dfac2da331f4df967ae860c6bdf2f8771a52d817bc70", "avgDurationSecs": 5.791, "validDurationSecs": 8449.011, "totalHrs": 4.14, "validHrs": 2.34 }, "mn": { "duration": 66850604, "buckets": { "dev": 1829, "invalidated": 739, "other": 3306, "reported": 17, "test": 1882, "train": 2178, "validated": 8180 }, "reportedSentences": 18, "clips": 12225, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.27, "twenties": 0.41, "fourties": 0.01, "teens": 0.02, "nineties": 0.06, "fifties": 0 }, "gender": { "male": 0.36, "": 0.27, "female": 0.32, "other": 0.06 } }, "users": 451, "size": 511311464, "checksum": "4918ec6e8de6f0c167130e1a11edf0811c0a033fa22d6552e69f09a3bd9f3731", "avgDurationSecs": 5.468, "validDurationSecs": 44731.12, "totalHrs": 18.56, "validHrs": 12.42 }, "zh-HK": { "duration": 433845852, "buckets": { "dev": 5563, "invalidated": 3763, "other": 13963, "reported": 612, "test": 5563, "train": 8360, "validated": 84696 }, "reportedSentences": 601, "clips": 102422, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.41, "teens": 0.02, "fifties": 0.02, "seventies": 0, "sixties": 0, "twenties": 0.31 }, "gender": { "male": 0.41, "": 0.37, "female": 0.22, "other": 0.01 } }, "users": 2738, "size": 3189158718, "checksum": "0cae705ca0b0d86f608bc61b9872e7f7ba61c4015cc1c279d33daf94fdbcdac3", "avgDurationSecs": 4.236, "validDurationSecs": 358760.894, "totalHrs": 120.51, "validHrs": 99.65 }, "ab": { "duration": 300948948, "buckets": { "dev": 9187, "invalidated": 5218, "other": 11702, "reported": 219, "test": 9184, "train": 20815, "validated": 41777 }, "reportedSentences": 218, "clips": 58697, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.19, "teens": 0.28, "twenties": 0.19, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 392, "size": 1718935472, "checksum": "fbfddfeddfe3b6f95bc7ee0d1cf293ac6e5e8b00ba3266c53ed7493476f771d3", "avgDurationSecs": 5.127, "validDurationSecs": 214197.39, "totalHrs": 83.59, "validHrs": 59.49 }, "cv": { "duration": 81658092, "buckets": { "dev": 1205, "invalidated": 1745, "other": 5132, "reported": 142, "test": 1203, "train": 1366, "validated": 9505 }, "reportedSentences": 138, "clips": 16382, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.45, "": 0.23, "fourties": 0.07, "thirties": 0.01, "teens": 0.23, "fifties": 0.01 }, "gender": { "male": 0.51, "": 0.2, "female": 0.29 } }, "users": 101, "size": 573369798, "checksum": "b62162fb5588f579be9629551c28d3ea80d9c09343fe361c9bafdc588c6d2ae8", "avgDurationSecs": 4.985, "validDurationSecs": 47378.84, "totalHrs": 22.68, "validHrs": 13.16 }, "uk": { "duration": 276258228, "buckets": { "dev": 5802, "invalidated": 2148, "other": 7714, "reported": 574, "test": 5802, "train": 8915, "validated": 47063 }, "reportedSentences": 575, "clips": 56925, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "teens": 0.06, "": 0.28, "fourties": 0.14, "thirties": 0.24, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.54, "female": 0.17, "": 0.29 } }, "users": 684, "size": 1905832841, "checksum": "0c7aa9dcd83282471753e83131a0e0d94ec8c1052395330f5ce7af972f66fa9e", "avgDurationSecs": 4.853, "validDurationSecs": 228397.734, "totalHrs": 76.73, "validHrs": 63.44 }, "mt": { "duration": 60956460, "buckets": { "dev": 1596, "invalidated": 319, "other": 6235, "reported": 6, "test": 1625, "train": 1952, "validated": 6318 }, "reportedSentences": 7, "clips": 12872, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.26, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 203, "size": 453931362, "checksum": "dfc42d48aa7b4b02e75ae7e13f6ef764b97efcc2bfbc30eaaf17707753b0ab9b", "avgDurationSecs": 4.736, "validDurationSecs": 29919.431, "totalHrs": 16.93, "validHrs": 8.31 }, "as": { "duration": 5118797, "buckets": { "dev": 116, "invalidated": 47, "other": 0, "reported": 9, "test": 294, "train": 508, "validated": 918 }, "reportedSentences": 10, "clips": 965, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "": 0.56, "thirties": 0.05 }, "gender": { "male": 0.46, "": 0.54 } }, "users": 38, "size": 35350705, "checksum": "488309b938dbcb4027d387a39cafa17bf03816ef129ee2b68c3c5ca01cfc6824", "avgDurationSecs": 5.304, "validDurationSecs": 4869.488, "totalHrs": 1.42, "validHrs": 1.35 }, "ka": { "duration": 29256840, "buckets": { "dev": 1348, "invalidated": 349, "other": 7, "reported": 34, "test": 1345, "train": 1655, "validated": 5100 }, "reportedSentences": 35, "clips": 5456, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.39, "thirties": 0.24, "": 0.34, "fourties": 0.03, "fifties": 0, "teens": 0.01 }, "gender": { "male": 0.46, "female": 0.2, "": 0.34 } }, "users": 127, "size": 193547750, "checksum": "0424f971399d08f1fd9ed0b2badf1310d7e4b5cb4a8bac3c63905d91b98ac896", "avgDurationSecs": 5.362, "validDurationSecs": 27347.853, "totalHrs": 8.12, "validHrs": 7.59 }, "fy-NL": { "duration": 453115107, "buckets": { "dev": 3024, "invalidated": 2900, "other": 52267, "reported": 402, "test": 3024, "train": 3697, "validated": 35880 }, "reportedSentences": 400, "clips": 91047, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "fifties": 0.12, "thirties": 0.04, "twenties": 0.02, "fourties": 0.07, "sixties": 0.14, "seventies": 0.01, "teens": 0, "eighties": 0 }, "gender": { "": 0.6, "male": 0.1, "female": 0.3 } }, "users": 1132, "size": 2839357525, "checksum": "42f0bebb57cb56a715be8423aff6ef97c338b58aaa8efa4b6f766b14937f489a", "avgDurationSecs": 4.977, "validDurationSecs": 178564.588, "totalHrs": 125.86, "validHrs": 49.6 }, "dv": { "duration": 210442743, "buckets": { "dev": 2244, "invalidated": 1538, "other": 14430, "reported": 49, "test": 2243, "train": 2624, "validated": 25774 }, "reportedSentences": 50, "clips": 41742, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.36, "fourties": 0.23, "teens": 0.01, "nineties": 0 }, "gender": { "": 0.22, "male": 0.28, "female": 0.5 } }, "users": 300, "size": 1353683004, "checksum": "1e7a5b82997ca76d37ee256e8c666f42eb01cbbf5a9dc6e24fa577c2b4ecff16", "avgDurationSecs": 5.042, "validDurationSecs": 129939.899, "totalHrs": 58.45, "validHrs": 36.09 }, "pa-IN": { "duration": 12071510, "buckets": { "dev": 266, "invalidated": 71, "other": 1227, "reported": 233, "test": 360, "train": 590, "validated": 1216 }, "reportedSentences": 232, "clips": 2514, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "fourties": 0.04, "fifties": 0.01, "thirties": 0.44, "twenties": 0.27, "sixties": 0, "teens": 0 }, "gender": { "": 0.24, "male": 0.76, "female": 0 } }, "users": 47, "size": 88232608, "checksum": "5728ab2b9391ff9ac7c744646bcb7cd1433796ec69aaac2cc61cb0b08feee7b5", "avgDurationSecs": 4.802, "validDurationSecs": 5838.885, "totalHrs": 3.35, "validHrs": 1.62 }, "vi": { "duration": 60732188, "buckets": { "dev": 0, "invalidated": 285, "other": 10954, "reported": 171, "test": 1120, "train": 2722, "validated": 4066 }, "reportedSentences": 170, "clips": 15305, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.18, "": 0.24, "teens": 0.21, "seventies": 0, "fourties": 0.02, "sixties": 0.33 }, "gender": { "male": 0.52, "": 0.24, "female": 0.21, "other": 0.02 } }, "users": 200, "size": 352797985, "checksum": "38782df109852d3cbc6a7b788bfa3a745648c1886a4e81acd2a600b529a4fbe5", "avgDurationSecs": 3.968, "validDurationSecs": 16134.406, "totalHrs": 16.87, "validHrs": 4.48 }, "or": { "duration": 32312652, "buckets": { "dev": 306, "invalidated": 146, "other": 5184, "reported": 4, "test": 213, "train": 447, "validated": 1065 }, "reportedSentences": 5, "clips": 6395, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.16, "": 0.09, "thirties": 0.75, "fourties": 0, "teens": 0 }, "gender": { "male": 0.92, "": 0.08, "female": 0 } }, "users": 79, "size": 237971623, "checksum": "bc2310cfae8ad38401c83dc43d75dec3686c82e98b22dca9cd0ab3034d93d50b", "avgDurationSecs": 5.053, "validDurationSecs": 5381.231, "totalHrs": 8.97, "validHrs": 1.49 }, "ga-IE": { "duration": 31746555, "buckets": { "dev": 512, "invalidated": 778, "other": 3811, "reported": 9, "test": 509, "train": 534, "validated": 4412 }, "reportedSentences": 10, "clips": 9001, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.37, "thirties": 0.26, "fourties": 0.04, "sixties": 0.01, "teens": 0.02, "fifties": 0.05 }, "gender": { "male": 0.49, "": 0.37, "female": 0.13, "other": 0 } }, "users": 153, "size": 221761496, "checksum": "800e4bdc1823758925bb1cd5cc680c64fc97cd1d34aa78fcac40ec37b8caaf3c", "avgDurationSecs": 3.527, "validDurationSecs": 15561.138, "totalHrs": 8.81, "validHrs": 4.32 }, "fi": { "duration": 53956767, "buckets": { "dev": 1430, "invalidated": 187, "other": 4976, "reported": 38, "test": 1739, "train": 2334, "validated": 6699 }, "reportedSentences": 39, "clips": 11862, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "": 0.38, "twenties": 0.1, "fourties": 0.37, "teens": 0.01, "fifties": 0.02, "seventies": 0 }, "gender": { "male": 0.23, "": 0.38, "female": 0.39, "other": 0 } }, "users": 171, "size": 322898980, "checksum": "53c0c6e9c8fb523486ca8c4d8c20409dff6cff4a622f6ee5fb8c817748b3418c", "avgDurationSecs": 4.549, "validDurationSecs": 30471.791, "totalHrs": 14.98, "validHrs": 8.46 }, "hu": { "duration": 88030549, "buckets": { "dev": 3865, "invalidated": 745, "other": 2600, "reported": 85, "test": 4020, "train": 6703, "validated": 14621 }, "reportedSentences": 86, "clips": 17966, "splits": { "accent": { "": 1 }, "age": { "teens": 0.08, "": 0.28, "thirties": 0.14, "twenties": 0.4, "fifties": 0.06, "fourties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.6, "": 0.28, "female": 0.11 } }, "users": 197, "size": 572439425, "checksum": "76663918e570d3fdcd8b3dc7e8642e06a8b508ba19347fd8721542af5821fce0", "avgDurationSecs": 4.9, "validDurationSecs": 71640.58, "totalHrs": 24.45, "validHrs": 19.9 }, "th": { "duration": 1224838524, "buckets": { "dev": 10769, "invalidated": 7934, "other": 159310, "reported": 3799, "test": 10769, "train": 30519, "validated": 119980 }, "reportedSentences": 3799, "clips": 287224, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.22, "": 0.4, "thirties": 0.06, "fourties": 0.04, "teens": 0.05, "fifties": 0.24, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.39, "": 0.4, "female": 0.21, "other": 0.01 } }, "users": 7414, "size": 7129246288, "checksum": "b30339fc5e6c55651f5b0978f959b49f8aaaaaa446a853e8aa05e526c5cc9baf", "avgDurationSecs": 4.264, "validDurationSecs": 511642.92, "totalHrs": 340.23, "validHrs": 142.12 }, "lt": { "duration": 73294964, "buckets": { "dev": 3370, "invalidated": 551, "other": 1502, "reported": 122, "test": 3647, "train": 5143, "validated": 12163 }, "reportedSentences": 122, "clips": 14216, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.24, "thirties": 0.28, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.05 }, "gender": { "male": 0.62, "": 0.24, "female": 0.14 } }, "users": 249, "size": 450546574, "checksum": "43a12031682ea130a3ea2bcfb3d3622a4776d0dc83ff70b25e3bfd6db76edfb4", "avgDurationSecs": 5.156, "validDurationSecs": 62710.091, "totalHrs": 20.35, "validHrs": 17.41 }, "lg": { "duration": 1719856143, "buckets": { "dev": 12660, "invalidated": 38083, "other": 5946, "reported": 6039, "test": 12716, "train": 54975, "validated": 252176 }, "reportedSentences": 6034, "clips": 296205, "splits": { "accent": { "": 1 }, "age": { "": 0.27, "thirties": 0.22, "twenties": 0.41, "fourties": 0.05, "fifties": 0.03, "teens": 0.01, "nineties": 0, "sixties": 0.01 }, "gender": { "": 0.27, "female": 0.4, "male": 0.34 } }, "users": 486, "size": 10082077359, "checksum": "4628670760d5fc2f646422944229297ce4cd3fd969cba05374184786f052c45e", "avgDurationSecs": 5.806, "validDurationSecs": 1464210.404, "totalHrs": 477.73, "validHrs": 406.72 }, "hi": { "duration": 60712692, "buckets": { "dev": 2175, "invalidated": 617, "other": 3280, "reported": 103, "test": 2693, "train": 4001, "validated": 8887 }, "reportedSentences": 104, "clips": 12784, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "fourties": 0.03, "": 0.31, "thirties": 0.28, "teens": 0.01, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.65, "female": 0.04, "": 0.32 } }, "users": 276, "size": 358891308, "checksum": "58430ad90346a5a2bd60fc6608fc5a6a2531c46cc660b924137652a27f3e479d", "avgDurationSecs": 4.749, "validDurationSecs": 42205.389, "totalHrs": 16.86, "validHrs": 11.72 }, "bas": { "duration": 9448128, "buckets": { "dev": 457, "invalidated": 482, "other": 4, "reported": 7, "test": 440, "train": 763, "validated": 1660 }, "reportedSentences": 8, "clips": 2146, "splits": { "accent": { "": 1 }, "age": { "": 0.99, "fourties": 0.01 }, "gender": { "": 0.99, "female": 0.01 } }, "users": 30, "size": 52383598, "checksum": "9be95b0329c472a29d5aff9524a74362181f19454058c6697141e5968c05f367", "avgDurationSecs": 4.403, "validDurationSecs": 7308.431, "totalHrs": 2.62, "validHrs": 2.03 }, "sk": { "duration": 68047884, "buckets": { "dev": 2290, "invalidated": 662, "other": 421, "reported": 30, "test": 2217, "train": 2991, "validated": 15929 }, "reportedSentences": 31, "clips": 17012, "splits": { "accent": { "": 1 }, "age": { "": 0.52, "thirties": 0.23, "twenties": 0.04, "fourties": 0.11, "teens": 0.11 }, "gender": { "": 0.52, "male": 0.37, "female": 0.09, "other": 0.02 } }, "users": 133, "size": 380843781, "checksum": "1e15ba6684a14ad6f198969891ae09f6f8c7665650469414dd3c45c04cac1cf6", "avgDurationSecs": 4, "validDurationSecs": 63715.891, "totalHrs": 18.9, "validHrs": 17.69 }, "kmr": { "duration": 191414844, "buckets": { "dev": 2375, "invalidated": 1540, "other": 3652, "reported": 576, "test": 2401, "train": 2835, "validated": 38096 }, "reportedSentences": 577, "clips": 43288, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "twenties": 0.31, "thirties": 0.06, "fourties": 0.04, "fifties": 0.09, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.5, "male": 0.34, "female": 0.16 } }, "users": 295, "size": 1020029130, "checksum": "73b8936cc9e62cf1ed6bd2c8ec8666b0b8ac59669aa917676347a3a24ce8c206", "avgDurationSecs": 4.422, "validDurationSecs": 168456.383, "totalHrs": 53.17, "validHrs": 46.79 }, "bg": { "duration": 42329448, "buckets": { "dev": 600, "invalidated": 346, "other": 1964, "reported": 68, "test": 1700, "train": 3076, "validated": 5387 }, "reportedSentences": 69, "clips": 7697, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.34, "thirties": 0.03, "": 0.43, "twenties": 0.19, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.55, "female": 0.02, "": 0.43 } }, "users": 60, "size": 247038973, "checksum": "063a26e5267a0f4c804fcef64341c4e2255eb2987bff3082cfa79ef4f20f542e", "avgDurationSecs": 5.499, "validDurationSecs": 29625.664, "totalHrs": 11.75, "validHrs": 8.22 }, "kk": { "duration": 6257556, "buckets": { "dev": 316, "invalidated": 188, "other": 0, "reported": 1, "test": 336, "train": 401, "validated": 1058 }, "reportedSentences": 2, "clips": 1246, "splits": { "accent": { "": 1 }, "age": { "": 0.54, "thirties": 0.03, "twenties": 0.26, "teens": 0.06, "fifties": 0.11 }, "gender": { "": 0.55, "male": 0.42, "female": 0.03 } }, "users": 75, "size": 35863149, "checksum": "dc3fd78b5a18010dc1edbe0fa2f875aaf288d6c35a7a7aa7b731159f47ab9255", "avgDurationSecs": 5.022, "validDurationSecs": 5313.398, "totalHrs": 1.73, "validHrs": 1.47 }, "ba": { "duration": 956435580, "buckets": { "dev": 14577, "invalidated": 7855, "other": 624, "reported": 862, "test": 14563, "train": 118847, "validated": 207662 }, "reportedSentences": 859, "clips": 216141, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 866, "size": 5364795092, "checksum": "313d21aec123d5d8842d36e9e7b523087430f036b7a2b78d266b4899f49dd026", "avgDurationSecs": 4.425, "validDurationSecs": 918915.548, "totalHrs": 265.67, "validHrs": 255.25 }, "gl": { "duration": 54358920, "buckets": { "dev": 2240, "invalidated": 290, "other": 3397, "reported": 163, "test": 2258, "train": 3062, "validated": 7618 }, "reportedSentences": 164, "clips": 11305, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "thirties": 0.42, "fifties": 0.09, "twenties": 0.07, "fourties": 0.05, "teens": 0, "sixties": 0.01 }, "gender": { "": 0.37, "male": 0.41, "female": 0.21, "other": 0.01 } }, "users": 130, "size": 312424426, "checksum": "e4b45d11f9df9115c4cc9facf81f1f8cedf985dcc0d11d49b35cba5a9b7d3827", "avgDurationSecs": 4.808, "validDurationSecs": 36630.363, "totalHrs": 15.09, "validHrs": 10.17 }, "ug": { "duration": 230330232, "buckets": { "dev": 2742, "invalidated": 1930, "other": 546, "reported": 180, "test": 2744, "train": 3292, "validated": 35711 }, "reportedSentences": 181, "clips": 38187, "splits": { "accent": { "": 1 }, "age": { "": 0.65, "fifties": 0.02, "twenties": 0.14, "thirties": 0.1, "fourties": 0.07, "teens": 0, "eighties": 0.01 }, "gender": { "": 0.65, "male": 0.25, "female": 0.09, "other": 0 } }, "users": 382, "size": 1342350136, "checksum": "24f4eb359b09c1e93b9bc00d7f3784b63295f73b02afe871db53fb3564618004", "avgDurationSecs": 6.032, "validDurationSecs": 215395.892, "totalHrs": 63.98, "validHrs": 59.83 }, "hy-AM": { "duration": 12582828, "buckets": { "dev": 229, "invalidated": 54, "other": 932, "reported": 24, "test": 335, "train": 499, "validated": 1064 }, "reportedSentences": 25, "clips": 2050, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "thirties": 0.18, "twenties": 0.3, "fifties": 0.05, "teens": 0.11 }, "gender": { "": 0.36, "male": 0.29, "female": 0.35 } }, "users": 32, "size": 73751770, "checksum": "eef313003c5d83e0a716625a31114ccf8d996e35ddf36fa9bfefc920f350b999", "avgDurationSecs": 6.138, "validDurationSecs": 6530.795, "totalHrs": 3.49, "validHrs": 1.81 }, "be": { "duration": 3555221004, "buckets": { "dev": 15803, "invalidated": 22178, "other": 40596, "reported": 3083, "test": 15801, "train": 314305, "validated": 677936 }, "reportedSentences": 3082, "clips": 740710, "splits": { "accent": { "": 1 }, "age": { "": 0.77, "fourties": 0.07, "thirties": 0.08, "twenties": 0.06, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.76, "male": 0.1, "female": 0.14, "other": 0 } }, "users": 6160, "size": 20224853323, "checksum": "90845533f7bc31c0eed59f1a4f88639879a793b8b2637d996d73a930713aad40", "avgDurationSecs": 4.8, "validDurationSecs": 3253921.652, "totalHrs": 987.56, "validHrs": 903.86 }, "ur": { "duration": 12235824, "buckets": { "dev": 341, "invalidated": 119, "other": 1650, "reported": 25, "test": 341, "train": 469, "validated": 1151 }, "reportedSentences": 25, "clips": 2920, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.3, "fourties": 0.26, "thirties": 0.08, "teens": 0.03 }, "gender": { "male": 0.57, "": 0.3, "female": 0.13 } }, "users": 48, "size": 71258354, "checksum": "f9a7c509c2152d4dbbf7187b477817c413cd1b725d5749f19cbdff01babee427", "avgDurationSecs": 4.19, "validDurationSecs": 4823.094, "totalHrs": 3.39, "validHrs": 1.33 }, "gn": { "duration": 8325792, "buckets": { "dev": 93, "invalidated": 61, "other": 1277, "reported": 13, "test": 159, "train": 293, "validated": 545 }, "reportedSentences": 14, "clips": 1883, "splits": { "accent": { "": 1 }, "age": { "": 0.58, "twenties": 0.39, "thirties": 0.04 }, "gender": { "": 0.58, "male": 0.31, "female": 0.12 } }, "users": 58, "size": 45681477, "checksum": "f992e3e60efa3b39ceb04a530b3787b0ca170e8e0f85cbca7ac66ff5d5c2c989", "avgDurationSecs": 4.422, "validDurationSecs": 2409.749, "totalHrs": 2.31, "validHrs": 0.66 }, "sr": { "duration": 6546024, "buckets": { "dev": 572, "invalidated": 31, "other": 405, "reported": 15, "test": 598, "train": 706, "validated": 1878 }, "reportedSentences": 16, "clips": 2314, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.69, "": 0.14, "fifties": 0.01, "fourties": 0.11, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.39, "": 0.14, "female": 0.47 } }, "users": 51, "size": 36210414, "checksum": "03458e5beb7f3a99134aa1cafe086c1a1d39d123a1ad24b55c439935a6939b74", "avgDurationSecs": 2.829, "validDurationSecs": 5312.633, "totalHrs": 1.81, "validHrs": 1.47 }, "uz": { "duration": 817646544, "buckets": { "dev": 10849, "invalidated": 11276, "other": 119461, "reported": 1570, "test": 11598, "train": 39456, "validated": 72541 }, "reportedSentences": 1559, "clips": 203278, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.38, "": 0.42, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.42, "": 0.42, "female": 0.17 } }, "users": 1355, "size": 4555245955, "checksum": "5c66de6387d70c0660d2ad06539bafced11544ad029d2c8b02fddc21389c3e21", "avgDurationSecs": 4.022, "validDurationSecs": 291782.18, "totalHrs": 227.12, "validHrs": 81.05 }, "mr": { "duration": 18138924, "buckets": { "dev": 269, "invalidated": 1064, "other": 1081, "reported": 27, "test": 306, "train": 429, "validated": 1004 }, "reportedSentences": 28, "clips": 3149, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.63, "sixties": 0.02, "twenties": 0.3, "": 0.05 }, "gender": { "male": 0.72, "female": 0.23, "": 0.05 } }, "users": 14, "size": 106764053, "checksum": "511fc22b802978cdee292034d7c6332b7d031134eabb5987b2e31037694be31f", "avgDurationSecs": 5.76, "validDurationSecs": 5783.258, "totalHrs": 5.03, "validHrs": 1.6 }, "da": { "duration": 25078464, "buckets": { "dev": 1259, "invalidated": 205, "other": 93, "reported": 100, "test": 1390, "train": 1748, "validated": 5460 }, "reportedSentences": 101, "clips": 5758, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "thirties": 0.3, "twenties": 0.15, "sixties": 0, "fourties": 0.09, "fifties": 0.05, "teens": 0 }, "gender": { "": 0.41, "female": 0.1, "male": 0.49 } }, "users": 137, "size": 143294716, "checksum": "3105db00bf917f3ea91b5fb43e6270c4014a56f9b247341571f67f6fbb66c37e", "avgDurationSecs": 4.355, "validDurationSecs": 23780.551, "totalHrs": 6.96, "validHrs": 6.6 }, "myv": { "duration": 8153208, "buckets": { "dev": 349, "invalidated": 4, "other": 298, "reported": 9, "test": 294, "train": 479, "validated": 1124 }, "reportedSentences": 10, "clips": 1426, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.29, "": 0.34, "thirties": 0.34, "twenties": 0.04 }, "gender": { "male": 0.66, "": 0.34 } }, "users": 5, "size": 47661126, "checksum": "f61f4fed7c1e8a32c5b58fcde3db2fc2cc8e743bb9f5162191670ddcf23022fc", "avgDurationSecs": 5.718, "validDurationSecs": 6426.512, "totalHrs": 2.26, "validHrs": 1.78 }, "nn-NO": { "duration": 1388556, "buckets": { "dev": 49, "invalidated": 2, "other": 57, "reported": 4, "test": 73, "train": 107, "validated": 234 }, "reportedSentences": 5, "clips": 293, "splits": { "accent": { "": 1 }, "age": { "": 0.44, "thirties": 0.51, "twenties": 0.05 }, "gender": { "": 0.44, "female": 0.51, "male": 0.05 } }, "users": 13, "size": 8143783, "checksum": "fe619255f01ea4badb373b4d0edbacd2146ae002967a28082724afd543306ea9", "avgDurationSecs": 4.739, "validDurationSecs": 1108.949, "totalHrs": 0.38, "validHrs": 0.3 }, "ha": { "duration": 37282500, "buckets": { "dev": 0, "invalidated": 152, "other": 5614, "reported": 13, "test": 892, "train": 1941, "validated": 2833 }, "reportedSentences": 13, "clips": 8599, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.76, "twenties": 0.03, "fourties": 0, "fifties": 0.03 }, "gender": { "": 0.18, "male": 0.53, "female": 0.29 } }, "users": 25, "size": 218110720, "checksum": "e3bc71d73407b0db85e5254750f4eb8dd184d457513ddaf9544f324f3ce797db", "avgDurationSecs": 4.336, "validDurationSecs": 12282.977, "totalHrs": 10.35, "validHrs": 3.41 }, "ckb": { "duration": 134665092, "buckets": { "dev": 4137, "invalidated": 1057, "other": 116, "reported": 1602, "test": 4192, "train": 6004, "validated": 33957 }, "reportedSentences": 1603, "clips": 35130, "splits": { "accent": { "": 1 }, "age": { "": 0.45, "thirties": 0.13, "twenties": 0.39, "fourties": 0.02, "teens": 0.01, "fifties": 0 }, "gender": { "": 0.43, "male": 0.51, "female": 0.05, "other": 0.01 } }, "users": 209, "size": 727060125, "checksum": "f1a81c037ea3e44b47bd762740ac30dc9b630166dae9c5f23a8f74967bdb9861", "avgDurationSecs": 3.833, "validDurationSecs": 130168.589, "totalHrs": 37.4, "validHrs": 36.15 }, "ml": { "duration": 9095544, "buckets": { "dev": 0, "invalidated": 0, "other": 1926, "reported": 106, "test": 0, "train": 302, "validated": 302 }, "reportedSentences": 107, "clips": 2228, "splits": { "accent": { "": 1 }, "age": { "": 0.48, "twenties": 0.52 }, "gender": { "": 0.48, "male": 0.52 } }, "users": 10, "size": 53097264, "checksum": "64919d4bdbb3b439840afbfe7fc2698a670da9ffba41d56dacf42d1587fd8608", "avgDurationSecs": 4.082, "validDurationSecs": 1232.879, "totalHrs": 2.52, "validHrs": 0.34 }, "mdf": { "duration": 1697688, "buckets": { "dev": 45, "invalidated": 4, "other": 85, "reported": 4, "test": 72, "train": 115, "validated": 232 }, "reportedSentences": 5, "clips": 321, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.57, "fourties": 0.37 }, "gender": { "male": 0.06, "": 0.57, "female": 0.37 } }, "users": 8, "size": 9972105, "checksum": "f8f2506fa98a1243ec4dd0c848959ab60ed140aed852820ddf5a999a2662f0d7", "avgDurationSecs": 5.289, "validDurationSecs": 1226.989, "totalHrs": 0.47, "validHrs": 0.34 }, "sw": { "duration": 2359703628, "buckets": { "dev": 8805, "invalidated": 7756, "other": 337702, "reported": 199, "test": 8941, "train": 19606, "validated": 99733 }, "reportedSentences": 194, "clips": 445191, "splits": { "accent": { "": 1 }, "age": { "": 0.28, "twenties": 0.45, "thirties": 0.15, "teens": 0, "fifties": 0.06, "fourties": 0.05, "sixties": 0.01 }, "gender": { "": 0.25, "male": 0.41, "female": 0.34 } }, "users": 288, "size": 13818063727, "checksum": "fc84436e79671fa55241525aba63d0d53a85a24967f60a15a691f8f831704630", "avgDurationSecs": 5.3, "validDurationSecs": 528627.762, "totalHrs": 655.47, "validHrs": 146.84 }, "sat": { "duration": 1729728, "buckets": { "dev": 0, "invalidated": 10, "other": 178, "reported": 5, "test": 46, "train": 106, "validated": 152 }, "reportedSentences": 6, "clips": 340, "splits": { "accent": { "": 1 }, "age": { "": 0.15, "twenties": 0.57, "fourties": 0.03, "fifties": 0.03, "teens": 0.03, "thirties": 0.19 }, "gender": { "": 0.1, "male": 0.87, "female": 0.03 } }, "users": 8, "size": 9017013, "checksum": "4e813435cfae5e88f90e96fa9966022be4ea6b4f37fa798b7f61d53a67671191", "avgDurationSecs": 5.087, "validDurationSecs": 773.29, "totalHrs": 0.48, "validHrs": 0.21 }, "sah": { "duration": 23990556, "buckets": { "dev": 1083, "invalidated": 90, "other": 32, "test": 1165, "train": 1567, "validated": 3881 }, "clips": 4003, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "twenties": 0.02, "fourties": 0.07, "thirties": 0.44, "teens": 0.11, "fifties": 0 }, "gender": { "": 0.36, "male": 0.54, "female": 0.1 } }, "users": 48, "size": 183316054, "checksum": "0e6cef17a832f265a6964dfbecbf6c2355f1c4652bc143a1714e43b35e089d54", "avgDurationSecs": 5.993, "validDurationSecs": 23259.392, "totalHrs": 6.66, "validHrs": 6.46 }, "vot": { "duration": 1025976, "buckets": { "dev": 0, "invalidated": 322, "other": 10, "test": 6, "train": 88, "validated": 94 }, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7892848, "checksum": "b4a512348d76caac3fca3f8e31047d6f6fbc832e75a0fb7ee9a897bd11446c10", "avgDurationSecs": 2.408, "validDurationSecs": 226.389, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "duration": 493344, "buckets": { "dev": 15, "invalidated": 20, "other": 0, "test": 18, "train": 39, "validated": 72 }, "clips": 92, "splits": { "accent": { "": 1 }, "age": { "": 0.3, "twenties": 0.66, "fourties": 0.03 }, "gender": { "": 0.3, "male": 0.7 } }, "users": 10, "size": 2871153, "checksum": "9855029a8cedcce51a6b028795ac89040699ff34bbd632b6ab34651fa2f2838b", "avgDurationSecs": 5.362, "validDurationSecs": 386.095, "totalHrs": 0.13, "validHrs": 0.1 }, "mk": { "duration": 691272, "buckets": { "dev": 0, "invalidated": 4, "other": 44, "test": 0, "train": 91, "validated": 91 }, "clips": 139, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 3, "size": 4058513, "checksum": "d7bbfc7f39bd1643c6570478dbed4f115e93f5ec9bc5f77d27256013ae16f4b8", "avgDurationSecs": 4.973, "validDurationSecs": 452.559, "totalHrs": 0.19, "validHrs": 0.12 }, "ig": { "duration": 32472, "buckets": { "dev": 0, "invalidated": 0, "other": 5, "test": 0, "train": 0, "validated": 0 }, "clips": 5, "splits": { "accent": { "": 1 }, "age": { "": 1 }, "gender": { "": 1 } }, "users": 1, "size": 192599, "checksum": "5a7666730a84b5a2b24ea32b1251a6827d7de0ea60120573796a4ecbe8bc1888", "avgDurationSecs": 6.494, "validDurationSecs": 0, "totalHrs": 0, "validHrs": 0 } }, "totalDuration": 65676839253, "totalValidDurationSecs": 50841646, "totalHrs": 18243, "totalValidHrs": 14122 } ================================================ FILE: datasets/scripted-speech/cv-corpus-9.0-2022-04-27.json ================================================ { "date": "2022-04-27", "locales": { "en": { "duration": 10632910099, "buckets": { "dev": 16335, "invalidated": 242945, "other": 267228, "reported": 3903, "test": 16335, "train": 890116, "validated": 1556254 }, "reportedSentences": 3842, "clips": 2066427, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.24, "sixties": 0.04, "thirties": 0.13, "teens": 0.06, "seventies": 0.01, "fourties": 0.1, "fifties": 0.05, "eighties": 0, "nineties": 0 }, "gender": { "": 0.37, "male": 0.46, "female": 0.15, "other": 0.02 } }, "users": 81085, "size": 76768058135, "checksum": "7c789a325e1d9f16137c1aeab9f45a1fde049a39b8fbf49e9ef4a8827ae67eae", "avgDurationSecs": 5.146, "validDurationSecs": 8007787.777, "totalHrs": 2953.58, "validHrs": 2224.38 }, "eu": { "reportedSentences": 44, "buckets": { "dev": 6558, "invalidated": 5785, "other": 26668, "reported": 43, "test": 6558, "train": 10825, "validated": 69111 }, "duration": 527089551, "clips": 101564, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.07, "fifties": 0.14, "twenties": 0.35, "": 0.25, "teens": 0.03, "sixties": 0.02, "seventies": 0 }, "gender": { "male": 0.47, "female": 0.26, "": 0.25, "other": 0.02 } }, "users": 1201, "size": 3975713611, "checksum": "d28b239b95019469c3fa3a64587906efac9e152f2cb4a19d3175d9fa7f4f2e18", "avgDurationSecs": 5.19, "validDurationSecs": 358667.303, "totalHrs": 146.41, "validHrs": 99.62 }, "tr": { "reportedSentences": 306, "buckets": { "dev": 8313, "invalidated": 3262, "other": 151, "reported": 305, "test": 8362, "train": 16862, "validated": 66078 }, "duration": 256673727, "clips": 69491, "splits": { "accent": { "": 1 }, "age": { "": 0.33, "thirties": 0.1, "twenties": 0.28, "teens": 0.02, "fourties": 0.04, "fifties": 0.09, "sixties": 0.09, "eighties": 0, "seventies": 0.04 }, "gender": { "": 0.33, "male": 0.46, "female": 0.2, "other": 0 } }, "users": 1264, "size": 1608944247, "checksum": "7064e9f756fb93b4d0c1b0dae619280d3470c5e95cb7e5406953ac4888dbd8d7", "avgDurationSecs": 3.694, "validDurationSecs": 244067.383, "totalHrs": 71.29, "validHrs": 67.79 }, "ar": { "reportedSentences": 2033, "buckets": { "dev": 10220, "invalidated": 14873, "other": 30302, "reported": 2040, "test": 10421, "train": 28139, "validated": 75322 }, "duration": 502349703, "clips": 120497, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.12, "": 0.58, "twenties": 0.26, "fourties": 0.01, "teens": 0.03, "fifties": 0, "sixties": 0, "nineties": 0 }, "gender": { "female": 0.17, "": 0.57, "male": 0.25, "other": 0 } }, "users": 1237, "size": 2995697303, "checksum": "d29c81a1801ba01af97b7a90548dfd0409b9efa9b9d1291e28deee90267a746b", "avgDurationSecs": 4.169, "validDurationSecs": 314017, "totalHrs": 139.54, "validHrs": 87.23 }, "zh-TW": { "reportedSentences": 139, "buckets": { "dev": 4618, "invalidated": 4482, "other": 37593, "reported": 138, "test": 4618, "train": 6393, "validated": 74894 }, "duration": 380039242, "clips": 116969, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.2, "twenties": 0.33, "teens": 0.05, "": 0.28, "fifties": 0.04, "seventies": 0, "fourties": 0.1, "sixties": 0 }, "gender": { "male": 0.46, "": 0.27, "female": 0.25, "other": 0.02 } }, "users": 2002, "size": 2719746719, "checksum": "b40023ca8f5113a3766ba2d0844c3e846ff48eb1d4408d9ae912d4f9caa183eb", "avgDurationSecs": 3.249, "validDurationSecs": 243335.063, "totalHrs": 105.56, "validHrs": 67.59 }, "br": { "duration": 71324827, "reportedSentences": 183, "buckets": { "dev": 2161, "invalidated": 756, "other": 11283, "reported": 182, "test": 2150, "train": 2561, "validated": 11152 }, "clips": 23191, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.33, "fifties": 0.06, "fourties": 0.07, "thirties": 0.08, "sixties": 0.18, "seventies": 0.02, "teens": 0.01 }, "gender": { "male": 0.65, "": 0.34, "female": 0.02 } }, "users": 172, "size": 525880271, "checksum": "7a49666d12df795d95c5d64e827970e40863f26dfb87ceae61d14819788e6cfa", "avgDurationSecs": 3.076, "validDurationSecs": 34298.412, "totalHrs": 19.81, "validHrs": 9.52 }, "pt": { "duration": 484853047, "reportedSentences": 2305, "buckets": { "dev": 8528, "invalidated": 4416, "other": 11900, "reported": 2310, "test": 8539, "train": 17559, "validated": 99173 }, "clips": 115489, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "twenties": 0.42, "teens": 0.03, "thirties": 0.22, "fourties": 0.1, "sixties": 0, "fifties": 0.03, "seventies": 0 }, "gender": { "": 0.2, "male": 0.74, "female": 0.04, "other": 0.02 } }, "users": 2453, "size": 3187978215, "checksum": "1a748f21ba194d57ecd6418cce9692a198eeabcb44ba6e2b7a34435efc6625a4", "avgDurationSecs": 4.198, "validDurationSecs": 416354.209, "totalHrs": 134.68, "validHrs": 115.65 }, "eo": { "duration": 6712729612, "reportedSentences": 2113, "buckets": { "dev": 14911, "invalidated": 127285, "other": 130550, "reported": 2115, "test": 14915, "train": 143973, "validated": 848434 }, "clips": 1106269, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.57, "thirties": 0.12, "": 0.19, "fourties": 0.04, "fifties": 0.02, "seventies": 0, "teens": 0.05, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.69, "": 0.2, "female": 0.11, "other": 0 } }, "users": 1498, "size": 40086922305, "checksum": "5b341a961cf9f680d5c97d0be4f817fedbdfe695a3cf18825363c4c9104899b7", "avgDurationSecs": 6.068, "validDurationSecs": 5148297, "totalHrs": 1864.65, "validHrs": 1430.08 }, "zh-CN": { "reportedSentences": 445, "buckets": { "dev": 9740, "invalidated": 6689, "other": 14066, "reported": 458, "test": 9747, "train": 23539, "validated": 48516 }, "duration": 358872900, "clips": 69271, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "teens": 0.09, "twenties": 0.38, "thirties": 0.11, "fourties": 0.03, "nineties": 0, "fifties": 0, "sixties": 0 }, "gender": { "": 0.39, "male": 0.51, "female": 0.1, "other": 0.01 } }, "users": 4233, "size": 2637465454, "checksum": "36faeffbdbbdeab6a8994ceddc3df326503777754141f9135c24fd87d1a301a0", "avgDurationSecs": 5.181, "validDurationSecs": 251347.283, "totalHrs": 99.68, "validHrs": 69.81 }, "id": { "reportedSentences": 261, "buckets": { "dev": 3218, "invalidated": 2454, "other": 22787, "reported": 260, "test": 3622, "train": 5043, "validated": 23132 }, "duration": 196639788, "clips": 48373, "splits": { "accent": { "": 1 }, "age": { "": 0.26, "twenties": 0.39, "thirties": 0.07, "teens": 0.26, "fifties": 0, "fourties": 0.02 }, "gender": { "": 0.26, "male": 0.41, "female": 0.29, "other": 0.04 } }, "users": 416, "size": 1253048208, "checksum": "874e959e2ca1aacc502ff969a3e54de792dd41e4f672ae1fd9d38213f4bf4139", "avgDurationSecs": 4.065, "validDurationSecs": 94033.274, "totalHrs": 54.62, "validHrs": 26.12 }, "ia": { "reportedSentences": 259, "buckets": { "dev": 1790, "invalidated": 328, "other": 2707, "reported": 263, "test": 1722, "train": 5056, "validated": 11338 }, "duration": 60049596, "clips": 14373, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.22, "fourties": 0.3, "": 0.39, "twenties": 0.05, "thirties": 0.02, "teens": 0, "fifties": 0.03, "sixties": 0 }, "gender": { "male": 0.61, "": 0.39, "female": 0.01 } }, "users": 59, "size": 407391806, "checksum": "520a355dd5a0d8980cc5fc22d07e95db39eda4967d13d32aef77fd306a0c6979", "avgDurationSecs": 4.178, "validDurationSecs": 47369.535, "totalHrs": 16.68, "validHrs": 13.15 }, "lv": { "duration": 30488525, "reportedSentences": 26, "buckets": { "dev": 1841, "invalidated": 166, "other": 1204, "reported": 25, "test": 2152, "train": 3123, "validated": 7570 }, "clips": 8940, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.49, "fourties": 0.03, "": 0.17, "twenties": 0.28, "teens": 0.03, "fifties": 0 }, "gender": { "male": 0.7, "female": 0.13, "": 0.17 } }, "users": 115, "size": 224727245, "checksum": "9b967d04e672fe34fd7b7d3356fd2224be03b8d25d516f962b66ce66f2be5ca6", "avgDurationSecs": 3.41, "validDurationSecs": 25816.346, "totalHrs": 8.46, "validHrs": 7.17 }, "ja": { "duration": 177935145, "reportedSentences": 129, "buckets": { "dev": 4445, "invalidated": 2195, "other": 310, "reported": 129, "test": 4495, "train": 6212, "validated": 34715 }, "clips": 37220, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.32, "": 0.24, "teens": 0.04, "fifties": 0.01, "thirties": 0.1, "fourties": 0.28, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.52, "": 0.21, "female": 0.26, "other": 0 } }, "users": 619, "size": 1081598283, "checksum": "6d142a6969698a8505fa32e5fd4636e502116ded6d0e6b49e66340480cb70150", "avgDurationSecs": 4.781, "validDurationSecs": 165959.66, "totalHrs": 49.42, "validHrs": 46.09 }, "rw": { "duration": 8580385049, "reportedSentences": 624, "buckets": { "dev": 15988, "invalidated": 227719, "other": 47336, "reported": 623, "test": 16213, "train": 1003811, "validated": 1438343 }, "clips": 1713398, "splits": { "accent": { "": 1 }, "age": { "": 0.05, "twenties": 0.61, "thirties": 0.12, "teens": 0.2, "fourties": 0.02, "fifties": 0 }, "gender": { "": 0.1, "male": 0.57, "female": 0.33, "other": 0 } }, "users": 1068, "size": 60996878145, "checksum": "9bfafd887f7a2a73ba0f819e932cf2836b8dd336ff0c0cd4999b904dda08df87", "avgDurationSecs": 5.008, "validDurationSecs": 7202959.717, "totalHrs": 2383.44, "validHrs": 2000.82 }, "sv-SE": { "reportedSentences": 571, "buckets": { "dev": 5005, "invalidated": 1335, "other": 5357, "reported": 570, "test": 5045, "train": 7302, "validated": 38398 }, "duration": 177932535, "clips": 45090, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.25, "": 0.18, "teens": 0.03, "fifties": 0.03, "twenties": 0.12, "fourties": 0.38, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.47, "": 0.19, "female": 0.34, "other": 0 } }, "users": 738, "size": 1131052430, "checksum": "064c1aace5a33e08d035568573fc4f36426c3e7110d6a8482124ceca1ebc6c66", "avgDurationSecs": 3.946, "validDurationSecs": 151524.805, "totalHrs": 49.42, "validHrs": 42.09 }, "cnh": { "duration": 20673132, "reportedSentences": 9, "buckets": { "dev": 761, "invalidated": 436, "other": 2907, "reported": 8, "test": 763, "train": 817, "validated": 2458 }, "clips": 5801, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.36, "fourties": 0.01, "teens": 0.02, "thirties": 0.08, "fifties": 0.02 }, "gender": { "": 0.51, "male": 0.33, "female": 0.16 } }, "users": 298, "size": 161375852, "checksum": "2ee28e4c1e5c953b1ed85e42f80f96d71f5185a78a29123bdbaf2aed4a3685f0", "avgDurationSecs": 3.564, "validDurationSecs": 8759.62, "totalHrs": 5.74, "validHrs": 2.43 }, "et": { "duration": 177339854, "reportedSentences": 461, "buckets": { "dev": 2636, "invalidated": 6094, "other": 129, "reported": 464, "test": 2636, "train": 3135, "validated": 20013 }, "clips": 26236, "splits": { "accent": { "": 1 }, "age": { "": 0.2, "thirties": 0.08, "twenties": 0.68, "fourties": 0.04, "fifties": 0, "seventies": 0, "teens": 0 }, "gender": { "": 0.2, "male": 0.55, "female": 0.24, "other": 0 } }, "users": 793, "size": 1227039248, "checksum": "0d90ac5db32f4cdc1d228a7f28d1aded44fdd2155c6770bba177ddadb6290caa", "avgDurationSecs": 6.759, "validDurationSecs": 135276.052, "totalHrs": 49.26, "validHrs": 37.57 }, "ky": { "duration": 161279904, "reportedSentences": 35, "buckets": { "dev": 1613, "invalidated": 5582, "other": 262, "reported": 34, "test": 1613, "train": 1787, "validated": 29662 }, "clips": 35506, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.08, "": 0.07, "fourties": 0.01, "twenties": 0.66, "teens": 0.18 }, "gender": { "male": 0.53, "": 0.11, "female": 0.35, "other": 0 } }, "users": 239, "size": 1042617186, "checksum": "aa1f7ea3fcf417af988009de613a4185216793858f606c80f8cb4622a984a5a7", "avgDurationSecs": 4.542, "validDurationSecs": 134734.538, "totalHrs": 44.79, "validHrs": 37.42 }, "ro": { "duration": 134129767, "reportedSentences": 298, "buckets": { "dev": 3801, "invalidated": 855, "other": 18353, "reported": 297, "test": 3827, "train": 5076, "validated": 14653 }, "clips": 33861, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "teens": 0.02, "": 0.11, "fourties": 0.06, "sixties": 0, "twenties": 0.66, "fifties": 0.01, "eighties": 0 }, "gender": { "male": 0.73, "": 0.11, "female": 0.16, "other": 0.01 } }, "users": 346, "size": 844195813, "checksum": "69ee370178e60bd581c38805b46abc5a9c0687fe5bd89b0b983181cb5fa931d1", "avgDurationSecs": 3.961, "validDurationSecs": 58043.279, "totalHrs": 37.25, "validHrs": 16.12 }, "hsb": { "duration": 10103328, "reportedSentences": 57, "buckets": { "dev": 172, "invalidated": 228, "other": 17, "reported": 56, "test": 431, "train": 808, "validated": 1411 }, "clips": 1656, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.55, "": 0.17, "thirties": 0.11, "sixties": 0, "seventies": 0.03, "twenties": 0.11, "fifties": 0.03 }, "gender": { "male": 0.83, "": 0.17, "other": 0 } }, "users": 19, "size": 79356270, "checksum": "8b82e5155d6f1d1b5bbf832db01212956b96fd3540b65f6cc564382b0478232c", "avgDurationSecs": 6.101, "validDurationSecs": 8608.572, "totalHrs": 2.8, "validHrs": 2.39 }, "el": { "duration": 95865010, "reportedSentences": 65, "buckets": { "dev": 1695, "invalidated": 758, "other": 8255, "reported": 64, "test": 1686, "train": 1926, "validated": 14223 }, "clips": 23236, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.4, "fourties": 0.14, "": 0.28, "twenties": 0.14, "fifties": 0.03, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.67, "": 0.28, "other": 0.02, "female": 0.03 } }, "users": 325, "size": 652538106, "checksum": "2265f0030dbdf412d04fbe0c866f4417d804f636d2b7859ea0297f17d264223d", "avgDurationSecs": 4.126, "validDurationSecs": 58679.981, "totalHrs": 26.62, "validHrs": 16.29 }, "cs": { "duration": 250513527, "reportedSentences": 693, "buckets": { "dev": 7051, "invalidated": 1231, "other": 9110, "reported": 696, "test": 7536, "train": 14289, "validated": 47484 }, "clips": 57825, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.2, "": 0.36, "thirties": 0.14, "teens": 0.01, "twenties": 0.27, "fifties": 0.02, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.62, "": 0.35, "female": 0.03 } }, "users": 534, "size": 1768374191, "checksum": "addd26310d181309a55dbc22f00ad3bfabcbbcbddc59d7661a0c5d9af6a6aa99", "avgDurationSecs": 4.332, "validDurationSecs": 205713.52, "totalHrs": 69.58, "validHrs": 57.14 }, "pl": { "duration": 593355386, "reportedSentences": 528, "buckets": { "dev": 8155, "invalidated": 6004, "other": 7115, "reported": 528, "test": 8155, "train": 15997, "validated": 119647 }, "clips": 132766, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.28, "": 0.24, "teens": 0.02, "thirties": 0.33, "fourties": 0.12, "fifties": 0.01, "nineties": 0.01, "sixties": 0 }, "gender": { "male": 0.6, "": 0.25, "female": 0.14, "other": 0.01 } }, "users": 3062, "size": 4272080136, "checksum": "0fe371030664d372c821312fa3baff27d90e553d4553a2f6b1d1566e4e56e8c3", "avgDurationSecs": 4.469, "validDurationSecs": 534724.19, "totalHrs": 164.82, "validHrs": 148.53 }, "rm-sursilv": { "duration": 38420393, "reportedSentences": 10, "buckets": { "dev": 1334, "invalidated": 670, "other": 2160, "reported": 9, "test": 1315, "train": 1526, "validated": 4176 }, "clips": 7006, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.03, "twenties": 0.1, "": 0.64, "teens": 0.06, "fourties": 0.18 }, "gender": { "male": 0.17, "female": 0.19, "": 0.64, "other": 0 } }, "users": 85, "size": 290857567, "checksum": "67a9e201cb078d43832af37bdd3a2def823aecb8c74fc5f4045ff2f0e633ed7b", "avgDurationSecs": 5.484, "validDurationSecs": 22900.879, "totalHrs": 10.67, "validHrs": 6.36 }, "rm-vallader": { "duration": 15074402, "reportedSentences": 29, "buckets": { "dev": 375, "invalidated": 391, "other": 724, "reported": 30, "test": 435, "train": 664, "validated": 1481 }, "clips": 2596, "splits": { "accent": { "": 1 }, "age": { "": 0.36, "fourties": 0.41, "twenties": 0.14, "thirties": 0.06, "fifties": 0, "sixties": 0.03 }, "gender": { "": 0.36, "male": 0.44, "female": 0.19, "other": 0.01 } }, "users": 51, "size": 115232789, "checksum": "0e05b05e19d8a6c7df38840cf9af511a15d105d2455ae0323f542a13d1bc51a7", "avgDurationSecs": 5.807, "validDurationSecs": 8599.842, "totalHrs": 4.18, "validHrs": 2.38 }, "mn": { "duration": 67682312, "reportedSentences": 19, "buckets": { "dev": 1852, "invalidated": 742, "other": 3411, "reported": 18, "test": 1867, "train": 2177, "validated": 8208 }, "clips": 12361, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.23, "": 0.27, "twenties": 0.42, "fourties": 0.01, "teens": 0.02, "nineties": 0.06, "fifties": 0 }, "gender": { "male": 0.36, "": 0.27, "female": 0.31, "other": 0.06 } }, "users": 465, "size": 516172228, "checksum": "ce3e8ce228c263b937d063ca7f77d1fd200139b0685d84f9947852c1c774a2ca", "avgDurationSecs": 5.475, "validDurationSecs": 44942.676, "totalHrs": 18.8, "validHrs": 12.48 }, "zh-HK": { "duration": 451153356, "reportedSentences": 615, "buckets": { "dev": 5578, "invalidated": 4045, "other": 15366, "reported": 626, "test": 5578, "train": 8392, "validated": 87390 }, "clips": 106801, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.13, "thirties": 0.11, "": 0.41, "teens": 0.02, "fifties": 0.03, "seventies": 0, "sixties": 0, "twenties": 0.31 }, "gender": { "male": 0.41, "": 0.37, "female": 0.22, "other": 0.01 } }, "users": 2869, "size": 3287174684, "checksum": "02132780a56946a6069129e8c15541616de2ebe789e58599215028dc604bb618", "avgDurationSecs": 4.224, "validDurationSecs": 369156.579, "totalHrs": 125.32, "validHrs": 102.54 }, "ab": { "duration": 301136940, "reportedSentences": 218, "buckets": { "dev": 9151, "invalidated": 5268, "other": 11583, "reported": 219, "test": 9115, "train": 21027, "validated": 41903 }, "clips": 58754, "splits": { "accent": { "": 1 }, "age": { "seventies": 0.01, "thirties": 0.13, "": 0.19, "teens": 0.28, "twenties": 0.19, "fifties": 0.06, "sixties": 0.05, "fourties": 0.09, "eighties": 0.01 }, "gender": { "male": 0.18, "female": 0.64, "": 0.18 } }, "users": 397, "size": 1720010158, "checksum": "bdd127169cd25a2c40d27e04a124b98a9962bcc70fdf31ed6877ba4050c8c1b6", "avgDurationSecs": 5.125, "validDurationSecs": 214769.057, "totalHrs": 83.64, "validHrs": 59.65 }, "cv": { "duration": 84820368, "reportedSentences": 138, "buckets": { "dev": 1282, "invalidated": 1889, "other": 1261, "reported": 142, "test": 1266, "train": 1416, "validated": 13793 }, "clips": 16943, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.47, "": 0.22, "fourties": 0.07, "thirties": 0.01, "teens": 0.22, "fifties": 0.01 }, "gender": { "male": 0.51, "": 0.19, "female": 0.29 } }, "users": 102, "size": 590791689, "checksum": "d98d872e530e3142105ae3c8e014c834f51bacce64d3bd5b40a5644933a90c40", "avgDurationSecs": 5.006, "validDurationSecs": 69050.778, "totalHrs": 23.56, "validHrs": 19.18 }, "uk": { "duration": 287407536, "reportedSentences": 579, "buckets": { "dev": 6316, "invalidated": 2238, "other": 7740, "reported": 578, "test": 6320, "train": 10207, "validated": 49567 }, "clips": 59545, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.27, "teens": 0.07, "": 0.27, "fourties": 0.13, "thirties": 0.25, "fifties": 0, "sixties": 0 }, "gender": { "male": 0.56, "female": 0.16, "": 0.27 } }, "users": 696, "size": 1971028073, "checksum": "f806bb2600f1aa2b4314593854fe38fc3f36bab1308d0a27ec1db0eef7649b0c", "avgDurationSecs": 4.827, "validDurationSecs": 239246.441, "totalHrs": 79.83, "validHrs": 66.45 }, "mt": { "duration": 61045812, "reportedSentences": 8, "buckets": { "dev": 1596, "invalidated": 319, "other": 6237, "reported": 7, "test": 1627, "train": 1951, "validated": 6331 }, "clips": 12887, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.19, "": 0.26, "fourties": 0.17, "thirties": 0.09, "teens": 0.03, "fifties": 0.26, "sixties": 0.01 }, "gender": { "male": 0.25, "": 0.26, "female": 0.48, "other": 0.01 } }, "users": 204, "size": 454454984, "checksum": "cf1045f68501fecb4977d198909b2b09a9e11eed24796baff4464ca4105e8d3c", "avgDurationSecs": 4.737, "validDurationSecs": 29989.993, "totalHrs": 16.95, "validHrs": 8.33 }, "as": { "duration": 5201165, "reportedSentences": 10, "buckets": { "dev": 116, "invalidated": 47, "other": 10, "reported": 9, "test": 299, "train": 508, "validated": 923 }, "clips": 980, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.4, "": 0.55, "thirties": 0.05, "teens": 0.01 }, "gender": { "male": 0.46, "": 0.53, "female": 0 } }, "users": 41, "size": 35835442, "checksum": "4a4710429af3e14f4f7d4dd58b8a2c18164bdf1538b598d6385f09bdccef8d3a", "avgDurationSecs": 5.307, "validDurationSecs": 4898.648, "totalHrs": 1.44, "validHrs": 1.36 }, "ka": { "duration": 29727684, "reportedSentences": 37, "buckets": { "dev": 1353, "invalidated": 362, "other": 0, "reported": 36, "test": 1363, "train": 1686, "validated": 5207 }, "clips": 5569, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.38, "thirties": 0.24, "": 0.34, "fourties": 0.02, "fifties": 0, "teens": 0.01 }, "gender": { "male": 0.46, "female": 0.2, "": 0.35 } }, "users": 132, "size": 196283323, "checksum": "f499863ec4f7697cfb15f46730b630bc2123e8e044950c30b41912d1e897aebb", "avgDurationSecs": 5.338, "validDurationSecs": 27795.304, "totalHrs": 8.25, "validHrs": 7.72 }, "fy-NL": { "duration": 456026103, "reportedSentences": 400, "buckets": { "dev": 3024, "invalidated": 2912, "other": 52677, "reported": 402, "test": 3024, "train": 3699, "validated": 36036 }, "clips": 91625, "splits": { "accent": { "": 1 }, "age": { "": 0.59, "fifties": 0.12, "thirties": 0.04, "twenties": 0.02, "fourties": 0.07, "sixties": 0.14, "seventies": 0.01, "teens": 0, "eighties": 0 }, "gender": { "": 0.6, "male": 0.1, "female": 0.3 } }, "users": 1138, "size": 2856459160, "checksum": "300f2e5662aec1a3b5a402c6aa8a597c555be69b81fbe6d94a54358befc437b2", "avgDurationSecs": 4.977, "validDurationSecs": 179354.506, "totalHrs": 126.67, "validHrs": 49.82 }, "dv": { "duration": 210812751, "reportedSentences": 50, "buckets": { "dev": 2242, "invalidated": 1539, "other": 14415, "reported": 49, "test": 2248, "train": 2623, "validated": 25854 }, "clips": 41808, "splits": { "accent": { "": 1 }, "age": { "": 0.22, "twenties": 0.18, "thirties": 0.36, "fourties": 0.23, "teens": 0.01, "nineties": 0 }, "gender": { "": 0.22, "male": 0.28, "female": 0.5 } }, "users": 308, "size": 1355645101, "checksum": "4fc327c12b947f9a5e9fd9feebfcfa95a3896d79db98dc3bb227266eff04f56f", "avgDurationSecs": 5.042, "validDurationSecs": 130366.266, "totalHrs": 58.55, "validHrs": 36.21 }, "pa-IN": { "duration": 12812318, "reportedSentences": 232, "buckets": { "dev": 279, "invalidated": 74, "other": 1243, "reported": 233, "test": 388, "train": 671, "validated": 1338 }, "clips": 2655, "splits": { "accent": { "": 1 }, "age": { "": 0.23, "fourties": 0.04, "fifties": 0.05, "thirties": 0.41, "twenties": 0.26, "sixties": 0, "teens": 0 }, "gender": { "": 0.23, "male": 0.77, "female": 0 } }, "users": 51, "size": 92598301, "checksum": "a536ccde38a695f9a4d43b5f1a0b2f5a85c6a4d0801cc5d6341f136ab86131c5", "avgDurationSecs": 4.826, "validDurationSecs": 6456.829, "totalHrs": 3.55, "validHrs": 1.79 }, "vi": { "duration": 62852480, "reportedSentences": 178, "buckets": { "dev": 185, "invalidated": 313, "other": 11250, "reported": 179, "test": 1087, "train": 2645, "validated": 4267 }, "clips": 15830, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.18, "": 0.24, "teens": 0.22, "seventies": 0, "fourties": 0.02, "sixties": 0.32 }, "gender": { "male": 0.53, "": 0.24, "female": 0.21, "other": 0.02 } }, "users": 219, "size": 365213095, "checksum": "dd4553ac5cabeee244c71a1ad4909f4a0f723e4b33399bc852578b0e0958ff15", "avgDurationSecs": 3.97, "validDurationSecs": 16941.979, "totalHrs": 17.45, "validHrs": 4.7 }, "or": { "duration": 33971316, "reportedSentences": 5, "buckets": { "dev": 309, "invalidated": 153, "other": 5442, "reported": 4, "test": 218, "train": 477, "validated": 1136 }, "clips": 6731, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.17, "": 0.08, "thirties": 0.74, "fourties": 0, "teens": 0 }, "gender": { "male": 0.92, "": 0.08, "female": 0 } }, "users": 85, "size": 247540742, "checksum": "8d1833393a267c07db96cf403dffbd841caaf79d8152e2e7270742f6b0fd4cd7", "avgDurationSecs": 5.047, "validDurationSecs": 5733.385, "totalHrs": 9.43, "validHrs": 1.59 }, "ga-IE": { "duration": 32223447, "reportedSentences": 10, "buckets": { "dev": 512, "invalidated": 797, "other": 3813, "reported": 9, "test": 512, "train": 532, "validated": 4526 }, "clips": 9136, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.25, "": 0.37, "thirties": 0.26, "fourties": 0.04, "sixties": 0.01, "teens": 0.02, "fifties": 0.05 }, "gender": { "male": 0.49, "": 0.37, "female": 0.14, "other": 0 } }, "users": 157, "size": 224532834, "checksum": "e7dfcbff72b543dac3f3d330f656f441973360ce3813b917af4139616cc30aa0", "avgDurationSecs": 3.527, "validDurationSecs": 15963.586, "totalHrs": 8.95, "validHrs": 4.43 }, "fi": { "duration": 55337331, "reportedSentences": 39, "buckets": { "dev": 1433, "invalidated": 189, "other": 5046, "reported": 38, "test": 1761, "train": 2314, "validated": 6929 }, "clips": 12164, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.13, "": 0.38, "twenties": 0.1, "fourties": 0.36, "teens": 0.01, "fifties": 0.02, "seventies": 0 }, "gender": { "male": 0.23, "": 0.38, "female": 0.38, "other": 0 } }, "users": 178, "size": 330869041, "checksum": "6de0884a012e3e3f329f558920cdbfd9cb4051de620b385d11f9eae0f109668d", "avgDurationSecs": 4.549, "validDurationSecs": 31521.898, "totalHrs": 15.37, "validHrs": 8.75 }, "hu": { "duration": 91660753, "reportedSentences": 87, "buckets": { "dev": 4139, "invalidated": 794, "other": 2286, "reported": 86, "test": 4582, "train": 6835, "validated": 15592 }, "clips": 18672, "splits": { "accent": { "": 1 }, "age": { "teens": 0.08, "": 0.29, "thirties": 0.15, "twenties": 0.39, "fifties": 0.06, "fourties": 0.02, "sixties": 0.01 }, "gender": { "male": 0.6, "": 0.29, "female": 0.11 } }, "users": 209, "size": 593607072, "checksum": "b6c20484d966f8127cb1630e92b4ad54bfb5a93e933332f20045853679f39796", "avgDurationSecs": 4.909, "validDurationSecs": 76541.049, "totalHrs": 25.46, "validHrs": 21.26 }, "th": { "duration": 1304114448, "reportedSentences": 3861, "buckets": { "dev": 10826, "invalidated": 8226, "other": 178341, "reported": 3862, "test": 10826, "train": 30991, "validated": 125227 }, "clips": 311794, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.2, "": 0.44, "thirties": 0.05, "fourties": 0.04, "teens": 0.05, "fifties": 0.22, "eighties": 0, "sixties": 0 }, "gender": { "male": 0.36, "": 0.44, "female": 0.19, "other": 0.01 } }, "users": 7460, "size": 7590672263, "checksum": "2a06cc0ebb4f988e83d434c3c6abeed26d983513fe9fd14db3499a65101a759c", "avgDurationSecs": 4.183, "validDurationSecs": 523776.404, "totalHrs": 362.25, "validHrs": 145.49 }, "lt": { "duration": 73483892, "reportedSentences": 122, "buckets": { "dev": 3377, "invalidated": 551, "other": 1512, "reported": 122, "test": 3666, "train": 5143, "validated": 12189 }, "clips": 14252, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.33, "": 0.24, "thirties": 0.28, "fifties": 0.05, "sixties": 0.01, "teens": 0.03, "fourties": 0.05 }, "gender": { "male": 0.62, "": 0.24, "female": 0.14 } }, "users": 252, "size": 451652341, "checksum": "4a1d8d6f0465c785f8acf4c2eb2b3120626895aa36bb187c034c8d9bb5188e26", "avgDurationSecs": 5.156, "validDurationSecs": 62846.98, "totalHrs": 20.41, "validHrs": 17.45 }, "lg": { "duration": 1720023723, "reportedSentences": 6034, "buckets": { "dev": 12660, "invalidated": 38088, "other": 5933, "reported": 6039, "test": 12717, "train": 55005, "validated": 252207 }, "clips": 296228, "splits": { "accent": { "": 1 }, "age": { "": 0.27, "thirties": 0.22, "twenties": 0.41, "fourties": 0.05, "fifties": 0.03, "teens": 0.01, "nineties": 0, "sixties": 0.01 }, "gender": { "": 0.27, "female": 0.4, "male": 0.34 } }, "users": 487, "size": 10083037056, "checksum": "249e7fd866620807d9567948abebdf268f445081f22ba25c4cb13a62c87812bd", "avgDurationSecs": 5.806, "validDurationSecs": 1464419.377, "totalHrs": 477.78, "validHrs": 406.78 }, "hi": { "duration": 62575584, "reportedSentences": 104, "buckets": { "dev": 2064, "invalidated": 638, "other": 3295, "reported": 103, "test": 2839, "train": 4217, "validated": 9140 }, "clips": 13073, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.34, "fourties": 0.03, "": 0.33, "thirties": 0.28, "teens": 0.01, "fifties": 0.01, "sixties": 0 }, "gender": { "male": 0.63, "female": 0.04, "": 0.33 } }, "users": 299, "size": 369876544, "checksum": "18a2826d53ced567c72cfddf98fd9ac27e79506af6d897a5339cfc46f0f19787", "avgDurationSecs": 4.787, "validDurationSecs": 43749.777, "totalHrs": 17.38, "validHrs": 12.15 }, "bas": { "duration": 9991980, "reportedSentences": 8, "buckets": { "dev": 457, "invalidated": 483, "other": 109, "reported": 7, "test": 444, "train": 763, "validated": 1664 }, "clips": 2256, "splits": { "accent": { "": 1 }, "age": { "": 0.98, "fourties": 0.01, "teens": 0.01 }, "gender": { "": 0.98, "female": 0.02 } }, "users": 32, "size": 55579555, "checksum": "b289a4d3740c92cb17d5caa25bcbb3303f7c583f9019ead72553c2c044eaafad", "avgDurationSecs": 4.429, "validDurationSecs": 7369.971, "totalHrs": 2.77, "validHrs": 2.04 }, "sk": { "duration": 68353272, "reportedSentences": 31, "buckets": { "dev": 2258, "invalidated": 691, "other": 211, "reported": 30, "test": 2366, "train": 2879, "validated": 16180 }, "clips": 17082, "splits": { "accent": { "": 1 }, "age": { "": 0.52, "thirties": 0.23, "twenties": 0.04, "fourties": 0.11, "teens": 0.11 }, "gender": { "": 0.52, "male": 0.38, "female": 0.09, "other": 0.02 } }, "users": 136, "size": 382625533, "checksum": "7e603516b30793784726c7752b539282cc2c851d0e47b2dfe3c371ecdd751d0e", "avgDurationSecs": 4.001, "validDurationSecs": 64743.938, "totalHrs": 18.98, "validHrs": 17.98 }, "kmr": { "duration": 191911824, "reportedSentences": 578, "buckets": { "dev": 2396, "invalidated": 1542, "other": 3403, "reported": 577, "test": 2380, "train": 2835, "validated": 38448 }, "clips": 43393, "splits": { "accent": { "": 1 }, "age": { "": 0.5, "twenties": 0.31, "thirties": 0.06, "fourties": 0.04, "fifties": 0.09, "teens": 0.02, "sixties": 0 }, "gender": { "": 0.5, "male": 0.34, "female": 0.16 } }, "users": 299, "size": 1022772194, "checksum": "b469b04d1da7f47d5a9d1b32c7f533cc1921678c8e43bb22300e670e7201474a", "avgDurationSecs": 4.423, "validDurationSecs": 170041.846, "totalHrs": 53.3, "validHrs": 47.23 }, "bg": { "duration": 46011060, "reportedSentences": 146, "buckets": { "dev": 902, "invalidated": 375, "other": 2109, "reported": 145, "test": 1813, "train": 3149, "validated": 5875 }, "clips": 8359, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.33, "thirties": 0.07, "": 0.41, "twenties": 0.18, "teens": 0.01, "sixties": 0 }, "gender": { "male": 0.52, "female": 0.06, "": 0.41 } }, "users": 65, "size": 268561655, "checksum": "93d3702a7a91e2e58bafd8c66fcb406be321d70504c5cba95611b34b13ca07df", "avgDurationSecs": 5.504, "validDurationSecs": 32338.196, "totalHrs": 12.78, "validHrs": 8.98 }, "kk": { "duration": 6555816, "reportedSentences": 8, "buckets": { "dev": 341, "invalidated": 191, "other": 7, "reported": 7, "test": 365, "train": 401, "validated": 1112 }, "clips": 1310, "splits": { "accent": { "": 1 }, "age": { "": 0.53, "thirties": 0.03, "twenties": 0.28, "teens": 0.06, "fifties": 0.1 }, "gender": { "": 0.54, "male": 0.43, "female": 0.03 } }, "users": 79, "size": 37614021, "checksum": "028c7a95b3402f5b51df4edc086670593625d1ce26f8cbdbf6c14273c88c3b6a", "avgDurationSecs": 5.004, "validDurationSecs": 5564.937, "totalHrs": 1.82, "validHrs": 1.54 }, "ba": { "duration": 958205736, "reportedSentences": 863, "buckets": { "dev": 14560, "invalidated": 7884, "other": 60, "reported": 866, "test": 14509, "train": 118993, "validated": 208559 }, "clips": 216503, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.17, "": 0.3, "fourties": 0.06, "fifties": 0.05, "twenties": 0.17, "sixties": 0.2, "seventies": 0, "teens": 0.04 }, "gender": { "male": 0.3, "": 0.3, "female": 0.4 } }, "users": 888, "size": 5375041473, "checksum": "12b85d10cfae0784937717b2213b667210f9e2bb7e37c38d06c62b34920a0d57", "avgDurationSecs": 4.426, "validDurationSecs": 923046.933, "totalHrs": 266.16, "validHrs": 256.4 }, "gl": { "duration": 56512476, "reportedSentences": 166, "buckets": { "dev": 2382, "invalidated": 304, "other": 3305, "reported": 165, "test": 2375, "train": 3291, "validated": 8106 }, "clips": 11715, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.41, "fifties": 0.09, "twenties": 0.07, "fourties": 0.04, "teens": 0, "sixties": 0.01 }, "gender": { "": 0.38, "male": 0.4, "female": 0.21, "other": 0.01 } }, "users": 140, "size": 324986608, "checksum": "4ab20faa2d8e9811bf6ab4135daefa61a998225db481a3918a8e117e2a1e5b0e", "avgDurationSecs": 4.824, "validDurationSecs": 39102.871, "totalHrs": 15.69, "validHrs": 10.86 }, "ug": { "duration": 235420236, "reportedSentences": 181, "buckets": { "dev": 2746, "invalidated": 1932, "other": 62, "reported": 180, "test": 2746, "train": 3292, "validated": 37038 }, "clips": 39032, "splits": { "accent": { "": 1 }, "age": { "": 0.64, "fifties": 0.02, "twenties": 0.14, "thirties": 0.12, "fourties": 0.07, "teens": 0, "eighties": 0.01 }, "gender": { "": 0.64, "male": 0.27, "female": 0.09, "other": 0 } }, "users": 385, "size": 1372345697, "checksum": "4d904c8244451208b4563a525413aae4d0d44a7e1643bfdc1e42bf8a12de66f4", "avgDurationSecs": 6.031, "validDurationSecs": 223393.49, "totalHrs": 65.39, "validHrs": 62.05 }, "hy-AM": { "duration": 16162056, "reportedSentences": 26, "buckets": { "dev": 344, "invalidated": 89, "other": 1246, "reported": 25, "test": 369, "train": 595, "validated": 1309 }, "clips": 2644, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.14, "twenties": 0.36, "fifties": 0.04, "teens": 0.08 }, "gender": { "": 0.37, "male": 0.23, "female": 0.4 } }, "users": 60, "size": 94866525, "checksum": "7d9535bd096dd60bac0f692b5621a3995b3f6c26add8b1bd63e3e76bca730a30", "avgDurationSecs": 6.113, "validDurationSecs": 8001.563, "totalHrs": 4.48, "validHrs": 2.22 }, "be": { "duration": 3797989380, "reportedSentences": 3097, "buckets": { "dev": 15866, "invalidated": 25367, "other": 574, "reported": 3098, "test": 15877, "train": 346139, "validated": 771750 }, "clips": 797691, "splits": { "accent": { "": 1 }, "age": { "": 0.78, "fourties": 0.07, "thirties": 0.08, "twenties": 0.06, "teens": 0.01, "fifties": 0, "sixties": 0, "seventies": 0 }, "gender": { "": 0.78, "male": 0.1, "female": 0.13, "other": 0 } }, "users": 6316, "size": 21530210404, "checksum": "6ec07dc257128cf65b80ce4a5de429bdb2e43d6052b589c4859783faada0d9e0", "avgDurationSecs": 4.761, "validDurationSecs": 3674478.343, "totalHrs": 1054.99, "validHrs": 1020.68 }, "ur": { "duration": 48370572, "reportedSentences": 35, "buckets": { "dev": 2864, "invalidated": 903, "other": 121, "reported": 37, "test": 2796, "train": 3674, "validated": 10411 }, "clips": 11435, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.71, "": 0.18, "fourties": 0.07, "thirties": 0.02, "teens": 0.01, "fifties": 0 }, "gender": { "male": 0.61, "": 0.18, "female": 0.21 } }, "users": 108, "size": 283099171, "checksum": "53ee69b3a42ceabacbb88d5a7831d5e6f8748bb84fde4087b6a1e5877f653d4b", "avgDurationSecs": 4.23, "validDurationSecs": 44039.005, "totalHrs": 13.43, "validHrs": 12.23 }, "gn": { "duration": 8978220, "reportedSentences": 21, "buckets": { "dev": 100, "invalidated": 62, "other": 1430, "reported": 20, "test": 163, "train": 295, "validated": 558 }, "clips": 2050, "splits": { "accent": { "": 1 }, "age": { "": 0.6, "twenties": 0.37, "thirties": 0.03 }, "gender": { "": 0.6, "male": 0.29, "female": 0.11 } }, "users": 62, "size": 49435800, "checksum": "9dc4b54c18770a50f6d0d56faec37cbe3e95ddf70890791d7277f15d8a002af6", "avgDurationSecs": 4.38, "validDurationSecs": 2443.828, "totalHrs": 2.49, "validHrs": 0.67 }, "sr": { "duration": 6616980, "reportedSentences": 19, "buckets": { "dev": 516, "invalidated": 38, "other": 0, "reported": 18, "test": 742, "train": 1037, "validated": 2297 }, "clips": 2335, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.69, "": 0.15, "fifties": 0.01, "fourties": 0.11, "thirties": 0.04, "teens": 0 }, "gender": { "male": 0.39, "": 0.15, "female": 0.46 } }, "users": 53, "size": 36628440, "checksum": "c80dbab34150c44621bcd2340f2aad60f7711b989b726f0aab6dc5655c532299", "avgDurationSecs": 2.834, "validDurationSecs": 6509.295, "totalHrs": 1.83, "validHrs": 1.8 }, "uz": { "duration": 833326524, "reportedSentences": 1664, "buckets": { "dev": 10819, "invalidated": 12183, "other": 115624, "reported": 1681, "test": 12007, "train": 44479, "validated": 78437 }, "clips": 206244, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.38, "": 0.42, "thirties": 0.01, "teens": 0.18, "fifties": 0, "fourties": 0.01, "nineties": 0 }, "gender": { "male": 0.42, "": 0.42, "female": 0.16 } }, "users": 1503, "size": 4644040593, "checksum": "7ed210d6c97d6fee3bd57c6d1f85e11a32d7a7237a89d09470ab04e8e98373e4", "avgDurationSecs": 4.04, "validDurationSecs": 316923.802, "totalHrs": 231.47, "validHrs": 88.03 }, "mr": { "duration": 88055640, "reportedSentences": 38, "buckets": { "dev": 1794, "invalidated": 2094, "other": 1896, "reported": 37, "test": 1806, "train": 2125, "validated": 10331 }, "clips": 14321, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.14, "sixties": 0, "twenties": 0.26, "": 0.05, "teens": 0.55 }, "gender": { "male": 0.19, "female": 0.76, "": 0.05 } }, "users": 70, "size": 516203389, "checksum": "3312d72c9a10f8762cbe0a5c7903faf3cb6b74a446c3028408c47b32d4f6b4cc", "avgDurationSecs": 6.149, "validDurationSecs": 63522.297, "totalHrs": 24.45, "validHrs": 17.64 }, "da": { "duration": 30950316, "reportedSentences": 114, "buckets": { "dev": 1602, "invalidated": 264, "other": 387, "reported": 113, "test": 1583, "train": 2038, "validated": 6509 }, "clips": 7160, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "thirties": 0.3, "twenties": 0.21, "sixties": 0, "fourties": 0.07, "fifties": 0.04, "teens": 0 }, "gender": { "": 0.37, "female": 0.09, "male": 0.54 } }, "users": 169, "size": 177172753, "checksum": "a57080a05c26bfa96cd4c9c0d83b01961781930fcd1a1bd7c3bb47ea377bae37", "avgDurationSecs": 4.323, "validDurationSecs": 28136.258, "totalHrs": 8.59, "validHrs": 7.81 }, "myv": { "duration": 8582580, "reportedSentences": 15, "buckets": { "dev": 351, "invalidated": 4, "other": 324, "reported": 14, "test": 297, "train": 522, "validated": 1173 }, "clips": 1501, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.3, "": 0.34, "thirties": 0.32, "twenties": 0.03 }, "gender": { "male": 0.66, "": 0.34 } }, "users": 5, "size": 50163913, "checksum": "3b5c0c8f99951e6ae6c20064a7d1e9204f92b3a9fa10c54cc636931cbde0cf9a", "avgDurationSecs": 5.718, "validDurationSecs": 6707.106, "totalHrs": 2.38, "validHrs": 1.86 }, "nn-NO": { "duration": 2319192, "reportedSentences": 5, "buckets": { "dev": 96, "invalidated": 15, "other": 14, "reported": 4, "test": 150, "train": 234, "validated": 485 }, "clips": 514, "splits": { "accent": { "": 1 }, "age": { "": 0.39, "thirties": 0.34, "twenties": 0.27 }, "gender": { "": 0.39, "female": 0.29, "male": 0.27, "other": 0.05 } }, "users": 18, "size": 13594849, "checksum": "3efa85f8616fffda1e35a76d8a1fe8a93b6136e31fd1ae092ec5e6eb179061fe", "avgDurationSecs": 4.512, "validDurationSecs": 2188.343, "totalHrs": 0.64, "validHrs": 0.6 }, "ha": { "duration": 38472084, "reportedSentences": 14, "buckets": { "dev": 0, "invalidated": 153, "other": 5876, "reported": 14, "test": 896, "train": 1941, "validated": 2837 }, "clips": 8866, "splits": { "accent": { "": 1 }, "age": { "": 0.18, "thirties": 0.77, "twenties": 0.02, "fourties": 0, "fifties": 0.03 }, "gender": { "": 0.18, "male": 0.54, "female": 0.28 } }, "users": 27, "size": 225075429, "checksum": "395c80ab6bacae21042ff7360976b7bde7c3cec805ca97dac1761138dfbac9e7", "avgDurationSecs": 4.339, "validDurationSecs": 12310.546, "totalHrs": 10.68, "validHrs": 3.41 }, "ckb": { "duration": 189151776, "reportedSentences": 1750, "buckets": { "dev": 4070, "invalidated": 1414, "other": 3434, "reported": 1749, "test": 4311, "train": 6403, "validated": 45048 }, "clips": 49896, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "thirties": 0.11, "twenties": 0.42, "fourties": 0.01, "teens": 0.01, "fifties": 0.03 }, "gender": { "": 0.38, "male": 0.56, "female": 0.06, "other": 0 } }, "users": 250, "size": 1015400895, "checksum": "6aa0b9d17f31a6f8fa0b23fb90e1134e75c23f104acc2b69c704f0adcdd9ea30", "avgDurationSecs": 3.791, "validDurationSecs": 170773.393, "totalHrs": 52.54, "validHrs": 47.43 }, "ml": { "duration": 9444672, "reportedSentences": 107, "buckets": { "dev": 0, "invalidated": 1, "other": 1914, "reported": 106, "test": 24, "train": 366, "validated": 390 }, "clips": 2305, "splits": { "accent": { "": 1 }, "age": { "": 0.47, "twenties": 0.53 }, "gender": { "": 0.47, "male": 0.53 } }, "users": 16, "size": 54752972, "checksum": "57e0639b25530a0feb23530b857858782828833cd203a09018aadf99cf7376d8", "avgDurationSecs": 4.097, "validDurationSecs": 1598.014, "totalHrs": 2.62, "validHrs": 0.44 }, "mdf": { "duration": 1791720, "reportedSentences": 6, "buckets": { "dev": 47, "invalidated": 5, "other": 98, "reported": 5, "test": 73, "train": 116, "validated": 236 }, "clips": 339, "splits": { "accent": { "": 1 }, "age": { "sixties": 0.06, "": 0.59, "fourties": 0.35 }, "gender": { "male": 0.06, "": 0.59, "female": 0.35 } }, "users": 10, "size": 10525562, "checksum": "a2d5ad04d92814618b20fb80881afa151784fb893f94e020976d7618f3f1a6b5", "avgDurationSecs": 5.285, "validDurationSecs": 1247.333, "totalHrs": 0.49, "validHrs": 0.34 }, "sw": { "duration": 2589824160, "reportedSentences": 363, "buckets": { "dev": 9008, "invalidated": 10070, "other": 353864, "reported": 368, "test": 8934, "train": 19861, "validated": 120101 }, "clips": 484035, "splits": { "accent": { "": 1 }, "age": { "": 0.29, "twenties": 0.45, "thirties": 0.14, "teens": 0, "fifties": 0.06, "fourties": 0.05, "sixties": 0.01 }, "gender": { "": 0.26, "male": 0.39, "female": 0.35, "other": 0 } }, "users": 569, "size": 15139558447, "checksum": "d124fdb0e12074adfa137f69f28f35b9d8d7cdcdf13f4feae0d39bd252c07a65", "avgDurationSecs": 5.35, "validDurationSecs": 642599.133, "totalHrs": 719.39, "validHrs": 178.49 }, "sat": { "duration": 2042928, "reportedSentences": 7, "buckets": { "dev": 0, "invalidated": 12, "other": 242, "reported": 6, "test": 47, "train": 109, "validated": 156 }, "clips": 410, "splits": { "accent": { "": 1 }, "age": { "": 0.13, "twenties": 0.62, "fourties": 0.02, "fifties": 0.02, "teens": 0.02, "thirties": 0.18 }, "gender": { "": 0.09, "male": 0.89, "female": 0.02 } }, "users": 8, "size": 10811844, "checksum": "4c00be164ac7ac124d3bed39ef87c82cec3f8bc911aefee451b802d65bd443e8", "avgDurationSecs": 4.983, "validDurationSecs": 777.309, "totalHrs": 0.56, "validHrs": 0.21 }, "tig": { "duration": 74016, "reportedSentences": 1, "buckets": { "dev": 0, "invalidated": 8, "other": 0, "reported": 0, "test": 0, "train": 10, "validated": 10 }, "clips": 18, "splits": { "accent": { "": 1 }, "age": { "": 0.72, "twenties": 0.28 }, "gender": { "": 0.72, "male": 0.28 } }, "users": 4, "size": 436155, "checksum": "37fdd8b0cfded4c466b3c32d4d7cc670165383ef77c303b5e69b2c937b981729", "avgDurationSecs": 4.112, "validDurationSecs": 41.12, "totalHrs": 0.02, "validHrs": 0.01 }, "ig": { "duration": 29924424, "reportedSentences": 5, "buckets": { "dev": 0, "invalidated": 1, "other": 5481, "reported": 5, "test": 2, "train": 5, "validated": 7 }, "clips": 5489, "splits": { "accent": { "": 1 }, "age": { "": 1, "twenties": 0.33, "teens": 0.05, "eighties": 0, "thirties": 0.04, "sixties": 0.02 }, "gender": { "": 1, "male": 0.14, "female": 0.31 } }, "users": 96, "size": 174792302, "checksum": "921d62787743f854bf771f3f3b9492b167292992d9d3565a535b3f943bdc0c9c", "avgDurationSecs": 5.452, "validDurationSecs": 38.162, "totalHrs": 8.31, "validHrs": 0.01 }, "nan-tw": { "duration": 11365740, "reportedSentences": 20, "buckets": { "dev": 161, "invalidated": 82, "other": 3211, "reported": 19, "test": 285, "train": 503, "validated": 950 }, "clips": 4243, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.16, "": 0.2, "twenties": 0.52, "fourties": 0.1, "teens": 0.01, "fifties": 0 }, "gender": { "male": 0.43, "": 0.2, "other": 0.27, "female": 0.09 } }, "users": 55, "size": 65348573, "checksum": "275abb62dd72bfe1a6db9092851da16e308d8b172402bfee5c82e7094de48feb", "avgDurationSecs": 2.679, "validDurationSecs": 2544.769, "totalHrs": 3.15, "validHrs": 0.7 }, "mhr": { "duration": 236876904, "reportedSentences": 32, "buckets": { "dev": 9152, "invalidated": 1405, "other": 6723, "reported": 31, "test": 9074, "train": 21828, "validated": 41094 }, "clips": 49222, "splits": { "accent": { "": 1 }, "age": { "fifties": 0.1, "": 0.11, "sixties": 0.11, "thirties": 0.19, "fourties": 0.21, "twenties": 0.24, "teens": 0.05 }, "gender": { "male": 0.23, "": 0.11, "female": 0.66 } }, "users": 180, "size": 1341912846, "checksum": "39621e9ca4d12e2872546131f1eca3197324943ee29bba657f10489f098cc67a", "avgDurationSecs": 4.812, "validDurationSecs": 197761.56, "totalHrs": 65.79, "validHrs": 54.93 }, "bn": { "duration": 1438112808, "reportedSentences": 693, "buckets": { "dev": 7748, "invalidated": 5844, "other": 192522, "reported": 717, "test": 7748, "train": 14503, "validated": 32754 }, "clips": 231120, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.02, "twenties": 0.22, "": 0.72, "teens": 0.04, "fourties": 0 }, "gender": { "male": 0.24, "": 0.72, "female": 0.04, "other": 0 } }, "users": 19863, "size": 8262390506, "checksum": "599a5f7c9e55a297928da390345a19180b279a1f013081e7255a657fc99f98d5", "avgDurationSecs": 6.222, "validDurationSecs": 203807.316, "totalHrs": 399.47, "validHrs": 56.61 }, "tok": { "duration": 25490268, "reportedSentences": 88, "buckets": { "dev": 1076, "invalidated": 129, "other": 1787, "reported": 87, "test": 1326, "train": 2242, "validated": 5250 }, "clips": 7166, "splits": { "accent": { "": 1 }, "age": { "": 0.51, "twenties": 0.23, "teens": 0.16, "thirties": 0.09 }, "gender": { "": 0.51, "male": 0.37, "other": 0.01, "female": 0.11 } }, "users": 50, "size": 148247234, "checksum": "9e6d7dd7728199a2de17d97c79e87fe8c522b8c1575b595d2dcfca9a38e36d53", "avgDurationSecs": 3.557, "validDurationSecs": 18674.84, "totalHrs": 7.08, "validHrs": 5.18 }, "yue": { "duration": 58318992, "reportedSentences": 96, "buckets": { "dev": 1648, "invalidated": 877, "other": 5871, "reported": 95, "test": 1636, "train": 1854, "validated": 6882 }, "clips": 13630, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.35, "": 0.36, "twenties": 0.14, "fourties": 0.08, "sixties": 0, "fifties": 0, "teens": 0.06 }, "gender": { "male": 0.15, "": 0.49, "female": 0.31, "other": 0.05 } }, "users": 332, "size": 331489433, "checksum": "6aec37d62a600b641d5377d25b956e5f6918cf9abe05debbdfdfe3ff8ff35624", "avgDurationSecs": 4.279, "validDurationSecs": 29446.17, "totalHrs": 16.19, "validHrs": 8.17 }, "sah": { "duration": 24207744, "reportedSentences": 1, "buckets": { "dev": 1083, "invalidated": 99, "other": 26, "reported": 0, "test": 1191, "train": 1570, "validated": 3912 }, "clips": 4037, "splits": { "accent": { "": 1 }, "age": { "": 0.37, "twenties": 0.03, "fourties": 0.07, "thirties": 0.43, "teens": 0.1, "fifties": 0 }, "gender": { "": 0.37, "male": 0.54, "female": 0.1 } }, "users": 52, "size": 184519417, "checksum": "3e6bfeba4c17c80eb7c2e5b5cd95ba958747001ed9a49d52d86333c5332ada5b", "avgDurationSecs": 5.996, "validDurationSecs": 23458.185, "totalHrs": 6.72, "validHrs": 6.51 }, "fa": { "buckets": { "dev": 9852, "invalidated": 13220, "other": 34785, "reported": 2078, "test": 9852, "train": 24150, "validated": 290215 }, "reportedSentences": 2070, "duration": 1356155252, "clips": 338220, "splits": { "accent": { "": 1 }, "age": { "": 0.24, "twenties": 0.31, "thirties": 0.37, "fifties": 0.02, "fourties": 0.03, "teens": 0.03, "sixties": 0 }, "gender": { "": 0.21, "male": 0.72, "female": 0.07, "other": 0 } }, "users": 4058, "size": 10032969547, "checksum": "62f07bed6cb1d28e51575c684bcb6eea1b50e553f50640d8c82576fb150c01fe", "avgDurationSecs": 4.01, "validDurationSecs": 1163670.382, "totalHrs": 376.7, "validHrs": 323.24 }, "fr": { "buckets": { "dev": 16041, "invalidated": 54752, "other": 1757, "reported": 6205, "test": 16041, "train": 445537, "validated": 612070 }, "duration": 3334982374, "reportedSentences": 6129, "clips": 668579, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.18, "thirties": 0.17, "": 0.35, "teens": 0.03, "fourties": 0.13, "fifties": 0.09, "sixties": 0.03, "seventies": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.62, "": 0.27, "female": 0.1, "other": 0.01 } }, "users": 16291, "size": 24260798182, "checksum": "6e5326e0cd1841ed3e5fceeb46e71d39c1cb653968f0e888c62d0369cc06cb9d", "avgDurationSecs": 4.988, "validDurationSecs": 3053106.157, "totalHrs": 926.38, "validHrs": 848.08 }, "es": { "buckets": { "dev": 15449, "invalidated": 47995, "other": 197465, "reported": 1690, "test": 15449, "train": 215745, "validated": 290972 }, "duration": 2704659930, "reportedSentences": 1677, "clips": 536432, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.11, "": 0.32, "fifties": 0.08, "twenties": 0.25, "teens": 0.03, "fourties": 0.06, "sixties": 0.15, "eighties": 0, "seventies": 0, "nineties": 0 }, "gender": { "male": 0.51, "": 0.32, "other": 0.01, "female": 0.17 } }, "users": 23013, "size": 19800467043, "checksum": "61b8bd40123962d5e98e7740b11a5f0b3abb8691c44632e0f24b30833d41e3f1", "avgDurationSecs": 5.042, "validDurationSecs": 1467064.435, "totalHrs": 751.29, "validHrs": 407.51 }, "sl": { "buckets": { "dev": 1223, "invalidated": 225, "other": 905, "reported": 28, "test": 1219, "train": 1384, "validated": 9291 }, "duration": 40129030, "reportedSentences": 29, "clips": 10421, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.52, "teens": 0.09, "": 0.17, "sixties": 0.08, "fifties": 0.08, "fourties": 0.02, "thirties": 0.05 }, "gender": { "female": 0.15, "male": 0.67, "": 0.17, "other": 0 } }, "users": 128, "size": 289537155, "checksum": "394a3c8c1fe1c24ae7b63ccab3428a5d87e5f51730984038f09254a2793e88b5", "avgDurationSecs": 3.851, "validDurationSecs": 35777.643, "totalHrs": 11.14, "validHrs": 9.93 }, "kab": { "buckets": { "dev": 14875, "invalidated": 19148, "other": 101261, "reported": 4807, "test": 14875, "train": 140213, "validated": 596320 }, "duration": 2382988048, "reportedSentences": 4802, "clips": 716729, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.09, "thirties": 0.3, "": 0.27, "fifties": 0.19, "twenties": 0.12, "eighties": 0, "teens": 0, "sixties": 0.03, "seventies": 0 }, "gender": { "male": 0.55, "": 0.25, "female": 0.2, "other": 0 } }, "users": 1441, "size": 17975235335, "checksum": "9eb1cbf3d9d83c9659b581cbe4088bc9f275443fcfeafbdc09f6dfc9ef875b0c", "avgDurationSecs": 3.325, "validDurationSecs": 1982650.95, "totalHrs": 661.94, "validHrs": 550.73 }, "cy": { "buckets": { "dev": 5192, "invalidated": 4283, "other": 17960, "reported": 152, "test": 5202, "train": 7578, "validated": 87106 }, "duration": 529187805, "reportedSentences": 153, "clips": 109349, "splits": { "accent": { "": 1 }, "age": { "fourties": 0.16, "twenties": 0.13, "sixties": 0.07, "fifties": 0.09, "": 0.42, "thirties": 0.09, "seventies": 0.01, "eighties": 0, "teens": 0.02 }, "gender": { "male": 0.33, "female": 0.25, "": 0.41, "other": 0.01 } }, "users": 1705, "size": 3908225447, "checksum": "dcc085ccdf2790020c305db1f56d19f67035cdf784746fd3c658c8d90b8388fb", "avgDurationSecs": 4.839, "validDurationSecs": 421544.165, "totalHrs": 146.99, "validHrs": 117.09 }, "ca": { "buckets": { "dev": 16230, "invalidated": 56440, "other": 456885, "reported": 3754, "test": 16230, "train": 657479, "validated": 803439 }, "duration": 7362990642, "reportedSentences": 3686, "clips": 1316764, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.07, "fifties": 0.17, "fourties": 0.11, "twenties": 0.05, "": 0.34, "sixties": 0.22, "teens": 0.01, "seventies": 0.03, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.45, "": 0.34, "female": 0.21, "other": 0 } }, "users": 26053, "size": 47095083457, "checksum": "30993c465a078f4144f1399553d92506d72cbb42ae3f00e34f69f704f0f5b39f", "avgDurationSecs": 5.592, "validDurationSecs": 4492615.107, "totalHrs": 2045.27, "validHrs": 1247.94 }, "de": { "buckets": { "dev": 16033, "invalidated": 45660, "other": 5246, "reported": 7276, "test": 16033, "train": 439325, "validated": 765790 }, "duration": 4197654857, "reportedSentences": 7212, "clips": 816696, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.2, "fourties": 0.17, "": 0.31, "thirties": 0.15, "teens": 0.03, "sixties": 0.03, "fifties": 0.1, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "male": 0.59, "": 0.31, "female": 0.09, "other": 0.01 } }, "users": 16645, "size": 30220261910, "checksum": "a8f088b7a55f9754c165c218655b0a7131d3f2e17f56d8cbc9ba6eca32228a81", "avgDurationSecs": 5.14, "validDurationSecs": 3936008.151, "totalHrs": 1166.01, "validHrs": 1093.33 }, "tt": { "buckets": { "dev": 3069, "invalidated": 378, "other": 0, "reported": 3, "test": 5055, "train": 9840, "validated": 28452 }, "duration": 107943146, "reportedSentences": 4, "clips": 28830, "splits": { "accent": { "": 1 }, "age": { "": 0.19, "thirties": 0.73, "twenties": 0.05, "sixties": 0, "fifties": 0.01, "teens": 0, "fourties": 0, "seventies": 0.01 }, "gender": { "": 0.19, "male": 0.79, "female": 0.02 } }, "users": 215, "size": 800243818, "checksum": "14d199b9b94afb46db45f8db6e159b6cc63dab09d778db4fcfa15f0246498f52", "avgDurationSecs": 3.744, "validDurationSecs": 106527.866, "totalHrs": 29.98, "validHrs": 29.59 }, "ta": { "buckets": { "dev": 11563, "invalidated": 5515, "other": 76526, "reported": 3229, "test": 11712, "train": 39758, "validated": 128031 }, "duration": 1302133236, "reportedSentences": 3229, "clips": 210072, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.08, "thirties": 0.09, "": 0.71, "fourties": 0.03, "seventies": 0.02, "fifties": 0.03, "teens": 0.04, "sixties": 0, "eighties": 0 }, "gender": { "male": 0.16, "": 0.7, "other": 0, "female": 0.13 } }, "users": 730, "size": 7812683439, "checksum": "c8ac4adfb3e1bc0f8cec4ed15cd61ec12bc0784036f5edb0cc7528a2277dcc77", "avgDurationSecs": 6.199, "validDurationSecs": 793601.338, "totalHrs": 361.7, "validHrs": 220.44 }, "ru": { "buckets": { "dev": 9457, "invalidated": 6549, "other": 18733, "reported": 290, "test": 9456, "train": 21909, "validated": 115007 }, "duration": 725508648, "reportedSentences": 284, "clips": 140289, "splits": { "accent": { "": 1 }, "age": { "twenties": 0.41, "teens": 0.08, "": 0.2, "fourties": 0.14, "thirties": 0.13, "fifties": 0.03, "sixties": 0, "seventies": 0 }, "gender": { "male": 0.63, "": 0.21, "other": 0, "female": 0.16 } }, "users": 2585, "size": 5132638847, "checksum": "f3e386715e55b5e293b77ccda252d1c4c5458fb5f1290f54e241e7fe45c0bd60", "avgDurationSecs": 5.172, "validDurationSecs": 594762.049, "totalHrs": 201.53, "validHrs": 165.21 }, "nl": { "buckets": { "dev": 10368, "invalidated": 4995, "other": 2233, "reported": 310, "test": 10552, "train": 29031, "validated": 82976 }, "duration": 388289017, "reportedSentences": 311, "clips": 90204, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.22, "fourties": 0.14, "thirties": 0.11, "teens": 0.02, "fifties": 0.08, "sixties": 0.01, "nineties": 0, "eighties": 0, "seventies": 0 }, "gender": { "": 0.42, "male": 0.47, "female": 0.11, "other": 0 } }, "users": 1488, "size": 2678633936, "checksum": "23467ca7212bedde6f5b53689cb76e8a78d7502e937d990c64f304cf4f472dbc", "avgDurationSecs": 4.305, "validDurationSecs": 357175.618, "totalHrs": 107.85, "validHrs": 99.21 }, "it": { "buckets": { "dev": 14935, "invalidated": 16988, "other": 27, "reported": 5204, "test": 14934, "train": 145968, "validated": 212264 }, "duration": 1226597196, "reportedSentences": 5200, "clips": 229279, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.16, "twenties": 0.2, "": 0.29, "fifties": 0.16, "fourties": 0.14, "seventies": 0, "sixties": 0.02, "teens": 0.01, "eighties": 0, "nineties": 0 }, "gender": { "female": 0.12, "male": 0.59, "": 0.29, "other": 0 } }, "users": 6640, "size": 8536806025, "checksum": "3c97337d99684cda8c3c383b68555f1124fd8646df61e03b7578212580d71f1b", "avgDurationSecs": 5.35, "validDurationSecs": 1135570.319, "totalHrs": 340.72, "validHrs": 315.43 }, "vot": { "duration": 1025976, "buckets": { "dev": 0, "invalidated": 324, "other": 0, "test": 6, "train": 96, "validated": 102 }, "clips": 426, "splits": { "accent": { "": 1 }, "age": { "": 0.25, "twenties": 0.73, "teens": 0.01 }, "gender": { "": 0.25, "male": 0.75 } }, "users": 5, "size": 7892306, "checksum": "5ca8a934d111c6ea330df3c1f52c45fbec0dbbae0ff4fe83e97c32b2f6294b60", "avgDurationSecs": 2.408, "validDurationSecs": 245.656, "totalHrs": 0.28, "validHrs": 0.06 }, "az": { "duration": 604476, "buckets": { "dev": 15, "invalidated": 28, "other": 0, "test": 26, "train": 39, "validated": 80 }, "clips": 108, "splits": { "accent": { "": 1 }, "age": { "": 0.41, "twenties": 0.56, "fourties": 0.03 }, "gender": { "": 0.41, "male": 0.59 } }, "users": 13, "size": 3529914, "checksum": "8fabb36856ca872e56a5cb9ada9029086c05600a809524afda3e8883a687640a", "avgDurationSecs": 5.597, "validDurationSecs": 447.76, "totalHrs": 0.16, "validHrs": 0.12 }, "mk": { "duration": 729000, "buckets": { "dev": 0, "invalidated": 7, "other": 9, "test": 15, "train": 114, "validated": 129 }, "clips": 145, "splits": { "accent": { "": 1 }, "age": { "thirties": 0.38, "": 1, "twenties": 0.14, "teens": 0.1 }, "gender": { "male": 0.62, "": 1 } }, "users": 5, "size": 4286542, "checksum": "e49da4f1b244b00f67879b48877ad85b9659ff2b050c34d1427e220a9b67ebb9", "avgDurationSecs": 5.028, "validDurationSecs": 648.559, "totalHrs": 0.2, "validHrs": 0.18 } }, "totalDuration": 72782088097, "totalValidDurationSecs": 53904443, "totalHrs": 20217, "totalValidHrs": 14973 } ================================================ FILE: datasets/spontaneous-speech/.gitkeep ================================================ ================================================ FILE: datasets/spontaneous-speech/CHANGELOG.md ================================================ # Spontaneous Speech (SPS) Changelog ## Dataset Changes in Corpus 3.0 The following changes affect SPS datasets starting with Corpus 3.0. For bundler internals, see the [SPS Bundler documentation](https://github.com/common-voice/spontaneous-speech/tree/main/spontaneous-speech-bundler). ### New files in archive - **Datasheets**: each locale archive now includes a `README.md` datasheet with language description, statistics, and community context (generated from [cv-datasheets](https://github.com/common-voice/cv-datasheets), schema v2.0.0). Full releases only. - **QA summary**: `ss-corpus-{locale}.qa-summary.json` with processing metrics per locale. ### TSV field changes - added `age`, `gender`, `accents`, `variant` columns -- demographics are cross-referenced from SCS profiles when the contributor has one, with SCS data taking priority - added `char_per_sec` column -- characters per second of transcription relative to audio duration - added `quality_tags` column -- pipe-separated quality flags (see [Quality Tags](README.md#quality-tags) in the README) - disfluency markers in `transcription` are now standardized to `[disfluency]` tags ### Statistics structure The per-locale statistics JSON differs from SCS (relevant for programmatic consumers): - `duration` is a nested object: `total_ms`, `total_hrs`, `validated_ms`, `validated_hrs`, `avg_ms`, `min_ms`, `max_ms`, `avg_chars_per_sec` - `buckets` contains per-split detail: `train`, `dev`, `test`, each with `clips`, `users`, `duration_ms`, `duration_hrs` - `demographics` replaces the SCS `splits` key, with `age`, `gender`, `accent`, and `variant` breakdowns - SPS-specific objects: `questions`, `audios`, `transcriptions`, `reported` (with `reasons` breakdown) - accents and variants use machine tokens in statistics; non-predefined values are grouped as `user_defined` - `generated_at` timestamp is included ## Current Release ### [Corpus 3.0](sps-corpus-3.0-2026-03-09.json) - **Date released**: 18 March 2026 - **Clip cut-off date**: 09 March 2026 - **Total hours**: 508 - **Total validated hours**: 269 - **Number of languages**: 72 **New languages since last release**: Danish (`da`), Esperanto (`eo`), Croatian (`hr`), Javanese (`jv`), Dutch (`nl`), Pashto (`ps`), Rakhine (`rki`), Sinhala (`si`), Sundanese (`su`), Tudaga (`tuq`) ## Past Releases ### [Corpus 2.0](sps-corpus-2.0-2025-12-05.json) - **Date released**: 17 December 2025 - **Clip cut-off date**: 05 December 2025 - **Total hours**: 454 - **Total validated hours**: 268 - **Number of languages**: 62 **New languages since last release**: Spanish (`es`), Tashlhiyt (`shi`), Shona (`sn`), Thai (`th`) ### [Corpus 1.0](sps-corpus-1.0-2025-09-05.json) First Spontaneous Speech release. - **Date released**: 17 September 2025 - **Clip cut-off date**: 05 September 2025 - **Total hours**: 428 - **Total validated hours**: 263 - **Number of languages**: 58 **Languages**: Arvanitika (`aat`), Adyghe (`ady`), Gheg Albanian (`aln`), Aragonese (`an`), Bashkir (`ba`), Basaa (`bas`), Betawi (`bew`), Breton (`br`), Bodo (`brx`), Sabah Bisaya (`bsy`), Bukusu (`bxk`), Catalan (`ca`), Eastern Min (`cdo`), Chiga (`cgg`), Heng Hua (`cpx`), Welsh (`cy`), German (`de`), Cypriot Greek (`el-CY`), English (`en`), French (`fr`), Frisian (`fy-NL`), Irish (`ga-IE`), Galician (`gl`), Alsatian (`gsw`), Manx (`gv`), Gorani (`hac`), Wixarika (`hch`), Georgian (`ka`), Kabardian (`kbd`), Nubi (`kcn`), Konzo (`koo`), Kelabit (`kzi`), Lendu (`led`), Ligurian (`lij`), Kenyi (`lke`), Thur (`lth`), Latvian (`lv`), Mixteco Yucuhiti (`meh`), Melanau (`mel`), Michoacan Mazahua (`mmc`), Sabah Malay (`msi`), Bahasa Malay (`ms-MY`), Western Penan (`pne`), Puno Quechua (`qxp`), Russian (`ru`), Ruuli (`ruc`), Amba (`rwm`), Scots (`sco`), Serian Bidayuh (`sdo`), Sena (`seh`), Sa'ban (`snv`), Toba Qom (`tob`), Papantla Totonac (`top`), Turkish (`tr`), Rutoro (`ttj`), Kuku (`ukv`), Ushojo (`ush`), Kenyah (`xkl`) ================================================ FILE: datasets/spontaneous-speech/README.md ================================================ # Spontaneous Speech (SPS) Spontaneous Speech is a newer Common Voice modality where contributors respond to open-ended questions in their own words, producing natural, unscripted audio. The community validates recordings, transcribes the audio, and reviews the transcriptions. Releases are produced using the SPS Bundler. All audio contributions are released under the [CC-0 license](https://creativecommons.org/publicdomain/zero/1.0/). Clips are only removed at the request of the contributor, and problematic content flagged by the community via the Report button is also excluded from the datasets. ## Release History See the full [Changelog](CHANGELOG.md) for detailed release notes and new languages per release. ### Total and Validated Hours ```mermaid --- config: xyChart: width: 600 height: 350 --- xychart-beta title "Spontaneous Speech: Total vs Validated Hours" x-axis ["v1.0","v2.0","v3.0"] y-axis "Hours" 0 --> 600 bar [428,454,508] bar [263,268,269] ``` ### Contributors ```mermaid --- config: xyChart: width: 600 height: 350 --- xychart-beta title "Spontaneous Speech: Total Contributors" x-axis ["v1.0","v2.0","v3.0"] y-axis "Users" 0 --> 2500 bar [1060,1571,2043] ``` ### Dataset Count ```mermaid --- config: xyChart: width: 600 height: 250 --- xychart-beta title "Spontaneous Speech: Dataset Count per Release" x-axis ["v1.0","v2.0","v3.0"] y-axis "Datasets" 0 --> 85 line [58,62,72] ``` ### Release Summary
| Release | Date | Languages | Total Hours | Validated Hours | | ------- | ---------- | --------: | ----------: | --------------: | | v1.0 | 2025-09-05 | 58 | 428 | 263 | | v2.0 | 2025-12-05 | 62 | 454 | 268 | | v3.0 | 2026-03-09 | 72 | 508 | 269 |
## About the Statistics Statistics for each release are stored as JSON files in this directory. Durations are measured in milliseconds and file sizes in bytes unless otherwise noted. Key differences from Scripted Speech statistics: - Demographics are under a `demographics` object (not `splits`) - Duration is a nested object with `total_ms`, `validated_ms`, `avg_ms`, etc. - Buckets (`train`/`dev`/`test`) include per-bucket `clips`, `users`, `duration_ms`, `duration_hrs` - SPS-specific fields: `questions`, `audios`, `transcriptions`, `reported.reasons` ## Archive Structure Each release produces a full data archive per locale. Naming: `sps-corpus-{version}-{YYYY-MM-DD}-{locale}.tar.gz` ```txt sps-corpus-{version}-{YYYY-MM-DD}-{locale}/ ├── README.md (locale-specific datasheet) ├── audios/ │ └── spontaneous-speech-{locale}-*.mp3 ├── ss-corpus-{locale}.tsv (main data file) ├── ss-reported-audios-{locale}.tsv (reported/flagged audios) └── ss-corpus-{locale}.qa-summary.json (quality assurance summary) ``` ## TSV Fields ### Main Data File: `ss-corpus-{locale}.tsv` Each row represents a single audio recording: - `client_id` -- hashed UUID of the speaker - `audio_id` -- numeric identifier for the audio - `audio_file` -- filename (e.g., `spontaneous-speech-en-1.mp3`) - `duration_ms` -- audio duration in milliseconds - `prompt_id` -- numeric identifier for the question/prompt - `prompt` -- the question text asked to the speaker - `transcription` -- transcription of the speaker's response (may contain `[disfluency]`, `[noise]`, etc. tags) - `votes` -- number of validation votes received - `age` -- age bracket of the speaker\* (since v3.0, cross-referenced from SCS profiles when available) - `gender` -- gender of the speaker\* (since v3.0, cross-referenced from SCS profiles when available) - `accents` -- accent codes (comma-separated; since v3.0, cross-referenced from SCS profiles when available) - `variant` -- language variant codes of the speaker (since v3.0, cross-referenced from SCS profiles when available) - `language` -- language name - `prompt_upvotes` -- number of upvotes on the prompt - `prompt_reports` -- number of reports on the prompt - `is_edited` -- `0` or `1`, indicates if transcription was edited - `split` -- dataset partition: `train`, `dev`, `test`, or `unassigned` - `char_per_sec` -- characters per second of transcription relative to audio duration - `quality_tags` -- pipe-separated quality flags applied during post-processing (see [Quality Tags](#quality-tags) below) \*For a full list of age and gender options, see the [demographics spec](https://github.com/common-voice/spontaneous-speech/blob/main/web/src/stores/demographics.ts). These are only reported if the speaker opted in. ### Reported Audios File: `ss-reported-audios-{locale}.tsv` Each row represents a reported audio clip: - `client_id` -- hashed UUID of the reporter - `audio_id` -- numeric identifier - `audio_file` -- filename - `duration_ms` -- audio duration - `prompt_id` -- prompt identifier - `prompt` -- prompt text - `reason` -- report reason: `other`, `different_language`, `personally_identifiable_information`, `offensive_speech` - `comment` -- free-text comment from the reporter - `language` -- language name Note: reported audios (regardless of reason) are excluded from the main corpus TSV. However, their metadata and audio files are still present in the release archive under `ss-reported-audios-{locale}.tsv` and `audios/`. `personally_identifiable_information` reports will be excluded in the following releases. ### QA Summary File: `ss-corpus-{locale}.qa-summary.json` Quality assurance metadata documenting the processing pipeline: - Whether disfluency markers were applied to transcriptions - How many rows were affected by each processing step - Quality tagging results and problem clip counts ### Quality Tags The `quality_tags` field contains pipe-separated flags assigned during the post-processing QA step. A clip may have zero, one, or multiple tags. Tags are informational and do not exclude clips from the dataset. #### Audio duration tags | Tag | Condition | Description | | ------------- | --------------------- | ------------------------------- | | `short-audio` | duration < 2,000 ms | Audio is shorter than 2 seconds | | `long-audio` | duration > 300,000 ms | Audio is longer than 5 minutes | #### Transcription quality tags | Tag | Condition | Description | | ---------------------- | ---------------- | ----------------------------------------------------------- | | `transcription-length` | chars/sec < 3.0 | Transcription is unusually short relative to audio duration | | `speech-rate` | chars/sec > 30.0 | Transcription is unusually long relative to audio duration | #### Script and language tags | Tag | Description | | --------------------------------- | ------------------------------------------------------------------------- | | `non-allowed-script` | Transcription uses a writing system not in the language's allowed scripts | | `mixed-script-words` | A single word/token contains characters from multiple writing systems | | `mixed-script-transcription` | Transcription contains tokens from multiple writing systems | | `dataset-language-audio-mismatch` | Audio language verification did not match the expected dataset language | ## Data Pipeline SPS data goes through a multi-stage community pipeline: 1. **Question submission** -- community members submit open-ended prompts/questions 2. **Question validation** -- questions are upvoted or reported by the community 3. **Recording** -- contributors answer validated questions spontaneously 4. **Transcription** -- community members transcribe the audio 5. **Transcription validation** -- transcriptions are reviewed, possibly edited, and accepted when ready ### Question status categories in statistics (`questions`) - `validated` -- questions accepted by the community (sufficient upvotes) - `invalidated` -- questions rejected (reported or downvoted) - `other` -- questions not yet fully validated - `has_audio` -- questions that have at least one recording - `avg_recordings_per_question` -- average number of audio recordings per question ### Audio status categories in statistics (`audios`) - `transcribed_validated` -- audio with reviewed and accepted transcriptions - `transcribed_pending` -- audio transcribed but not yet validated - `not_transcribed` -- audio without any transcription yet ### Transcription status categories in statistics (`transcriptions`) - `validated` -- transcriptions that have been reviewed and accepted - `not_yet_validated` -- transcriptions awaiting review - `edited` -- transcriptions that were modified during validation - `edited_pct` -- percentage of transcriptions that were edited Dataset split (`train`/`dev`/`test`) is assigned only to audio with validated transcriptions. Other audio has the `split` field set to `unassigned`. ================================================ FILE: datasets/spontaneous-speech/sps-corpus-1.0-2025-09-05.json ================================================ { "locales": { "aat": { "locale": "aat", "clips": 334, "users": 5, "questions": { "total": 111, "validated": 0, "invalidated": 0, "other": 111, "has_audio": 111, "avg_recordings_per_question": 3.01 }, "audios": { "total": 334, "transcribed_validated": 0, "transcribed_pending": 194, "not_transcribed": 140 }, "transcriptions": { "total": 194, "validated": 0, "not_yet_validated": 194, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 8290188, "total_hrs": 2.3, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 24821, "min_ms": 3996, "max_ms": 193536, "avg_chars_per_sec": 6.45 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.7006, "eighties": 0.2994, "nineties": 0 }, "gender": { "": 0.1587, "male_masculine": 0, "female_feminine": 0.8413, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 334 }, "variant": { "": 334 } }, "reported": { "clips": 0, "reasons": {} }, "size": 49122938, "checksum": "a6a91a2be715dffe04d0275cf2cbddf49ee5c54a9eb1109a5ec95b9c0863af1e", "generated_at": "2026-03-10T09:37:08.423Z" }, "ady": { "locale": "ady", "clips": 783, "users": 22, "questions": { "total": 127, "validated": 33, "invalidated": 0, "other": 94, "has_audio": 127, "avg_recordings_per_question": 6.17 }, "audios": { "total": 783, "transcribed_validated": 685, "transcribed_pending": 16, "not_transcribed": 82 }, "transcriptions": { "total": 701, "validated": 685, "not_yet_validated": 16, "edited": 327, "edited_pct": 46.65 }, "duration": { "total_ms": 20066400, "total_hrs": 5.57, "validated_ms": 16741944, "validated_hrs": 4.65, "avg_ms": 25628, "min_ms": 1116, "max_ms": 132048, "avg_chars_per_sec": 7.69 }, "buckets": { "train": { "clips": 272, "users": 14, "duration_ms": 4545504, "duration_hrs": 1.26 }, "dev": { "clips": 215, "users": 4, "duration_ms": 5478588, "duration_hrs": 1.52 }, "test": { "clips": 198, "users": 4, "duration_ms": 6717852, "duration_hrs": 1.86 } }, "demographics": { "age": { "": 0.0358, "teens": 0, "twenties": 0.1379, "thirties": 0.1967, "fourties": 0.152, "fifties": 0.3602, "sixties": 0.1175, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0817, "male_masculine": 0.2503, "female_feminine": 0.6679, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "abzakh": 53, "bzhedug": 75, "hatikuay": 0, "shapsug": 206, "temirgoy": 80, "ubykh": 0, "user_defined": 10, "": 370 }, "variant": { "ady-Cyrl": 410, "ady-Cyrl-JOR": 0, "ady-Cyrl-SY": 0, "ady-Cyrl-TR": 264, "ady-Latn-TR-t-ady-cyrl": 0, "ady-RU": 7, "": 102 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 111656904, "checksum": "55761b24af39ae56ac2f99c24e9839240ffef3d0335589419c1595b94302dc74", "generated_at": "2026-03-10T09:38:16.916Z" }, "aln": { "locale": "aln", "clips": 1654, "users": 14, "questions": { "total": 144, "validated": 0, "invalidated": 0, "other": 144, "has_audio": 144, "avg_recordings_per_question": 11.49 }, "audios": { "total": 1654, "transcribed_validated": 1516, "transcribed_pending": 0, "not_transcribed": 138 }, "transcriptions": { "total": 1516, "validated": 1516, "not_yet_validated": 0, "edited": 103, "edited_pct": 6.79 }, "duration": { "total_ms": 39111120, "total_hrs": 10.86, "validated_ms": 37793736, "validated_hrs": 10.49, "avg_ms": 23646, "min_ms": 396, "max_ms": 73368, "avg_chars_per_sec": 6.9 }, "buckets": { "train": { "clips": 938, "users": 7, "duration_ms": 24081624, "duration_hrs": 6.68 }, "dev": { "clips": 297, "users": 3, "duration_ms": 6782436, "duration_hrs": 1.88 }, "test": { "clips": 281, "users": 3, "duration_ms": 6929676, "duration_hrs": 1.92 } }, "demographics": { "age": { "": 0.0707, "teens": 0.1608, "twenties": 0.0036, "thirties": 0.0871, "fourties": 0.4317, "fifties": 0.2461, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4426, "male_masculine": 0.0738, "female_feminine": 0.4837, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1654 }, "variant": { "": 1654 } }, "reported": { "clips": 95, "reasons": { "other": 95 } }, "size": 212131485, "checksum": "c67f2cb15357d90700553bd8e7b9c343c59e51aabed7a573e23a8dae29dc0b51", "generated_at": "2026-03-10T09:31:32.422Z" }, "an": { "locale": "an", "clips": 19, "users": 1, "questions": { "total": 211, "validated": 0, "invalidated": 0, "other": 211, "has_audio": 19, "avg_recordings_per_question": 1 }, "audios": { "total": 19, "transcribed_validated": 19, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 19, "validated": 19, "not_yet_validated": 0, "edited": 7, "edited_pct": 36.84 }, "duration": { "total_ms": 395352, "total_hrs": 0.1, "validated_ms": 395352, "validated_hrs": 0.1, "avg_ms": 20808, "min_ms": 7740, "max_ms": 46368, "avg_chars_per_sec": 9.21 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 19 }, "variant": { "": 19 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2360487, "checksum": "42a74ff19cae153dd6b0ff7be39801b21e2c9f1da49f949b49e2b02af40c9e61", "generated_at": "2026-03-10T09:34:14.884Z" }, "ba": { "locale": "ba", "clips": 120, "users": 3, "questions": { "total": 167, "validated": 0, "invalidated": 0, "other": 167, "has_audio": 106, "avg_recordings_per_question": 1.13 }, "audios": { "total": 120, "transcribed_validated": 6, "transcribed_pending": 114, "not_transcribed": 0 }, "transcriptions": { "total": 120, "validated": 6, "not_yet_validated": 114, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 871308, "total_hrs": 0.24, "validated_ms": 31176, "validated_hrs": 0, "avg_ms": 7261, "min_ms": 1836, "max_ms": 31536, "avg_chars_per_sec": 7.35 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0333, "fourties": 0.1417, "fifties": 0.825, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 120 }, "variant": { "": 120 } }, "reported": { "clips": 0, "reasons": {} }, "size": 5169853, "checksum": "4c29dd518bb61e449cb32690e4cdbfffd8f9675aa3022fecc5df7361c227c909", "generated_at": "2026-03-10T10:02:53.631Z" }, "bas": { "locale": "bas", "clips": 773, "users": 11, "questions": { "total": 74, "validated": 0, "invalidated": 0, "other": 74, "has_audio": 74, "avg_recordings_per_question": 10.45 }, "audios": { "total": 773, "transcribed_validated": 734, "transcribed_pending": 38, "not_transcribed": 1 }, "transcriptions": { "total": 772, "validated": 734, "not_yet_validated": 38, "edited": 500, "edited_pct": 64.77 }, "duration": { "total_ms": 19354320, "total_hrs": 5.37, "validated_ms": 18232452, "validated_hrs": 5.06, "avg_ms": 25038, "min_ms": 4680, "max_ms": 64116, "avg_chars_per_sec": 8.86 }, "buckets": { "train": { "clips": 256, "users": 4, "duration_ms": 6984828, "duration_hrs": 1.94 }, "dev": { "clips": 200, "users": 3, "duration_ms": 5688144, "duration_hrs": 1.58 }, "test": { "clips": 278, "users": 4, "duration_ms": 5559480, "duration_hrs": 1.54 } }, "demographics": { "age": { "": 0, "teens": 0.1889, "twenties": 0, "thirties": 0.1863, "fourties": 0.5304, "fifties": 0.0944, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5705, "male_masculine": 0.4295, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 773 }, "variant": { "": 773 } }, "reported": { "clips": 0, "reasons": {} }, "size": 115061095, "checksum": "43722623752fe9b6e0409107309515dd26e655560afe900666d6876c07ce774a", "generated_at": "2026-03-10T10:02:36.086Z" }, "bew": { "locale": "bew", "clips": 1336, "users": 21, "questions": { "total": 200, "validated": 0, "invalidated": 0, "other": 200, "has_audio": 199, "avg_recordings_per_question": 6.71 }, "audios": { "total": 1336, "transcribed_validated": 1270, "transcribed_pending": 12, "not_transcribed": 54 }, "transcriptions": { "total": 1282, "validated": 1270, "not_yet_validated": 12, "edited": 1127, "edited_pct": 87.91 }, "duration": { "total_ms": 37777068, "total_hrs": 10.49, "validated_ms": 35235684, "validated_hrs": 9.78, "avg_ms": 28276, "min_ms": 3168, "max_ms": 156996, "avg_chars_per_sec": 9.27 }, "buckets": { "train": { "clips": 907, "users": 13, "duration_ms": 24028920, "duration_hrs": 6.67 }, "dev": { "clips": 150, "users": 3, "duration_ms": 5609268, "duration_hrs": 1.55 }, "test": { "clips": 213, "users": 5, "duration_ms": 5597496, "duration_hrs": 1.55 } }, "demographics": { "age": { "": 0.6557, "teens": 0, "twenties": 0.1198, "thirties": 0.0756, "fourties": 0.149, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6557, "male_masculine": 0.3443, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1336 }, "variant": { "": 1336 } }, "reported": { "clips": 0, "reasons": {} }, "size": 224868778, "checksum": "d7d9a4f36f8f1bc317bc5f743e79fa13036723874c272bb70a3a4dfdd35d9663", "generated_at": "2026-03-10T09:05:59.435Z" }, "br": { "locale": "br", "clips": 35, "users": 5, "questions": { "total": 61, "validated": 1, "invalidated": 0, "other": 60, "has_audio": 24, "avg_recordings_per_question": 1.46 }, "audios": { "total": 35, "transcribed_validated": 29, "transcribed_pending": 5, "not_transcribed": 1 }, "transcriptions": { "total": 34, "validated": 29, "not_yet_validated": 5, "edited": 3, "edited_pct": 8.82 }, "duration": { "total_ms": 443160, "total_hrs": 0.12, "validated_ms": 322452, "validated_hrs": 0.08, "avg_ms": 12662, "min_ms": 1980, "max_ms": 37692, "avg_chars_per_sec": 7.59 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0857, "fourties": 0.1143, "fifties": 0.0857, "sixties": 0.2571, "seventies": 0.4571, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8857, "male_masculine": 0, "female_feminine": 0.1143, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "gwenedeg": 4, "kerneveg": 6, "leoneg": 16, "tregerieg": 9 }, "variant": { "": 35 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2513508, "checksum": "be3ae0fb766dc77f4bb997e3601c1dfd8e9ef9a055c74a0a8ff69ea23405df24", "generated_at": "2026-03-10T10:01:26.995Z" }, "brx": { "locale": "brx", "clips": 69, "users": 1, "questions": { "total": 69, "validated": 3, "invalidated": 0, "other": 66, "has_audio": 69, "avg_recordings_per_question": 1 }, "audios": { "total": 69, "transcribed_validated": 25, "transcribed_pending": 44, "not_transcribed": 0 }, "transcriptions": { "total": 69, "validated": 25, "not_yet_validated": 44, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 232344, "total_hrs": 0.06, "validated_ms": 84816, "validated_hrs": 0.02, "avg_ms": 3367, "min_ms": 2376, "max_ms": 7128, "avg_chars_per_sec": 9.17 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 69 }, "variant": { "": 69 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1365002, "checksum": "5d0ddfb79050f07ac3485cd3da8e8f928730208534b1124818df38b27cb881f5", "generated_at": "2026-03-10T10:03:02.846Z" }, "bsy": { "locale": "bsy", "clips": 1741, "users": 25, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 14.51 }, "audios": { "total": 1741, "transcribed_validated": 0, "transcribed_pending": 1741, "not_transcribed": 0 }, "transcriptions": { "total": 1741, "validated": 0, "not_yet_validated": 1741, "edited": 2, "edited_pct": 0.11 }, "duration": { "total_ms": 38892996, "total_hrs": 10.8, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 22339, "min_ms": 1764, "max_ms": 154764, "avg_chars_per_sec": 8.19 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7777, "teens": 0, "twenties": 0.0638, "thirties": 0, "fourties": 0.0402, "fifties": 0.0632, "sixties": 0.0551, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7777, "male_masculine": 0.104, "female_feminine": 0.1183, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1741 }, "variant": { "": 1741 } }, "reported": { "clips": 26, "reasons": { "other": 21, "different_language": 4, "personally_identifiable_information": 1 } }, "size": 232052509, "checksum": "76e007955e912e4eb09df61e186394f062a7f9e289415b80daaf25d443592196", "generated_at": "2026-03-10T08:54:59.484Z" }, "bxk": { "locale": "bxk", "clips": 3214, "users": 27, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.78 }, "audios": { "total": 3214, "transcribed_validated": 2215, "transcribed_pending": 0, "not_transcribed": 999 }, "transcriptions": { "total": 2215, "validated": 2215, "not_yet_validated": 0, "edited": 398, "edited_pct": 17.97 }, "duration": { "total_ms": 57135060, "total_hrs": 15.87, "validated_ms": 39742308, "validated_hrs": 11.03, "avg_ms": 17777, "min_ms": 540, "max_ms": 72180, "avg_chars_per_sec": 8.12 }, "buckets": { "train": { "clips": 1397, "users": 14, "duration_ms": 27126180, "duration_hrs": 7.53 }, "dev": { "clips": 454, "users": 5, "duration_ms": 6595524, "duration_hrs": 1.83 }, "test": { "clips": 364, "users": 5, "duration_ms": 6020604, "duration_hrs": 1.67 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0373, "thirties": 0.1733, "fourties": 0.257, "fifties": 0.3158, "sixties": 0.2166, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1758, "male_masculine": 0.5756, "female_feminine": 0.2486, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3214 }, "variant": { "": 3214 } }, "reported": { "clips": 40, "reasons": { "other": 36, "different_language": 2, "personally_identifiable_information": 2 } }, "size": 275066218, "checksum": "c93995b06df7f335c9b6b41f7f0b7b8331cd4df2e164cc72142b50d7fc1fb53b", "generated_at": "2026-03-10T09:42:51.545Z" }, "ca": { "locale": "ca", "clips": 128, "users": 11, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 57, "avg_recordings_per_question": 2.25 }, "audios": { "total": 128, "transcribed_validated": 115, "transcribed_pending": 12, "not_transcribed": 1 }, "transcriptions": { "total": 127, "validated": 115, "not_yet_validated": 12, "edited": 34, "edited_pct": 26.77 }, "duration": { "total_ms": 1990872, "total_hrs": 0.55, "validated_ms": 1696320, "validated_hrs": 0.47, "avg_ms": 15554, "min_ms": 2556, "max_ms": 72180, "avg_chars_per_sec": 7.55 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.1172, "teens": 0, "twenties": 0, "thirties": 0.125, "fourties": 0.4766, "fifties": 0.1172, "sixties": 0.1641, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1172, "male_masculine": 0.5781, "female_feminine": 0.3047, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "balearic": 0, "central": 3, "learner_es": 0, "learner_other": 0, "northern": 0, "northwestern": 0, "valencian": 28, "user_defined": 32, "": 65 }, "variant": { "ca-algueres": 0, "ca-balear": 0, "ca-central": 84, "ca-northern": 0, "ca-nwestern": 0, "ca-valencia-alacant": 0, "ca-valencia-central": 1, "ca-valencia-northern": 0, "ca-valencia-southern": 0, "ca-valencia-tortosi": 0, "": 43 } }, "reported": { "clips": 2, "reasons": { "other": 2 } }, "size": 11793208, "checksum": "95990dd2fe559764897f5d10df3925e018c185106dd6f6e5799baf5c3559c1ea", "generated_at": "2026-03-10T09:58:05.245Z" }, "cdo": { "locale": "cdo", "clips": 1032, "users": 31, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 8.67 }, "audios": { "total": 1032, "transcribed_validated": 107, "transcribed_pending": 925, "not_transcribed": 0 }, "transcriptions": { "total": 1032, "validated": 107, "not_yet_validated": 925, "edited": 73, "edited_pct": 7.07 }, "duration": { "total_ms": 35155944, "total_hrs": 9.76, "validated_ms": 3289824, "validated_hrs": 0.91, "avg_ms": 34066, "min_ms": 720, "max_ms": 476568, "avg_chars_per_sec": 2.67 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.8043, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.1143, "fifties": 0.0814, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8043, "male_masculine": 0.0814, "female_feminine": 0.1143, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1032 }, "variant": { "": 1032 } }, "reported": { "clips": 4, "reasons": { "other": 4 } }, "size": 201037880, "checksum": "a37c144f3b53c0dd7d090eb4f0db61d5de228a86c2caf1152718d2a2832719c0", "generated_at": "2026-03-10T08:40:03.524Z" }, "cgg": { "locale": "cgg", "clips": 3085, "users": 27, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 25.71 }, "audios": { "total": 3085, "transcribed_validated": 2711, "transcribed_pending": 0, "not_transcribed": 374 }, "transcriptions": { "total": 2711, "validated": 2711, "not_yet_validated": 0, "edited": 516, "edited_pct": 19.03 }, "duration": { "total_ms": 42117228, "total_hrs": 11.69, "validated_ms": 36681696, "validated_hrs": 10.18, "avg_ms": 13652, "min_ms": 936, "max_ms": 86976, "avg_chars_per_sec": 8.61 }, "buckets": { "train": { "clips": 1885, "users": 19, "duration_ms": 24438780, "duration_hrs": 6.78 }, "dev": { "clips": 536, "users": 5, "duration_ms": 6629544, "duration_hrs": 1.84 }, "test": { "clips": 290, "users": 3, "duration_ms": 5613372, "duration_hrs": 1.55 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.5825, "thirties": 0.2716, "fourties": 0.1459, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1164, "male_masculine": 0.2726, "female_feminine": 0.611, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3085 }, "variant": { "": 3085 } }, "reported": { "clips": 0, "reasons": {} }, "size": 245560537, "checksum": "1d392574c4db95ad567b537722fc445e74dca603fd477a9a3c5a5fa5abf1f323", "generated_at": "2026-03-10T09:50:44.351Z" }, "cpx": { "locale": "cpx", "clips": 2007, "users": 29, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 16.87 }, "audios": { "total": 2007, "transcribed_validated": 13, "transcribed_pending": 1991, "not_transcribed": 3 }, "transcriptions": { "total": 2004, "validated": 13, "not_yet_validated": 1991, "edited": 3, "edited_pct": 0.15 }, "duration": { "total_ms": 38396232, "total_hrs": 10.66, "validated_ms": 116244, "validated_hrs": 0.03, "avg_ms": 19131, "min_ms": 720, "max_ms": 83160, "avg_chars_per_sec": 3.23 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7997, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.0877, "fifties": 0.0593, "sixties": 0.0249, "seventies": 0.0284, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7997, "male_masculine": 0.1221, "female_feminine": 0.0782, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2007 }, "variant": { "": 2007 } }, "reported": { "clips": 110, "reasons": { "other": 45, "different_language": 65 } }, "size": 236701684, "checksum": "f718cba85b54913e82ab2e6cb01bdd4028ff250ffc895bcea1e9d10a9ca4f830", "generated_at": "2026-03-10T08:38:33.518Z" }, "cy": { "locale": "cy", "clips": 11, "users": 2, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 11, "avg_recordings_per_question": 1 }, "audios": { "total": 11, "transcribed_validated": 0, "transcribed_pending": 3, "not_transcribed": 8 }, "transcriptions": { "total": 3, "validated": 0, "not_yet_validated": 3, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 631980, "total_hrs": 0.17, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 57453, "min_ms": 2808, "max_ms": 162288, "avg_chars_per_sec": 6.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.7273, "thirties": 0, "fourties": 0, "fifties": 0.2727, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2727, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0.7273, "do_not_wish_to_say": 0 }, "accent": { "united_kingdom": 3, "user_defined": 8 }, "variant": { "cy-midwales": 0, "cy-northeas": 0, "cy-northwes": 8, "cy-southeas": 0, "cy-southwes": 0, "cy-wladfa": 0, "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3758012, "checksum": "96d9cd94224d5f0c883563f1b99e93add44fafb1b211305dcc1211ae5a81314c", "generated_at": "2026-03-10T09:57:53.001Z" }, "de": { "locale": "de", "clips": 48, "users": 4, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 33, "avg_recordings_per_question": 1.45 }, "audios": { "total": 48, "transcribed_validated": 11, "transcribed_pending": 0, "not_transcribed": 37 }, "transcriptions": { "total": 11, "validated": 11, "not_yet_validated": 0, "edited": 2, "edited_pct": 18.18 }, "duration": { "total_ms": 1633248, "total_hrs": 0.45, "validated_ms": 158220, "validated_hrs": 0.04, "avg_ms": 34026, "min_ms": 3024, "max_ms": 160704, "avg_chars_per_sec": 10.64 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.2917, "thirties": 0, "fourties": 0.4792, "fifties": 0.2292, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.7708, "female_feminine": 0.2292, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "austria": 0, "belgium": 0, "brazil": 0, "bulgaria": 0, "canada": 0, "czechia": 0, "denmark": 0, "finland": 0, "france": 0, "germany": 47, "greece": 0, "hungary": 0, "italy": 0, "kazakhstan": 0, "liechtenstein": 0, "lithuania": 0, "luxembourg": 0, "namibia": 0, "netherlands": 0, "paraguay": 0, "poland": 0, "romania": 0, "russia": 0, "slovakia": 0, "slovenia": 0, "switzerland": 1, "turkey": 0, "united_kingdom": 0, "united_states": 0 }, "variant": { "": 48 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 7767802, "checksum": "e6a3333f4e07c118f9f8dbf18b56b95bd9ef5a92e7910caf4072eab367a92bf0", "generated_at": "2026-03-10T10:03:07.880Z" }, "el-CY": { "locale": "el-CY", "clips": 1221, "users": 10, "questions": { "total": 146, "validated": 0, "invalidated": 0, "other": 146, "has_audio": 146, "avg_recordings_per_question": 8.36 }, "audios": { "total": 1221, "transcribed_validated": 1217, "transcribed_pending": 3, "not_transcribed": 1 }, "transcriptions": { "total": 1220, "validated": 1217, "not_yet_validated": 3, "edited": 1196, "edited_pct": 98.03 }, "duration": { "total_ms": 37345716, "total_hrs": 10.37, "validated_ms": 37167084, "validated_hrs": 10.32, "avg_ms": 30586, "min_ms": 3456, "max_ms": 271080, "avg_chars_per_sec": 9.52 }, "buckets": { "train": { "clips": 661, "users": 6, "duration_ms": 21519972, "duration_hrs": 5.97 }, "dev": { "clips": 145, "users": 1, "duration_ms": 7065432, "duration_hrs": 1.96 }, "test": { "clips": 411, "users": 3, "duration_ms": 8581680, "duration_hrs": 2.38 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.6585, "thirties": 0.1196, "fourties": 0.1753, "fifties": 0, "sixties": 0, "seventies": 0.0467, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.4144, "female_feminine": 0.5856, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1221 }, "variant": { "": 1221 } }, "reported": { "clips": 0, "reasons": {} }, "size": 221752068, "checksum": "8f54f0a39046b0806de1cde01d721de0df1f0bf430597ee0ce240510e7cc08d2", "generated_at": "2026-03-10T09:35:41.303Z" }, "en": { "locale": "en", "clips": 1573, "users": 149, "questions": { "total": 333, "validated": 183, "invalidated": 9, "other": 141, "has_audio": 290, "avg_recordings_per_question": 5.42 }, "audios": { "total": 1573, "transcribed_validated": 1274, "transcribed_pending": 10, "not_transcribed": 289 }, "transcriptions": { "total": 1284, "validated": 1274, "not_yet_validated": 10, "edited": 183, "edited_pct": 14.25 }, "duration": { "total_ms": 25225920, "total_hrs": 7, "validated_ms": 19513224, "validated_hrs": 5.42, "avg_ms": 16037, "min_ms": 1404, "max_ms": 356796, "avg_chars_per_sec": 7.57 }, "buckets": { "train": { "clips": 496, "users": 36, "duration_ms": 6890652, "duration_hrs": 1.91 }, "dev": { "clips": 322, "users": 27, "duration_ms": 6861168, "duration_hrs": 1.9 }, "test": { "clips": 456, "users": 71, "duration_ms": 5761404, "duration_hrs": 1.6 } }, "demographics": { "age": { "": 0.0687, "teens": 0.0381, "twenties": 0.3586, "thirties": 0.1869, "fourties": 0.0509, "fifties": 0.0483, "sixties": 0.0896, "seventies": 0.0706, "eighties": 0.0884, "nineties": 0 }, "gender": { "": 0.1666, "male_masculine": 0.3783, "female_feminine": 0.3465, "intersex": 0, "transgender": 0.0108, "non-binary": 0.0127, "do_not_wish_to_say": 0.0852 }, "accent": { "african": 8, "australia": 24, "bermuda": 0, "canada": 4, "england": 116, "hongkong": 0, "indian": 61, "ireland": 3, "malaysia": 4, "newzealand": 0, "philippines": 0, "scotland": 1, "singapore": 8, "southatlandtic": 0, "us": 334, "wales": 0, "user_defined": 103, "": 980 }, "variant": { "": 1573 } }, "reported": { "clips": 342, "reasons": { "other": 32, "different_language": 308, "personally_identifiable_information": 2 } }, "size": 182952527, "checksum": "7ae0442efd434f27a019311a59d80734bb75b5f218e8998a83bd3d87b8e2ba8c", "generated_at": "2026-03-10T08:35:59.409Z" }, "fr": { "locale": "fr", "clips": 133, "users": 14, "questions": { "total": 119, "validated": 19, "invalidated": 3, "other": 97, "has_audio": 81, "avg_recordings_per_question": 1.64 }, "audios": { "total": 133, "transcribed_validated": 104, "transcribed_pending": 0, "not_transcribed": 29 }, "transcriptions": { "total": 104, "validated": 104, "not_yet_validated": 0, "edited": 29, "edited_pct": 27.88 }, "duration": { "total_ms": 2142792, "total_hrs": 0.59, "validated_ms": 1637280, "validated_hrs": 0.45, "avg_ms": 16111, "min_ms": 1512, "max_ms": 78408, "avg_chars_per_sec": 9.08 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0602, "teens": 0.0075, "twenties": 0.0827, "thirties": 0.3459, "fourties": 0.0075, "fifties": 0.0075, "sixties": 0.0075, "seventies": 0.4812, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0677, "male_masculine": 0.5414, "female_feminine": 0.3759, "intersex": 0, "transgender": 0.015, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "algeria": 0, "belgium": 0, "benin": 0, "cameroon": 0, "canada": 0, "central_african_republic": 0, "comoros": 0, "congo_kinshasa": 0, "cote_d_ivoire": 0, "cyprus": 0, "djibouti": 0, "equatorial_guinea": 0, "fr-metro-east": 0, "fr-metro-north": 0, "fr-metro-south": 0, "fr-metro-west": 0, "french_guiana": 0, "french_polynesia": 0, "germany": 0, "guadeloupe": 0, "haiti": 0, "italy": 0, "luxembourg": 0, "madagascar": 0, "mali": 0, "martinique": 0, "mauritania": 0, "mauritius": 0, "mayotte": 0, "monaco": 0, "morocco": 0, "netherlands": 0, "new_caledonia": 0, "niger": 0, "reunion": 0, "romania": 0, "senegal": 0, "seychelles": 0, "st_barthelemy": 0, "st_martin": 0, "st_pierre_et_miquelon": 0, "switzerland": 0, "syria": 0, "togo": 0, "tunisia": 0, "united_kingdom": 0, "united_states": 0, "vanuatu": 0, "wallis_et_futuna": 0, "": 133 }, "variant": { "fr-asia": 0, "fr-droum": 0, "fr-europe": 0, "fr-metro": 117, "fr-nafrica": 0, "fr-namerica": 0, "fr-safrica": 0, "fr-samerica": 0, "": 16 } }, "reported": { "clips": 0, "reasons": {} }, "size": 12664315, "checksum": "fab84c1c72a8fc382d50244117d5c617cd4481197c8bd1cc458fcaa88d65ab18", "generated_at": "2026-03-10T09:01:06.118Z" }, "fy-NL": { "locale": "fy-NL", "clips": 3, "users": 1, "questions": { "total": 71, "validated": 0, "invalidated": 0, "other": 71, "has_audio": 3, "avg_recordings_per_question": 1 }, "audios": { "total": 3, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 56196, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 18732, "min_ms": 16956, "max_ms": 21312, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 3 }, "variant": { "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 330660, "checksum": "0416b05a2b22e42db01c714c8009aba7b33ce5c27ebf1485f58d61430550a8d7", "generated_at": "2026-03-10T10:03:24.430Z" }, "ga-IE": { "locale": "ga-IE", "clips": 17, "users": 1, "questions": { "total": 104, "validated": 0, "invalidated": 0, "other": 104, "has_audio": 17, "avg_recordings_per_question": 1 }, "audios": { "total": 17, "transcribed_validated": 3, "transcribed_pending": 4, "not_transcribed": 10 }, "transcriptions": { "total": 7, "validated": 3, "not_yet_validated": 4, "edited": 1, "edited_pct": 14.29 }, "duration": { "total_ms": 256716, "total_hrs": 0.07, "validated_ms": 51336, "validated_hrs": 0.01, "avg_ms": 15101, "min_ms": 6336, "max_ms": 28188, "avg_chars_per_sec": 10.22 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "cainteoir_dúchais_gaeltacht": 0, "cainteoir_dúchais_lasmuigh_den_ghaeltacht": 0, "cainteoir_líofa_ní_ó_dhúchas": 0, "foghlaimeoir_fásta_ó_bhéarla": 0, "foghlaimeoir_fásta_ó_theanga_eile": 0, "foghlaimeoir_i_ngaelscoil": 0, "": 17 }, "variant": { "ga-IE-caighd": 0, "ga-IE-chonnact": 0, "ga-IE-mumhan": 0, "ga-IE-uladh": 0, "": 17 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1507997, "checksum": "a61fad97542e085aafa5686e643e4bc2221941c6aa26167a9cf85d476fbde209", "generated_at": "2026-03-10T10:02:57.075Z" }, "gl": { "locale": "gl", "clips": 211, "users": 14, "questions": { "total": 286, "validated": 17, "invalidated": 0, "other": 269, "has_audio": 153, "avg_recordings_per_question": 1.38 }, "audios": { "total": 211, "transcribed_validated": 81, "transcribed_pending": 128, "not_transcribed": 2 }, "transcriptions": { "total": 209, "validated": 81, "not_yet_validated": 128, "edited": 46, "edited_pct": 22.01 }, "duration": { "total_ms": 3905676, "total_hrs": 1.08, "validated_ms": 1093860, "validated_hrs": 0.3, "avg_ms": 18510, "min_ms": 2376, "max_ms": 85536, "avg_chars_per_sec": 9.27 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.2512, "thirties": 0.0284, "fourties": 0.673, "fifties": 0.0427, "sixties": 0.0047, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0047, "male_masculine": 0.3555, "female_feminine": 0.5071, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.1327 }, "accent": { "atlantico": 42, "central": 6, "neofalante": 30, "normativo": 82, "oriental": 66, "user_defined": 1 }, "variant": { "": 211 } }, "reported": { "clips": 0, "reasons": {} }, "size": 22899970, "checksum": "4c227fcf1ce26325598edcd716395cc0483dae1a93270b7d523b7e3ece4eaaa4", "generated_at": "2026-03-10T10:01:42.499Z" }, "gsw": { "locale": "gsw", "clips": 3, "users": 1, "questions": { "total": 61, "validated": 0, "invalidated": 0, "other": 61, "has_audio": 3, "avg_recordings_per_question": 1 }, "audios": { "total": 3, "transcribed_validated": 3, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 3, "validated": 3, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 31644, "total_hrs": 0, "validated_ms": 31644, "validated_hrs": 0, "avg_ms": 10548, "min_ms": 5328, "max_ms": 19260, "avg_chars_per_sec": 7.82 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3 }, "variant": { "gsw-FR-hochalem": 0, "gsw-FR-nordalem": 3, "gsw-FR-rhinfran": 0, "gsw-FR-sudnalem": 0, "gsw-FR-sudrfran": 0 } }, "reported": { "clips": 0, "reasons": {} }, "size": 191181, "checksum": "5aaac40ea0ccef68ab7788150589518f36ba998d03efc9bdfa9c480b42411ff5", "generated_at": "2026-03-10T09:57:54.995Z" }, "gv": { "locale": "gv", "clips": 130, "users": 1, "questions": { "total": 250, "validated": 0, "invalidated": 0, "other": 250, "has_audio": 130, "avg_recordings_per_question": 1 }, "audios": { "total": 130, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 130 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 2770956, "total_hrs": 0.76, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21315, "min_ms": 3780, "max_ms": 50220, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 130 }, "variant": { "": 130 } }, "reported": { "clips": 0, "reasons": {} }, "size": 16142644, "checksum": "a03f6e1e20580f5672cd64d56ca2f236d97f4f9bdd7b97f41a6592773f0c8523", "generated_at": "2026-03-10T09:37:22.164Z" }, "hac": { "locale": "hac", "clips": 421, "users": 19, "questions": { "total": 39, "validated": 0, "invalidated": 0, "other": 39, "has_audio": 39, "avg_recordings_per_question": 10.79 }, "audios": { "total": 421, "transcribed_validated": 14, "transcribed_pending": 13, "not_transcribed": 394 }, "transcriptions": { "total": 27, "validated": 14, "not_yet_validated": 13, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 40291704, "total_hrs": 11.19, "validated_ms": 1334844, "validated_hrs": 0.37, "avg_ms": 95705, "min_ms": 1296, "max_ms": 680256, "avg_chars_per_sec": 6.29 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.2138, "teens": 0, "twenties": 0, "thirties": 0.0926, "fourties": 0.4584, "fifties": 0.1425, "sixties": 0, "seventies": 0.0926, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2922, "male_masculine": 0.1568, "female_feminine": 0.5511, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 421 }, "variant": { "": 421 } }, "reported": { "clips": 0, "reasons": {} }, "size": 236285452, "checksum": "4768ec5a6dd98c045e801bd0decf9fdf753a95ab43213d2f62721c6e3f5dbc66", "generated_at": "2026-03-10T09:36:29.327Z" }, "hch": { "locale": "hch", "clips": 1553, "users": 10, "questions": { "total": 236, "validated": 0, "invalidated": 0, "other": 236, "has_audio": 236, "avg_recordings_per_question": 6.58 }, "audios": { "total": 1553, "transcribed_validated": 1549, "transcribed_pending": 0, "not_transcribed": 4 }, "transcriptions": { "total": 1549, "validated": 1549, "not_yet_validated": 0, "edited": 1524, "edited_pct": 98.39 }, "duration": { "total_ms": 36508932, "total_hrs": 10.14, "validated_ms": 36443448, "validated_hrs": 10.12, "avg_ms": 23509, "min_ms": 72, "max_ms": 114300, "avg_chars_per_sec": 7.68 }, "buckets": { "train": { "clips": 585, "users": 4, "duration_ms": 20945268, "duration_hrs": 5.81 }, "dev": { "clips": 472, "users": 3, "duration_ms": 6214176, "duration_hrs": 1.72 }, "test": { "clips": 492, "users": 3, "duration_ms": 9284004, "duration_hrs": 2.57 } }, "demographics": { "age": { "": 0.0064, "teens": 0, "twenties": 0.1442, "thirties": 0.3039, "fourties": 0, "fifties": 0.2415, "sixties": 0.3039, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3999, "male_masculine": 0, "female_feminine": 0.6001, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1553 }, "variant": { "": 1553 } }, "reported": { "clips": 0, "reasons": {} }, "size": 209204237, "checksum": "57a6fda2639f0799fd6bfe326e833aa1fb8b66aa9ddc00f1886ef59561a24f46", "generated_at": "2026-03-10T09:04:17.025Z" }, "ka": { "locale": "ka", "clips": 118, "users": 7, "questions": { "total": 79, "validated": 0, "invalidated": 0, "other": 79, "has_audio": 58, "avg_recordings_per_question": 2.03 }, "audios": { "total": 118, "transcribed_validated": 60, "transcribed_pending": 5, "not_transcribed": 53 }, "transcriptions": { "total": 65, "validated": 60, "not_yet_validated": 5, "edited": 10, "edited_pct": 15.38 }, "duration": { "total_ms": 2118204, "total_hrs": 0.58, "validated_ms": 512712, "validated_hrs": 0.14, "avg_ms": 17951, "min_ms": 3060, "max_ms": 107748, "avg_chars_per_sec": 9.49 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1356, "thirties": 0.339, "fourties": 0.4492, "fifties": 0, "sixties": 0.0763, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.0508, "female_feminine": 0.9492, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 118 }, "variant": { "": 118 } }, "reported": { "clips": 0, "reasons": {} }, "size": 12173523, "checksum": "eafa229be3fc1c36ba9d733af4a2094f995d783c3098d59593cfa40ea452613a", "generated_at": "2026-03-10T09:57:49.647Z" }, "kbd": { "locale": "kbd", "clips": 1061, "users": 25, "questions": { "total": 155, "validated": 0, "invalidated": 0, "other": 155, "has_audio": 154, "avg_recordings_per_question": 6.89 }, "audios": { "total": 1061, "transcribed_validated": 790, "transcribed_pending": 0, "not_transcribed": 271 }, "transcriptions": { "total": 790, "validated": 790, "not_yet_validated": 0, "edited": 243, "edited_pct": 30.76 }, "duration": { "total_ms": 39836592, "total_hrs": 11.06, "validated_ms": 29519928, "validated_hrs": 8.19, "avg_ms": 37546, "min_ms": 1908, "max_ms": 307440, "avg_chars_per_sec": 8.86 }, "buckets": { "train": { "clips": 453, "users": 4, "duration_ms": 16756164, "duration_hrs": 4.65 }, "dev": { "clips": 185, "users": 10, "duration_ms": 6281424, "duration_hrs": 1.74 }, "test": { "clips": 152, "users": 9, "duration_ms": 6482340, "duration_hrs": 1.8 } }, "demographics": { "age": { "": 0.033, "teens": 0.0047, "twenties": 0.114, "thirties": 0.2149, "fourties": 0.2912, "fifties": 0.3176, "sixties": 0.0245, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0763, "male_masculine": 0.6315, "female_feminine": 0.2922, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "bakhsan": 29, "besleney": 0, "kuban": 0, "kuban_zelenchuk": 20, "malka": 1, "mozdok": 0, "terek": 2, "user_defined": 26, "": 984 }, "variant": { "kbd-Cyrl": 760, "kbd-Cyrl-JOR": 0, "kbd-Cyrl-SY": 0, "kbd-Cyrl-TR": 26, "kbd-Latn-TR-t-kbd-cyrl": 0, "kbd-RU": 119, "": 156 } }, "reported": { "clips": 35, "reasons": { "different_language": 1, "other": 34 } }, "size": 149649028, "checksum": "fa05e3a8914335681312ca1a060c7a097d8a91d062cfb1dae99c239485a1c855", "generated_at": "2026-03-10T09:39:25.321Z" }, "kcn": { "locale": "kcn", "clips": 2715, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 22.63 }, "audios": { "total": 2715, "transcribed_validated": 1894, "transcribed_pending": 8, "not_transcribed": 813 }, "transcriptions": { "total": 1902, "validated": 1894, "not_yet_validated": 8, "edited": 355, "edited_pct": 18.66 }, "duration": { "total_ms": 50285700, "total_hrs": 13.96, "validated_ms": 34896636, "validated_hrs": 9.69, "avg_ms": 18521, "min_ms": 108, "max_ms": 104436, "avg_chars_per_sec": 6.73 }, "buckets": { "train": { "clips": 1277, "users": 18, "duration_ms": 22826664, "duration_hrs": 6.34 }, "dev": { "clips": 323, "users": 4, "duration_ms": 5514696, "duration_hrs": 1.53 }, "test": { "clips": 294, "users": 4, "duration_ms": 6555276, "duration_hrs": 1.82 } }, "demographics": { "age": { "": 0.1738, "teens": 0, "twenties": 0.0431, "thirties": 0.2431, "fourties": 0.1271, "fifties": 0.2936, "sixties": 0.0773, "seventies": 0, "eighties": 0.042, "nineties": 0 }, "gender": { "": 0.1738, "male_masculine": 0.54, "female_feminine": 0.2862, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2715 }, "variant": { "": 2715 } }, "reported": { "clips": 284, "reasons": { "other": 261, "personally_identifiable_information": 6, "offensive_speech": 5, "different_language": 12 } }, "size": 316922962, "checksum": "a218b84fed5b17d16357079a2d948e7b0feee335dbf37e26b3e988ce891be58c", "generated_at": "2026-03-10T09:17:40.158Z" }, "koo": { "locale": "koo", "clips": 3233, "users": 28, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.94 }, "audios": { "total": 3233, "transcribed_validated": 2211, "transcribed_pending": 10, "not_transcribed": 1012 }, "transcriptions": { "total": 2221, "validated": 2211, "not_yet_validated": 10, "edited": 172, "edited_pct": 7.74 }, "duration": { "total_ms": 53618940, "total_hrs": 14.89, "validated_ms": 36472068, "validated_hrs": 10.13, "avg_ms": 16585, "min_ms": 288, "max_ms": 112968, "avg_chars_per_sec": 7.42 }, "buckets": { "train": { "clips": 1416, "users": 17, "duration_ms": 23430456, "duration_hrs": 6.5 }, "dev": { "clips": 407, "users": 5, "duration_ms": 6664176, "duration_hrs": 1.85 }, "test": { "clips": 388, "users": 5, "duration_ms": 6377436, "duration_hrs": 1.77 } }, "demographics": { "age": { "": 0.0371, "teens": 0, "twenties": 0.2221, "thirties": 0.5193, "fourties": 0.1847, "fifties": 0.0368, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3699, "male_masculine": 0.2595, "female_feminine": 0.3706, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3233 }, "variant": { "": 3233 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 294385119, "checksum": "37e125b331a99f2ac29beb2ef311c767db18efe915f0a5f913d90081a6645ed6", "generated_at": "2026-03-10T09:46:39.681Z" }, "kzi": { "locale": "kzi", "clips": 1717, "users": 21, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 14.31 }, "audios": { "total": 1717, "transcribed_validated": 59, "transcribed_pending": 1658, "not_transcribed": 0 }, "transcriptions": { "total": 1717, "validated": 59, "not_yet_validated": 1658, "edited": 126, "edited_pct": 7.34 }, "duration": { "total_ms": 34264404, "total_hrs": 9.51, "validated_ms": 812340, "validated_hrs": 0.22, "avg_ms": 19956, "min_ms": 720, "max_ms": 69876, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.2656, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0.2097, "sixties": 0.3069, "seventies": 0.2178, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2656, "male_masculine": 0.3722, "female_feminine": 0.3623, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1717 }, "variant": { "": 1717 } }, "reported": { "clips": 0, "reasons": {} }, "size": 203763706, "checksum": "cc50a63eefb848789165e29fd3bc3d250da9b9c0c692347a715cb0fc915556c8", "generated_at": "2026-03-10T08:49:52.425Z" }, "led": { "locale": "led", "clips": 2851, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 23.76 }, "audios": { "total": 2851, "transcribed_validated": 1897, "transcribed_pending": 30, "not_transcribed": 924 }, "transcriptions": { "total": 1927, "validated": 1897, "not_yet_validated": 30, "edited": 425, "edited_pct": 22.06 }, "duration": { "total_ms": 54529056, "total_hrs": 15.14, "validated_ms": 36313488, "validated_hrs": 10.08, "avg_ms": 19126, "min_ms": 720, "max_ms": 54036, "avg_chars_per_sec": 6.21 }, "buckets": { "train": { "clips": 1213, "users": 14, "duration_ms": 22815108, "duration_hrs": 6.33 }, "dev": { "clips": 302, "users": 4, "duration_ms": 6386436, "duration_hrs": 1.77 }, "test": { "clips": 382, "users": 5, "duration_ms": 7111944, "duration_hrs": 1.97 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0972, "thirties": 0.483, "fourties": 0.2525, "fifties": 0.0421, "sixties": 0.1252, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3076, "male_masculine": 0.524, "female_feminine": 0.1684, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2851 }, "variant": { "": 2851 } }, "reported": { "clips": 0, "reasons": {} }, "size": 283174342, "checksum": "8f1ae2cadf04f680d37a35278a02b1db1777bb93ebaa9ef0c0d4fde9a3157afa", "generated_at": "2026-03-10T10:01:03.660Z" }, "lij": { "locale": "lij", "clips": 229, "users": 4, "questions": { "total": 169, "validated": 2, "invalidated": 0, "other": 167, "has_audio": 162, "avg_recordings_per_question": 1.41 }, "audios": { "total": 229, "transcribed_validated": 210, "transcribed_pending": 12, "not_transcribed": 7 }, "transcriptions": { "total": 222, "validated": 210, "not_yet_validated": 12, "edited": 51, "edited_pct": 22.97 }, "duration": { "total_ms": 6478236, "total_hrs": 1.79, "validated_ms": 5513724, "validated_hrs": 1.53, "avg_ms": 28289, "min_ms": 6408, "max_ms": 124920, "avg_chars_per_sec": 6.22 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0437, "fourties": 0, "fifties": 0, "sixties": 0.2751, "seventies": 0.6812, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6812, "male_masculine": 0.0437, "female_feminine": 0.2751, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 2, "": 227 }, "variant": { "": 229 } }, "reported": { "clips": 0, "reasons": {} }, "size": 38537874, "checksum": "b74bb6f75cfbbdf87b1502ef102085dd181d8d4e81622774fba3d47a2ffd3886", "generated_at": "2026-03-10T09:47:17.230Z" }, "lke": { "locale": "lke", "clips": 2743, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 22.86 }, "audios": { "total": 2743, "transcribed_validated": 2269, "transcribed_pending": 11, "not_transcribed": 463 }, "transcriptions": { "total": 2280, "validated": 2269, "not_yet_validated": 11, "edited": 879, "edited_pct": 38.55 }, "duration": { "total_ms": 44597196, "total_hrs": 12.38, "validated_ms": 35831160, "validated_hrs": 9.95, "avg_ms": 16259, "min_ms": 180, "max_ms": 47736, "avg_chars_per_sec": 7.55 }, "buckets": { "train": { "clips": 1309, "users": 16, "duration_ms": 22933656, "duration_hrs": 6.37 }, "dev": { "clips": 485, "users": 5, "duration_ms": 6178860, "duration_hrs": 1.71 }, "test": { "clips": 475, "users": 5, "duration_ms": 6718644, "duration_hrs": 1.86 } }, "demographics": { "age": { "": 0.4342, "teens": 0.031, "twenties": 0.0824, "thirties": 0.214, "fourties": 0.1659, "fifties": 0.0725, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4342, "male_masculine": 0.4834, "female_feminine": 0.0824, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2743 }, "variant": { "": 2743 } }, "reported": { "clips": 32, "reasons": { "different_language": 18, "other": 13, "personally_identifiable_information": 1 } }, "size": 266911582, "checksum": "00b7d3282dab81ab459eb9039b3d0b1c38eff0a39d18ca60a48516e6e05d9c63", "generated_at": "2026-03-10T09:14:11.894Z" }, "lth": { "locale": "lth", "clips": 3126, "users": 29, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.05 }, "audios": { "total": 3126, "transcribed_validated": 1809, "transcribed_pending": 2, "not_transcribed": 1315 }, "transcriptions": { "total": 1811, "validated": 1809, "not_yet_validated": 2, "edited": 855, "edited_pct": 47.21 }, "duration": { "total_ms": 55331244, "total_hrs": 15.36, "validated_ms": 32886324, "validated_hrs": 9.13, "avg_ms": 17700, "min_ms": 1620, "max_ms": 1324080, "avg_chars_per_sec": 7.35 }, "buckets": { "train": { "clips": 1131, "users": 20, "duration_ms": 20921364, "duration_hrs": 5.81 }, "dev": { "clips": 335, "users": 4, "duration_ms": 5920884, "duration_hrs": 1.64 }, "test": { "clips": 343, "users": 5, "duration_ms": 6044076, "duration_hrs": 1.67 } }, "demographics": { "age": { "": 0.247, "teens": 0, "twenties": 0, "thirties": 0.1942, "fourties": 0.1897, "fifties": 0.1891, "sixties": 0.1801, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3986, "male_masculine": 0.23, "female_feminine": 0.3714, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3126 }, "variant": { "": 3126 } }, "reported": { "clips": 1170, "reasons": { "other": 510, "different_language": 516, "personally_identifiable_information": 142, "offensive_speech": 2 } }, "size": 334148009, "checksum": "2f1ecced59991f4aadd60919c89c97f8e91afe67a2594518f81972cfd1000cf8", "generated_at": "2026-03-10T09:26:06.544Z" }, "lv": { "locale": "lv", "clips": 18, "users": 4, "questions": { "total": 101, "validated": 0, "invalidated": 0, "other": 101, "has_audio": 18, "avg_recordings_per_question": 1 }, "audios": { "total": 18, "transcribed_validated": 4, "transcribed_pending": 1, "not_transcribed": 13 }, "transcriptions": { "total": 5, "validated": 4, "not_yet_validated": 1, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 720612, "total_hrs": 0.2, "validated_ms": 127548, "validated_hrs": 0.03, "avg_ms": 40034, "min_ms": 6516, "max_ms": 82116, "avg_chars_per_sec": 8.85 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.8333, "fourties": 0.1111, "fifties": 0.0556, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.1667, "female_feminine": 0.8333, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 2, "": 16 }, "variant": { "": 18 } }, "reported": { "clips": 0, "reasons": {} }, "size": 4290939, "checksum": "9d142c6aabb6e65d1ca9d25026a196230328171500b6153520c0ed8358bdc30b", "generated_at": "2026-03-10T09:51:04.355Z" }, "meh": { "locale": "meh", "clips": 1057, "users": 16, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 8.81 }, "audios": { "total": 1057, "transcribed_validated": 1048, "transcribed_pending": 0, "not_transcribed": 9 }, "transcriptions": { "total": 1048, "validated": 1048, "not_yet_validated": 0, "edited": 328, "edited_pct": 31.3 }, "duration": { "total_ms": 36567144, "total_hrs": 10.15, "validated_ms": 36197496, "validated_hrs": 10.05, "avg_ms": 34595, "min_ms": 3924, "max_ms": 234864, "avg_chars_per_sec": 6.89 }, "buckets": { "train": { "clips": 678, "users": 9, "duration_ms": 24660936, "duration_hrs": 6.85 }, "dev": { "clips": 165, "users": 3, "duration_ms": 5510736, "duration_hrs": 1.53 }, "test": { "clips": 205, "users": 4, "duration_ms": 6025824, "duration_hrs": 1.67 } }, "demographics": { "age": { "": 0.1987, "teens": 0, "twenties": 0.1627, "thirties": 0.1079, "fourties": 0.1135, "fifties": 0, "sixties": 0.3264, "seventies": 0.0908, "eighties": 0, "nineties": 0 }, "gender": { "": 0.193, "male_masculine": 0.0057, "female_feminine": 0.8013, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1057 }, "variant": { "": 1057 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 212173060, "checksum": "0cc25f08a07693dc606c24863a6b1390ae9510772631f01e2a327c8f4955f1da", "generated_at": "2026-03-10T09:02:20.483Z" }, "mel": { "locale": "mel", "clips": 1624, "users": 35, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 13.53 }, "audios": { "total": 1624, "transcribed_validated": 28, "transcribed_pending": 1554, "not_transcribed": 42 }, "transcriptions": { "total": 1582, "validated": 28, "not_yet_validated": 1554, "edited": 13, "edited_pct": 0.82 }, "duration": { "total_ms": 37180800, "total_hrs": 10.32, "validated_ms": 562824, "validated_hrs": 0.15, "avg_ms": 22895, "min_ms": 756, "max_ms": 93348, "avg_chars_per_sec": 7.52 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.9803, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.0197, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9803, "male_masculine": 0, "female_feminine": 0.0197, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1624 }, "variant": { "": 1624 } }, "reported": { "clips": 0, "reasons": {} }, "size": 219256654, "checksum": "313aa8e48583828751d6314e229c41cde6467bfcd427156237e1fd5225f4f333", "generated_at": "2026-03-10T08:45:24.311Z" }, "mmc": { "locale": "mmc", "clips": 871, "users": 12, "questions": { "total": 195, "validated": 0, "invalidated": 0, "other": 195, "has_audio": 195, "avg_recordings_per_question": 4.47 }, "audios": { "total": 871, "transcribed_validated": 871, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 871, "validated": 871, "not_yet_validated": 0, "edited": 305, "edited_pct": 35.02 }, "duration": { "total_ms": 40443984, "total_hrs": 11.23, "validated_ms": 40443984, "validated_hrs": 11.23, "avg_ms": 46434, "min_ms": 3348, "max_ms": 332820, "avg_chars_per_sec": 5.27 }, "buckets": { "train": { "clips": 604, "users": 5, "duration_ms": 29286504, "duration_hrs": 8.13 }, "dev": { "clips": 90, "users": 4, "duration_ms": 5635080, "duration_hrs": 1.56 }, "test": { "clips": 177, "users": 3, "duration_ms": 5522400, "duration_hrs": 1.53 } }, "demographics": { "age": { "": 0.07, "teens": 0, "twenties": 0.5029, "thirties": 0, "fourties": 0, "fifties": 0.2227, "sixties": 0.2044, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3226, "male_masculine": 0, "female_feminine": 0.6774, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 871 }, "variant": { "": 871 } }, "reported": { "clips": 0, "reasons": {} }, "size": 237398246, "checksum": "b4566530def143385e309279e8ee04bb7597b160f9ff76a3e1938752e01e0e0d", "generated_at": "2026-03-10T09:09:14.880Z" }, "ms-MY": { "locale": "ms-MY", "clips": 1144, "users": 22, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 9.61 }, "audios": { "total": 1144, "transcribed_validated": 1054, "transcribed_pending": 80, "not_transcribed": 10 }, "transcriptions": { "total": 1134, "validated": 1054, "not_yet_validated": 80, "edited": 94, "edited_pct": 8.29 }, "duration": { "total_ms": 22271760, "total_hrs": 6.18, "validated_ms": 20302020, "validated_hrs": 5.63, "avg_ms": 19468, "min_ms": 3096, "max_ms": 208476, "avg_chars_per_sec": 8.59 }, "buckets": { "train": { "clips": 429, "users": 8, "duration_ms": 7527600, "duration_hrs": 2.09 }, "dev": { "clips": 387, "users": 7, "duration_ms": 6780132, "duration_hrs": 1.88 }, "test": { "clips": 238, "users": 5, "duration_ms": 5994288, "duration_hrs": 1.66 } }, "demographics": { "age": { "": 0.3558, "teens": 0, "twenties": 0.1136, "thirties": 0.2649, "fourties": 0.1399, "fifties": 0, "sixties": 0.0664, "seventies": 0.0594, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3558, "male_masculine": 0.2045, "female_feminine": 0.4318, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0079 }, "accent": { "": 1144 }, "variant": { "": 1144 } }, "reported": { "clips": 1, "reasons": { "offensive_speech": 1 } }, "size": 132515505, "checksum": "36cd8fa26e32968140cf75e4b1ba1a2fd89092c54eb17c4febc7f8e0b708fa84", "generated_at": "2026-03-10T08:41:33.989Z" }, "msi": { "locale": "msi", "clips": 2277, "users": 33, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 19.13 }, "audios": { "total": 2277, "transcribed_validated": 4, "transcribed_pending": 2273, "not_transcribed": 0 }, "transcriptions": { "total": 2277, "validated": 4, "not_yet_validated": 2273, "edited": 3, "edited_pct": 0.13 }, "duration": { "total_ms": 48764700, "total_hrs": 13.54, "validated_ms": 70020, "validated_hrs": 0.01, "avg_ms": 21416, "min_ms": 864, "max_ms": 89316, "avg_chars_per_sec": 8.5 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7686, "teens": 0, "twenties": 0.0654, "thirties": 0, "fourties": 0.0733, "fifties": 0.0487, "sixties": 0.0439, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7686, "male_masculine": 0.061, "female_feminine": 0.1704, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2277 }, "variant": { "": 2277 } }, "reported": { "clips": 45, "reasons": { "other": 39, "different_language": 6 } }, "size": 291427160, "checksum": "c1c030119787578d841723fe908f014daa74bcb7633eea6a7cf9dca1554fc068", "generated_at": "2026-03-10T08:52:41.659Z" }, "pne": { "locale": "pne", "clips": 2595, "users": 24, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 21.63 }, "audios": { "total": 2595, "transcribed_validated": 2594, "transcribed_pending": 0, "not_transcribed": 1 }, "transcriptions": { "total": 2594, "validated": 2594, "not_yet_validated": 0, "edited": 424, "edited_pct": 16.35 }, "duration": { "total_ms": 43702704, "total_hrs": 12.13, "validated_ms": 43675992, "validated_hrs": 12.13, "avg_ms": 16841, "min_ms": 1800, "max_ms": 36324, "avg_chars_per_sec": 8.44 }, "buckets": { "train": { "clips": 1782, "users": 17, "duration_ms": 30897684, "duration_hrs": 8.58 }, "dev": { "clips": 452, "users": 4, "duration_ms": 7123068, "duration_hrs": 1.97 }, "test": { "clips": 360, "users": 3, "duration_ms": 5655240, "duration_hrs": 1.57 } }, "demographics": { "age": { "": 0.3083, "teens": 0.185, "twenties": 0.2593, "thirties": 0.1272, "fourties": 0.074, "fifties": 0.0462, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3083, "male_masculine": 0.3048, "female_feminine": 0.3869, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2595 }, "variant": { "": 2595 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 260101926, "checksum": "1355a7e8ee034120b3e4b7b94a069eb0187bf9d80c609a03b93080f5603ec959", "generated_at": "2026-03-10T09:00:37.296Z" }, "qxp": { "locale": "qxp", "clips": 1065, "users": 11, "questions": { "total": 152, "validated": 109, "invalidated": 1, "other": 42, "has_audio": 150, "avg_recordings_per_question": 7.1 }, "audios": { "total": 1065, "transcribed_validated": 1065, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 1065, "validated": 1065, "not_yet_validated": 0, "edited": 19, "edited_pct": 1.78 }, "duration": { "total_ms": 18584100, "total_hrs": 5.16, "validated_ms": 18584100, "validated_hrs": 5.16, "avg_ms": 17450, "min_ms": 2268, "max_ms": 74160, "avg_chars_per_sec": 8.19 }, "buckets": { "train": { "clips": 336, "users": 3, "duration_ms": 5307696, "duration_hrs": 1.47 }, "dev": { "clips": 426, "users": 3, "duration_ms": 7163064, "duration_hrs": 1.98 }, "test": { "clips": 303, "users": 5, "duration_ms": 6113340, "duration_hrs": 1.69 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.8404, "thirties": 0.1596, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.4, "female_feminine": 0.6, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 809, "": 256 }, "variant": { "": 1065 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 103003265, "checksum": "c64c6b83541198f9c1ab96a8efa370f24f81165efe20579d01c343b1ad015972", "generated_at": "2026-03-10T09:34:04.900Z" }, "ru": { "locale": "ru", "clips": 103, "users": 4, "questions": { "total": 83, "validated": 23, "invalidated": 0, "other": 60, "has_audio": 52, "avg_recordings_per_question": 1.98 }, "audios": { "total": 103, "transcribed_validated": 102, "transcribed_pending": 1, "not_transcribed": 0 }, "transcriptions": { "total": 103, "validated": 102, "not_yet_validated": 1, "edited": 12, "edited_pct": 11.65 }, "duration": { "total_ms": 1838268, "total_hrs": 0.51, "validated_ms": 1819368, "validated_hrs": 0.5, "avg_ms": 17847, "min_ms": 3096, "max_ms": 45900, "avg_chars_per_sec": 9.4 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0097, "teens": 0, "twenties": 0.8252, "thirties": 0.165, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4078, "male_masculine": 0.4272, "female_feminine": 0.165, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 34, "": 86 }, "variant": { "": 103 } }, "reported": { "clips": 1, "reasons": { "different_language": 1 } }, "size": 10531917, "checksum": "a7c6e2ce909bd9ffd4fd57cf9edd52a66a4a37d4243dacde789aa723793d40ce", "generated_at": "2026-03-10T10:03:17.610Z" }, "ruc": { "locale": "ruc", "clips": 2822, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 23.52 }, "audios": { "total": 2822, "transcribed_validated": 1659, "transcribed_pending": 0, "not_transcribed": 1163 }, "transcriptions": { "total": 1659, "validated": 1659, "not_yet_validated": 0, "edited": 181, "edited_pct": 10.91 }, "duration": { "total_ms": 63901116, "total_hrs": 17.75, "validated_ms": 36522720, "validated_hrs": 10.14, "avg_ms": 22644, "min_ms": 720, "max_ms": 130536, "avg_chars_per_sec": 8.65 }, "buckets": { "train": { "clips": 1092, "users": 17, "duration_ms": 24617880, "duration_hrs": 6.83 }, "dev": { "clips": 270, "users": 4, "duration_ms": 5777352, "duration_hrs": 1.6 }, "test": { "clips": 297, "users": 4, "duration_ms": 6127488, "duration_hrs": 1.7 } }, "demographics": { "age": { "": 0.5152, "teens": 0, "twenties": 0.0422, "thirties": 0.2807, "fourties": 0.0801, "fifties": 0, "sixties": 0.0404, "seventies": 0.0415, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5152, "male_masculine": 0.404, "female_feminine": 0.0808, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2822 }, "variant": { "": 2822 } }, "reported": { "clips": 59, "reasons": { "other": 57, "personally_identifiable_information": 2 } }, "size": 383710860, "checksum": "d51357321312db9e7686b8a9083062fa9e2bfa20c0028602c343f2f89333a9a9", "generated_at": "2026-03-10T09:21:09.184Z" }, "rwm": { "locale": "rwm", "clips": 2427, "users": 21, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 20.23 }, "audios": { "total": 2427, "transcribed_validated": 1801, "transcribed_pending": 51, "not_transcribed": 575 }, "transcriptions": { "total": 1852, "validated": 1801, "not_yet_validated": 51, "edited": 444, "edited_pct": 23.97 }, "duration": { "total_ms": 49025628, "total_hrs": 13.61, "validated_ms": 36494424, "validated_hrs": 10.13, "avg_ms": 20200, "min_ms": 216, "max_ms": 94428, "avg_chars_per_sec": 7.66 }, "buckets": { "train": { "clips": 1226, "users": 14, "duration_ms": 24692616, "duration_hrs": 6.85 }, "dev": { "clips": 269, "users": 3, "duration_ms": 5928444, "duration_hrs": 1.64 }, "test": { "clips": 306, "users": 3, "duration_ms": 5873364, "duration_hrs": 1.63 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0466, "thirties": 0.2299, "fourties": 0.1887, "fifties": 0.2909, "sixties": 0.0968, "seventies": 0.1471, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1846, "male_masculine": 0.8154, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2427 }, "variant": { "": 2427 } }, "reported": { "clips": 57, "reasons": { "different_language": 14, "personally_identifiable_information": 2, "offensive_speech": 1, "other": 40 } }, "size": 279534903, "checksum": "ae39060bd89260a57de43e9353f45d08f70c9265ec55450498370d4e66e800e1", "generated_at": "2026-03-10T09:53:51.754Z" }, "sco": { "locale": "sco", "clips": 715, "users": 21, "questions": { "total": 47, "validated": 0, "invalidated": 0, "other": 47, "has_audio": 47, "avg_recordings_per_question": 15.21 }, "audios": { "total": 715, "transcribed_validated": 680, "transcribed_pending": 0, "not_transcribed": 35 }, "transcriptions": { "total": 680, "validated": 680, "not_yet_validated": 0, "edited": 190, "edited_pct": 27.94 }, "duration": { "total_ms": 40234608, "total_hrs": 11.17, "validated_ms": 38478564, "validated_hrs": 10.68, "avg_ms": 56272, "min_ms": 576, "max_ms": 298656, "avg_chars_per_sec": 10.78 }, "buckets": { "train": { "clips": 507, "users": 15, "duration_ms": 26807220, "duration_hrs": 7.44 }, "dev": { "clips": 74, "users": 3, "duration_ms": 5893704, "duration_hrs": 1.63 }, "test": { "clips": 99, "users": 3, "duration_ms": 5777640, "duration_hrs": 1.6 } }, "demographics": { "age": { "": 0.2867, "teens": 0, "twenties": 0.0587, "thirties": 0.1944, "fourties": 0.0406, "fifties": 0.1664, "sixties": 0.1902, "seventies": 0.0629, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2867, "male_masculine": 0.2881, "female_feminine": 0.386, "intersex": 0, "transgender": 0, "non-binary": 0.0392, "do_not_wish_to_say": 0 }, "accent": { "": 715 }, "variant": { "": 715 } }, "reported": { "clips": 0, "reasons": {} }, "size": 239564764, "checksum": "cd3f952cd71a93eebada8feb874ddbb744668c7185d086bb25d3374fd2634f19", "generated_at": "2026-03-10T09:32:39.136Z" }, "sdo": { "locale": "sdo", "clips": 1389, "users": 25, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 11.67 }, "audios": { "total": 1389, "transcribed_validated": 217, "transcribed_pending": 1172, "not_transcribed": 0 }, "transcriptions": { "total": 1389, "validated": 217, "not_yet_validated": 1172, "edited": 493, "edited_pct": 35.49 }, "duration": { "total_ms": 35331228, "total_hrs": 9.81, "validated_ms": 4778640, "validated_hrs": 1.32, "avg_ms": 25436, "min_ms": 1224, "max_ms": 120924, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.9323, "teens": 0, "twenties": 0.0194, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0.0482, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9323, "male_masculine": 0.0677, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1389 }, "variant": { "": 1389 } }, "reported": { "clips": 9, "reasons": { "different_language": 2, "other": 7 } }, "size": 211031338, "checksum": "ae04d9d4ecf500515466c90396d37c7db81c4878eadb1f5e9e6457fe47c8ea3c", "generated_at": "2026-03-10T08:43:17.727Z" }, "seh": { "locale": "seh", "clips": 27, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 2, "other": 58, "has_audio": 27, "avg_recordings_per_question": 1 }, "audios": { "total": 27, "transcribed_validated": 1, "transcribed_pending": 2, "not_transcribed": 24 }, "transcriptions": { "total": 3, "validated": 1, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 1333008, "total_hrs": 0.37, "validated_ms": 13248, "validated_hrs": 0, "avg_ms": 49371, "min_ms": 12276, "max_ms": 117828, "avg_chars_per_sec": 8.7 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 27 }, "variant": { "": 27 } }, "reported": { "clips": 0, "reasons": {} }, "size": 6718106, "checksum": "05b964ab6a4459ec822639e3efe117db4cb672c7e36d5fd2cb46d0ef86b1bb19", "generated_at": "2026-03-10T10:03:22.111Z" }, "snv": { "locale": "snv", "clips": 2036, "users": 30, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 16.97 }, "audios": { "total": 2036, "transcribed_validated": 6, "transcribed_pending": 1101, "not_transcribed": 929 }, "transcriptions": { "total": 1107, "validated": 6, "not_yet_validated": 1101, "edited": 69, "edited_pct": 6.23 }, "duration": { "total_ms": 37699092, "total_hrs": 10.47, "validated_ms": 87876, "validated_hrs": 0.02, "avg_ms": 18516, "min_ms": 1008, "max_ms": 74448, "avg_chars_per_sec": 5.89 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6179, "teens": 0, "twenties": 0, "thirties": 0.001, "fourties": 0, "fifties": 0.2883, "sixties": 0.0928, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6179, "male_masculine": 0.1248, "female_feminine": 0.2574, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2036 }, "variant": { "": 2036 } }, "reported": { "clips": 0, "reasons": {} }, "size": 223721938, "checksum": "8337b783429af574f771426ef1b4f8c30ed8b79d0b39a584c6504736681e62f2", "generated_at": "2026-03-10T08:57:31.036Z" }, "tob": { "locale": "tob", "clips": 1572, "users": 25, "questions": { "total": 136, "validated": 0, "invalidated": 0, "other": 136, "has_audio": 136, "avg_recordings_per_question": 11.56 }, "audios": { "total": 1572, "transcribed_validated": 1540, "transcribed_pending": 0, "not_transcribed": 32 }, "transcriptions": { "total": 1540, "validated": 1540, "not_yet_validated": 0, "edited": 540, "edited_pct": 35.06 }, "duration": { "total_ms": 36023976, "total_hrs": 10, "validated_ms": 34841304, "validated_hrs": 9.67, "avg_ms": 22916, "min_ms": 108, "max_ms": 191916, "avg_chars_per_sec": 5.23 }, "buckets": { "train": { "clips": 994, "users": 15, "duration_ms": 22964940, "duration_hrs": 6.37 }, "dev": { "clips": 195, "users": 4, "duration_ms": 6082668, "duration_hrs": 1.68 }, "test": { "clips": 351, "users": 6, "duration_ms": 5793696, "duration_hrs": 1.6 } }, "demographics": { "age": { "": 0.4198, "teens": 0, "twenties": 0, "thirties": 0.0827, "fourties": 0.0884, "fifties": 0.2455, "sixties": 0.1635, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6654, "male_masculine": 0.1718, "female_feminine": 0.1628, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1572 }, "variant": { "": 1572 } }, "reported": { "clips": 0, "reasons": {} }, "size": 181803326, "checksum": "9c37f72b2135973dd83543f8044926722ceb026a411c09e12e56088f409aee71", "generated_at": "2026-03-10T09:08:00.228Z" }, "top": { "locale": "top", "clips": 417, "users": 10, "questions": { "total": 150, "validated": 0, "invalidated": 0, "other": 150, "has_audio": 143, "avg_recordings_per_question": 2.92 }, "audios": { "total": 417, "transcribed_validated": 413, "transcribed_pending": 0, "not_transcribed": 4 }, "transcriptions": { "total": 413, "validated": 413, "not_yet_validated": 0, "edited": 159, "edited_pct": 38.5 }, "duration": { "total_ms": 37808280, "total_hrs": 10.5, "validated_ms": 37316664, "validated_hrs": 10.36, "avg_ms": 90667, "min_ms": 5256, "max_ms": 498996, "avg_chars_per_sec": 5.83 }, "buckets": { "train": { "clips": 264, "users": 4, "duration_ms": 24848172, "duration_hrs": 6.9 }, "dev": { "clips": 82, "users": 3, "duration_ms": 6530112, "duration_hrs": 1.81 }, "test": { "clips": 67, "users": 3, "duration_ms": 5938380, "duration_hrs": 1.64 } }, "demographics": { "age": { "": 0.0671, "teens": 0, "twenties": 0, "thirties": 0.4269, "fourties": 0.1823, "fifties": 0.048, "sixties": 0.2758, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.247, "male_masculine": 0.3237, "female_feminine": 0.4293, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 417 }, "variant": { "": 417 } }, "reported": { "clips": 0, "reasons": {} }, "size": 216204384, "checksum": "283dd8795460c572df64fd567139826fc2f589805309bf4ce9e771fe1aeed9b5", "generated_at": "2026-03-10T09:10:04.197Z" }, "tr": { "locale": "tr", "clips": 25, "users": 8, "questions": { "total": 140, "validated": 0, "invalidated": 0, "other": 140, "has_audio": 25, "avg_recordings_per_question": 1 }, "audios": { "total": 25, "transcribed_validated": 22, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 22, "validated": 22, "not_yet_validated": 0, "edited": 8, "edited_pct": 36.36 }, "duration": { "total_ms": 556632, "total_hrs": 0.15, "validated_ms": 537984, "validated_hrs": 0.14, "avg_ms": 22265, "min_ms": 4428, "max_ms": 56628, "avg_chars_per_sec": 8.96 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.08, "thirties": 0.12, "fourties": 0, "fifties": 0.12, "sixties": 0.6, "seventies": 0.08, "eighties": 0, "nineties": 0 }, "gender": { "": 0.08, "male_masculine": 0.64, "female_feminine": 0.28, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 16, "": 9 }, "variant": { "": 25 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3310709, "checksum": "e48c00b8c5bc8a12ca28b88fec7170d7a02b057d535c64688497afd5e354258e", "generated_at": "2026-03-10T09:34:18.454Z" }, "ttj": { "locale": "ttj", "clips": 3100, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 25.83 }, "audios": { "total": 3100, "transcribed_validated": 1882, "transcribed_pending": 0, "not_transcribed": 1218 }, "transcriptions": { "total": 1882, "validated": 1882, "not_yet_validated": 0, "edited": 783, "edited_pct": 41.6 }, "duration": { "total_ms": 60386832, "total_hrs": 16.77, "validated_ms": 36131184, "validated_hrs": 10.03, "avg_ms": 19480, "min_ms": 3960, "max_ms": 82080, "avg_chars_per_sec": 9.05 }, "buckets": { "train": { "clips": 1262, "users": 14, "duration_ms": 24084936, "duration_hrs": 6.69 }, "dev": { "clips": 359, "users": 5, "duration_ms": 6441552, "duration_hrs": 1.78 }, "test": { "clips": 261, "users": 3, "duration_ms": 5604696, "duration_hrs": 1.55 } }, "demographics": { "age": { "": 0, "teens": 0.0381, "twenties": 0.2316, "thirties": 0.1906, "fourties": 0.3097, "fifties": 0.1913, "sixties": 0.0387, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.1935, "female_feminine": 0.8065, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3100 }, "variant": { "": 3100 } }, "reported": { "clips": 0, "reasons": {} }, "size": 287333739, "checksum": "e9925c4096b02524a0a1bf734266d22c109bc6bccf76ce758433190330211c25", "generated_at": "2026-03-10T09:57:20.734Z" }, "ukv": { "locale": "ukv", "clips": 2570, "users": 22, "questions": { "total": 119, "validated": 0, "invalidated": 19, "other": 100, "has_audio": 119, "avg_recordings_per_question": 21.6 }, "audios": { "total": 2570, "transcribed_validated": 2202, "transcribed_pending": 4, "not_transcribed": 364 }, "transcriptions": { "total": 2206, "validated": 2202, "not_yet_validated": 4, "edited": 966, "edited_pct": 43.79 }, "duration": { "total_ms": 42464052, "total_hrs": 11.79, "validated_ms": 36133524, "validated_hrs": 10.03, "avg_ms": 16523, "min_ms": 1908, "max_ms": 114768, "avg_chars_per_sec": 6.77 }, "buckets": { "train": { "clips": 1498, "users": 15, "duration_ms": 23123592, "duration_hrs": 6.42 }, "dev": { "clips": 403, "users": 4, "duration_ms": 6130152, "duration_hrs": 1.7 }, "test": { "clips": 301, "users": 3, "duration_ms": 6879780, "duration_hrs": 1.91 } }, "demographics": { "age": { "": 0.4564, "teens": 0, "twenties": 0.1786, "thirties": 0.0891, "fourties": 0.2296, "fifties": 0.0463, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4564, "male_masculine": 0.1362, "female_feminine": 0.4074, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2570 }, "variant": { "": 2570 } }, "reported": { "clips": 35, "reasons": { "different_language": 5, "offensive_speech": 1, "other": 29 } }, "size": 249801641, "checksum": "f333b71ef8741d5a30588a9bc2950a2982e2ab10be587dd9129118dfc0e0b8c1", "generated_at": "2026-03-10T09:29:19.305Z" }, "ush": { "locale": "ush", "clips": 664, "users": 10, "questions": { "total": 118, "validated": 0, "invalidated": 0, "other": 118, "has_audio": 118, "avg_recordings_per_question": 5.63 }, "audios": { "total": 664, "transcribed_validated": 644, "transcribed_pending": 7, "not_transcribed": 13 }, "transcriptions": { "total": 651, "validated": 644, "not_yet_validated": 7, "edited": 64, "edited_pct": 9.83 }, "duration": { "total_ms": 18281844, "total_hrs": 5.07, "validated_ms": 17802792, "validated_hrs": 4.94, "avg_ms": 27533, "min_ms": 180, "max_ms": 149760, "avg_chars_per_sec": 11.45 }, "buckets": { "train": { "clips": 204, "users": 3, "duration_ms": 5713992, "duration_hrs": 1.58 }, "dev": { "clips": 203, "users": 3, "duration_ms": 6383160, "duration_hrs": 1.77 }, "test": { "clips": 237, "users": 4, "duration_ms": 5705640, "duration_hrs": 1.58 } }, "demographics": { "age": { "": 0.1988, "teens": 0.1747, "twenties": 0.4714, "thirties": 0.003, "fourties": 0.1521, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9985, "male_masculine": 0.0015, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 664 }, "variant": { "": 664 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 108218394, "checksum": "c49aa97b202b93844349b5ae3c6f6091638199c6105d68c3471469590c236ff1", "generated_at": "2026-03-10T09:11:03.979Z" }, "xkl": { "locale": "xkl", "clips": 1795, "users": 23, "questions": { "total": 120, "validated": 0, "invalidated": 1, "other": 119, "has_audio": 120, "avg_recordings_per_question": 14.96 }, "audios": { "total": 1795, "transcribed_validated": 259, "transcribed_pending": 1516, "not_transcribed": 20 }, "transcriptions": { "total": 1775, "validated": 259, "not_yet_validated": 1516, "edited": 302, "edited_pct": 17.01 }, "duration": { "total_ms": 37481616, "total_hrs": 10.41, "validated_ms": 5204916, "validated_hrs": 1.44, "avg_ms": 20881, "min_ms": 1764, "max_ms": 86400, "avg_chars_per_sec": 6.94 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6301, "teens": 0, "twenties": 0.1142, "thirties": 0.0875, "fourties": 0.0635, "fifties": 0, "sixties": 0.0507, "seventies": 0.054, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6301, "male_masculine": 0.093, "female_feminine": 0.2769, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1795 }, "variant": { "": 1795 } }, "reported": { "clips": 0, "reasons": {} }, "size": 223056735, "checksum": "ff0d167329c87962c807f1eb9c8cd0c5f86df5097d616d738bced1ae22e73242", "generated_at": "2026-03-10T08:47:41.895Z" } }, "totalDurationMs": 1542692628, "totalValidDurationMs": 946680516, "totalHrs": 428, "totalValidHrs": 263 } ================================================ FILE: datasets/spontaneous-speech/sps-corpus-2.0-2025-12-05.json ================================================ { "locales": { "aat": { "locale": "aat", "clips": 334, "users": 5, "questions": { "total": 111, "validated": 0, "invalidated": 0, "other": 111, "has_audio": 111, "avg_recordings_per_question": 3.01 }, "audios": { "total": 334, "transcribed_validated": 0, "transcribed_pending": 194, "not_transcribed": 140 }, "transcriptions": { "total": 194, "validated": 0, "not_yet_validated": 194, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 8290188, "total_hrs": 2.3, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 24821, "min_ms": 3996, "max_ms": 193536, "avg_chars_per_sec": 6.45 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.7006, "eighties": 0.2994, "nineties": 0 }, "gender": { "": 0.1587, "male_masculine": 0, "female_feminine": 0.8413, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 334 }, "variant": { "": 334 } }, "reported": { "clips": 0, "reasons": {} }, "size": 49124780, "checksum": "48ee23994b948101fd418489b0d80e0b1adb09546db223c10b70da78c96de1da", "generated_at": "2026-03-10T10:35:11.395Z" }, "ady": { "locale": "ady", "clips": 798, "users": 24, "questions": { "total": 128, "validated": 34, "invalidated": 0, "other": 94, "has_audio": 127, "avg_recordings_per_question": 6.28 }, "audios": { "total": 798, "transcribed_validated": 688, "transcribed_pending": 24, "not_transcribed": 86 }, "transcriptions": { "total": 712, "validated": 688, "not_yet_validated": 24, "edited": 328, "edited_pct": 46.07 }, "duration": { "total_ms": 20295216, "total_hrs": 5.63, "validated_ms": 16773084, "validated_hrs": 4.65, "avg_ms": 25433, "min_ms": 1116, "max_ms": 132048, "avg_chars_per_sec": 7.67 }, "buckets": { "train": { "clips": 160, "users": 7, "duration_ms": 3540096, "duration_hrs": 0.98 }, "dev": { "clips": 245, "users": 7, "duration_ms": 7173036, "duration_hrs": 1.99 }, "test": { "clips": 283, "users": 9, "duration_ms": 6059952, "duration_hrs": 1.68 } }, "demographics": { "age": { "": 0.0351, "teens": 0, "twenties": 0.1353, "thirties": 0.2068, "fourties": 0.1491, "fifties": 0.3584, "sixties": 0.1153, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0802, "male_masculine": 0.2506, "female_feminine": 0.6692, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "abzakh": 67, "bzhedug": 76, "hatikuay": 0, "shapsug": 216, "temirgoy": 90, "ubykh": 0, "user_defined": 10, "": 370 }, "variant": { "ady-Cyrl": 410, "ady-Cyrl-JOR": 0, "ady-Cyrl-SY": 0, "ady-Cyrl-TR": 268, "ady-Latn-TR-t-ady-cyrl": 0, "ady-RU": 18, "": 102 } }, "reported": { "clips": 4, "reasons": { "other": 3, "different_language": 1 } }, "size": 113215512, "checksum": "91b78725405609f2988d32db73af136a2ee4de6ca962aa82637ee6d757e00bb1", "generated_at": "2026-03-10T10:35:25.341Z" }, "aln": { "locale": "aln", "clips": 1654, "users": 14, "questions": { "total": 144, "validated": 0, "invalidated": 0, "other": 144, "has_audio": 144, "avg_recordings_per_question": 11.49 }, "audios": { "total": 1654, "transcribed_validated": 1516, "transcribed_pending": 0, "not_transcribed": 138 }, "transcriptions": { "total": 1516, "validated": 1516, "not_yet_validated": 0, "edited": 103, "edited_pct": 6.79 }, "duration": { "total_ms": 39111120, "total_hrs": 10.86, "validated_ms": 37793736, "validated_hrs": 10.49, "avg_ms": 23646, "min_ms": 396, "max_ms": 73368, "avg_chars_per_sec": 6.9 }, "buckets": { "train": { "clips": 926, "users": 7, "duration_ms": 24455952, "duration_hrs": 6.79 }, "dev": { "clips": 291, "users": 3, "duration_ms": 6968376, "duration_hrs": 1.93 }, "test": { "clips": 299, "users": 3, "duration_ms": 6369408, "duration_hrs": 1.76 } }, "demographics": { "age": { "": 0.0707, "teens": 0.1608, "twenties": 0.0036, "thirties": 0.0871, "fourties": 0.4317, "fifties": 0.2461, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4426, "male_masculine": 0.0738, "female_feminine": 0.4837, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1654 }, "variant": { "": 1654 } }, "reported": { "clips": 95, "reasons": { "other": 95 } }, "size": 212137972, "checksum": "889a5397f2288d3baa851d1249d630688f3b018e90ed6f6948d81a9aa4b462d0", "generated_at": "2026-03-10T10:33:06.153Z" }, "an": { "locale": "an", "clips": 19, "users": 1, "questions": { "total": 211, "validated": 0, "invalidated": 0, "other": 211, "has_audio": 19, "avg_recordings_per_question": 1 }, "audios": { "total": 19, "transcribed_validated": 19, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 19, "validated": 19, "not_yet_validated": 0, "edited": 7, "edited_pct": 36.84 }, "duration": { "total_ms": 395352, "total_hrs": 0.1, "validated_ms": 395352, "validated_hrs": 0.1, "avg_ms": 20808, "min_ms": 7740, "max_ms": 46368, "avg_chars_per_sec": 9.21 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 19 }, "variant": { "": 19 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2360597, "checksum": "115db4d7b8f2961e6cc3cd7690c8aac579c07c9c0f714365ec0ca2de02de431e", "generated_at": "2026-03-10T10:34:13.754Z" }, "ba": { "locale": "ba", "clips": 127, "users": 4, "questions": { "total": 167, "validated": 0, "invalidated": 0, "other": 167, "has_audio": 108, "avg_recordings_per_question": 1.18 }, "audios": { "total": 127, "transcribed_validated": 6, "transcribed_pending": 115, "not_transcribed": 6 }, "transcriptions": { "total": 121, "validated": 6, "not_yet_validated": 115, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 901872, "total_hrs": 0.25, "validated_ms": 31176, "validated_hrs": 0, "avg_ms": 7101, "min_ms": 1836, "max_ms": 31536, "avg_chars_per_sec": 7.3 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0551, "thirties": 0.0315, "fourties": 0.1339, "fifties": 0.7795, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 127 }, "variant": { "": 127 } }, "reported": { "clips": 0, "reasons": {} }, "size": 5348629, "checksum": "a0aadfb90e36a8d93ec75a45a511fe0246567d8090a186ca83883a8392f09437", "generated_at": "2026-03-10T10:40:20.927Z" }, "bas": { "locale": "bas", "clips": 773, "users": 11, "questions": { "total": 74, "validated": 0, "invalidated": 0, "other": 74, "has_audio": 74, "avg_recordings_per_question": 10.45 }, "audios": { "total": 773, "transcribed_validated": 734, "transcribed_pending": 38, "not_transcribed": 1 }, "transcriptions": { "total": 772, "validated": 734, "not_yet_validated": 38, "edited": 500, "edited_pct": 64.77 }, "duration": { "total_ms": 19354320, "total_hrs": 5.37, "validated_ms": 18232452, "validated_hrs": 5.06, "avg_ms": 25038, "min_ms": 4680, "max_ms": 64116, "avg_chars_per_sec": 8.86 }, "buckets": { "train": { "clips": 202, "users": 3, "duration_ms": 4931604, "duration_hrs": 1.36 }, "dev": { "clips": 252, "users": 4, "duration_ms": 6487056, "duration_hrs": 1.8 }, "test": { "clips": 280, "users": 4, "duration_ms": 6813792, "duration_hrs": 1.89 } }, "demographics": { "age": { "": 0, "teens": 0.1889, "twenties": 0, "thirties": 0.1863, "fourties": 0.5304, "fifties": 0.0944, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5705, "male_masculine": 0.4295, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 773 }, "variant": { "": 773 } }, "reported": { "clips": 0, "reasons": {} }, "size": 115065154, "checksum": "b01311d068d5b473f43fb341d4089ef28b2ec5314836d0562bc2fc47c0768278", "generated_at": "2026-03-10T10:40:10.979Z" }, "bew": { "locale": "bew", "clips": 1336, "users": 21, "questions": { "total": 200, "validated": 0, "invalidated": 0, "other": 200, "has_audio": 199, "avg_recordings_per_question": 6.71 }, "audios": { "total": 1336, "transcribed_validated": 1270, "transcribed_pending": 12, "not_transcribed": 54 }, "transcriptions": { "total": 1282, "validated": 1270, "not_yet_validated": 12, "edited": 1127, "edited_pct": 87.91 }, "duration": { "total_ms": 37777068, "total_hrs": 10.49, "validated_ms": 35235684, "validated_hrs": 9.78, "avg_ms": 28276, "min_ms": 3168, "max_ms": 156996, "avg_chars_per_sec": 9.27 }, "buckets": { "train": { "clips": 772, "users": 13, "duration_ms": 23049540, "duration_hrs": 6.4 }, "dev": { "clips": 292, "users": 4, "duration_ms": 6589116, "duration_hrs": 1.83 }, "test": { "clips": 206, "users": 4, "duration_ms": 5597028, "duration_hrs": 1.55 } }, "demographics": { "age": { "": 0.6557, "teens": 0, "twenties": 0.1198, "thirties": 0.0756, "fourties": 0.149, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6557, "male_masculine": 0.3443, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1336 }, "variant": { "": 1336 } }, "reported": { "clips": 0, "reasons": {} }, "size": 224874219, "checksum": "ee501a0396630ec76946d4c6ed37ab456ad73a4f8076436ce6125513324f0482", "generated_at": "2026-03-10T10:28:02.626Z" }, "br": { "locale": "br", "clips": 247, "users": 14, "questions": { "total": 76, "validated": 5, "invalidated": 0, "other": 71, "has_audio": 62, "avg_recordings_per_question": 3.98 }, "audios": { "total": 247, "transcribed_validated": 73, "transcribed_pending": 55, "not_transcribed": 119 }, "transcriptions": { "total": 128, "validated": 73, "not_yet_validated": 55, "edited": 12, "edited_pct": 9.38 }, "duration": { "total_ms": 2525256, "total_hrs": 0.7, "validated_ms": 698256, "validated_hrs": 0.19, "avg_ms": 10224, "min_ms": 1980, "max_ms": 49788, "avg_chars_per_sec": 6.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0283, "thirties": 0.2713, "fourties": 0.2794, "fifties": 0.0364, "sixties": 0.1984, "seventies": 0.1862, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9798, "male_masculine": 0, "female_feminine": 0.0202, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "gwenedeg": 4, "kerneveg": 65, "leoneg": 117, "tregerieg": 116, "user_defined": 45, "": 4 }, "variant": { "": 247 } }, "reported": { "clips": 0, "reasons": {} }, "size": 14297127, "checksum": "e9af399415508f15d82a4dfd00e705f0725dbd22c6649da49dff406c953e3f49", "generated_at": "2026-03-10T10:39:57.658Z" }, "brx": { "locale": "brx", "clips": 69, "users": 1, "questions": { "total": 69, "validated": 3, "invalidated": 0, "other": 66, "has_audio": 69, "avg_recordings_per_question": 1 }, "audios": { "total": 69, "transcribed_validated": 25, "transcribed_pending": 44, "not_transcribed": 0 }, "transcriptions": { "total": 69, "validated": 25, "not_yet_validated": 44, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 232344, "total_hrs": 0.06, "validated_ms": 84816, "validated_hrs": 0.02, "avg_ms": 3367, "min_ms": 2376, "max_ms": 7128, "avg_chars_per_sec": 9.17 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 69 }, "variant": { "": 69 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1364967, "checksum": "ec0bdfde5fb74854cd95f3e90552a2efbccc1a98a1ca3d2fd6a479bac56b1b2b", "generated_at": "2026-03-10T10:40:25.622Z" }, "bsy": { "locale": "bsy", "clips": 1741, "users": 25, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 14.51 }, "audios": { "total": 1741, "transcribed_validated": 0, "transcribed_pending": 1741, "not_transcribed": 0 }, "transcriptions": { "total": 1741, "validated": 0, "not_yet_validated": 1741, "edited": 2, "edited_pct": 0.11 }, "duration": { "total_ms": 38892996, "total_hrs": 10.8, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 22339, "min_ms": 1764, "max_ms": 154764, "avg_chars_per_sec": 8.19 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7777, "teens": 0, "twenties": 0.0638, "thirties": 0, "fourties": 0.0402, "fifties": 0.0632, "sixties": 0.0551, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7777, "male_masculine": 0.104, "female_feminine": 0.1183, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1741 }, "variant": { "": 1741 } }, "reported": { "clips": 26, "reasons": { "other": 21, "different_language": 4, "personally_identifiable_information": 1 } }, "size": 232062329, "checksum": "5c75c7ba569755ac8f66c371aa7a9d8316c3ae4d9a0eb88f2dba636efe33d155", "generated_at": "2026-03-10T10:25:39.405Z" }, "bxk": { "locale": "bxk", "clips": 3214, "users": 27, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.78 }, "audios": { "total": 3214, "transcribed_validated": 2215, "transcribed_pending": 0, "not_transcribed": 999 }, "transcriptions": { "total": 2215, "validated": 2215, "not_yet_validated": 0, "edited": 398, "edited_pct": 17.97 }, "duration": { "total_ms": 57135060, "total_hrs": 15.87, "validated_ms": 39742308, "validated_hrs": 11.03, "avg_ms": 17777, "min_ms": 540, "max_ms": 72180, "avg_chars_per_sec": 8.12 }, "buckets": { "train": { "clips": 1554, "users": 16, "duration_ms": 27796392, "duration_hrs": 7.72 }, "dev": { "clips": 286, "users": 3, "duration_ms": 5855004, "duration_hrs": 1.62 }, "test": { "clips": 375, "users": 5, "duration_ms": 6090912, "duration_hrs": 1.69 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0373, "thirties": 0.1733, "fourties": 0.257, "fifties": 0.3158, "sixties": 0.2166, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1758, "male_masculine": 0.5756, "female_feminine": 0.2486, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3214 }, "variant": { "": 3214 } }, "reported": { "clips": 40, "reasons": { "other": 36, "different_language": 2, "personally_identifiable_information": 2 } }, "size": 275080770, "checksum": "e2f31b9f055c0004cff0751eca808093ccb4b8125173830e2f9a37f47bd18e28", "generated_at": "2026-03-10T10:36:10.698Z" }, "ca": { "locale": "ca", "clips": 140, "users": 12, "questions": { "total": 74, "validated": 14, "invalidated": 0, "other": 60, "has_audio": 57, "avg_recordings_per_question": 2.46 }, "audios": { "total": 140, "transcribed_validated": 123, "transcribed_pending": 16, "not_transcribed": 1 }, "transcriptions": { "total": 139, "validated": 123, "not_yet_validated": 16, "edited": 42, "edited_pct": 30.22 }, "duration": { "total_ms": 2156976, "total_hrs": 0.59, "validated_ms": 1783692, "validated_hrs": 0.49, "avg_ms": 15407, "min_ms": 2556, "max_ms": 72180, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.1071, "teens": 0, "twenties": 0, "thirties": 0.1143, "fourties": 0.4357, "fifties": 0.1929, "sixties": 0.15, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1071, "male_masculine": 0.6143, "female_feminine": 0.2786, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "balearic": 0, "central": 3, "learner_es": 0, "learner_other": 0, "northern": 0, "northwestern": 0, "valencian": 28, "user_defined": 32, "": 77 }, "variant": { "ca-algueres": 0, "ca-balear": 0, "ca-central": 84, "ca-northern": 0, "ca-nwestern": 0, "ca-valencia-alacant": 0, "ca-valencia-central": 1, "ca-valencia-northern": 0, "ca-valencia-southern": 0, "ca-valencia-tortosi": 0, "": 55 } }, "reported": { "clips": 2, "reasons": { "other": 2 } }, "size": 12781819, "checksum": "f6f68284c9312c36cd9719c3f14916fb23f85528cefe4862e62ecea0ce076ad8", "generated_at": "2026-03-10T10:39:19.894Z" }, "cdo": { "locale": "cdo", "clips": 1032, "users": 31, "questions": { "total": 120, "validated": 1, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 8.67 }, "audios": { "total": 1032, "transcribed_validated": 107, "transcribed_pending": 925, "not_transcribed": 0 }, "transcriptions": { "total": 1032, "validated": 107, "not_yet_validated": 925, "edited": 73, "edited_pct": 7.07 }, "duration": { "total_ms": 35155944, "total_hrs": 9.76, "validated_ms": 3289824, "validated_hrs": 0.91, "avg_ms": 34066, "min_ms": 720, "max_ms": 476568, "avg_chars_per_sec": 2.67 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.8043, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.1143, "fifties": 0.0814, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8043, "male_masculine": 0.0814, "female_feminine": 0.1143, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1032 }, "variant": { "": 1032 } }, "reported": { "clips": 4, "reasons": { "other": 4 } }, "size": 201042622, "checksum": "b001b68d524d3ea62e7e9cbebbb0df82c0b02fc7f620dc6f362d88f4e3fc09bf", "generated_at": "2026-03-10T10:22:35.283Z" }, "cgg": { "locale": "cgg", "clips": 3085, "users": 27, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 25.71 }, "audios": { "total": 3085, "transcribed_validated": 2711, "transcribed_pending": 0, "not_transcribed": 374 }, "transcriptions": { "total": 2711, "validated": 2711, "not_yet_validated": 0, "edited": 516, "edited_pct": 19.03 }, "duration": { "total_ms": 42117228, "total_hrs": 11.69, "validated_ms": 36681696, "validated_hrs": 10.18, "avg_ms": 13652, "min_ms": 936, "max_ms": 86976, "avg_chars_per_sec": 8.61 }, "buckets": { "train": { "clips": 1743, "users": 17, "duration_ms": 25241472, "duration_hrs": 7.01 }, "dev": { "clips": 533, "users": 6, "duration_ms": 5940900, "duration_hrs": 1.65 }, "test": { "clips": 435, "users": 4, "duration_ms": 5499324, "duration_hrs": 1.52 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.5825, "thirties": 0.2716, "fourties": 0.1459, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1164, "male_masculine": 0.2726, "female_feminine": 0.611, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3085 }, "variant": { "": 3085 } }, "reported": { "clips": 0, "reasons": {} }, "size": 245575238, "checksum": "4eee2d8c256f620e6739ccf655571a452330f0f5b86a5d6c506afbf7e427b0b2", "generated_at": "2026-03-10T10:37:27.387Z" }, "cpx": { "locale": "cpx", "clips": 2007, "users": 29, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 16.87 }, "audios": { "total": 2007, "transcribed_validated": 13, "transcribed_pending": 1991, "not_transcribed": 3 }, "transcriptions": { "total": 2004, "validated": 13, "not_yet_validated": 1991, "edited": 3, "edited_pct": 0.15 }, "duration": { "total_ms": 38396232, "total_hrs": 10.66, "validated_ms": 116244, "validated_hrs": 0.03, "avg_ms": 19131, "min_ms": 720, "max_ms": 83160, "avg_chars_per_sec": 3.23 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7997, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.0877, "fifties": 0.0593, "sixties": 0.0249, "seventies": 0.0284, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7997, "male_masculine": 0.1221, "female_feminine": 0.0782, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2007 }, "variant": { "": 2007 } }, "reported": { "clips": 110, "reasons": { "other": 45, "different_language": 65 } }, "size": 236713275, "checksum": "5b1be27e7e7e38635becc0dad7b088664e31bf534ec25bc071248016334666af", "generated_at": "2026-03-10T10:22:07.488Z" }, "cy": { "locale": "cy", "clips": 11, "users": 2, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 11, "avg_recordings_per_question": 1 }, "audios": { "total": 11, "transcribed_validated": 0, "transcribed_pending": 3, "not_transcribed": 8 }, "transcriptions": { "total": 3, "validated": 0, "not_yet_validated": 3, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 631980, "total_hrs": 0.17, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 57453, "min_ms": 2808, "max_ms": 162288, "avg_chars_per_sec": 6.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.7273, "thirties": 0, "fourties": 0, "fifties": 0.2727, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2727, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0.7273, "do_not_wish_to_say": 0 }, "accent": { "united_kingdom": 3, "user_defined": 8 }, "variant": { "cy-midwales": 0, "cy-northeas": 0, "cy-northwes": 8, "cy-southeas": 0, "cy-southwes": 0, "cy-wladfa": 0, "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3758062, "checksum": "c690331e3d14a2802e806325eb6d643f6a0f8c7b1c0f5445a21d2b1c0bbb2784", "generated_at": "2026-03-10T10:39:03.859Z" }, "de": { "locale": "de", "clips": 214, "users": 17, "questions": { "total": 92, "validated": 26, "invalidated": 6, "other": 60, "has_audio": 79, "avg_recordings_per_question": 2.71 }, "audios": { "total": 214, "transcribed_validated": 35, "transcribed_pending": 5, "not_transcribed": 174 }, "transcriptions": { "total": 40, "validated": 35, "not_yet_validated": 5, "edited": 12, "edited_pct": 30 }, "duration": { "total_ms": 4286088, "total_hrs": 1.19, "validated_ms": 356688, "validated_hrs": 0.09, "avg_ms": 20028, "min_ms": 3024, "max_ms": 160704, "avg_chars_per_sec": 10.55 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.1308, "twenties": 0.0841, "thirties": 0.1402, "fourties": 0.1449, "fifties": 0.3271, "sixties": 0.1729, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2944, "male_masculine": 0.2523, "female_feminine": 0.4533, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "austria": 0, "belgium": 0, "brazil": 0, "bulgaria": 0, "canada": 0, "czechia": 0, "denmark": 0, "finland": 0, "france": 0, "germany": 146, "greece": 0, "hungary": 0, "italy": 0, "kazakhstan": 0, "liechtenstein": 0, "lithuania": 0, "luxembourg": 0, "namibia": 0, "netherlands": 0, "paraguay": 0, "poland": 0, "romania": 0, "russia": 0, "slovakia": 0, "slovenia": 0, "switzerland": 2, "turkey": 0, "united_kingdom": 0, "united_states": 0, "user_defined": 1, "": 66 }, "variant": { "": 214 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 23283107, "checksum": "519c20bfa76f10507b7f539622e997dff86e3f82107353e0257add3448b4fca2", "generated_at": "2026-03-10T10:40:28.536Z" }, "el-CY": { "locale": "el-CY", "clips": 1221, "users": 10, "questions": { "total": 146, "validated": 0, "invalidated": 0, "other": 146, "has_audio": 146, "avg_recordings_per_question": 8.36 }, "audios": { "total": 1221, "transcribed_validated": 1217, "transcribed_pending": 3, "not_transcribed": 1 }, "transcriptions": { "total": 1220, "validated": 1217, "not_yet_validated": 3, "edited": 1196, "edited_pct": 98.03 }, "duration": { "total_ms": 37345716, "total_hrs": 10.37, "validated_ms": 37167084, "validated_hrs": 10.32, "avg_ms": 30586, "min_ms": 3456, "max_ms": 271080, "avg_chars_per_sec": 9.52 }, "buckets": { "train": { "clips": 824, "users": 7, "duration_ms": 21533544, "duration_hrs": 5.98 }, "dev": { "clips": 280, "users": 2, "duration_ms": 9810576, "duration_hrs": 2.72 }, "test": { "clips": 113, "users": 1, "duration_ms": 5822964, "duration_hrs": 1.61 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.6585, "thirties": 0.1196, "fourties": 0.1753, "fifties": 0, "sixties": 0, "seventies": 0.0467, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.4144, "female_feminine": 0.5856, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1221 }, "variant": { "": 1221 } }, "reported": { "clips": 0, "reasons": {} }, "size": 221759064, "checksum": "bfa04ae84aa856c2516053ea35236b096d95db5350c441ca301652ebdc88d6c4", "generated_at": "2026-03-10T10:34:27.098Z" }, "en": { "locale": "en", "clips": 5531, "users": 509, "questions": { "total": 423, "validated": 245, "invalidated": 31, "other": 147, "has_audio": 337, "avg_recordings_per_question": 16.41 }, "audios": { "total": 5531, "transcribed_validated": 2246, "transcribed_pending": 60, "not_transcribed": 3225 }, "transcriptions": { "total": 2306, "validated": 2246, "not_yet_validated": 60, "edited": 266, "edited_pct": 11.54 }, "duration": { "total_ms": 56428956, "total_hrs": 15.67, "validated_ms": 29016288, "validated_hrs": 8.06, "avg_ms": 10202, "min_ms": 108, "max_ms": 400320, "avg_chars_per_sec": 7.33 }, "buckets": { "train": { "clips": 1412, "users": 194, "duration_ms": 17534088, "duration_hrs": 4.87 }, "dev": { "clips": 445, "users": 63, "duration_ms": 6045444, "duration_hrs": 1.67 }, "test": { "clips": 389, "users": 78, "duration_ms": 5436756, "duration_hrs": 1.51 } }, "demographics": { "age": { "": 0.0805, "teens": 0.1204, "twenties": 0.5126, "thirties": 0.1054, "fourties": 0.0212, "fifties": 0.0213, "sixties": 0.0879, "seventies": 0.0251, "eighties": 0.0251, "nineties": 0.0005 }, "gender": { "": 0.454, "male_masculine": 0.1408, "female_feminine": 0.3366, "intersex": 0.03, "transgender": 0.0072, "non-binary": 0.0042, "do_not_wish_to_say": 0.0271 }, "accent": { "african": 20, "australia": 24, "bermuda": 0, "canada": 4, "england": 166, "hongkong": 0, "indian": 62, "ireland": 3, "malaysia": 4, "newzealand": 0, "philippines": 1, "scotland": 1, "singapore": 8, "southatlandtic": 0, "us": 394, "wales": 0, "user_defined": 837, "": 4617 }, "variant": { "": 5531 } }, "reported": { "clips": 791, "reasons": { "other": 80, "different_language": 708, "personally_identifiable_information": 2, "offensive_speech": 1 } }, "size": 379355907, "checksum": "f0bd9c2e482cd609226d81b36138478717d13f63a7f9580ecc0490ea7215ba2c", "generated_at": "2026-03-10T10:21:26.013Z" }, "es": { "locale": "es", "clips": 12, "users": 3, "questions": { "total": 90, "validated": 25, "invalidated": 0, "other": 65, "has_audio": 12, "avg_recordings_per_question": 1 }, "audios": { "total": 12, "transcribed_validated": 12, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 12, "validated": 12, "not_yet_validated": 0, "edited": 2, "edited_pct": 16.67 }, "duration": { "total_ms": 110988, "total_hrs": 0.03, "validated_ms": 110988, "validated_hrs": 0.03, "avg_ms": 9249, "min_ms": 1980, "max_ms": 24480, "avg_chars_per_sec": 8.9 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.9167, "fourties": 0.0833, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1667, "male_masculine": 0.8333, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "americacentral": 0, "andino": 0, "canario": 0, "caribe": 0, "centrosurpeninsular": 0, "chileno": 0, "filipinas": 0, "mexicano": 0, "nortepeninsular": 1, "rioplatense": 11, "surpeninsular": 0 }, "variant": { "": 12 } }, "reported": { "clips": 0, "reasons": {} }, "size": 656518, "checksum": "d7bc8a338b19239861905e8e028190bcf35fcb706b36387f2a950157e9b61686", "generated_at": "2026-03-10T10:40:48.732Z" }, "fr": { "locale": "fr", "clips": 345, "users": 36, "questions": { "total": 147, "validated": 46, "invalidated": 3, "other": 98, "has_audio": 129, "avg_recordings_per_question": 2.67 }, "audios": { "total": 345, "transcribed_validated": 151, "transcribed_pending": 0, "not_transcribed": 194 }, "transcriptions": { "total": 151, "validated": 151, "not_yet_validated": 0, "edited": 34, "edited_pct": 22.52 }, "duration": { "total_ms": 4789044, "total_hrs": 1.33, "validated_ms": 2149884, "validated_hrs": 0.59, "avg_ms": 13881, "min_ms": 288, "max_ms": 90180, "avg_chars_per_sec": 8.98 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0232, "teens": 0.0029, "twenties": 0.2377, "thirties": 0.1797, "fourties": 0.0319, "fifties": 0.0986, "sixties": 0.0261, "seventies": 0.4, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2609, "male_masculine": 0.4377, "female_feminine": 0.258, "intersex": 0, "transgender": 0.0145, "non-binary": 0.029, "do_not_wish_to_say": 0 }, "accent": { "algeria": 1, "belgium": 0, "benin": 0, "cameroon": 0, "canada": 0, "central_african_republic": 0, "comoros": 0, "congo_kinshasa": 0, "cote_d_ivoire": 0, "cyprus": 0, "djibouti": 0, "equatorial_guinea": 0, "fr-metro-east": 10, "fr-metro-north": 0, "fr-metro-south": 0, "fr-metro-west": 0, "french_guiana": 0, "french_polynesia": 0, "germany": 3, "guadeloupe": 11, "haiti": 0, "italy": 0, "luxembourg": 0, "madagascar": 0, "mali": 0, "martinique": 0, "mauritania": 0, "mauritius": 0, "mayotte": 0, "monaco": 0, "morocco": 0, "netherlands": 0, "new_caledonia": 0, "niger": 0, "reunion": 0, "romania": 0, "senegal": 0, "seychelles": 0, "st_barthelemy": 0, "st_martin": 0, "st_pierre_et_miquelon": 0, "switzerland": 0, "syria": 0, "togo": 0, "tunisia": 0, "united_kingdom": 0, "united_states": 0, "vanuatu": 0, "wallis_et_futuna": 0, "user_defined": 1, "": 319 }, "variant": { "fr-asia": 0, "fr-droum": 11, "fr-europe": 0, "fr-metro": 260, "fr-nafrica": 1, "fr-namerica": 0, "fr-safrica": 0, "fr-samerica": 0, "": 73 } }, "reported": { "clips": 13, "reasons": { "different_language": 12, "other": 1 } }, "size": 27023733, "checksum": "bf6ebcb60018f44568b3ec4234e0b61ef9ed7e8f6a877622b94695fc2ce5d4d1", "generated_at": "2026-03-10T10:26:58.096Z" }, "fy-NL": { "locale": "fy-NL", "clips": 3, "users": 1, "questions": { "total": 71, "validated": 0, "invalidated": 0, "other": 71, "has_audio": 3, "avg_recordings_per_question": 1 }, "audios": { "total": 3, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 56196, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 18732, "min_ms": 16956, "max_ms": 21312, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 3 }, "variant": { "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 330683, "checksum": "41b81872d0d50d9e656240e875e46f54cbde4aef53bb9944088809148bbf61fa", "generated_at": "2026-03-10T10:40:45.341Z" }, "ga-IE": { "locale": "ga-IE", "clips": 36, "users": 4, "questions": { "total": 104, "validated": 0, "invalidated": 0, "other": 104, "has_audio": 32, "avg_recordings_per_question": 1.13 }, "audios": { "total": 36, "transcribed_validated": 4, "transcribed_pending": 4, "not_transcribed": 28 }, "transcriptions": { "total": 8, "validated": 4, "not_yet_validated": 4, "edited": 1, "edited_pct": 12.5 }, "duration": { "total_ms": 558684, "total_hrs": 0.15, "validated_ms": 55152, "validated_hrs": 0.01, "avg_ms": 15519, "min_ms": 3816, "max_ms": 54108, "avg_chars_per_sec": 10.09 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0278, "fourties": 0.7222, "fifties": 0.25, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0556, "male_masculine": 0.7222, "female_feminine": 0.2222, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "cainteoir_dúchais_gaeltacht": 10, "cainteoir_dúchais_lasmuigh_den_ghaeltacht": 0, "cainteoir_líofa_ní_ó_dhúchas": 0, "foghlaimeoir_fásta_ó_bhéarla": 0, "foghlaimeoir_fásta_ó_theanga_eile": 0, "foghlaimeoir_i_ngaelscoil": 0, "": 26 }, "variant": { "ga-IE-caighd": 0, "ga-IE-chonnact": 3, "ga-IE-mumhan": 7, "ga-IE-uladh": 0, "": 26 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3287934, "checksum": "5e8e0aad581272ede03149504e45a0f7e9b63b8b96ee581d9145f36adf14d892", "generated_at": "2026-03-10T10:40:23.533Z" }, "gl": { "locale": "gl", "clips": 238, "users": 15, "questions": { "total": 288, "validated": 19, "invalidated": 0, "other": 269, "has_audio": 166, "avg_recordings_per_question": 1.43 }, "audios": { "total": 238, "transcribed_validated": 85, "transcribed_pending": 151, "not_transcribed": 2 }, "transcriptions": { "total": 236, "validated": 85, "not_yet_validated": 151, "edited": 50, "edited_pct": 21.19 }, "duration": { "total_ms": 4197852, "total_hrs": 1.16, "validated_ms": 1125684, "validated_hrs": 0.31, "avg_ms": 17638, "min_ms": 2376, "max_ms": 85536, "avg_chars_per_sec": 9.18 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.3361, "thirties": 0.0252, "fourties": 0.5966, "fifties": 0.0378, "sixties": 0.0042, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0084, "male_masculine": 0.3151, "female_feminine": 0.4496, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.2269 }, "accent": { "atlantico": 42, "central": 6, "neofalante": 30, "normativo": 108, "oriental": 66, "user_defined": 1, "": 1 }, "variant": { "": 238 } }, "reported": { "clips": 0, "reasons": {} }, "size": 24639400, "checksum": "498d576480ef587cc0990f7387123cea133f86d9822257f16f5e715a9f102313", "generated_at": "2026-03-10T10:40:02.292Z" }, "gsw": { "locale": "gsw", "clips": 843, "users": 47, "questions": { "total": 65, "validated": 3, "invalidated": 0, "other": 62, "has_audio": 63, "avg_recordings_per_question": 13.38 }, "audios": { "total": 843, "transcribed_validated": 49, "transcribed_pending": 122, "not_transcribed": 672 }, "transcriptions": { "total": 171, "validated": 49, "not_yet_validated": 122, "edited": 46, "edited_pct": 26.9 }, "duration": { "total_ms": 16778808, "total_hrs": 4.66, "validated_ms": 667548, "validated_hrs": 0.18, "avg_ms": 19904, "min_ms": 468, "max_ms": 141732, "avg_chars_per_sec": 7.42 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0712, "teens": 0, "twenties": 0.0558, "thirties": 0.0451, "fourties": 0.1507, "fifties": 0.1862, "sixties": 0.3665, "seventies": 0.1246, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6489, "male_masculine": 0, "female_feminine": 0.3333, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0178 }, "accent": { "user_defined": 32, "": 811 }, "variant": { "gsw-FR-hochalem": 11, "gsw-FR-nordalem": 159, "gsw-FR-rhinfran": 0, "gsw-FR-sudnalem": 3, "gsw-FR-sudrfran": 19, "": 651 } }, "reported": { "clips": 25, "reasons": { "other": 18, "offensive_speech": 3, "different_language": 4 } }, "size": 90423508, "checksum": "df3fd61706f7793c967641467e8f620c69f2a336c3bcb2f207069a5f915f7b7d", "generated_at": "2026-03-10T10:39:10.430Z" }, "gv": { "locale": "gv", "clips": 130, "users": 1, "questions": { "total": 250, "validated": 0, "invalidated": 0, "other": 250, "has_audio": 130, "avg_recordings_per_question": 1 }, "audios": { "total": 130, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 130 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 2770956, "total_hrs": 0.76, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21315, "min_ms": 3780, "max_ms": 50220, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 130 }, "variant": { "": 130 } }, "reported": { "clips": 0, "reasons": {} }, "size": 16143419, "checksum": "0a4e4175d081d0eb493eed442ff82827b174d7b530c055447542d0fad5757712", "generated_at": "2026-03-10T10:35:17.213Z" }, "hac": { "locale": "hac", "clips": 421, "users": 19, "questions": { "total": 39, "validated": 0, "invalidated": 0, "other": 39, "has_audio": 39, "avg_recordings_per_question": 10.79 }, "audios": { "total": 421, "transcribed_validated": 14, "transcribed_pending": 13, "not_transcribed": 394 }, "transcriptions": { "total": 27, "validated": 14, "not_yet_validated": 13, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 40291704, "total_hrs": 11.19, "validated_ms": 1334844, "validated_hrs": 0.37, "avg_ms": 95705, "min_ms": 1296, "max_ms": 680256, "avg_chars_per_sec": 6.29 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.2138, "teens": 0, "twenties": 0, "thirties": 0.0926, "fourties": 0.4584, "fifties": 0.1425, "sixties": 0, "seventies": 0.0926, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2922, "male_masculine": 0.1568, "female_feminine": 0.5511, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 421 }, "variant": { "": 421 } }, "reported": { "clips": 0, "reasons": {} }, "size": 236288634, "checksum": "35bef7f0231b3e8be57aa371274df8170bfde340c8db352ea1d8e4144233d6bb", "generated_at": "2026-03-10T10:34:52.611Z" }, "hch": { "locale": "hch", "clips": 1553, "users": 10, "questions": { "total": 236, "validated": 0, "invalidated": 0, "other": 236, "has_audio": 236, "avg_recordings_per_question": 6.58 }, "audios": { "total": 1553, "transcribed_validated": 1549, "transcribed_pending": 0, "not_transcribed": 4 }, "transcriptions": { "total": 1549, "validated": 1549, "not_yet_validated": 0, "edited": 1524, "edited_pct": 98.39 }, "duration": { "total_ms": 36508932, "total_hrs": 10.14, "validated_ms": 36443448, "validated_hrs": 10.12, "avg_ms": 23509, "min_ms": 72, "max_ms": 114300, "avg_chars_per_sec": 7.68 }, "buckets": { "train": { "clips": 960, "users": 7, "duration_ms": 23137632, "duration_hrs": 6.42 }, "dev": { "clips": 355, "users": 2, "duration_ms": 5656320, "duration_hrs": 1.57 }, "test": { "clips": 234, "users": 1, "duration_ms": 7649496, "duration_hrs": 2.12 } }, "demographics": { "age": { "": 0.0064, "teens": 0, "twenties": 0.1442, "thirties": 0.3039, "fourties": 0, "fifties": 0.2415, "sixties": 0.3039, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3999, "male_masculine": 0, "female_feminine": 0.6001, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1553 }, "variant": { "": 1553 } }, "reported": { "clips": 0, "reasons": {} }, "size": 209212158, "checksum": "6885a279a8b6c508ddace32db5f5472f867790b74c7e662f95d8c3cc22f9cb1e", "generated_at": "2026-03-10T10:27:36.396Z" }, "ka": { "locale": "ka", "clips": 118, "users": 7, "questions": { "total": 79, "validated": 0, "invalidated": 0, "other": 79, "has_audio": 58, "avg_recordings_per_question": 2.03 }, "audios": { "total": 118, "transcribed_validated": 60, "transcribed_pending": 5, "not_transcribed": 53 }, "transcriptions": { "total": 65, "validated": 60, "not_yet_validated": 5, "edited": 10, "edited_pct": 15.38 }, "duration": { "total_ms": 2118204, "total_hrs": 0.58, "validated_ms": 512712, "validated_hrs": 0.14, "avg_ms": 17951, "min_ms": 3060, "max_ms": 107748, "avg_chars_per_sec": 9.49 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1356, "thirties": 0.339, "fourties": 0.4492, "fifties": 0, "sixties": 0.0763, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.0508, "female_feminine": 0.9492, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 118 }, "variant": { "": 118 } }, "reported": { "clips": 0, "reasons": {} }, "size": 12174033, "checksum": "c99215c8ed9059488950e645a827f767fba6490786d3d6d31aa93ce1676f6a9c", "generated_at": "2026-03-10T10:39:00.903Z" }, "kbd": { "locale": "kbd", "clips": 1210, "users": 37, "questions": { "total": 155, "validated": 0, "invalidated": 0, "other": 155, "has_audio": 154, "avg_recordings_per_question": 7.86 }, "audios": { "total": 1210, "transcribed_validated": 793, "transcribed_pending": 0, "not_transcribed": 417 }, "transcriptions": { "total": 793, "validated": 793, "not_yet_validated": 0, "edited": 244, "edited_pct": 30.77 }, "duration": { "total_ms": 43729992, "total_hrs": 12.14, "validated_ms": 29586564, "validated_hrs": 8.21, "avg_ms": 36140, "min_ms": 1908, "max_ms": 307440, "avg_chars_per_sec": 8.86 }, "buckets": { "train": { "clips": 488, "users": 13, "duration_ms": 17948664, "duration_hrs": 4.98 }, "dev": { "clips": 134, "users": 7, "duration_ms": 5407488, "duration_hrs": 1.5 }, "test": { "clips": 171, "users": 6, "duration_ms": 6230412, "duration_hrs": 1.73 } }, "demographics": { "age": { "": 0.0471, "teens": 0.0041, "twenties": 0.1471, "thirties": 0.2289, "fourties": 0.2694, "fifties": 0.2818, "sixties": 0.0215, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1149, "male_masculine": 0.5537, "female_feminine": 0.3314, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "bakhsan": 29, "besleney": 0, "kuban": 0, "kuban_zelenchuk": 20, "malka": 1, "mozdok": 0, "terek": 63, "user_defined": 26, "": 1072 }, "variant": { "kbd-Cyrl": 356, "kbd-Cyrl-JOR": 0, "kbd-Cyrl-SY": 0, "kbd-Cyrl-TR": 26, "kbd-Latn-TR-t-kbd-cyrl": 0, "kbd-RU": 119, "": 709 } }, "reported": { "clips": 35, "reasons": { "different_language": 1, "other": 34 } }, "size": 172429844, "checksum": "795534d686475115d327f518a742bcd09e1e323a6f7aaa2d70f12a5fc60bad45", "generated_at": "2026-03-10T10:35:42.446Z" }, "kcn": { "locale": "kcn", "clips": 2715, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 22.63 }, "audios": { "total": 2715, "transcribed_validated": 1894, "transcribed_pending": 8, "not_transcribed": 813 }, "transcriptions": { "total": 1902, "validated": 1894, "not_yet_validated": 8, "edited": 355, "edited_pct": 18.66 }, "duration": { "total_ms": 50285700, "total_hrs": 13.96, "validated_ms": 34896636, "validated_hrs": 9.69, "avg_ms": 18521, "min_ms": 108, "max_ms": 104436, "avg_chars_per_sec": 6.73 }, "buckets": { "train": { "clips": 1273, "users": 18, "duration_ms": 22732056, "duration_hrs": 6.31 }, "dev": { "clips": 297, "users": 4, "duration_ms": 5900004, "duration_hrs": 1.63 }, "test": { "clips": 324, "users": 4, "duration_ms": 6264576, "duration_hrs": 1.74 } }, "demographics": { "age": { "": 0.1738, "teens": 0, "twenties": 0.0431, "thirties": 0.2431, "fourties": 0.1271, "fifties": 0.2936, "sixties": 0.0773, "seventies": 0, "eighties": 0.042, "nineties": 0 }, "gender": { "": 0.1738, "male_masculine": 0.54, "female_feminine": 0.2862, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2715 }, "variant": { "": 2715 } }, "reported": { "clips": 284, "reasons": { "other": 261, "personally_identifiable_information": 6, "offensive_speech": 5, "different_language": 12 } }, "size": 316936092, "checksum": "c3169227b39082a30e20878d7f7ce7ae62663200137032d44308b60ea8916c0e", "generated_at": "2026-03-10T10:30:33.708Z" }, "koo": { "locale": "koo", "clips": 3233, "users": 28, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.94 }, "audios": { "total": 3233, "transcribed_validated": 2211, "transcribed_pending": 10, "not_transcribed": 1012 }, "transcriptions": { "total": 2221, "validated": 2211, "not_yet_validated": 10, "edited": 172, "edited_pct": 7.74 }, "duration": { "total_ms": 53618940, "total_hrs": 14.89, "validated_ms": 36472068, "validated_hrs": 10.13, "avg_ms": 16585, "min_ms": 288, "max_ms": 112968, "avg_chars_per_sec": 7.42 }, "buckets": { "train": { "clips": 1470, "users": 18, "duration_ms": 25086276, "duration_hrs": 6.96 }, "dev": { "clips": 406, "users": 5, "duration_ms": 5415588, "duration_hrs": 1.5 }, "test": { "clips": 335, "users": 4, "duration_ms": 5970204, "duration_hrs": 1.65 } }, "demographics": { "age": { "": 0.0371, "teens": 0, "twenties": 0.2221, "thirties": 0.5193, "fourties": 0.1847, "fifties": 0.0368, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3699, "male_masculine": 0.2595, "female_feminine": 0.3706, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3233 }, "variant": { "": 3233 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 294397664, "checksum": "757707603f9e3203261129ccbf247d7cc59c129fb168f05b38ae42eae9003b6b", "generated_at": "2026-03-10T10:36:45.785Z" }, "kzi": { "locale": "kzi", "clips": 1717, "users": 21, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 14.31 }, "audios": { "total": 1717, "transcribed_validated": 59, "transcribed_pending": 1658, "not_transcribed": 0 }, "transcriptions": { "total": 1717, "validated": 59, "not_yet_validated": 1658, "edited": 126, "edited_pct": 7.34 }, "duration": { "total_ms": 34264404, "total_hrs": 9.51, "validated_ms": 812340, "validated_hrs": 0.22, "avg_ms": 19956, "min_ms": 720, "max_ms": 69876, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.2656, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0.2097, "sixties": 0.3069, "seventies": 0.2178, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2656, "male_masculine": 0.3722, "female_feminine": 0.3623, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1717 }, "variant": { "": 1717 } }, "reported": { "clips": 0, "reasons": {} }, "size": 203771740, "checksum": "02f2b2fdf68f940aaadc516b28fca449b9ea6c2572288f37587980d5314b972c", "generated_at": "2026-03-10T10:24:38.238Z" }, "led": { "locale": "led", "clips": 2851, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 23.76 }, "audios": { "total": 2851, "transcribed_validated": 1897, "transcribed_pending": 30, "not_transcribed": 924 }, "transcriptions": { "total": 1927, "validated": 1897, "not_yet_validated": 30, "edited": 425, "edited_pct": 22.06 }, "duration": { "total_ms": 54529056, "total_hrs": 15.14, "validated_ms": 36313488, "validated_hrs": 10.08, "avg_ms": 19126, "min_ms": 720, "max_ms": 54036, "avg_chars_per_sec": 6.21 }, "buckets": { "train": { "clips": 1211, "users": 15, "duration_ms": 24216120, "duration_hrs": 6.72 }, "dev": { "clips": 412, "users": 5, "duration_ms": 6677172, "duration_hrs": 1.85 }, "test": { "clips": 274, "users": 3, "duration_ms": 5420196, "duration_hrs": 1.5 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0972, "thirties": 0.483, "fourties": 0.2525, "fifties": 0.0421, "sixties": 0.1252, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3076, "male_masculine": 0.524, "female_feminine": 0.1684, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2851 }, "variant": { "": 2851 } }, "reported": { "clips": 0, "reasons": {} }, "size": 283191629, "checksum": "b0163c35661493e673d9013d8f4afdfa7f96dfe7da55ccfd8328adfe34cfd4de", "generated_at": "2026-03-10T10:39:35.510Z" }, "lij": { "locale": "lij", "clips": 294, "users": 5, "questions": { "total": 169, "validated": 2, "invalidated": 0, "other": 167, "has_audio": 164, "avg_recordings_per_question": 1.79 }, "audios": { "total": 294, "transcribed_validated": 223, "transcribed_pending": 13, "not_transcribed": 58 }, "transcriptions": { "total": 236, "validated": 223, "not_yet_validated": 13, "edited": 57, "edited_pct": 24.15 }, "duration": { "total_ms": 8522964, "total_hrs": 2.36, "validated_ms": 5941584, "validated_hrs": 1.65, "avg_ms": 28990, "min_ms": 6408, "max_ms": 124920, "avg_chars_per_sec": 6.37 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.2551, "fourties": 0, "fifties": 0, "sixties": 0.2143, "seventies": 0.5306, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5306, "male_masculine": 0.2551, "female_feminine": 0.2143, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 2, "": 292 }, "variant": { "": 294 } }, "reported": { "clips": 0, "reasons": {} }, "size": 50705108, "checksum": "c223ee0d10cf30c35f319d1e84f1b603749f3d6f48474bcc102cf7f303aec590", "generated_at": "2026-03-10T10:37:10.223Z" }, "lke": { "locale": "lke", "clips": 2743, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 22.86 }, "audios": { "total": 2743, "transcribed_validated": 2269, "transcribed_pending": 11, "not_transcribed": 463 }, "transcriptions": { "total": 2280, "validated": 2269, "not_yet_validated": 11, "edited": 879, "edited_pct": 38.55 }, "duration": { "total_ms": 44597196, "total_hrs": 12.38, "validated_ms": 35831160, "validated_hrs": 9.95, "avg_ms": 16259, "min_ms": 180, "max_ms": 47736, "avg_chars_per_sec": 7.55 }, "buckets": { "train": { "clips": 1516, "users": 18, "duration_ms": 23093208, "duration_hrs": 6.41 }, "dev": { "clips": 390, "users": 4, "duration_ms": 5780268, "duration_hrs": 1.6 }, "test": { "clips": 363, "users": 4, "duration_ms": 6957684, "duration_hrs": 1.93 } }, "demographics": { "age": { "": 0.4342, "teens": 0.031, "twenties": 0.0824, "thirties": 0.214, "fourties": 0.1659, "fifties": 0.0725, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4342, "male_masculine": 0.4834, "female_feminine": 0.0824, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2743 }, "variant": { "": 2743 } }, "reported": { "clips": 32, "reasons": { "different_language": 18, "other": 13, "personally_identifiable_information": 1 } }, "size": 266926215, "checksum": "fde9388c9f97c46081e7b92bbececb873166cb3be4de766fa1ce63f34d87bbab", "generated_at": "2026-03-10T10:29:59.838Z" }, "lth": { "locale": "lth", "clips": 3126, "users": 29, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.05 }, "audios": { "total": 3126, "transcribed_validated": 1809, "transcribed_pending": 2, "not_transcribed": 1315 }, "transcriptions": { "total": 1811, "validated": 1809, "not_yet_validated": 2, "edited": 855, "edited_pct": 47.21 }, "duration": { "total_ms": 55331244, "total_hrs": 15.36, "validated_ms": 32886324, "validated_hrs": 9.13, "avg_ms": 17700, "min_ms": 1620, "max_ms": 1324080, "avg_chars_per_sec": 7.35 }, "buckets": { "train": { "clips": 1315, "users": 21, "duration_ms": 20581236, "duration_hrs": 5.71 }, "dev": { "clips": 264, "users": 4, "duration_ms": 5490288, "duration_hrs": 1.52 }, "test": { "clips": 230, "users": 4, "duration_ms": 6814800, "duration_hrs": 1.89 } }, "demographics": { "age": { "": 0.247, "teens": 0, "twenties": 0, "thirties": 0.1942, "fourties": 0.1897, "fifties": 0.1891, "sixties": 0.1801, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3986, "male_masculine": 0.23, "female_feminine": 0.3714, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3126 }, "variant": { "": 3126 } }, "reported": { "clips": 1170, "reasons": { "other": 510, "different_language": 516, "personally_identifiable_information": 142, "offensive_speech": 2 } }, "size": 334166158, "checksum": "30b8d20e4d5ee60a1823d8895f3753ac9cd9ed0c3c1332300d7c6391520da58a", "generated_at": "2026-03-10T10:31:59.633Z" }, "lv": { "locale": "lv", "clips": 23, "users": 6, "questions": { "total": 101, "validated": 0, "invalidated": 0, "other": 101, "has_audio": 22, "avg_recordings_per_question": 1.05 }, "audios": { "total": 23, "transcribed_validated": 4, "transcribed_pending": 2, "not_transcribed": 17 }, "transcriptions": { "total": 6, "validated": 4, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 901584, "total_hrs": 0.25, "validated_ms": 127548, "validated_hrs": 0.03, "avg_ms": 39199, "min_ms": 6516, "max_ms": 86796, "avg_chars_per_sec": 9.01 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.087, "twenties": 0, "thirties": 0.7826, "fourties": 0.087, "fifties": 0.0435, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.087, "male_masculine": 0.1304, "female_feminine": 0.7826, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 4, "": 19 }, "variant": { "": 23 } }, "reported": { "clips": 0, "reasons": {} }, "size": 5368121, "checksum": "024ad144c7e044ca6eec7e5670fd0b1d9291dad2ecbf5bb9090eaf5d56636f6b", "generated_at": "2026-03-10T10:37:46.392Z" }, "meh": { "locale": "meh", "clips": 1057, "users": 16, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 8.81 }, "audios": { "total": 1057, "transcribed_validated": 1048, "transcribed_pending": 0, "not_transcribed": 9 }, "transcriptions": { "total": 1048, "validated": 1048, "not_yet_validated": 0, "edited": 328, "edited_pct": 31.3 }, "duration": { "total_ms": 36567144, "total_hrs": 10.15, "validated_ms": 36197496, "validated_hrs": 10.05, "avg_ms": 34595, "min_ms": 3924, "max_ms": 234864, "avg_chars_per_sec": 6.89 }, "buckets": { "train": { "clips": 633, "users": 9, "duration_ms": 22695012, "duration_hrs": 6.3 }, "dev": { "clips": 241, "users": 4, "duration_ms": 6798384, "duration_hrs": 1.88 }, "test": { "clips": 174, "users": 3, "duration_ms": 6704100, "duration_hrs": 1.86 } }, "demographics": { "age": { "": 0.1987, "teens": 0, "twenties": 0.1627, "thirties": 0.1079, "fourties": 0.1135, "fifties": 0, "sixties": 0.3264, "seventies": 0.0908, "eighties": 0, "nineties": 0 }, "gender": { "": 0.193, "male_masculine": 0.0057, "female_feminine": 0.8013, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1057 }, "variant": { "": 1057 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 212178342, "checksum": "3c3880ed2fcfbb74545fe3e13ddd8787681af2ab8c33a1f11a1cdfddd941ded4", "generated_at": "2026-03-10T10:27:10.720Z" }, "mel": { "locale": "mel", "clips": 1624, "users": 35, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 13.53 }, "audios": { "total": 1624, "transcribed_validated": 28, "transcribed_pending": 1554, "not_transcribed": 42 }, "transcriptions": { "total": 1582, "validated": 28, "not_yet_validated": 1554, "edited": 13, "edited_pct": 0.82 }, "duration": { "total_ms": 37180800, "total_hrs": 10.32, "validated_ms": 562824, "validated_hrs": 0.15, "avg_ms": 22895, "min_ms": 756, "max_ms": 93348, "avg_chars_per_sec": 7.52 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.9803, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.0197, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9803, "male_masculine": 0, "female_feminine": 0.0197, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1624 }, "variant": { "": 1624 } }, "reported": { "clips": 0, "reasons": {} }, "size": 219265416, "checksum": "8d9f382b992e273fa03e33dd2d91db774b05f8b1b6695ad26cafaa5d13239c5c", "generated_at": "2026-03-10T10:23:44.690Z" }, "mmc": { "locale": "mmc", "clips": 871, "users": 12, "questions": { "total": 195, "validated": 0, "invalidated": 0, "other": 195, "has_audio": 195, "avg_recordings_per_question": 4.47 }, "audios": { "total": 871, "transcribed_validated": 871, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 871, "validated": 871, "not_yet_validated": 0, "edited": 305, "edited_pct": 35.02 }, "duration": { "total_ms": 40443984, "total_hrs": 11.23, "validated_ms": 40443984, "validated_hrs": 11.23, "avg_ms": 46434, "min_ms": 3348, "max_ms": 332820, "avg_chars_per_sec": 5.27 }, "buckets": { "train": { "clips": 405, "users": 7, "duration_ms": 13599252, "duration_hrs": 3.77 }, "dev": { "clips": 218, "users": 2, "duration_ms": 14436576, "duration_hrs": 4.01 }, "test": { "clips": 248, "users": 3, "duration_ms": 12408156, "duration_hrs": 3.44 } }, "demographics": { "age": { "": 0.07, "teens": 0, "twenties": 0.5029, "thirties": 0, "fourties": 0, "fifties": 0.2227, "sixties": 0.2044, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3226, "male_masculine": 0, "female_feminine": 0.6774, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 871 }, "variant": { "": 871 } }, "reported": { "clips": 0, "reasons": {} }, "size": 237402720, "checksum": "e8a4c32ddd188c0bc322e1ef0812921063b8fc65cea36f3476a11f1976289f72", "generated_at": "2026-03-10T10:28:52.240Z" }, "ms-MY": { "locale": "ms-MY", "clips": 1147, "users": 23, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 9.64 }, "audios": { "total": 1147, "transcribed_validated": 1054, "transcribed_pending": 80, "not_transcribed": 13 }, "transcriptions": { "total": 1134, "validated": 1054, "not_yet_validated": 80, "edited": 94, "edited_pct": 8.29 }, "duration": { "total_ms": 22281516, "total_hrs": 6.18, "validated_ms": 20302020, "validated_hrs": 5.63, "avg_ms": 19426, "min_ms": 1800, "max_ms": 208476, "avg_chars_per_sec": 8.59 }, "buckets": { "train": { "clips": 498, "users": 7, "duration_ms": 8224596, "duration_hrs": 2.28 }, "dev": { "clips": 332, "users": 8, "duration_ms": 5972724, "duration_hrs": 1.65 }, "test": { "clips": 224, "users": 5, "duration_ms": 6104700, "duration_hrs": 1.69 } }, "demographics": { "age": { "": 0.3575, "teens": 0, "twenties": 0.1133, "thirties": 0.2642, "fourties": 0.1395, "fifties": 0, "sixties": 0.0663, "seventies": 0.0593, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3548, "male_masculine": 0.204, "female_feminine": 0.4333, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0078 }, "accent": { "": 1147 }, "variant": { "": 1147 } }, "reported": { "clips": 1, "reasons": { "offensive_speech": 1 } }, "size": 132575839, "checksum": "407b028ff272b3533775c8b3b16d3104f0a92fa2dd53a19d6221db6f50c4ddde", "generated_at": "2026-03-10T10:22:57.135Z" }, "msi": { "locale": "msi", "clips": 2277, "users": 33, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 19.13 }, "audios": { "total": 2277, "transcribed_validated": 4, "transcribed_pending": 2273, "not_transcribed": 0 }, "transcriptions": { "total": 2277, "validated": 4, "not_yet_validated": 2273, "edited": 3, "edited_pct": 0.13 }, "duration": { "total_ms": 48764700, "total_hrs": 13.54, "validated_ms": 70020, "validated_hrs": 0.01, "avg_ms": 21416, "min_ms": 864, "max_ms": 89316, "avg_chars_per_sec": 8.5 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7686, "teens": 0, "twenties": 0.0654, "thirties": 0, "fourties": 0.0733, "fifties": 0.0487, "sixties": 0.0439, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7686, "male_masculine": 0.061, "female_feminine": 0.1704, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2277 }, "variant": { "": 2277 } }, "reported": { "clips": 45, "reasons": { "other": 39, "different_language": 6 } }, "size": 291439315, "checksum": "7dc243987f1b19adfe195f72c0721521eca832d6fe1e902283ed768d29cb5ff2", "generated_at": "2026-03-10T10:25:07.241Z" }, "pne": { "locale": "pne", "clips": 2595, "users": 24, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 21.63 }, "audios": { "total": 2595, "transcribed_validated": 2594, "transcribed_pending": 0, "not_transcribed": 1 }, "transcriptions": { "total": 2594, "validated": 2594, "not_yet_validated": 0, "edited": 424, "edited_pct": 16.35 }, "duration": { "total_ms": 43702704, "total_hrs": 12.13, "validated_ms": 43675992, "validated_hrs": 12.13, "avg_ms": 16841, "min_ms": 1800, "max_ms": 36324, "avg_chars_per_sec": 8.44 }, "buckets": { "train": { "clips": 1918, "users": 18, "duration_ms": 31906008, "duration_hrs": 8.86 }, "dev": { "clips": 360, "users": 3, "duration_ms": 5661972, "duration_hrs": 1.57 }, "test": { "clips": 316, "users": 3, "duration_ms": 6108012, "duration_hrs": 1.69 } }, "demographics": { "age": { "": 0.3083, "teens": 0.185, "twenties": 0.2593, "thirties": 0.1272, "fourties": 0.074, "fifties": 0.0462, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3083, "male_masculine": 0.3048, "female_feminine": 0.3869, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2595 }, "variant": { "": 2595 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 260113564, "checksum": "ce74972035d8326b4a0d64708a9f4e8490f58b420d8aea02d184bf0adb354212", "generated_at": "2026-03-10T10:26:36.953Z" }, "qxp": { "locale": "qxp", "clips": 2208, "users": 26, "questions": { "total": 152, "validated": 109, "invalidated": 1, "other": 42, "has_audio": 150, "avg_recordings_per_question": 14.72 }, "audios": { "total": 2208, "transcribed_validated": 1067, "transcribed_pending": 21, "not_transcribed": 1120 }, "transcriptions": { "total": 1088, "validated": 1067, "not_yet_validated": 21, "edited": 19, "edited_pct": 1.75 }, "duration": { "total_ms": 33818832, "total_hrs": 9.39, "validated_ms": 18602136, "validated_hrs": 5.16, "avg_ms": 15317, "min_ms": 1620, "max_ms": 83088, "avg_chars_per_sec": 8.2 }, "buckets": { "train": { "clips": 317, "users": 3, "duration_ms": 6199236, "duration_hrs": 1.72 }, "dev": { "clips": 405, "users": 6, "duration_ms": 6604020, "duration_hrs": 1.83 }, "test": { "clips": 345, "users": 4, "duration_ms": 5798880, "duration_hrs": 1.61 } }, "demographics": { "age": { "": 0, "teens": 0.096, "twenties": 0.5199, "thirties": 0.2332, "fourties": 0.0824, "fifties": 0.0376, "sixties": 0.0308, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.337, "male_masculine": 0.1929, "female_feminine": 0.4701, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 919, "": 1289 }, "variant": { "": 2208 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 188194421, "checksum": "efad07419fb4e73c2a96c4162df315ecefe46d2d26c846df72bd89c31655c1bf", "generated_at": "2026-03-10T10:33:58.831Z" }, "ru": { "locale": "ru", "clips": 426, "users": 12, "questions": { "total": 656, "validated": 587, "invalidated": 1, "other": 68, "has_audio": 274, "avg_recordings_per_question": 1.55 }, "audios": { "total": 426, "transcribed_validated": 393, "transcribed_pending": 26, "not_transcribed": 7 }, "transcriptions": { "total": 419, "validated": 393, "not_yet_validated": 26, "edited": 64, "edited_pct": 15.27 }, "duration": { "total_ms": 9062784, "total_hrs": 2.51, "validated_ms": 7943832, "validated_hrs": 2.2, "avg_ms": 21274, "min_ms": 2448, "max_ms": 94608, "avg_chars_per_sec": 9.61 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 127, "users": 5, "duration_ms": 2535984, "duration_hrs": 0.7 }, "test": { "clips": 266, "users": 6, "duration_ms": 5407848, "duration_hrs": 1.5 } }, "demographics": { "age": { "": 0.0023, "teens": 0.0023, "twenties": 0.939, "thirties": 0.0563, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7934, "male_masculine": 0.1127, "female_feminine": 0.0939, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 206, "": 237 }, "variant": { "": 426 } }, "reported": { "clips": 12, "reasons": { "different_language": 10, "offensive_speech": 1, "other": 1 } }, "size": 53113790, "checksum": "915a73442a0f657cefa630968442981b7af4df27a2e578ca73772ae57d057231", "generated_at": "2026-03-10T10:40:34.917Z" }, "ruc": { "locale": "ruc", "clips": 2822, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 23.52 }, "audios": { "total": 2822, "transcribed_validated": 1659, "transcribed_pending": 0, "not_transcribed": 1163 }, "transcriptions": { "total": 1659, "validated": 1659, "not_yet_validated": 0, "edited": 181, "edited_pct": 10.91 }, "duration": { "total_ms": 63901116, "total_hrs": 17.75, "validated_ms": 36522720, "validated_hrs": 10.14, "avg_ms": 22644, "min_ms": 720, "max_ms": 130536, "avg_chars_per_sec": 8.65 }, "buckets": { "train": { "clips": 1044, "users": 16, "duration_ms": 24276636, "duration_hrs": 6.74 }, "dev": { "clips": 339, "users": 5, "duration_ms": 6304788, "duration_hrs": 1.75 }, "test": { "clips": 276, "users": 4, "duration_ms": 5941296, "duration_hrs": 1.65 } }, "demographics": { "age": { "": 0.5152, "teens": 0, "twenties": 0.0422, "thirties": 0.2807, "fourties": 0.0801, "fifties": 0, "sixties": 0.0404, "seventies": 0.0415, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5152, "male_masculine": 0.404, "female_feminine": 0.0808, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2822 }, "variant": { "": 2822 } }, "reported": { "clips": 59, "reasons": { "other": 57, "personally_identifiable_information": 2 } }, "size": 383722665, "checksum": "f387d213c923318809a6418dd68b66a9af7dbbf0ba9509a611a1f0abf5f2208c", "generated_at": "2026-03-10T10:31:14.860Z" }, "rwm": { "locale": "rwm", "clips": 2427, "users": 21, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 20.23 }, "audios": { "total": 2427, "transcribed_validated": 1801, "transcribed_pending": 51, "not_transcribed": 575 }, "transcriptions": { "total": 1852, "validated": 1801, "not_yet_validated": 51, "edited": 444, "edited_pct": 23.97 }, "duration": { "total_ms": 49025628, "total_hrs": 13.61, "validated_ms": 36494424, "validated_hrs": 10.13, "avg_ms": 20200, "min_ms": 216, "max_ms": 94428, "avg_chars_per_sec": 7.66 }, "buckets": { "train": { "clips": 1178, "users": 13, "duration_ms": 24755148, "duration_hrs": 6.87 }, "dev": { "clips": 326, "users": 4, "duration_ms": 5933196, "duration_hrs": 1.64 }, "test": { "clips": 297, "users": 3, "duration_ms": 5806080, "duration_hrs": 1.61 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0466, "thirties": 0.2299, "fourties": 0.1887, "fifties": 0.2909, "sixties": 0.0968, "seventies": 0.1471, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1846, "male_masculine": 0.8154, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2427 }, "variant": { "": 2427 } }, "reported": { "clips": 57, "reasons": { "different_language": 14, "personally_identifiable_information": 2, "offensive_speech": 1, "other": 40 } }, "size": 279547441, "checksum": "e27ac69db4a9361b046856054fc10c6894979c3ba517d1141bf76086d4f6576c", "generated_at": "2026-03-10T10:38:01.996Z" }, "sco": { "locale": "sco", "clips": 715, "users": 21, "questions": { "total": 47, "validated": 0, "invalidated": 0, "other": 47, "has_audio": 47, "avg_recordings_per_question": 15.21 }, "audios": { "total": 715, "transcribed_validated": 680, "transcribed_pending": 0, "not_transcribed": 35 }, "transcriptions": { "total": 680, "validated": 680, "not_yet_validated": 0, "edited": 190, "edited_pct": 27.94 }, "duration": { "total_ms": 40234608, "total_hrs": 11.17, "validated_ms": 38478564, "validated_hrs": 10.68, "avg_ms": 56272, "min_ms": 576, "max_ms": 298656, "avg_chars_per_sec": 10.78 }, "buckets": { "train": { "clips": 416, "users": 14, "duration_ms": 26790444, "duration_hrs": 7.44 }, "dev": { "clips": 141, "users": 4, "duration_ms": 5500656, "duration_hrs": 1.52 }, "test": { "clips": 123, "users": 3, "duration_ms": 6187464, "duration_hrs": 1.71 } }, "demographics": { "age": { "": 0.2867, "teens": 0, "twenties": 0.0587, "thirties": 0.1944, "fourties": 0.0406, "fifties": 0.1664, "sixties": 0.1902, "seventies": 0.0629, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2867, "male_masculine": 0.2881, "female_feminine": 0.386, "intersex": 0, "transgender": 0, "non-binary": 0.0392, "do_not_wish_to_say": 0 }, "accent": { "": 715 }, "variant": { "": 715 } }, "reported": { "clips": 0, "reasons": {} }, "size": 239568183, "checksum": "dcbe448a86637ac30a3456312edf2fb6df8bc176d8ac41a4d8f28aa157e14f62", "generated_at": "2026-03-10T10:33:32.424Z" }, "sdo": { "locale": "sdo", "clips": 1389, "users": 25, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 11.67 }, "audios": { "total": 1389, "transcribed_validated": 217, "transcribed_pending": 1172, "not_transcribed": 0 }, "transcriptions": { "total": 1389, "validated": 217, "not_yet_validated": 1172, "edited": 493, "edited_pct": 35.49 }, "duration": { "total_ms": 35331228, "total_hrs": 9.81, "validated_ms": 4778640, "validated_hrs": 1.32, "avg_ms": 25436, "min_ms": 1224, "max_ms": 120924, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.9323, "teens": 0, "twenties": 0.0194, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0.0482, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9323, "male_masculine": 0.0677, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1389 }, "variant": { "": 1389 } }, "reported": { "clips": 9, "reasons": { "different_language": 2, "other": 7 } }, "size": 211038973, "checksum": "188f01511311f05a7604fc8e2441a00a39fa9488d7d02d178d99396a92c9ba20", "generated_at": "2026-03-10T10:23:18.235Z" }, "seh": { "locale": "seh", "clips": 290, "users": 5, "questions": { "total": 61, "validated": 0, "invalidated": 2, "other": 59, "has_audio": 58, "avg_recordings_per_question": 5 }, "audios": { "total": 290, "transcribed_validated": 1, "transcribed_pending": 2, "not_transcribed": 287 }, "transcriptions": { "total": 3, "validated": 1, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 5832180, "total_hrs": 1.62, "validated_ms": 13248, "validated_hrs": 0, "avg_ms": 20111, "min_ms": 1368, "max_ms": 117828, "avg_chars_per_sec": 8.7 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.2, "thirties": 0.4, "fourties": 0, "fifties": 0.2, "sixties": 0.2, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 290 }, "variant": { "": 290 } }, "reported": { "clips": 0, "reasons": {} }, "size": 25941825, "checksum": "cce46904b704a5c27f6504b5de915cb3b8e1d95e5e562089da992df6c84eec71", "generated_at": "2026-03-10T10:40:41.806Z" }, "shi": { "locale": "shi", "clips": 39, "users": 4, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 34, "avg_recordings_per_question": 1.15 }, "audios": { "total": 39, "transcribed_validated": 2, "transcribed_pending": 2, "not_transcribed": 35 }, "transcriptions": { "total": 4, "validated": 2, "not_yet_validated": 2, "edited": 1, "edited_pct": 25 }, "duration": { "total_ms": 1151064, "total_hrs": 0.31, "validated_ms": 26496, "validated_hrs": 0, "avg_ms": 29514, "min_ms": 5040, "max_ms": 90540, "avg_chars_per_sec": 7.59 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.8205, "thirties": 0.1795, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8205, "male_masculine": 0.1795, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 39 }, "variant": { "": 39 } }, "reported": { "clips": 0, "reasons": {} }, "size": 6835393, "checksum": "0cdf30c152fe7f7789d576caf528a10d06f18473501ce2ed81bed95be5dcfcec", "generated_at": "2026-03-10T10:40:50.733Z" }, "sn": { "locale": "sn", "clips": 6, "users": 2, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 6, "avg_recordings_per_question": 1 }, "audios": { "total": 6, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 6 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 270108, "total_hrs": 0.07, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 45018, "min_ms": 13536, "max_ms": 74880, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1667, "thirties": 0, "fourties": 0, "fifties": 0.8333, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1667, "male_masculine": 0, "female_feminine": 0.8333, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 6 }, "variant": { "": 6 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1606987, "checksum": "478a2bfb0534934224bdebdc74a87240b386d15ff887081cec96387e14738eec", "generated_at": "2026-03-10T10:40:47.092Z" }, "snv": { "locale": "snv", "clips": 2036, "users": 30, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 16.97 }, "audios": { "total": 2036, "transcribed_validated": 6, "transcribed_pending": 1101, "not_transcribed": 929 }, "transcriptions": { "total": 1107, "validated": 6, "not_yet_validated": 1101, "edited": 69, "edited_pct": 6.23 }, "duration": { "total_ms": 37699092, "total_hrs": 10.47, "validated_ms": 87876, "validated_hrs": 0.02, "avg_ms": 18516, "min_ms": 1008, "max_ms": 74448, "avg_chars_per_sec": 5.89 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6179, "teens": 0, "twenties": 0, "thirties": 0.001, "fourties": 0, "fifties": 0.2883, "sixties": 0.0928, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6179, "male_masculine": 0.1248, "female_feminine": 0.2574, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2036 }, "variant": { "": 2036 } }, "reported": { "clips": 0, "reasons": {} }, "size": 223733097, "checksum": "76b8662fa4cfdbb6cb7de2b2bc5ef69de89831ecd7a0d7be277682763d95d461", "generated_at": "2026-03-10T10:26:07.526Z" }, "th": { "locale": "th", "clips": 6, "users": 1, "questions": { "total": 61, "validated": 1, "invalidated": 0, "other": 60, "has_audio": 6, "avg_recordings_per_question": 1 }, "audios": { "total": 6, "transcribed_validated": 3, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 3, "validated": 3, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 19584, "total_hrs": 0, "validated_ms": 11124, "validated_hrs": 0, "avg_ms": 3264, "min_ms": 2016, "max_ms": 4248, "avg_chars_per_sec": 6.41 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 6 }, "variant": { "": 6 } }, "reported": { "clips": 0, "reasons": {} }, "size": 91262, "checksum": "e3963ea75d6beb75a9c94170eaebeea8041da154633ee65940bef52a28674d6a", "generated_at": "2026-03-10T10:40:52.932Z" }, "tob": { "locale": "tob", "clips": 1572, "users": 25, "questions": { "total": 136, "validated": 0, "invalidated": 0, "other": 136, "has_audio": 136, "avg_recordings_per_question": 11.56 }, "audios": { "total": 1572, "transcribed_validated": 1540, "transcribed_pending": 0, "not_transcribed": 32 }, "transcriptions": { "total": 1540, "validated": 1540, "not_yet_validated": 0, "edited": 540, "edited_pct": 35.06 }, "duration": { "total_ms": 36023976, "total_hrs": 10, "validated_ms": 34841304, "validated_hrs": 9.67, "avg_ms": 22916, "min_ms": 108, "max_ms": 191916, "avg_chars_per_sec": 5.23 }, "buckets": { "train": { "clips": 894, "users": 13, "duration_ms": 23177052, "duration_hrs": 6.43 }, "dev": { "clips": 400, "users": 7, "duration_ms": 5558544, "duration_hrs": 1.54 }, "test": { "clips": 246, "users": 5, "duration_ms": 6105708, "duration_hrs": 1.69 } }, "demographics": { "age": { "": 0.4198, "teens": 0, "twenties": 0, "thirties": 0.0827, "fourties": 0.0884, "fifties": 0.2455, "sixties": 0.1635, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6654, "male_masculine": 0.1718, "female_feminine": 0.1628, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1572 }, "variant": { "": 1572 } }, "reported": { "clips": 0, "reasons": {} }, "size": 181812009, "checksum": "c8c21036c7e04c0e3a809056396dceed8a0c4d598ca7b95fcc29ede579a20278", "generated_at": "2026-03-10T10:28:27.993Z" }, "top": { "locale": "top", "clips": 417, "users": 10, "questions": { "total": 150, "validated": 0, "invalidated": 0, "other": 150, "has_audio": 143, "avg_recordings_per_question": 2.92 }, "audios": { "total": 417, "transcribed_validated": 413, "transcribed_pending": 0, "not_transcribed": 4 }, "transcriptions": { "total": 413, "validated": 413, "not_yet_validated": 0, "edited": 159, "edited_pct": 38.5 }, "duration": { "total_ms": 37808280, "total_hrs": 10.5, "validated_ms": 37316664, "validated_hrs": 10.36, "avg_ms": 90667, "min_ms": 5256, "max_ms": 498996, "avg_chars_per_sec": 5.83 }, "buckets": { "train": { "clips": 272, "users": 4, "duration_ms": 24670332, "duration_hrs": 6.85 }, "dev": { "clips": 67, "users": 3, "duration_ms": 5938380, "duration_hrs": 1.64 }, "test": { "clips": 74, "users": 3, "duration_ms": 6707952, "duration_hrs": 1.86 } }, "demographics": { "age": { "": 0.0671, "teens": 0, "twenties": 0, "thirties": 0.4269, "fourties": 0.1823, "fifties": 0.048, "sixties": 0.2758, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.247, "male_masculine": 0.3237, "female_feminine": 0.4293, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 417 }, "variant": { "": 417 } }, "reported": { "clips": 0, "reasons": {} }, "size": 216206173, "checksum": "fd17e356a343266e23bffdb926106b187f04063a5134862815a64583046695c2", "generated_at": "2026-03-10T10:29:18.092Z" }, "tr": { "locale": "tr", "clips": 31, "users": 8, "questions": { "total": 140, "validated": 0, "invalidated": 0, "other": 140, "has_audio": 30, "avg_recordings_per_question": 1.03 }, "audios": { "total": 31, "transcribed_validated": 22, "transcribed_pending": 0, "not_transcribed": 9 }, "transcriptions": { "total": 22, "validated": 22, "not_yet_validated": 0, "edited": 8, "edited_pct": 36.36 }, "duration": { "total_ms": 744588, "total_hrs": 0.2, "validated_ms": 537984, "validated_hrs": 0.14, "avg_ms": 24019, "min_ms": 4428, "max_ms": 56628, "avg_chars_per_sec": 8.96 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0645, "thirties": 0.0968, "fourties": 0, "fifties": 0.0968, "sixties": 0.6774, "seventies": 0.0645, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0645, "male_masculine": 0.7097, "female_feminine": 0.2258, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 22, "": 9 }, "variant": { "": 31 } }, "reported": { "clips": 0, "reasons": {} }, "size": 4421750, "checksum": "aaa966a9c14c5ea09227ac2624452292f2270707806507c93ecc702cd499377f", "generated_at": "2026-03-10T10:34:15.837Z" }, "ttj": { "locale": "ttj", "clips": 3100, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 25.83 }, "audios": { "total": 3100, "transcribed_validated": 1882, "transcribed_pending": 0, "not_transcribed": 1218 }, "transcriptions": { "total": 1882, "validated": 1882, "not_yet_validated": 0, "edited": 783, "edited_pct": 41.6 }, "duration": { "total_ms": 60386832, "total_hrs": 16.77, "validated_ms": 36131184, "validated_hrs": 10.03, "avg_ms": 19480, "min_ms": 3960, "max_ms": 82080, "avg_chars_per_sec": 9.05 }, "buckets": { "train": { "clips": 1233, "users": 15, "duration_ms": 23966496, "duration_hrs": 6.65 }, "dev": { "clips": 286, "users": 3, "duration_ms": 5530608, "duration_hrs": 1.53 }, "test": { "clips": 363, "users": 4, "duration_ms": 6634080, "duration_hrs": 1.84 } }, "demographics": { "age": { "": 0, "teens": 0.0381, "twenties": 0.2316, "thirties": 0.1906, "fourties": 0.3097, "fifties": 0.1913, "sixties": 0.0387, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.1935, "female_feminine": 0.8065, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3100 }, "variant": { "": 3100 } }, "reported": { "clips": 0, "reasons": {} }, "size": 287350913, "checksum": "d4012356ae6216200e30c098ea84f564460310d41cb6c8d26694474a916bb265", "generated_at": "2026-03-10T10:38:36.856Z" }, "ukv": { "locale": "ukv", "clips": 2570, "users": 22, "questions": { "total": 119, "validated": 0, "invalidated": 19, "other": 100, "has_audio": 119, "avg_recordings_per_question": 21.6 }, "audios": { "total": 2570, "transcribed_validated": 2202, "transcribed_pending": 4, "not_transcribed": 364 }, "transcriptions": { "total": 2206, "validated": 2202, "not_yet_validated": 4, "edited": 966, "edited_pct": 43.79 }, "duration": { "total_ms": 42464052, "total_hrs": 11.79, "validated_ms": 36133524, "validated_hrs": 10.03, "avg_ms": 16523, "min_ms": 1908, "max_ms": 114768, "avg_chars_per_sec": 6.77 }, "buckets": { "train": { "clips": 1493, "users": 15, "duration_ms": 24758676, "duration_hrs": 6.87 }, "dev": { "clips": 303, "users": 3, "duration_ms": 5554800, "duration_hrs": 1.54 }, "test": { "clips": 406, "users": 4, "duration_ms": 5820048, "duration_hrs": 1.61 } }, "demographics": { "age": { "": 0.4564, "teens": 0, "twenties": 0.1786, "thirties": 0.0891, "fourties": 0.2296, "fifties": 0.0463, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4564, "male_masculine": 0.1362, "female_feminine": 0.4074, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2570 }, "variant": { "": 2570 } }, "reported": { "clips": 35, "reasons": { "different_language": 5, "offensive_speech": 1, "other": 29 } }, "size": 249814400, "checksum": "9ea19dd2628982cee0ab0021aa55a1b5cf8087e34b8ce6e75df5ea992cfe51b9", "generated_at": "2026-03-10T10:32:37.829Z" }, "ush": { "locale": "ush", "clips": 664, "users": 10, "questions": { "total": 118, "validated": 0, "invalidated": 0, "other": 118, "has_audio": 118, "avg_recordings_per_question": 5.63 }, "audios": { "total": 664, "transcribed_validated": 644, "transcribed_pending": 7, "not_transcribed": 13 }, "transcriptions": { "total": 651, "validated": 644, "not_yet_validated": 7, "edited": 64, "edited_pct": 9.83 }, "duration": { "total_ms": 18281844, "total_hrs": 5.07, "validated_ms": 17802792, "validated_hrs": 4.94, "avg_ms": 27533, "min_ms": 180, "max_ms": 149760, "avg_chars_per_sec": 11.45 }, "buckets": { "train": { "clips": 130, "users": 2, "duration_ms": 5001192, "duration_hrs": 1.38 }, "dev": { "clips": 206, "users": 5, "duration_ms": 5771844, "duration_hrs": 1.6 }, "test": { "clips": 308, "users": 3, "duration_ms": 7029756, "duration_hrs": 1.95 } }, "demographics": { "age": { "": 0.1988, "teens": 0.1747, "twenties": 0.4714, "thirties": 0.003, "fourties": 0.1521, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9985, "male_masculine": 0.0015, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 664 }, "variant": { "": 664 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 108221984, "checksum": "08a9ee5d9e2599d56c1888a7c508bf13743b2ba031da095208661fbc16e2b52d", "generated_at": "2026-03-10T10:29:38.202Z" }, "xkl": { "locale": "xkl", "clips": 1795, "users": 23, "questions": { "total": 120, "validated": 0, "invalidated": 1, "other": 119, "has_audio": 120, "avg_recordings_per_question": 14.96 }, "audios": { "total": 1795, "transcribed_validated": 259, "transcribed_pending": 1516, "not_transcribed": 20 }, "transcriptions": { "total": 1775, "validated": 259, "not_yet_validated": 1516, "edited": 302, "edited_pct": 17.01 }, "duration": { "total_ms": 37481616, "total_hrs": 10.41, "validated_ms": 5204916, "validated_hrs": 1.44, "avg_ms": 20881, "min_ms": 1764, "max_ms": 86400, "avg_chars_per_sec": 6.94 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6301, "teens": 0, "twenties": 0.1142, "thirties": 0.0875, "fourties": 0.0635, "fifties": 0, "sixties": 0.0507, "seventies": 0.054, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6301, "male_masculine": 0.093, "female_feminine": 0.2769, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1795 }, "variant": { "": 1795 } }, "reported": { "clips": 0, "reasons": {} }, "size": 223067335, "checksum": "6c1dfa6003126faac8527164cf22b0baafa59bdd355b0b53d763e8a0d15a62b9", "generated_at": "2026-03-10T10:24:11.894Z" } }, "totalDurationMs": 1633870620, "totalValidDurationMs": 964846116, "totalHrs": 454, "totalValidHrs": 268 } ================================================ FILE: datasets/spontaneous-speech/sps-corpus-2.0-delta-2025-12-05.json ================================================ { "locales": { "ady": { "locale": "ady", "clips": 26, "users": 6, "questions": { "total": 128, "validated": 34, "invalidated": 0, "other": 94, "has_audio": 127, "avg_recordings_per_question": 0.2 }, "audios": { "total": 26, "transcribed_validated": 14, "transcribed_pending": 8, "not_transcribed": 4 }, "transcriptions": { "total": 22, "validated": 14, "not_yet_validated": 8, "edited": 3, "edited_pct": 13.64 }, "duration": { "total_ms": 400212, "total_hrs": 0.11, "validated_ms": 202536, "validated_hrs": 0.05, "avg_ms": 15393, "min_ms": 3060, "max_ms": 45288, "avg_chars_per_sec": 6.22 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0769, "thirties": 0.5385, "fourties": 0, "fifties": 0.3077, "sixties": 0.0769, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.3846, "female_feminine": 0.6154, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "abzakh": 18, "bzhedug": 1, "hatikuay": 0, "shapsug": 12, "temirgoy": 10, "ubykh": 0, "": 5 }, "variant": { "ady-Cyrl": 5, "ady-Cyrl-JOR": 0, "ady-Cyrl-SY": 0, "ady-Cyrl-TR": 10, "ady-Latn-TR-t-ady-cyrl": 0, "ady-RU": 11 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 2570786, "checksum": "f84d0861bbf5bd3e3099bd82c97bda7ad3dbb7a7077c4fa54f0e0cba2c78c409", "generated_at": "2026-03-10T10:14:10.252Z" }, "ba": { "locale": "ba", "clips": 27, "users": 3, "questions": { "total": 167, "validated": 0, "invalidated": 0, "other": 167, "has_audio": 108, "avg_recordings_per_question": 0.25 }, "audios": { "total": 27, "transcribed_validated": 0, "transcribed_pending": 21, "not_transcribed": 6 }, "transcriptions": { "total": 21, "validated": 0, "not_yet_validated": 21, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 249012, "total_hrs": 0.06, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 9223, "min_ms": 3096, "max_ms": 30528, "avg_chars_per_sec": 7.8 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.2593, "thirties": 0, "fourties": 0.1852, "fifties": 0.5556, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 27 }, "variant": { "": 27 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1473766, "checksum": "265543699d2fa8b2953ddf444361fb49183cd3e5128bfec8fd31e945d4b4c88c", "generated_at": "2026-03-10T10:16:26.117Z" }, "br": { "locale": "br", "clips": 221, "users": 12, "questions": { "total": 76, "validated": 5, "invalidated": 0, "other": 71, "has_audio": 62, "avg_recordings_per_question": 3.56 }, "audios": { "total": 221, "transcribed_validated": 51, "transcribed_pending": 52, "not_transcribed": 118 }, "transcriptions": { "total": 103, "validated": 51, "not_yet_validated": 52, "edited": 9, "edited_pct": 8.74 }, "duration": { "total_ms": 2229948, "total_hrs": 0.61, "validated_ms": 476136, "validated_hrs": 0.13, "avg_ms": 10090, "min_ms": 1980, "max_ms": 49788, "avg_chars_per_sec": 6.44 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0317, "thirties": 0.2896, "fourties": 0.2986, "fifties": 0.0271, "sixties": 0.2127, "seventies": 0.1403, "eighties": 0, "nineties": 0 }, "gender": { "": 0.991, "male_masculine": 0, "female_feminine": 0.009, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "gwenedeg": 1, "kerneveg": 59, "leoneg": 102, "tregerieg": 114, "user_defined": 45, "": 4 }, "variant": { "": 221 } }, "reported": { "clips": 0, "reasons": {} }, "size": 12614503, "checksum": "b9bdff44e6432953e31a108155952379153e063c44fc9394eac6140722e9d04a", "generated_at": "2026-03-10T10:14:41.238Z" }, "ca": { "locale": "ca", "clips": 82, "users": 8, "questions": { "total": 74, "validated": 14, "invalidated": 0, "other": 60, "has_audio": 57, "avg_recordings_per_question": 1.44 }, "audios": { "total": 82, "transcribed_validated": 66, "transcribed_pending": 16, "not_transcribed": 0 }, "transcriptions": { "total": 82, "validated": 66, "not_yet_validated": 16, "edited": 36, "edited_pct": 43.9 }, "duration": { "total_ms": 1323936, "total_hrs": 0.36, "validated_ms": 973620, "validated_hrs": 0.27, "avg_ms": 16146, "min_ms": 2916, "max_ms": 72180, "avg_chars_per_sec": 6.88 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0488, "teens": 0, "twenties": 0, "thirties": 0.1098, "fourties": 0.622, "fifties": 0.1951, "sixties": 0.0244, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0488, "male_masculine": 0.5488, "female_feminine": 0.4024, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "balearic": 0, "central": 0, "learner_es": 0, "learner_other": 0, "northern": 0, "northwestern": 0, "valencian": 19, "user_defined": 32, "": 31 }, "variant": { "ca-algueres": 0, "ca-balear": 0, "ca-central": 42, "ca-northern": 0, "ca-nwestern": 0, "ca-valencia-alacant": 0, "ca-valencia-central": 0, "ca-valencia-northern": 0, "ca-valencia-southern": 0, "ca-valencia-tortosi": 0, "": 40 } }, "reported": { "clips": 2, "reasons": { "other": 2 } }, "size": 7989910, "checksum": "5bf32b2e9b52a6d91cf5701d3029ce75c60f6a68988be5a51d5c43b73b27ea95", "generated_at": "2026-03-10T10:14:48.510Z" }, "cy": { "locale": "cy", "clips": 3, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 11, "avg_recordings_per_question": 0.27 }, "audios": { "total": 3, "transcribed_validated": 0, "transcribed_pending": 3, "not_transcribed": 0 }, "transcriptions": { "total": 3, "validated": 0, "not_yet_validated": 3, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 10944, "total_hrs": 0, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 3648, "min_ms": 2808, "max_ms": 4320, "avg_chars_per_sec": 6.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "united_kingdom": 3 }, "variant": { "cy-midwales": 0, "cy-northeas": 0, "cy-northwes": 0, "cy-southeas": 0, "cy-southwes": 0, "cy-wladfa": 0, "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 64595, "checksum": "f3058406b27173cf0123602f7a1c78b0691d81350c0d0d8a821f1297f39675f0", "generated_at": "2026-03-10T10:14:23.599Z" }, "de": { "locale": "de", "clips": 175, "users": 16, "questions": { "total": 92, "validated": 26, "invalidated": 6, "other": 60, "has_audio": 79, "avg_recordings_per_question": 2.22 }, "audios": { "total": 175, "transcribed_validated": 33, "transcribed_pending": 5, "not_transcribed": 137 }, "transcriptions": { "total": 38, "validated": 33, "not_yet_validated": 5, "edited": 12, "edited_pct": 31.58 }, "duration": { "total_ms": 2799468, "total_hrs": 0.77, "validated_ms": 345096, "validated_hrs": 0.09, "avg_ms": 15997, "min_ms": 3096, "max_ms": 140976, "avg_chars_per_sec": 10.52 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.16, "twenties": 0.0229, "thirties": 0.1714, "fourties": 0.0686, "fifties": 0.3657, "sixties": 0.2114, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.36, "male_masculine": 0.12, "female_feminine": 0.52, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "austria": 0, "belgium": 0, "brazil": 0, "bulgaria": 0, "canada": 0, "czechia": 0, "denmark": 0, "finland": 0, "france": 0, "germany": 107, "greece": 0, "hungary": 0, "italy": 0, "kazakhstan": 0, "liechtenstein": 0, "lithuania": 0, "luxembourg": 0, "namibia": 0, "netherlands": 0, "paraguay": 0, "poland": 0, "romania": 0, "russia": 0, "slovakia": 0, "slovenia": 0, "switzerland": 2, "turkey": 0, "united_kingdom": 0, "united_states": 0, "user_defined": 1, "": 66 }, "variant": { "": 175 } }, "reported": { "clips": 2, "reasons": { "other": 1, "different_language": 1 } }, "size": 16352084, "checksum": "b9d4b61353489801efe87ef6809a3801d5cc1b2b2b1699569b99aa56cc0adf76", "generated_at": "2026-03-10T10:16:21.402Z" }, "en": { "locale": "en", "clips": 4291, "users": 421, "questions": { "total": 423, "validated": 245, "invalidated": 31, "other": 147, "has_audio": 337, "avg_recordings_per_question": 12.73 }, "audios": { "total": 4291, "transcribed_validated": 1303, "transcribed_pending": 52, "not_transcribed": 2936 }, "transcriptions": { "total": 1355, "validated": 1303, "not_yet_validated": 52, "edited": 104, "edited_pct": 7.68 }, "duration": { "total_ms": 36590184, "total_hrs": 10.16, "validated_ms": 14872500, "validated_hrs": 4.13, "avg_ms": 8527, "min_ms": 108, "max_ms": 400320, "avg_chars_per_sec": 7.1 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0792, "teens": 0.1454, "twenties": 0.5635, "thirties": 0.0755, "fourties": 0.0114, "fifties": 0.0151, "sixties": 0.0918, "seventies": 0.0091, "eighties": 0.0082, "nineties": 0.0007 }, "gender": { "": 0.5362, "male_masculine": 0.0727, "female_feminine": 0.3319, "intersex": 0.0387, "transgender": 0.0077, "non-binary": 0.0016, "do_not_wish_to_say": 0.0112 }, "accent": { "african": 15, "australia": 3, "bermuda": 0, "canada": 1, "england": 85, "hongkong": 0, "indian": 12, "ireland": 0, "malaysia": 0, "newzealand": 0, "philippines": 1, "scotland": 0, "singapore": 2, "southatlandtic": 0, "us": 153, "wales": 0, "user_defined": 759, "": 3815 }, "variant": { "": 4291 } }, "reported": { "clips": 502, "reasons": { "different_language": 451, "other": 50, "offensive_speech": 1 } }, "size": 233642883, "checksum": "c7c62294bca14ea2ff6cf7946d2b3640e1e48a573cd17a16b4dcce33b000c3d8", "generated_at": "2026-03-10T10:11:45.381Z" }, "es": { "locale": "es", "clips": 12, "users": 3, "questions": { "total": 90, "validated": 25, "invalidated": 0, "other": 65, "has_audio": 12, "avg_recordings_per_question": 1 }, "audios": { "total": 12, "transcribed_validated": 12, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 12, "validated": 12, "not_yet_validated": 0, "edited": 2, "edited_pct": 16.67 }, "duration": { "total_ms": 110988, "total_hrs": 0.03, "validated_ms": 110988, "validated_hrs": 0.03, "avg_ms": 9249, "min_ms": 1980, "max_ms": 24480, "avg_chars_per_sec": 8.9 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.9167, "fourties": 0.0833, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1667, "male_masculine": 0.8333, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "americacentral": 0, "andino": 0, "canario": 0, "caribe": 0, "centrosurpeninsular": 0, "chileno": 0, "filipinas": 0, "mexicano": 0, "nortepeninsular": 1, "rioplatense": 11, "surpeninsular": 0 }, "variant": { "": 12 } }, "reported": { "clips": 0, "reasons": {} }, "size": 655122, "checksum": "b5d640fbf709248a497da66103dfd3a2edfa5f402d9ef7e2106a53db085ac8ab", "generated_at": "2026-03-10T10:15:58.344Z" }, "fr": { "locale": "fr", "clips": 232, "users": 29, "questions": { "total": 147, "validated": 46, "invalidated": 3, "other": 98, "has_audio": 129, "avg_recordings_per_question": 1.8 }, "audios": { "total": 232, "transcribed_validated": 67, "transcribed_pending": 0, "not_transcribed": 165 }, "transcriptions": { "total": 67, "validated": 67, "not_yet_validated": 0, "edited": 12, "edited_pct": 17.91 }, "duration": { "total_ms": 3082932, "total_hrs": 0.85, "validated_ms": 949284, "validated_hrs": 0.26, "avg_ms": 13289, "min_ms": 288, "max_ms": 90180, "avg_chars_per_sec": 9.59 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.3233, "thirties": 0.1078, "fourties": 0.0431, "fifties": 0.1422, "sixties": 0.0345, "seventies": 0.3491, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3491, "male_masculine": 0.3966, "female_feminine": 0.1983, "intersex": 0, "transgender": 0.0129, "non-binary": 0.0431, "do_not_wish_to_say": 0 }, "accent": { "algeria": 1, "belgium": 0, "benin": 0, "cameroon": 0, "canada": 0, "central_african_republic": 0, "comoros": 0, "congo_kinshasa": 0, "cote_d_ivoire": 0, "cyprus": 0, "djibouti": 0, "equatorial_guinea": 0, "fr-metro-east": 10, "fr-metro-north": 0, "fr-metro-south": 0, "fr-metro-west": 0, "french_guiana": 0, "french_polynesia": 0, "germany": 3, "guadeloupe": 11, "haiti": 0, "italy": 0, "luxembourg": 0, "madagascar": 0, "mali": 0, "martinique": 0, "mauritania": 0, "mauritius": 0, "mayotte": 0, "monaco": 0, "morocco": 0, "netherlands": 0, "new_caledonia": 0, "niger": 0, "reunion": 0, "romania": 0, "senegal": 0, "seychelles": 0, "st_barthelemy": 0, "st_martin": 0, "st_pierre_et_miquelon": 0, "switzerland": 0, "syria": 0, "togo": 0, "tunisia": 0, "united_kingdom": 0, "united_states": 0, "vanuatu": 0, "wallis_et_futuna": 0, "user_defined": 1, "": 206 }, "variant": { "fr-asia": 0, "fr-droum": 11, "fr-europe": 0, "fr-metro": 163, "fr-nafrica": 1, "fr-namerica": 0, "fr-safrica": 0, "fr-samerica": 0, "": 57 } }, "reported": { "clips": 13, "reasons": { "different_language": 12, "other": 1 } }, "size": 16894840, "checksum": "318551526204692734d49dca0de1e59e645d20eb348eae29d6cceaa9a10e19e0", "generated_at": "2026-03-10T10:12:26.199Z" }, "ga-IE": { "locale": "ga-IE", "clips": 23, "users": 4, "questions": { "total": 104, "validated": 0, "invalidated": 0, "other": 104, "has_audio": 32, "avg_recordings_per_question": 0.72 }, "audios": { "total": 23, "transcribed_validated": 1, "transcribed_pending": 4, "not_transcribed": 18 }, "transcriptions": { "total": 5, "validated": 1, "not_yet_validated": 4, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 347508, "total_hrs": 0.09, "validated_ms": 3816, "validated_hrs": 0, "avg_ms": 15109, "min_ms": 3816, "max_ms": 54108, "avg_chars_per_sec": 10 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0435, "fourties": 0.5652, "fifties": 0.3913, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.087, "male_masculine": 0.5652, "female_feminine": 0.3478, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "cainteoir_dúchais_gaeltacht": 10, "cainteoir_dúchais_lasmuigh_den_ghaeltacht": 0, "cainteoir_líofa_ní_ó_dhúchas": 0, "foghlaimeoir_fásta_ó_bhéarla": 0, "foghlaimeoir_fásta_ó_theanga_eile": 0, "foghlaimeoir_i_ngaelscoil": 0, "": 13 }, "variant": { "ga-IE-caighd": 0, "ga-IE-chonnact": 3, "ga-IE-mumhan": 7, "ga-IE-uladh": 0, "": 13 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2043473, "checksum": "8a347c255a6de102f9e2a8335f1d5197aa0fbead23fadcb5499ee3f95ab30ba7", "generated_at": "2026-03-10T10:16:29.146Z" }, "gl": { "locale": "gl", "clips": 87, "users": 13, "questions": { "total": 288, "validated": 19, "invalidated": 0, "other": 269, "has_audio": 166, "avg_recordings_per_question": 0.52 }, "audios": { "total": 87, "transcribed_validated": 43, "transcribed_pending": 44, "not_transcribed": 0 }, "transcriptions": { "total": 87, "validated": 43, "not_yet_validated": 44, "edited": 28, "edited_pct": 32.18 }, "duration": { "total_ms": 1130868, "total_hrs": 0.31, "validated_ms": 452304, "validated_hrs": 0.12, "avg_ms": 12998, "min_ms": 2520, "max_ms": 85536, "avg_chars_per_sec": 8.89 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.4943, "thirties": 0.023, "fourties": 0.4253, "fifties": 0.0575, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0115, "male_masculine": 0.3333, "female_feminine": 0.2644, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.3908 }, "accent": { "atlantico": 1, "central": 4, "neofalante": 10, "normativo": 57, "oriental": 20, "": 1 }, "variant": { "": 87 } }, "reported": { "clips": 0, "reasons": {} }, "size": 6548126, "checksum": "02f06bc63c23014703ac1f8baa3325d7125fe68c54194b42634e64d0d54b852e", "generated_at": "2026-03-10T10:16:04.651Z" }, "gsw": { "locale": "gsw", "clips": 840, "users": 47, "questions": { "total": 65, "validated": 3, "invalidated": 0, "other": 62, "has_audio": 63, "avg_recordings_per_question": 13.33 }, "audios": { "total": 840, "transcribed_validated": 46, "transcribed_pending": 122, "not_transcribed": 672 }, "transcriptions": { "total": 168, "validated": 46, "not_yet_validated": 122, "edited": 46, "edited_pct": 27.38 }, "duration": { "total_ms": 16747164, "total_hrs": 4.65, "validated_ms": 635904, "validated_hrs": 0.17, "avg_ms": 19937, "min_ms": 468, "max_ms": 141732, "avg_chars_per_sec": 7.41 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0714, "teens": 0, "twenties": 0.056, "thirties": 0.0452, "fourties": 0.1476, "fifties": 0.1869, "sixties": 0.3679, "seventies": 0.125, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6512, "male_masculine": 0, "female_feminine": 0.331, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0179 }, "accent": { "user_defined": 32, "": 808 }, "variant": { "gsw-FR-hochalem": 11, "gsw-FR-nordalem": 156, "gsw-FR-rhinfran": 0, "gsw-FR-sudnalem": 3, "gsw-FR-sudrfran": 19, "": 651 } }, "reported": { "clips": 25, "reasons": { "other": 18, "offensive_speech": 3, "different_language": 4 } }, "size": 89921631, "checksum": "13fb06a91b915f608236d9902b1ad9d852f4fa6166ef36d19530cd06c393a1f0", "generated_at": "2026-03-10T10:15:48.329Z" }, "kbd": { "locale": "kbd", "clips": 150, "users": 16, "questions": { "total": 155, "validated": 0, "invalidated": 0, "other": 155, "has_audio": 154, "avg_recordings_per_question": 0.97 }, "audios": { "total": 150, "transcribed_validated": 4, "transcribed_pending": 0, "not_transcribed": 146 }, "transcriptions": { "total": 4, "validated": 4, "not_yet_validated": 0, "edited": 2, "edited_pct": 50 }, "duration": { "total_ms": 3917736, "total_hrs": 1.08, "validated_ms": 90972, "validated_hrs": 0.02, "avg_ms": 26118, "min_ms": 3348, "max_ms": 130104, "avg_chars_per_sec": 8.15 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.1467, "teens": 0, "twenties": 0.38, "thirties": 0.3267, "fourties": 0.1133, "fifties": 0.0333, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3867, "male_masculine": 0, "female_feminine": 0.6133, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "bakhsan": 0, "besleney": 0, "kuban": 0, "kuban_zelenchuk": 1, "malka": 0, "mozdok": 0, "terek": 61, "": 88 }, "variant": { "kbd-Cyrl": 104, "kbd-Cyrl-JOR": 0, "kbd-Cyrl-SY": 0, "kbd-Cyrl-TR": 0, "kbd-Latn-TR-t-kbd-cyrl": 0, "kbd-RU": 1, "": 45 } }, "reported": { "clips": 0, "reasons": {} }, "size": 22871408, "checksum": "152b91e7eec0b0a10739e4514b66d7db232d4cb24f224f68cb8616d389dd3e2c", "generated_at": "2026-03-10T10:14:05.400Z" }, "lij": { "locale": "lij", "clips": 95, "users": 4, "questions": { "total": 169, "validated": 2, "invalidated": 0, "other": 167, "has_audio": 164, "avg_recordings_per_question": 0.58 }, "audios": { "total": 95, "transcribed_validated": 38, "transcribed_pending": 6, "not_transcribed": 51 }, "transcriptions": { "total": 44, "validated": 38, "not_yet_validated": 6, "edited": 28, "edited_pct": 63.64 }, "duration": { "total_ms": 3261420, "total_hrs": 0.9, "validated_ms": 1367064, "validated_hrs": 0.37, "avg_ms": 34331, "min_ms": 8532, "max_ms": 118044, "avg_chars_per_sec": 7.37 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.6947, "fourties": 0, "fifties": 0, "sixties": 0.2421, "seventies": 0.0632, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0632, "male_masculine": 0.6947, "female_feminine": 0.2421, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 95 }, "variant": { "": 95 } }, "reported": { "clips": 0, "reasons": {} }, "size": 19370634, "checksum": "e597882a56aa8c6197c189e494d4534fc2e53a8819fd8e020234cd6512ba5622", "generated_at": "2026-03-10T10:14:18.123Z" }, "lv": { "locale": "lv", "clips": 6, "users": 3, "questions": { "total": 101, "validated": 0, "invalidated": 0, "other": 101, "has_audio": 22, "avg_recordings_per_question": 0.27 }, "audios": { "total": 6, "transcribed_validated": 1, "transcribed_pending": 1, "not_transcribed": 4 }, "transcriptions": { "total": 2, "validated": 1, "not_yet_validated": 1, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 204948, "total_hrs": 0.05, "validated_ms": 23976, "validated_hrs": 0, "avg_ms": 34158, "min_ms": 13500, "max_ms": 86796, "avg_chars_per_sec": 8.26 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.3333, "twenties": 0, "thirties": 0.6667, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3333, "male_masculine": 0, "female_feminine": 0.6667, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 2, "": 4 }, "variant": { "": 6 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1217992, "checksum": "1471e019fe03acbe65d668ff9cf09bf9f16fb0963cd9948d07f68ae90c15c52e", "generated_at": "2026-03-10T10:14:21.722Z" }, "ms-MY": { "locale": "ms-MY", "clips": 3, "users": 1, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 0.03 }, "audios": { "total": 3, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 9756, "total_hrs": 0, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 3252, "min_ms": 1800, "max_ms": 4500, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 1, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3 }, "variant": { "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 54764, "checksum": "07fd5c7bfc41df2ce568b88dea0613fb862df720e63d855e68b22a799042909c", "generated_at": "2026-03-10T10:12:08.853Z" }, "qxp": { "locale": "qxp", "clips": 1143, "users": 15, "questions": { "total": 152, "validated": 109, "invalidated": 1, "other": 42, "has_audio": 150, "avg_recordings_per_question": 7.62 }, "audios": { "total": 1143, "transcribed_validated": 2, "transcribed_pending": 21, "not_transcribed": 1120 }, "transcriptions": { "total": 23, "validated": 2, "not_yet_validated": 21, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 15234732, "total_hrs": 4.23, "validated_ms": 18036, "validated_hrs": 0, "avg_ms": 13329, "min_ms": 1620, "max_ms": 83088, "avg_chars_per_sec": 8.58 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.1855, "twenties": 0.2213, "thirties": 0.3018, "fourties": 0.1592, "fifties": 0.0726, "sixties": 0.0595, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6509, "male_masculine": 0, "female_feminine": 0.3491, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 110, "": 1033 }, "variant": { "": 1143 } }, "reported": { "clips": 0, "reasons": {} }, "size": 84965376, "checksum": "bbcb25e717a8a092b6ae42d3df0e7db9ab4f66f9d5759b639a6181417c5b5611", "generated_at": "2026-03-10T10:13:46.123Z" }, "ru": { "locale": "ru", "clips": 415, "users": 11, "questions": { "total": 656, "validated": 587, "invalidated": 1, "other": 68, "has_audio": 274, "avg_recordings_per_question": 1.51 }, "audios": { "total": 415, "transcribed_validated": 382, "transcribed_pending": 26, "not_transcribed": 7 }, "transcriptions": { "total": 408, "validated": 382, "not_yet_validated": 26, "edited": 64, "edited_pct": 15.69 }, "duration": { "total_ms": 8960688, "total_hrs": 2.48, "validated_ms": 7841736, "validated_hrs": 2.17, "avg_ms": 21592, "min_ms": 2448, "max_ms": 94608, "avg_chars_per_sec": 9.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.0024, "twenties": 0.9639, "thirties": 0.0337, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.812, "male_masculine": 0.1157, "female_feminine": 0.0723, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 186, "": 236 }, "variant": { "": 415 } }, "reported": { "clips": 12, "reasons": { "different_language": 10, "offensive_speech": 1, "other": 1 } }, "size": 52373629, "checksum": "65439081a67b7125ec452dcad08acac51b4d02894bacc88c4a9b2e3cdd013b2e", "generated_at": "2026-03-10T10:16:59.633Z" }, "seh": { "locale": "seh", "clips": 263, "users": 5, "questions": { "total": 61, "validated": 0, "invalidated": 2, "other": 59, "has_audio": 58, "avg_recordings_per_question": 4.53 }, "audios": { "total": 263, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 263 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 4499172, "total_hrs": 1.24, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 17107, "min_ms": 1368, "max_ms": 78696, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.2205, "thirties": 0.3384, "fourties": 0, "fifties": 0.2205, "sixties": 0.2205, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5589, "male_masculine": 0, "female_feminine": 0.4411, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 263 }, "variant": { "": 263 } }, "reported": { "clips": 0, "reasons": {} }, "size": 19111834, "checksum": "f21898694fcc641c0319563f33c8b94fd13f501030ac019136a6d6eca8604b4e", "generated_at": "2026-03-10T10:17:22.579Z" }, "shi": { "locale": "shi", "clips": 39, "users": 4, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 34, "avg_recordings_per_question": 1.15 }, "audios": { "total": 39, "transcribed_validated": 2, "transcribed_pending": 2, "not_transcribed": 35 }, "transcriptions": { "total": 4, "validated": 2, "not_yet_validated": 2, "edited": 1, "edited_pct": 25 }, "duration": { "total_ms": 1151064, "total_hrs": 0.31, "validated_ms": 26496, "validated_hrs": 0, "avg_ms": 29514, "min_ms": 5040, "max_ms": 90540, "avg_chars_per_sec": 7.59 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.8205, "thirties": 0.1795, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8205, "male_masculine": 0.1795, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 39 }, "variant": { "": 39 } }, "reported": { "clips": 0, "reasons": {} }, "size": 6822400, "checksum": "ecf23b5617e304f34d05947e45402c6efdd014e0f5a490fb887defb1aa76bd6d", "generated_at": "2026-03-10T10:17:32.728Z" }, "sn": { "locale": "sn", "clips": 6, "users": 2, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 6, "avg_recordings_per_question": 1 }, "audios": { "total": 6, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 6 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 270108, "total_hrs": 0.07, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 45018, "min_ms": 13536, "max_ms": 74880, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1667, "thirties": 0, "fourties": 0, "fifties": 0.8333, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1667, "male_masculine": 0, "female_feminine": 0.8333, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 6 }, "variant": { "": 6 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1603300, "checksum": "df0b38ca79e41df69c1118f92e7348a7d056b0f1b8c2fc00065beb5a2f7f8bc4", "generated_at": "2026-03-10T10:17:26.362Z" }, "th": { "locale": "th", "clips": 6, "users": 1, "questions": { "total": 61, "validated": 1, "invalidated": 0, "other": 60, "has_audio": 6, "avg_recordings_per_question": 1 }, "audios": { "total": 6, "transcribed_validated": 3, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 3, "validated": 3, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 19584, "total_hrs": 0, "validated_ms": 11124, "validated_hrs": 0, "avg_ms": 3264, "min_ms": 2016, "max_ms": 4248, "avg_chars_per_sec": 6.41 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 6 }, "variant": { "": 6 } }, "reported": { "clips": 0, "reasons": {} }, "size": 90200, "checksum": "02839bff1e273695f9c71bde23713144f8482f7657867fae4c96e7a6a1b5101c", "generated_at": "2026-03-10T10:17:28.394Z" }, "tr": { "locale": "tr", "clips": 7, "users": 2, "questions": { "total": 140, "validated": 0, "invalidated": 0, "other": 140, "has_audio": 30, "avg_recordings_per_question": 0.23 }, "audios": { "total": 7, "transcribed_validated": 1, "transcribed_pending": 0, "not_transcribed": 6 }, "transcriptions": { "total": 1, "validated": 1, "not_yet_validated": 0, "edited": 1, "edited_pct": 100 }, "duration": { "total_ms": 192384, "total_hrs": 0.05, "validated_ms": 4428, "validated_hrs": 0, "avg_ms": 27483, "min_ms": 4428, "max_ms": 40500, "avg_chars_per_sec": 7.45 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.1429, "fourties": 0, "fifties": 0, "sixties": 0.8571, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1429, "male_masculine": 0.8571, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 6, "": 1 }, "variant": { "": 7 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1134902, "checksum": "44486e47011ae3e37b92a9f4d48b84b462c436baccc54b7eac0299126ef1a3ca", "generated_at": "2026-03-10T10:12:30.029Z" } }, "totalDurationMs": 102744756, "totalValidDurationMs": 28406016, "totalHrs": 28, "totalValidHrs": 8 } ================================================ FILE: datasets/spontaneous-speech/sps-corpus-3.0-2026-03-09.json ================================================ { "locales": { "aat": { "locale": "aat", "clips": 334, "users": 5, "questions": { "total": 111, "validated": 0, "invalidated": 0, "other": 111, "has_audio": 111, "avg_recordings_per_question": 3.01 }, "audios": { "total": 334, "transcribed_validated": 0, "transcribed_pending": 194, "not_transcribed": 140 }, "transcriptions": { "total": 194, "validated": 0, "not_yet_validated": 194, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 8290188, "total_hrs": 2.3, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 24821, "min_ms": 3996, "max_ms": 193536, "avg_chars_per_sec": 6.45 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0.7006, "eighties": 0.2994, "nineties": 0 }, "gender": { "": 0.1587, "male_masculine": 0, "female_feminine": 0.8413, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 334 }, "variant": { "": 334 } }, "reported": { "clips": 0, "reasons": {} }, "size": 49125514, "checksum": "c464a404031cc74f41b10e2eb9a23110d688838f700df4f350c28310c51b09de", "generated_at": "2026-03-15T02:21:35.297Z" }, "ady": { "locale": "ady", "clips": 836, "users": 27, "questions": { "total": 128, "validated": 34, "invalidated": 0, "other": 94, "has_audio": 128, "avg_recordings_per_question": 6.53 }, "audios": { "total": 836, "transcribed_validated": 694, "transcribed_pending": 44, "not_transcribed": 98 }, "transcriptions": { "total": 738, "validated": 694, "not_yet_validated": 44, "edited": 329, "edited_pct": 44.58 }, "duration": { "total_ms": 20579508, "total_hrs": 5.71, "validated_ms": 16833672, "validated_hrs": 4.67, "avg_ms": 24617, "min_ms": 1116, "max_ms": 132048, "avg_chars_per_sec": 7.65 }, "buckets": { "train": { "clips": 235, "users": 13, "duration_ms": 5042916, "duration_hrs": 1.4 }, "dev": { "clips": 276, "users": 9, "duration_ms": 5557176, "duration_hrs": 1.54 }, "test": { "clips": 183, "users": 3, "duration_ms": 6233580, "duration_hrs": 1.73 } }, "demographics": { "age": { "": 0.0335, "teens": 0, "twenties": 0.1304, "thirties": 0.1974, "fourties": 0.1459, "fifties": 0.3457, "sixties": 0.11, "seventies": 0.0371, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1136, "male_masculine": 0.2428, "female_feminine": 0.6435, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "abzakh": 67, "bzhedug": 79, "hatikuay": 0, "shapsug": 219, "temirgoy": 91, "ubykh": 0, "user_defined": 13, "": 401 }, "variant": { "ady-Cyrl": 413, "ady-Cyrl-JOR": 0, "ady-Cyrl-SY": 0, "ady-Cyrl-TR": 271, "ady-Latn-TR-t-ady-cyrl": 0, "ady-RU": 19, "": 133 } }, "reported": { "clips": 4, "reasons": { "other": 3, "different_language": 1 } }, "size": 114901829, "checksum": "076e7e9e6b7e926ef0dd69f3a215800d0823d03dcfb20207407b9c8d9247b8b2", "generated_at": "2026-03-15T02:21:45.803Z" }, "aln": { "locale": "aln", "clips": 1654, "users": 14, "questions": { "total": 144, "validated": 0, "invalidated": 0, "other": 144, "has_audio": 144, "avg_recordings_per_question": 11.49 }, "audios": { "total": 1654, "transcribed_validated": 1516, "transcribed_pending": 0, "not_transcribed": 138 }, "transcriptions": { "total": 1516, "validated": 1516, "not_yet_validated": 0, "edited": 103, "edited_pct": 6.79 }, "duration": { "total_ms": 39111120, "total_hrs": 10.86, "validated_ms": 37793736, "validated_hrs": 10.49, "avg_ms": 23646, "min_ms": 396, "max_ms": 73368, "avg_chars_per_sec": 6.9 }, "buckets": { "train": { "clips": 934, "users": 7, "duration_ms": 24303456, "duration_hrs": 6.75 }, "dev": { "clips": 300, "users": 3, "duration_ms": 6777828, "duration_hrs": 1.88 }, "test": { "clips": 282, "users": 3, "duration_ms": 6712452, "duration_hrs": 1.86 } }, "demographics": { "age": { "": 0.0707, "teens": 0.1608, "twenties": 0.0036, "thirties": 0.0871, "fourties": 0.4317, "fifties": 0.2461, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4426, "male_masculine": 0.0738, "female_feminine": 0.4837, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1654 }, "variant": { "": 1654 } }, "reported": { "clips": 95, "reasons": { "other": 95 } }, "size": 212143711, "checksum": "7b54e0f54e000d8321f5c2b6d4e56248bff11b9b524f29c98ee353a9be8c2809", "generated_at": "2026-03-15T02:19:26.495Z" }, "an": { "locale": "an", "clips": 240, "users": 3, "questions": { "total": 334, "validated": 112, "invalidated": 0, "other": 222, "has_audio": 220, "avg_recordings_per_question": 1.09 }, "audios": { "total": 240, "transcribed_validated": 29, "transcribed_pending": 10, "not_transcribed": 201 }, "transcriptions": { "total": 39, "validated": 29, "not_yet_validated": 10, "edited": 13, "edited_pct": 33.33 }, "duration": { "total_ms": 6663672, "total_hrs": 1.85, "validated_ms": 615132, "validated_hrs": 0.17, "avg_ms": 27765, "min_ms": 288, "max_ms": 103536, "avg_chars_per_sec": 8.95 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0042, "fourties": 0, "fifties": 0.0792, "sixties": 0.9167, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9208, "male_masculine": 0, "female_feminine": 0.0792, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 239, "": 1 }, "variant": { "": 240 } }, "reported": { "clips": 0, "reasons": {} }, "size": 39629268, "checksum": "2e0531949c96982e07b4c9238c979827777cb9a5df964fbf13a37f402c1e59e5", "generated_at": "2026-03-15T02:20:52.902Z" }, "ba": { "locale": "ba", "clips": 127, "users": 4, "questions": { "total": 167, "validated": 0, "invalidated": 0, "other": 167, "has_audio": 108, "avg_recordings_per_question": 1.18 }, "audios": { "total": 127, "transcribed_validated": 6, "transcribed_pending": 115, "not_transcribed": 6 }, "transcriptions": { "total": 121, "validated": 6, "not_yet_validated": 115, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 901872, "total_hrs": 0.25, "validated_ms": 31176, "validated_hrs": 0, "avg_ms": 7101, "min_ms": 1836, "max_ms": 31536, "avg_chars_per_sec": 7.3 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0551, "thirties": 0.0315, "fourties": 0.1339, "fifties": 0.7795, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 127 }, "variant": { "": 127 } }, "reported": { "clips": 0, "reasons": {} }, "size": 5349385, "checksum": "84c7d331308134980392c9c719e8514dd138188154a3ba95f7c0ba126c6e2ee8", "generated_at": "2026-03-15T02:25:12.471Z" }, "bas": { "locale": "bas", "clips": 773, "users": 11, "questions": { "total": 74, "validated": 0, "invalidated": 0, "other": 74, "has_audio": 74, "avg_recordings_per_question": 10.45 }, "audios": { "total": 773, "transcribed_validated": 734, "transcribed_pending": 38, "not_transcribed": 1 }, "transcriptions": { "total": 772, "validated": 734, "not_yet_validated": 38, "edited": 500, "edited_pct": 64.77 }, "duration": { "total_ms": 19354320, "total_hrs": 5.37, "validated_ms": 18232452, "validated_hrs": 5.06, "avg_ms": 25038, "min_ms": 4680, "max_ms": 64116, "avg_chars_per_sec": 8.86 }, "buckets": { "train": { "clips": 203, "users": 3, "duration_ms": 5405004, "duration_hrs": 1.5 }, "dev": { "clips": 252, "users": 4, "duration_ms": 6090300, "duration_hrs": 1.69 }, "test": { "clips": 279, "users": 4, "duration_ms": 6737148, "duration_hrs": 1.87 } }, "demographics": { "age": { "": 0, "teens": 0.1889, "twenties": 0, "thirties": 0.1863, "fourties": 0.5304, "fifties": 0.0944, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5705, "male_masculine": 0.4295, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 773 }, "variant": { "": 773 } }, "reported": { "clips": 0, "reasons": {} }, "size": 115066454, "checksum": "b272db162fe71fecc98fcb9831a97f52a125fbfa82c65ad3c8997a0a3014e1d4", "generated_at": "2026-03-15T02:25:05.701Z" }, "bew": { "locale": "bew", "clips": 1336, "users": 21, "questions": { "total": 200, "validated": 0, "invalidated": 0, "other": 200, "has_audio": 199, "avg_recordings_per_question": 6.71 }, "audios": { "total": 1336, "transcribed_validated": 1270, "transcribed_pending": 12, "not_transcribed": 54 }, "transcriptions": { "total": 1282, "validated": 1270, "not_yet_validated": 12, "edited": 1127, "edited_pct": 87.91 }, "duration": { "total_ms": 37777068, "total_hrs": 10.49, "validated_ms": 35235684, "validated_hrs": 9.78, "avg_ms": 28276, "min_ms": 3168, "max_ms": 156996, "avg_chars_per_sec": 9.27 }, "buckets": { "train": { "clips": 764, "users": 12, "duration_ms": 22322088, "duration_hrs": 6.2 }, "dev": { "clips": 206, "users": 3, "duration_ms": 5832036, "duration_hrs": 1.62 }, "test": { "clips": 300, "users": 6, "duration_ms": 7081560, "duration_hrs": 1.96 } }, "demographics": { "age": { "": 0.6557, "teens": 0, "twenties": 0.1198, "thirties": 0.0756, "fourties": 0.149, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6557, "male_masculine": 0.3443, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1336 }, "variant": { "": 1336 } }, "reported": { "clips": 0, "reasons": {} }, "size": 224878963, "checksum": "52513690f628cc6317d872fa1fa653346b52b975a61c6ea6ecbc99a508648a2d", "generated_at": "2026-03-15T02:16:05.371Z" }, "br": { "locale": "br", "clips": 248, "users": 14, "questions": { "total": 80, "validated": 5, "invalidated": 0, "other": 75, "has_audio": 62, "avg_recordings_per_question": 4 }, "audios": { "total": 248, "transcribed_validated": 73, "transcribed_pending": 55, "not_transcribed": 120 }, "transcriptions": { "total": 128, "validated": 73, "not_yet_validated": 55, "edited": 12, "edited_pct": 9.38 }, "duration": { "total_ms": 2574252, "total_hrs": 0.71, "validated_ms": 698256, "validated_hrs": 0.19, "avg_ms": 10380, "min_ms": 1980, "max_ms": 49788, "avg_chars_per_sec": 6.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0282, "thirties": 0.2702, "fourties": 0.2782, "fifties": 0.0363, "sixties": 0.1976, "seventies": 0.1895, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9798, "male_masculine": 0, "female_feminine": 0.0202, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "gwenedeg": 4, "kerneveg": 65, "leoneg": 118, "tregerieg": 116, "user_defined": 45, "": 4 }, "variant": { "": 248 } }, "reported": { "clips": 0, "reasons": {} }, "size": 14584929, "checksum": "7d574ef0f39243eb963e48a4cce08cbfcc1afc72675952da2f1a463e83bc1864", "generated_at": "2026-03-15T02:24:49.075Z" }, "brx": { "locale": "brx", "clips": 69, "users": 1, "questions": { "total": 69, "validated": 3, "invalidated": 0, "other": 66, "has_audio": 69, "avg_recordings_per_question": 1 }, "audios": { "total": 69, "transcribed_validated": 25, "transcribed_pending": 44, "not_transcribed": 0 }, "transcriptions": { "total": 69, "validated": 25, "not_yet_validated": 44, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 232344, "total_hrs": 0.06, "validated_ms": 84816, "validated_hrs": 0.02, "avg_ms": 3367, "min_ms": 2376, "max_ms": 7128, "avg_chars_per_sec": 9.17 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 69 }, "variant": { "": 69 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1365271, "checksum": "2eeaf0f8103ced555f9e976be3f5648cf64dacc1e451152024d00576351a9810", "generated_at": "2026-03-15T02:25:15.870Z" }, "bsy": { "locale": "bsy", "clips": 1741, "users": 25, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 14.51 }, "audios": { "total": 1741, "transcribed_validated": 0, "transcribed_pending": 1741, "not_transcribed": 0 }, "transcriptions": { "total": 1741, "validated": 0, "not_yet_validated": 1741, "edited": 2, "edited_pct": 0.11 }, "duration": { "total_ms": 38892996, "total_hrs": 10.8, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 22339, "min_ms": 1764, "max_ms": 154764, "avg_chars_per_sec": 8.19 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7777, "teens": 0, "twenties": 0.0638, "thirties": 0, "fourties": 0.0402, "fifties": 0.0632, "sixties": 0.0551, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7777, "male_masculine": 0.104, "female_feminine": 0.1183, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1741 }, "variant": { "": 1741 } }, "reported": { "clips": 26, "reasons": { "other": 21, "different_language": 4, "personally_identifiable_information": 1 } }, "size": 232063736, "checksum": "2156cc6f859ca8c483ffd69c4262ba82aaa54f5772e399283086a2a5c87f87bb", "generated_at": "2026-03-15T02:14:31.177Z" }, "bxk": { "locale": "bxk", "clips": 3214, "users": 27, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.78 }, "audios": { "total": 3214, "transcribed_validated": 2215, "transcribed_pending": 0, "not_transcribed": 999 }, "transcriptions": { "total": 2215, "validated": 2215, "not_yet_validated": 0, "edited": 398, "edited_pct": 17.97 }, "duration": { "total_ms": 57135060, "total_hrs": 15.87, "validated_ms": 39742308, "validated_hrs": 11.03, "avg_ms": 17777, "min_ms": 540, "max_ms": 72180, "avg_chars_per_sec": 8.12 }, "buckets": { "train": { "clips": 1423, "users": 15, "duration_ms": 26653680, "duration_hrs": 7.4 }, "dev": { "clips": 452, "users": 5, "duration_ms": 6319404, "duration_hrs": 1.75 }, "test": { "clips": 340, "users": 4, "duration_ms": 6769224, "duration_hrs": 1.88 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0373, "thirties": 0.1733, "fourties": 0.257, "fifties": 0.3158, "sixties": 0.2166, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1758, "male_masculine": 0.5756, "female_feminine": 0.2486, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3214 }, "variant": { "": 3214 } }, "reported": { "clips": 40, "reasons": { "other": 36, "different_language": 2, "personally_identifiable_information": 2 } }, "size": 275084779, "checksum": "6bc60c69d6f39e56a03b85cbe358d5e07053dfd0e792e463538715c4d5443410", "generated_at": "2026-03-15T02:22:16.290Z" }, "ca": { "locale": "ca", "clips": 149, "users": 15, "questions": { "total": 74, "validated": 14, "invalidated": 0, "other": 60, "has_audio": 60, "avg_recordings_per_question": 2.48 }, "audios": { "total": 149, "transcribed_validated": 123, "transcribed_pending": 19, "not_transcribed": 7 }, "transcriptions": { "total": 142, "validated": 123, "not_yet_validated": 19, "edited": 42, "edited_pct": 29.58 }, "duration": { "total_ms": 2357676, "total_hrs": 0.65, "validated_ms": 1783692, "validated_hrs": 0.49, "avg_ms": 15823, "min_ms": 2556, "max_ms": 72180, "avg_chars_per_sec": 7.57 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.1007, "teens": 0, "twenties": 0.0268, "thirties": 0.1074, "fourties": 0.4094, "fifties": 0.1812, "sixties": 0.1745, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1208, "male_masculine": 0.5772, "female_feminine": 0.302, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "balearic": 2, "central": 12, "learner_es": 0, "learner_other": 0, "northern": 0, "northwestern": 0, "valencian": 28, "user_defined": 32, "": 77 }, "variant": { "ca-algueres": 0, "ca-balear": 2, "ca-central": 91, "ca-northern": 0, "ca-nwestern": 0, "ca-valencia-alacant": 0, "ca-valencia-central": 1, "ca-valencia-northern": 0, "ca-valencia-southern": 0, "ca-valencia-tortosi": 0, "": 55 } }, "reported": { "clips": 2, "reasons": { "other": 2 } }, "size": 13972010, "checksum": "51073a7f9376d33b5b68fe300506675d4f3768870c79f35b4aae5c7f07f81dd3", "generated_at": "2026-03-15T02:24:23.648Z" }, "cdo": { "locale": "cdo", "clips": 1034, "users": 32, "questions": { "total": 120, "validated": 1, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 8.69 }, "audios": { "total": 1034, "transcribed_validated": 107, "transcribed_pending": 927, "not_transcribed": 0 }, "transcriptions": { "total": 1034, "validated": 107, "not_yet_validated": 927, "edited": 73, "edited_pct": 7.06 }, "duration": { "total_ms": 35166960, "total_hrs": 9.76, "validated_ms": 3289824, "validated_hrs": 0.91, "avg_ms": 34011, "min_ms": 720, "max_ms": 476568, "avg_chars_per_sec": 2.67 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.8027, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.1161, "fifties": 0.0812, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8046, "male_masculine": 0.0812, "female_feminine": 0.1141, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1034 }, "variant": { "": 1034 } }, "reported": { "clips": 4, "reasons": { "other": 4 } }, "size": 201109488, "checksum": "eeb97347bfafc0caf1ce8a7774ec3d5e92f1395e20f28686884599a41cb0f06b", "generated_at": "2026-03-15T02:12:29.085Z" }, "cgg": { "locale": "cgg", "clips": 3085, "users": 27, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 25.71 }, "audios": { "total": 3085, "transcribed_validated": 2711, "transcribed_pending": 0, "not_transcribed": 374 }, "transcriptions": { "total": 2711, "validated": 2711, "not_yet_validated": 0, "edited": 516, "edited_pct": 19.03 }, "duration": { "total_ms": 42117228, "total_hrs": 11.69, "validated_ms": 36681696, "validated_hrs": 10.18, "avg_ms": 13652, "min_ms": 936, "max_ms": 86976, "avg_chars_per_sec": 8.61 }, "buckets": { "train": { "clips": 1772, "users": 18, "duration_ms": 25631136, "duration_hrs": 7.11 }, "dev": { "clips": 398, "users": 4, "duration_ms": 5415480, "duration_hrs": 1.5 }, "test": { "clips": 541, "users": 5, "duration_ms": 5635080, "duration_hrs": 1.56 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.5825, "thirties": 0.2716, "fourties": 0.1459, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1164, "male_masculine": 0.2726, "female_feminine": 0.611, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3085 }, "variant": { "": 3085 } }, "reported": { "clips": 0, "reasons": {} }, "size": 245576787, "checksum": "0ac4a7bf4818a83c2c93c011c9730ea6b5c10255801efef8d27b2e57cd9fd8e5", "generated_at": "2026-03-15T02:23:05.992Z" }, "cpx": { "locale": "cpx", "clips": 2007, "users": 29, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 16.87 }, "audios": { "total": 2007, "transcribed_validated": 13, "transcribed_pending": 1991, "not_transcribed": 3 }, "transcriptions": { "total": 2004, "validated": 13, "not_yet_validated": 1991, "edited": 3, "edited_pct": 0.15 }, "duration": { "total_ms": 38396232, "total_hrs": 10.66, "validated_ms": 116244, "validated_hrs": 0.03, "avg_ms": 19131, "min_ms": 720, "max_ms": 83160, "avg_chars_per_sec": 3.23 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7997, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.0877, "fifties": 0.0593, "sixties": 0.0249, "seventies": 0.0284, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7997, "male_masculine": 0.1221, "female_feminine": 0.0782, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2007 }, "variant": { "": 2007 } }, "reported": { "clips": 110, "reasons": { "other": 45, "different_language": 65 } }, "size": 236716275, "checksum": "942634671d85789e6692d40f9f4a4d4eab9c66b5040d0a78bd7576578e3a766c", "generated_at": "2026-03-15T02:12:10.952Z" }, "cy": { "locale": "cy", "clips": 16, "users": 3, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 16, "avg_recordings_per_question": 1 }, "audios": { "total": 16, "transcribed_validated": 0, "transcribed_pending": 3, "not_transcribed": 13 }, "transcriptions": { "total": 3, "validated": 0, "not_yet_validated": 3, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 757908, "total_hrs": 0.21, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 47369, "min_ms": 2808, "max_ms": 162288, "avg_chars_per_sec": 6.68 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.8125, "thirties": 0, "fourties": 0, "fifties": 0.1875, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1875, "male_masculine": 0, "female_feminine": 0.3125, "intersex": 0, "transgender": 0, "non-binary": 0.5, "do_not_wish_to_say": 0 }, "accent": { "united_kingdom": 8, "user_defined": 8 }, "variant": { "cy-midwales": 0, "cy-northeas": 0, "cy-northwes": 8, "cy-southeas": 0, "cy-southwes": 0, "cy-wladfa": 0, "": 8 } }, "reported": { "clips": 0, "reasons": {} }, "size": 4504784, "checksum": "b315b922be5e7c08277a9d42ab52d8eae6e71d5cd0411f196f05959bdd71de78", "generated_at": "2026-03-15T02:24:10.021Z" }, "da": { "locale": "da", "clips": 1, "users": 1, "questions": { "total": 94, "validated": 6, "invalidated": 0, "other": 88, "has_audio": 1, "avg_recordings_per_question": 1 }, "audios": { "total": 1, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 1 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 10260, "total_hrs": 0, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 10260, "min_ms": 10260, "max_ms": 10260, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1 }, "variant": { "": 1 } }, "reported": { "clips": 0, "reasons": {} }, "size": 63285, "checksum": "6efb7a44323e2867aa7c86383dff79ed695d0c9edd34d1baac9f22532f7ca7ae", "generated_at": "2026-03-15T02:25:57.788Z" }, "de": { "locale": "de", "clips": 229, "users": 23, "questions": { "total": 97, "validated": 30, "invalidated": 6, "other": 61, "has_audio": 81, "avg_recordings_per_question": 2.83 }, "audios": { "total": 229, "transcribed_validated": 35, "transcribed_pending": 5, "not_transcribed": 189 }, "transcriptions": { "total": 40, "validated": 35, "not_yet_validated": 5, "edited": 12, "edited_pct": 30 }, "duration": { "total_ms": 4475880, "total_hrs": 1.24, "validated_ms": 356688, "validated_hrs": 0.09, "avg_ms": 19545, "min_ms": 3024, "max_ms": 160704, "avg_chars_per_sec": 10.55 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.131, "twenties": 0.1179, "thirties": 0.1397, "fourties": 0.1354, "fifties": 0.3144, "sixties": 0.1616, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2882, "male_masculine": 0.262, "female_feminine": 0.4454, "intersex": 0, "transgender": 0.0044, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "austria": 0, "belgium": 0, "brazil": 0, "bulgaria": 0, "canada": 0, "czechia": 0, "denmark": 0, "finland": 0, "france": 0, "germany": 158, "greece": 0, "hungary": 0, "italy": 0, "kazakhstan": 0, "liechtenstein": 0, "lithuania": 0, "luxembourg": 0, "namibia": 0, "netherlands": 0, "paraguay": 0, "poland": 0, "romania": 0, "russia": 0, "slovakia": 0, "slovenia": 0, "switzerland": 6, "turkey": 0, "united_kingdom": 0, "united_states": 0, "user_defined": 6, "": 69 }, "variant": { "": 229 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 24406428, "checksum": "ff97500d1002a935f0d47c2b4472bb7b9d697e61bcb9c59ac9e5ecc41d6401f6", "generated_at": "2026-03-15T02:25:17.960Z" }, "el-CY": { "locale": "el-CY", "clips": 1221, "users": 10, "questions": { "total": 146, "validated": 0, "invalidated": 0, "other": 146, "has_audio": 146, "avg_recordings_per_question": 8.36 }, "audios": { "total": 1221, "transcribed_validated": 1217, "transcribed_pending": 3, "not_transcribed": 1 }, "transcriptions": { "total": 1220, "validated": 1217, "not_yet_validated": 3, "edited": 1196, "edited_pct": 98.03 }, "duration": { "total_ms": 37345716, "total_hrs": 10.37, "validated_ms": 37167084, "validated_hrs": 10.32, "avg_ms": 30586, "min_ms": 3456, "max_ms": 271080, "avg_chars_per_sec": 9.52 }, "buckets": { "train": { "clips": 880, "users": 7, "duration_ms": 26170668, "duration_hrs": 7.26 }, "dev": { "clips": 134, "users": 1, "duration_ms": 5520672, "duration_hrs": 1.53 }, "test": { "clips": 203, "users": 2, "duration_ms": 5475744, "duration_hrs": 1.52 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.6585, "thirties": 0.1196, "fourties": 0.1753, "fifties": 0, "sixties": 0, "seventies": 0.0467, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.4144, "female_feminine": 0.5856, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1221 }, "variant": { "": 1221 } }, "reported": { "clips": 0, "reasons": {} }, "size": 221759876, "checksum": "52e5da645b4e12985558f07cef17a8ae3216e810bd2886e889ea6b3eae1a6ca0", "generated_at": "2026-03-15T02:21:04.604Z" }, "en": { "locale": "en", "clips": 7084, "users": 628, "questions": { "total": 527, "validated": 330, "invalidated": 38, "other": 159, "has_audio": 448, "avg_recordings_per_question": 15.81 }, "audios": { "total": 7084, "transcribed_validated": 2309, "transcribed_pending": 70, "not_transcribed": 4705 }, "transcriptions": { "total": 2379, "validated": 2309, "not_yet_validated": 70, "edited": 275, "edited_pct": 11.56 }, "duration": { "total_ms": 73538100, "total_hrs": 20.42, "validated_ms": 29466000, "validated_hrs": 8.18, "avg_ms": 10381, "min_ms": 108, "max_ms": 400320, "avg_chars_per_sec": 7.27 }, "buckets": { "train": { "clips": 1397, "users": 214, "duration_ms": 17163900, "duration_hrs": 4.76 }, "dev": { "clips": 490, "users": 84, "duration_ms": 6009084, "duration_hrs": 1.66 }, "test": { "clips": 422, "users": 51, "duration_ms": 6293016, "duration_hrs": 1.74 } }, "demographics": { "age": { "": 0.0776, "teens": 0.1145, "twenties": 0.4904, "thirties": 0.1184, "fourties": 0.0208, "fifties": 0.0645, "sixties": 0.074, "seventies": 0.0198, "eighties": 0.0196, "nineties": 0.0004 }, "gender": { "": 0.4581, "male_masculine": 0.1165, "female_feminine": 0.3656, "intersex": 0.0234, "transgender": 0.0058, "non-binary": 0.004, "do_not_wish_to_say": 0.0267 }, "accent": { "african": 20, "australia": 33, "bermuda": 0, "canada": 7, "england": 258, "hongkong": 0, "indian": 65, "ireland": 3, "malaysia": 4, "newzealand": 0, "philippines": 1, "scotland": 1, "singapore": 8, "southatlandtic": 0, "us": 444, "wales": 0, "user_defined": 837, "": 6031 }, "variant": { "": 7084 } }, "reported": { "clips": 828, "reasons": { "other": 81, "different_language": 744, "personally_identifiable_information": 2, "offensive_speech": 1 } }, "size": 481371909, "checksum": "21b69fbfa31fb146e16ffb27d6f9e7be4ef46a54ca0b89f9a59d2bf4c0d4bfd9", "generated_at": "2026-03-15T02:11:38.798Z" }, "eo": { "locale": "eo", "clips": 66, "users": 3, "questions": { "total": 81, "validated": 2, "invalidated": 0, "other": 79, "has_audio": 55, "avg_recordings_per_question": 1.2 }, "audios": { "total": 66, "transcribed_validated": 11, "transcribed_pending": 7, "not_transcribed": 48 }, "transcriptions": { "total": 18, "validated": 11, "not_yet_validated": 7, "edited": 2, "edited_pct": 11.11 }, "duration": { "total_ms": 2209212, "total_hrs": 0.61, "validated_ms": 79308, "validated_hrs": 0.02, "avg_ms": 33473, "min_ms": 2340, "max_ms": 164376, "avg_chars_per_sec": 5.58 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.7424, "thirties": 0.2576, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7424, "male_masculine": 0.2576, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "internacia": 6, "user_defined": 33, "": 49 }, "variant": { "": 66 } }, "reported": { "clips": 0, "reasons": {} }, "size": 13118366, "checksum": "c86837ebac6f20ccd03c12f45daa2dacf120c6de783e7c983b87aeb08c1fbc3a", "generated_at": "2026-03-15T02:25:59.454Z" }, "es": { "locale": "es", "clips": 332, "users": 12, "questions": { "total": 124, "validated": 55, "invalidated": 0, "other": 69, "has_audio": 119, "avg_recordings_per_question": 2.79 }, "audios": { "total": 332, "transcribed_validated": 14, "transcribed_pending": 3, "not_transcribed": 315 }, "transcriptions": { "total": 17, "validated": 14, "not_yet_validated": 3, "edited": 4, "edited_pct": 23.53 }, "duration": { "total_ms": 2276136, "total_hrs": 0.63, "validated_ms": 125136, "validated_hrs": 0.03, "avg_ms": 6856, "min_ms": 180, "max_ms": 30456, "avg_chars_per_sec": 8.27 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.6627, "twenties": 0.1024, "thirties": 0.0783, "fourties": 0.1446, "fifties": 0.012, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7831, "male_masculine": 0.0753, "female_feminine": 0.1386, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.003 }, "accent": { "americacentral": 0, "andino": 117, "canario": 0, "caribe": 0, "centrosurpeninsular": 0, "chileno": 0, "filipinas": 0, "mexicano": 0, "nortepeninsular": 117, "rioplatense": 11, "surpeninsular": 4, "user_defined": 116, "": 83 }, "variant": { "": 332 } }, "reported": { "clips": 3, "reasons": { "different_language": 3 } }, "size": 13131569, "checksum": "21e86b551f54c81ed2ef692640fc7dffc3b8ee12f4b15b00e2a618a7a386928f", "generated_at": "2026-03-15T02:25:34.161Z" }, "fr": { "locale": "fr", "clips": 445, "users": 46, "questions": { "total": 155, "validated": 54, "invalidated": 3, "other": 98, "has_audio": 138, "avg_recordings_per_question": 3.22 }, "audios": { "total": 445, "transcribed_validated": 152, "transcribed_pending": 0, "not_transcribed": 293 }, "transcriptions": { "total": 152, "validated": 152, "not_yet_validated": 0, "edited": 34, "edited_pct": 22.37 }, "duration": { "total_ms": 5924268, "total_hrs": 1.64, "validated_ms": 2156508, "validated_hrs": 0.59, "avg_ms": 13313, "min_ms": 288, "max_ms": 90180, "avg_chars_per_sec": 9 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0202, "teens": 0.0022, "twenties": 0.1843, "thirties": 0.1798, "fourties": 0.0337, "fifties": 0.1258, "sixties": 0.0315, "seventies": 0.3933, "eighties": 0.0292, "nineties": 0 }, "gender": { "": 0.3079, "male_masculine": 0.3798, "female_feminine": 0.2787, "intersex": 0, "transgender": 0.0112, "non-binary": 0.0225, "do_not_wish_to_say": 0 }, "accent": { "algeria": 1, "belgium": 0, "benin": 0, "cameroon": 0, "canada": 0, "central_african_republic": 0, "comoros": 0, "congo_kinshasa": 0, "cote_d_ivoire": 0, "cyprus": 0, "djibouti": 0, "equatorial_guinea": 0, "fr-metro-east": 25, "fr-metro-north": 0, "fr-metro-south": 32, "fr-metro-west": 0, "french_guiana": 0, "french_polynesia": 0, "germany": 5, "guadeloupe": 11, "haiti": 0, "italy": 0, "luxembourg": 0, "madagascar": 0, "mali": 0, "martinique": 0, "mauritania": 0, "mauritius": 0, "mayotte": 0, "monaco": 0, "morocco": 0, "netherlands": 0, "new_caledonia": 0, "niger": 0, "reunion": 0, "romania": 0, "senegal": 0, "seychelles": 0, "st_barthelemy": 0, "st_martin": 0, "st_pierre_et_miquelon": 0, "switzerland": 0, "syria": 0, "togo": 0, "tunisia": 0, "united_kingdom": 0, "united_states": 0, "vanuatu": 0, "wallis_et_futuna": 0, "user_defined": 1, "": 370 }, "variant": { "fr-asia": 0, "fr-droum": 11, "fr-europe": 0, "fr-metro": 333, "fr-nafrica": 1, "fr-namerica": 0, "fr-safrica": 0, "fr-samerica": 0, "": 100 } }, "reported": { "clips": 13, "reasons": { "different_language": 12, "other": 1 } }, "size": 33561267, "checksum": "a19a7362ca57c547e215e4efec5a76c2c9985c5f9b5ee9f61f4681aed516bdd3", "generated_at": "2026-03-15T02:15:23.169Z" }, "fy-NL": { "locale": "fy-NL", "clips": 3, "users": 1, "questions": { "total": 71, "validated": 0, "invalidated": 0, "other": 71, "has_audio": 3, "avg_recordings_per_question": 1 }, "audios": { "total": 3, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 3 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 56196, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 18732, "min_ms": 16956, "max_ms": 21312, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 3 }, "variant": { "": 3 } }, "reported": { "clips": 0, "reasons": {} }, "size": 331004, "checksum": "5217925533c94a77e3f3d36a1bf96167b5f07134df100ab8530a683faad7ad00", "generated_at": "2026-03-15T02:25:30.797Z" }, "ga-IE": { "locale": "ga-IE", "clips": 36, "users": 4, "questions": { "total": 104, "validated": 0, "invalidated": 0, "other": 104, "has_audio": 32, "avg_recordings_per_question": 1.13 }, "audios": { "total": 36, "transcribed_validated": 4, "transcribed_pending": 4, "not_transcribed": 28 }, "transcriptions": { "total": 8, "validated": 4, "not_yet_validated": 4, "edited": 1, "edited_pct": 12.5 }, "duration": { "total_ms": 558684, "total_hrs": 0.15, "validated_ms": 55152, "validated_hrs": 0.01, "avg_ms": 15519, "min_ms": 3816, "max_ms": 54108, "avg_chars_per_sec": 10.09 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0278, "fourties": 0.7222, "fifties": 0.25, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.0556, "male_masculine": 0.7222, "female_feminine": 0.2222, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "cainteoir_dúchais_gaeltacht": 10, "cainteoir_dúchais_lasmuigh_den_ghaeltacht": 0, "cainteoir_líofa_ní_ó_dhúchas": 0, "foghlaimeoir_fásta_ó_bhéarla": 0, "foghlaimeoir_fásta_ó_theanga_eile": 0, "foghlaimeoir_i_ngaelscoil": 0, "": 26 }, "variant": { "ga-IE-caighd": 0, "ga-IE-chonnact": 3, "ga-IE-mumhan": 7, "ga-IE-uladh": 0, "": 26 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3288315, "checksum": "60fea304b93c4ec8b720a507700121cb638f380cba805d81c02594f54c6ee57d", "generated_at": "2026-03-15T02:25:14.184Z" }, "gl": { "locale": "gl", "clips": 2180, "users": 109, "questions": { "total": 313, "validated": 43, "invalidated": 0, "other": 270, "has_audio": 310, "avg_recordings_per_question": 7.03 }, "audios": { "total": 2180, "transcribed_validated": 96, "transcribed_pending": 931, "not_transcribed": 1153 }, "transcriptions": { "total": 1027, "validated": 96, "not_yet_validated": 931, "edited": 52, "edited_pct": 5.06 }, "duration": { "total_ms": 18519732, "total_hrs": 5.14, "validated_ms": 1172700, "validated_hrs": 0.32, "avg_ms": 8495, "min_ms": 180, "max_ms": 497700, "avg_chars_per_sec": 7.63 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0693, "teens": 0.4321, "twenties": 0.0367, "thirties": 0.0693, "fourties": 0.3761, "fifties": 0.0161, "sixties": 0.0005, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4014, "male_masculine": 0.045, "female_feminine": 0.5161, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0376 }, "accent": { "atlantico": 148, "central": 155, "neofalante": 130, "normativo": 646, "oriental": 77, "user_defined": 12, "": 1028 }, "variant": { "": 2180 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 104637681, "checksum": "874679fe7c278189e38c078e82c21becb0dece937b1f9c084bf325518d0a72d3", "generated_at": "2026-03-15T02:24:55.877Z" }, "gsw": { "locale": "gsw", "clips": 1122, "users": 70, "questions": { "total": 75, "validated": 11, "invalidated": 0, "other": 64, "has_audio": 71, "avg_recordings_per_question": 15.8 }, "audios": { "total": 1122, "transcribed_validated": 50, "transcribed_pending": 139, "not_transcribed": 933 }, "transcriptions": { "total": 189, "validated": 50, "not_yet_validated": 139, "edited": 47, "edited_pct": 24.87 }, "duration": { "total_ms": 22098636, "total_hrs": 6.13, "validated_ms": 669456, "validated_hrs": 0.18, "avg_ms": 19696, "min_ms": 468, "max_ms": 144828, "avg_chars_per_sec": 7.3 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0624, "teens": 0, "twenties": 0.0446, "thirties": 0.0455, "fourties": 0.1194, "fifties": 0.1658, "sixties": 0.4545, "seventies": 0.1078, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5339, "male_masculine": 0, "female_feminine": 0.4528, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0134 }, "accent": { "user_defined": 57, "": 1065 }, "variant": { "gsw-FR-hochalem": 11, "gsw-FR-nordalem": 207, "gsw-FR-rhinfran": 24, "gsw-FR-sudnalem": 4, "gsw-FR-sudrfran": 19, "": 857 } }, "reported": { "clips": 26, "reasons": { "other": 18, "offensive_speech": 3, "different_language": 5 } }, "size": 121740825, "checksum": "88fe1202639de5915a3801f5582c90bde23668b10bf6711f0cef66e78708498b", "generated_at": "2026-03-15T02:24:15.660Z" }, "gv": { "locale": "gv", "clips": 130, "users": 1, "questions": { "total": 250, "validated": 0, "invalidated": 0, "other": 250, "has_audio": 130, "avg_recordings_per_question": 1 }, "audios": { "total": 130, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 130 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 2770956, "total_hrs": 0.76, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21315, "min_ms": 3780, "max_ms": 50220, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 130 }, "variant": { "": 130 } }, "reported": { "clips": 0, "reasons": {} }, "size": 16143715, "checksum": "e8e08575fefc0c6d05a15d5b614f179732c0ec0662ded42af4bed3eedf60f166", "generated_at": "2026-03-15T02:21:39.572Z" }, "hac": { "locale": "hac", "clips": 421, "users": 19, "questions": { "total": 39, "validated": 0, "invalidated": 0, "other": 39, "has_audio": 39, "avg_recordings_per_question": 10.79 }, "audios": { "total": 421, "transcribed_validated": 14, "transcribed_pending": 13, "not_transcribed": 394 }, "transcriptions": { "total": 27, "validated": 14, "not_yet_validated": 13, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 40291704, "total_hrs": 11.19, "validated_ms": 1334844, "validated_hrs": 0.37, "avg_ms": 95705, "min_ms": 1296, "max_ms": 680256, "avg_chars_per_sec": 6.29 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.2138, "teens": 0, "twenties": 0, "thirties": 0.0926, "fourties": 0.4584, "fifties": 0.1425, "sixties": 0, "seventies": 0.0926, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2922, "male_masculine": 0.1568, "female_feminine": 0.5511, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 421 }, "variant": { "": 421 } }, "reported": { "clips": 0, "reasons": {} }, "size": 236289904, "checksum": "5f2f258a5e1ddaf892fcfc4a3ef6e01066dad87363e16dd415aed0a3e1d8de6a", "generated_at": "2026-03-15T02:21:22.360Z" }, "hch": { "locale": "hch", "clips": 1553, "users": 10, "questions": { "total": 236, "validated": 0, "invalidated": 0, "other": 236, "has_audio": 236, "avg_recordings_per_question": 6.58 }, "audios": { "total": 1553, "transcribed_validated": 1549, "transcribed_pending": 0, "not_transcribed": 4 }, "transcriptions": { "total": 1549, "validated": 1549, "not_yet_validated": 0, "edited": 1524, "edited_pct": 98.39 }, "duration": { "total_ms": 36508932, "total_hrs": 10.14, "validated_ms": 36443448, "validated_hrs": 10.12, "avg_ms": 23509, "min_ms": 72, "max_ms": 114300, "avg_chars_per_sec": 7.68 }, "buckets": { "train": { "clips": 599, "users": 4, "duration_ms": 11369592, "duration_hrs": 3.15 }, "dev": { "clips": 491, "users": 4, "duration_ms": 12522456, "duration_hrs": 3.47 }, "test": { "clips": 459, "users": 2, "duration_ms": 12551400, "duration_hrs": 3.48 } }, "demographics": { "age": { "": 0.0064, "teens": 0, "twenties": 0.1442, "thirties": 0.3039, "fourties": 0, "fifties": 0.2415, "sixties": 0.3039, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3999, "male_masculine": 0, "female_feminine": 0.6001, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1553 }, "variant": { "": 1553 } }, "reported": { "clips": 0, "reasons": {} }, "size": 209213660, "checksum": "62ce26ff87e2fabd078fa560459194cd21933aa3d53c8dffbf0409ed122184ee", "generated_at": "2026-03-15T02:15:48.604Z" }, "hr": { "locale": "hr", "clips": 4, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 4, "avg_recordings_per_question": 1 }, "audios": { "total": 4, "transcribed_validated": 0, "transcribed_pending": 2, "not_transcribed": 2 }, "transcriptions": { "total": 2, "validated": 0, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 52056, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 13014, "min_ms": 6156, "max_ms": 24732, "avg_chars_per_sec": 7.04 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 1 }, "accent": { "": 4 }, "variant": { "": 4 } }, "reported": { "clips": 0, "reasons": {} }, "size": 291957, "checksum": "7a50b54e981d0114913e77fd68b4b3b649646df78e1e43d1369dae0f132a042f", "generated_at": "2026-03-15T02:26:11.813Z" }, "jv": { "locale": "jv", "clips": 32, "users": 3, "questions": { "total": 315, "validated": 0, "invalidated": 0, "other": 315, "has_audio": 31, "avg_recordings_per_question": 1.03 }, "audios": { "total": 32, "transcribed_validated": 0, "transcribed_pending": 14, "not_transcribed": 18 }, "transcriptions": { "total": 14, "validated": 0, "not_yet_validated": 14, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 647892, "total_hrs": 0.17, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 20247, "min_ms": 8316, "max_ms": 31176, "avg_chars_per_sec": 10.03 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.7188, "fourties": 0.2813, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 32 }, "variant": { "jv-kulon": 0, "jv-tengah": 0, "jv-timur": 0, "": 32 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3852678, "checksum": "8fb7ce5cfe2c0f7d243c5ee8b6dba6a52560f01b88364b463da2eed930bfe633", "generated_at": "2026-03-15T02:26:08.092Z" }, "ka": { "locale": "ka", "clips": 118, "users": 7, "questions": { "total": 79, "validated": 0, "invalidated": 0, "other": 79, "has_audio": 58, "avg_recordings_per_question": 2.03 }, "audios": { "total": 118, "transcribed_validated": 60, "transcribed_pending": 5, "not_transcribed": 53 }, "transcriptions": { "total": 65, "validated": 60, "not_yet_validated": 5, "edited": 10, "edited_pct": 15.38 }, "duration": { "total_ms": 2118204, "total_hrs": 0.58, "validated_ms": 512712, "validated_hrs": 0.14, "avg_ms": 17951, "min_ms": 3060, "max_ms": 107748, "avg_chars_per_sec": 9.49 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1356, "thirties": 0.339, "fourties": 0.4492, "fifties": 0, "sixties": 0.0763, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.0508, "female_feminine": 0.9492, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 118 }, "variant": { "": 118 } }, "reported": { "clips": 0, "reasons": {} }, "size": 12174471, "checksum": "fb1007e2f6dc578d7a25534810aacc52c0eb56d8ebe71a0e80dba347a5285c7b", "generated_at": "2026-03-15T02:24:07.914Z" }, "kbd": { "locale": "kbd", "clips": 1287, "users": 42, "questions": { "total": 155, "validated": 0, "invalidated": 0, "other": 155, "has_audio": 155, "avg_recordings_per_question": 8.3 }, "audios": { "total": 1287, "transcribed_validated": 793, "transcribed_pending": 0, "not_transcribed": 494 }, "transcriptions": { "total": 793, "validated": 793, "not_yet_validated": 0, "edited": 244, "edited_pct": 30.77 }, "duration": { "total_ms": 44881560, "total_hrs": 12.46, "validated_ms": 29586564, "validated_hrs": 8.21, "avg_ms": 34873, "min_ms": 1908, "max_ms": 307440, "avg_chars_per_sec": 8.86 }, "buckets": { "train": { "clips": 493, "users": 6, "duration_ms": 17981316, "duration_hrs": 4.99 }, "dev": { "clips": 140, "users": 9, "duration_ms": 5402808, "duration_hrs": 1.5 }, "test": { "clips": 160, "users": 11, "duration_ms": 6202440, "duration_hrs": 1.72 } }, "demographics": { "age": { "": 0.0443, "teens": 0.0039, "twenties": 0.1437, "thirties": 0.2238, "fourties": 0.2991, "fifties": 0.265, "sixties": 0.0202, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1228, "male_masculine": 0.5237, "female_feminine": 0.3535, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "bakhsan": 36, "besleney": 0, "kuban": 0, "kuban_zelenchuk": 20, "malka": 1, "mozdok": 0, "terek": 70, "user_defined": 26, "": 1135 }, "variant": { "kbd-Cyrl": 385, "kbd-Cyrl-JOR": 0, "kbd-Cyrl-SY": 0, "kbd-Cyrl-TR": 26, "kbd-Latn-TR-t-kbd-cyrl": 0, "kbd-RU": 119, "": 757 } }, "reported": { "clips": 35, "reasons": { "different_language": 1, "other": 34 } }, "size": 179225339, "checksum": "eafabe1a3e560965ba13b3d947f2ca6c82fcd79322d9ec6927b0fa0b90429b9d", "generated_at": "2026-03-15T02:21:57.585Z" }, "kcn": { "locale": "kcn", "clips": 2715, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 22.63 }, "audios": { "total": 2715, "transcribed_validated": 1894, "transcribed_pending": 8, "not_transcribed": 813 }, "transcriptions": { "total": 1902, "validated": 1894, "not_yet_validated": 8, "edited": 355, "edited_pct": 18.66 }, "duration": { "total_ms": 50285700, "total_hrs": 13.96, "validated_ms": 34896636, "validated_hrs": 9.69, "avg_ms": 18521, "min_ms": 108, "max_ms": 104436, "avg_chars_per_sec": 6.73 }, "buckets": { "train": { "clips": 1266, "users": 17, "duration_ms": 23090040, "duration_hrs": 6.41 }, "dev": { "clips": 318, "users": 4, "duration_ms": 6204384, "duration_hrs": 1.72 }, "test": { "clips": 310, "users": 5, "duration_ms": 5602212, "duration_hrs": 1.55 } }, "demographics": { "age": { "": 0.1738, "teens": 0, "twenties": 0.0431, "thirties": 0.2431, "fourties": 0.1271, "fifties": 0.2936, "sixties": 0.0773, "seventies": 0, "eighties": 0.042, "nineties": 0 }, "gender": { "": 0.1738, "male_masculine": 0.54, "female_feminine": 0.2862, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2715 }, "variant": { "": 2715 } }, "reported": { "clips": 284, "reasons": { "other": 261, "personally_identifiable_information": 6, "offensive_speech": 5, "different_language": 12 } }, "size": 316940236, "checksum": "034cf7c6b8b2e94006200e4722bff94092f040ad5549087a3a4d21532d743e28", "generated_at": "2026-03-15T02:17:46.195Z" }, "koo": { "locale": "koo", "clips": 3233, "users": 28, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.94 }, "audios": { "total": 3233, "transcribed_validated": 2211, "transcribed_pending": 10, "not_transcribed": 1012 }, "transcriptions": { "total": 2221, "validated": 2211, "not_yet_validated": 10, "edited": 172, "edited_pct": 7.74 }, "duration": { "total_ms": 53618940, "total_hrs": 14.89, "validated_ms": 36472068, "validated_hrs": 10.13, "avg_ms": 16585, "min_ms": 288, "max_ms": 112968, "avg_chars_per_sec": 7.42 }, "buckets": { "train": { "clips": 1478, "users": 18, "duration_ms": 23556888, "duration_hrs": 6.54 }, "dev": { "clips": 392, "users": 5, "duration_ms": 6318216, "duration_hrs": 1.75 }, "test": { "clips": 341, "users": 4, "duration_ms": 6596964, "duration_hrs": 1.83 } }, "demographics": { "age": { "": 0.0371, "teens": 0, "twenties": 0.2221, "thirties": 0.5193, "fourties": 0.1847, "fifties": 0.0368, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3699, "male_masculine": 0.2595, "female_feminine": 0.3706, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3233 }, "variant": { "": 3233 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 294405447, "checksum": "e9280b23618ff0c9311117c6767dd760c41d3aba42cba0db3ad7e24a8e015fc5", "generated_at": "2026-03-15T02:22:39.007Z" }, "kzi": { "locale": "kzi", "clips": 1717, "users": 21, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 14.31 }, "audios": { "total": 1717, "transcribed_validated": 59, "transcribed_pending": 1658, "not_transcribed": 0 }, "transcriptions": { "total": 1717, "validated": 59, "not_yet_validated": 1658, "edited": 126, "edited_pct": 7.34 }, "duration": { "total_ms": 34264404, "total_hrs": 9.51, "validated_ms": 812340, "validated_hrs": 0.22, "avg_ms": 19956, "min_ms": 720, "max_ms": 69876, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.2656, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0.2097, "sixties": 0.3069, "seventies": 0.2178, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2656, "male_masculine": 0.3722, "female_feminine": 0.3623, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1717 }, "variant": { "": 1717 } }, "reported": { "clips": 0, "reasons": {} }, "size": 203774050, "checksum": "206001f1fa8feb1d8c1e8b3f1d6090e01eb138ca537446372282cb8f6ed41c56", "generated_at": "2026-03-15T02:13:50.664Z" }, "led": { "locale": "led", "clips": 2851, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 23.76 }, "audios": { "total": 2851, "transcribed_validated": 1897, "transcribed_pending": 30, "not_transcribed": 924 }, "transcriptions": { "total": 1927, "validated": 1897, "not_yet_validated": 30, "edited": 425, "edited_pct": 22.06 }, "duration": { "total_ms": 54529056, "total_hrs": 15.14, "validated_ms": 36313488, "validated_hrs": 10.08, "avg_ms": 19126, "min_ms": 720, "max_ms": 54036, "avg_chars_per_sec": 6.21 }, "buckets": { "train": { "clips": 1210, "users": 14, "duration_ms": 22821948, "duration_hrs": 6.33 }, "dev": { "clips": 327, "users": 5, "duration_ms": 6471144, "duration_hrs": 1.79 }, "test": { "clips": 360, "users": 4, "duration_ms": 7020396, "duration_hrs": 1.95 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0972, "thirties": 0.483, "fourties": 0.2525, "fifties": 0.0421, "sixties": 0.1252, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3076, "male_masculine": 0.524, "female_feminine": 0.1684, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2851 }, "variant": { "": 2851 } }, "reported": { "clips": 0, "reasons": {} }, "size": 283191275, "checksum": "f405bf63c75a1bc2fe13197f0ebd4020e08a2503ee9c98471a56ddd07458dc59", "generated_at": "2026-03-15T02:24:34.372Z" }, "lij": { "locale": "lij", "clips": 294, "users": 5, "questions": { "total": 169, "validated": 2, "invalidated": 0, "other": 167, "has_audio": 164, "avg_recordings_per_question": 1.79 }, "audios": { "total": 294, "transcribed_validated": 223, "transcribed_pending": 13, "not_transcribed": 58 }, "transcriptions": { "total": 236, "validated": 223, "not_yet_validated": 13, "edited": 57, "edited_pct": 24.15 }, "duration": { "total_ms": 8522964, "total_hrs": 2.36, "validated_ms": 5941584, "validated_hrs": 1.65, "avg_ms": 28990, "min_ms": 6408, "max_ms": 124920, "avg_chars_per_sec": 6.37 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.2551, "fourties": 0, "fifties": 0, "sixties": 0.2143, "seventies": 0.5306, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5306, "male_masculine": 0.2551, "female_feminine": 0.2143, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 2, "": 292 }, "variant": { "": 294 } }, "reported": { "clips": 0, "reasons": {} }, "size": 50705895, "checksum": "3c5e9c2a23a67c3638336961aafd62735b58a78fd305696731cbafdbe5101ec3", "generated_at": "2026-03-15T02:22:54.987Z" }, "lke": { "locale": "lke", "clips": 2743, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 22.86 }, "audios": { "total": 2743, "transcribed_validated": 2269, "transcribed_pending": 11, "not_transcribed": 463 }, "transcriptions": { "total": 2280, "validated": 2269, "not_yet_validated": 11, "edited": 879, "edited_pct": 38.55 }, "duration": { "total_ms": 44597196, "total_hrs": 12.38, "validated_ms": 35831160, "validated_hrs": 9.95, "avg_ms": 16259, "min_ms": 180, "max_ms": 47736, "avg_chars_per_sec": 7.55 }, "buckets": { "train": { "clips": 1526, "users": 18, "duration_ms": 23658552, "duration_hrs": 6.57 }, "dev": { "clips": 399, "users": 4, "duration_ms": 6733404, "duration_hrs": 1.87 }, "test": { "clips": 344, "users": 4, "duration_ms": 5439204, "duration_hrs": 1.51 } }, "demographics": { "age": { "": 0.4342, "teens": 0.031, "twenties": 0.0824, "thirties": 0.214, "fourties": 0.1659, "fifties": 0.0725, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4342, "male_masculine": 0.4834, "female_feminine": 0.0824, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2743 }, "variant": { "": 2743 } }, "reported": { "clips": 32, "reasons": { "different_language": 18, "other": 13, "personally_identifiable_information": 1 } }, "size": 266929548, "checksum": "301052c77618b738d44e13470de9f540bad32e3d5ed95801e1e498df51b91900", "generated_at": "2026-03-15T02:17:23.311Z" }, "lth": { "locale": "lth", "clips": 3126, "users": 29, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 26.05 }, "audios": { "total": 3126, "transcribed_validated": 1809, "transcribed_pending": 2, "not_transcribed": 1315 }, "transcriptions": { "total": 1811, "validated": 1809, "not_yet_validated": 2, "edited": 855, "edited_pct": 47.21 }, "duration": { "total_ms": 55331244, "total_hrs": 15.36, "validated_ms": 32886324, "validated_hrs": 9.13, "avg_ms": 17700, "min_ms": 1620, "max_ms": 1324080, "avg_chars_per_sec": 7.35 }, "buckets": { "train": { "clips": 1248, "users": 20, "duration_ms": 20861100, "duration_hrs": 5.79 }, "dev": { "clips": 385, "users": 5, "duration_ms": 6080112, "duration_hrs": 1.68 }, "test": { "clips": 176, "users": 4, "duration_ms": 5945112, "duration_hrs": 1.65 } }, "demographics": { "age": { "": 0.247, "teens": 0, "twenties": 0, "thirties": 0.1942, "fourties": 0.1897, "fifties": 0.1891, "sixties": 0.1801, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3986, "male_masculine": 0.23, "female_feminine": 0.3714, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3126 }, "variant": { "": 3126 } }, "reported": { "clips": 1170, "reasons": { "other": 510, "different_language": 516, "personally_identifiable_information": 142, "offensive_speech": 2 } }, "size": 334171163, "checksum": "13323eb30b6ac778b075a0b2b4c6193c759e7676284e3414a8383c354849ac34", "generated_at": "2026-03-15T02:18:43.179Z" }, "lv": { "locale": "lv", "clips": 56, "users": 8, "questions": { "total": 101, "validated": 0, "invalidated": 0, "other": 101, "has_audio": 48, "avg_recordings_per_question": 1.17 }, "audios": { "total": 56, "transcribed_validated": 5, "transcribed_pending": 5, "not_transcribed": 46 }, "transcriptions": { "total": 10, "validated": 5, "not_yet_validated": 5, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 1969128, "total_hrs": 0.54, "validated_ms": 162036, "validated_hrs": 0.04, "avg_ms": 35163, "min_ms": 4788, "max_ms": 165960, "avg_chars_per_sec": 8.9 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.0357, "twenties": 0.3036, "thirties": 0.3214, "fourties": 0.0357, "fifties": 0.3036, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3214, "male_masculine": 0.0536, "female_feminine": 0.625, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 4, "": 52 }, "variant": { "": 56 } }, "reported": { "clips": 0, "reasons": {} }, "size": 11715393, "checksum": "70fa3b43febc2faa0522b6113e251fa7ee6a15ada154ff910db5308d14314c8d", "generated_at": "2026-03-15T02:23:18.861Z" }, "meh": { "locale": "meh", "clips": 1057, "users": 16, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 8.81 }, "audios": { "total": 1057, "transcribed_validated": 1048, "transcribed_pending": 0, "not_transcribed": 9 }, "transcriptions": { "total": 1048, "validated": 1048, "not_yet_validated": 0, "edited": 328, "edited_pct": 31.3 }, "duration": { "total_ms": 36567144, "total_hrs": 10.15, "validated_ms": 36197496, "validated_hrs": 10.05, "avg_ms": 34595, "min_ms": 3924, "max_ms": 234864, "avg_chars_per_sec": 6.89 }, "buckets": { "train": { "clips": 532, "users": 7, "duration_ms": 23360148, "duration_hrs": 6.48 }, "dev": { "clips": 288, "users": 5, "duration_ms": 6596352, "duration_hrs": 1.83 }, "test": { "clips": 228, "users": 4, "duration_ms": 6240996, "duration_hrs": 1.73 } }, "demographics": { "age": { "": 0.1987, "teens": 0, "twenties": 0.1627, "thirties": 0.1079, "fourties": 0.1135, "fifties": 0, "sixties": 0.3264, "seventies": 0.0908, "eighties": 0, "nineties": 0 }, "gender": { "": 0.193, "male_masculine": 0.0057, "female_feminine": 0.8013, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1057 }, "variant": { "": 1057 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 212179312, "checksum": "4c45dec572ca5b40a7ec0c058720a0ec1290b87c3a31aab80e5b178251aeee1d", "generated_at": "2026-03-15T02:15:32.072Z" }, "mel": { "locale": "mel", "clips": 1624, "users": 35, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 13.53 }, "audios": { "total": 1624, "transcribed_validated": 28, "transcribed_pending": 1554, "not_transcribed": 42 }, "transcriptions": { "total": 1582, "validated": 28, "not_yet_validated": 1554, "edited": 13, "edited_pct": 0.82 }, "duration": { "total_ms": 37180800, "total_hrs": 10.32, "validated_ms": 562824, "validated_hrs": 0.15, "avg_ms": 22895, "min_ms": 756, "max_ms": 93348, "avg_chars_per_sec": 7.52 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.9803, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0.0197, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9803, "male_masculine": 0, "female_feminine": 0.0197, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1624 }, "variant": { "": 1624 } }, "reported": { "clips": 0, "reasons": {} }, "size": 219268263, "checksum": "5d4af6b1977147aa94af60aef1af32128544fa2cc58e54d8afed202d530fa0ff", "generated_at": "2026-03-15T02:13:15.004Z" }, "mmc": { "locale": "mmc", "clips": 871, "users": 12, "questions": { "total": 195, "validated": 0, "invalidated": 0, "other": 195, "has_audio": 195, "avg_recordings_per_question": 4.47 }, "audios": { "total": 871, "transcribed_validated": 871, "transcribed_pending": 0, "not_transcribed": 0 }, "transcriptions": { "total": 871, "validated": 871, "not_yet_validated": 0, "edited": 305, "edited_pct": 35.02 }, "duration": { "total_ms": 40443984, "total_hrs": 11.23, "validated_ms": 40443984, "validated_hrs": 11.23, "avg_ms": 46434, "min_ms": 3348, "max_ms": 332820, "avg_chars_per_sec": 5.27 }, "buckets": { "train": { "clips": 607, "users": 6, "duration_ms": 29374560, "duration_hrs": 8.15 }, "dev": { "clips": 183, "users": 3, "duration_ms": 5540400, "duration_hrs": 1.53 }, "test": { "clips": 81, "users": 3, "duration_ms": 5529024, "duration_hrs": 1.53 } }, "demographics": { "age": { "": 0.07, "teens": 0, "twenties": 0.5029, "thirties": 0, "fourties": 0, "fifties": 0.2227, "sixties": 0.2044, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3226, "male_masculine": 0, "female_feminine": 0.6774, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 871 }, "variant": { "": 871 } }, "reported": { "clips": 0, "reasons": {} }, "size": 237404606, "checksum": "8f946fd3238d7216570d5e633c6532519f07a549f78add7022b353709fcfc885", "generated_at": "2026-03-15T02:16:37.997Z" }, "ms-MY": { "locale": "ms-MY", "clips": 1150, "users": 24, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 119, "avg_recordings_per_question": 9.66 }, "audios": { "total": 1150, "transcribed_validated": 1054, "transcribed_pending": 80, "not_transcribed": 16 }, "transcriptions": { "total": 1134, "validated": 1054, "not_yet_validated": 80, "edited": 94, "edited_pct": 8.29 }, "duration": { "total_ms": 22315860, "total_hrs": 6.19, "validated_ms": 20302020, "validated_hrs": 5.63, "avg_ms": 19405, "min_ms": 1800, "max_ms": 208476, "avg_chars_per_sec": 8.59 }, "buckets": { "train": { "clips": 425, "users": 9, "duration_ms": 8071380, "duration_hrs": 2.24 }, "dev": { "clips": 333, "users": 6, "duration_ms": 6789744, "duration_hrs": 1.88 }, "test": { "clips": 296, "users": 5, "duration_ms": 5440896, "duration_hrs": 1.51 } }, "demographics": { "age": { "": 0.3591, "teens": 0, "twenties": 0.113, "thirties": 0.2635, "fourties": 0.1391, "fifties": 0, "sixties": 0.0661, "seventies": 0.0591, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3565, "male_masculine": 0.2035, "female_feminine": 0.4322, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0078 }, "accent": { "": 1150 }, "variant": { "": 1150 } }, "reported": { "clips": 1, "reasons": { "offensive_speech": 1 } }, "size": 132747123, "checksum": "714bf168d393ee00a478c56a78359a5d30b6137cb6be4e2b901f3bf2b786e065", "generated_at": "2026-03-15T02:12:44.058Z" }, "msi": { "locale": "msi", "clips": 2277, "users": 33, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 19.13 }, "audios": { "total": 2277, "transcribed_validated": 4, "transcribed_pending": 2273, "not_transcribed": 0 }, "transcriptions": { "total": 2277, "validated": 4, "not_yet_validated": 2273, "edited": 3, "edited_pct": 0.13 }, "duration": { "total_ms": 48764700, "total_hrs": 13.54, "validated_ms": 70020, "validated_hrs": 0.01, "avg_ms": 21416, "min_ms": 864, "max_ms": 89316, "avg_chars_per_sec": 8.5 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.7686, "teens": 0, "twenties": 0.0654, "thirties": 0, "fourties": 0.0733, "fifties": 0.0487, "sixties": 0.0439, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7686, "male_masculine": 0.061, "female_feminine": 0.1704, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2277 }, "variant": { "": 2277 } }, "reported": { "clips": 45, "reasons": { "other": 39, "different_language": 6 } }, "size": 291438520, "checksum": "0b0349d2c7d5c1b79c9826df2a1d8efdcaadf2836e1fc20f31e42a688f46bb4c", "generated_at": "2026-03-15T02:14:09.370Z" }, "nl": { "locale": "nl", "clips": 20, "users": 2, "questions": { "total": 98, "validated": 0, "invalidated": 0, "other": 98, "has_audio": 19, "avg_recordings_per_question": 1.05 }, "audios": { "total": 20, "transcribed_validated": 0, "transcribed_pending": 1, "not_transcribed": 19 }, "transcriptions": { "total": 1, "validated": 0, "not_yet_validated": 1, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 437904, "total_hrs": 0.12, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21895, "min_ms": 3312, "max_ms": 54144, "avg_chars_per_sec": 8.48 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.6, "fourties": 0.4, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "aruba": 0, "belgium": 0, "curacao": 0, "france": 0, "germany": 0, "indonesia": 0, "namibia": 0, "netherlands": 20, "sint_maarten": 0, "south_africa": 0, "suriname": 0 }, "variant": { "": 20 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2541866, "checksum": "d28ad97f96b5d47fb1fe7a80d4a84b228e89a3dfbace751d62c3f26e9c3c4f99", "generated_at": "2026-03-15T02:26:01.759Z" }, "pne": { "locale": "pne", "clips": 2595, "users": 24, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 21.63 }, "audios": { "total": 2595, "transcribed_validated": 2594, "transcribed_pending": 0, "not_transcribed": 1 }, "transcriptions": { "total": 2594, "validated": 2594, "not_yet_validated": 0, "edited": 424, "edited_pct": 16.35 }, "duration": { "total_ms": 43702704, "total_hrs": 12.13, "validated_ms": 43675992, "validated_hrs": 12.13, "avg_ms": 16841, "min_ms": 1800, "max_ms": 36324, "avg_chars_per_sec": 8.44 }, "buckets": { "train": { "clips": 1869, "users": 17, "duration_ms": 31354020, "duration_hrs": 8.7 }, "dev": { "clips": 365, "users": 4, "duration_ms": 6040440, "duration_hrs": 1.67 }, "test": { "clips": 360, "users": 3, "duration_ms": 6281532, "duration_hrs": 1.74 } }, "demographics": { "age": { "": 0.3083, "teens": 0.185, "twenties": 0.2593, "thirties": 0.1272, "fourties": 0.074, "fifties": 0.0462, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3083, "male_masculine": 0.3048, "female_feminine": 0.3869, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2595 }, "variant": { "": 2595 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 260116985, "checksum": "7db45325aae5a9346baf560bf157a6cd629cccc05aa82d20f58589050f773849", "generated_at": "2026-03-15T02:15:08.200Z" }, "ps": { "locale": "ps", "clips": 827, "users": 68, "questions": { "total": 192, "validated": 56, "invalidated": 1, "other": 135, "has_audio": 176, "avg_recordings_per_question": 4.7 }, "audios": { "total": 827, "transcribed_validated": 85, "transcribed_pending": 13, "not_transcribed": 729 }, "transcriptions": { "total": 98, "validated": 85, "not_yet_validated": 13, "edited": 31, "edited_pct": 31.63 }, "duration": { "total_ms": 36875016, "total_hrs": 10.24, "validated_ms": 2126160, "validated_hrs": 0.59, "avg_ms": 44589, "min_ms": 1296, "max_ms": 352548, "avg_chars_per_sec": 13.28 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.006, "teens": 0.0266, "twenties": 0.5586, "thirties": 0.347, "fourties": 0.0375, "fifties": 0.0218, "sixties": 0.0024, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4474, "male_masculine": 0.0145, "female_feminine": 0.5381, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 108, "": 719 }, "variant": { "": 827 } }, "reported": { "clips": 4, "reasons": { "other": 4 } }, "size": 204618790, "checksum": "ca0c43a7911175ae8105aac2f045d3adf3806173822148056c78311ae72a44c6", "generated_at": "2026-03-15T02:25:47.481Z" }, "qxp": { "locale": "qxp", "clips": 7286, "users": 110, "questions": { "total": 164, "validated": 110, "invalidated": 2, "other": 52, "has_audio": 150, "avg_recordings_per_question": 48.57 }, "audios": { "total": 7286, "transcribed_validated": 1074, "transcribed_pending": 148, "not_transcribed": 6064 }, "transcriptions": { "total": 1222, "validated": 1074, "not_yet_validated": 148, "edited": 20, "edited_pct": 1.64 }, "duration": { "total_ms": 127113048, "total_hrs": 35.3, "validated_ms": 18658944, "validated_hrs": 5.18, "avg_ms": 17446, "min_ms": 720, "max_ms": 1715616, "avg_chars_per_sec": 8.02 }, "buckets": { "train": { "clips": 322, "users": 8, "duration_ms": 6237900, "duration_hrs": 1.73 }, "dev": { "clips": 388, "users": 6, "duration_ms": 6408612, "duration_hrs": 1.78 }, "test": { "clips": 364, "users": 6, "duration_ms": 6012432, "duration_hrs": 1.67 } }, "demographics": { "age": { "": 0, "teens": 0.0487, "twenties": 0.398, "thirties": 0.2763, "fourties": 0.088, "fifties": 0.1046, "sixties": 0.064, "seventies": 0.0205, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2346, "male_masculine": 0.0586, "female_feminine": 0.7068, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 1901, "": 5412 }, "variant": { "": 7286 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 732900443, "checksum": "d0f4e6a29c2ba3f4a954903b4f75534cb5cfd4e303591e4a3d232adac494933b", "generated_at": "2026-03-15T02:20:16.728Z" }, "rki": { "locale": "rki", "clips": 182, "users": 12, "questions": { "total": 180, "validated": 55, "invalidated": 0, "other": 125, "has_audio": 90, "avg_recordings_per_question": 2.02 }, "audios": { "total": 182, "transcribed_validated": 66, "transcribed_pending": 12, "not_transcribed": 104 }, "transcriptions": { "total": 78, "validated": 66, "not_yet_validated": 12, "edited": 18, "edited_pct": 23.08 }, "duration": { "total_ms": 1993392, "total_hrs": 0.55, "validated_ms": 432324, "validated_hrs": 0.12, "avg_ms": 10953, "min_ms": 1908, "max_ms": 106380, "avg_chars_per_sec": 9.85 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.011, "teens": 0.0055, "twenties": 0.1593, "thirties": 0.8187, "fourties": 0, "fifties": 0.0055, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9121, "male_masculine": 0, "female_feminine": 0.0879, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 182 }, "variant": { "": 182 } }, "reported": { "clips": 0, "reasons": {} }, "size": 11745372, "checksum": "e0b0b1e0115ec77bd574384d496743776a90bcabc15dae33ca78491aaea9bd4b", "generated_at": "2026-03-15T02:26:03.522Z" }, "ru": { "locale": "ru", "clips": 513, "users": 15, "questions": { "total": 759, "validated": 666, "invalidated": 1, "other": 92, "has_audio": 331, "avg_recordings_per_question": 1.55 }, "audios": { "total": 513, "transcribed_validated": 427, "transcribed_pending": 64, "not_transcribed": 22 }, "transcriptions": { "total": 491, "validated": 427, "not_yet_validated": 64, "edited": 86, "edited_pct": 17.52 }, "duration": { "total_ms": 11187108, "total_hrs": 3.1, "validated_ms": 8443224, "validated_hrs": 2.34, "avg_ms": 21807, "min_ms": 2448, "max_ms": 94608, "avg_chars_per_sec": 9.44 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 151, "users": 7, "duration_ms": 3041604, "duration_hrs": 0.84 }, "test": { "clips": 276, "users": 6, "duration_ms": 5401620, "duration_hrs": 1.5 } }, "demographics": { "age": { "": 0.0039, "teens": 0.0643, "twenties": 0.8772, "thirties": 0.0546, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8285, "male_masculine": 0.0936, "female_feminine": 0.078, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 206, "": 324 }, "variant": { "": 513 } }, "reported": { "clips": 12, "reasons": { "different_language": 10, "offensive_speech": 1, "other": 1 } }, "size": 65680879, "checksum": "8ab325f675b7d89f63f91c0f5a371c8dbe5692b06c8df6e43ea7bd75140f8734", "generated_at": "2026-03-15T02:25:23.074Z" }, "ruc": { "locale": "ruc", "clips": 2822, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 23.52 }, "audios": { "total": 2822, "transcribed_validated": 1659, "transcribed_pending": 0, "not_transcribed": 1163 }, "transcriptions": { "total": 1659, "validated": 1659, "not_yet_validated": 0, "edited": 181, "edited_pct": 10.91 }, "duration": { "total_ms": 63901116, "total_hrs": 17.75, "validated_ms": 36522720, "validated_hrs": 10.14, "avg_ms": 22644, "min_ms": 720, "max_ms": 130536, "avg_chars_per_sec": 8.65 }, "buckets": { "train": { "clips": 1032, "users": 15, "duration_ms": 23366088, "duration_hrs": 6.49 }, "dev": { "clips": 309, "users": 5, "duration_ms": 6953112, "duration_hrs": 1.93 }, "test": { "clips": 318, "users": 5, "duration_ms": 6203520, "duration_hrs": 1.72 } }, "demographics": { "age": { "": 0.5152, "teens": 0, "twenties": 0.0422, "thirties": 0.2807, "fourties": 0.0801, "fifties": 0, "sixties": 0.0404, "seventies": 0.0415, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5152, "male_masculine": 0.404, "female_feminine": 0.0808, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2822 }, "variant": { "": 2822 } }, "reported": { "clips": 59, "reasons": { "other": 57, "personally_identifiable_information": 2 } }, "size": 383728305, "checksum": "0aa852d42fb50f8db253755d76fc64f3e8ee079433813db41b558b07b97dbcbf", "generated_at": "2026-03-15T02:18:13.385Z" }, "rwm": { "locale": "rwm", "clips": 2427, "users": 21, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 20.23 }, "audios": { "total": 2427, "transcribed_validated": 1801, "transcribed_pending": 51, "not_transcribed": 575 }, "transcriptions": { "total": 1852, "validated": 1801, "not_yet_validated": 51, "edited": 444, "edited_pct": 23.97 }, "duration": { "total_ms": 49025628, "total_hrs": 13.61, "validated_ms": 36494424, "validated_hrs": 10.13, "avg_ms": 20200, "min_ms": 216, "max_ms": 94428, "avg_chars_per_sec": 7.66 }, "buckets": { "train": { "clips": 1143, "users": 13, "duration_ms": 24153408, "duration_hrs": 6.7 }, "dev": { "clips": 384, "users": 4, "duration_ms": 6095916, "duration_hrs": 1.69 }, "test": { "clips": 274, "users": 3, "duration_ms": 6245100, "duration_hrs": 1.73 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0466, "thirties": 0.2299, "fourties": 0.1887, "fifties": 0.2909, "sixties": 0.0968, "seventies": 0.1471, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1846, "male_masculine": 0.8154, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2427 }, "variant": { "": 2427 } }, "reported": { "clips": 57, "reasons": { "different_language": 14, "personally_identifiable_information": 2, "offensive_speech": 1, "other": 40 } }, "size": 279548783, "checksum": "3b6cdb89b0c0de8fe4c348099fe0e54ef4f50f92542597ff0ee448f448240cd9", "generated_at": "2026-03-15T02:23:29.296Z" }, "sco": { "locale": "sco", "clips": 715, "users": 21, "questions": { "total": 47, "validated": 0, "invalidated": 0, "other": 47, "has_audio": 47, "avg_recordings_per_question": 15.21 }, "audios": { "total": 715, "transcribed_validated": 680, "transcribed_pending": 0, "not_transcribed": 35 }, "transcriptions": { "total": 680, "validated": 680, "not_yet_validated": 0, "edited": 190, "edited_pct": 27.94 }, "duration": { "total_ms": 40234608, "total_hrs": 11.17, "validated_ms": 38478564, "validated_hrs": 10.68, "avg_ms": 56272, "min_ms": 576, "max_ms": 298656, "avg_chars_per_sec": 10.78 }, "buckets": { "train": { "clips": 421, "users": 13, "duration_ms": 25207452, "duration_hrs": 7 }, "dev": { "clips": 153, "users": 5, "duration_ms": 6894612, "duration_hrs": 1.91 }, "test": { "clips": 106, "users": 3, "duration_ms": 6376500, "duration_hrs": 1.77 } }, "demographics": { "age": { "": 0.2867, "teens": 0, "twenties": 0.0587, "thirties": 0.1944, "fourties": 0.0406, "fifties": 0.1664, "sixties": 0.1902, "seventies": 0.0629, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2867, "male_masculine": 0.2881, "female_feminine": 0.386, "intersex": 0, "transgender": 0, "non-binary": 0.0392, "do_not_wish_to_say": 0 }, "accent": { "": 715 }, "variant": { "": 715 } }, "reported": { "clips": 0, "reasons": {} }, "size": 239569384, "checksum": "b1b262805f6db8691c795a911783c8d85cff948e0be47a4c71fbcdd04e2917ff", "generated_at": "2026-03-15T02:19:43.894Z" }, "sdo": { "locale": "sdo", "clips": 1389, "users": 25, "questions": { "total": 119, "validated": 0, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 11.67 }, "audios": { "total": 1389, "transcribed_validated": 217, "transcribed_pending": 1172, "not_transcribed": 0 }, "transcriptions": { "total": 1389, "validated": 217, "not_yet_validated": 1172, "edited": 493, "edited_pct": 35.49 }, "duration": { "total_ms": 35331228, "total_hrs": 9.81, "validated_ms": 4778640, "validated_hrs": 1.32, "avg_ms": 25436, "min_ms": 1224, "max_ms": 120924, "avg_chars_per_sec": 7.62 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.9323, "teens": 0, "twenties": 0.0194, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0.0482, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9323, "male_masculine": 0.0677, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1389 }, "variant": { "": 1389 } }, "reported": { "clips": 9, "reasons": { "different_language": 2, "other": 7 } }, "size": 211038687, "checksum": "7c175c6a85056180b828ee987bbf527d0887c98f529483f8b8e732ca22e453a2", "generated_at": "2026-03-15T02:12:57.770Z" }, "seh": { "locale": "seh", "clips": 290, "users": 5, "questions": { "total": 61, "validated": 0, "invalidated": 2, "other": 59, "has_audio": 58, "avg_recordings_per_question": 5 }, "audios": { "total": 290, "transcribed_validated": 1, "transcribed_pending": 2, "not_transcribed": 287 }, "transcriptions": { "total": 3, "validated": 1, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 5832180, "total_hrs": 1.62, "validated_ms": 13248, "validated_hrs": 0, "avg_ms": 20111, "min_ms": 1368, "max_ms": 117828, "avg_chars_per_sec": 8.7 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.2, "thirties": 0.4, "fourties": 0, "fifties": 0.2, "sixties": 0.2, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0.4, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 290 }, "variant": { "": 290 } }, "reported": { "clips": 0, "reasons": {} }, "size": 25942400, "checksum": "4b97105e3d9200db427a8a729c71944c1f9494b905b8b1f0913ceb38ed5245af", "generated_at": "2026-03-15T02:25:28.122Z" }, "shi": { "locale": "shi", "clips": 230, "users": 10, "questions": { "total": 62, "validated": 0, "invalidated": 0, "other": 62, "has_audio": 60, "avg_recordings_per_question": 3.83 }, "audios": { "total": 230, "transcribed_validated": 2, "transcribed_pending": 4, "not_transcribed": 224 }, "transcriptions": { "total": 6, "validated": 2, "not_yet_validated": 4, "edited": 1, "edited_pct": 16.67 }, "duration": { "total_ms": 6360948, "total_hrs": 1.76, "validated_ms": 26496, "validated_hrs": 0, "avg_ms": 27656, "min_ms": 3780, "max_ms": 105228, "avg_chars_per_sec": 7.35 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.5696, "thirties": 0.1913, "fourties": 0, "fifties": 0.2391, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8087, "male_masculine": 0.0304, "female_feminine": 0.1609, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 230 }, "variant": { "": 230 } }, "reported": { "clips": 0, "reasons": {} }, "size": 37243603, "checksum": "4ebd69c0d48a4728d90b5aa489d5d2c948b646fd8b181381941c78c40d52b9b0", "generated_at": "2026-03-15T02:25:37.391Z" }, "si": { "locale": "si", "clips": 21, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 21, "avg_recordings_per_question": 1 }, "audios": { "total": 21, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 21 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 446148, "total_hrs": 0.12, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21245, "min_ms": 9396, "max_ms": 28224, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 1 }, "accent": { "": 21 }, "variant": { "": 21 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2641297, "checksum": "b86398f7142b555ef900f5a2c8326e93dc98cf049e1cb470570bd02141a98350", "generated_at": "2026-03-15T02:26:13.101Z" }, "sn": { "locale": "sn", "clips": 6, "users": 2, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 6, "avg_recordings_per_question": 1 }, "audios": { "total": 6, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 6 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 270108, "total_hrs": 0.07, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 45018, "min_ms": 13536, "max_ms": 74880, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1667, "thirties": 0, "fourties": 0, "fifties": 0.8333, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1667, "male_masculine": 0, "female_feminine": 0.8333, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 6 }, "variant": { "": 6 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1608898, "checksum": "d2a20e361c08383d60440001a8dc384fcdd1967498b55a9af79cad696605b05d", "generated_at": "2026-03-15T02:25:32.128Z" }, "snv": { "locale": "snv", "clips": 2036, "users": 30, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 16.97 }, "audios": { "total": 2036, "transcribed_validated": 6, "transcribed_pending": 1101, "not_transcribed": 929 }, "transcriptions": { "total": 1107, "validated": 6, "not_yet_validated": 1101, "edited": 69, "edited_pct": 6.23 }, "duration": { "total_ms": 37699092, "total_hrs": 10.47, "validated_ms": 87876, "validated_hrs": 0.02, "avg_ms": 18516, "min_ms": 1008, "max_ms": 74448, "avg_chars_per_sec": 5.89 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6179, "teens": 0, "twenties": 0, "thirties": 0.001, "fourties": 0, "fifties": 0.2883, "sixties": 0.0928, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6179, "male_masculine": 0.1248, "female_feminine": 0.2574, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2036 }, "variant": { "": 2036 } }, "reported": { "clips": 0, "reasons": {} }, "size": 223732735, "checksum": "2726dedfcfce7aa90abdc7e49d6a334336285890f407bd86ae2123360d0a6c28", "generated_at": "2026-03-15T02:14:49.199Z" }, "su": { "locale": "su", "clips": 207, "users": 2, "questions": { "total": 131, "validated": 0, "invalidated": 0, "other": 131, "has_audio": 131, "avg_recordings_per_question": 1.58 }, "audios": { "total": 207, "transcribed_validated": 14, "transcribed_pending": 159, "not_transcribed": 34 }, "transcriptions": { "total": 173, "validated": 14, "not_yet_validated": 159, "edited": 1, "edited_pct": 0.58 }, "duration": { "total_ms": 2258640, "total_hrs": 0.62, "validated_ms": 124524, "validated_hrs": 0.03, "avg_ms": 10911, "min_ms": 3348, "max_ms": 34956, "avg_chars_per_sec": 9.35 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3671, "male_masculine": 0, "female_feminine": 0.6329, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 207 }, "variant": { "": 207 } }, "reported": { "clips": 0, "reasons": {} }, "size": 13420586, "checksum": "2c9d6cc31e53e046a1cd1b6d48adf8f4dbfe23bd0b7a36929e30609e4d038bd0", "generated_at": "2026-03-15T02:26:09.911Z" }, "th": { "locale": "th", "clips": 15, "users": 2, "questions": { "total": 61, "validated": 1, "invalidated": 0, "other": 60, "has_audio": 14, "avg_recordings_per_question": 1.07 }, "audios": { "total": 15, "transcribed_validated": 3, "transcribed_pending": 0, "not_transcribed": 12 }, "transcriptions": { "total": 3, "validated": 3, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 170316, "total_hrs": 0.04, "validated_ms": 11124, "validated_hrs": 0, "avg_ms": 11354, "min_ms": 2016, "max_ms": 22716, "avg_chars_per_sec": 6.41 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.6, "thirties": 0.4, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4, "male_masculine": 0.6, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 6, "": 9 }, "variant": { "": 15 } }, "reported": { "clips": 0, "reasons": {} }, "size": 962784, "checksum": "75417443e28da7067a7f408dee5c639b4d39c02b4dd17f8a42e99edfb9eea3e3", "generated_at": "2026-03-15T02:25:40.571Z" }, "tob": { "locale": "tob", "clips": 1572, "users": 25, "questions": { "total": 136, "validated": 0, "invalidated": 0, "other": 136, "has_audio": 136, "avg_recordings_per_question": 11.56 }, "audios": { "total": 1572, "transcribed_validated": 1540, "transcribed_pending": 0, "not_transcribed": 32 }, "transcriptions": { "total": 1540, "validated": 1540, "not_yet_validated": 0, "edited": 540, "edited_pct": 35.06 }, "duration": { "total_ms": 36023976, "total_hrs": 10, "validated_ms": 34841304, "validated_hrs": 9.67, "avg_ms": 22916, "min_ms": 108, "max_ms": 191916, "avg_chars_per_sec": 5.23 }, "buckets": { "train": { "clips": 939, "users": 14, "duration_ms": 22775652, "duration_hrs": 6.32 }, "dev": { "clips": 197, "users": 4, "duration_ms": 6448464, "duration_hrs": 1.79 }, "test": { "clips": 404, "users": 7, "duration_ms": 5617188, "duration_hrs": 1.56 } }, "demographics": { "age": { "": 0.4198, "teens": 0, "twenties": 0, "thirties": 0.0827, "fourties": 0.0884, "fifties": 0.2455, "sixties": 0.1635, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6654, "male_masculine": 0.1718, "female_feminine": 0.1628, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1572 }, "variant": { "": 1572 } }, "reported": { "clips": 0, "reasons": {} }, "size": 181814535, "checksum": "5b5723329825280db762c27cba69a19008175ffe394d0527b05485febecef3ce", "generated_at": "2026-03-15T02:16:21.896Z" }, "top": { "locale": "top", "clips": 417, "users": 10, "questions": { "total": 150, "validated": 0, "invalidated": 0, "other": 150, "has_audio": 143, "avg_recordings_per_question": 2.92 }, "audios": { "total": 417, "transcribed_validated": 413, "transcribed_pending": 0, "not_transcribed": 4 }, "transcriptions": { "total": 413, "validated": 413, "not_yet_validated": 0, "edited": 159, "edited_pct": 38.5 }, "duration": { "total_ms": 37808280, "total_hrs": 10.5, "validated_ms": 37316664, "validated_hrs": 10.36, "avg_ms": 90667, "min_ms": 5256, "max_ms": 498996, "avg_chars_per_sec": 5.83 }, "buckets": { "train": { "clips": 165, "users": 5, "duration_ms": 13259592, "duration_hrs": 3.68 }, "dev": { "clips": 152, "users": 3, "duration_ms": 10601640, "duration_hrs": 2.94 }, "test": { "clips": 96, "users": 2, "duration_ms": 13455432, "duration_hrs": 3.73 } }, "demographics": { "age": { "": 0.0671, "teens": 0, "twenties": 0, "thirties": 0.4269, "fourties": 0.1823, "fifties": 0.048, "sixties": 0.2758, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.247, "male_masculine": 0.3237, "female_feminine": 0.4293, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 417 }, "variant": { "": 417 } }, "reported": { "clips": 0, "reasons": {} }, "size": 216207428, "checksum": "be877fb9c9a63aae4ba3a8f30e242026056fc011b7431f84577df7cd2f8b2dd8", "generated_at": "2026-03-15T02:16:55.267Z" }, "tr": { "locale": "tr", "clips": 46, "users": 10, "questions": { "total": 141, "validated": 1, "invalidated": 0, "other": 140, "has_audio": 41, "avg_recordings_per_question": 1.12 }, "audios": { "total": 46, "transcribed_validated": 22, "transcribed_pending": 0, "not_transcribed": 24 }, "transcriptions": { "total": 22, "validated": 22, "not_yet_validated": 0, "edited": 8, "edited_pct": 36.36 }, "duration": { "total_ms": 981360, "total_hrs": 0.27, "validated_ms": 537984, "validated_hrs": 0.14, "avg_ms": 21334, "min_ms": 4428, "max_ms": 56628, "avg_chars_per_sec": 8.96 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.1087, "teens": 0.0652, "twenties": 0.1957, "thirties": 0.0652, "fourties": 0, "fifties": 0.0652, "sixties": 0.4565, "seventies": 0.0435, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1957, "male_masculine": 0.5, "female_feminine": 0.3043, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 29, "": 20 }, "variant": { "": 46 } }, "reported": { "clips": 1, "reasons": { "different_language": 1 } }, "size": 5850321, "checksum": "6558c28c7ac469ccad730cdd2908f2cd507cf765ecd034c08198b28467a6bce3", "generated_at": "2026-03-15T02:20:56.416Z" }, "ttj": { "locale": "ttj", "clips": 3100, "users": 26, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 120, "avg_recordings_per_question": 25.83 }, "audios": { "total": 3100, "transcribed_validated": 1882, "transcribed_pending": 0, "not_transcribed": 1218 }, "transcriptions": { "total": 1882, "validated": 1882, "not_yet_validated": 0, "edited": 783, "edited_pct": 41.6 }, "duration": { "total_ms": 60386832, "total_hrs": 16.77, "validated_ms": 36131184, "validated_hrs": 10.03, "avg_ms": 19480, "min_ms": 3960, "max_ms": 82080, "avg_chars_per_sec": 9.05 }, "buckets": { "train": { "clips": 1245, "users": 15, "duration_ms": 23408460, "duration_hrs": 6.5 }, "dev": { "clips": 373, "users": 4, "duration_ms": 6805008, "duration_hrs": 1.89 }, "test": { "clips": 264, "users": 3, "duration_ms": 5917716, "duration_hrs": 1.64 } }, "demographics": { "age": { "": 0, "teens": 0.0381, "twenties": 0.2316, "thirties": 0.1906, "fourties": 0.3097, "fifties": 0.1913, "sixties": 0.0387, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0.1935, "female_feminine": 0.8065, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 3100 }, "variant": { "": 3100 } }, "reported": { "clips": 0, "reasons": {} }, "size": 287354889, "checksum": "c8c5b079c42c416d0586097d91cfc1f14c600a64fe09c09dceebfb77190e50e0", "generated_at": "2026-03-15T02:23:52.123Z" }, "tuq": { "locale": "tuq", "clips": 87, "users": 3, "questions": { "total": 70, "validated": 0, "invalidated": 0, "other": 70, "has_audio": 70, "avg_recordings_per_question": 1.24 }, "audios": { "total": 87, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 87 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 2484000, "total_hrs": 0.69, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 28552, "min_ms": 1080, "max_ms": 109476, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1954, "thirties": 0, "fourties": 0, "fifties": 0.8046, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 87 }, "variant": { "": 87 } }, "reported": { "clips": 0, "reasons": {} }, "size": 14274775, "checksum": "6282c9dfd903b9ed8d4c5a93f3aeb6a78a888de3b326dced92f019e46beacead", "generated_at": "2026-03-15T02:26:05.860Z" }, "ukv": { "locale": "ukv", "clips": 2570, "users": 22, "questions": { "total": 119, "validated": 0, "invalidated": 19, "other": 100, "has_audio": 119, "avg_recordings_per_question": 21.6 }, "audios": { "total": 2570, "transcribed_validated": 2202, "transcribed_pending": 4, "not_transcribed": 364 }, "transcriptions": { "total": 2206, "validated": 2202, "not_yet_validated": 4, "edited": 966, "edited_pct": 43.79 }, "duration": { "total_ms": 42464052, "total_hrs": 11.79, "validated_ms": 36133524, "validated_hrs": 10.03, "avg_ms": 16523, "min_ms": 1908, "max_ms": 114768, "avg_chars_per_sec": 6.77 }, "buckets": { "train": { "clips": 1403, "users": 14, "duration_ms": 24026040, "duration_hrs": 6.67 }, "dev": { "clips": 406, "users": 4, "duration_ms": 5558796, "duration_hrs": 1.54 }, "test": { "clips": 393, "users": 4, "duration_ms": 6548688, "duration_hrs": 1.81 } }, "demographics": { "age": { "": 0.4564, "teens": 0, "twenties": 0.1786, "thirties": 0.0891, "fourties": 0.2296, "fifties": 0.0463, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4564, "male_masculine": 0.1362, "female_feminine": 0.4074, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2570 }, "variant": { "": 2570 } }, "reported": { "clips": 35, "reasons": { "different_language": 5, "offensive_speech": 1, "other": 29 } }, "size": 249818408, "checksum": "7f295172d261c98abb4a020f3b552db10d2b5c09355b15ef1d17793624bf1cdd", "generated_at": "2026-03-15T02:19:07.597Z" }, "ush": { "locale": "ush", "clips": 664, "users": 10, "questions": { "total": 118, "validated": 0, "invalidated": 0, "other": 118, "has_audio": 118, "avg_recordings_per_question": 5.63 }, "audios": { "total": 664, "transcribed_validated": 644, "transcribed_pending": 7, "not_transcribed": 13 }, "transcriptions": { "total": 651, "validated": 644, "not_yet_validated": 7, "edited": 64, "edited_pct": 9.83 }, "duration": { "total_ms": 18281844, "total_hrs": 5.07, "validated_ms": 17802792, "validated_hrs": 4.94, "avg_ms": 27533, "min_ms": 180, "max_ms": 149760, "avg_chars_per_sec": 11.45 }, "buckets": { "train": { "clips": 139, "users": 3, "duration_ms": 5187024, "duration_hrs": 1.44 }, "dev": { "clips": 214, "users": 4, "duration_ms": 6052680, "duration_hrs": 1.68 }, "test": { "clips": 291, "users": 3, "duration_ms": 6563088, "duration_hrs": 1.82 } }, "demographics": { "age": { "": 0.1988, "teens": 0.1747, "twenties": 0.4714, "thirties": 0.003, "fourties": 0.1521, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9985, "male_masculine": 0.0015, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 664 }, "variant": { "": 664 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 108223224, "checksum": "5b55c54d43e8aad47671918540fc0f9738eeb9265a2dd12cb3c4e8c00d54e5e3", "generated_at": "2026-03-15T02:17:09.171Z" }, "xkl": { "locale": "xkl", "clips": 1795, "users": 23, "questions": { "total": 120, "validated": 0, "invalidated": 1, "other": 119, "has_audio": 120, "avg_recordings_per_question": 14.96 }, "audios": { "total": 1795, "transcribed_validated": 259, "transcribed_pending": 1516, "not_transcribed": 20 }, "transcriptions": { "total": 1775, "validated": 259, "not_yet_validated": 1516, "edited": 302, "edited_pct": 17.01 }, "duration": { "total_ms": 37481616, "total_hrs": 10.41, "validated_ms": 5204916, "validated_hrs": 1.44, "avg_ms": 20881, "min_ms": 1764, "max_ms": 86400, "avg_chars_per_sec": 6.94 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6301, "teens": 0, "twenties": 0.1142, "thirties": 0.0875, "fourties": 0.0635, "fifties": 0, "sixties": 0.0507, "seventies": 0.054, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6301, "male_masculine": 0.093, "female_feminine": 0.2769, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1795 }, "variant": { "": 1795 } }, "reported": { "clips": 0, "reasons": {} }, "size": 223067345, "checksum": "3c231c3d19d4e4e9182476ee2e6bff09dd3410c088d4fdf33bbbc0cda7959da5", "generated_at": "2026-03-15T02:13:32.585Z" } }, "totalDurationMs": 1831734792, "totalValidDurationMs": 968998896, "totalHrs": 508, "totalValidHrs": 269 } ================================================ FILE: datasets/spontaneous-speech/sps-corpus-3.0-delta-2026-03-09.json ================================================ { "locales": { "ady": { "locale": "ady", "clips": 71, "users": 13, "questions": { "total": 128, "validated": 34, "invalidated": 0, "other": 94, "has_audio": 128, "avg_recordings_per_question": 0.55 }, "audios": { "total": 71, "transcribed_validated": 15, "transcribed_pending": 44, "not_transcribed": 12 }, "transcriptions": { "total": 59, "validated": 15, "not_yet_validated": 44, "edited": 3, "edited_pct": 5.08 }, "duration": { "total_ms": 866736, "total_hrs": 0.24, "validated_ms": 190980, "validated_hrs": 0.05, "avg_ms": 12208, "min_ms": 2988, "max_ms": 50796, "avg_chars_per_sec": 7.1 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0563, "thirties": 0.1408, "fourties": 0.0704, "fifties": 0.2394, "sixties": 0.0563, "seventies": 0.4366, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4507, "male_masculine": 0.2817, "female_feminine": 0.2676, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "abzakh": 17, "bzhedug": 7, "hatikuay": 0, "shapsug": 16, "temirgoy": 7, "ubykh": 0, "user_defined": 5, "": 36 }, "variant": { "ady-Cyrl": 10, "ady-Cyrl-JOR": 0, "ady-Cyrl-SY": 0, "ady-Cyrl-TR": 21, "ady-Latn-TR-t-ady-cyrl": 0, "ady-RU": 8, "": 32 } }, "reported": { "clips": 3, "reasons": { "other": 2, "different_language": 1 } }, "size": 5342216, "checksum": "1011243fccd070a49cd2bed1abc71c654147afdf01fcfa220f72154e6f21fbed", "generated_at": "2026-03-15T02:04:47.473Z" }, "an": { "locale": "an", "clips": 221, "users": 2, "questions": { "total": 334, "validated": 112, "invalidated": 0, "other": 222, "has_audio": 220, "avg_recordings_per_question": 1 }, "audios": { "total": 221, "transcribed_validated": 10, "transcribed_pending": 10, "not_transcribed": 201 }, "transcriptions": { "total": 20, "validated": 10, "not_yet_validated": 10, "edited": 6, "edited_pct": 30 }, "duration": { "total_ms": 6268320, "total_hrs": 1.74, "validated_ms": 219780, "validated_hrs": 0.06, "avg_ms": 28363, "min_ms": 288, "max_ms": 103536, "avg_chars_per_sec": 8.7 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.0045, "fourties": 0, "fifties": 0, "sixties": 0.9955, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 220, "": 1 }, "variant": { "": 221 } }, "reported": { "clips": 0, "reasons": {} }, "size": 37190086, "checksum": "4424a1d187b02c5828c7f3eeada29b22d207f9cb0e4a6cdc9d38f13aba5d1ff9", "generated_at": "2026-03-15T02:01:39.194Z" }, "br": { "locale": "br", "clips": 2, "users": 2, "questions": { "total": 80, "validated": 5, "invalidated": 0, "other": 75, "has_audio": 62, "avg_recordings_per_question": 0.03 }, "audios": { "total": 2, "transcribed_validated": 0, "transcribed_pending": 1, "not_transcribed": 1 }, "transcriptions": { "total": 1, "validated": 0, "not_yet_validated": 1, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 52524, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 26262, "min_ms": 3528, "max_ms": 48996, "avg_chars_per_sec": 5.95 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 0.5, "sixties": 0, "seventies": 0.5, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "gwenedeg": 0, "kerneveg": 0, "leoneg": 2, "tregerieg": 0 }, "variant": { "": 2 } }, "reported": { "clips": 0, "reasons": {} }, "size": 308211, "checksum": "a0aecc35007f659107507f4827bd0764ce181f6d456c60ee54f9dbe5905741d3", "generated_at": "2026-03-15T02:04:54.499Z" }, "ca": { "locale": "ca", "clips": 9, "users": 3, "questions": { "total": 74, "validated": 14, "invalidated": 0, "other": 60, "has_audio": 60, "avg_recordings_per_question": 0.15 }, "audios": { "total": 9, "transcribed_validated": 0, "transcribed_pending": 3, "not_transcribed": 6 }, "transcriptions": { "total": 3, "validated": 0, "not_yet_validated": 3, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 200700, "total_hrs": 0.05, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 22300, "min_ms": 3276, "max_ms": 39528, "avg_chars_per_sec": 5.17 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.4444, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0.5556, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3333, "male_masculine": 0, "female_feminine": 0.6667, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "balearic": 2, "central": 9, "learner_es": 0, "learner_other": 0, "northern": 0, "northwestern": 0, "valencian": 0 }, "variant": { "ca-algueres": 0, "ca-balear": 2, "ca-central": 7, "ca-northern": 0, "ca-nwestern": 0, "ca-valencia-alacant": 0, "ca-valencia-central": 0, "ca-valencia-northern": 0, "ca-valencia-southern": 0, "ca-valencia-tortosi": 0 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1187087, "checksum": "ae878b6ba6ec707dd885911fb78dc7ec91ac9967f81844bc837561ba35986565", "generated_at": "2026-03-15T02:04:56.050Z" }, "cdo": { "locale": "cdo", "clips": 2, "users": 1, "questions": { "total": 120, "validated": 1, "invalidated": 0, "other": 119, "has_audio": 119, "avg_recordings_per_question": 0.02 }, "audios": { "total": 2, "transcribed_validated": 0, "transcribed_pending": 2, "not_transcribed": 0 }, "transcriptions": { "total": 2, "validated": 0, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 11016, "total_hrs": 0, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 5508, "min_ms": 3996, "max_ms": 7020, "avg_chars_per_sec": 1.48 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 1, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 2 }, "variant": { "": 2 } }, "reported": { "clips": 0, "reasons": {} }, "size": 65722, "checksum": "57cca02074fc25611d893ccc6ea3617fa0af565d763509299a97fd7933a83182", "generated_at": "2026-03-15T02:01:22.489Z" }, "cy": { "locale": "cy", "clips": 5, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 16, "avg_recordings_per_question": 0.31 }, "audios": { "total": 5, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 5 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 125928, "total_hrs": 0.03, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 25186, "min_ms": 17928, "max_ms": 36756, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 1, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "united_kingdom": 5 }, "variant": { "cy-midwales": 0, "cy-northeas": 0, "cy-northwes": 0, "cy-southeas": 0, "cy-southwes": 0, "cy-wladfa": 0, "": 5 } }, "reported": { "clips": 0, "reasons": {} }, "size": 745371, "checksum": "4ec793ca1ce2e121de330d5c15133a6dd1c1fa59d5e1c1f9e88f15bab4f9f321", "generated_at": "2026-03-15T02:04:51.989Z" }, "da": { "locale": "da", "clips": 1, "users": 1, "questions": { "total": 94, "validated": 6, "invalidated": 0, "other": 88, "has_audio": 1, "avg_recordings_per_question": 1 }, "audios": { "total": 1, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 1 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 10260, "total_hrs": 0, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 10260, "min_ms": 10260, "max_ms": 10260, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 1, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 1 }, "variant": { "": 1 } }, "reported": { "clips": 0, "reasons": {} }, "size": 61145, "checksum": "25894a541d09d6b6cfe002ab27c2c310ca55d37eadc6598935b85687e17f1775", "generated_at": "2026-03-15T02:04:53.295Z" }, "de": { "locale": "de", "clips": 26, "users": 12, "questions": { "total": 97, "validated": 30, "invalidated": 6, "other": 61, "has_audio": 81, "avg_recordings_per_question": 0.32 }, "audios": { "total": 26, "transcribed_validated": 7, "transcribed_pending": 4, "not_transcribed": 15 }, "transcriptions": { "total": 11, "validated": 7, "not_yet_validated": 4, "edited": 2, "edited_pct": 18.18 }, "duration": { "total_ms": 296928, "total_hrs": 0.08, "validated_ms": 56988, "validated_hrs": 0.01, "avg_ms": 11420, "min_ms": 3240, "max_ms": 42408, "avg_chars_per_sec": 10.26 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.1154, "twenties": 0.3846, "thirties": 0.1538, "fourties": 0, "fifties": 0.1538, "sixties": 0.1923, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2692, "male_masculine": 0.2692, "female_feminine": 0.4231, "intersex": 0, "transgender": 0.0385, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "austria": 0, "belgium": 0, "brazil": 0, "bulgaria": 0, "canada": 0, "czechia": 0, "denmark": 0, "finland": 0, "france": 0, "germany": 19, "greece": 0, "hungary": 0, "italy": 0, "kazakhstan": 0, "liechtenstein": 0, "lithuania": 0, "luxembourg": 0, "namibia": 0, "netherlands": 0, "paraguay": 0, "poland": 0, "romania": 0, "russia": 0, "slovakia": 0, "slovenia": 0, "switzerland": 4, "turkey": 0, "united_kingdom": 0, "united_states": 0, "user_defined": 5, "": 7 }, "variant": { "": 26 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1746690, "checksum": "73de26bbb7b2dde05476189812aff81958337c1a73b66efd801da84f948067c6", "generated_at": "2026-03-15T02:06:23.996Z" }, "en": { "locale": "en", "clips": 2180, "users": 294, "questions": { "total": 527, "validated": 330, "invalidated": 38, "other": 159, "has_audio": 448, "avg_recordings_per_question": 4.87 }, "audios": { "total": 2180, "transcribed_validated": 617, "transcribed_pending": 54, "not_transcribed": 1509 }, "transcriptions": { "total": 671, "validated": 617, "not_yet_validated": 54, "edited": 50, "edited_pct": 7.45 }, "duration": { "total_ms": 21202704, "total_hrs": 5.88, "validated_ms": 4152168, "validated_hrs": 1.15, "avg_ms": 9726, "min_ms": 180, "max_ms": 294408, "avg_chars_per_sec": 6.81 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0839, "teens": 0.1064, "twenties": 0.4587, "thirties": 0.1284, "fourties": 0.0161, "fifties": 0.1564, "sixties": 0.0486, "seventies": 0.0005, "eighties": 0.0005, "nineties": 0.0005 }, "gender": { "": 0.5083, "male_masculine": 0.0307, "female_feminine": 0.4326, "intersex": 0.005, "transgender": 0.0023, "non-binary": 0.0028, "do_not_wish_to_say": 0.0183 }, "accent": { "african": 3, "australia": 9, "bermuda": 0, "canada": 3, "england": 104, "hongkong": 0, "indian": 3, "ireland": 0, "malaysia": 0, "newzealand": 0, "philippines": 0, "scotland": 0, "singapore": 0, "southatlandtic": 0, "us": 61, "wales": 0, "user_defined": 232, "": 1957 }, "variant": { "": 2180 } }, "reported": { "clips": 331, "reasons": { "different_language": 315, "other": 15, "offensive_speech": 1 } }, "size": 137813177, "checksum": "699e64e34bbfdd2d9571219b24e8867e7b838b63f17c8b265dbd87fd821d3602", "generated_at": "2026-03-15T02:01:13.073Z" }, "eo": { "locale": "eo", "clips": 66, "users": 3, "questions": { "total": 81, "validated": 2, "invalidated": 0, "other": 79, "has_audio": 55, "avg_recordings_per_question": 1.2 }, "audios": { "total": 66, "transcribed_validated": 11, "transcribed_pending": 7, "not_transcribed": 48 }, "transcriptions": { "total": 18, "validated": 11, "not_yet_validated": 7, "edited": 2, "edited_pct": 11.11 }, "duration": { "total_ms": 2209212, "total_hrs": 0.61, "validated_ms": 79308, "validated_hrs": 0.02, "avg_ms": 33473, "min_ms": 2340, "max_ms": 164376, "avg_chars_per_sec": 5.58 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.7424, "thirties": 0.2576, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.7424, "male_masculine": 0.2576, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "internacia": 6, "user_defined": 33, "": 49 }, "variant": { "": 66 } }, "reported": { "clips": 0, "reasons": {} }, "size": 13089902, "checksum": "6461ab1c10cac639a468a6e8f81b0206ea57ef738ce3208d16624b92b92971f4", "generated_at": "2026-03-15T02:07:27.357Z" }, "es": { "locale": "es", "clips": 320, "users": 9, "questions": { "total": 124, "validated": 55, "invalidated": 0, "other": 69, "has_audio": 119, "avg_recordings_per_question": 2.69 }, "audios": { "total": 320, "transcribed_validated": 2, "transcribed_pending": 3, "not_transcribed": 315 }, "transcriptions": { "total": 5, "validated": 2, "not_yet_validated": 3, "edited": 2, "edited_pct": 40 }, "duration": { "total_ms": 2165148, "total_hrs": 0.6, "validated_ms": 14148, "validated_hrs": 0, "avg_ms": 6766, "min_ms": 180, "max_ms": 30456, "avg_chars_per_sec": 6.76 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.6875, "twenties": 0.1062, "thirties": 0.0469, "fourties": 0.1469, "fifties": 0.0125, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8063, "male_masculine": 0.0469, "female_feminine": 0.1437, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0031 }, "accent": { "americacentral": 0, "andino": 117, "canario": 0, "caribe": 0, "centrosurpeninsular": 0, "chileno": 0, "filipinas": 0, "mexicano": 0, "nortepeninsular": 116, "rioplatense": 0, "surpeninsular": 4, "user_defined": 116, "": 83 }, "variant": { "": 320 } }, "reported": { "clips": 3, "reasons": { "different_language": 3 } }, "size": 12438875, "checksum": "ed9d7b98fcd7f7e0cbc9103fb9b33cf91341c89c762feed72a019840a59b6218", "generated_at": "2026-03-15T02:05:18.999Z" }, "fr": { "locale": "fr", "clips": 106, "users": 15, "questions": { "total": 155, "validated": 54, "invalidated": 3, "other": 98, "has_audio": 138, "avg_recordings_per_question": 0.77 }, "audios": { "total": 106, "transcribed_validated": 7, "transcribed_pending": 0, "not_transcribed": 99 }, "transcriptions": { "total": 7, "validated": 7, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 1168560, "total_hrs": 0.32, "validated_ms": 39960, "validated_hrs": 0.01, "avg_ms": 11024, "min_ms": 2088, "max_ms": 84384, "avg_chars_per_sec": 9.12 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0094, "teens": 0, "twenties": 0.0377, "thirties": 0.1792, "fourties": 0.0377, "fifties": 0.217, "sixties": 0.0472, "seventies": 0.3491, "eighties": 0.1226, "nineties": 0 }, "gender": { "": 0.4434, "male_masculine": 0.1981, "female_feminine": 0.3302, "intersex": 0, "transgender": 0.0094, "non-binary": 0.0189, "do_not_wish_to_say": 0 }, "accent": { "algeria": 0, "belgium": 0, "benin": 0, "cameroon": 0, "canada": 0, "central_african_republic": 0, "comoros": 0, "congo_kinshasa": 0, "cote_d_ivoire": 0, "cyprus": 0, "djibouti": 0, "equatorial_guinea": 0, "fr-metro-east": 17, "fr-metro-north": 0, "fr-metro-south": 32, "fr-metro-west": 0, "french_guiana": 0, "french_polynesia": 0, "germany": 2, "guadeloupe": 0, "haiti": 0, "italy": 0, "luxembourg": 0, "madagascar": 0, "mali": 0, "martinique": 0, "mauritania": 0, "mauritius": 0, "mayotte": 0, "monaco": 0, "morocco": 0, "netherlands": 0, "new_caledonia": 0, "niger": 0, "reunion": 0, "romania": 0, "senegal": 0, "seychelles": 0, "st_barthelemy": 0, "st_martin": 0, "st_pierre_et_miquelon": 0, "switzerland": 0, "syria": 0, "togo": 0, "tunisia": 0, "united_kingdom": 0, "united_states": 0, "vanuatu": 0, "wallis_et_futuna": 0, "": 55 }, "variant": { "fr-asia": 0, "fr-droum": 0, "fr-europe": 0, "fr-metro": 78, "fr-nafrica": 0, "fr-namerica": 0, "fr-safrica": 0, "fr-samerica": 0, "": 28 } }, "reported": { "clips": 1, "reasons": { "different_language": 1 } }, "size": 6910708, "checksum": "337cba8dc4c1b3b63f22ce77494448a59af8a759d4c54e0ca78ff51037b2b205", "generated_at": "2026-03-15T02:01:28.786Z" }, "gl": { "locale": "gl", "clips": 2094, "users": 109, "questions": { "total": 313, "validated": 43, "invalidated": 0, "other": 270, "has_audio": 310, "avg_recordings_per_question": 6.75 }, "audios": { "total": 2094, "transcribed_validated": 38, "transcribed_pending": 905, "not_transcribed": 1151 }, "transcriptions": { "total": 943, "validated": 38, "not_yet_validated": 905, "edited": 18, "edited_pct": 1.91 }, "duration": { "total_ms": 17206776, "total_hrs": 4.77, "validated_ms": 494676, "validated_hrs": 0.13, "avg_ms": 8217, "min_ms": 180, "max_ms": 497700, "avg_chars_per_sec": 7.49 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0721, "teens": 0.4499, "twenties": 0.0258, "thirties": 0.0712, "fourties": 0.3663, "fifties": 0.0143, "sixties": 0.0005, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4179, "male_masculine": 0.0277, "female_feminine": 0.522, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.0325 }, "accent": { "atlantico": 143, "central": 151, "neofalante": 119, "normativo": 605, "oriental": 42, "user_defined": 12, "": 1028 }, "variant": { "": 2094 } }, "reported": { "clips": 1, "reasons": { "other": 1 } }, "size": 96754096, "checksum": "e17f4a81143b46085eb6b3f77a22916ad118adf5a940b0290e61ce41b0284dd1", "generated_at": "2026-03-15T02:06:16.288Z" }, "gsw": { "locale": "gsw", "clips": 336, "users": 53, "questions": { "total": 75, "validated": 11, "invalidated": 0, "other": 64, "has_audio": 71, "avg_recordings_per_question": 4.73 }, "audios": { "total": 336, "transcribed_validated": 2, "transcribed_pending": 73, "not_transcribed": 261 }, "transcriptions": { "total": 75, "validated": 2, "not_yet_validated": 73, "edited": 8, "edited_pct": 10.67 }, "duration": { "total_ms": 6395220, "total_hrs": 1.77, "validated_ms": 6444, "validated_hrs": 0, "avg_ms": 19033, "min_ms": 1008, "max_ms": 144828, "avg_chars_per_sec": 6.76 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0327, "teens": 0, "twenties": 0.0208, "thirties": 0.0446, "fourties": 0.0417, "fifties": 0.122, "sixties": 0.6756, "seventies": 0.0625, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2619, "male_masculine": 0, "female_feminine": 0.7381, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 26, "": 310 }, "variant": { "gsw-FR-hochalem": 0, "gsw-FR-nordalem": 54, "gsw-FR-rhinfran": 24, "gsw-FR-sudnalem": 1, "gsw-FR-sudrfran": 2, "": 255 } }, "reported": { "clips": 3, "reasons": { "different_language": 3 } }, "size": 37266943, "checksum": "4e9bfe87ca5543908841d416c780ce94a1e6f6cd5c3650be6c524f2aec0bd977", "generated_at": "2026-03-15T02:05:06.989Z" }, "hr": { "locale": "hr", "clips": 4, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 4, "avg_recordings_per_question": 1 }, "audios": { "total": 4, "transcribed_validated": 0, "transcribed_pending": 2, "not_transcribed": 2 }, "transcriptions": { "total": 2, "validated": 0, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 52056, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 13014, "min_ms": 6156, "max_ms": 24732, "avg_chars_per_sec": 7.04 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 1 }, "accent": { "": 4 }, "variant": { "": 4 } }, "reported": { "clips": 0, "reasons": {} }, "size": 289156, "checksum": "dcc1a5157a53d4d0e88646ec6d1272e465d2ab2fe4c5570057254677ef578e79", "generated_at": "2026-03-15T02:07:52.816Z" }, "jv": { "locale": "jv", "clips": 32, "users": 3, "questions": { "total": 315, "validated": 0, "invalidated": 0, "other": 315, "has_audio": 31, "avg_recordings_per_question": 1.03 }, "audios": { "total": 32, "transcribed_validated": 0, "transcribed_pending": 14, "not_transcribed": 18 }, "transcriptions": { "total": 14, "validated": 0, "not_yet_validated": 14, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 647892, "total_hrs": 0.17, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 20247, "min_ms": 8316, "max_ms": 31176, "avg_chars_per_sec": 10.03 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.7188, "fourties": 0.2813, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 32 }, "variant": { "jv-kulon": 0, "jv-tengah": 0, "jv-timur": 0, "": 32 } }, "reported": { "clips": 0, "reasons": {} }, "size": 3844316, "checksum": "b3ddb18698da5b7ab8649557bcef674d993dbeea0bffa89c6ec991cd61aad11d", "generated_at": "2026-03-15T02:07:50.956Z" }, "kbd": { "locale": "kbd", "clips": 77, "users": 6, "questions": { "total": 155, "validated": 0, "invalidated": 0, "other": 155, "has_audio": 155, "avg_recordings_per_question": 0.5 }, "audios": { "total": 77, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 77 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 1151568, "total_hrs": 0.31, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 14955, "min_ms": 2376, "max_ms": 54468, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.0909, "thirties": 0.1429, "fourties": 0.7662, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.2468, "male_masculine": 0.0519, "female_feminine": 0.7013, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "bakhsan": 7, "besleney": 0, "kuban": 0, "kuban_zelenchuk": 0, "malka": 0, "mozdok": 0, "terek": 7, "": 63 }, "variant": { "kbd-Cyrl": 29, "kbd-Cyrl-JOR": 0, "kbd-Cyrl-SY": 0, "kbd-Cyrl-TR": 0, "kbd-Latn-TR-t-kbd-cyrl": 0, "kbd-RU": 0, "": 48 } }, "reported": { "clips": 0, "reasons": {} }, "size": 6783481, "checksum": "f2d8a757004f1556ce08cc8357dfb43dd3c7894f58adfc404828c812a7063415", "generated_at": "2026-03-15T02:04:43.635Z" }, "lv": { "locale": "lv", "clips": 36, "users": 4, "questions": { "total": 101, "validated": 0, "invalidated": 0, "other": 101, "has_audio": 48, "avg_recordings_per_question": 0.75 }, "audios": { "total": 36, "transcribed_validated": 2, "transcribed_pending": 5, "not_transcribed": 29 }, "transcriptions": { "total": 7, "validated": 2, "not_yet_validated": 5, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 1204308, "total_hrs": 0.33, "validated_ms": 115128, "validated_hrs": 0.03, "avg_ms": 33453, "min_ms": 4788, "max_ms": 165960, "avg_chars_per_sec": 8.99 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.0278, "twenties": 0.4722, "thirties": 0.0556, "fourties": 0, "fifties": 0.4444, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4722, "male_masculine": 0, "female_feminine": 0.5278, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 1, "": 35 }, "variant": { "": 36 } }, "reported": { "clips": 0, "reasons": {} }, "size": 7150196, "checksum": "debc5423bcb04df3dd9fc4ef08a37c74fe944677705039ca5bd5e58400adf2c3", "generated_at": "2026-03-15T02:04:50.382Z" }, "ms-MY": { "locale": "ms-MY", "clips": 5, "users": 2, "questions": { "total": 120, "validated": 0, "invalidated": 0, "other": 120, "has_audio": 119, "avg_recordings_per_question": 0.04 }, "audios": { "total": 5, "transcribed_validated": 0, "transcribed_pending": 2, "not_transcribed": 3 }, "transcriptions": { "total": 2, "validated": 0, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 54324, "total_hrs": 0.01, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 10865, "min_ms": 5940, "max_ms": 14040, "avg_chars_per_sec": 10.39 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.6, "teens": 0, "twenties": 0.4, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.6, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0.4 }, "accent": { "": 5 }, "variant": { "": 5 } }, "reported": { "clips": 0, "reasons": {} }, "size": 289119, "checksum": "987f8699a405e7c2f2331c554b4c7e8a4199258187dfc063c089784f85a23c0a", "generated_at": "2026-03-15T02:01:23.934Z" }, "nl": { "locale": "nl", "clips": 20, "users": 2, "questions": { "total": 98, "validated": 0, "invalidated": 0, "other": 98, "has_audio": 19, "avg_recordings_per_question": 1.05 }, "audios": { "total": 20, "transcribed_validated": 0, "transcribed_pending": 1, "not_transcribed": 19 }, "transcriptions": { "total": 1, "validated": 0, "not_yet_validated": 1, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 437904, "total_hrs": 0.12, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21895, "min_ms": 3312, "max_ms": 54144, "avg_chars_per_sec": 8.48 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0.6, "fourties": 0.4, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 1, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "aruba": 0, "belgium": 0, "curacao": 0, "france": 0, "germany": 0, "indonesia": 0, "namibia": 0, "netherlands": 20, "sint_maarten": 0, "south_africa": 0, "suriname": 0 }, "variant": { "": 20 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2533832, "checksum": "c73d1b6d2073a72e8f6484cf51a17cc87ed6d133bf77e284815bb7578fdf2163", "generated_at": "2026-03-15T02:07:24.131Z" }, "ps": { "locale": "ps", "clips": 827, "users": 68, "questions": { "total": 192, "validated": 56, "invalidated": 1, "other": 135, "has_audio": 176, "avg_recordings_per_question": 4.7 }, "audios": { "total": 827, "transcribed_validated": 85, "transcribed_pending": 13, "not_transcribed": 729 }, "transcriptions": { "total": 98, "validated": 85, "not_yet_validated": 13, "edited": 31, "edited_pct": 31.63 }, "duration": { "total_ms": 36875016, "total_hrs": 10.24, "validated_ms": 2126160, "validated_hrs": 0.59, "avg_ms": 44589, "min_ms": 1296, "max_ms": 352548, "avg_chars_per_sec": 13.28 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.006, "teens": 0.0266, "twenties": 0.5586, "thirties": 0.347, "fourties": 0.0375, "fifties": 0.0218, "sixties": 0.0024, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.4474, "male_masculine": 0.0145, "female_feminine": 0.5381, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 108, "": 719 }, "variant": { "": 827 } }, "reported": { "clips": 4, "reasons": { "other": 4 } }, "size": 203997185, "checksum": "c2454fe6cb43336629c2f1dd217b4c3571e11692d7fbf737679a8efa93f4f513", "generated_at": "2026-03-15T02:07:12.468Z" }, "qxp": { "locale": "qxp", "clips": 5101, "users": 94, "questions": { "total": 164, "validated": 110, "invalidated": 2, "other": 52, "has_audio": 150, "avg_recordings_per_question": 34.01 }, "audios": { "total": 5101, "transcribed_validated": 9, "transcribed_pending": 148, "not_transcribed": 4944 }, "transcriptions": { "total": 157, "validated": 9, "not_yet_validated": 148, "edited": 1, "edited_pct": 0.64 }, "duration": { "total_ms": 93539412, "total_hrs": 25.98, "validated_ms": 74844, "validated_hrs": 0.02, "avg_ms": 18337, "min_ms": 720, "max_ms": 1715616, "avg_chars_per_sec": 6.85 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0.0296, "twenties": 0.3446, "thirties": 0.2948, "fourties": 0.0902, "fifties": 0.1333, "sixties": 0.0782, "seventies": 0.0292, "eighties": 0, "nineties": 0 }, "gender": { "": 0.1919, "male_masculine": 0.0002, "female_feminine": 0.8079, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 987, "": 4141 }, "variant": { "": 5101 } }, "reported": { "clips": 0, "reasons": {} }, "size": 544762623, "checksum": "2a0eb5a15f3b4c4cae51a5a95212e49bef16b549c2b256a5bd992f1ae6105cb9", "generated_at": "2026-03-15T02:04:12.096Z" }, "rki": { "locale": "rki", "clips": 182, "users": 12, "questions": { "total": 180, "validated": 55, "invalidated": 0, "other": 125, "has_audio": 90, "avg_recordings_per_question": 2.02 }, "audios": { "total": 182, "transcribed_validated": 66, "transcribed_pending": 12, "not_transcribed": 104 }, "transcriptions": { "total": 78, "validated": 66, "not_yet_validated": 12, "edited": 18, "edited_pct": 23.08 }, "duration": { "total_ms": 1993392, "total_hrs": 0.55, "validated_ms": 432324, "validated_hrs": 0.12, "avg_ms": 10953, "min_ms": 1908, "max_ms": 106380, "avg_chars_per_sec": 9.85 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.011, "teens": 0.0055, "twenties": 0.1593, "thirties": 0.8187, "fourties": 0, "fifties": 0.0055, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9121, "male_masculine": 0, "female_feminine": 0.0879, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 182 }, "variant": { "": 182 } }, "reported": { "clips": 0, "reasons": {} }, "size": 11713058, "checksum": "a9de0bda8aa074001d57ae00922a3c4f0e9bf29960f96ebb19e1eb9418cab4f7", "generated_at": "2026-03-15T02:07:34.221Z" }, "ru": { "locale": "ru", "clips": 176, "users": 6, "questions": { "total": 759, "validated": 666, "invalidated": 1, "other": 92, "has_audio": 331, "avg_recordings_per_question": 0.53 }, "audios": { "total": 176, "transcribed_validated": 109, "transcribed_pending": 52, "not_transcribed": 15 }, "transcriptions": { "total": 161, "validated": 109, "not_yet_validated": 52, "edited": 34, "edited_pct": 21.12 }, "duration": { "total_ms": 3668364, "total_hrs": 1.01, "validated_ms": 1590048, "validated_hrs": 0.44, "avg_ms": 20843, "min_ms": 2448, "max_ms": 72576, "avg_chars_per_sec": 8.45 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.0057, "teens": 0.1875, "twenties": 0.7841, "thirties": 0.0227, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.9943, "male_masculine": 0.0057, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 85, "": 91 }, "variant": { "": 176 } }, "reported": { "clips": 0, "reasons": {} }, "size": 21481414, "checksum": "0f2a392a54fc66cb68b2d9014622a89ae26d80637bb772d7fb30e98ebf28a926", "generated_at": "2026-03-15T02:06:31.023Z" }, "shi": { "locale": "shi", "clips": 191, "users": 7, "questions": { "total": 62, "validated": 0, "invalidated": 0, "other": 62, "has_audio": 60, "avg_recordings_per_question": 3.18 }, "audios": { "total": 191, "transcribed_validated": 0, "transcribed_pending": 2, "not_transcribed": 189 }, "transcriptions": { "total": 2, "validated": 0, "not_yet_validated": 2, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 5209884, "total_hrs": 1.44, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 27277, "min_ms": 3780, "max_ms": 105228, "avg_chars_per_sec": 6.86 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.5183, "thirties": 0.1937, "fourties": 0, "fifties": 0.288, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.8063, "male_masculine": 0, "female_feminine": 0.1937, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 191 }, "variant": { "": 191 } }, "reported": { "clips": 0, "reasons": {} }, "size": 30331184, "checksum": "3533e6ccddf612fba0633e74a03124fbc86b8bba6cdfc61b67d2b0c9c72c7d50", "generated_at": "2026-03-15T02:06:41.412Z" }, "si": { "locale": "si", "clips": 21, "users": 1, "questions": { "total": 60, "validated": 0, "invalidated": 0, "other": 60, "has_audio": 21, "avg_recordings_per_question": 1 }, "audios": { "total": 21, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 21 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 446148, "total_hrs": 0.12, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 21245, "min_ms": 9396, "max_ms": 28224, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0, "thirties": 0, "fourties": 0, "fifties": 1, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 1 }, "accent": { "": 21 }, "variant": { "": 21 } }, "reported": { "clips": 0, "reasons": {} }, "size": 2634068, "checksum": "aec1577de9d059f52fbe52ccafa4265c4bf3017dc27ef91e0b40de418afdfb89", "generated_at": "2026-03-15T02:07:41.074Z" }, "su": { "locale": "su", "clips": 207, "users": 2, "questions": { "total": 131, "validated": 0, "invalidated": 0, "other": 131, "has_audio": 131, "avg_recordings_per_question": 1.58 }, "audios": { "total": 207, "transcribed_validated": 14, "transcribed_pending": 159, "not_transcribed": 34 }, "transcriptions": { "total": 173, "validated": 14, "not_yet_validated": 159, "edited": 1, "edited_pct": 0.58 }, "duration": { "total_ms": 2258640, "total_hrs": 0.62, "validated_ms": 124524, "validated_hrs": 0.03, "avg_ms": 10911, "min_ms": 3348, "max_ms": 34956, "avg_chars_per_sec": 9.35 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 1, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.3671, "male_masculine": 0, "female_feminine": 0.6329, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 207 }, "variant": { "": 207 } }, "reported": { "clips": 0, "reasons": {} }, "size": 13391810, "checksum": "408178a4f4ea7a4992f8378e913ddae4875c918806632dae29717366ec1a83a7", "generated_at": "2026-03-15T02:07:48.018Z" }, "th": { "locale": "th", "clips": 12, "users": 2, "questions": { "total": 61, "validated": 1, "invalidated": 0, "other": 60, "has_audio": 14, "avg_recordings_per_question": 0.86 }, "audios": { "total": 12, "transcribed_validated": 3, "transcribed_pending": 0, "not_transcribed": 9 }, "transcriptions": { "total": 3, "validated": 3, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 161856, "total_hrs": 0.04, "validated_ms": 11124, "validated_hrs": 0, "avg_ms": 13488, "min_ms": 3060, "max_ms": 22716, "avg_chars_per_sec": 6.41 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.75, "thirties": 0.25, "fourties": 0, "fifties": 0, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.25, "male_masculine": 0.75, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 3, "": 9 }, "variant": { "": 12 } }, "reported": { "clips": 0, "reasons": {} }, "size": 915232, "checksum": "51a1c800149d190b8fa5a53136d292a25a81d6185aa663897ab5563a4ae21c13", "generated_at": "2026-03-15T02:06:33.908Z" }, "tr": { "locale": "tr", "clips": 16, "users": 4, "questions": { "total": 141, "validated": 1, "invalidated": 0, "other": 140, "has_audio": 41, "avg_recordings_per_question": 0.39 }, "audios": { "total": 16, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 16 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 245592, "total_hrs": 0.06, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 15350, "min_ms": 5580, "max_ms": 46620, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0.3125, "teens": 0.1875, "twenties": 0.4375, "thirties": 0, "fourties": 0, "fifties": 0, "sixties": 0.0625, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 0.5, "male_masculine": 0.0625, "female_feminine": 0.4375, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "user_defined": 7, "": 12 }, "variant": { "": 16 } }, "reported": { "clips": 0, "reasons": {} }, "size": 1426301, "checksum": "b7974bfa907f001b4d509708d7078b192ba0ebdc2a82f419ddca528f34c99eee", "generated_at": "2026-03-15T02:01:31.090Z" }, "tuq": { "locale": "tuq", "clips": 87, "users": 3, "questions": { "total": 70, "validated": 0, "invalidated": 0, "other": 70, "has_audio": 70, "avg_recordings_per_question": 1.24 }, "audios": { "total": 87, "transcribed_validated": 0, "transcribed_pending": 0, "not_transcribed": 87 }, "transcriptions": { "total": 0, "validated": 0, "not_yet_validated": 0, "edited": 0, "edited_pct": 0 }, "duration": { "total_ms": 2484000, "total_hrs": 0.69, "validated_ms": 0, "validated_hrs": 0, "avg_ms": 28552, "min_ms": 1080, "max_ms": 109476, "avg_chars_per_sec": 0 }, "buckets": { "train": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "dev": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 }, "test": { "clips": 0, "users": 0, "duration_ms": 0, "duration_hrs": 0 } }, "demographics": { "age": { "": 0, "teens": 0, "twenties": 0.1954, "thirties": 0, "fourties": 0, "fifties": 0.8046, "sixties": 0, "seventies": 0, "eighties": 0, "nineties": 0 }, "gender": { "": 1, "male_masculine": 0, "female_feminine": 0, "intersex": 0, "transgender": 0, "non-binary": 0, "do_not_wish_to_say": 0 }, "accent": { "": 87 }, "variant": { "": 87 } }, "reported": { "clips": 0, "reasons": {} }, "size": 14240916, "checksum": "0be8d0c8fc8011818198d490606d8193d4098fb1528a8a575521d9019137115c", "generated_at": "2026-03-15T02:07:38.686Z" } }, "totalDurationMs": 208610388, "totalValidDurationMs": 9728604, "totalHrs": 58, "totalValidHrs": 3 } ================================================ FILE: helpers/.eslintrc.json ================================================ { "env": { "node": true, "commonjs": false, "es2021": true }, "extends": [ "eslint:recommended" ], "parserOptions": { "ecmaVersion": 12 }, "rules": { "no-console": "off" } } ================================================ FILE: helpers/README.md ================================================ # CV Dataset Helper Scripts This directory contains helper scripts for processing and analyzing Common Voice dataset statistics. ## Dataset Types - `scripted-speech` — Classic Common Voice (SCS) - `spontaneous-speech` — Spontaneous Speech (SPS) - `code-switching` — Code Switching (CS, not yet implemented) ## Shared Module [`common.js`](common.js) contains shared constants and utilities used by all helpers: - `DATASET_TYPES` — list of valid dataset types - `buildFilePath` / `buildFolderPath` — path builders for `datasets//` - `getLocaleFromFilename` — extracts locale from `stats_{locale}.json` - `validateDatasetType` — validates and throws on unknown types ## Commands All commands follow the pattern: `node helpers/