FoLiApy

An extensive library for processing FoLiA documents. FoLiA stands for Format for Linguistic Annotation and is a very rich XML-based format used by various Natural Language Processing tools.

Provided tools & services

FoLiApy

Type
  • Software Library

Tool suite: FoLiA

The following closely related tools are in a tool suite together with FoLiApy:

  • Command-line Application
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.

FoLiA tools 2.5.7

  •   KNAW Humanities Cluster & CLST, Radboud University
FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation) [view more]
  • Annotating
  • https://w3id.org/nwo-research-fields#ComputationalLinguisticsandPhilology
  • Textual and linguistic corpora
  • annotation
  • computational linguistics
  • folia
  • nlp
  • search
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2011-01-14
Modified: 2024-05-14
  • Software Library
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

folia 0.0.6

  •   KNAW Humanities Cluster & CLST, Radboud University
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation) [view more]
  • ['science', 'text-processing']
  • annotation
  • linguistics
  • nlp
  • text-processing
  • xml
Created: 2019-06-08
Modified: 2020-11-16
  • Command-line Application
  • Active: The project has reached a stable, usable state and is being actively developed.

foliautils 0.22

Command-line utilities for working with the Format for Linguistic Annotation (FoLiA). [view more]
  • folia
  • linguistic annotation
  • natural language processing
  • nlp
  • xml
  • Posix
  • Command-line Application
  • Software Library
  • Active: The project has reached a stable, usable state and is being actively developed.

libfolia 2.20

This is a C++ Library for working with the Format for Linguistic Annotation (FoLiA). [view more]
  • folia
  • linguistic annotation
  • natural language processing
  • nlp
  • xml
  • Posix
  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

piereling 0.4

  •   KNAW Humanities Cluster & CLST, Radboud University
Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • webservice nlp computational_linguistics rest folia conversion
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2019-10-18
Modified: 2023-11-01

Citation

You can cite this software using the following citation generated from its metadata:

(2024) FoLiApy 2.5.11 .
  • KNAW Humanities Cluster & CLST, Radboud University
.

Logs & Reviews

Name
Automatic software metadata validation report for FoLiApy 2.5.11
Author
  • codemetapy validator using software.ttl
Date
2024-09-16 03:06:45
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of FoLiApy 2.5.11 was successful (score=4/5), but there are some remarks which you may or may not want to address:

1. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
2. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
Rating
★ ★ ★ ★ ☆
(log file starts at Mon Sep 16 03:06:30 UTC 2024)

[harvester info] --> Processing foliapy (https://github.com/proycon/foliapy) [Mon Sep 16 03:06:30 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/proycon/foliapy...

[harvester info] Found release v2.5.11

[harvester info] Using 'v2.5.11'

[harvester info] Git reference: v2.5.11

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/foliapy for harvestable resources...

[harvester info] found codemeta-harvest.json for foliapy (md5sum f8c2b03ebb3657288a8cbbb92f2ff776); values in here take precendence over (override) those in later detection stages

[harvester info] found python setup for foliapy, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Parsing AUTHORS...

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2010-05-27T19:17:40Z+0000, date modified: 2024-03-28T17:23:49Z+0100

[harvester info] Querying Github/GitLab API (https://github.com/proycon/foliapy)

[harvester info] Adding URL for found README: README.rst

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Converting README.rst to README.md

[harvester info] Looking for TRL information in README.md...

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Found CI https://github.com/proycon/foliapy/actions/workflows/foliapy.yml

[harvester info] Looking for documentation links in README.md...

[harvester info] Scraping title from http://foliapy.readthedocs.io/en/latest/?badge=latest

[harvester info] Found documentation at http://foliapy.readthedocs.io/en/latest/?badge=latest : "name": "FoLiA Python Library — FoLiA Python Library v2.5.11, FoLiA v2.5.3 documentation",

[harvester info] Scraping title from https://foliapy.readthedocs.io/en/latest/

[harvester info] Found documentation at https://foliapy.readthedocs.io/en/latest/ : "name": "FoLiA Python Library — FoLiA Python Library v2.5.11, FoLiA v2.5.3 documentation",

[harvester info] Falling back to git tag (v2.5.11) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.rst in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Setting group FoLiA

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.dev.clariah.nl --baseuri https://tools.dev.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "foliapy" --codeRepository "https://github.com/proycon/foliapy" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/31-authors.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/20-python.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/12-ci.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-harvest.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.foliapy.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.foliapy.codemeta.json 

-- begin log --

Passed 16 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/31-authors.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/20-python.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/12-ci.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-harvest.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.foliapy.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.foliapy.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.dev.clariah.nl/foliapy

Processing source #1 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 1 new triples, total is now 2

Processing source #2 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 1 new triples, total is now 3

Processing source #3 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.foliapy.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/foliapy.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 1 new triples, total is now 3

Processing source #4 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 8 new triples, total is now 11

Processing source #5 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 2 new triples, total is now 13

Processing source #6 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 1 new triples, total is now 14

Processing source #7 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.foliapy.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/foliapy/snapshot

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 29 new triples, total is now 42

Processing source #8 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] overriding old http://schema.org/dateCreated (2018-09-06T12:45:23Z -> 2010-05-27T19:17:40Z+0000)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] overriding old http://schema.org/dateModified (2024-05-14T23:01:14Z -> 2024-03-28T17:23:49Z+0100)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 2 new triples, total is now 42

Processing source #9 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/31-authors.foliapy.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/authors/snapshot

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] overriding old http://schema.org/author (https://tools.dev.clariah.nl/stub/H-78cda685abdb28ef -> https://tools.dev.clariah.nl/stub/H1e7359ffd8b3f3ec)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 8 new triples, total is now 48

Processing source #10 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.dev.clariah.nl/foliapy)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/foliapy)] processed 1 new triples, total is now 48

Processing source #11 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/20-python.foliapy.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/folia/2.5.11

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/author (https://tools.dev.clariah.nl/stub/H1e7359ffd8b3f3ec -> https://tools.dev.clariah.nl/stub/H-62db4a22ac9261b2)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/description (An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl. -> An extensive library for processing FoLiA documents. FoLiA stands for Format for Linguistic Annotation and is a very rich XML-based format used by various Natural Language Processing tools.)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/keywords (pynlpl -> annotation)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/keywords (computational-linguistics -> annotation)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/keywords (clarin -> annotation)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/keywords (clariah -> annotation)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/name (foliapy -> FoLiA)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/url (https://proycon.github.io/folia -> https://github.com/proycon/foliapy)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/version (v2.5.11 -> 2.5.11)

[CODEMETA COMPOSITION (folia)] processed 65 new triples, total is now 92

Processing source #12 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/12-ci.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (folia)] processed 1 new triples, total is now 93

Processing source #13 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (folia)] processed 1 new triples, total is now 93

Processing source #14 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-harvest.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/name (FoLiA -> FoLiApy)

[CODEMETA COMPOSITION (folia)] overriding old https://codemeta.github.io/terms/developmentStatus (https://w3id.org/research-technology-readiness-levels#Level8Complete -> https://w3id.org/research-technology-readiness-levels#Level9Proven)

[CODEMETA COMPOSITION (folia)] overriding old http://schema.org/applicationCategory (Text Processing > Linguistic -> https://vocabs.dariah.eu/tadirah/annotating)

[CODEMETA COMPOSITION (folia)] processed 6 new triples, total is now 95

Processing source #15 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (folia)] processed 1 new triples, total is now 95

Processing source #16 of 16

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.foliapy.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/foliapy

[CODEMETA COMPOSITION (folia)] processed 1 new triples, total is now 96

Remapping URI to (possibly) new identifier and version component: https://tools.dev.clariah.nl/foliapy -> https://tools.dev.clariah.nl/foliapy/2.5.11

[CODEMETA VALIDATION (foliapy)] done

[CODEMETA ENRICHMENT (foliapy)] Guessing interface type https://w3id.org/software-types#SoftwareLibrary based on clues

[CODEMETA ENRICHMENT (foliapy)] automatically adding programmingLanguage Python derived from runtimePlatform Python

[CODEMETA ENRICHMENT (foliapy)] adding author https://tools.dev.clariah.nl/person/maarten-van-gompel as contributor

[CODEMETA ENRICHMENT (foliapy)] adding affiliation(s) of first author as producer

VALIDATION https://tools.dev.clariah.nl/foliapy/2.5.11 #1: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://tools.dev.clariah.nl/foliapy/2.5.11 #2: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/foliapy.codemeta.json

[harvester info] <-- Finished processing foliapy (https://github.com/proycon/foliapy) [Mon Sep 16 03:06:46 UTC 2024]

        

Metadata Properties

Version
2.5.11 (release notes)
Interface types
  • Software Library
Software website
Source code repository
 https://github.com/proycon/foliapy  Stars are an indicator of the popularity of this project on GitHub
Category
  • Annotating
  • https://w3id.org/nwo-research-fields#ComputationalLinguisticsandPhilology
  • Textual and linguistic corpora
Keywords
  • annotation
  • computational linguistics
  • folia
  • format
  • nlp
  • xml
Development Status
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/proycon/foliapy/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
  •   KNAW Humanities Cluster & CLST, Radboud University
Programming Language
  • Python
Continuous Integration Tests
https://github.com/proycon/foliapy/actions/workflows/foliapy.yml
Runtime Platform
  • Python 3
Operating System
  • BSD
  • Linux
  • macOS
Software dependencies
  • lxml
  • rdflib
  • requests
Metadata validation
★ ★ ★ ★ ☆
Created
2010-05-27 19:17:40 +0000
Last modified
2024-03-28 17:23:49 +0100  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.