stam

stam
STAM is a powerful library for dealing with stand-off annotations on text. This is the Rust library.

Provided tools & services

stam

Type
  • Software Library

Tool suite: STAM

The following closely related tools are in a tool suite together with stam:

  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.
  • Active: The project has reached a stable, usable state and is being actively developed.
thumbnail/logo

stam v1.1.0

Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an annotation. This repository contains the model's full specification, extensions, schemas, examples and documentation. [view more]
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
  • annotation
  • linguistics
  • stand-off
  • text
  • text-annotation
  • webannotation
Created: 2021-09-09
Modified: 2024-08-23
  • Software Library
  • 7 - Release Candidate: Technology ready enough and in initial use by end-users in intended scholarly environments. Further validation may be in progress.
  • Active: The project has reached a stable, usable state and is being actively developed.
thumbnail/logo

stam 0.9.0

STAM is a library for dealing with standoff annotations on text, this is the python binding. [view more]
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
  • annotation
  • linguistics
  • nlp
  • standoff
  • text-processing
Created: 2023-01-31
Modified: 2024-08-29
  • Command-line Application
  • 7 - Release Candidate: Technology ready enough and in initial use by end-users in intended scholarly environments. Further validation may be in progress.
  • Active: The project has reached a stable, usable state and is being actively developed.

stam-tools 0.8.0

Command-line tools for working with stand-off annotations on text (STAM) [view more]
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
  • annotation
  • linguistics
  • nlp
  • standoff
  • text-processing
Created: 2023-03-21
Modified: 2024-08-29

Citation

You can cite this software using the following citation generated from its metadata:

(2024) stam 0.15.0 .
  • KNAW Humanities Cluster
.

Logs & Reviews

Name
Automatic software metadata validation report for stam 0.15.0
Author
  • codemetapy validator using software.ttl
Date
2024-09-16 03:14:39
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of stam 0.15.0 was successful (score=3/5), but there are some warnings which should be addressed:

1. Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)
2. Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)
3. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
Rating
★ ★ ★ ☆ ☆
(log file starts at Mon Sep 16 03:14:28 UTC 2024)

[harvester info] --> Processing stam-rust (https://github.com/annotation/stam-rust) [Mon Sep 16 03:14:28 UTC 2024]

[harvester info] Git updating cached clone of https://github.com/annotation/stam-rust...

[harvester info] Found release v0.15.0

[harvester info] Using 'v0.15.0'

[harvester info] Git reference: v0.15.0

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/stam-rust for harvestable resources...

[harvester info] found codemeta-harvest.json for stam-rust (md5sum 47f7539dabba7c4fffebf125100d8e7a); values in here take precendence over (override) those in later detection stages

[harvester info] found Cargo.toml (rust) for stam-rust, converting to codemeta

[harvester info] Looking for license....

[harvester info] Found license GPL-3.0-only

[harvester info] Getting contributors from git...

[harvester info] No git contributors found

[harvester info] Getting top contributor from git...

[harvester info] Git top contributor  will be assigned as author (and maintainer) if none are found in the metadata

[harvester info] Extracting last and first commit date from git log....

[harvester info] Date created: 2023-01-03T17:56:32Z+0100, date modified: 2024-08-29T17:38:16Z+0200

[harvester info] Querying Github/GitLab API (https://github.com/annotation/stam-rust)

[harvester info] Adding URL for found README: README.md

[harvester info] Found releaseNotes

[harvester info] Querying Zenodo API for DOI (access token provided)...

[harvester info] Looking for TRL information in README.md...

[harvester info] Found TRL https://w3id.org/research-technology-readiness-levels#Level7ReleaseCandidate

[harvester info] Looking for repostatus information in README.md...

[harvester info] Found repostatus https://www.repostatus.org/#active

[harvester info] Looking for continuous integration information in README.md...

[harvester info] Looking for documentation links in README.md...

[harvester info] Scraping title from https://docs.rs/stam

[harvester info] Found documentation at https://docs.rs/stam : "name": "stam - Rust",

[harvester info] Scraping title from https://docs.rs/stam/

[harvester info] Found documentation at https://docs.rs/stam/ : "name": "stam - Rust",

[harvester info] Falling back to git tag (v0.15.0) if no version number is specified...

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

[harvester info] Found repostatus (master branch) https://www.repostatus.org/#active

[harvester info] Setting group STAM

[harvester info] Reconciliating: codemetapy  --baseuri https://tools.dev.clariah.nl --baseuri https://tools.dev.clariah.nl --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "stam-rust" --codeRepository "https://github.com/annotation/stam-rust" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-version.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/90-authors.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/50-documentation.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/43-releasenotes.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/23-rust.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-trl.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/11-repostatus.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/10-harvest.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/05-repostatus.stam-rust.codemeta.json /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.stam-rust.codemeta.json 

-- begin log --

Passed 15 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-version.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/99-repostatus.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/90-authors.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/50-documentation.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/43-releasenotes.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/23-rust.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-trl.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/11-repostatus.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/10-harvest.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/05-repostatus.stam-rust.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/04-applicationSuite.stam-rust.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://tools.dev.clariah.nl/stam-rust

Processing source #1 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-version.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 2

Processing source #2 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 3

Processing source #3 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/90-authors.stam-rust.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/stam-rust.topcontributor/snapshot

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 3

Processing source #4 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/50-documentation.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 8 new triples, total is now 11

Processing source #5 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/43-releasenotes.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 2 new triples, total is now 13

Processing source #6 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 14

Processing source #7 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.stam-rust.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/stam-rust/snapshot

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 20 new triples, total is now 33

Processing source #8 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/dateCreated (2023-01-03T12:34:52Z -> 2023-01-03T17:56:32Z+0100)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/dateModified (2024-09-12T09:50:24Z -> 2024-08-29T17:38:16Z+0200)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 2 new triples, total is now 33

Processing source #9 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/license (http://spdx.org/licenses/GPL-3.0-only -> GPL-3.0-only)

[CODEMETA CORRECTION (https://tools.dev.clariah.nl/stam-rust)] automatically converting license to spdx URI

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 33

Processing source #10 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/23-rust.stam-rust.codemeta.json

    Found main resource with URI https://tools.dev.clariah.nl/cargo.toml/0.15.0

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/description (Programming library for the Standoff Text Annotation Model (STAM), written in Rust. This is the primary software library for STAM with a focus on performance. -> STAM is a powerful library for dealing with stand-off annotations on text. This is the Rust library.)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/keywords (library -> annotation)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/keywords (text -> annotation)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/keywords (rust -> annotation)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/name (stam-rust -> stam)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old https://codemeta.github.io/terms/readme (https://github.com/annotation/stam-rust/blob/v0.15.0//README.md -> https://tools.dev.clariah.nl/README.md)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/softwareHelp (https://docs.rs/stam/ -> https://docs.rs/stam)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/softwareHelp (https://docs.rs/stam -> https://docs.rs/stam)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/url (https://annotation.github.io/stam -> https://github.com/annotation/stam)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/version (v0.15.0 -> 0.15.0)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 93 new triples, total is now 110

Processing source #11 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-trl.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 111

Processing source #12 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/11-repostatus.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 111

Processing source #13 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/10-harvest.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] overriding old http://schema.org/producer (https://tools.dev.clariah.nl/org/annotation -> https://tools.dev.clariah.nl/stub/H64d680d4b989e80e)

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 15 new triples, total is now 125

Processing source #14 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/05-repostatus.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 125

Processing source #15 of 15

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/04-applicationSuite.stam-rust.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://tools.dev.clariah.nl/stam-rust

[CODEMETA COMPOSITION (https://tools.dev.clariah.nl/stam-rust)] processed 1 new triples, total is now 126

Remapping URI to (possibly) new identifier and version component: https://tools.dev.clariah.nl/stam-rust -> https://tools.dev.clariah.nl/stam-rust/0.15.0

[CODEMETA VALIDATION (stam-rust)] done

[CODEMETA ENRICHMENT (stam-rust)] Guessing interface type https://w3id.org/software-types#SoftwareLibrary based on clues

[CODEMETA ENRICHMENT (stam-rust)] adding author https://tools.dev.clariah.nl/person/maarten-van-gompel as contributor

[CODEMETA ENRICHMENT (stam-rust)] considering first author as maintainer

VALIDATION https://tools.dev.clariah.nl/stam-rust/0.15.0 #1: Info: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)

VALIDATION https://tools.dev.clariah.nl/stam-rust/0.15.0 #2: Warning: Documentation *SHOULD* be expressed (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?)

VALIDATION https://tools.dev.clariah.nl/stam-rust/0.15.0 #3: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/stam-rust.codemeta.json

[harvester info] <-- Finished processing stam-rust (https://github.com/annotation/stam-rust) [Mon Sep 16 03:14:39 UTC 2024]

        

Metadata Properties

Version
0.15.0 (release notes)
Interface types
  • Software Library
Software website
Source code repository
 https://github.com/annotation/stam-rust  Stars are an indicator of the popularity of this project on GitHub
Category
  • Annotating
  • Textual and content analysis
  • Textual and linguistic corpora
Keywords
  • annotation
  • linguistics
  • nlp
  • standoff
  • text-processing
Development Status
  • 7 - Release Candidate: Technology ready enough and in initial use by end-users in intended scholarly environments. Further validation may be in progress.
  • Active: The project has reached a stable, usable state and is being actively developed.
Issue Tracker (Support)
https://github.com/annotation/stam-rust/issues  The number of open issues on the issue tracker  The number of closes issues on the issue tracker
Documentation
License
Author(s)
Maintainer(s)
Contributor(s)
Producer
Programming Language
  • Rust
Software dependencies
  • base16ct
  • chrono
  • csv
  • datasize
  • minicbor
  • nanoid
  • rayon
  • regex
  • sealed
  • serde
  • serde_json
  • serde_path_to_error
  • sha1
  • smallvec
Metadata validation
★ ★ ★ ☆ ☆
Created
2023-01-03 17:56:32 +0100
Last modified
2024-08-29 17:38:16 +0200  Last commit (main branch). Gives an indication of project development activity and rough indication of how up-to-date the latest release is.  Number of commits since the last release. Gives an indication of project development activity and rough indication of how up-to-date the latest release is.