Skip to content

Class: EveryEvalAIResult

An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore.

URI: nexus:everyevalairesult

 classDiagram
    class EveryEvalAIResult
    click EveryEvalAIResult href "../EveryEvalAIResult/"
      AiEvalResult <|-- EveryEvalAIResult
        click AiEvalResult href "../AiEvalResult/"

      EveryEvalAIResult : broad_mappings





        EveryEvalAIResult --> "*" Any : broad_mappings
        click Any href "../Any/"



      EveryEvalAIResult : close_mappings





        EveryEvalAIResult --> "*" Any : close_mappings
        click Any href "../Any/"



      EveryEvalAIResult : dateCreated

      EveryEvalAIResult : dateModified

      EveryEvalAIResult : description

      EveryEvalAIResult : evaluation_id

      EveryEvalAIResult : evaluation_timestamp

      EveryEvalAIResult : evidence

      EveryEvalAIResult : exact_mappings





        EveryEvalAIResult --> "*" Any : exact_mappings
        click Any href "../Any/"



      EveryEvalAIResult : hasAudience

      EveryEvalAIResult : hasDataFormat

      EveryEvalAIResult : hasDataSize

      EveryEvalAIResult : hasDataSource

      EveryEvalAIResult : hasDataType

      EveryEvalAIResult : hasDocumentation





        EveryEvalAIResult --> "*" Documentation : hasDocumentation
        click Documentation href "../Documentation/"



      EveryEvalAIResult : hasDomains

      EveryEvalAIResult : hasEvaluationResults





        EveryEvalAIResult --> "*" EvaluationResultRecord : hasEvaluationResults
        click EvaluationResultRecord href "../EvaluationResultRecord/"



      EveryEvalAIResult : hasGoal

      EveryEvalAIResult : hasLanguages

      EveryEvalAIResult : hasLimitations

      EveryEvalAIResult : hasMethods

      EveryEvalAIResult : hasMetrics

      EveryEvalAIResult : hasModelInfo





        EveryEvalAIResult --> "0..1" ModelInfo : hasModelInfo
        click ModelInfo href "../ModelInfo/"



      EveryEvalAIResult : hasRelatedRisk





        EveryEvalAIResult --> "*" Risk : hasRelatedRisk
        click Risk href "../Risk/"



      EveryEvalAIResult : hasResources

      EveryEvalAIResult : hasSourceMetadata





        EveryEvalAIResult --> "0..1" SourceMetadata : hasSourceMetadata
        click SourceMetadata href "../SourceMetadata/"



      EveryEvalAIResult : hasTasks

      EveryEvalAIResult : id

      EveryEvalAIResult : isCategorizedAs





        EveryEvalAIResult --> "*" Any : isCategorizedAs
        click Any href "../Any/"



      EveryEvalAIResult : isResultOf





        EveryEvalAIResult --> "0..1" AiEval : isResultOf
        click AiEval href "../AiEval/"



      EveryEvalAIResult : name

      EveryEvalAIResult : narrow_mappings





        EveryEvalAIResult --> "*" Any : narrow_mappings
        click Any href "../Any/"



      EveryEvalAIResult : related_mappings





        EveryEvalAIResult --> "*" Any : related_mappings
        click Any href "../Any/"



      EveryEvalAIResult : retrieved_timestamp

      EveryEvalAIResult : schema_version

      EveryEvalAIResult : url

      EveryEvalAIResult : value

Inheritance

Class Properties

Property Value
Class URI nexus:everyevalairesult

Slots

Name Cardinality and Range Description Inheritance
hasSourceMetadata 0..1
SourceMetadata
Source metadata for the evaluation direct
hasModelInfo 0..1
ModelInfo
Model information for the evaluation direct
hasEvaluationResults *
EvaluationResultRecord
Array of evaluation results direct
hasDataType *
String
The type of data used in the benchmark (e direct
hasDomains *
String
The specific domains or areas where the benchmark is applied (e direct
hasLanguages *
String
The languages included in the dataset used by the benchmark (e direct
hasTasks *
String
The tasks or evaluations the benchmark is intended to assess direct
hasDataSource *
String
The origin or source of the data used in the benchmark (e direct
hasDataSize 0..1
String
The size of the dataset, including the number of data points or examples direct
hasDataFormat *
String
The structure and modality of the data (e direct
hasMethods *
String
The evaluation techniques applied within the benchmark direct
hasMetrics *
String
The specific performance metrics used to assess models (e direct
hasLimitations *
String
Limitations in evaluating or addressing risks, such as gaps in demographic co... direct
hasGoal 0..1
String
The specific goal or primary use case the benchmark is designed for direct
hasAudience *
String
The intended audience, such as researchers, developers, policymakers, etc direct
hasResources *
String
Links to relevant resources, such as repositories or papers related to the be... direct
hasDocumentation *
Documentation
Indicates documentation associated with an entity direct
hasRelatedRisk *
Term or 
Risk or 
RiskConcept
A relationship where an entity relates to a risk direct
schema_version 0..1
String
Version of the evaluation schema direct
evaluation_id 0..1
String
Unique identifier for this evaluation direct
evaluation_timestamp 0..1
Datetime
ISO 8601 timestamp when evaluation was performed direct
retrieved_timestamp 0..1
String
Unix timestamp when the data was retrieved direct
isResultOf 0..1
AiEval
A relationship indicating that an entity is the result of an AI evaluation AiEvalResult
value 1
String
Some numeric or string value Fact
evidence 0..1
String
Evidence provides a source (typical a chunk, paragraph or link) describing wh... Fact
id 1
String
A unique identifier to this instance of the model element Entity
name 0..1
String
A text name of this instance Entity
description 0..1
String
The description of an entity Entity
url 0..1
Uri
An optional URL associated with this instance Entity
dateCreated 0..1
Date
The date on which the entity was created Entity
dateModified 0..1
Date
The date on which the entity was most recently modified Entity
exact_mappings *
Any
The property is used to link two concepts, indicating a high degree of confid... Entity
close_mappings *
Any
The property is used to link two concepts that are sufficiently similar that ... Entity
related_mappings *
Any
The property skos:relatedMatch is used to state an associative mapping link b... Entity
narrow_mappings *
Any
The property is used to state a hierarchical mapping link between two concept... Entity
broad_mappings *
Any
The property is used to state a hierarchical mapping link between two concept... Entity
isCategorizedAs *
Any
A relationship where an entity has been deemed to be categorized Entity

Usages

used by used in type used
EveryEvalAIResult hasSourceMetadata domain EveryEvalAIResult
EveryEvalAIResult hasModelInfo domain EveryEvalAIResult
EveryEvalAIResult hasEvaluationResults domain EveryEvalAIResult

Identifier and Mapping Information

Schema Source

  • from schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology

Mappings

Mapping Type Mapped Value
self nexus:everyevalairesult
native nexus:EveryEvalAIResult

LinkML Source

Direct

```yaml name: EveryEvalAIResult description: An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology is_a: AiEvalResult slots: - hasSourceMetadata - hasModelInfo - hasEvaluationResults - hasDataType - hasDomains - hasLanguages - hasTasks - hasDataSource - hasDataSize - hasDataFormat - hasMethods - hasMetrics - hasLimitations - hasGoal - hasAudience - hasResources - hasDocumentation - hasRelatedRisk attributes: schema_version: name: schema_version description: Version of the evaluation schema from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: string evaluation_id: name: evaluation_id description: Unique identifier for this evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: string evaluation_timestamp: name: evaluation_timestamp description: ISO 8601 timestamp when evaluation was performed from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: datetime retrieved_timestamp: name: retrieved_timestamp description: Unix timestamp when the data was retrieved from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: string class_uri: nexus:everyevalairesult ````

Induced

```yaml name: EveryEvalAIResult description: An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology is_a: AiEvalResult attributes: schema_version: name: schema_version description: Version of the evaluation schema from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: schema_version owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: string evaluation_id: name: evaluation_id description: Unique identifier for this evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: evaluation_id owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: string evaluation_timestamp: name: evaluation_timestamp description: ISO 8601 timestamp when evaluation was performed from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: evaluation_timestamp owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: datetime retrieved_timestamp: name: retrieved_timestamp description: Unix timestamp when the data was retrieved from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: retrieved_timestamp owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: string hasSourceMetadata: name: hasSourceMetadata description: Source metadata for the evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: EveryEvalAIResult alias: hasSourceMetadata owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: SourceMetadata inlined: true hasModelInfo: name: hasModelInfo description: Model information for the evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: EveryEvalAIResult alias: hasModelInfo owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: ModelInfo inlined: true hasEvaluationResults: name: hasEvaluationResults description: Array of evaluation results from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: EveryEvalAIResult alias: hasEvaluationResults owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: EvaluationResultRecord multivalued: true inlined: true hasDataType: name: hasDataType description: The type of data used in the benchmark (e.g., text, images, or multi-modal) from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataType owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasDomains: name: hasDomains description: The specific domains or areas where the benchmark is applied (e.g., natural language processing, computer vision). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDomains owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasLanguages: name: hasLanguages description: The languages included in the dataset used by the benchmark (e.g., English, multilingual). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasLanguages owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasTasks: name: hasTasks description: The tasks or evaluations the benchmark is intended to assess. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasTasks owner: EveryEvalAIResult domain_of: - AiEval - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true inlined: false hasDataSource: name: hasDataSource description: The origin or source of the data used in the benchmark (e.g., curated datasets, user submissions). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataSource owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasDataSize: name: hasDataSize description: The size of the dataset, including the number of data points or examples. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataSize owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string hasDataFormat: name: hasDataFormat description: The structure and modality of the data (e.g., sentence pairs, question-answer format, tabular data). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataFormat owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasMethods: name: hasMethods description: The evaluation techniques applied within the benchmark. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasMethods owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasMetrics: name: hasMetrics description: The specific performance metrics used to assess models (e.g., accuracy, F1 score, precision, recall). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasMetrics owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasLimitations: name: hasLimitations description: Limitations in evaluating or addressing risks, such as gaps in demographic coverage or specific domains. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasLimitations owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasGoal: name: hasGoal description: The specific goal or primary use case the benchmark is designed for. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasGoal owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string hasAudience: name: hasAudience description: The intended audience, such as researchers, developers, policymakers, etc. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasAudience owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasResources: name: hasResources description: Links to relevant resources, such as repositories or papers related to the benchmark. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasResources owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasDocumentation: name: hasDocumentation description: Indicates documentation associated with an entity. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: airo:hasDocumentation alias: hasDocumentation owner: EveryEvalAIResult domain_of: - Dataset - Vocabulary - Taxonomy - Concept - Group - Entry - Term - Principle - RiskTaxonomy - RiskControlGroupTaxonomy - Action - BaseAi - LargeLanguageModelFamily - AiEval - EveryEvalAIResult - BenchmarkMetadataCard - Adapter - LLMIntrinsic range: Documentation multivalued: true inlined: false hasRelatedRisk: name: hasRelatedRisk description: A relationship where an entity relates to a risk from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: Any alias: hasRelatedRisk owner: EveryEvalAIResult domain_of: - Term - LLMQuestionPolicy - Action - AiSystem - AiEval - EveryEvalAIResult - BenchmarkMetadataCard - Adapter - LLMIntrinsic range: Risk multivalued: true inlined: false any_of: - range: RiskConcept - range: Term isResultOf: name: isResultOf description: A relationship indicating that an entity is the result of an AI evaluation. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: dqv:isMeasurementOf alias: isResultOf owner: EveryEvalAIResult domain_of: - AiEvalResult range: AiEval multivalued: false inlined: false value: name: value description: Some numeric or string value from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: value owner: EveryEvalAIResult domain_of: - Fact range: string required: true evidence: name: evidence description: Evidence provides a source (typical a chunk, paragraph or link) describing where some value was found or how it was generated. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: evidence owner: EveryEvalAIResult domain_of: - Fact range: string id: name: id description: A unique identifier to this instance of the model element. Example identifiers include UUID, URI, URN, etc. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:identifier identifier: true alias: id owner: EveryEvalAIResult domain_of: - Entity range: string required: true name: name: name description: A text name of this instance. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:name alias: name owner: EveryEvalAIResult domain_of: - Entity - BenchmarkMetadataCard range: string description: name: description description: The description of an entity from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:description alias: description owner: EveryEvalAIResult domain_of: - Entity range: string url: name: url description: An optional URL associated with this instance. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:url alias: url owner: EveryEvalAIResult domain_of: - Entity range: uri dateCreated: name: dateCreated description: The date on which the entity was created. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:dateCreated alias: dateCreated owner: EveryEvalAIResult domain_of: - Entity range: date required: false dateModified: name: dateModified description: The date on which the entity was most recently modified. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:dateModified alias: dateModified owner: EveryEvalAIResult domain_of: - Entity range: date required: false exact_mappings: name: exact_mappings description: The property is used to link two concepts, indicating a high degree of confidence that the concepts can be used interchangeably across a wide range of information retrieval applications from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:exactMatch alias: exact_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false close_mappings: name: close_mappings description: The property is used to link two concepts that are sufficiently similar that they can be used interchangeably in some information retrieval applications. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:closeMatch alias: close_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false related_mappings: name: related_mappings description: The property skos:relatedMatch is used to state an associative mapping link between two concepts. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:relatedMatch alias: related_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false narrow_mappings: name: narrow_mappings description: The property is used to state a hierarchical mapping link between two concepts, indicating that the concept linked to, is a narrower concept than the originating concept. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:narrowMatch alias: narrow_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false broad_mappings: name: broad_mappings description: The property is used to state a hierarchical mapping link between two concepts, indicating that the concept linked to, is a broader concept than the originating concept. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:broadMatch alias: broad_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false isCategorizedAs: name: isCategorizedAs description: A relationship where an entity has been deemed to be categorized from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: nexus:isCategorizedAs alias: isCategorizedAs owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false class_uri: nexus:everyevalairesult ````