Class: EveryEvalAIResult
An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore.
classDiagram
class EveryEvalAIResult
click EveryEvalAIResult href "../EveryEvalAIResult/"
AiEvalResult <|-- EveryEvalAIResult
click AiEvalResult href "../AiEvalResult/"
EveryEvalAIResult : broad_mappings
EveryEvalAIResult --> "*" Any : broad_mappings
click Any href "../Any/"
EveryEvalAIResult : close_mappings
EveryEvalAIResult --> "*" Any : close_mappings
click Any href "../Any/"
EveryEvalAIResult : dateCreated
EveryEvalAIResult : dateModified
EveryEvalAIResult : description
EveryEvalAIResult : evaluation_id
EveryEvalAIResult : evaluation_timestamp
EveryEvalAIResult : evidence
EveryEvalAIResult : exact_mappings
EveryEvalAIResult --> "*" Any : exact_mappings
click Any href "../Any/"
EveryEvalAIResult : hasAudience
EveryEvalAIResult : hasDataFormat
EveryEvalAIResult : hasDataSize
EveryEvalAIResult : hasDataSource
EveryEvalAIResult : hasDataType
EveryEvalAIResult : hasDocumentation
EveryEvalAIResult --> "*" Documentation : hasDocumentation
click Documentation href "../Documentation/"
EveryEvalAIResult : hasDomains
EveryEvalAIResult : hasEvaluationResults
EveryEvalAIResult --> "*" EvaluationResultRecord : hasEvaluationResults
click EvaluationResultRecord href "../EvaluationResultRecord/"
EveryEvalAIResult : hasGoal
EveryEvalAIResult : hasLanguages
EveryEvalAIResult : hasLimitations
EveryEvalAIResult : hasMethods
EveryEvalAIResult : hasMetrics
EveryEvalAIResult : hasModelInfo
EveryEvalAIResult --> "0..1" ModelInfo : hasModelInfo
click ModelInfo href "../ModelInfo/"
EveryEvalAIResult : hasRelatedRisk
EveryEvalAIResult --> "*" Risk : hasRelatedRisk
click Risk href "../Risk/"
EveryEvalAIResult : hasResources
EveryEvalAIResult : hasSourceMetadata
EveryEvalAIResult --> "0..1" SourceMetadata : hasSourceMetadata
click SourceMetadata href "../SourceMetadata/"
EveryEvalAIResult : hasTasks
EveryEvalAIResult : id
EveryEvalAIResult : isCategorizedAs
EveryEvalAIResult --> "*" Any : isCategorizedAs
click Any href "../Any/"
EveryEvalAIResult : isResultOf
EveryEvalAIResult --> "0..1" AiEval : isResultOf
click AiEval href "../AiEval/"
EveryEvalAIResult : name
EveryEvalAIResult : narrow_mappings
EveryEvalAIResult --> "*" Any : narrow_mappings
click Any href "../Any/"
EveryEvalAIResult : related_mappings
EveryEvalAIResult --> "*" Any : related_mappings
click Any href "../Any/"
EveryEvalAIResult : retrieved_timestamp
EveryEvalAIResult : schema_version
EveryEvalAIResult : url
EveryEvalAIResult : value
Inheritance
- Entity
- AiEvalResult [ Fact]
- EveryEvalAIResult
Class Properties
| Property | Value |
|---|---|
| Class URI | nexus:everyevalairesult |
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| hasSourceMetadata | 0..1 SourceMetadata |
Source metadata for the evaluation | direct |
| hasModelInfo | 0..1 ModelInfo |
Model information for the evaluation | direct |
| hasEvaluationResults | * EvaluationResultRecord |
Array of evaluation results | direct |
| hasDataType | * String |
The type of data used in the benchmark (e | direct |
| hasDomains | * String |
The specific domains or areas where the benchmark is applied (e | direct |
| hasLanguages | * String |
The languages included in the dataset used by the benchmark (e | direct |
| hasTasks | * String |
The tasks or evaluations the benchmark is intended to assess | direct |
| hasDataSource | * String |
The origin or source of the data used in the benchmark (e | direct |
| hasDataSize | 0..1 String |
The size of the dataset, including the number of data points or examples | direct |
| hasDataFormat | * String |
The structure and modality of the data (e | direct |
| hasMethods | * String |
The evaluation techniques applied within the benchmark | direct |
| hasMetrics | * String |
The specific performance metrics used to assess models (e | direct |
| hasLimitations | * String |
Limitations in evaluating or addressing risks, such as gaps in demographic co... | direct |
| hasGoal | 0..1 String |
The specific goal or primary use case the benchmark is designed for | direct |
| hasAudience | * String |
The intended audience, such as researchers, developers, policymakers, etc | direct |
| hasResources | * String |
Links to relevant resources, such as repositories or papers related to the be... | direct |
| hasDocumentation | * Documentation |
Indicates documentation associated with an entity | direct |
| hasRelatedRisk | * Term or Risk or RiskConcept |
A relationship where an entity relates to a risk | direct |
| schema_version | 0..1 String |
Version of the evaluation schema | direct |
| evaluation_id | 0..1 String |
Unique identifier for this evaluation | direct |
| evaluation_timestamp | 0..1 Datetime |
ISO 8601 timestamp when evaluation was performed | direct |
| retrieved_timestamp | 0..1 String |
Unix timestamp when the data was retrieved | direct |
| isResultOf | 0..1 AiEval |
A relationship indicating that an entity is the result of an AI evaluation | AiEvalResult |
| value | 1 String |
Some numeric or string value | Fact |
| evidence | 0..1 String |
Evidence provides a source (typical a chunk, paragraph or link) describing wh... | Fact |
| id | 1 String |
A unique identifier to this instance of the model element | Entity |
| name | 0..1 String |
A text name of this instance | Entity |
| description | 0..1 String |
The description of an entity | Entity |
| url | 0..1 Uri |
An optional URL associated with this instance | Entity |
| dateCreated | 0..1 Date |
The date on which the entity was created | Entity |
| dateModified | 0..1 Date |
The date on which the entity was most recently modified | Entity |
| exact_mappings | * Any |
The property is used to link two concepts, indicating a high degree of confid... | Entity |
| close_mappings | * Any |
The property is used to link two concepts that are sufficiently similar that ... | Entity |
| related_mappings | * Any |
The property skos:relatedMatch is used to state an associative mapping link b... | Entity |
| narrow_mappings | * Any |
The property is used to state a hierarchical mapping link between two concept... | Entity |
| broad_mappings | * Any |
The property is used to state a hierarchical mapping link between two concept... | Entity |
| isCategorizedAs | * Any |
A relationship where an entity has been deemed to be categorized | Entity |
Usages
| used by | used in | type | used |
|---|---|---|---|
| EveryEvalAIResult | hasSourceMetadata | domain | EveryEvalAIResult |
| EveryEvalAIResult | hasModelInfo | domain | EveryEvalAIResult |
| EveryEvalAIResult | hasEvaluationResults | domain | EveryEvalAIResult |
Identifier and Mapping Information
Schema Source
- from schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | nexus:everyevalairesult |
| native | nexus:EveryEvalAIResult |
LinkML Source
Direct
```yaml
name: EveryEvalAIResult
description: An evaluation result from the Every Eval Ever dataset, capturing evaluation
metadata and results from the EEE_datastore.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
is_a: AiEvalResult
slots:
- hasSourceMetadata
- hasModelInfo
- hasEvaluationResults
- hasDataType
- hasDomains
- hasLanguages
- hasTasks
- hasDataSource
- hasDataSize
- hasDataFormat
- hasMethods
- hasMetrics
- hasLimitations
- hasGoal
- hasAudience
- hasResources
- hasDocumentation
- hasRelatedRisk
attributes:
schema_version:
name: schema_version
description: Version of the evaluation schema
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
domain_of:
- EveryEvalAIResult
range: string
evaluation_id:
name: evaluation_id
description: Unique identifier for this evaluation
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
domain_of:
- EveryEvalAIResult
range: string
evaluation_timestamp:
name: evaluation_timestamp
description: ISO 8601 timestamp when evaluation was performed
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
domain_of:
- EveryEvalAIResult
range: datetime
retrieved_timestamp:
name: retrieved_timestamp
description: Unix timestamp when the data was retrieved
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
domain_of:
- EveryEvalAIResult
range: string
class_uri: nexus:everyevalairesult
````
Induced
```yaml
name: EveryEvalAIResult
description: An evaluation result from the Every Eval Ever dataset, capturing evaluation
metadata and results from the EEE_datastore.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
is_a: AiEvalResult
attributes:
schema_version:
name: schema_version
description: Version of the evaluation schema
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
alias: schema_version
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: string
evaluation_id:
name: evaluation_id
description: Unique identifier for this evaluation
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
alias: evaluation_id
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: string
evaluation_timestamp:
name: evaluation_timestamp
description: ISO 8601 timestamp when evaluation was performed
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
alias: evaluation_timestamp
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: datetime
retrieved_timestamp:
name: retrieved_timestamp
description: Unix timestamp when the data was retrieved
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval
rank: 1000
alias: retrieved_timestamp
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: string
hasSourceMetadata:
name: hasSourceMetadata
description: Source metadata for the evaluation
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
domain: EveryEvalAIResult
alias: hasSourceMetadata
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: SourceMetadata
inlined: true
hasModelInfo:
name: hasModelInfo
description: Model information for the evaluation
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
domain: EveryEvalAIResult
alias: hasModelInfo
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: ModelInfo
inlined: true
hasEvaluationResults:
name: hasEvaluationResults
description: Array of evaluation results
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
domain: EveryEvalAIResult
alias: hasEvaluationResults
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
range: EvaluationResultRecord
multivalued: true
inlined: true
hasDataType:
name: hasDataType
description: The type of data used in the benchmark (e.g., text, images, or multi-modal)
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasDataType
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasDomains:
name: hasDomains
description: The specific domains or areas where the benchmark is applied (e.g.,
natural language processing, computer vision).
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasDomains
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasLanguages:
name: hasLanguages
description: The languages included in the dataset used by the benchmark (e.g.,
English, multilingual).
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasLanguages
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasTasks:
name: hasTasks
description: The tasks or evaluations the benchmark is intended to assess.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasTasks
owner: EveryEvalAIResult
domain_of:
- AiEval
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
inlined: false
hasDataSource:
name: hasDataSource
description: The origin or source of the data used in the benchmark (e.g., curated
datasets, user submissions).
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasDataSource
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasDataSize:
name: hasDataSize
description: The size of the dataset, including the number of data points or examples.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasDataSize
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
hasDataFormat:
name: hasDataFormat
description: The structure and modality of the data (e.g., sentence pairs, question-answer
format, tabular data).
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasDataFormat
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasMethods:
name: hasMethods
description: The evaluation techniques applied within the benchmark.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasMethods
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasMetrics:
name: hasMetrics
description: The specific performance metrics used to assess models (e.g., accuracy,
F1 score, precision, recall).
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasMetrics
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasLimitations:
name: hasLimitations
description: Limitations in evaluating or addressing risks, such as gaps in demographic
coverage or specific domains.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasLimitations
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasGoal:
name: hasGoal
description: The specific goal or primary use case the benchmark is designed for.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasGoal
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
hasAudience:
name: hasAudience
description: The intended audience, such as researchers, developers, policymakers,
etc.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasAudience
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasResources:
name: hasResources
description: Links to relevant resources, such as repositories or papers related
to the benchmark.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: hasResources
owner: EveryEvalAIResult
domain_of:
- EveryEvalAIResult
- BenchmarkMetadataCard
range: string
multivalued: true
hasDocumentation:
name: hasDocumentation
description: Indicates documentation associated with an entity.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: airo:hasDocumentation
alias: hasDocumentation
owner: EveryEvalAIResult
domain_of:
- Dataset
- Vocabulary
- Taxonomy
- Concept
- Group
- Entry
- Term
- Principle
- RiskTaxonomy
- RiskControlGroupTaxonomy
- Action
- BaseAi
- LargeLanguageModelFamily
- AiEval
- EveryEvalAIResult
- BenchmarkMetadataCard
- Adapter
- LLMIntrinsic
range: Documentation
multivalued: true
inlined: false
hasRelatedRisk:
name: hasRelatedRisk
description: A relationship where an entity relates to a risk
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
domain: Any
alias: hasRelatedRisk
owner: EveryEvalAIResult
domain_of:
- Term
- LLMQuestionPolicy
- Action
- AiSystem
- AiEval
- EveryEvalAIResult
- BenchmarkMetadataCard
- Adapter
- LLMIntrinsic
range: Risk
multivalued: true
inlined: false
any_of:
- range: RiskConcept
- range: Term
isResultOf:
name: isResultOf
description: A relationship indicating that an entity is the result of an AI evaluation.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: dqv:isMeasurementOf
alias: isResultOf
owner: EveryEvalAIResult
domain_of:
- AiEvalResult
range: AiEval
multivalued: false
inlined: false
value:
name: value
description: Some numeric or string value
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: value
owner: EveryEvalAIResult
domain_of:
- Fact
range: string
required: true
evidence:
name: evidence
description: Evidence provides a source (typical a chunk, paragraph or link) describing
where some value was found or how it was generated.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
alias: evidence
owner: EveryEvalAIResult
domain_of:
- Fact
range: string
id:
name: id
description: A unique identifier to this instance of the model element. Example
identifiers include UUID, URI, URN, etc.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: schema:identifier
identifier: true
alias: id
owner: EveryEvalAIResult
domain_of:
- Entity
range: string
required: true
name:
name: name
description: A text name of this instance.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: schema:name
alias: name
owner: EveryEvalAIResult
domain_of:
- Entity
- BenchmarkMetadataCard
range: string
description:
name: description
description: The description of an entity
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: schema:description
alias: description
owner: EveryEvalAIResult
domain_of:
- Entity
range: string
url:
name: url
description: An optional URL associated with this instance.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: schema:url
alias: url
owner: EveryEvalAIResult
domain_of:
- Entity
range: uri
dateCreated:
name: dateCreated
description: The date on which the entity was created.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: schema:dateCreated
alias: dateCreated
owner: EveryEvalAIResult
domain_of:
- Entity
range: date
required: false
dateModified:
name: dateModified
description: The date on which the entity was most recently modified.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: schema:dateModified
alias: dateModified
owner: EveryEvalAIResult
domain_of:
- Entity
range: date
required: false
exact_mappings:
name: exact_mappings
description: The property is used to link two concepts, indicating a high degree
of confidence that the concepts can be used interchangeably across a wide range
of information retrieval applications
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: skos:exactMatch
alias: exact_mappings
owner: EveryEvalAIResult
domain_of:
- Entity
range: Any
multivalued: true
inlined: false
close_mappings:
name: close_mappings
description: The property is used to link two concepts that are sufficiently similar
that they can be used interchangeably in some information retrieval applications.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: skos:closeMatch
alias: close_mappings
owner: EveryEvalAIResult
domain_of:
- Entity
range: Any
multivalued: true
inlined: false
related_mappings:
name: related_mappings
description: The property skos:relatedMatch is used to state an associative mapping
link between two concepts.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: skos:relatedMatch
alias: related_mappings
owner: EveryEvalAIResult
domain_of:
- Entity
range: Any
multivalued: true
inlined: false
narrow_mappings:
name: narrow_mappings
description: The property is used to state a hierarchical mapping link between
two concepts, indicating that the concept linked to, is a narrower concept than
the originating concept.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: skos:narrowMatch
alias: narrow_mappings
owner: EveryEvalAIResult
domain_of:
- Entity
range: Any
multivalued: true
inlined: false
broad_mappings:
name: broad_mappings
description: The property is used to state a hierarchical mapping link between
two concepts, indicating that the concept linked to, is a broader concept than
the originating concept.
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: skos:broadMatch
alias: broad_mappings
owner: EveryEvalAIResult
domain_of:
- Entity
range: Any
multivalued: true
inlined: false
isCategorizedAs:
name: isCategorizedAs
description: A relationship where an entity has been deemed to be categorized
from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology
rank: 1000
slot_uri: nexus:isCategorizedAs
alias: isCategorizedAs
owner: EveryEvalAIResult
domain_of:
- Entity
range: Any
multivalued: true
inlined: false
class_uri: nexus:everyevalairesult
````