Class: EveryEvalAIResult

An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore.

URI: nexus:everyevalairesult

 classDiagram
    class EveryEvalAIResult
    click EveryEvalAIResult href "../EveryEvalAIResult/"
      AiEvalResult <|-- EveryEvalAIResult
        click AiEvalResult href "../AiEvalResult/"

      EveryEvalAIResult : broad_mappings





        EveryEvalAIResult --> "*" Any : broad_mappings
        click Any href "../Any/"



      EveryEvalAIResult : close_mappings





        EveryEvalAIResult --> "*" Any : close_mappings
        click Any href "../Any/"



      EveryEvalAIResult : dateCreated

      EveryEvalAIResult : dateModified

      EveryEvalAIResult : description

      EveryEvalAIResult : evaluation_id

      EveryEvalAIResult : evaluation_timestamp

      EveryEvalAIResult : evidence

      EveryEvalAIResult : exact_mappings





        EveryEvalAIResult --> "*" Any : exact_mappings
        click Any href "../Any/"



      EveryEvalAIResult : hasAudience

      EveryEvalAIResult : hasDataFormat

      EveryEvalAIResult : hasDataSize

      EveryEvalAIResult : hasDataSource

      EveryEvalAIResult : hasDataType

      EveryEvalAIResult : hasDocumentation





        EveryEvalAIResult --> "*" Documentation : hasDocumentation
        click Documentation href "../Documentation/"



      EveryEvalAIResult : hasDomains

      EveryEvalAIResult : hasEvaluationResults





        EveryEvalAIResult --> "*" EvaluationResultRecord : hasEvaluationResults
        click EvaluationResultRecord href "../EvaluationResultRecord/"



      EveryEvalAIResult : hasGoal

      EveryEvalAIResult : hasLanguages

      EveryEvalAIResult : hasLimitations

      EveryEvalAIResult : hasMethods

      EveryEvalAIResult : hasMetrics

      EveryEvalAIResult : hasModelInfo





        EveryEvalAIResult --> "0..1" ModelInfo : hasModelInfo
        click ModelInfo href "../ModelInfo/"



      EveryEvalAIResult : hasRelatedRisk





        EveryEvalAIResult --> "*" Risk : hasRelatedRisk
        click Risk href "../Risk/"



      EveryEvalAIResult : hasResources

      EveryEvalAIResult : hasSourceMetadata





        EveryEvalAIResult --> "0..1" SourceMetadata : hasSourceMetadata
        click SourceMetadata href "../SourceMetadata/"



      EveryEvalAIResult : hasTasks

      EveryEvalAIResult : id

      EveryEvalAIResult : isCategorizedAs





        EveryEvalAIResult --> "*" Any : isCategorizedAs
        click Any href "../Any/"



      EveryEvalAIResult : isResultOf





        EveryEvalAIResult --> "0..1" AiEval : isResultOf
        click AiEval href "../AiEval/"



      EveryEvalAIResult : name

      EveryEvalAIResult : narrow_mappings





        EveryEvalAIResult --> "*" Any : narrow_mappings
        click Any href "../Any/"



      EveryEvalAIResult : related_mappings





        EveryEvalAIResult --> "*" Any : related_mappings
        click Any href "../Any/"



      EveryEvalAIResult : retrieved_timestamp

      EveryEvalAIResult : schema_version

      EveryEvalAIResult : url

      EveryEvalAIResult : value

Inheritance

Entity
AiEvalResult [ Fact]
- EveryEvalAIResult

Class Properties

Property	Value
Class URI	nexus:everyevalairesult

Slots

Name	Cardinality and Range	Description	Inheritance
hasSourceMetadata	0..1 SourceMetadata	Source metadata for the evaluation	direct
hasModelInfo	0..1 ModelInfo	Model information for the evaluation	direct
hasEvaluationResults	* EvaluationResultRecord	Array of evaluation results	direct
hasDataType	* String	The type of data used in the benchmark (e	direct
hasDomains	* String	The specific domains or areas where the benchmark is applied (e	direct
hasLanguages	* String	The languages included in the dataset used by the benchmark (e	direct
hasTasks	* String	The tasks or evaluations the benchmark is intended to assess	direct
hasDataSource	* String	The origin or source of the data used in the benchmark (e	direct
hasDataSize	0..1 String	The size of the dataset, including the number of data points or examples	direct
hasDataFormat	* String	The structure and modality of the data (e	direct
hasMethods	* String	The evaluation techniques applied within the benchmark	direct
hasMetrics	* String	The specific performance metrics used to assess models (e	direct
hasLimitations	* String	Limitations in evaluating or addressing risks, such as gaps in demographic co...	direct
hasGoal	0..1 String	The specific goal or primary use case the benchmark is designed for	direct
hasAudience	* String	The intended audience, such as researchers, developers, policymakers, etc	direct
hasResources	* String	Links to relevant resources, such as repositories or papers related to the be...	direct
hasDocumentation	* Documentation	Indicates documentation associated with an entity	direct
hasRelatedRisk	* Term or Risk or RiskConcept	A relationship where an entity relates to a risk	direct
schema_version	0..1 String	Version of the evaluation schema	direct
evaluation_id	0..1 String	Unique identifier for this evaluation	direct
evaluation_timestamp	0..1 Datetime	ISO 8601 timestamp when evaluation was performed	direct
retrieved_timestamp	0..1 String	Unix timestamp when the data was retrieved	direct
isResultOf	0..1 AiEval	A relationship indicating that an entity is the result of an AI evaluation	AiEvalResult
value	1 String	Some numeric or string value	Fact
evidence	0..1 String	Evidence provides a source (typical a chunk, paragraph or link) describing wh...	Fact
id	1 String	A unique identifier to this instance of the model element	Entity
name	0..1 String	A text name of this instance	Entity
description	0..1 String	The description of an entity	Entity
url	0..1 Uri	An optional URL associated with this instance	Entity
dateCreated	0..1 Date	The date on which the entity was created	Entity
dateModified	0..1 Date	The date on which the entity was most recently modified	Entity
exact_mappings	* Any	The property is used to link two concepts, indicating a high degree of confid...	Entity
close_mappings	* Any	The property is used to link two concepts that are sufficiently similar that ...	Entity
related_mappings	* Any	The property skos:relatedMatch is used to state an associative mapping link b...	Entity
narrow_mappings	* Any	The property is used to state a hierarchical mapping link between two concept...	Entity
broad_mappings	* Any	The property is used to state a hierarchical mapping link between two concept...	Entity
isCategorizedAs	* Any	A relationship where an entity has been deemed to be categorized	Entity

Usages

used by	used in	type	used
EveryEvalAIResult	hasSourceMetadata	domain	EveryEvalAIResult
EveryEvalAIResult	hasModelInfo	domain	EveryEvalAIResult
EveryEvalAIResult	hasEvaluationResults	domain	EveryEvalAIResult

Identifier and Mapping Information

Schema Source

from schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology

Mappings

Mapping Type	Mapped Value
self	nexus:everyevalairesult
native	nexus:EveryEvalAIResult

LinkML Source

Direct

```yaml name: EveryEvalAIResult description: An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology is_a: AiEvalResult slots: - hasSourceMetadata - hasModelInfo - hasEvaluationResults - hasDataType - hasDomains - hasLanguages - hasTasks - hasDataSource - hasDataSize - hasDataFormat - hasMethods - hasMetrics - hasLimitations - hasGoal - hasAudience - hasResources - hasDocumentation - hasRelatedRisk attributes: schema_version: name: schema_version description: Version of the evaluation schema from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: string evaluation_id: name: evaluation_id description: Unique identifier for this evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: string evaluation_timestamp: name: evaluation_timestamp description: ISO 8601 timestamp when evaluation was performed from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: datetime retrieved_timestamp: name: retrieved_timestamp description: Unix timestamp when the data was retrieved from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 domain_of: - EveryEvalAIResult range: string class_uri: nexus:everyevalairesult ````

Induced

```yaml name: EveryEvalAIResult description: An evaluation result from the Every Eval Ever dataset, capturing evaluation metadata and results from the EEE_datastore. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology is_a: AiEvalResult attributes: schema_version: name: schema_version description: Version of the evaluation schema from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: schema_version owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: string evaluation_id: name: evaluation_id description: Unique identifier for this evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: evaluation_id owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: string evaluation_timestamp: name: evaluation_timestamp description: ISO 8601 timestamp when evaluation was performed from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: evaluation_timestamp owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: datetime retrieved_timestamp: name: retrieved_timestamp description: Unix timestamp when the data was retrieved from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai_eval rank: 1000 alias: retrieved_timestamp owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: string hasSourceMetadata: name: hasSourceMetadata description: Source metadata for the evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: EveryEvalAIResult alias: hasSourceMetadata owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: SourceMetadata inlined: true hasModelInfo: name: hasModelInfo description: Model information for the evaluation from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: EveryEvalAIResult alias: hasModelInfo owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: ModelInfo inlined: true hasEvaluationResults: name: hasEvaluationResults description: Array of evaluation results from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: EveryEvalAIResult alias: hasEvaluationResults owner: EveryEvalAIResult domain_of: - EveryEvalAIResult range: EvaluationResultRecord multivalued: true inlined: true hasDataType: name: hasDataType description: The type of data used in the benchmark (e.g., text, images, or multi-modal) from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataType owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasDomains: name: hasDomains description: The specific domains or areas where the benchmark is applied (e.g., natural language processing, computer vision). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDomains owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasLanguages: name: hasLanguages description: The languages included in the dataset used by the benchmark (e.g., English, multilingual). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasLanguages owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasTasks: name: hasTasks description: The tasks or evaluations the benchmark is intended to assess. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasTasks owner: EveryEvalAIResult domain_of: - AiEval - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true inlined: false hasDataSource: name: hasDataSource description: The origin or source of the data used in the benchmark (e.g., curated datasets, user submissions). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataSource owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasDataSize: name: hasDataSize description: The size of the dataset, including the number of data points or examples. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataSize owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string hasDataFormat: name: hasDataFormat description: The structure and modality of the data (e.g., sentence pairs, question-answer format, tabular data). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasDataFormat owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasMethods: name: hasMethods description: The evaluation techniques applied within the benchmark. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasMethods owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasMetrics: name: hasMetrics description: The specific performance metrics used to assess models (e.g., accuracy, F1 score, precision, recall). from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasMetrics owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasLimitations: name: hasLimitations description: Limitations in evaluating or addressing risks, such as gaps in demographic coverage or specific domains. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasLimitations owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasGoal: name: hasGoal description: The specific goal or primary use case the benchmark is designed for. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasGoal owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string hasAudience: name: hasAudience description: The intended audience, such as researchers, developers, policymakers, etc. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasAudience owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasResources: name: hasResources description: Links to relevant resources, such as repositories or papers related to the benchmark. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: hasResources owner: EveryEvalAIResult domain_of: - EveryEvalAIResult - BenchmarkMetadataCard range: string multivalued: true hasDocumentation: name: hasDocumentation description: Indicates documentation associated with an entity. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: airo:hasDocumentation alias: hasDocumentation owner: EveryEvalAIResult domain_of: - Dataset - Vocabulary - Taxonomy - Concept - Group - Entry - Term - Principle - RiskTaxonomy - RiskControlGroupTaxonomy - Action - BaseAi - LargeLanguageModelFamily - AiEval - EveryEvalAIResult - BenchmarkMetadataCard - Adapter - LLMIntrinsic range: Documentation multivalued: true inlined: false hasRelatedRisk: name: hasRelatedRisk description: A relationship where an entity relates to a risk from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 domain: Any alias: hasRelatedRisk owner: EveryEvalAIResult domain_of: - Term - LLMQuestionPolicy - Action - AiSystem - AiEval - EveryEvalAIResult - BenchmarkMetadataCard - Adapter - LLMIntrinsic range: Risk multivalued: true inlined: false any_of: - range: RiskConcept - range: Term isResultOf: name: isResultOf description: A relationship indicating that an entity is the result of an AI evaluation. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: dqv:isMeasurementOf alias: isResultOf owner: EveryEvalAIResult domain_of: - AiEvalResult range: AiEval multivalued: false inlined: false value: name: value description: Some numeric or string value from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: value owner: EveryEvalAIResult domain_of: - Fact range: string required: true evidence: name: evidence description: Evidence provides a source (typical a chunk, paragraph or link) describing where some value was found or how it was generated. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 alias: evidence owner: EveryEvalAIResult domain_of: - Fact range: string id: name: id description: A unique identifier to this instance of the model element. Example identifiers include UUID, URI, URN, etc. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:identifier identifier: true alias: id owner: EveryEvalAIResult domain_of: - Entity range: string required: true name: name: name description: A text name of this instance. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:name alias: name owner: EveryEvalAIResult domain_of: - Entity - BenchmarkMetadataCard range: string description: name: description description: The description of an entity from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:description alias: description owner: EveryEvalAIResult domain_of: - Entity range: string url: name: url description: An optional URL associated with this instance. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:url alias: url owner: EveryEvalAIResult domain_of: - Entity range: uri dateCreated: name: dateCreated description: The date on which the entity was created. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:dateCreated alias: dateCreated owner: EveryEvalAIResult domain_of: - Entity range: date required: false dateModified: name: dateModified description: The date on which the entity was most recently modified. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: schema:dateModified alias: dateModified owner: EveryEvalAIResult domain_of: - Entity range: date required: false exact_mappings: name: exact_mappings description: The property is used to link two concepts, indicating a high degree of confidence that the concepts can be used interchangeably across a wide range of information retrieval applications from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:exactMatch alias: exact_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false close_mappings: name: close_mappings description: The property is used to link two concepts that are sufficiently similar that they can be used interchangeably in some information retrieval applications. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:closeMatch alias: close_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false related_mappings: name: related_mappings description: The property skos:relatedMatch is used to state an associative mapping link between two concepts. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:relatedMatch alias: related_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false narrow_mappings: name: narrow_mappings description: The property is used to state a hierarchical mapping link between two concepts, indicating that the concept linked to, is a narrower concept than the originating concept. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:narrowMatch alias: narrow_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false broad_mappings: name: broad_mappings description: The property is used to state a hierarchical mapping link between two concepts, indicating that the concept linked to, is a broader concept than the originating concept. from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: skos:broadMatch alias: broad_mappings owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false isCategorizedAs: name: isCategorizedAs description: A relationship where an entity has been deemed to be categorized from_schema: https://ibm.github.io/ai-atlas-nexus/ontology/ai-risk-ontology rank: 1000 slot_uri: nexus:isCategorizedAs alias: isCategorizedAs owner: EveryEvalAIResult domain_of: - Entity range: Any multivalued: true inlined: false class_uri: nexus:everyevalairesult ````