[Bug] Validate sparse tensors that use ORT's in-memory address markers

### Describe the issue

ORT uses special markers on TensorProtos to indicate that an existing memory buffer contains the TensorProto external data:
https://github.com/microsoft/onnxruntime/blob/158bdef0183fa25f8ddf9416f964146f38cc60a9/onnxruntime/core/framework/tensorprotoutils.h#L225-L241

For dense TensorProtos, ORT currently correctly [validates](https://github.com/microsoft/onnxruntime/blob/ee444bdc3679b4002466a50643611fed4d9704cc/onnxruntime/core/graph/graph.cc#L3775) that such "in-memory" references point to valid memory.

However, for sparse tensors, ORT is missing this validation, which could trigger an invalid memory read. Specifically, [SparseTensorProtoToDenseTensorProto](https://github.com/microsoft/onnxruntime/blob/158bdef0183fa25f8ddf9416f964146f38cc60a9/onnxruntime/core/framework/tensorprotoutils.cc#L2392) passes these tensors to [UnpackInitializerData](https://github.com/microsoft/onnxruntime/blob/158bdef0183fa25f8ddf9416f964146f38cc60a9/onnxruntime/core/framework/tensorprotoutils.cc#L2672), which dereferences the supplied "in-memory" references without checks.

Note: this issue was discussed in a separate PR: https://github.com/microsoft/onnxruntime/pull/28408#discussion_r3261024343

### To reproduce

Create a model with sparse tensors that use an arbitrary memory address. Here's an example model that was added in a [PR](https://github.com/microsoft/onnxruntime/pull/26764) that added validation for dense tensors (needs to be adapted for sparse tensors): https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/test/testdata/test_evil_weights.py



### Urgency

_No response_

### Platform

Windows

### OS Version

Windows 11

### ONNX Runtime Installation

Built from Source

### ONNX Runtime Version or Commit ID

1.26.0

### ONNX Runtime API

Python

### Architecture

X64

### Execution Provider

Default CPU

### Execution Provider Library Version

_No response_

	/**
	Special marker used to indicate an existing memory buffer contains the TensorProto external data.
	If the 'location' field of the external data info is set to this marker, the 'offset' field should contain the
	address of the memory containing the data.

	This marker is used when data is always in little endian format.
	*/
	constexpr const ORTCHAR_T* kTensorProtoLittleEndianMemoryAddressTag = ORT_TSTR("/_ORT_MEM_ADDR_/");

	/**
	Special marker used to indicate an existing memory buffer contains the TensorProto external data.
	If the 'location' field of the external data info is set to this marker, the 'offset' field should contain the
	address of the memory containing the data.

	This marker is used when data is in native endian format, i.e. big endian on big endian systems.
	*/
	constexpr const ORTCHAR_T* kTensorProtoNativeEndianMemoryAddressTag = ORT_TSTR("/_ORT_NATIVE_ENDIAN_MEM_ADDR_/");

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Validate sparse tensors that use ORT's in-memory address markers #28617

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Bug] Validate sparse tensors that use ORT's in-memory address markers #28617

Description

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions