A dependently Typed Assembly Language (Joint work with Robert Harper)

A Dependently Typed Assembly Language

The general motivation

Advantages of type systems

The goal of this work

Array bounds checking problem

Byte copy: A version in C

Dynamic array bounds checking

Some experimental results

Static array bounds checking

Static array bounds checking

What are dependent types?

Byte copy: A version in de Caml

Array bounds checking in mobile code

Some key applications of DTAL

Increment: A flow chart

Increment: An assembly version

State types

Register file

Stack

Increment: A version in DTAL

Type index objects

Types

Instructions in DTAL

Programs in DTAL

Memory allocation

Memory allocation: an example

Array types are non-variant

State types are contra-variant

Typing unconditional jumps

Typing conditional jumps

Byte copy: A flow chart

Byte copy: A version in DTAL

Byte copy: a version in DTAL

Operational semantics of DTAL

Soundness

Related work

Current status & Future work

Conclusion

Dostları ilə paylaş:

A dependently Typed Assembly Language (Joint work with Robert Harper)

A Dependently Typed Assembly Language

(Joint work with Robert Harper)

The general motivation

Q: Why do we want to type low level

languages?

A: We want to reap the benefits of

type systems at low levels.

Advantages of type systems

Capturing program errors at compile-time (well-known)

Enabling aggressive compiler optimizations (recent)

Supporting sophisticated module systems (SML)

Facilitating program verification (NuPRL, Coq, PVS)

Serving as program documentation

The goal of this work

The goal is to capture memory safety of assembly code through a type system

Memory Safety =

Type Safety + Safe Array Access

Array bounds checking problem

Array bounds checking refers to determining whether the value of an expression is within the bounds of an array when the expression is used to index the array.

Byte copy: A version in C

void

bcopy(int src[], int dst[]) {

int i;

if (length(src) != length(dst) {

printf “bcopy: unequal lengths\n”;

exit(1);

}

for(i=1; i < length(src), i++)

dst[i] = src[i];

}

Dynamic array bounds checking

is required for safe languages such as Java, Modula-3, ML, Pascal

can be expensive in practice (e.g. numerical computation)

bounds violation is a rich source of program errors in unsafe languages such as C, C++ (e.g. off-by-one error)

Some experimental results

Static array bounds checking

Flow Analysis

Static array bounds checking

Type-based approaches

What are dependent types?

Dependent types depend on the values of language expressions.

For instance,

type : dependent type

array : array(x)

int : int(x)

stack : stack(x)

Byte copy: A version in de Caml

let bcopy src dst = begin

for i = 0 to vect_length(src) - 1 do

dst..(i) <- src..(i)

done

end

withtype {n:nat} int vect(n) ->

int vect(n) -> unit

Array bounds checking in mobile code

It needs to be enforced for safety concerns

It is difficult to eliminate since the machine which executes the code may not trust the source of the code

It is time-consuming to be compiled away

Some key applications of DTAL

Compiler verification

Mobile code security

Mobile code efficiency

Increment: A flow chart

Increment: An assembly version

inc:

pop r1

add r1, r1, 1

pop r2

push r1

jmp r2

State types

A state type corresponds to code continuation. It records the type information about register file and stack.

For instance,

[r1: int(i), r2: int array(i)]

(‘a)[r1: ‘a, r2: [r1: ‘a]]

(‘a,‘b)[r1: ‘a, r2: ‘b,

r3: [r1: ‘a, r2: ‘b]]

{n:nat}[sp: [sp: stack(n)] :: stack(n)]

Register file

index **i,j ::= a | c | i+j | i-j | i*j | i/j**

index sort **gamma ::= int | {a: gamma | P}**

**{a: int | a >= 0}**

**Note: int is for {a:int}.int(a)**

**nat is for {a:nat}.int(a)**