闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

CSE 3341 Project 3 - CORE Interpreter

Overview

The goal of this project is to build an interpreter for the Core language discussed in class.

Your submission should compile and run in the standard environment on stdlinux. If you work in some other environment, it is your responsibility to port your code to stdlinux and make sure it works there.

How you interpreter should represent variables/memory, and the semantics of the Input and Output statements are given in the sections below. Other semantic details of this language should be for the most part obvious; if any aspects of the language are not, please bring it up on Piazza for discussion. Building from the project 2 parser, you need to write an executor for the language described. Every valid input program for this language should be executed correctly, and every invalid input program for this language should be rejected with an error message.

The interpreter should have three main components: a lexical analyzer (a.k.a scanner from project 1), a syntax analyzer (a.k.a. parser from project 2), and an executor. The executor should be written using the recursive descent approach discussed in class, similar to how the parser/printer in project 2 worked.

Main

Your project should have a main procedure in a ﬁle called main, which does the following in order:

1. Initializes the scanner.

2. Parses the input program.

3. Execute the input program.

You can add extra steps here if you like, for example you may choose to add some extra checks or modify the parse tree after parsing and before executing. You do not need to print the program or run your semantic checks from project 2.

The Executor

The goal of this project is to use recursive descent to walk over the parse tree built by your parser, and “execute” the Core program by performing the actions described. Essentially this means that for each “parse” function, you will need to make an “execute” function, which will execute its children and perform any action needed on the result of that execution.

For example, for your Procedure struct you will want execute function that is something like this:

v o i d e x e c u t e P r o c e d u r e ( s t r u c t nodeProcedure * p ) {

// Do something t o i n i t i a l i z e memory

// Execute t h e d e c l a r a t i o n se q u e nc e ,

// i . e . c r e a t e s p a c e i n memory f o r t h e v a r i a b l e s

e x e c u t e D e c l S e q ( p >ds ) ;

// Execute t h e s t a t e m e n t s e q u e n c e

executeStmtSeq ( p—>s s ) ;

}

Many of the execute functions will be this simple. Others will require something more complicated and you may decide to pass in values or have values returned. For example, for Expr, Term, Factor, and similar things it probably makes sense to have a return value so the execute function can look something like this:

i n t e x e c u t e E x p r ( s t r u c t nodeExpr * expr ) {

i n t v a l u e = executeTerm ( expr —>term ) ;

i f ( o p t i o n ==1) {

v a l u e = v a l u e + e x e c u t e E x p r ( expr >expr ) ;

} e l s e i f ( o p t i o n ==2) {

v a l u e = v a l u e e x e c u t e E x p r ( expr >expr ) ;

}

r e t u r n v a l u e ;

}

Memory and Variables

Your interpreter needs to simulate memory for our variables. Conceptually, the Core lan- guage has 2 “regions” of memory: Local and Heap. Our language will not support a global memory

1. Essentially we need a map, from identiﬁer to value. My suggestion is using arrays for this. Recall that you can assume there are at most 20 variables in the program.

2. We can use the C heap as our simulated heap.

I suggest putting these data structures in their own ﬁle, along with any helper functions you create to interact with them. You do not need to worry about freeing this memory! I want us to have memory leaks here, so we can implement a garbage collector in project 5.

With memory structured in this way, we have the following semantics for how variables are used:

1. An integer variable uses the value model of variables, and has similar semantics to the “int” type in C.

(a) When an integer variable is declared, it has initial value 0.

2. A record variable uses the reference model of variables, and has similar semantics as an “array of ints” in C.

(a) When a record variable is declared, it is initially a null reference.

(b) The “id[<expr>]” notation is how we access diﬀerent positions within the record,

starting from index 0.

(c) The “id = new record[<expr>]” type assignment is how we create a record, with <expr> positions. This should correspond to an array with indexes from 0 to the value of <expr>-1.

(d) When a record variable is used in a factor without “[<expr>]”, we treat it as a shorthand for the index 0.

(e) When a record variable appears on the left hand side of an “id := <expr>” type

assignment without “[<expr>]”, we treat it as a shorthand for the index 0.

(f) The “id := record id” type assignment should result in both ids having the same

reference value as the id on the right hand side.

Input To Your Interpreter, The In Function in Core

The input to the interpreter will come from two ASCII text ﬁles. The names of these ﬁles will be given as command line arguments to the interpreter.

The ﬁrst ﬁle will be a .code ﬁle that contains the program to be executed.

The second ﬁle will be a .data ﬁle that contains integer constants separated by spaces. Creating another instance of your scanner should make getting these constants easy.

During execution, each Core in() function will read the next data value from the second ﬁle each time it is executed, to simulate user input.

Output From Your Interpreter, The Out Statement in Core

All output should go to stdout. This includes error messages - do not print to stderr.

The scanner and parser should only produce output in the case of an error.

For the executor, each Core out statement should print on a new line the integer value of the expression, without any spaces/tabs before or after it. Other than that, the executor should only have output if there is an error. The output for error cases is described below.

Invalid Input

We will not be rechecking the error handling of your scanner and parser, you can focus on just catching runtime errors in your executor.

There are just a few kinds of runtime errors possible in the current version of Core:

1. An error is possible with the input function. If an input function is executing but all values in the .data ﬁle have already been used then your executor should print a meaningful error message and halt execution.

2. An attempted division by 0 should result in an error.

3. Errors around the use of records:

(a) Accessing any index of a null array should result in an error. (b) Accessing outside of the valid indexes should result in an error.

Testing Your Project

I will provide some test cases. The test cases I will provide are rather weak. You should do additional testing testing with your own cases. For each test case I provide there are three ﬁles: a .code ﬁle, a .data ﬁle, and a .expected ﬁle.

I will provide a “tester.sh” script that works similar to how the script from project 2 worked, and it will use the diﬀ command to compare your output to the expected output. Your output should exactly match what is in the .expected ﬁles.

Suggestions/Notes

There are many ways to approach this project. Here are some suggestions:

❼ Just like I suggested for the parser, take the executor in stages. Implement just enough

to execute a very simple program, then test, then implement more, ect.

❼ Before starting, spend some time thinking about how to to keep track of the value of

each variable (symbol table)? This will be the diﬃcult part of this project.

❼ While constants in Core are integers 0-1009, variables in Core can store positive and

negative integer values. You do not need to worry about the range or overﬂow here, just represent them using the built in int representation for C.

❼ Post questions on piazza, and read the questions other students post. You may ﬁnd

details you missed on your own. You are encouraged to share test cases with the class on piazza.

❼ Remember that the grammar enforces right-associativity. In Core we have 1−2−3 = 2,

not −4.

Project Submission

On or before 11:59 pm March 3rd, you should submit the following:

❼ Your complete source code.

❼ An ASCII text ﬁle named README.txt that contains:

– Your name on top

– The names of all the other ﬁles you are submitting and a brief description of each stating what the ﬁle contains

– Any special features or comments on your project

– A description of the overall design of the interpreter, in particular how you are handling variables w.r.t. tracking their values and hiding variables when entering a new scope

– A brief description of how you tested the interpreter and a list of known remaining bugs (if any)

Submit your project as a single zipped ﬁle to the Carmen dropbox for Project 3.

If the time stamp on your submission is 12:00 am on March 4th or later, you will receive a 10% reduction per day, for up to three days. If your submission is more than 3 days late, it will not be accepted and you will receive zero points for this project. If you resubmit your project, only the latest submission will be considered.

Grading

The project is worth 100 points. Correct functioning of the interpreter is worth 75 points. The handling of error conditions is worth 10 points. The implementation style and docu- mentation are worth 15 points.

Academic Integrity

The project you submit must be entirely your own work. Minor consultations with others in the class are OK, but they should be at a very high level, without any speciﬁc details. The work on the project should be entirely your own; all the design, programming, testing, and debugging should be done only by you, independently and from scratch. Sharing your code or documentation with others is not acceptable. Submissions that show excessive similarities (for code or documentation) will be taken as evidence of cheating and dealt with accordingly; this includes any similarities with projects submitted in previous instances of this course.

Academic misconduct is an extremely serious oﬀense with severe consequences. Addi- tional details on academic integrity are available from the Committee on Academic Mis- conduct (see http://oaa.osu.edu/coamresources.html). If you have any questions about uni- versity policies or what constitutes academic misconduct in this course, please contact me immediately.

Please note this is a language like C or Java where whitespaces have no meaning, and whitespace can be inserted between keywords, identifiers, constants, and specials to accommodate programmer style. This

grammar does not include formal rules about whitespace because that would add immense clutter. Recall epsilon is the empty string.

<procedure> ::= procedure ID is <decl-seq> begin <stmt-seq> end

<decl-seq> ::= <decl > | <decl><decl-seq>

<stmt-seq> ::= <stmt> | <stmt><stmt-seq>

<decl-integer> ::= integer ID ;

<decl-record> ::= record ID ;

<assign> ::= id <index> := <expr> ; | id := new record [ <expr> ]; | id := record id;