March 29, 2003

notes - a to do list

I have made the following observations, as a to do list for all. These are things we need to work on.

-directory structures for unix/windows/dos
it needs to create directories if needed
more bulletproof - can we find example of this kind of parser?

-integration of preprocessing after file read
need to do "gcc -C" preprocessing with leaving comments first
see: http://www.dis.com/gnu/gcc/gcc_14.html:

@gcctabopt{-C}
Do not discard comments. All comments are passed through to the output file, except for comments in processed directives, which are deleted along with the directive.
You should be prepared for side effects when using `-C'; it causes the preprocessor to treat comments as tokens in their own right. For example, comments appearing at the start of what would be a directive line have the effect of turning that line into an ordinary source line, since the first token on the line is no longer a `#'.

-parser quirks

bulletproof read from bracket start to bracket end
data structure creation (vector of vectors?)

-data structure stop word extraction
methods for addition and deletion

-integration!
it will take one week. This is for sure!

Right now, we have minimal file parsing, very minimal function break up, good file I/O, and wordnet functionality.

If you have any feedback, please post comments.

Posted by Charles Brian Quinn at March 29, 2003 04:15 PM
Comments
Post a comment