Package org.codehaus.groovy.antlr.java
Class JavaRecognizer
java.lang.Object
antlr.Parser
antlr.LLkParser
org.codehaus.groovy.antlr.java.JavaRecognizer
- All Implemented Interfaces:
JavaTokenTypes
public class JavaRecognizer extends LLkParser implements JavaTokenTypes
Java 1.5 Recognizer
Run 'java Main [-showtree] directory-full-of-java-files'
[The -showtree option pops up a Swing frame that shows
the AST constructed from the parser.]
Run 'java Main '
Contributing authors:
Jeremy Rayner groovy@ross-rayner.com
John Mitchell johnm@non.net
Terence Parr parrt@magelang.com
John Lilley jlilley@empathy.com
Scott Stanchfield thetick@magelang.com
Markus Mohnen mohnen@informatik.rwth-aachen.de
Peter Williams pete.williams@sun.com
Allan Jacobs Allan.Jacobs@eng.sun.com
Steve Messick messick@redhills.com
John Pybus john@pybus.org
Version 1.00 December 9, 1997 -- initial release
Version 1.01 December 10, 1997
fixed bug in octal def (0..7 not 0..8)
Version 1.10 August 1998 (parrt)
added tree construction
fixed definition of WS,comments for mac,pc,unix newlines
added unary plus
Version 1.11 (Nov 20, 1998)
Added "shutup" option to turn off last ambig warning.
Fixed inner class def to allow named class defs as statements
synchronized requires compound not simple statement
add [] after builtInType DOT class in primaryExpression
"const" is reserved but not valid..removed from modifiers
Version 1.12 (Feb 2, 1999)
Changed LITERAL_xxx to xxx in tree grammar.
Updated java.g to use tokens {...} now for 2.6.0 (new feature).
Version 1.13 (Apr 23, 1999)
Didn't have (stat)? for else clause in tree parser.
Didn't gen ASTs for interface extends. Updated tree parser too.
Updated to 2.6.0.
Version 1.14 (Jun 20, 1999)
Allowed final/abstract on local classes.
Removed local interfaces from methods
Put instanceof precedence where it belongs...in relationalExpr
It also had expr not type as arg; fixed it.
Missing ! on SEMI in classBlock
fixed: (expr) + "string" was parsed incorrectly (+ as unary plus).
fixed: didn't like Object[].class in parser or tree parser
Version 1.15 (Jun 26, 1999)
Screwed up rule with instanceof in it. :( Fixed.
Tree parser didn't like (expr).something; fixed.
Allowed multiple inheritance in tree grammar. oops.
Version 1.16 (August 22, 1999)
Extending an interface built a wacky tree: had extra EXTENDS.
Tree grammar didn't allow multiple superinterfaces.
Tree grammar didn't allow empty var initializer: {}
Version 1.17 (October 12, 1999)
ESC lexer rule allowed 399 max not 377 max.
java.tree.g didn't handle the expression of synchronized
statements.
Version 1.18 (August 12, 2001)
Terence updated to Java 2 Version 1.3 by
observing/combining work of Allan Jacobs and Steve
Messick. Handles 1.3 src. Summary:
o primary didn't include boolean.class kind of thing
o constructor calls parsed explicitly now:
see explicitConstructorInvocation
o add strictfp modifier
o missing objBlock after new expression in tree grammar
o merged local class definition alternatives, moved after declaration
o fixed problem with ClassName.super.field
o reordered some alternatives to make things more efficient
o long and double constants were not differentiated from int/float
o whitespace rule was inefficient: matched only one char
o add an examples directory with some nasty 1.3 cases
o made Main.java use buffered IO and a Reader for Unicode support
o supports UNICODE?
Using Unicode charVocabulay makes code file big, but only
in the bitsets at the end. I need to make ANTLR generate
unicode bitsets more efficiently.
Version 1.19 (April 25, 2002)
Terence added in nice fixes by John Pybus concerning floating
constants and problems with super() calls. John did a nice
reorg of the primary/postfix expression stuff to read better
and makes f.g.super() parse properly (it was METHOD_CALL not
a SUPER_CTOR_CALL). Also:
o "finally" clause was a root...made it a child of "try"
o Added stuff for asserts too for Java 1.4, but *commented out*
as it is not backward compatible.
Version 1.20 (October 27, 2002)
Terence ended up reorging John Pybus' stuff to
remove some nondeterminisms and some syntactic predicates.
Note that the grammar is stricter now; e.g., this(...) must
be the first statement.
Ternary ?: operator wasn't working as array name:
(isBig ? bigDigits : digits)[i];
Checked parser/tree parser on source for
Resin-2.0.5, jive-2.1.1, jdk 1.3.1, Lucene, antlr 2.7.2a4,
and the 110k-line jGuru server source.
Version 1.21 (October 17, 2003)
Fixed lots of problems including:
Ray Waldin: add typeDefinition to interfaceBlock in java.tree.g
He found a problem/fix with floating point that start with 0
Ray also fixed problem that (int.class) was not recognized.
Thorsten van Ellen noticed that \n are allowed incorrectly in strings.
TJP fixed CHAR_LITERAL analogously.
Version 1.21.2 (March, 2003)
Changes by Matt Quail to support generics (as per JDK1.5/JSR14)
Notes:
o We only allow the "extends" keyword and not the "implements"
keyword, since that's what JSR14 seems to imply.
o Thanks to Monty Zukowski for his help on the antlr-interest
mail list.
o Thanks to Alan Eliasen for testing the grammar over his
Fink source base
Version 1.22 (July, 2004)
Changes by Michael Studman to support Java 1.5 language extensions
Notes:
o Added support for annotations types
o Finished off Matt Quail's generics enhancements to support bound type arguments
o Added support for new for statement syntax
o Added support for static import syntax
o Added support for enum types
o Tested against JDK 1.5 source base and source base of jdigraph project
o Thanks to Matt Quail for doing the hard part by doing most of the generics work
Version 1.22.1 (July 28, 2004)
Bug/omission fixes for Java 1.5 language support
o Fixed tree structure bug with classOrInterface - thanks to Pieter Vangorpto for
spotting this
o Fixed bug where incorrect handling of SR and BSR tokens would cause type
parameters to be recognised as type arguments.
o Enabled type parameters on constructors, annotations on enum constants
and package definitions
o Fixed problems when parsing if ((char.class.equals(c))) {} - solution by Matt Quail at Cenqua
Version 1.22.2 (July 28, 2004)
Slight refactoring of Java 1.5 language support
o Refactored for/"foreach" productions so that original literal "for" literal
is still used but the for sub-clauses vary by token type
o Fixed bug where type parameter was not included in generic constructor's branch of AST
Version 1.22.3 (August 26, 2004)
Bug fixes as identified by Michael Stahl; clean up of tabs/spaces
and other refactorings
o Fixed typeParameters omission in identPrimary and newStatement
o Replaced GT reconciliation code with simple semantic predicate
o Adapted enum/assert keyword checking support from Michael Stahl's java15 grammar
o Refactored typeDefinition production and field productions to reduce duplication
Version 1.22.4 (October 21, 2004)
Small bux fixes
o Added typeArguments to explicitConstructorInvocation, e.g. new MyParameterised()
o Added typeArguments to postfixExpression productions for anonymous inner class super
constructor invocation, e.g. new Outer().super()
o Fixed bug in array declarations identified by Geoff Roy
Version 1.22.4.j.1
Changes by Jeremy Rayner to support java2groovy tool
o I have taken java.g for Java1.5 from Michael Studman (1.22.4)
and have made some changes to enable use by java2groovy tool (Jan 2007)
Based on an original grammar released in the PUBLIC DOMAIN
-
Field Summary
Fields inherited from class antlr.Parser
astFactory, inputState, returnAST, tokenNames, tokenTypeToASTClassMap, traceDepth
Fields inherited from interface org.codehaus.groovy.antlr.java.JavaTokenTypes
ABSTRACT, ANNOTATION, ANNOTATION_ARRAY_INIT, ANNOTATION_DEF, ANNOTATION_FIELD_DEF, ANNOTATION_MEMBER_VALUE_PAIR, ANNOTATIONS, ARRAY_DECLARATOR, ARRAY_INIT, ASSIGN, AT, BAND, BAND_ASSIGN, BLOCK, BNOT, BOR, BOR_ASSIGN, BSR, BSR_ASSIGN, BXOR, BXOR_ASSIGN, CASE_GROUP, CHAR_LITERAL, CLASS_DEF, COLON, COMMA, CTOR_CALL, DEC, DIGIT, DIGITS_WITH_UNDERSCORE, DIGITS_WITH_UNDERSCORE_OPT, DIV, DIV_ASSIGN, DOT, ELIST, EMPTY_STAT, ENUM_CONSTANT_DEF, ENUM_DEF, EOF, EQUAL, ESC, EXPONENT, EXPR, EXTENDS_CLAUSE, FINAL, FLOAT_SUFFIX, FOR_CONDITION, FOR_EACH_CLAUSE, FOR_INIT, FOR_ITERATOR, GE, GT, HEX_DIGIT, IDENT, IMPLEMENTS_CLAUSE, IMPORT, INC, INDEX_OP, INSTANCE_INIT, INTERFACE_DEF, LABELED_STAT, LAND, LBRACK, LCURLY, LE, LITERAL_assert, LITERAL_boolean, LITERAL_break, LITERAL_byte, LITERAL_case, LITERAL_catch, LITERAL_char, LITERAL_class, LITERAL_continue, LITERAL_default, LITERAL_do, LITERAL_double, LITERAL_else, LITERAL_enum, LITERAL_extends, LITERAL_false, LITERAL_finally, LITERAL_float, LITERAL_for, LITERAL_if, LITERAL_implements, LITERAL_import, LITERAL_instanceof, LITERAL_int, LITERAL_interface, LITERAL_long, LITERAL_native, LITERAL_new, LITERAL_null, LITERAL_package, LITERAL_private, LITERAL_protected, LITERAL_public, LITERAL_return, LITERAL_short, LITERAL_static, LITERAL_super, LITERAL_switch, LITERAL_synchronized, LITERAL_this, LITERAL_threadsafe, LITERAL_throw, LITERAL_throws, LITERAL_transient, LITERAL_true, LITERAL_try, LITERAL_void, LITERAL_volatile, LITERAL_while, LNOT, LOR, LPAREN, LT, METHOD_CALL, METHOD_DEF, MINUS, MINUS_ASSIGN, ML_COMMENT, MOD, MOD_ASSIGN, MODIFIERS, NOT_EQUAL, NULL_TREE_LOOKAHEAD, NUM_DOUBLE, NUM_FLOAT, NUM_INT, NUM_LONG, OBJBLOCK, PACKAGE_DEF, PARAMETER_DEF, PARAMETERS, PLUS, PLUS_ASSIGN, POST_DEC, POST_INC, QUESTION, RBRACK, RCURLY, RPAREN, SEMI, SL, SL_ASSIGN, SL_COMMENT, SLIST, SR, SR_ASSIGN, STAR, STAR_ASSIGN, STATIC_IMPORT, STATIC_INIT, STRICTFP, STRING_LITERAL, SUPER_CTOR_CALL, TRIPLE_DOT, TYPE, TYPE_ARGUMENT, TYPE_ARGUMENTS, TYPE_LOWER_BOUNDS, TYPE_PARAMETER, TYPE_PARAMETERS, TYPE_UPPER_BOUNDS, TYPECAST, UNARY_MINUS, UNARY_PLUS, VARIABLE_DEF, VARIABLE_PARAMETER_DEF, VOCAB, WILDCARD_TYPE, WS
-
Constructor Summary
Constructors Modifier Constructor Description JavaRecognizer(ParserSharedInputState state)
JavaRecognizer(TokenBuffer tokenBuf)
protected
JavaRecognizer(TokenBuffer tokenBuf, int k)
JavaRecognizer(TokenStream lexer)
protected
JavaRecognizer(TokenStream lexer, int k)
-
Method Summary
Methods inherited from class antlr.Parser
addMessageListener, addParserListener, addParserMatchListener, addParserTokenListener, addSemanticPredicateListener, addSyntacticPredicateListener, addTraceListener, consumeUntil, consumeUntil, defaultDebuggingSetup, getAST, getASTFactory, getFilename, getInputState, getTokenName, getTokenNames, getTokenTypeToASTClassMap, isDebugMode, mark, match, match, matchNot, panic, recover, removeMessageListener, removeParserListener, removeParserMatchListener, removeParserTokenListener, removeSemanticPredicateListener, removeSyntacticPredicateListener, removeTraceListener, reportError, reportError, reportWarning, rewind, setASTFactory, setASTNodeClass, setASTNodeType, setDebugMode, setIgnoreInvalidDebugCalls, setInputState, setTokenBuffer, traceIndent
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Field Details
-
_tokenNames
public static final java.lang.String[] _tokenNames -
_tokenSet_0
-
_tokenSet_1
-
_tokenSet_2
-
_tokenSet_3
-
_tokenSet_4
-
_tokenSet_5
-
_tokenSet_6
-
_tokenSet_7
-
_tokenSet_8
-
_tokenSet_9
-
_tokenSet_10
-
_tokenSet_11
-
_tokenSet_12
-
_tokenSet_13
-
_tokenSet_14
-
_tokenSet_15
-
_tokenSet_16
-
_tokenSet_17
-
_tokenSet_18
-
_tokenSet_19
-
_tokenSet_20
-
_tokenSet_21
-
_tokenSet_22
-
_tokenSet_23
-
_tokenSet_24
-
_tokenSet_25
-
_tokenSet_26
-
_tokenSet_27
-
_tokenSet_28
-
_tokenSet_29
-
_tokenSet_30
-
_tokenSet_31
-
_tokenSet_32
-
_tokenSet_33
-
_tokenSet_34
-
_tokenSet_35
-
_tokenSet_36
-
_tokenSet_37
-
_tokenSet_38
-
_tokenSet_39
-
_tokenSet_40
-
_tokenSet_41
-
_tokenSet_42
-
_tokenSet_43
-
_tokenSet_44
-
_tokenSet_45
-
_tokenSet_46
-
_tokenSet_47
-
_tokenSet_48
-
_tokenSet_49
-
_tokenSet_50
-
_tokenSet_51
-
-
Constructor Details
-
JavaRecognizer
-
JavaRecognizer
-
JavaRecognizer
-
JavaRecognizer
-
-
Method Details
-
make
This factory is the correct way to wire together a Groovy parser and lexer. -
make
-
make
-
make
-
getLexer
-
setFilename
public void setFilename(java.lang.String f)- Overrides:
setFilename
in classParser
-
setSourceBuffer
-
create
Create an AST node with the token type and text passed in, but with the same background information as another supplied Token (e.g. line numbers) to be used in place of antlr tree construction syntax, i.e. #[TOKEN,"text"] becomes create(TOKEN,"text",anotherToken) todo - change antlr.ASTFactory to do this instead... -
compilationUnit
-
annotations
-
packageDefinition
-
importDefinition
-
typeDefinition
-
identifier
-
identifierStar
-
modifiers
-
typeDefinitionInternal
protected final void typeDefinitionInternal(AST mods) throws RecognitionException, TokenStreamException -
classDefinition
-
interfaceDefinition
public final void interfaceDefinition(AST modifiers) throws RecognitionException, TokenStreamException -
enumDefinition
-
annotationDefinition
public final void annotationDefinition(AST modifiers) throws RecognitionException, TokenStreamException -
declaration
-
typeSpec
-
variableDefinitions
public final void variableDefinitions(AST mods, AST t) throws RecognitionException, TokenStreamException -
classTypeSpec
public final void classTypeSpec(boolean addImagNode) throws RecognitionException, TokenStreamException -
builtInTypeSpec
public final void builtInTypeSpec(boolean addImagNode) throws RecognitionException, TokenStreamException -
classOrInterfaceType
public final void classOrInterfaceType(boolean addImagNode) throws RecognitionException, TokenStreamException -
typeArgumentsOrDiamond
-
typeArgumentSpec
-
builtInTypeArraySpec
public final void builtInTypeArraySpec(boolean addImagNode) throws RecognitionException, TokenStreamException -
typeArgument
-
wildcardType
-
typeArgumentBounds
-
typeArguments
-
typeArgumentsOrParametersEnd
protected final void typeArgumentsOrParametersEnd() throws RecognitionException, TokenStreamException -
builtInType
-
type
-
modifier
-
annotation
-
annotationArguments
-
annotationMemberValueInitializer
public final void annotationMemberValueInitializer() throws RecognitionException, TokenStreamException -
annotationMemberValuePairs
-
annotationMemberValuePair
-
conditionalExpression
-
annotationMemberArrayInitializer
public final void annotationMemberArrayInitializer() throws RecognitionException, TokenStreamException -
annotationMemberArrayValueInitializer
public final void annotationMemberArrayValueInitializer() throws RecognitionException, TokenStreamException -
superClassClause
-
typeParameters
-
implementsClause
-
classBlock
-
interfaceExtends
-
interfaceBlock
-
enumBlock
-
annotationBlock
-
typeParameter
-
typeParameterBounds
-
classField
-
interfaceField
-
annotationField
-
enumConstant
-
declaratorBrackets
-
argList
-
enumConstantBlock
-
enumConstantField
-
parameterDeclarationList
-
throwsClause
-
compoundStatement
-
ctorHead
-
constructorBody
-
explicitConstructorInvocation
Catch obvious constructor calls, but not the expr.super(...) calls -
statement
-
variableDeclarator
public final void variableDeclarator(AST mods, AST t) throws RecognitionException, TokenStreamExceptionDeclaration of a variable. This can be a class/instance variable, or a local variable in a method It can also include possible initialization. -
varInitializer
-
initializer
-
arrayInitializer
-
expression
-
parameterDeclaration
-
variableLengthParameterDeclaration
public final void variableLengthParameterDeclaration() throws RecognitionException, TokenStreamException -
parameterModifier
-
forStatement
-
casesGroup
-
tryBlock
-
forInit
-
traditionalForClause
-
forEachClause
-
forCond
-
forIter
-
aCase
-
caseSList
-
expressionList
-
resources
-
handler
-
finallyClause
-
resourceList
-
resource
-
multicatch
-
multicatch_types
-
assignmentExpression
-
logicalOrExpression
-
logicalAndExpression
-
inclusiveOrExpression
-
exclusiveOrExpression
-
andExpression
-
equalityExpression
-
relationalExpression
-
shiftExpression
-
additiveExpression
-
multiplicativeExpression
-
unaryExpression
-
unaryExpressionNotPlusMinus
-
postfixExpression
-
-
newExpression
object instantiation. Trees are built as illustrated by the following input/tree pairs: new T() new | T -- ELIST | arg1 -- arg2 -- .. -- argn new int[] new | int -- ARRAY_DECLARATOR new int[] {1,2} new | int -- ARRAY_DECLARATOR -- ARRAY_INIT | EXPR -- EXPR | | 1 2 new int[3] new | int -- ARRAY_DECLARATOR | EXPR | 3 new int[1][2] new | int -- ARRAY_DECLARATOR | ARRAY_DECLARATOR -- EXPR | | EXPR 1 | 2 -
identPrimary
Match a, a.b.c refs, a.b.c(...) refs, a.b.c[], a.b.c[].class, and a.b.c.class refs. Also this(...) and super(...). Match this or super. -
constant
-
newArrayDeclarator
-
buildTokenTypeASTClassMap
protected void buildTokenTypeASTClassMap()
-