implement C++11/C23 [[attributes]] #2840

mmomtchev · 2024-03-19T17:05:23Z

This PR adds support for C++11/C23 [[attributes]] and allows to use the existing match$ in %rename.

The C++ specifications is deliberately very vague about where this attributes can be placed. I have tried to support all cases that I have seen. All types of attributes are parsed and attached to the AST, but only declaration attributes for variables and functions are available to the end user at the moment.

In order to safeguard against eventual unexpected placement of attributes, -ignoreattrs allows to revert to the old behavior - ignoring all attributes directly in the tokenizer.

This PR also switches most of the larger bison codeblocks to use named references in order to make further modifications and especially experimenting with the grammar easier.

The new main grammar item, attribute contains %empty and has to always precede storage_class which also has %empty.

The only exception is function argument attributes which use an enumeration without %empty in order to avoid shift/reduce conflicts with template parameters which use the same items and do not support attributes.

The only other behavior change is the %rename directive where

$match$attribute$deprecated

without =value matches every occurrence of deprecated which can be [[deprecated]] but also [[deprecated("reason")]].

Res: #2837

Examples/test-suite/php/cpp11_attribute_specifiers_runme.php

ojwb · 2024-03-20T19:20:55Z

I really think we want to do the grammar change to use named references as a separate commit (as I said in the issue earlier) - firstly so it's easy to verify that the generated code is unchanged by this refactor, but also to make it actually feasible to review the functional changes to the grammar for this.

mmomtchev · 2024-03-21T12:49:17Z

I see that there is a callparms grammar item - which should probably be used instead of a new item (attribute_arg_list) with the difference being that callparms returns a string, while attribute_arg_list is a numbered hash. If I reduce the arguments to a string this will render the syntax for most attributes simpler, but won't allow matching on a single argument when having multiple arguments - which probably is not a needed feature.

Source/Modules/main.cxx

ojwb · 2024-04-02T22:25:04Z

Source/Modules/main.cxx

@@ -105,6 +105,7 @@ static const char *usage2 = "\
     -I-             - Don't search the current directory\n\
     -I<dir>         - Look for SWIG files in directory <dir>\n\
     -ignoremissing  - Ignore missing include files\n\
+     -ignoreattrs    - Ignore C++11/C23 [[attributes]]\n\


Making this a command line option seems unhelpful - it's something the user is likely to need to specify for a particular interface file (because it's something that is likely to be needed based on particular uses of attributes in third party headers they're trying to wrap), so it would be better to be able to specify it in the interface file (if it's a global setting, perhaps as a parameter on %module; if it's something that's useful to control in a more fine-grained way, perhaps as a %feature. I can see one might theoretically want it on for one wrapped header but not another, but that may be a bit of a contrived example).

I did mostly because the specifications for [[attribute]] are very vague and I am afraid that now that parsing is implemented and mandatory, there might be cases where the parser wouldn't expect an [[attribute]] and this header won't pass at all in SWIG.

It seems reasonable to have a way to switch attribute handling back to the current "just ignore them in the lexer", I'm just saying making it a command line option seems the wrong approach for how to specify this.

This is a parser option, it does not belong in the parsed text. Since an attribute cannot contain a %feature it can be made to work, but still, I don't think it belongs there.

ojwb · 2024-04-02T22:33:42Z

Examples/test-suite/cpp11_attribute_specifiers.i

@@ -22,9 +27,13 @@
 #pragma warning(disable : 5030) // attribute is not recognized ('likely' and 'unlikely')
 #endif

+#ifndef __BIGGEST_ALIGNMENT__
+#define __BIGGEST_ALIGNMENT__ 16
+#endif


Defining a macro with a reserved name makes the program ill-formed so it'd be better to use e.g. BIGGEST_ALIGNMENT. If it actually matters to use __BIGGEST_ALIGNMENT__ when it is defined by the compiler we could define it using

#ifdef __BIGGEST_ALIGNMENT__ #define BIGGEST_ALIGNMENT __BIGGEST_ALIGNMENT__ #else #define BIGGEST_ALIGNMENT 16 #endif

and then use BIGGEST_ALIGNMENT below.

I don't think this is the only anti-pattern in the unit tests, this will only add additional clutter.

Please change this.

ojwb · 2024-04-02T22:37:56Z

Examples/test-suite/cpp11_attribute_specifiers.i

+  int data[1] = { 0 };
+  int *a = data;
+  int b = a[a[0]];
+}


The point here is to ensure that SWIG can parse int b = a[a[0]]; (in particular that the ]] is broken apart properly. SWIG doesn't parse function bodies, so moving this code into a function body doesn't actually test what was intended here.

(The reason this part is currently deactivated is just that the parser doesn't currently actually handle a double array dereference in an expression, but once that is fixed we want to be testing this properly.)

It will still have to be tokenized if attributes parsing is disabled - this is the only case when this could not work.

The parser just skips tokens between the { at the start of a function body and its matching closing }, so putting it in a function body will parse OK even if the ]] is scanned as a single token.

The next time you permit yourself to make a comment in order to pass a criminal message, I will seriously reconsider my decision from last year to skip sending an email to the legal department of every institution mentioned in the SWIG license files to inform them of a copyright problem involving plagiarism for a criminal extortion related to a series of falsified legal procedures in the EU. I will be completely honest and I will attest that you most probably do not have anything to do with the organized judicial and police corruption, the sexual elements, the international prostitution ring or the drug trade taking part in this affair. My patience has limits.

@mmomtchev, this last comment appears threatening, is not constructive and is not welcome in the collaborative SWIG development community that we strive for. @ojwb is merely trying to help review and improve your (very welcome) contribution. I don't understand the comment as it appears out of context and I hope it has been posted on the wrong issue, in which case, please withdraw it. If not and if you have some non-technical issue about SWIG that you would like to discuss, please raise a separate discussion thread. Otherwise, I ask you to kindly keep discussions on topic to C++11 attributes. Thank-you for you understanding.

erezgeva · 2024-06-26T10:10:08Z

This PR adds support for C++11/C23 [[attributes]] and allows to use the existing match$ in %rename.

The C++ specifications is deliberately very vague about where this attributes can be placed. I have tried to support all cases that I have seen. All types of attributes are parsed and attached to the AST, but only declaration attributes for variables and functions are available to the end user at the moment.

In order to safeguard against eventual unexpected placement of attributes, -ignoreattrs allows to revert to the old behavior - ignoring all attributes directly in the tokenizer.

This PR also switches most of the larger bison codeblocks to use named references in order to make further modifications and especially experimenting with the grammar easier.

The new main grammar item, attribute contains %empty and has to always precede storage_class which also has %empty.

The only exception is function argument attributes which use an enumeration without %empty in order to avoid shift/reduce conflicts with template parameters which use the same items and do not support attributes.

The only other behavior change is the %rename directive where

$match$attribute$deprecated

without =value matches every occurrence of deprecated which can be [[deprecated]] but also [[deprecated("reason")]].

Res: #2837

Pardon for asking.
I understand in general the need to support C/C++ attributes.
I would like to ask some clarification questions:

Do you mean parsing C/C++ code that uses the attributes or add ones to the generated code?
I understand C++ 11 is the default, why C23? It seems like a big gap. Perhaps I missed it?
What about newer C++ standard attributes optionally? At least as for parsing?

Erez

mmomtchev · 2024-06-26T10:17:28Z

Attributes exist in C++ starting with C++11 and C starting with C23. The attributes themselves are not interpreted in any way - they are simply made available to the end user for matching %feature. The first use case is libraries that depend on gnu::visibility to not export certain methods/symbols.

ojwb reviewed Mar 20, 2024

View reviewed changes

Examples/test-suite/php/cpp11_attribute_specifiers_runme.php Outdated Show resolved Hide resolved

ojwb reviewed Mar 20, 2024

View reviewed changes

Examples/test-suite/php/cpp11_attribute_specifiers_runme.php Show resolved Hide resolved

implement C++11/C23 [[attributes]]

25c987b

mmomtchev force-pushed the cpp11-attributes branch from 0f6ae79 to 25c987b Compare March 24, 2024 19:39

ojwb reviewed Apr 2, 2024

View reviewed changes

Source/Modules/main.cxx Outdated Show resolved Hide resolved

ojwb reviewed Apr 2, 2024

View reviewed changes

remove the function-like has_cpp_attribute macro

30f7962

ojwb reviewed Apr 2, 2024

View reviewed changes

wsfulton mentioned this pull request Apr 18, 2024

swig 4.x can not parse large c enum #2876

Open

implement C++11/C23 [[attributes]] #2840

Are you sure you want to change the base?

implement C++11/C23 [[attributes]] #2840

Uh oh!

Conversation

mmomtchev commented Mar 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ojwb commented Mar 20, 2024

Uh oh!

mmomtchev commented Mar 21, 2024

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ojwb Apr 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

erezgeva commented Jun 26, 2024

Uh oh!

mmomtchev commented Jun 26, 2024

Uh oh!

Uh oh!

mmomtchev commented Mar 19, 2024 •

edited

Loading

ojwb Apr 2, 2024 •

edited

Loading