About the a_k, b, x_a, x_b in the paper

Hi, I read your paper Attentional Pooling for Action Recognition and feels it great for my network pooling (a C3D network for video recognition). However, in your code I did not find clear clues about a_k, b, x_a, x_b and the corresponding pooling module in the paper. All I can see is about "POSE_ATTENTION_LOGITS". 
![image](https://user-images.githubusercontent.com/62736484/230318876-09a4a8b2-1f14-47f1-98de-56b6487dd8b3.png)

https://github.com/rohitgirdhar/AttentionalPoolingAction/blob/9ab0acd9360fc9763b27073a7da057f996c01c58/models/slim/nets/nets_factory.py#L162-L168

Can you give me a more brief instruction? **so that I can use your attention pooling module to pool a [bsz, 128, 16, 32, 32] feature into [bsz, 128, 1, 32, 32]**

	if cfg.NET.USE_POSE_ATTENTION_LOGITS:
	with tf.variable_scope('PoseAttention'):
	# use the pose prediction as an attention map to get the features
	# step1: split pose logits over channels
	pose_logits_parts = tf.split(
	pose_logits, pose_logits.get_shape().as_list()[-1],
	axis=pose_logits.get_shape().ndims-1)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

About the a_k, b, x_a, x_b in the paper #36

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

About the a_k, b, x_a, x_b in the paper #36

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions