Some questions on ViMON

Hello, I'm somewhat new to this field and having questions on ViMON implementation (I'm sorry if this is not the right place to have questions).

1. About the calculation of KL divergence between prior and posterior, I don't think that "KL between product of Normals and Standard Normal" equals to "Sum of KLs between each Normal and Standard Normal".

![image](https://github.com/ecker-lab/object-centric-representation-benchmark/assets/75618251/757a8d41-765a-48f2-96b1-4a903ac0a589)

However in the implementation [it is calculated simply as sum over k](https://github.com/ecker-lab/object-centric-representation-benchmark/blob/master/ocrb/vimon/networks/model.py#L57-L58).

```python
def kl_div(mu, logvar):
    return - 0.5 * torch.sum(1 + logvar - mu.pow(2) - logvar.exp(), dim=3).mean(dim=(0, 1)).sum()
```

Should I assume that some approximation is happening here?

<br>

2. Why isn't the scope calculated like in the paper [here](https://github.com/ecker-lab/object-centric-representation-benchmark/blob/master/ocrb/vimon/networks/model.py#L120-L123)? From the paper it should be:

```python
log_s_k = log_s_k + F.logsigmoid(1 - alpha_k)
```

I'm guessing that it is because it doesn't really matter if we add 1 to -alpha_k in log-space (it just slides in the negative area after logsigmoid), but is there a reason to positively omit 1?

<br>

Sorry for taking your time and thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions on ViMON #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Some questions on ViMON #4

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions