nora: fix format

nmder · nmder · commit 29dffdcc5cf0 · 2025-05-04T17:37:45.000+08:00
diff --git a/nora.html b/nora.html
@@ -52,10 +52,7 @@ <h1 itemprop="headline" align="center">
           </h1>
           <br>
           <p style="line-height:1" align="center"><b>
-              <font color="061E61">Chia-Yu Hung<sup>1</sup>, Qi Sun<sup>1</sup>, Pengfei Hong<sup>1</sup></font>
-            </b></p>
-          <p style="line-height:1" align="center"><b>
-              <font color="061E61">Amir Zadeh<sup>2</sup>, Chuan Li<sup>2</sup></font>
+              <font color="061E61">Chia-Yu Hung<sup>1</sup>, Qi Sun<sup>1</sup>, Pengfei Hong<sup>1</sup>, Amir Zadeh<sup>2</sup>, Chuan Li<sup>2</sup></font>
             </b></p>
           <p style="line-height:1" align="center"><b>
               <font color="061E61">U-Xuan Tan<sup>1</sup>, Navonil Majumder<sup>1</sup>, Soujanya Poria<sup>1</sup></font>
@@ -79,6 +76,8 @@ <h1 itemprop="headline" align="center">
                 <p><a href="https://arxiv.org/abs/2504.19854">[Paper on ArXiv]</a>&nbsp;&nbsp;&nbsp;&nbsp;<a href="https://github.com/declare-lab/nora">[Code on GitHub]</a>&nbsp;&nbsp;&nbsp;&nbsp;<a href="https://huggingface.co/collections/declare-lab/nora-6811ba3e820ef362d9eca281">[Hugging Face]</a></p>
               </center>
             </div>
+            <p align="center"><iframe width="560" height="315" src="https://www.youtube.com/embed/_6AsL7AAPzk?si=di4MXco-w73zlj1y" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe></p>
+            <br>
             <h2 id="abstract">
               <font color="000093">Abstract</font>
             </h2>
@@ -100,11 +99,9 @@ <h2 id="abstract">
             <!--   <font color="061E61"> Despite training TANGO's LDM with 63x less data, it manages to produce superior sound quality to the baselines</font> -->
             <!-- </li> -->
 
-            <br>
-            <p align="center"><iframe width="560" height="315" src="https://www.youtube.com/embed/_6AsL7AAPzk?si=di4MXco-w73zlj1y" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe></p>
             <br>
             <figure>
-              <p align="center"><img src="../NORA.png" width="100%" class="center" /></p>
+              <p align="center"><img src="../NORA.png" width="80%" class="center" /></p>
               <figcaption>
                 <p style="text-align: justify">
 		<font color="061E61"><b>Figure 1:</b> NORA, as depicted in this figure, has three major components: (i) image encoder, (ii) vision language model, and (iii) FAST+ action tokenizer. The image encoder encodes the current state of the environment. Subsequently, the VLM predicts the next action in order to accomplish the input goal, given the current state. Thereafter, FAST+ decodes the VLM output tokens into actionable robot tokens.</font>