Merge pull request #64 from maum-ai/update-publication-with-internship2

CoaLee · web-flow · commit b8972bf5d2ef · 2025-05-30T08:54:17.000+09:00
Update publication with internship2
diff --git a/src/components/PublicationFeatures/index.js b/src/components/PublicationFeatures/index.js
@@ -97,6 +97,14 @@ export function MiscItem({link, customName="Misc"}) {
     );
 }
 
+export function LeaderboardItem({link, customName="Leaderboard"}) {
+    return (
+        <p className={styles.leaderboard}>
+            <a href={link} target="_blank" rel="noopener noreferrer">{customName}</a>
+        </p>
+    );
+}
+
 
 
 export default {
@@ -106,5 +114,6 @@ export default {
     PaperDescription: PaperDescription,
     GithubItem: GithubItem,
     DemoItem: DemoItem,
-    MiscItem: MiscItem
+    MiscItem: MiscItem,
+    LeaderboardItem: LeaderboardItem,
 }
diff --git a/src/pages/careers.mdx b/src/pages/careers.mdx
@@ -23,7 +23,7 @@ import figHome from './image/maum-home.png';
 
 지원 가능한 직군은 아래와 같습니다.
 - AI Scientist (Brain팀)
-- ML Engineer (Brain팀)
+- ~~ML Engineer (Brain팀)~~ (현재 ML Engineer는 지원을 받고 있지 않습니다.)
 
 문의 사항이 있을 경우, brain-hr@maum.ai 로 문의주시면 최대한 빠르게 답변드릴 수 있도록 하겠습니다.
 
@@ -46,18 +46,18 @@ import figHome from './image/maum-home.png';
 ### 핵심 업무
 
 <Tabs groupId="job-position">
-<TabItem value="requirements" label="ML Engineer" default>
+<TabItem value="requirements" label="ML Engineer">
 
 - ML/DL 모델을 효율적으로 서빙하기 위한 시스템 개발 및 설계
 - ML/DL 모델의 학습 및 서빙 성능을 높이는 최신 최적화/양자화 기법 탐색 및 적용
 - 기존 학습/서빙 시스템의 성능을 높이기 위한 개선 방안 연구 및 구현
 
 </TabItem>
-<TabItem value="ai-scientist" label="AI Scientist">
+<TabItem value="ai-scientist" label="AI Scientist" default>
 
 - 현실 세계의 문제를 해결하기 위한 최적의 ML/DL 알고리즘을 설계
 - 기존 알고리즘 및 모델의 상업적 가치와 성능을 높이기 위한 개선 연구
-- 주 연구 분야: Audio(STT, TTS), Vision(Face generation), NLP(LLM, Retrieval, Dialog)
+- 주 연구 분야: Audio(STT, TTS), Vision(VLM, VLA), NLP(LLM, Retrieval, Dialog)
 
 </TabItem>
 </Tabs>
@@ -68,14 +68,14 @@ import figHome from './image/maum-home.png';
 ### 자격 요건
 
 <Tabs groupId="job-position">
-<TabItem value="requirements" label="ML Engineer" default>
+<TabItem value="requirements" label="ML Engineer">
 
 - Python, C++에 능숙하여 원하는 것을 구현할 수 있는 능력
 - Docker를 사용하여 서비스를 구성할 수 있는 능력
 - PyTorch 기반의 프레임워크로 작성된 코드를 이해하고 모델 아키텍처를 파악할 수 있는 능력
 
 </TabItem>
-<TabItem value="ai-scientist" label="AI Scientist">
+<TabItem value="ai-scientist" label="AI Scientist" default>
 
 - ML/DL 알고리즘을 깊이 이해할 수 있는 수준의 수학적 능력 및 모델링 능력
 - TensorFlow, PyTorch 등 DL framework 활용 및 모델 구현에 자유로울 수 있는 수준의 개발 능력
@@ -90,14 +90,14 @@ import figHome from './image/maum-home.png';
 ### 우대 요건
 
 <Tabs groupId="job-position">
-<TabItem value="requirements" label="ML Engineer" default>
+<TabItem value="requirements" label="ML Engineer">
 
 - 유관 전공 석사 이상의 경력 또는 관련 학회 논문 제출 경험
 - OpenMP, MPI, CUDA 등 multi-core CPU나 가속기를 활용하여 프로그램의 성능을 최적화 해본 경험
 - ONNX, ONNX Runtime을 사용해 PyTorch 모델을 변환하고 edge device에서 실행 해본 경험
 
 </TabItem>
-<TabItem value="ai-scientist" label="AI Scientist">
+<TabItem value="ai-scientist" label="AI Scientist" default>
 
 - 유관 전공 석사 이상의 경력 또는 관련 학회 논문 제출 경험
 - 금융, 의료, 교육, 패션 등 산업별 실제 데이터 기반 AI 프로젝트 경험
@@ -123,28 +123,28 @@ Brain팀은 시차출퇴근제를 진행하고 있습니다.
 **코어타임은 10시 30분부터 17시까지**로, 오전 8시 ~ 오전 10시 30분 사이에 각자가 정한 시간에 출근합니다.  
 **산업기능요원, 전문연구요원으로 복무하시는 분들도 시차출퇴근제 적용이 가능**합니다!
 
-병역이 아니신 정규직 분들은 재택과 병행하여 하이브리드 근무가 가능합니다. 😉
+병역이 아니신 정규직 분들은 **재택과 병행**하여 하이브리드 근무가 가능합니다. 😉
 
 <br/>
 
 ### 근무 환경
 
 Brain팀 구성원에게는 **입사 시 GPU 탑재 데스크탑부터 MacBook까지 원하는 기기를 지원**해드리며, **모니터 및 모니터암을 기본으로 제공**하여 Brain팀 구성원분들의 목 건강도 책임집니다! 💪
 
-Brain팀에는 **연구용으로만 On-premise로 V100 10대, A100 30대, H100 64대 이상을 운용**하고 있고, 인원당 On-premise GPU를 최소 2대 이상 사용하실 수 있게끔 연구 및 개발환경을 구축하고 있습니다. (2024년 06월 기준)
+Brain팀에는 **연구용으로만 On-premise로 V100 10대, A100 30대, H100 96대 이상을 운용**하고 있고, 인원당 On-premise GPU를 최소 2대 이상 사용하실 수 있게끔 연구 및 개발환경을 구축하고 있습니다. (2025년 5월 기준)
 
 <img className={styles.figCenter} src={figBrainRoom} alt="brain-room" />
 
 <br/>
 
-Brain팀 AI Scientist는 회사의 사업 방향에 따른 연구인 **Strategic Research**와 개인 관심 주제에 따른 자유 연구인 **General Research**를 **일과 중 절반씩 진행**하는 것을 권장하는 등 최적의 연구 환경을 마련해드리기 위해 노력하고 있습니다. 
+Brain팀 AI Scientist는 회사의 사업 방향에 따른 연구인 **Strategic Research**와 ~~개인 관심 주제에 따른 자유 연구인 **General Research**를~~(회사 사정에 따라 현재는 중단) **일과 중 절반씩 진행**하는 것을 권장하는 등 최적의 연구 환경을 마련해드리기 위해 노력하고 있습니다. 
 
 **NeurIPS, ICLR, CVPR, ECCV, Interspeech, ACL 등 학회 참석 및 학회 논문 제출**을 희망하실 경우, 적극적으로 지원해드리고 있습니다! 💸 (현재까지 채택된 논문 현황에 대해서는 **[Publications](/publications)** 탭을 참고해주세요!)  
 <br/>
 
 <img className={styles.figCenter} src={figHome} alt="home" />
 
-maum.ai는 전 직원 **점심 식사 식대를 제공**합니다.(1일 16000원) 판교 사옥 근처의 많은 식당을 자유롭게 이용할 수 있습니다!
+maum.ai는 전 직원 **점심 식사 식대를 제공**합니다.(1일 10000원) 판교 사옥 근처의 많은 식당을 자유롭게 이용할 수 있습니다!
 
 또한 사옥 내에도 로봇 바리스타가 비치되어 있어 자유롭게 음료 음용이 가능합니다. 🍹
 
diff --git a/src/pages/internship-season1.mdx b/src/pages/internship-season1.mdx
@@ -1,6 +1,6 @@
 ---
-title: maum.ai Brain팀 채용
-description: maum.ai Brain팀에 지원하세요!
+title: maum.ai Brain팀 체험형 인턴 채용
+description: maum.ai Brain팀 체험형 인턴에 지원하세요!
 image: img/maumai_Symbol.png
 ---
 
diff --git a/src/pages/internship-season2.mdx b/src/pages/internship-season2.mdx
@@ -1,6 +1,6 @@
 ---
-title: maum.ai Brain팀 채용
-description: maum.ai Brain팀에 지원하세요!
+title: maum.ai Brain팀 체험형 인턴 채용
+description: maum.ai Brain팀 체험형 인턴에 지원하세요!
 image: img/maumai_Symbol.png
 ---
 
diff --git a/src/pages/internship.mdx b/src/pages/internship.mdx
@@ -1,6 +1,6 @@
 ---
-title: maum.ai Brain팀 채용
-description: maum.ai Brain팀에 지원하세요!
+title: maum.ai Brain팀 체험형 인턴 채용
+description: maum.ai Brain팀 체험형 인턴에 지원하세요!
 image: img/maumai_Symbol.png
 ---
 
@@ -19,19 +19,16 @@ import figGPU from './image/h100-gpu.png';
 
 <br/>
 
-현재 2024년 겨울방학 Brain팀 체험형 인턴 2기 모집 중에 있습니다. 기존에 진행하던 NLP 주제에서 확장해, Audio, MLE 2개의 주제가 이번년도에는 새로 생겼습니다. 자세한 내용은 아래 링크를 참고해주세요.
-
-<div className={styles.buttons}>
-    <Link
-        className="button button--primary button--lg"
-        color="red"
-        to="/internship-season2">
-        Brain팀 체험형 인턴 2기 지원하러 가기✍
-    </Link>
-</div>
+2024년 겨울방학 Brain팀 체험형 인턴 2기를 포함한 모든 인턴 활동이 현재 종료되었습니다. <br/>
 
 ## 과거 지원페이지
 
+> 2024년 겨울방학 Brain팀 체험형 인턴 2기<br/>
+> NLP, Audio, MLE 주제를 중심으로 진행되었습니다.<br/>
+> 경쟁률 **95:5**, 최종 합격 총 5명<br/>
+> [더 자세히 알아보기](/internship-season2)
+
 > 2024년 여름방학 Brain팀 체험형 인턴 1기<br/>
+> NLP (LLM) 주제를 중심으로 진행되었습니다.<br/>
 > 경쟁률 **33:3**, 최종 합격 track당 1명씩 총 3명<br/>
 > [더 자세히 알아보기](/internship-season1)
diff --git a/src/pages/open-source.mdx b/src/pages/open-source.mdx
@@ -16,6 +16,12 @@ import * as features from '@site/src/components/OpenSourceFeatures';
 
 <section id="activities" className={styles.category}>
     <ul className={styles.repositories}>
+        <li>
+            {/* <features.StarItem userName="maum-ai" repoName="KOFFVQA" /> */}
+            <features.StarItem userName="maum-ai" repoName="KOFFVQA" />
+            <features.GithubLinkItem userName="maum-ai" repoName="KOFFVQA" repoNickname="KOFFVQA"  />
+            <features.PaperLinkItem paperLink="https://arxiv.org/abs/2503.23730" title="KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language" />
+        </li>
         <li>
             {/* <features.StarItem userName="worv-ai" repoName="CANVAS" /> */}
             <features.StarItem userName="worv-ai" repoName="CANVAS" />
diff --git a/src/pages/publications.mdx b/src/pages/publications.mdx
@@ -11,18 +11,40 @@ import * as features from '@site/src/components/PublicationFeatures';
 <!-- ![maum.ai Logo](assets/maumai_BI.png) -->
 ## Publications
 
-### 2024
+### 2025
 <section id="activities" className={styles.category}>
     <ul className={styles.publications}>
         <li>
-            <features.ConferenceItem conference="ICRA Under Review"/>
+            <features.ConferenceItem conference="ACL Findings"/>
+            <features.PaperTitle paperLink="https://arxiv.org/abs/2504.14175" title="Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion"/>
+            <features.AuthorItem authors={["Yejun Yoon", "Jaeyoon Jung", "Seunghyun Yoon", "Kunwoo Park"]} numFirstAuthor={1} isBrainTeam={[false, true, false, false]}/>
+            <features.PaperDescription preview="Query expansion methods powered by large language models (LLMs) have demonstrated effectiveness in zero-shot retrieval tasks. "
+            description="These methods assume that LLMs can generate hypothetical documents that, when incorporated into a query vector, enhance the retrieval of real evidence. However, we challenge this assumption by investigating whether knowledge leakage in benchmarks contributes to the observed performance gains. Using fact verification as a testbed, we analyzed whether the generated documents contained information entailed by ground truth evidence and assessed their impact on performance. Our findings indicate that performance improvements occurred consistently only for claims whose generated documents included sentences entailed by ground truth evidence. This suggests that knowledge leakage may be present in these benchmarks, inflating the perceived performance of LLM-based query expansion methods, particularly in real-world scenarios that require retrieving niche or novel knowledge."/>
+        </li>
+        <li>
+            <features.ConferenceItem conference="CVPR Workshop"/>
+            <features.PaperTitle paperLink="https://arxiv.org/abs/2503.23730" title="KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language"/>
+            <features.AuthorItem authors={["Yoonshik Kim", "Jaeyoon Jung"]} numFirstAuthor={4} isBrainTeam={[true, true]}/>
+            <features.PaperDescription preview="The recent emergence of Large Vision-Language Models(VLMs) has resulted in a variety of different benchmarks for evaluating such models. "
+            description="Despite this, we observe that most existing evaluation methods suffer from the fact that they either require the model to choose from pre-determined responses, sacrificing open-endedness, or evaluate responses using a judge model, resulting in subjective and unreliable evaluation. In addition, we observe a lack of benchmarks for VLMs in the Korean language, which are necessary as a separate metric from more common English language benchmarks, as the performance of generative language models can differ significantly based on the language being used. Therefore, we present KOFFVQA, a general-purpose free-form visual question answering benchmark in the Korean language for the evaluation of VLMs. Our benchmark consists of 275 carefully crafted questions each paired with an image and grading criteria covering 10 different aspects of VLM performance. The grading criteria eliminate the problem of unreliability by allowing the judge model to grade each response based on a pre-determined set of rules. By defining the evaluation criteria in an objective manner, even a small open-source model can be used to evaluate models on our benchmark reliably. In addition to evaluating a large number of existing VLMs on our benchmark, we also experimentally verify that our method of using pre-existing grading criteria for evaluation is much more reliable than existing methods. Our evaluation code is available at https://github.com/maum-ai/KOFFVQA."/>
+            <features.GithubItem link="https://github.com/maum-ai/KOFFVQA" />
+            <features.LeaderboardItem link="https://huggingface.co/spaces/maum-ai/KOFFVQA-Leaderboard" />
+        </li>
+        <li>
+            <features.ConferenceItem conference="ICRA"/>
             <features.PaperTitle paperLink="https://arxiv.org/abs/2410.01273" title="CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction"/>
             <features.AuthorItem authors={["Suhwan Choi", "Yongjun Cho", "Minchan Kim", "Jaeyoon Jung", "Myunchul Joe", "Yubeen Park", "Minseo Kim", "Sungwoong Kim", "Sungjae Lee", "Hwiseong Park", "Jiwan Chung", "Youngjae Yu"]} numFirstAuthor={4} isBrainTeam={[true, true, true, true, true, false, false, false, false, false, false, false]}/>
             <features.PaperDescription preview="Real-life robot navigation involves more than just reaching a destination; it requires optimizing movements while addressing scenario-specific goals. "
             description="An intuitive way for humans to express these goals is through abstract cues like verbal commands or rough sketches. Such human guidance may lack details or be noisy. Nonetheless, we expect robots to navigate as intended. For robots to interpret and execute these abstract instructions in line with human expectations, they must share a common understanding of basic navigation concepts with humans. To this end, we introduce CANVAS, a novel framework that combines visual and linguistic instructions for commonsense-aware navigation. Its success is driven by imitation learning, enabling the robot to learn from human navigation behavior. We present COMMAND, a comprehensive dataset with human-annotated navigation results, spanning over 48 hours and 219 km, designed to train commonsense-aware navigation systems in simulated environments. Our experiments show that CANVAS outperforms the strong rule-based system ROS NavStack across all environments, demonstrating superior performance with noisy instructions. Notably, in the orchard environment, where ROS NavStack records a 0% total success rate, CANVAS achieves a total success rate of 67%. CANVAS also closely aligns with human demonstrations and commonsense constraints, even in unseen environments. Furthermore, real-world deployment of CANVAS showcases impressive Sim2Real transfer with a total success rate of 69%, highlighting the potential of learning from human demonstrations in simulated environments for real-world applications."/>
             <features.GithubItem link="https://github.com/worv-ai/canvas" />
             <features.DemoItem link="https://worv-ai.github.io/canvas/" />
         </li>
+    </ul>
+</section>
+
+### 2024
+<section id="activities" className={styles.category}>
+    <ul className={styles.publications}>
         <li>
             <features.ConferenceItem conference="NeurIPS Workshop OWA (Oral)"/>
             <features.PaperTitle paperLink="https://openreview.net/forum?id=U6wyOnPt1U" title="Integrating Visual and Linguistic Instructions for Context-Aware Navigation Agents"/>