Spaces:

metacritical
/

DeepSeekPapers

Running

App Files Files Community

metacritical commited on Feb 18

Commit

4af19e1

verified ·

1 Parent(s): 89f340d

Other style.

Browse files

Files changed (1) hide show

index.html +106 -102

index.html CHANGED Viewed

@@ -29,9 +29,9 @@
     <div class="container is-max-desktop">
       <div class="columns is-centered">
         <div class="column has-text-centered">
-          <h1 class="title is-1 publication-title">DeepSeek Papers</h1>
           <div class="is-size-5 publication-authors">
-            Advancing Open-Source Language Models
           </div>
         </div>
       </div>
@@ -44,12 +44,11 @@
     <!-- Abstract. -->
     <div class="columns is-centered has-text-centered">
       <div class="column is-four-fifths">
-        <h2 class="title is-3">DeepSeek Research Contributions</h2>
         <div class="content has-text-justified">
           <p>
-            Below is a list of significant papers by DeepSeek detailing advancements in large language models (LLMs),
-            ordered by release date from most recent to oldest. Each paper includes a brief description and highlights
-            upcoming deep dives.
           </p>
         </div>
       </div>
@@ -57,111 +56,107 @@
     <!--/ Abstract. -->
     <!-- Paper Collection -->
-    <div class="columns is-centered">
       <div class="column is-four-fifths">
-        <div class="content">
-          <div class="publication-list">
-            <!-- Papers in chronological order -->
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-info">
-                <strong>Release Date:</strong> January 20, 2025
-              </div>
-              <div class="publication-description">
-                The R1 model enhances reasoning capabilities through large-scale reinforcement learning, competing
-                directly with leading models like OpenAI's o1.
-              </div>
             </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeek-V3 Technical Report</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-info">
-                <strong>Release Date:</strong> December 2024
-              </div>
-              <div class="publication-description">
-                This report discusses the scaling of sparse MoE networks to 671 billion parameters, utilizing mixed
-                precision training and HPC co-design strategies.
-              </div>
             </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-info">
-                <strong>Release Date:</strong> May 2024
-              </div>
-              <div class="publication-description">
-                This paper introduces a Mixture-of-Experts (MoE) architecture, enhancing performance while reducing
-                training costs by 42%.
-              </div>
             </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-info">
-                <strong>Release Date:</strong> April 2024
-              </div>
-              <div class="publication-description">
-                This paper presents methods to improve mathematical reasoning in LLMs, introducing the Group
-                Relative Policy Optimization (GRPO) algorithm.
-              </div>
             </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeekLLM: Scaling Open-Source Language Models with Longer-termism</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-info">
-                <strong>Release Date:</strong> November 29, 2023
-              </div>
-              <div class="publication-description">
-                This foundational paper explores scaling laws and the trade-offs between data and model size,
-                establishing the groundwork for subsequent models.
-              </div>
             </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-description">
-                Focuses on enhancing theorem proving capabilities in language models using synthetic data for training.
-              </div>
-            </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-description">
-                This paper details advancements in code-related tasks with an emphasis on open-source methodologies,
-                improving upon earlier coding models.
-              </div>
-            </div>
-            <div class="publication-item">
-              <div class="publication-title">
-                <a href="#">DeepSeekMoE</a>
-                <span class="tag is-info is-light">[Deep Dive Coming Soon]</span>
-              </div>
-              <div class="publication-description">
-                Discusses the integration and benefits of the Mixture-of-Experts approach within the DeepSeek framework.
-              </div>
-            </div>
           </div>
         </div>
       </div>
@@ -172,10 +167,19 @@
 <footer class="footer">
   <div class="container">
     <div class="content has-text-centered">
-      <p>
-        This website is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/">Creative
-        Commons Attribution-ShareAlike 4.0 International License</a>.
-      </p>
     </div>
   </div>
 </footer>

     <div class="container is-max-desktop">
       <div class="columns is-centered">
         <div class="column has-text-centered">
+          <h1 class="title is-1 publication-title">DeepSeek: Advancing Open-Source Language Models</h1>
           <div class="is-size-5 publication-authors">
+            A collection of groundbreaking research papers in AI and language models
           </div>
         </div>
       </div>
     <!-- Abstract. -->
     <div class="columns is-centered has-text-centered">
       <div class="column is-four-fifths">
+        <h2 class="title is-3">Overview</h2>
         <div class="content has-text-justified">
           <p>
+            DeepSeek has released a series of significant papers detailing advancements in large language models (LLMs).
+            Each paper represents a step forward in making AI more capable, efficient, and accessible.
           </p>
         </div>
       </div>
     <!--/ Abstract. -->
     <!-- Paper Collection -->
+    <div class="columns is-centered has-text-centered">
       <div class="column is-four-fifths">
+        <h2 class="title is-3">Research Papers</h2>
+        <!-- Paper 1 -->
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeekLLM: Scaling Open-Source Language Models with Longer-termism</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+            <div class="is-size-5 publication-authors">
+              Released: November 29, 2023
             </div>
+          </div>
+          <div class="content has-text-justified">
+            <p>This foundational paper explores scaling laws and the trade-offs between data and model size,
+            establishing the groundwork for subsequent models.</p>
+          </div>
+        </div>
+        <!-- Paper 2 -->
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+            <div class="is-size-5 publication-authors">
+              Released: May 2024
             </div>
+          </div>
+          <div class="content has-text-justified">
+            <p>Introduces a Mixture-of-Experts (MoE) architecture, enhancing performance while reducing
+            training costs by 42%.</p>
+          </div>
+        </div>
+        <!-- Additional papers following same structure -->
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeek-V3 Technical Report</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+            <div class="is-size-5 publication-authors">
+              Released: December 2024
             </div>
+          </div>
+          <div class="content has-text-justified">
+            <p>Discusses the scaling of sparse MoE networks to 671 billion parameters.</p>
+          </div>
+        </div>
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeek-R1: Incentivizing Reasoning Capability in LLMs</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+            <div class="is-size-5 publication-authors">
+              Released: January 20, 2025
             </div>
+          </div>
+          <div class="content has-text-justified">
+            <p>Enhances reasoning capabilities through large-scale reinforcement learning.</p>
+          </div>
+        </div>
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeekMath: Pushing the Limits of Mathematical Reasoning</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+            <div class="is-size-5 publication-authors">
+              Released: April 2024
             </div>
+          </div>
+          <div class="content has-text-justified">
+            <p>Presents methods to improve mathematical reasoning in LLMs.</p>
+          </div>
+        </div>
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeek-Prover: Advancing Theorem Proving in LLMs</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+          </div>
+          <div class="content has-text-justified">
+            <p>Focuses on enhancing theorem proving capabilities using synthetic data for training.</p>
+          </div>
+        </div>
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+          </div>
+          <div class="content has-text-justified">
+            <p>Details advancements in code-related tasks with emphasis on open-source methodologies.</p>
+          </div>
+        </div>
+        <div class="publication-block">
+          <div class="publication-header">
+            <h3 class="title is-4">DeepSeekMoE: Advancing Mixture-of-Experts Architecture</h3>
+            <span class="tag is-primary is-medium">Deep Dive Coming Soon</span>
+          </div>
+          <div class="content has-text-justified">
+            <p>Discusses the integration and benefits of the Mixture-of-Experts approach.</p>
           </div>
         </div>
       </div>
 <footer class="footer">
   <div class="container">
     <div class="content has-text-centered">
+      <a class="icon-link" href="https://github.com/deepseek-ai" target="_blank" class="external-link">
+        <i class="fab fa-github"></i>
+      </a>
+    </div>
+    <div class="columns is-centered">
+      <div class="column is-8">
+        <div class="content">
+          <p>
+            This website is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/">Creative
+            Commons Attribution-ShareAlike 4.0 International License</a>.
+          </p>
+        </div>
+      </div>
     </div>
   </div>
 </footer>