Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Paper Copilot
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
CSPaper

CSPaper: review sidekick

Go to CCFDDL
Go to CSRankings
Go to OpenReview
  1. Home
  2. Anonymous Sharing & Supplementary Materials
  3. ARR-addon-results&codebase-nr0034je9

ARR-addon-results&codebase-nr0034je9

Scheduled Pinned Locked Moved Anonymous Sharing & Supplementary Materials
1 Posts 1 Posters 155 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • H Offline
    H Offline
    Hu8kKo34
    Super Users
    wrote on last edited by
    #1

    Impl. based on nr0034je9.zip .


    Table A: Model Performance on NLP Benchmarks

    Model SST-2 (Acc) MNLI (Acc) QNLI (Acc) CoLA (Matthews) Avg Score
    BERT-Base 91.2 84.6 90.1 58.2 81.0
    RoBERTa-Base 92.3 87.4 91.8 63.1 83.7
    GPT-3 (175B) 94.1 88.9 93.0 66.4 85.6
    Our Method 94.8 89.7 93.5 68.9 86.7

    Table B: Ablation Study on Model Components (Evaluated on MNLI)

    Configuration Attention Mechanism Pretraining Corpus MNLI (Acc)
    Full Model Multi-head Self-Attn Custom + Public 89.7
    – w/o Custom Corpus Multi-head Self-Attn Public Only 87.1
    – w/o Attention Refinement Block Basic Self-Attn Custom + Public 86.5
    – w/o Positional Embeddings Multi-head Self-Attn Custom + Public 85.2
    – Random Initialization — — 72.4
    1 Reply Last reply
    0
    Reply
    • Reply as topic
    Log in to reply
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes


    • Login

    • Don't have an account? Register

    • Login or register to search.
    © 2025 CSPaper.org Sidekick of Peer Reviews
    Debating the highs and lows of peer review in computer science.
    • First post
      Last post
    0
    • Categories
    • Recent
    • Tags
    • Popular
    • World
    • Paper Copilot