{ "Summary": "The paper introduces DualDiff, a dual-expert denoising architecture designed to enhance diffusion models' performance on low-dimensional datasets. Using a gating mechanism to dynamically combine two specialized expert networks, the model aims to improve the capture of multi-modal distributions. Extensive experiments on 2D datasets demonstrate the approach's efficacy, particularly on complex datasets like 'dino'.", "Strengths": [ "Novel combination of dual-expert networks and a dynamic gating mechanism.", "Significant improvements in capturing complex, multi-modal distributions.", "Thorough experimental evaluation with both quantitative and qualitative analysis." ], "Weaknesses": [ "Increased complexity and computational cost without full justification.", "Modest improvements on simpler datasets, questioning general applicability.", "Some sections could benefit from clearer explanations.", "Lack of comparison with recent state-of-the-art methods.", "Limited scope of datasets and depth of experimental analysis.", "Insufficient discussion on potential negative societal impacts." ], "Originality": 3, "Quality": 3, "Clarity": 2, "Significance": 3, "Questions": [ "Can the authors provide more details on the gating mechanism and its behavior?", "How does the diversity loss term specifically contribute to the overall performance?", "Can the authors justify the increased computational cost and complexity?", "How does the proposed methodology compare with more recent state-of-the-art methods in low-dimensional generative modeling?", "Can the authors provide additional insights into the trade-off between improved mode capture and increased computation time?", "What are the potential strategies for scaling the architecture to higher-dimensional datasets?", "Can the authors clarify the choice of datasets and whether these are representative of real-world low-dimensional data?", "How were the hyperparameters chosen and what was their impact on the results?", "Can the authors provide a broader set of experiments, including comparisons with other state-of-the-art methods?" ], "Limitations": [ "Increased model complexity leads to longer training and inference times.", "Modest improvements in simpler datasets suggest limited generalizability.", "The increased computational cost may not be suitable for time-sensitive applications.", "The diversity loss term, while improving mode capture, slightly affects reconstruction accuracy.", "The approach's benefit in simpler datasets is limited, suggesting a niche applicability primarily for complex datasets.", "Limited to low-dimensional datasets; scalability to higher dimensions not explored." ], "Ethical Concerns": false, "Soundness": 3, "Presentation": 2, "Contribution": 2, "Overall": 4, "Confidence": 4, "Decision": "Reject" }