Skip to content

Training and implement details for Chatlaw2 #87

@jacky-rui

Description

@jacky-rui

I am currently exploring the ChatLaw models and I have a few questions regarding their training schemes and roles within the ensemble model.

  1. Could you please provide detailed information about the training schemes used for ChatLaw2_plain and ChatLaw2E_plain? Specifically, I am interested in the datasets, preprocessing steps, model architectures, and any fine-tuning techniques applied.

  2. Additionally, I would like to understand the role that ChatLaw2_plain and ChatLaw2E_plain play within the ChatLaw2_MOE (Mixture of Experts) model. How do these models interact and contribute to the overall performance of ChatLaw2_MOE?

Thank you in advance for your assistance. I look forward to your response.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions