Really hope that LLM research will be more matured and big players (Meta, etc.) fully open their dataset source (not just pre-trained weight) of training data (publicly available) thus other researchers and users can really judge "how much unbiased" it was. Research in Computer Vision for example has shown many research that were developed based on fully open source dataset (ImageNet, COCO, Visual Object Tracking, etc). Reproducible for other researchers.

Reply to this note

Please Login to reply.

Discussion

No replies yet.