What's Changed
- 【开源实习】Speech2Text模型迁移 by @imeet000 in #1725
- 【开源实习】Unispeech模型迁移 by @liuyifan123123 in #1722
- Fix typing mistakes in documentation by @Yashbhatt786 in #1740
- support O2 on OrangePi, tinyllama 450ms -> 160ms by @lvyufeng in #1744
- fix no_grad state error by @lvyufeng in #1754
- avoid lack of cumsum operator on OrangePi by @lvyufeng in #1755
- fix enable_grad by @lvyufeng in #1757
- fix name for nn.Linear by @lvyufeng in #1758
- fix nll_loss casued backward error by @lvyufeng in #1759
- fix out_channels of Linear by @lvyufeng in #1760
- add toc in readme.md by @adarsh-jha-dev in #1746
- safe_load_file use mmap to speedup by @lvyufeng in #1763
- fix from_numpy caused error by @lvyufeng in #1765
- fix llama embedding by @lvyufeng in #1774
- upgrade wav2vec by @lvyufeng in #1776
- update nn.utils.parametrizations by @lvyufeng in #1777
- fix sew and sew_d by @lvyufeng in #1782
- fix whisper ut by @lvyufeng in #1786
- fix llava on Ascend by @lvyufeng in #1788
- fix mixtral ut by @lvyufeng in #1790
- fix gamma on CPU and GPU by @lvyufeng in #1791
- 【开源之夏】add dpo trainer and support dpo training for mindnlp by @Trace2333 in #1701
- 【开源实习】GPT-J-6B模型应用开发 by @LuMH1027 in #1742
- 【开源实习】ImageGPT模型应用开发 by @Helloyouth2022 in #1700
- 【开源实习】X_CLIP模型应用开发 by @dyedd in #1694
- 【开源实习】UPerNet模型应用开发 by @Maysixi in #1717
- update whisper stream by @lvyufeng in #1796
- 【开源实习】multilayer_perceptron_lora模型微调 by @barryyfli in #1749
- dino应用开发 by @LuMH1027 in #1797
- support depth_estimation & doc_qa by @lvyufeng in #1800
- support fill_mask/image_classification/image_feature_extraction pipelines by @lvyufeng in #1801
- init mimm & move tests folder by @lvyufeng in #1804
- fix
per_gpu
args by @lvyufeng in #1809 - add mindspore infer function patch by @lvyufeng in #1810
- fix sync parallel and support low_cpu_mem_usage by @lvyufeng in #1814
- 【开源实习】dpt模型应用开发 by @yegoling in #1813
- 【开源实习】LayoutLMv2模型应用开发 by @Helloyouth2022 in #1805
- 【开源实习】BERT模型应用开发 by @KhunLounZai in #1781
- 【开源实习】peft_adalora_seq2seq模型微调 by @liuyifan123123 in #1747
- 【开源实习】Vision Transformer模型应用开发 by @KhunLounZai in #1643
- 【开源实习】SegFormer模型应用开发 by @KhunLounZai in #1663
- 【开源实习】BEiT模型应用开发 by @somecatw in #1680
- 【开源实习】SAM模型应用开发 by @KhunLounZai in #1688
- Update prompt_direct.txt by @YadlaMani in #1732
- fix:解决minicpm未注册问题 by @ResDream in #1821
- Module support H2D move by @lvyufeng in #1831
- fix low_cpu_mem_usage(contguous) by @lvyufeng in #1832
- value_and_grad support attacach grads, Parameter support accumulate a… by @lvyufeng in #1833
- fix update_and_allocate by @lvyufeng in #1834
- value_and_grad support kwargs by @lvyufeng in #1835
- 修复blip推理报错 by @confused666 in #1838
- fix roll on CPU by @lvyufeng in #1840
- deprecated mindnlp.transformers.optimization by @lvyufeng in #1841
- fix optimizer args as same dtype on GPU by @lvyufeng in #1842
- image_classification_timm_peft_lora模型微调 by @chenxinxi in #1830
- 【开源实习】 MobileViTV2 模型迁移 by @oucfm in #1850
- 重做deepseek_v2模型并补注册 by @ShangJingLi in #1859
- fix: Fix problem using ops.ones in BigBird by @reeered in #1861
- #benchmark: add GLUE-QNLI benchmark, including 10 models inference accuracy comparsion by @xuhangscut in #1865
- #fix benchmark GLUE-QNLI fix read_csv error and predict funciton and modify readme description by @xuhangscut in #1868
- feat: add data parallel of native mindspore to mindnlp.Trainer.base by @Tridu33 in #1852
- Revert "image_classification_timm_peft_lora模型微调" by @lvyufeng in #1871
- add minicpm3 model and dynmaic inference demo by @xing-yiren in #1870
- fix baichuan finfo error by @lvyufeng in #1872
- fix sbert precision problem on mindnlp.sentence by @lvyufeng in #1873
- fix cell_ to module_ in mindnlp.peft by @lvyufeng in #1874
- fix cells to modules in mindnlp.peft by @lvyufeng in #1875
- fix named_modules by @lvyufeng in #1877
- fix sbert normalize_embeddings by @lvyufeng in #1879
- 【开源实习】MindSpore自定义RWKV算子开发(Python接口实现) by @EliwiiKeeya in #1862
- fix llama and baichuan typo by @lvyufeng in #1883
- update core.ops with pyboost by @lvyufeng in #1884
- fix pytest error by @lvyufeng in #1885
- fix mobilebert register by @lvyufeng in #1890
- feat: add file lock for remote files download to local path when multiple thread environment. by @Tridu33 in #1887
- fix ia3 by @lvyufeng in #1891
- expose prob as positional argument for bernoulli ops by @qhzhuang in #1949
- fix model llama for split function in line 852 by @zhuizhuzheming in #1941
- fix TensorPy for mindspore 2.5 by @lvyufeng in #1961
- fix TensorPy init empty Tensor by @lvyufeng in #1962
- fix Blip2加载和推理bug #1902 #1904 #1905 by @Alemax067 in #1958
- 【开源实习】Mask2Former模型应用开发 by @linrx-ctrl in #1770
- 【开源实习】TAPAS模型应用开发 by @sjtu-weimang in #1839
- 【开源实习】MaskFormer模型应用开发 by @linrx-ctrl in #1864
- 【开源实习】YOLOS模型应用开发 by @chenxinxi in #1867
- 【开源实习】bert_japanese模型微调 by @zhuyuhua1 in #1892
- 【开源实习】bloom模型微调 by @guyueyuan in #1907
- 【开源实习】bert_generation模型微调 by @guyueyuan in #1911
- 【开源实习】bertweet模型微调 by @Alemax067 in #1964
- 【开源实习】blip_2模型微调 by @Alemax067 in #1965
- ViTMAE模型应用开发 by @1hb6s7t in #1956
- fix mindspore2.5-2.6 caused error by @lvyufeng in #1985
- load safetensor back to numpy by @lvyufeng in #1986
- support triton self-defined op by @lvyufeng in #1990
- 开源实习 BEiT 模型微调 by @4everImmortality in #1975
- 【开源实习】blenderbot模型微调 by @ZhFuGui in #1978
- 【开源实习】blenderbot_small模型微调 by @outbreak-sen in #1980
- fix bugs for ms2.5 by @lvyufeng in #1991
- Fix incorrect import quick_start.md by @XueyanZhang in #1987
- use new safetensors loader by @lvyufeng in #1992
- fix ops.max by @lvyufeng in #1993
- 【开源实习】Bigbird pegasus模型微调 by @outbreak-sen in #1994
New Contributors
- @imeet000 made their first contribution in #1725
- @Yashbhatt786 made their first contribution in #1740
- @adarsh-jha-dev made their first contribution in #1746
- @Trace2333 made their first contribution in #1701
- @Helloyouth2022 made their first contribution in #1700
- @dyedd made their first contribution in #1694
- @yegoling made their first contribution in #1813
- @KhunLounZai made their first contribution in #1781
- @somecatw made their first contribution in #1680
- @YadlaMani made their first contribution in #1732
- @confused666 made their first contribution in #1838
- @chenxinxi made their first contribution in #1830
- @oucfm made their first contribution in #1850
- @reeered made their first contribution in #1861
- @xuhangscut made their first contribution in #1865
- @Tridu33 made their first contribution in #1852
- @EliwiiKeeya made their first contribution in #1862
- @zhuizhuzheming made their first contribution in #1941
- @Alemax067 made their first contribution in #1958
- @sjtu-weimang made their first contribution in #1839
- @zhuyuhua1 made their first contribution in #1892
- @guyueyuan made their first contribution in #1907
- @ZhFuGui made their first contribution in #1978
- @XueyanZhang made their first contribution in #1987
Full Changelog: v0.4.0...v0.4.1