no code implementations • 28 Mar 2025 • Feng Lin, Dong Jae Kim, Zhenhao Li, Jinqiu Yang, Tse-Hsun, Chen
When varying workflows, the tested LLMs show significantly different NFR-aware code generation capabilities between two workflows: (1) integrating NFRs and functional requirements into the initial prompt and (2) enhancing Function-Only-generated code with the same NFR.
no code implementations • 3 Mar 2025 • Mohammod N. I. Suvon, Shuo Zhou, Prasun C. Tripathi, Wenrui Fan, Samer Alabed, Bishesh Khanal, Venet Osmani, Andrew J. Swift, Chen, Haiping Lu
Several studies in PH have applied machine learning to low-cost diagnostic tools like 12-lead ECG (12L-ECG), but they mainly focus on areas with limited resources, overlooking areas with no diagnostic tools, such as rural primary healthcare in LMICs.
no code implementations • 2 Jan 2025 • Rui Hu, Luc, Chen, Yiwei Wang
The nature of modern data is increasingly real-time, making outlier detection crucial in any data-related field, such as finance for fraud detection and healthcare for monitoring patient vitals.
1 code implementation • 6 Dec 2024 • Yitian Zhang, Huseyin Coskun, Xu Ma, Huan Wang, Ke Ma, Xi, Chen, Derek Hao Hu, Yun Fu
Thus, we propose a general framework, named Scala, to enable a single network to represent multiple smaller ViTs with flexible inference capability, which aligns with the inherent design of ViT to vary from widths.
no code implementations • 7 Nov 2024 • Ke Xu, Rui Zhang, He, Chen
We employ a belief propagation approach to compute the posterior distributions of the positions of the scatterers and the transmitter.
1 code implementation • International Conference on Machine Learning (ICML) 2024 • Chen, Sijia and Li, Baochun
For instance, the solving rate of GPT-4 with TR outperforms the current best by $9\%$ on the \texttt{MATH} dataset.
no code implementations • 13 Jul 2024 • Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming, Chen, Biswajit Mishra, Natalia Vassilieva, Joel Hestness, Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Onkar Pandit, Satheesh Katipomu, Samta Kamboj, Samujjwal Ghosh, Rahul Pal, Parvez Mullah, Soundar Doraiswamy, Mohamed El Karim Chami, Preslav Nakov
By continually pre-training on a mix of Arabic and English corpora, the model retains its proficiency in English while acquiring capabilities in Arabic.
no code implementations • 10 Jul 2024 • Shahroz Khan, Zahid Masood, Muhammad Usama, Konstantinos Kostas, Panagiotis Kaklis, Wei, Chen
In this work, we propose a set of physics-informed geometric operators (GOs) to enrich the geometric data provided for training surrogate/discriminative models, dimension reduction, and generative models, typically employed for performance prediction, dimension reduction, and creating data-driven parameterisations, respectively.
no code implementations • 24 Apr 2024 • Harit Vishwakarma, Reid, Chen, Sui Jiet Tay, Satya Sai Srinath Namburi, Frederic Sala, Ramya Korlakai Vinayak
We develop a tractable version of the framework to obtain \texttt{Colander} (Confidence functions for Efficient and Reliable Auto-labeling), a new post-hoc method specifically designed to maximize performance in TBAL systems.
1 code implementation • 17 Apr 2024 • Rachel, Chen, Juheon Lee, Chuang Gan, Zijiang Yang, Mohammad Amin Nabian, Jun Zeng
Metal Sintering is a necessary step for Metal Injection Molded parts and binder jet such as HP's metal 3D printer.
no code implementations • 17 Apr 2024 • Rachel, Chen, Wenjia Zheng, Sandeep Jalui, Pavan Suri, Jun Zeng
With the advancements in 3D printing technologies, it is extremely important that the quality of 3D printed objects, and dimensional accuracies should meet the customer's specifications.
no code implementations • 23 Mar 2024 • Feng Lin, Dong Jae Kim, Tse-Husn, Chen
We use GPT3. 5 as our underlying LLM and several baselines (RawGPT, CodeT, Reflexion) to evaluate code generation on four benchmarks: HumanEval, HumanEval-ET, MBPP, and MBPP-ET.
Ranked #11 on
Code Generation
on MBPP
no code implementations • 1 Feb 2024 • Hsiang Hsu, Guihong Li, Shaohan Hu, Chun-Fu, Chen
Predictive multiplicity refers to the phenomenon in which classification tasks may admit multiple competing models that achieve almost-equally-optimal performance, yet generate conflicting outputs for individual samples.
1 code implementation • 3 Jan 2024 • Shahla Shaan Ahmed, Shaowei Wang, Yuan Tian, Tse-Hsun, Chen, Haoxiang Zhang
Conclusion: Our findings suggest that it is possible to develop recommendation models for highlighting information for answers with different formatting styles on Stack Overflow.
1 code implementation • ICCV 2023 • Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi, Chen, Xinggang Wang, Hongyang Chao, Han Hu
In this paper, we propose a novel cross-modal distillation method, called TinyCLIP, for large-scale language-image pre-trained models.
1 code implementation • 20 Sep 2023 • Nolan Dey, Daria Soboleva, Faisal Al-Khateeb, Bowen Yang, Ribhu Pathria, Hemant Khachane, Shaheer Muhammad, Zhiming, Chen, Robert Myers, Jacob Robert Steeves, Natalia Vassilieva, Marvin Tom, Joel Hestness
BTLM-3B-8K is available under an Apache 2. 0 license on Hugging Face: https://huggingface. co/cerebras/btlm-3b-8k-base.
1 code implementation • Proceedings of the AAAI Conference on Artificial Intelligence 2023 • Zeng, D., Liu, Chen, W., Zhou, L., Zhang, M., & Qu, H
Despite the great achievements of Graph Neural Networks (GNNs) in graph learning, conventional GNNs struggle to break through the upper limit of the expressiveness of first-order Weisfeiler-Leman graph isomorphism test algorithm (1-WL) due to the consistency of the propagation paradigm of GNNs with the 1-WL. Based on the fact that it is easier to distinguish the original graph through subgraphs, we propose a novel framework neural network framework called Substructure Aware Graph Neural Networks (SAGNN) to address these issues.
Ranked #11 on
Graph Regression
on ZINC
2 code implementations • 6 Apr 2023 • Nolan Dey, Gurpreet Gosal, Zhiming, Chen, Hemant Khachane, William Marshall, Ribhu Pathria, Marvin Tom, Joel Hestness
We study recent research advances that improve large language models through efficient pre-training and scaling, and open datasets and tools.
no code implementations • 21 Feb 2023 • Zhao, Chen
In this study, we propose an incremental mutual information based improved swarm intelligent optimization method (IMIICSO), which uses rough set theory to calculate the importance of feature selection based on mutual information.
1 code implementation • COLING 2022 • Srinivas Sunkara, Maria Wang, Lijuan Liu, Gilles Baechler, Yu-Chung Hsiao, Jindong, Chen, Abhanshu Sharma, James Stout
Improving the accessibility and automation capabilities of mobile devices can have a significant positive impact on the daily lives of countless users.
no code implementations • 2 Oct 2022 • Avichai Snir, Haipeng, Chen, Daniel Levy
Using a large store level retail CPI data, we find that 0 ending prices are popular and rigid at convenience stores even when they offer little transaction convenience.
1 code implementation • CVPR 2022 • Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu, Chen, Rogerio Feris, David Cox, Nuno Vasconcelos
In particular, given a source sentence an autoregressive hallucination transformer is used to predict a discrete visual representation from the input text, and the combined text and hallucinated representations are utilized to obtain the target translation.
no code implementations • 25 Apr 2022 • Quanfu Fan, Donghyun Kim, Chun-Fu, Chen, Stan Sclaroff, Kate Saenko, Sarah Adel Bargal
In this paper, we provide a deep analysis of temporal modeling for action recognition, an important but underexplored problem in the literature.
no code implementations • 21 Feb 2022 • Aixiang, Chen, Jinting Zhang, Zanbo Zhang, Zhihong Li
The fluctuation effect of gradient expectation and variance caused by parameter update between consecutive iterations is neglected or confusing by current mainstream gradient optimization algorithms. Using this fluctuation effect, combined with the stratified sampling strategy, this paper designs a novel \underline{M}emory \underline{S}tochastic s\underline{T}ratified Gradient Descend(\underline{MST}GD) algorithm with an exponential convergence rate.
1 code implementation • 12 Feb 2022 • Zhen Li, Guenevere, Chen, Chen Chen, Yayi Zou, Shouhuai Xu
Recent studies show that current source code authorship attribution methods can be compromised by attackers exploiting adversarial examples and coding style manipulation.
1 code implementation • ICLR 2022 • Quanfu Fan, Chun-Fu, Chen, Rameswar Panda
We explore a new perspective on video understanding by casting the video recognition problem as an image recognition task.
no code implementations • 15 Jan 2019 • Timo Kohlberger, Yun Liu, Melissa Moran, Po-Hsuan, Chen, Trissia Brown, Craig H. Mermel, Jason D. Hipp, Martin C. Stumpe
OOF is often only detected upon careful review, potentially causing rescanning and workflow delays.
no code implementations • 15 Nov 2018 • Kunal Nagpal, Davis Foote, Yun Liu, Po-Hsuan, Chen, Ellery Wulczyn, Fraser Tan, Niels Olson, Jenny L. Smith, Arash Mohtashamian, James H. Wren, Greg S. Corrado, Robert MacDonald, Lily H. Peng, Mahul B. Amin, Andrew J. Evans, Ankur R. Sangoi, Craig H. Mermel, Jason D. Hipp, Martin C. Stumpe
For prostate cancer patients, the Gleason score is one of the most important prognostic factors, potentially determining treatment independent of the stage.
no code implementations • 20 Jun 2017 • Jintao Ke, Hongyu Zheng, Hai Yang, Xiqun, Chen
The fusion of convolutional techniques and the LSTM network enables the proposed DL approach to better capture the spatio-temporal characteristics and correlations of explanatory variables.
no code implementations • 3 Jan 2017 • Nithyanand Kota, Abhishek Mishra, Sunil Srinivasa, Xi, Chen, Pieter Abbeel
The high variance issue in unbiased policy-gradient methods such as VPG and REINFORCE is typically mitigated by adding a baseline.
1 code implementation • 4 Dec 2015 • Xin, Chen, Jeffrey M Beck, John M. Pearson
Here, we present a Bayesian model for exploratory data analysis that is capable of automatically identifying the features present in unstructured stimuli based solely on neuronal responses.
no code implementations • IEEE Transactions on Geoscience and Remote Sensing 2008 • Lee, J. S., Wen, J. H., Ainsworth, T. L., Chen, K. S., & Chen, A. J
This improved sigma filter remains simple in concept and is computationally efficient but without the deficiencies of the original Lee sigma filter.