Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changes special characters in samples names from incoming matrix file #2

Open
spacocha opened this issue Dec 18, 2015 · 1 comment
Open

Comments

@spacocha
Copy link
Owner

dbOTUcaller.py changes the headers if the headers contain periods (probably spaces or other characters too). Underscores (_) seem to be ok. This can create an issue with down stream analysis because you have different samples names than you thought. Make a pre-flight check in SmileTrain and dbOTUcaller.py which identifies this error and doesn't continue unless it's fixed.

@swo
Copy link

swo commented Dec 18, 2015

Does this happen because you load the table in R? If so, R's read.table function has a check.names option, which you can set to FALSE to keep whatever was in the file. Let as TRUE, the default, it would change 1234 to X1234, A+B to A.B, etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants