StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POHow to define a utilization function which can be called both by a CUDA kernel and a regular C++ function
primarykey
Id
18195066
data
AcceptedAnswerId
18195808
AnswerCount
1
ClosedDate
CommentCount
2
CommunityOwnedDate
CreationDate
2013-08-12T19:23:55.457
FavoriteCount
0
LastActivityDate
2013-08-13T20:26:08.823
LastEditDate
2013-08-13T18:15:29.017
LastEditorUserId
964314
OwnerUserId
964314
ParentId
0
PostTypeId
1
Score
0
ViewCount
1970
LastEditorDisplayName
text
Body
I'm working on a project which involves a lot of mathematics. For a single target problem( for example, gradient calculation), we always have two versions of implementations: one CPU version and one CUDA version. Now the CPU version is written in regular C++ and the kernel version is written in CUDA. If I want to define a small function, for example, vec_weight which returns the weight of a vector, I have to write one for the CPU compiled by g++ for the CPU version and one cuda version which has an "__device__" before it compiled by nvcc. I'm not trying to define a "__device__ __host__" function here. What I want is kind of a library which can be called by a regular C++ function and a CUDA kernel. I tried to use the "__CUDACC__" macro but it didn't work. Because we will have a lot of small utilization functions needed by both CPU version and GPU version, I think it is reasonable to combine them in to one. Writing the CPU version in .cu instead of .cpp may solve our problem but this is not what we want. So what should I do? Here is the code segment: head.h: <pre><code> 1 #ifndef HEAD_H 2 #define HEAD_H 3 #ifdef __cplusplus 4 extern "C"{ 5 #endif 6 __device__ __host__ void myprint(); 7 #ifdef __cplusplus 8 } 9 #endif 10 #endif </code></pre> head.cu: <pre><code> 1 #include "head.h" 2 #include <stdio.h> 3 void myprint(){ 4 // do something here 5 } </code></pre> main.cpp <pre><code> 1 #include "head.h" 2 int main(){ 3 myprint(); 4 } </code></pre> I compiled the head.cu by: <pre><code>nvcc -c head.cu </code></pre> Link them together by : <pre><code>g++ main.cpp head.o -o main ( The reason that I didn't use nvcc here is that we are using the PGI's pgcpp in our project and we need it to talk to the PGI's OMP library. But I'm sure that there is something wrong here but I don't know how to fix that. ) </code></pre> The error messages: <pre><code>In file included from main.cpp:18: head.h:6: error: ‘__device__’ does not name a type main.cpp: In function ‘int main()’: main.cpp:20: error: ‘myprint’ was not declared in this scope </code></pre> So I'm pretty sure that g++ couldn't recognize the "__device__" prefix here. But our project demands us to use PGCPP to compile the cpp file because this is the only way we can have omp directives works fine both in Fortran and C( Our project mixes C/C++, Fortran and CUDA). But here even the g++ can not work, so I think we have fix this first. 
Tags
<c++><cuda>
Title
How to define a utilization function which can be called both by a CUDA kernel and a regular C++ function
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USArcheosudoerus
UserOwnerUserId
1. USArcheosudoerus
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. This table or related slice is empty.
CommentsPostId
1. COThis is exactly what `__host__ __device__` functions are for. Why are you trying to avoid creating one?
 singulars
 PostPostId
 POHow to define a utilization function which can be called both by a CUDA kernel and a regular C++ function
 UserUserId
 USJared Hoberock
2. CO@JaredHoberock What I'm trying to do is to make a common interface which can be called both by cuda and regular C++. We also don't want to use the NVCC to compile the C++ source file. Since regular C++ couldn't recognize "\__host\__" or "\__device\__", how can they help?
 singulars
 PostPostId
 POHow to define a utilization function which can be called both by a CUDA kernel and a regular C++ function
 UserUserId
 USArcheosudoerus

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.